Skyler Shapiro

PhD Student · Biostatistics · Duke University

Skyler Shapiro

I'm a first-year PhD student in Biostatistics at Duke University. My research focuses on causal inference, clinical trial emulation, and machine learning for health data. I completed my undergraduate degree at Cornell University where I studied Biometry and Statistics with a minor in Computer Science. My prior research spans drug repurposing with Real-World Data, diversity across clinical trials and medical evidence, and accelerated inference with large language models. I'm passionate about addressing clinically motivated questions using both observational data and clinical trials.

Education

2025 — Present
PhD, Biostatistics
Duke University
2021 — 2025
BS, Biometry and Statistics
Cornell University · Minor in Computer Science
Merrill Presidential Scholar Distinction in Research Cum Laude

Research Experience

May 2023 —
Sep 2024
Research Intern
Harvard Medical School & Massachusetts General Hospital · Laboratory for Systems Pharmacology· Boston, MA
Investigated drug repurposing for Alzheimer's and Parkinson's disease using causal inference and causal discovery methods on patient data. Developed federated approaches to target trial emulation across distributed health systems.
Causal Inference Target Trial Emulation Federated Learning
May 2022 —
May 2023
Data Science Intern
MIT Lab for Computational Physiology · Cambridge, MA
Conducted applied data science research on clinical datasets. Contributed to projects in health data engineering and statistical analysis of physiological signals.
Data Engineering Electronic Health Records
Jun 2020 —
Aug 2021
Machine Learning Intern
Harvard Medical School · Boston, MA
Prepared and managed clinical data using Python and R. Developed an algorithm for cleaning and translating medical records. Contributed to the construction and training of an NLP model. Conducted a literature review of public datasets focused on adverse drug reactions.
NLP Data Engineering
Jan 2022 —
Dec 2023
Team Lead · Project Lead
Cornell Data Science · Ithaca, NY
Led applied machine learning project teams, managed technical roadmaps, and mentored junior members.
PyTorch Project Management Technical Leadership

Publications

Journal
Article
A pilot study evaluating mid-point vancomycin concentrations to estimate pharmacokinetics and area under the curve: a prospective study + Conference Abstract
Aseel AbuSara, Deema Abdelrahman, Wedad Awad, Jennifer Le, Skyler Shapiro, Lama Nazer · BMC Infectious Diseases · September 2025
Presentation: 803: A Pilot Study Evaluating Midpoint Vancomycin Concentrations to Estimate Area Under the Curve · Critical Care Medicine · January 2023
An evaluation of single mid-interval vancomycin concentrations in critically ill cancer patients, assessing their ability to estimate AUC24, clearance, and volume of distribution compared to Bayesian peak–trough methods, and suggesting a simpler alternative for therapeutic drug monitoring.
Workshop Poster
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis
Andy He*, Darren Key*, Mason Bulling*, Andrew Chang*, Skyler Shapiro*, Everett Lee · International Conference on Machine Learning (ICML), 2024: ES-FoMo II: 2nd Workshop on Efficient Systems for Foundation Models · June 2024
HLSTransform is an open-source FPGA accelerator for Llama 2 that leverages High-Level Synthesis and 8-bit quantization to achieve an 8.25x reduction in energy per token compared to a GPU while maintaining 53% of its inference speed, demonstrating a high-efficiency alternative for LLM deployment on edge hardware.
Journal
Article
Patient diversity and author representation in clinical studies supporting the Surviving Sepsis Campaign guidelines for management of sepsis and septic shock 2021: a systematic review of citations + Conference Abstract
Lama Nazer, Aseel Abusara, Batoul Aloran, et al. · BMC Infectious Diseases · November 2023
Presentation: 1229: Racial, Sex, and Geographic Representation in Clinical Studies Supporting the Sepsis Guidelines · Critical Care Medicine · January 2023
A systematic review assessing the generalizability of the Surviving Sepsis Campaign 2021 guidelines by examining racial, sex, and geographic diversity of patients enrolled in supporting clinical studies, as well as demographic representation among authorship teams.
Journal
Article
Sociodemographic disparities in ophthalmological clinical trials
Luis Filipe Nakayama, William Greig Mitchell, Skyler Shapiro, et al. · BMJ Open Ophthalmology · February 2023
An analysis of sociodemographic representation across ophthalmology clinical trials studying conditions including diabetic retinopathy, myopia, age-related macular degeneration, glaucoma, and keratoconus, identifying enrollment gaps that may limit the generalizability of findings.

Teaching

2025 — Present
Graduate Teaching Assistant
Duke University · Department of Biostatistics & Bioinformatics
BIOSTAT 821 — Software Tools for Data Science
BIOSTAT 721 — Introduction to Statistical Computing
Jul – Aug 2022
Teaching Assistant, BeaverWorks Summer Institute
MIT Lincoln Laboratory · Cambridge, MA
Course Medlytics: data science and deep learning with medical data geared for high school students.
Jul 2022
Teaching Fellow, Summer Program in Clinical Effectiveness
Harvard T.H. Chan School of Public Health · Boston, MA
Course BST 209: Designed a course materials for a week-long datathon challenge geared for healthcare professionals. Coached a physician team building an event-prediction model.