Talks and presentations

Learning Regional Uncertainty to Calibrate Predictive Models in Clinical Settings

September 03, 2025

Poster, Health Data Science Poster Session, Boston, MA, USA

The use of machine learning (ML) in clinical settings has grown rapidly in recent years. These models have shown strong performance in predicting disease risk, treatment outcomes, tumor detection from imaging, and more. Despite this, many clinicians remain hesitant to rely on ML in practice. A key reason is their black-box nature—it is often unclear why the model is making a particular prediction. That lack of transparency can be dangerous in clinical applications. The stakes are especially high when it comes to false positives and false negatives in clinical prediction. In the context of treatment decisions, false negatives can result in patients missing out on necessary care—potentially leading to serious health consequences—while false positives may expose patients to unnecessary treatments and harmful side effects, ultimately reducing their quality of life. This challenge is compounded by a common issue in ML classification: overconfidence. Many classifiers can achieve high accuracy (e.g., 80% or higher) while still assigning high probabilities to incorrect classifications. If a model is uncertain about a particular prediction, this uncertainty should be reflected in the model output. However, reporting uncertainty measures alongside predictions often only serves to complicate the decision-making process. We propose a local similarity-aware post-hoc smoothing algorithm for predictive probabilities. Our method identifies regions of uncertainty in the feature space and directly adjusts predicted probabilities using a shrinkage estimate to reduce overconfidence. In simulated data, this approach has reduced conditional misclassification rates by as much as 16%. We also show that, in terms of conditional misclassification, our smoothed probabilities consistently outperform the original model outputs—even for neural networks. This work underscores the importance of moving beyond accuracy as a pure measure of model success, and towards safe models that represent what they know and what they don’t.

The Genetic Overlap Between Multisite Chronic Pain and PTSD Severity

June 01, 2025

Poster, Behavioral Genetics Association 55th Annual Meeting, Atlanta, GA

This study builds upon twin analyses which suggest that the co-occurrence between chronic pain and posttraumatic stress disorder (PTSD) can be partly explained by genetic factors. The sample consisted of 968 individuals of European ancestry (78% male, meanage = 43.62, SDage = 13.82) recruited into two Boston VA studies with identical assessment procedures, including assessment of PTSD severity by structured clinical interview. We used well-powered GWASs (>180,000) of multisite chronic pain (MCP) and PTSD severity (i.e., Total Score from PTSD Checklist) to compute PTSD and MCP polygenic scores (PGS). Genetic correlations between the two PGSs were small (r=0.24, SE=0.03). The MCP PGS significantly predicted PTSD severity in models covarying for ancestry substructure, sex, and age (β=0.084, p=0.007), but was no longer significant when PTSD PGS was added to the model, suggesting potential genetic confounding. Exploratory post-hoc FUMA analyses revealed enrichment: 46% of the lead SNPs from the MCP GWAS were nominally significant (p<0.05) in the PTSD GWAS, but 72% of these showed opposite directions of association. Fifty-three of the top-100 genes in the MCP GWAS were nominally significant in the PTSD severity GWAS, with three genes reaching genome-wide significance for PTSD severity (UBA7,RNF123,RBM6). Of the top-100 genes in the PTSD severity GWAS, 80 were also nominally significant in the MCP GWAS, including seven genome-wide significant (FOXP2,EXD3,MON1A,SEMA3F,GMPPB,AMIGO3,UBA7). Mendelian randomization indicated a positive significant bi-directional relationship, with larger effects detected for MCP on PTSD severity. These results further support that chronic pain and PTSD comorbidity may be partially due to genetic pleiotropy.

Improving AD Clinical Trial Enrollment Using Machine Learning

May 01, 2025

Talk, Red Rock Data Science Conference, St. George, UT, USA

This talk presents a machine learning approach to improve enrollment efficiency in Alzheimer’s disease (AD) clinical trials targeting individuals with high amyloid levels. Using the NACC Uniform Data Set and XGBoost models, we show that cognitive exam scores significantly enhance the ability to predict AD conversion beyond demographics and genetic risk factors alone. Power simulations demonstrate that screening with cognitive data can increase trial power by up to 45.8% compared to demographic-only models. We then extend this work to a larger population without PET scans, predicting both AD risk and likely amyloid status to identify high-priority candidates for imaging. Overall, our method offers a data-driven framework to optimize trial recruitment and make PET-based trials more scalable and inclusive.

Connecting the Dots: Overlapping Gene Expression Signatures in Human BNST and PTSD Across Brain Regions and Mouse Models

September 01, 2024

Talk, International Society for Traumatic Studies, Boston, MA, USA

The Bed Nucleus of the Stria Terminalis (BNST) serves as a regulator for long-term emotional states and is relevant to the study of post-traumatic stress disorder (PTSD). No human studies of PTSD and BNST expression have been conducted, but BNST gene expression has been studied in conjunction with stress in mice. We performed differential gene expression analyses of RNAseq data from the BNST of 8 PTSD cases and 8 PTSD/depression-free donors from the VA National PTSD Brain Bank and compared the results of an independent PTSD study of other brain regions (Girgenti et al. 2021; UPMC results) and a joint analysis of two mouse BNST studies of chronic social defeat stress (GEO GSE109315 and GSE122840). We identified 1,641 nominally significant (p<0.05) PTSD-associated BNST differentially expressed genes (DEGs; top genes CALB1, ZIC2). Although the set of significant genes did not overlap more than by chance, we found a strong correlation of effect sizes between PTSD BNST DEGs and UPMC DEGs from multiple brain regions, most strongly the OFC (r = 0.86, p = 2.2x10-16). Human BNST DEGs were not enriched for mouse DEGS (p = 0.1) and effect size estimates were not correlated (p>0.05). Although more data is needed, this study is the first step in understanding the role of human BNST gene expression and its role in PTSD genesis and maintenance.

Henry Bayly

Talks and presentations

Learning Regional Uncertainty to Calibrate Predictive Models in Clinical Settings

The Genetic Overlap Between Multisite Chronic Pain and PTSD Severity

Improving AD Clinical Trial Enrollment Using Machine Learning

Connecting the Dots: Overlapping Gene Expression Signatures in Human BNST and PTSD Across Brain Regions and Mouse Models