Go to JCI Insight
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Advertising
  • Job board
  • Contact
  • Clinical Research and Public Health
  • Current issue
  • Past issues
  • By specialty
    • COVID-19
    • Cardiology
    • Gastroenterology
    • Immunology
    • Metabolism
    • Nephrology
    • Neuroscience
    • Oncology
    • Pulmonology
    • Vascular biology
    • All ...
  • Videos
    • ASCI Milestone Awards
    • Video Abstracts
    • Conversations with Giants in Medicine
  • Reviews
    • View all reviews ...
    • Clinical innovation and scientific progress in GLP-1 medicine (Nov 2025)
    • Pancreatic Cancer (Jul 2025)
    • Complement Biology and Therapeutics (May 2025)
    • Evolving insights into MASLD and MASH pathogenesis and treatment (Apr 2025)
    • Microbiome in Health and Disease (Feb 2025)
    • Substance Use Disorders (Oct 2024)
    • Clonal Hematopoiesis (Oct 2024)
    • View all review series ...
  • Viewpoint
  • Collections
    • In-Press Preview
    • Clinical Research and Public Health
    • Research Letters
    • Letters to the Editor
    • Editorials
    • Commentaries
    • Editor's notes
    • Reviews
    • Viewpoints
    • 100th anniversary
    • Top read articles

  • Current issue
  • Past issues
  • Specialties
  • Reviews
  • Review series
  • ASCI Milestone Awards
  • Video Abstracts
  • Conversations with Giants in Medicine
  • In-Press Preview
  • Clinical Research and Public Health
  • Research Letters
  • Letters to the Editor
  • Editorials
  • Commentaries
  • Editor's notes
  • Reviews
  • Viewpoints
  • 100th anniversary
  • Top read articles
  • About
  • Editors
  • Consulting Editors
  • For authors
  • Publication ethics
  • Publication alerts by email
  • Advertising
  • Job board
  • Contact
The BIOPREVENT machine-learning algorithm predicts chronic graft-versus-host disease and mortality risk using posttransplant biomarkers
Michael J. Martens, Debjani Dutta, Yongzi Yu, Lisa E. Rein, Jerome Ritz, Brent R. Logan, Sophie Paczesny
Michael J. Martens, Debjani Dutta, Yongzi Yu, Lisa E. Rein, Jerome Ritz, Brent R. Logan, Sophie Paczesny
View: Text | PDF
Clinical Research and Public Health Clinical Research Hematology

The BIOPREVENT machine-learning algorithm predicts chronic graft-versus-host disease and mortality risk using posttransplant biomarkers

  • Text
  • PDF
Abstract

BACKGROUND Chronic graft-versus-host disease (cGVHD) is a major contributor to nonrelapse mortality (NRM) following hematopoietic cell transplantation (HCT). Whether machine-learning (ML) models with biomarkers improve the accuracy for predicting future cGVHD/NRM is not established.METHODS We developed BIOPREVENT (BIOmarkers PREVENTion), a ML algorithm using data from 1,310 HCT recipients, incorporating 7 plasma proteins measured at Day 90/100 post-HCT and 9 clinical variables. Patients were divided into training and validation datasets. ML models — including CoxXGBoost, Group SCAD, Adaptive Group Lasso, Random Survival Forests, and Bayesian Additive Regression Trees (BART) — were used to estimate time-varying Area Under the ROC Curve (AUCt) at Days 180, 270, 360, and 540. Deep learning models were also evaluated.RESULTS ML models with biomarkers outperformed clinical-only models for predicting cGVHD, with BART and CoxXGBoost achieving AUCt greater than 0.65 at 1 year. For NRM, models with biomarkers achieved AUCt ranging from 0.75–0.91. Deep learning did not outperform other ML approaches. BART consistently demonstrated high predictive accuracy and was selected for the final BIOPREVENT model. Calibration curves aligned with observed values. Variable importance analysis identified MMP3 and CXCL9 as key for cGVHD prediction and IL1RL1 and sCD163 for NRM. Cumulative incidences of cGVHD and NRM differed significantly based on BIOPREVENT-defined cutpoints.CONCLUSION BIOPREVENT accurately predicts individual risk of future cGVHD and NRM using biomarkers at 3 months post-HCT. A publicly available R Shiny web application supports its clinical use. Further studies are needed to explore its role in guiding preemptive therapy.TRIAL REGISTRATION BMTCTN 0201, BMTCTN 1202, and NCT02194439.FUNDING R01CA264921, U10HL069294, U24HL138660, R01HD074587, and P01HL158505.

Authors

Michael J. Martens, Debjani Dutta, Yongzi Yu, Lisa E. Rein, Jerome Ritz, Brent R. Logan, Sophie Paczesny

×

Figure 3

Dynamic risk prediction performance and key predictors in machine learning models of moderate/severe chronic GVHD.

Options: View larger image (or click on image) Download as PowerPoint
Dynamic risk prediction performance and key predictors in machine learni...
(A) Time-varying Area Under the ROC curve (AUCt) for moderate/severe cGVHD risk machine learning (ML) models with biomarkers and clinical variable effects. AUCt estimated at days 180, 270, 360, and 540 after HCT in the validation dataset. ML models were fitted using the training dataset. For calculation of AUCt at a given time point, patients with moderate/severe cGVHD onset through that time are classified as cases, while patients who are still alive are classified as controls. (B) BART variable importance metric for model of moderate/severe cGVHD with biomarkers and clinical variables. Importance metric describes how often each variable is used for splitting within the trees of the BART ensemble. These are Dirichlet posterior probabilities (see Methods).

Copyright © 2026 American Society for Clinical Investigation
ISSN: 0021-9738 (print), 1558-8238 (online)

Sign up for email alerts