Machine learning-driven identification of the gene-expression signature associated with a persistent multiple organ dysfunction trajectory in critical illness

EBioMedicine. 2024 Jan:99:104938. doi: 10.1016/j.ebiom.2023.104938. Epub 2023 Dec 23.

Abstract

Background: Multiple organ dysfunction syndrome (MODS) disproportionately drives morbidity and mortality among critically ill patients. However, we lack a comprehensive understanding of its pathobiology. Identification of genes associated with a persistent MODS trajectory may shed light on underlying biology and allow for accurate prediction of those at-risk.

Methods: Secondary analyses of publicly available gene-expression datasets. Supervised machine learning (ML) was used to identify a parsimonious set of genes associated with a persistent MODS trajectory in a training set of pediatric septic shock. We optimized model parameters and tested risk-prediction capabilities in independent validation and test datasets, respectively. We compared model performance relative to an established gene-set predictive of sepsis mortality.

Findings: Patients with a persistent MODS trajectory had 568 differentially expressed genes and characterized by a dysregulated innate immune response. Supervised ML identified 111 genes associated with the outcome of interest on repeated cross-validation, with an AUROC of 0.87 (95% CI: 0.85-0.88) in the training set. The optimized model, limited to 20 genes, achieved AUROCs ranging from 0.74 to 0.79 in the validation and test sets to predict those with persistent MODS, regardless of host age and cause of organ dysfunction. Our classifier demonstrated reproducibility in identifying those with persistent MODS in comparison with a published gene-set predictive of sepsis mortality.

Interpretation: We demonstrate the utility of supervised ML driven identification of the genes associated with persistent MODS. Pending validation in enriched cohorts with a high burden of organ dysfunction, such an approach may inform targeted delivery of interventions among at-risk patients.

Funding: H.R.W.'s NIHR35GM126943 award supported the work detailed in this manuscript. Upon his death, the award was transferred to M.N.A. M.R.A., N.S.P, and R.K were supported by NIHR21GM151703. R.K. was supported by R01GM139967.

Keywords: Critical illness; Multiple organ dysfunction syndrome; Precision medicine; Sepsis; Supervised machine learning.

MeSH terms

  • Child
  • Critical Illness
  • Humans
  • Machine Learning
  • Multiple Organ Failure* / genetics
  • Reproducibility of Results
  • Sepsis* / complications
  • Sepsis* / genetics