Propensity score matching as an effective strategy for biomarker cohort design and omics data analysis

PLoS One. 2024 May 2;19(5):e0302109. doi: 10.1371/journal.pone.0302109. eCollection 2024.

Abstract

Background: Analysis of omics data that contain multidimensional biological and clinical information can be complex and make it difficult to deduce significance of specific biomarker factors.

Methods: We explored the utility of propensity score matching (PSM), a statistical technique for minimizing confounding factors and simplifying the examination of specific factors. We tested two datasets generated from cohorts of colorectal cancer (CRC) patients, one comprised of immunohistochemical analysis of 12 protein markers in 544 CRC tissues and another consisting of RNA-seq profiles of 163 CRC cases. We examined the efficiency of PSM by comparing pre- and post-PSM analytical results.

Results: Unlike conventional analysis which typically compares randomized cohorts of cancer and normal tissues, PSM enabled direct comparison between patient characteristics uncovering new prognostic biomarkers. By creating optimally matched groups to minimize confounding effects, our study demonstrates that PSM enables robust extraction of significant biomarkers while requiring fewer cancer cases and smaller overall patient cohorts.

Conclusion: PSM may emerge as an efficient and cost-effective strategy for multiomic data analysis and clinical trial design for biomarker discovery.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Biomarkers, Tumor* / genetics
  • Biomarkers, Tumor* / metabolism
  • Cohort Studies
  • Colorectal Neoplasms* / genetics
  • Data Analysis
  • Female
  • Humans
  • Male
  • Prognosis
  • Propensity Score*

Substances

  • Biomarkers, Tumor

Grants and funding

Michael H. Roehrl acknowledges grants from the National Institute of Health/National Cancer Institute (R21 CA251992, R21 CA263262, and U01 CA263986) and an Investigator Grant from the Neuroendocrine Tumor Research Foundation (NETRF). The funding bodies were not involved in the design of the study and the collection, analysis, and interpretation of data.