A personalized probabilistic approach to ovarian cancer diagnostics

Gynecol Oncol. 2024 Mar:182:168-175. doi: 10.1016/j.ygyno.2023.12.030. Epub 2024 Jan 23.

Abstract

Objective: The identification/development of a machine learning-based classifier that utilizes metabolic profiles of serum samples to accurately identify individuals with ovarian cancer.

Methods: Serum samples collected from 431 ovarian cancer patients and 133 normal women at four geographic locations were analyzed by mass spectrometry. Reliable metabolites were identified using recursive feature elimination coupled with repeated cross-validation and used to develop a consensus classifier able to distinguish cancer from non-cancer. The probabilities assigned to individuals by the model were used to create a clinical tool that assigns a likelihood that an individual patient sample is cancer or normal.

Results: Our consensus classification model is able to distinguish cancer from control samples with 93% accuracy. The frequency distribution of individual patient scores was used to develop a clinical tool that assigns a likelihood that an individual patient does or does not have cancer.

Conclusions: An integrative approach using metabolomic profiles and machine learning-based classifiers has been employed to develop a clinical tool that assigns a probability that an individual patient does or does not have ovarian cancer. This personalized/probabilistic approach to cancer diagnostics is more clinically informative and accurate than traditional binary (yes/no) tests and represents a promising new direction in the early detection of ovarian cancer.

Keywords: Diagnostic; Machine learning; Metabolomics; Ovarian Cancer.

MeSH terms

  • Female
  • Humans
  • Machine Learning
  • Mass Spectrometry
  • Metabolomics
  • Ovarian Neoplasms* / diagnosis