A correlation-based feature analysis of physical examination indicators can help predict the overall underlying health status using machine learning

Sci Rep. 2022 Nov 15;12(1):19626. doi: 10.1038/s41598-022-20474-3.

Abstract

As a systematic investigation of the correlations between physical examination indicators (PEIs) is lacking, most PEIs are currently independently used for disease warning. This results in the general physical examination having limited diagnostic values. Here, we systematically analyzed the correlations in 221 PEIs between healthy and 34 unhealthy statuses in 803,614 individuals in China. Specifically, the study population included 711,928 healthy participants, 51,341 patients with hypertension, 12,878 patients with diabetes, and 34,997 patients with other unhealthy statuses. We found rich relevance between PEIs in the healthy physical status (7662 significant correlations, 31.5%). However, in the disease conditions, the PEI correlations changed. We focused on the difference in PEIs between healthy and 35 unhealthy physical statuses and found 1239 significant PEI differences, suggesting that they could be candidate disease markers. Finally, we established machine learning algorithms to predict health status using 15-16% of the PEIs through feature extraction, reaching a 66-99% accurate prediction, depending on the physical status. This new reference of the PEI correlation provides rich information for chronic disease diagnosis. The developed machine learning algorithms can fundamentally affect the practice of general physical examinations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China
  • Health Status*
  • Humans
  • Machine Learning*
  • Physical Examination