Integrating Phenotypic Information of Obstructive Sleep Apnea and Deep Representation of Sleep-Event Sequences for Cardiovascular Risk Prediction

Res Sq [Preprint]. 2024 Mar 15:rs.3.rs-4084889. doi: 10.21203/rs.3.rs-4084889/v1.

Abstract

Background: Advances in mobile, wearable and machine learning (ML) technologies for gathering and analyzing long-term health data have opened up new possibilities for predicting and preventing cardiovascular diseases (CVDs). Meanwhile, the association between obstructive sleep apnea (OSA) and CV risk has been well-recognized. This study seeks to explore effective strategies of incorporating OSA phenotypic information and overnight physiological information for precise CV risk prediction in the general population.

Methods: 1,874 participants without a history of CVDs from the MESA dataset were included for the 5-year CV risk prediction. Four OSA phenotypes were first identified by the K-mean clustering based on static polysomnographic (PSG) features. Then several phenotype-agnostic and phenotype-specific ML models, along with deep learning (DL) models that integrate deep representations of overnight sleep-event feature sequences, were built for CV risk prediction. Finally, feature importance analysis was conducted by calculating SHapley Additive exPlanations (SHAP) values for all features across the four phenotypes to provide model interpretability.

Results: All ML models showed improved performance after incorporating the OSA phenotypic information. The DL model trained with the proposed phenotype-contrastive training strategy performed the best, achieving an area under the Receiver Operating Characteristic (ROC) curve of 0.877. Moreover, PSG and FOOD FREQUENCY features were recognized as significant CV risk factors across all phenotypes, with each phenotype emphasizing unique features.

Conclusion: Models that are aware of OSA phenotypes are preferred, and lifestyle factors should be a greater focus for precise CV prevention and risk management in the general population.

Keywords: Cardiovascular risk prediction; deep representation; model interpretability; obstructive sleep apnea phenotyping; phenotype-aware models; sleep event sequences.

Publication types

  • Preprint