Bayesian integrative analysis and prediction with application to atherosclerosis cardiovascular disease

Biostatistics. 2022 Dec 12;24(1):124-139. doi: 10.1093/biostatistics/kxab016.

Abstract

The problem of associating data from multiple sources and predicting an outcome simultaneously is an important one in modern biomedical research. It has potential to identify multidimensional array of variables predictive of a clinical outcome and to enhance our understanding of the pathobiology of complex diseases. Incorporating functional knowledge in association and prediction models can reveal pathways contributing to disease risk. We propose Bayesian hierarchical integrative analysis models that associate multiple omics data, predict a clinical outcome, allow for prior functional information, and can accommodate clinical covariates. The models, motivated by available data and the need for exploring other risk factors of atherosclerotic cardiovascular disease (ASCVD), are used for integrative analysis of clinical, demographic, and genomics data to identify genetic variants, genes, and gene pathways likely contributing to 10-year ASCVD risk in healthy adults. Our findings revealed several genetic variants, genes, and gene pathways that are highly associated with ASCVD risk, with some already implicated in cardiovascular disease (CVD) risk. Extensive simulations demonstrate the merit of joint association and prediction models over two-stage methods: association followed by prediction.

Keywords: Bayesian variable selection; Biological information; Cardiovascular disease; Factor analysis; Integrative analysis; Joint association and prediction.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Atherosclerosis* / etiology
  • Atherosclerosis* / genetics
  • Bayes Theorem
  • Cardiovascular Diseases* / etiology
  • Cardiovascular Diseases* / genetics
  • Genomics / methods
  • Humans
  • Risk Assessment
  • Risk Factors