Two-Variance-Component Model Improves Genetic Prediction in Family Datasets

Am J Hum Genet. 2015 Nov 5;97(5):677-90. doi: 10.1016/j.ajhg.2015.10.002.

Abstract

Genetic prediction based on either identity by state (IBS) sharing or pedigree information has been investigated extensively with best linear unbiased prediction (BLUP) methods. Such methods were pioneered in plant and animal-breeding literature and have since been applied to predict human traits, with the aim of eventual clinical utility. However, methods to combine IBS sharing and pedigree information for genetic prediction in humans have not been explored. We introduce a two-variance-component model for genetic prediction: one component for IBS sharing and one for approximate pedigree structure, both estimated with genetic markers. In simulations using real genotypes from the Candidate-gene Association Resource (CARe) and Framingham Heart Study (FHS) family cohorts, we demonstrate that the two-variance-component model achieves gains in prediction r(2) over standard BLUP at current sample sizes, and we project, based on simulations, that these gains will continue to hold at larger sample sizes. Accordingly, in analyses of four quantitative phenotypes from CARe and two quantitative phenotypes from FHS, the two-variance-component model significantly improves prediction r(2) in each case, with up to a 20% relative improvement. We also find that standard mixed-model association tests can produce inflated test statistics in datasets with related individuals, whereas the two-variance-component model corrects for inflation.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cardiovascular Diseases / diagnosis*
  • Cardiovascular Diseases / genetics
  • Computer Simulation
  • Datasets as Topic
  • Family
  • Genetic Association Studies
  • Genetic Markers*
  • Genome-Wide Association Study*
  • Genomics / methods
  • Humans
  • Models, Genetic*
  • Models, Statistical*
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Principal Component Analysis
  • Quantitative Trait Loci*
  • Selection, Genetic / genetics

Substances

  • Genetic Markers