Whole-genome sequencing to understand the genetic architecture of common gene expression and biomarker phenotypes

Hum Mol Genet. 2015 Mar 1;24(5):1504-12. doi: 10.1093/hmg/ddu560. Epub 2014 Nov 6.

Abstract

Initial results from sequencing studies suggest that there are relatively few low-frequency (<5%) variants associated with large effects on common phenotypes. We performed low-pass whole-genome sequencing in 680 individuals from the InCHIANTI study to test two primary hypotheses: (i) that sequencing would detect single low-frequency-large effect variants that explained similar amounts of phenotypic variance as single common variants, and (ii) that some common variant associations could be explained by low-frequency variants. We tested two sets of disease-related common phenotypes for which we had statistical power to detect large numbers of common variant-common phenotype associations-11 132 cis-gene expression traits in 450 individuals and 93 circulating biomarkers in all 680 individuals. From a total of 11 657 229 high-quality variants of which 6 129 221 and 5 528 008 were common and low frequency (<5%), respectively, low frequency-large effect associations comprised 7% of detectable cis-gene expression traits [89 of 1314 cis-eQTLs at P < 1 × 10(-06) (false discovery rate ∼5%)] and one of eight biomarker associations at P < 8 × 10(-10). Very few (30 of 1232; 2%) common variant associations were fully explained by low-frequency variants. Our data show that whole-genome sequencing can identify low-frequency variants undetected by genotyping based approaches when sample sizes are sufficiently large to detect substantial numbers of common variant associations, and that common variant associations are rarely explained by single low-frequency variants of large effect.

Publication types

  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Female
  • Gene Frequency
  • Genetic Association Studies / methods*
  • Genetic Markers*
  • Genetic Variation
  • Genome, Human
  • Genotyping Techniques
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Male
  • Middle Aged
  • Phenotype*
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Young Adult

Substances

  • Genetic Markers