Leveraging genetically simple traits to identify small-effect variants for complex phenotypes

BMC Genomics. 2016 Nov 3;17(1):858. doi: 10.1186/s12864-016-3175-3.

Abstract

Background: Polymorphisms underlying complex traits often explain a small part (less than 1 %) of the phenotypic variance (σ2P). This makes identification of mutations underling complex traits difficult and usually only a subset of large-effect loci are identified. One approach to identify more loci is to increase sample size of experiments but here we propose an alternative. The aim of this paper is to use secondary phenotypes for genetically simple traits during the QTL discovery phase for complex traits. We demonstrate this approach in a dairy cattle data set where the complex traits were milk production phenotypes (fat, milk and protein yield; fat and protein percentage in milk) measured on thousands of individuals while secondary (potentially genetically simpler) traits are detailed milk composition traits (measurements of individual protein abundance, mineral and sugar concentrations; and gene expression).

Results: Quantitative trait loci (QTL) were identified using 11,527 Holstein cattle with milk production records and up to 444 cows with milk composition traits. There were eight regions that contained QTL for both milk production and a composition trait, including four novel regions. One region on BTAU1 affected both milk yield and phosphorous concentration in milk. The QTL interval included the gene SLC37A1, a phosphorous antiporter. The most significant imputed sequence variants in this region explained 0.001 σ2P for milk yield, and 0.11 σ2P for phosphorus concentration. Since the polymorphisms were non-coding, association mapping for SLC37A1 gene expression was performed using high depth mammary RNAseq data from a separate group of 371 lactating cows. This confirmed a strong eQTL for SLC37A1, with peak association at the same imputed sequence variants that were most significant for phosphorus concentration. Fitting any of these variants as covariables in the association analysis removed the QTL signal for milk production traits. Plausible causative mutations in the casein complex region were also identified using a similar strategy.

Conclusions: Milk production traits in dairy cows are typical complex traits where polymorphisms explain only a small portion of the phenotypic variance. However, here we show that these mutations can have larger effects on secondary traits, such as concentrations of minerals, proteins and sugars in the milk, and expression levels of genes in mammary tissue. These larger effects were used to successfully map variants for milk production traits. Genetically simple traits also provide a direct biological link between possible causal mutations and the effect of these mutations on milk production.

Keywords: Complex traits; Gene expression; Pleiotropy; QTL mapping.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cattle
  • Gene Expression
  • Genetic Association Studies*
  • Genetic Variation*
  • Milk
  • Phenotype*
  • Quantitative Trait Loci
  • Quantitative Trait, Heritable*
  • Sequence Analysis, DNA