Fast eQTL Analysis for Twin Studies

Genet Epidemiol. 2015 Jul;39(5):357-65. doi: 10.1002/gepi.21900. Epub 2015 Apr 10.

Abstract

Twin data are commonly used for studying complex psychiatric disorders, and mixed effects models are one of the most popular tools for modeling dependence structures between twin pairs. However, for eQTL (expression quantitative trait loci) data where associations between thousands of transcripts and millions of single nucleotide polymorphisms need to be tested, mixed effects models are computationally inefficient and often impractical. In this paper, we propose a fast eQTL analysis approach for twin eQTL data where we randomly split twin pairs into two groups, so that within each group the samples are unrelated, and we then apply a multiple linear regression analysis separately to each group. A score statistic that automatically adjusts the (hidden) correlation between the two groups is constructed for combining the results from the two groups. The proposed method has well-controlled type I error. Compared to mixed effects models, the proposed method has similar power but drastically improved computational efficiency. We demonstrate the computational advantage of the proposed method via extensive simulations. The proposed method is also applied to a large twin eQTL data from the Netherlands Twin Register.

Keywords: high correlation; matrix representation; mixed effects model; score statistic; twin eQTL data.

Publication types

  • Twin Study

MeSH terms

  • Algorithms
  • Computational Biology
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Humans
  • Models, Genetic*
  • Polymorphism, Single Nucleotide / genetics*
  • Quantitative Trait Loci*