A comparison of strategies for analyzing dichotomous outcomes in genome-wide association studies with general pedigrees

Genet Epidemiol. 2011 Nov;35(7):650-7. doi: 10.1002/gepi.20614. Epub 2011 Aug 4.

Abstract

Genome-wide association studies (GWAS) have been frequently conducted on general or isolated populations with related individuals. However, there is a lack of consensus on which strategy is most appropriate for analyzing dichotomous phenotypes in general pedigrees. Using simulation studies, we compared several strategies including generalized estimating equations (GEE) strategies with various working correlation structures, generalized linear mixed model (GLMM), and a variance component strategy (denoted LMEBIN) that treats dichotomous outcomes as continuous with special attentions to their performance with rare variants, rare diseases, and small sample sizes. In our simulations, when the sample size is not small, for type I error, only GEE and LMEBIN maintain nominal type I error in most cases with exceptions for GEE with very rare disease and genetic variants. GEE and LMEBIN have similar statistical power and slightly outperform GLMM when the prevalence is low. In terms of computational efficiency, GEE with sandwich variance estimator outperforms GLMM and LMEBIN. We apply the strategies to GWAS of gout in the Framingham Heart Study. Based on our results, we would recommend using GEE ind-san in the GWAS for common variants and GEE ind-fij or LMEBIN for rare variants for GWAS of dichotomous outcomes with general pedigrees.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Data Interpretation, Statistical
  • Genetic Predisposition to Disease
  • Genetic Variation
  • Genome-Wide Association Study* / statistics & numerical data
  • Gout / genetics
  • Humans
  • Linear Models
  • Models, Genetic*
  • Pedigree*
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Sample Size