Identifying rarer genetic variants for common complex diseases: diseased versus neutral discovery panels

Ann Hum Genet. 2009 Jan;73(1):54-60. doi: 10.1111/j.1469-1809.2008.00483.x.

Abstract

The power of genetic association studies to identify disease susceptibility alleles fundamentally relies on the variants studied. The standard approach is to determine a set of tagging-SNPs (tSNPs) that capture the majority of genomic variation in regions of interest by exploiting local correlation structures. Typically, tSNPs are selected from neutral discovery panels - collections of individuals comprehensively genotyped across a region. We investigated the implications of discovery panel design on tSNP performance in association studies using realistically-simulated sequence data. We found that discovery panels of 24 sequenced 'neutral' individuals (similar to NIEHS or HapMap ENCODE data) were sufficient to select well-powered tSNPs to identify common susceptibility alleles. For less common alleles (0.01-0.05 frequency) we found neutral panels of this size inadequate, particularly if low-frequency variants were removed prior to tSNP selection; superior tSNPs were found using panels of diseased individuals. Only large neutral panels (200 individuals) matched diseased panel performance in selecting well-powered tSNPs to detect both common and rarer alleles. The 1000 Genomes Project initiative may provide larger neutral panels necessary to identify rarer susceptibility alleles in association studies. In the interim, our results suggest investigators can boost power to detect such alleles by sequencing diseased individuals for tSNP selection.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Disease / genetics*
  • Genetic Predisposition to Disease
  • Genetic Techniques*
  • Genetic Variation*
  • Genetics, Population
  • Genome, Human*
  • Haplotypes
  • Humans
  • Models, Genetic
  • Polymorphism, Single Nucleotide*