An evaluation of the genetic-matched pair study design using genome-wide SNP data from the European population

Eur J Hum Genet. 2009 Jul;17(7):967-75. doi: 10.1038/ejhg.2008.266. Epub 2009 Jan 21.

Abstract

Genetic matching potentially provides a means to alleviate the effects of incomplete Mendelian randomization in population-based gene-disease association studies. We therefore evaluated the genetic-matched pair study design on the basis of genome-wide SNP data (309,790 markers; Affymetrix GeneChip Human Mapping 500K Array) from 2457 individuals, sampled at 23 different recruitment sites across Europe. Using pair-wise identity-by-state (IBS) as a matching criterion, we tried to derive a subset of markers that would allow identification of the best overall matching (BOM) partner for a given individual, based on the IBS status for the subset alone. However, our results suggest that, by following this approach, the prediction accuracy is only notably improved by the first 20 markers selected, and increases proportionally to the marker number thereafter. Furthermore, in a considerable proportion of cases (76.0%), the BOM of a given individual, based on the complete marker set, came from a different recruitment site than the individual itself. A second marker set, specifically selected for ancestry sensitivity using singular value decomposition, performed even more poorly and was no more capable of predicting the BOM than randomly chosen subsets. This leads us to conclude that, at least in Europe, the utility of the genetic-matched pair study design depends critically on the availability of comprehensive genotype information for both cases and controls.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA / analysis
  • DNA / genetics
  • Europe
  • Female
  • Genetic Markers
  • Genetic Variation
  • Genome, Human*
  • Genome-Wide Association Study*
  • Humans
  • Male
  • Matched-Pair Analysis*
  • Polymorphism, Single Nucleotide*
  • Population Groups / genetics
  • Research Design
  • Sequence Analysis, DNA

Substances

  • Genetic Markers
  • DNA