Summary report: Missing data and pedigree and genotyping errors

Genet Epidemiol. 2003:25 Suppl 1:S36-42. doi: 10.1002/gepi.10282.

Abstract

Genetic epidemiology is faced with mapping complex traits to genes with relatively small effects whose phenotypes may be modulated by temporal factors. To do this, detailed and accurate data must be available on families, perhaps collected over time. The Framingham Heart Study data supplied to Genetic Analysis Workshop 13 (GAW13), along with its simulated counterpart, contain longitudinal measurements and genomic scan data on 2,885 individuals in 330 families, and offer an opportunity to examine data quality and completeness issues as they affect analytical conclusions. Six GAW13 contributions applied methods to deal with missing data, both phenotypic and genotypic, at a single time point and longitudinally, and with possible errors in pedigree structure and genotypes. The methods included missing phenotypic data imputation by Markov chain Monte Carlo sampling, propensity scoring, regression, and adjusted mean values, as well as the assessment of transmission-disequilibrium tests when missing marker data may be allele-specific. Pedigree structural errors were found by genome-wide allele-sharing probabilities, while Mendelian consistent genotype errors were evaluated through likelihoods of double-recombination events. Each of the methods reviewed here offered insights into how to better take advantage of large, time-dependent, familial data sets. However, no one of them dealt with the longitudinal and familial aspects simultaneously. Overall, more consideration needs to be given to the effects that missing data and data errors have on our ability to map complex traits efficiently and accurately.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Cardiovascular Diseases / epidemiology*
  • Cardiovascular Diseases / genetics*
  • Genetic Linkage
  • Genotype
  • Humans
  • Longitudinal Studies
  • Models, Genetic*
  • Models, Statistical*
  • Pedigree
  • Phenotype
  • Research Design*