SNPing away at candidate genes

Genet Epidemiol. 2001:21 Suppl 1:S643-8. doi: 10.1002/gepi.2001.21.s1.s643.

Abstract

We develop regression methodology to identify subsets of single nucleotide polymorphisms (SNPs) within candidate genes related to quantitative traits and apply our methods to the simulated Genetic Analysis Workshop (GAW) 12 data set. In the data set we find 694 SNP loci with minimum allele frequencies of at least 0.01. We assume an additive casual model between these SNPs and all five quantitative traits. After initial screening using one-way analysis of variance, we employ a computationally efficient, simulated annealing algorithm to select among all possible subsets of SNP loci, using a generalization of Mallows' Cp as our optimality criterion. The simple transition kernel we develop evaluates new subsets in O(1), by requiring just three arithmetic operations to calculate the proposed RSS based on the Gauss-Jordan pivot. We identify an SNP loci located at 6-5782 related to traits 2 and 3 and several sites on gene 2 related to trait 5 using a subsample of 1,000 individuals and the full data set (n = 8,250) for comparison.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alleles
  • Chromosome Mapping / statistics & numerical data
  • Gene Frequency
  • Genetic Predisposition to Disease / genetics*
  • Humans
  • Models, Genetic*
  • Polymorphism, Single Nucleotide / genetics*
  • Quantitative Trait, Heritable*
  • Regression Analysis