A note on phasing long genomic regions using local haplotype predictions

J Bioinform Comput Biol. 2006 Jun;4(3):639-47. doi: 10.1142/s0219720006002272.

Abstract

The common approaches for haplotype inference from genotype data are targeted toward phasing short genomic regions. Longer regions are often tackled in a heuristic manner, due to the high computational cost. Here, we describe a novel approach for phasing genotypes over long regions, which is based on combining information from local predictions on short, overlapping regions. The phasing is done in a way, which maximizes a natural maximum likelihood criterion. Among other things, this criterion takes into account the physical length between neighboring single nucleotide polymorphisms. The approach is very efficient and is applied to several large scale datasets and is shown to be successful in two recent benchmarking studies (Zaitlen et al., in press; Marchini et al., in preparation). Our method is publicly available via a webserver at http://research.calit2.net/hap/.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Databases, Genetic
  • Genome, Human*
  • Genotype
  • Haplotypes*
  • Humans
  • Likelihood Functions
  • Polymorphism, Single Nucleotide / genetics
  • Software*