Detecting Recombination Hotspots from Patterns of Linkage Disequilibrium

G3 (Bethesda). 2016 Aug 9;6(8):2265-71. doi: 10.1534/g3.116.029587.

Abstract

With recent advances in DNA sequencing technologies, it has become increasingly easy to use whole-genome sequencing of unrelated individuals to assay patterns of linkage disequilibrium (LD) across the genome. One type of analysis that is commonly performed is to estimate local recombination rates and identify recombination hotspots from patterns of LD. One method for detecting recombination hotspots, LDhot, has been used in a handful of species to further our understanding of the basic biology of recombination. For the most part, the effectiveness of this method (e.g., power and false positive rate) is unknown. In this study, we run extensive simulations to compare the effectiveness of three different implementations of LDhot. We find large differences in the power and false positive rates of these different approaches, as well as a strong sensitivity to the window size used (with smaller window sizes leading to more accurate estimation of hotspot locations). We also compared our LDhot simulation results with comparable simulation results obtained from a Bayesian maximum-likelihood approach for identifying hotspots. Surprisingly, we found that the latter computationally intensive approach had substantially lower power over the parameter values considered in our simulations.

Keywords: composite likelihood; linkage disequilibrium; recombination hotspots.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Bayes Theorem
  • Computer Simulation
  • Genome, Human / genetics*
  • Humans
  • Likelihood Functions
  • Linkage Disequilibrium*
  • Models, Genetic*
  • Polymorphism, Single Nucleotide
  • Recombination, Genetic*
  • Sequence Analysis, DNA