Unstable DNA repair genes shaped by their own sequence modifying phenotypes

J Mol Evol. 2010 Mar;70(3):266-74. doi: 10.1007/s00239-010-9328-0. Epub 2010 Mar 6.

Abstract

The question of whether natural selection favors genetic stability or genetic variability is a fundamental problem in evolutionary biology. Bioinformatic analyses demonstrate that selection favors genetic stability by avoiding unstable nucleotide sequences in protein encoding DNA. Yet, such unstable sequences are maintained in several DNA repair genes, thereby promoting breakdown of repair and destabilizing the genome. Several studies have therefore argued that selection favors genetic variability at the expense of stability. Here we propose a new evolutionary mechanism, with supporting bioinformatic evidence, that resolves this paradox. Combining the concepts of gene-dependent mutation biases and meiotic recombination, we argue that unstable sequences in the DNA mismatch repair (MMR) genes are maintained by their own phenotype. In particular, we predict that human MMR maintains an overrepresentation of mononucleotide repeats (monorepeats) within and around the MMR genes. In support of this hypothesis, we report a 31% excess in monorepeats in 250 kb regions surrounding the seven MMR genes compared to all other RefSeq genes (1.75 vs. 1.34%, P = 0.0047), with a particularly high content in PMS2 (2.41%, P = 0.0047) and MSH6 (2.07%, P = 0.043). Based on a mathematical model of monorepeat frequency, we argue that the proposed mechanism may suffice to explain the observed excess of repeats around MMR genes. Our findings thus indicate that unstable sequences in MMR genes are maintained through evolution by the MMR mechanism. The evolutionary paradox of genetically unstable DNA repair genes may thus be explained by an equilibrium in which the phenotype acts back on its own genotype.

MeSH terms

  • Base Sequence / physiology*
  • DNA Repair / genetics*
  • Evolution, Molecular
  • Gene Frequency
  • Genes / physiology
  • Genetic Variation / physiology*
  • Genomic Instability / physiology*
  • Humans
  • Models, Biological
  • Models, Genetic
  • Models, Theoretical
  • Phenotype
  • Repetitive Sequences, Nucleic Acid / genetics
  • Sequence Analysis, DNA