The Prevalence and Evolutionary Conservation of Inverted Repeats in Proteobacteria

Genome Biol Evol. 2018 Mar 1;10(3):918-927. doi: 10.1093/gbe/evy044.

Abstract

Perfect short inverted repeats (IRs) are known to be enriched in a variety of bacterial and eukaryotic genomes. Currently, it is unclear whether perfect IRs are conserved over evolutionary time scales. In this study, we aimed to characterize the prevalence and evolutionary conservation of IRs across 20 proteobacterial strains. We first identified IRs in Escherichia coli K-12 substr MG1655 and showed that they are overabundant. We next aimed to test whether this overabundance is reflected in the conservation of IRs over evolutionary time scales. To this end, for each perfect IR identified in E. coli MG1655, we collected orthologous sequences from related proteobacterial genomes. We next quantified the evolutionary conservation of these IRs, that is, the presence of the exact same IR across orthologous regions. We observed high conservation of perfect IRs: out of the 234 examined orthologous regions, 145 were more conserved than expected, which is statistically significant even after correcting for multiple testing. Our results together with previous experimental findings support a model in which imperfect IRs are corrected to perfect IRs in a preferential manner via a template switching mechanism.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Conserved Sequence / genetics*
  • Escherichia coli / genetics
  • Evolution, Molecular*
  • Genome, Bacterial / genetics
  • Inverted Repeat Sequences / genetics*
  • Proteobacteria / genetics