A tight link between orthologs and bidirectional best hits in bacterial and archaeal genomes

Genome Biol Evol. 2012;4(12):1286-94. doi: 10.1093/gbe/evs100.

Abstract

Orthologous relationships between genes are routinely inferred from bidirectional best hits (BBH) in pairwise genome comparisons. However, to our knowledge, it has never been quantitatively demonstrated that orthologs form BBH. To test this "BBH-orthology conjecture," we take advantage of the operon organization of bacterial and archaeal genomes and assume that, when two genes in compared genomes are flanked by two BBH show statistically significant sequence similarity to one another, these genes are bona fide orthologs. Under this assumption, we tested whether middle genes in "syntenic orthologous gene triplets" form BBH. We found that this was the case in more than 95% of the syntenic gene triplets in all genome comparisons. A detailed examination of the exceptions to this pattern, including maximum likelihood phylogenetic tree analysis, showed that some of these deviations involved artifacts of genome annotation, whereas very small fractions represented random assignment of the best hit to one of closely related in-paralogs, paralogous displacement in situ, or even less frequent genuine violations of the BBH-orthology conjecture caused by acceleration of evolution in one of the orthologs. We conclude that, at least in prokaryotes, genes for which independent evidence of orthology is available typically form BBH and, conversely, BBH can serve as a strong indication of gene orthology.

Publication types

  • Letter
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Artifacts
  • Escherichia coli / genetics*
  • Evolution, Molecular*
  • Genome, Archaeal*
  • Genome, Bacterial*
  • Haloarcula marismortui / genetics*
  • Likelihood Functions
  • Molecular Sequence Annotation
  • Operon / genetics
  • Phylogeny
  • Synteny