Maximum-Likelihood Tree Estimation Using Codon Substitution Models with Multiple Partitions

Mol Biol Evol. 2015 Aug;32(8):2208-16. doi: 10.1093/molbev/msv097. Epub 2015 Apr 23.

Abstract

Many protein sequences have distinct domains that evolve with different rates, different selective pressures, or may differ in codon bias. Instead of modeling these differences by more and more complex models of molecular evolution, we present a multipartition approach that allows maximum-likelihood phylogeny inference using different codon models at predefined partitions in the data. Partition models can, but do not have to, share free parameters in the estimation process. We test this approach with simulated data as well as in a phylogenetic study of the origin of the leucin-rich repeat regions in the type III effector proteins of the pythopathogenic bacteria Ralstonia solanacearum. Our study does not only show that a simple two-partition model resolves the phylogeny better than a one-partition model but also gives more evidence supporting the hypothesis of lateral gene transfer events between the bacterial pathogens and its eukaryotic hosts.

Keywords: Markov model; amino acid substitution model; codon substitution model; maximum-likelihood tree.

MeSH terms

  • Bacterial Proteins / genetics*
  • Codon*
  • Models, Genetic*
  • Ralstonia solanacearum / genetics*

Substances

  • Bacterial Proteins
  • Codon