Modelling the evolution of protein coding sequences sampled from Measurably Evolving Populations

Genome Inform. 2008:21:150-64.

Abstract

Models of nucleotide or amino acid sequence evolution that implement homogeneous and stationary Markov processes of substitutions are mathematically convenient but are unlikely to represent the true complexity of evolution. With the large amounts of data that next generation sequencing promises, appropriate models of evolution are important, particularly when data are collected from ancient and sub-fossil remains, where changes in evolutionary parameters are the norm and not the exception. In this paper, we describe a new codon-based model of evolution that applies to Measurably Evolving Populations (MEPs). A MEP is defined as a population from which it is possible to detect a statistically significant accumulation of substitutions when sequences are obtained at different times. The new model of codon evolution permits changes to the substitution process, including changes to the intensity of selection and the proportions of sites undergoing different selective pressures. In our serial model of codon evolution, changes in the selective regime occur simultaneously across all lineages. Different regions of the protein may also evolve under distinct selective patterns. We illustrate the application of the new model to a dataset of HIV-1 sequences obtained from an infected individual before and after the commencement of antiretroviral therapy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acquired Immunodeficiency Syndrome / drug therapy
  • Acquired Immunodeficiency Syndrome / genetics
  • Amino Acid Substitution
  • Animals
  • Anti-HIV Agents / therapeutic use
  • Codon / genetics
  • DNA / chemistry
  • DNA / genetics
  • Eukaryota / genetics
  • Evolution, Molecular*
  • Gene Frequency
  • Genetic Variation
  • HIV-1 / drug effects
  • HIV-1 / genetics
  • Humans
  • Likelihood Functions
  • Models, Genetic
  • Probability
  • Proteins / chemistry
  • Proteins / genetics*

Substances

  • Anti-HIV Agents
  • Codon
  • Proteins
  • DNA