Evolutionary information hidden in a single protein structure

Proteins. 2012 Jun;80(6):1647-57. doi: 10.1002/prot.24058. Epub 2012 Mar 27.

Abstract

The knowledge of conserved sequences in proteins is valuable in identifying functionally or structurally important residues. Generating the conservation profile of a sequence requires aligning families of homologous sequences and having knowledge of their evolutionary relationships. Here, we report that the conservation profile at the residue level can be quantitatively derived from a single protein structure with only backbone information. We found that the reciprocal packing density profiles of protein structures closely resemble their sequence conservation profiles. For a set of 554 nonhomologous enzymes, 74% (408/554) of the proteins have a correlation coefficient > 0.5 between these two profiles. Our results indicate that the three-dimensional structure, instead of being a mere scaffold for positioning amino acid residues, exerts such strong evolutionary constraints on the residues of the protein that its profile of sequence conservation essentially reflects that of its structural characteristics.

MeSH terms

  • Catalytic Domain
  • Conserved Sequence*
  • Databases, Protein
  • Enzymes / chemistry*
  • Enzymes / genetics*
  • Evolution, Molecular*
  • Protein Conformation
  • Protein Structure, Tertiary
  • Protein Subunits
  • Sequence Alignment
  • Sequence Homology, Amino Acid

Substances

  • Enzymes
  • Protein Subunits