The importance of sequence diversity in the aggregation and evolution of proteins

Nature. 2005 Dec 8;438(7069):878-81. doi: 10.1038/nature04195.

Abstract

Incorrect folding of proteins, leading to aggregation and amyloid formation, is associated with a group of highly debilitating medical conditions including Alzheimer's disease and late-onset diabetes. The issue of how unwanted protein association is normally avoided in a living system is particularly significant in the context of the evolution of multidomain proteins, which account for over 70% of all eukaryotic proteins, where the effective local protein concentration in the vicinity of each domain is very high. Here we describe the aggregation kinetics of multidomain protein constructs of immunoglobulin domains and the ability of different homologous domains to aggregate together. We show that aggregation of these proteins is a specific process and that the efficiency of coaggregation between different domains decreases markedly with decreasing sequence identity. Thus, whereas immunoglobulin domains with more than about 70% identity are highly prone to coaggregation, those with less than 30-40% sequence identity do not detectably interact. A bioinformatics analysis of consecutive homologous domains in large multidomain proteins shows that such domains almost exclusively have sequence identities of less than 40%, in other words below the level at which coaggregation is likely to be efficient. We propose that such low sequence identities could have a crucial and general role in safeguarding proteins against misfolding and aggregation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amyloid / chemistry
  • Amyloid / metabolism
  • Connectin
  • Evolution, Molecular*
  • Fibronectins / chemistry
  • Fibronectins / classification
  • Fibronectins / metabolism
  • Humans
  • Immunoglobulins / chemistry
  • Immunoglobulins / metabolism
  • Kinetics
  • Light
  • Muscle Proteins / chemistry
  • Muscle Proteins / metabolism
  • Protein Binding
  • Protein Denaturation
  • Protein Folding
  • Protein Kinases / chemistry
  • Protein Kinases / metabolism
  • Protein Structure, Quaternary
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Scattering, Radiation

Substances

  • Amyloid
  • Connectin
  • Fibronectins
  • Immunoglobulins
  • Muscle Proteins
  • Proteins
  • TTN protein, human
  • Protein Kinases