Consensus sequence design as a general strategy to create hyperstable, biologically active proteins

Proc Natl Acad Sci U S A. 2019 Jun 4;116(23):11275-11284. doi: 10.1073/pnas.1816707116. Epub 2019 May 20.

Abstract

Consensus sequence design offers a promising strategy for designing proteins of high stability while retaining biological activity since it draws upon an evolutionary history in which residues important for both stability and function are likely to be conserved. Although there have been several reports of successful consensus design of individual targets, it is unclear from these anecdotal studies how often this approach succeeds and how often it fails. Here, we attempt to assess generality by designing consensus sequences for a set of six protein families with a range of chain lengths, structures, and activities. We characterize the resulting consensus proteins for stability, structure, and biological activities in an unbiased way. We find that all six consensus proteins adopt cooperatively folded structures in solution. Strikingly, four of six of these consensus proteins show increased thermodynamic stability over naturally occurring homologs. Each consensus protein tested for function maintained at least partial biological activity. Although peptide binding affinity by a consensus-designed SH3 is rather low, Km values for consensus enzymes are similar to values from extant homologs. Although consensus enzymes are slower than extant homologs at low temperature, they are faster than some thermophilic enzymes at high temperature. An analysis of sequence properties shows consensus proteins to be enriched in charged residues, and rarified in uncharged polar residues. Sequence differences between consensus and extant homologs are predominantly located at weakly conserved surface residues, highlighting the importance of these residues in the success of the consensus strategy.

Keywords: consensus sequence; protein design; protein stability.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Consensus Sequence / genetics*
  • Proteins / genetics*
  • Temperature
  • Thermodynamics

Substances

  • Proteins