Order in Disorder as Observed by the "Hydrophobic Cluster Analysis" of Protein Sequences

Proteomics. 2018 Nov;18(21-22):e1800054. doi: 10.1002/pmic.201800054. Epub 2018 Oct 30.

Abstract

Hydrophobic cluster analysis (HCA) is an original approach for protein sequence analysis, which provides access to the foldable repertoire of the protein universe, including yet unannotated protein segments ("dark proteome"). Foldable segments correspond to ordered regions, as well as to intrinsically disordered regions (IDRs) undergoing disorder to order transitions. In this review, how HCA can be used to give insight into this last category of foldable segments is illustrated, with examples matching known 3D structures. After reviewing the HCA principles, examples of short foldable segments are given, which often contain short linear motifs, typically matching hydrophobic clusters. These segments become ordered upon contact with partners, with secondary structure preferences generally corresponding to those observed in the 3D structures within the complexes. Such small foldable segments are sometimes larger than the segments of known 3D structures, including flanking hydrophobic clusters that may be critical for interaction specificity or regulation, as well as intervening sequences allowing fuzziness. Cases of larger conditionally disordered domains are also presented, with lower density in hydrophobic clusters than well-folded globular domains or with exposed hydrophobic patches, which are stabilized by interaction with partners.

Keywords: HCA; dark proteome; disorder; foldability; secondary structure.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Cluster Analysis*
  • Hydrophobic and Hydrophilic Interactions
  • Protein Structure, Secondary
  • Sequence Analysis, Protein / methods*