Combination of threading potentials and sequence profiles improves fold recognition

J Mol Biol. 2000 Mar 10;296(5):1319-31. doi: 10.1006/jmbi.2000.3541.

Abstract

Using a benchmark set of structurally similar proteins, we conduct a series of threading experiments intended to identify a scoring function with an optimal combination of contact-potential and sequence-profile terms. The benchmark set is selected to include many medium-difficulty fold recognition targets, where sequence similarity is undetectable by BLAST but structural similarity is extensive. The contact potential is based on the log-odds of non-local contacts involving different amino acid pairs, in native as opposed to randomly compacted structures. The sequence profile term is that used in PSI-BLAST. We find that combination of these terms significantly improves the success rate of fold recognition over use of either term alone, with respect to both recognition sensitivity and the accuracy of threading models. Improvement is greatest for targets between 10 % and 20 % sequence identity and 60 % to 80 % superimposable residues, where the number of models crossing critical accuracy and significance thresholds more than doubles. We suggest that these improvements account for the successful performance of the combined scoring function at CASP3. We discuss possible explanations as to why sequence-profile and contact-potential terms appear complementary.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Conserved Sequence
  • Evolution, Molecular
  • Protein Folding*
  • Proteins / chemistry*
  • Sensitivity and Specificity
  • Sequence Alignment / methods*
  • Sequence Homology, Amino Acid*
  • Software
  • Templates, Genetic
  • Thermodynamics

Substances

  • Proteins