The accuracy of NMR protein structures in the Protein Data Bank

Structure. 2021 Dec 2;29(12):1430-1439.e2. doi: 10.1016/j.str.2021.07.001. Epub 2021 Jul 30.

Abstract

The program ANSURR measures the accuracy of NMR structures by comparing rigidity obtained from experimental backbone chemical shifts and from structures. We report on ANSURR analysis of 7,000 PDB NMR ensembles within the Protein Data Bank, which can be found at ansurr.com. The accuracy of NMR structures progressively improved up until 2005, but since then, it has plateaued. Most structures have accurate secondary structure, but are generally too floppy, particularly in loops. Thus, there is a need for more experimental restraints in loops. Currently, the best predictors of accuracy are Ramachandran distribution and the number of NOE restraints per residue. The precision of structures within the ensemble correlates well with accuracy, as does the number of hydrogen bond restraints per residue. Structure accuracy is improved when other components (such as additional polypeptide chains or ligands) are included.

Keywords: NMR; PDB; chemical shift; experimental restraints; protein structure; rigidity; validation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein*
  • Magnetic Resonance Spectroscopy*
  • Models, Molecular*
  • Protein Conformation*