Predicting Protein Complex Structure from Surface-Induced Dissociation Mass Spectrometry Data

ACS Cent Sci. 2019 Aug 28;5(8):1330-1341. doi: 10.1021/acscentsci.8b00912. Epub 2019 Jul 2.

Abstract

Recently, mass spectrometry (MS) has become a viable method for elucidation of protein structure. Surface-induced dissociation (SID), colliding multiply charged protein complexes or other ions with a surface, has been paired with native MS to provide useful structural information such as connectivity and topology for many different protein complexes. We recently showed that SID gives information not only on connectivity and topology but also on relative interface strengths. However, SID has not yet been coupled with computational structure prediction methods that could use the sparse information from SID to improve the prediction of quaternary structures, i.e., how protein subunits interact with each other to form complexes. Protein-protein docking, a computational method to predict the quaternary structure of protein complexes, can be used in combination with subunit structures from X-ray crystallography and NMR in situations where it is difficult to obtain an experimental structure of an entire complex. While de novo structure prediction can be successful, many studies have shown that inclusion of experimental data can greatly increase prediction accuracy. In this study, we show that the appearance energy (AE, defined as 10% fragmentation) extracted from SID can be used in combination with Rosetta to successfully evaluate protein-protein docking poses. We developed an improved model to predict measured SID AEs and incorporated this model into a scoring function that combines the RosettaDock scoring function with a novel SID scoring term, which quantifies agreement between experiments and structures generated from RosettaDock. As a proof of principle, we tested the effectiveness of these restraints on 57 systems using ideal SID AE data (AE determined from crystal structures using the predictive model). When theoretical AEs were used, the RMSD of the selected structure improved or stayed the same in 95% of cases. When experimental SID data were incorporated on a different set of systems, the method predicted near-native structures (less than 2 Å root-mean-square deviation, RMSD, from native) for 6/9 tested cases, while unrestrained RosettaDock (without SID data) only predicted 3/9 such cases. Score versus RMSD funnel profiles were also improved when SID data were included. Additionally, we developed a confidence measure to evaluate predicted model quality in the absence of a crystal structure.