Detection and verification of 2.3 million cancer mutations in NCI60 cancer cell lines with a cloud search engine

J Proteomics. 2019 Oct 30:209:103488. doi: 10.1016/j.jprot.2019.103488. Epub 2019 Aug 21.

Abstract

Today we have unprecedented access to human genomic and proteomic data that appear to be rapidly approaching our current understanding of comprehensive coverage. Combining genomic information with shotgun proteomics remains challenging due to the large increase in proteomics search space. However, making this connection between genomic and proteomic information is critical for cancer studies to vaccine development. Furthermore, as we progress towards personalized medicine, it will be essential for proteomics analysis to identify individual mutations and variants in order to fully understand protein networks and to develop personalized therapies. While these advantages are well-established, only a few studies have demonstrated the successful integration of proteomic data with large genomic input. We present and examine the abilities of Bolt, a new cloud-based proteomics search engine to search for the presence of over 2.3 million known cancer mutations in a matter of minutes while still performing a standard proteomics search that includes 31 post translational modifications. We use previously published proteomics data sets and identify mutations that are verified using genomic studies as well as previous proteomics efforts. Our results also emphasize the need to search for mutations in a comprehensive manner while still searching for both common and rare PTMs. SIGNIFICANCE: We present and examine the abilities of Bolt, a new cloud-based proteomics search engine to search for the presence of over 2.3 million known cancer mutations in a matter of minutes while still performing a standard proteomics search that includes 31 post translational modifications. No other proteomics search software can do so.

Keywords: Bolt; Cloud; Isoforms; Mutations; Proteomics; Variants.

MeSH terms

  • Cell Line, Tumor
  • Cloud Computing*
  • Genomics / methods
  • Humans
  • Mutation*
  • Neoplasms / genetics*
  • Protein Processing, Post-Translational
  • Proteomics / methods*
  • Search Engine / methods*
  • Search Engine / standards