Small Molecule Accurate Recognition Technology (SMART) to Enhance Natural Products Research

Sci Rep. 2017 Oct 27;7(1):14243. doi: 10.1038/s41598-017-13923-x.

Abstract

Various algorithms comparing 2D NMR spectra have been explored for their ability to dereplicate natural products as well as determine molecular structures. However, spectroscopic artefacts, solvent effects, and the interactive effect of functional group(s) on chemical shifts combine to hinder their effectiveness. Here, we leveraged Non-Uniform Sampling (NUS) 2D NMR techniques and deep Convolutional Neural Networks (CNNs) to create a tool, SMART, that can assist in natural products discovery efforts. First, an NUS heteronuclear single quantum coherence (HSQC) NMR pulse sequence was adapted to a state-of-the-art nuclear magnetic resonance (NMR) instrument, and data reconstruction methods were optimized, and second, a deep CNN with contrastive loss was trained on a database containing over 2,054 HSQC spectra as the training set. To demonstrate the utility of SMART, several newly isolated compounds were automatically located with their known analogues in the embedded clustering space, thereby streamlining the discovery pipeline for new natural products.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Biological Products / chemistry*
  • Cyanobacteria / chemistry
  • Data Analysis*
  • Magnetic Resonance Spectroscopy / methods*
  • Neural Networks, Computer*
  • Peptide Synthases / chemistry

Substances

  • Biological Products
  • Peptide Synthases
  • non-ribosomal peptide synthase