PECAN Predicts Patterns of Cancer Cell Cytostatic Activity of Natural Products Using Deep Learning

J Nat Prod. 2024 Mar 22;87(3):567-575. doi: 10.1021/acs.jnatprod.3c00879. Epub 2024 Feb 13.

Abstract

Many machine learning techniques are used as drug discovery tools with the intent to speed characterization by determining relationships between compound structure and biological function. However, particularly in anticancer drug discovery, these models often make only binary decisions about the biological activity for a narrow scope of drug targets. We present a feed-forward neural network, PECAN (Prediction Engine for the Cytostatic Activity of Natural product-like compounds), that simultaneously classifies the potential antiproliferative activity of compounds against 59 cancer cell lines. It predicts the activity to be one of six categories, indicating not only if activity is present but the degree of activity. Using an independent subset of NCI data as a test set, we show that PECAN can reach 60.1% accuracy in a six-way classification and present further evidence that it classifies based on useful structural features of compounds using a "within-one" measure that reaches 93.0% accuracy.

MeSH terms

  • Biological Products* / pharmacology
  • Carya*
  • Cytostatic Agents* / pharmacology
  • Deep Learning*
  • Humans
  • Neoplasms*

Substances

  • Cytostatic Agents
  • Biological Products