Cheminformatics Tools for Analyzing and Designing Optimized Small-Molecule Collections and Libraries

Cell Chem Biol. 2019 May 16;26(5):765-777.e3. doi: 10.1016/j.chembiol.2019.02.018. Epub 2019 Apr 4.

Abstract

Libraries of well-annotated small molecules have many uses in chemical genetics, drug discovery, and therapeutic repurposing. Multiple libraries are available, but few data-driven approaches exist to compare them and design new libraries. We describe an approach to scoring and creating libraries based on binding selectivity, target coverage, and induced cellular phenotypes as well as chemical structure, stage of clinical development, and user preference. The approach, available via the online tool http://www.smallmoleculesuite.org, assembles sets of compounds with the lowest possible off-target overlap. Analysis of six kinase inhibitor libraries using our approach reveals dramatic differences among them and led us to design a new LSP-OptimalKinase library that outperforms existing collections in target coverage and compact size. We also describe a mechanism of action library that optimally covers 1,852 targets in the liganded genome. Our tools facilitate creation, analysis, and updates of both private and public compound collections.

Keywords: chemical biology; chemical genetics; chemical library; cheminformatics; drug discovery; drug repurposing; kinase inhibitors; machine learning; mechanism of action; small molecule.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cheminformatics / methods*
  • Drug Discovery
  • Protein Kinase Inhibitors / analysis
  • Protein Kinase Inhibitors / chemistry
  • Small Molecule Libraries / analysis*
  • Small Molecule Libraries / chemistry
  • User-Computer Interface*

Substances

  • Protein Kinase Inhibitors
  • Small Molecule Libraries