Improvement of the Structure Generator DAECS with Respect to Structural Diversity

Mol Inform. 2021 Apr;40(4):e2000225. doi: 10.1002/minf.202000225. Epub 2020 Nov 25.

Abstract

The development of novel organic compounds with desired properties is time consuming and costly. Thus, the quantitative structure-property relationship (QSPR) model is used widely for efficiently discovering compounds with the desired properties. Novel structures can be generated from a variety of input structures in silico by structure generators. We previously developed the structure generator DAECS to yield highly active drug-like structures. However, the structural diversity of the structures generated by DAECS was still small for practical applications such as drug discovery. In this paper, we present structure modification rules and the algorithm to output more diverse structures through the DAECS workflow. Two new types of structural modification rules, bond contraction and ring mergence, were added. The new algorithm, which restricts the search area and subsequently clusters structures on a two-dimensional map generated by generative topographic mapping, was implemented for the repetitive selection of seed structures. A case study was conducted to evaluate our method using ligand structures for the histamine H1 receptor. The results showed improved structural diversity than the previous method.

Keywords: Chemoinformatics; Molecular diversity; Structure generation; Structure-activity relationships; Virtual screening.

MeSH terms

  • Algorithms*
  • Molecular Structure
  • Organic Chemicals / chemical synthesis
  • Organic Chemicals / chemistry*
  • Quantitative Structure-Activity Relationship*

Substances

  • Organic Chemicals