Drug-symptom networking: Linking drug-likeness screening to drug discovery

Pharmacol Res. 2016 Jan:103:105-13. doi: 10.1016/j.phrs.2015.11.015. Epub 2015 Nov 23.

Abstract

Understanding the relationships between drugs and symptoms has broad medical consequences, yet a comprehensive description of the drug-symptom associations is currently lacking. Here, 1441 FDA-approved drugs were collected, and PCA was used to extract 122 descriptors which explained 91% of the variance. Then, a k-means++ method was employed to partition the drug dataset into 3 clusters, and 3 corresponding SVDD models (drug-likeness screening models) were constructed with an overall accuracy of up to 95.6%. Furthermore, 6878 herbal molecules from the TcmSP™ database were screened by the above 3 SVDD model to obtain 5309 candidate drug molecules with highly accept classification of 77.19%. To assess the accuracy of the SVDD models, 8559 herbal molecule-symptom co-occurrences were mined from Pubmed abstracts, involving 697 herbal molecules and 314 symptoms. Most of the 697 herbal molecules could be found in the accepted SVDD data (5309 molecules), showing the potential of the SVDD for the screening of drug candidates. Moreover, a herbal molecule-herbal molecule network and a herbal molecule-symptom were constructed. Overall, the results provided a new drug-likeness screening approach independent to abnormal training data, and the comprehensive collection of herbal molecule-symptom associations formed a new data resource for systematic characterization of the symptom-oriented medicines.

Keywords: Drug-likeness; Drug–symptom relationship; Herbal molecule; SVDD classification; Text mining.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Drug Discovery*
  • Drugs, Chinese Herbal
  • Humans
  • Models, Theoretical*
  • Principal Component Analysis

Substances

  • Drugs, Chinese Herbal