Improving prediction accuracy of drug activities by utilising unlabelled instances with feature selection

Int J Comput Biol Drug Des. 2008;1(1):1-13. doi: 10.1504/ijcbdd.2008.018706.

Abstract

Molecular activities can be predicted by Quantitative Structure Activity Relationship (QSAR). Because of the high cost of experiments, the number of drug molecules with known activity is much less than that of unknown, to predict molecular activities utilising unlabelled instances will be an interesting issue. Here, Semi-Supervised Learning (SSL) is introduced and a SSL method, Co-Training is investigated on predicting drug activities utilising unlabelled instances. At the same time, a novel algorithm called FESCOT is proposed, which applies feature selection to remove redundant and irrelevant features for Co-Training. Numerical experimental results show that Co-Training and feature selection helps to improve the prediction ability of Co-Training.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Computational Biology
  • Computer Simulation
  • Databases, Factual
  • Drug Design*
  • Models, Biological
  • Mutagenicity Tests / statistics & numerical data
  • Quantitative Structure-Activity Relationship*