Information extraction in molecular biology

Brief Bioinform. 2002 Jun;3(2):154-65. doi: 10.1093/bib/3.2.154.

Abstract

Information extraction has become a very active field in bioinformatics recently and a number of interesting papers have been published. Most of the efforts have been concentrated on a few specific problems, such as the detection of protein-protein interactions and the analysis of DNA expression arrays, although it is obvious that there are many other interesting areas of potential application (document retrieval, protein functional description, and detection of disease-related genes to name a few). Paradoxically, these exciting developments have not yet crystallised into general agreement on a set of standard evaluation criteria, such as the ones developed in fields such as protein structure prediction, which makes it very difficult to compare performance across these different systems. In this review we introduce the general field of information extraction, we outline the status of the applications in molecular biology, and we then discuss some ideas about possible standards for evaluation that are needed for the future development of the field.

Publication types

  • Review

MeSH terms

  • Information Management / methods*
  • Information Storage and Retrieval / methods*
  • MEDLINE
  • Molecular Biology
  • PubMed