Text mining for metabolic pathways, signaling cascades, and protein networks

Sci STKE. 2005 May 10;2005(283):pe21. doi: 10.1126/stke.2832005pe21.

Abstract

The complexity of the information stored in databases and publications on metabolic and signaling pathways, the high throughput of experimental data, and the growing number of publications make it imperative to provide systems to help the researcher navigate through these interrelated information resources. Text-mining methods have started to play a key role in the creation and maintenance of links between the information stored in biological databases and its original sources in the literature. These links will be extremely useful for database updating and curation, especially if a number of technical problems can be solved satisfactorily, including the identification of protein and gene names (entities in general) and the characterization of their types of interactions. The first generation of openly accessible text-mining systems, such as iHOP (Information Hyperlinked over Proteins), provides additional functions to facilitate the reconstruction of protein interaction networks, combine database and text information, and support the scientist in the formulation of novel hypotheses. The next challenge is the generation of comprehensive information regarding the general function of signaling pathways and protein interaction networks.

Publication types

  • Review

MeSH terms

  • Animals
  • Databases, Factual*
  • Dictionaries as Topic
  • Enzymes
  • Genes
  • Humans
  • Information Management / methods*
  • Information Management / organization & administration
  • Information Storage and Retrieval / methods*
  • Knowledge Bases
  • Metabolism*
  • Proteins* / classification
  • Proteins* / genetics
  • Signal Transduction*
  • Terminology as Topic

Substances

  • Enzymes
  • Proteins