GeneSeer: a sage for gene names and genomic resources

BMC Genomics. 2005 Sep 21:6:134. doi: 10.1186/1471-2164-6-134.

Abstract

Background: Independent identification of genes in different organisms and assays has led to a multitude of names for each gene. This balkanization makes it difficult to use gene names to locate genomic resources, homologs in other species and relevant publications.

Methods: We solve the naming problem by collecting data from a variety of sources and building a name-translation database. We have also built a table of homologs across several model organisms: H. sapiens, M. musculus, R. norvegicus, D. melanogaster, C. elegans, S. cerevisiae, S. pombe and A. thaliana. This allows GeneSeer to draw phylogenetic trees and identify the closest homologs. This, in turn, allows the use of names from one species to identify homologous genes in another species. A website http://geneseer.cshl.org/ is connected to the database to allow user-friendly access to our tools and external genomic resources using familiar gene names.

Conclusion: GeneSeer allows access to gene information through common names and can map sequences to names. GeneSeer also allows identification of homologs and paralogs for a given gene. A variety of genomic data such as sequences, SNPs, splice variants, expression patterns and others can be accessed through the GeneSeer interface. It is freely available over the web http://geneseer.cshl.org/ and can be incorporated in other tools through an http-based software interface described on the website. It is currently used as the search engine in the RNAi codex resource, which is a portal for short hairpin RNA (shRNA) gene-silencing constructs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Computational Biology / methods*
  • Database Management Systems
  • Databases, Factual*
  • Databases, Genetic
  • Databases, Protein
  • Genetic Techniques*
  • Genome
  • Genomics / methods*
  • Humans
  • Information Storage and Retrieval
  • Internet
  • Natural Language Processing
  • Phylogeny
  • Polymorphism, Single Nucleotide
  • RNA, Small Interfering / metabolism
  • Software
  • Terminology as Topic

Substances

  • RNA, Small Interfering