MicroRNA identification based on sequence and structure alignment

Bioinformatics. 2005 Sep 15;21(18):3610-4. doi: 10.1093/bioinformatics/bti562. Epub 2005 Jun 30.

Abstract

Motivation: MicroRNAs (miRNA) are approximately 22 nt long non-coding RNAs that are derived from larger hairpin RNA precursors and play important regulatory roles in both animals and plants. The short length of the miRNA sequences and relatively low conservation of pre-miRNA sequences restrict the conventional sequence-alignment-based methods to finding only relatively close homologs. On the other hand, it has been reported that miRNA genes are more conserved in the secondary structure rather than in primary sequences. Therefore, secondary structural features should be more fully exploited in the homologue search for new miRNA genes.

Results: In this paper, we present a novel genome-wide computational approach to detect miRNAs in animals based on both sequence and structure alignment. Experiments show this approach has higher sensitivity and comparable specificity than other reported homologue searching methods. We applied this method on Anopheles gambiae and detected 59 new miRNA genes.

Availability: This program is available at http://bioinfo.au.tsinghua.edu.cn/miralign.

Supplementary information: Supplementary information is available at http://bioinfo.au.tsinghua.edu.cn/miralign/supplementary.htm.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Anopheles / genetics
  • Base Sequence
  • Chromosome Mapping
  • Computational Biology / methods*
  • Conserved Sequence
  • Databases, Genetic
  • Gene Expression Profiling
  • Genome*
  • Internet
  • MicroRNAs / chemistry*
  • Molecular Sequence Data
  • RNA Precursors
  • Sensitivity and Specificity
  • Sequence Analysis, RNA
  • Software

Substances

  • MicroRNAs
  • RNA Precursors