RNAlign program: alignment of RNA sequences using both primary and secondary structures

F Corpet; B Michot

doi:10.1093/bioinformatics/10.4.389

RNAlign program: alignment of RNA sequences using both primary and secondary structures

Comput Appl Biosci. 1994 Jul;10(4):389-99. doi: 10.1093/bioinformatics/10.4.389.

Authors

F Corpet¹, B Michot

Affiliation

¹ Institut National de la Recherche Agronomique (INRA), Laboratoire de Génétique Cellulaire, Castanet Tolosan, France.

PMID: 7528630
DOI: 10.1093/bioinformatics/10.4.389

Abstract

We have developed an algorithm and a computer program for aligning new RNA sequences with a bank of aligned homologous RNA sequences. Given a common folding structure for the bank, the program performs an alignment between the bank and a new sequence, optimal both in terms of primary and secondary structure. This method is useful to align sequences that present a common folding structure despite extensive divergence of their primary structures. It allows these preserved regions to be precisely distinguished from domains with more variable secondary structure. An optimal alignment of a sequence of length N with a bank of homologous sequences of length M is produced in O (M2N3) time and O(M2N2) space. For sequences that are too long for an algorithm of this complexity, a proposed strategy is to use a classical alignment (using only primary structure data) then improve it with the new algorithm in the regions where the bank stems are not aligned with possible stems in the new sequence. The algorithm has been implemented in Turbo Pascal on a PC, and has been used to align RNA sequences of eubacterial large ribosomal subunit.

MeSH terms

Algorithms
Base Sequence
Databases, Factual
Molecular Sequence Data
Nucleic Acid Conformation
RNA / chemistry
RNA / genetics*
RNA, Bacterial / chemistry
RNA, Bacterial / genetics
RNA, Ribosomal, 23S / chemistry
RNA, Ribosomal, 23S / genetics
Sequence Alignment / methods*
Sequence Alignment / statistics & numerical data
Sequence Homology, Nucleic Acid
Software*

Substances

RNA, Bacterial
RNA, Ribosomal, 23S
RNA