Locating well-conserved regions within a pairwise alignment

Comput Appl Biosci. 1993 Aug;9(4):387-96. doi: 10.1093/bioinformatics/9.4.387.

Abstract

Within a single alignment of two DNA sequences or two protein sequences, some regions may be much better conserved than others. Such strong conservation may reveal a region that possesses an important function. When alignments are so long that it is infeasible, or at least undesirable, to inspect them in complete detail, it is helpful to have an automatic process that computes information about the varying degree of conservation along the alignment and displays the information in a graphical representation that is readily assimilated. This paper presents methods for computing several such 'robustness measures' at each position of a given alignment. These methods are all very space-efficient; they use only space proportional to the sum of the two sequence lengths. To illustrate their effectiveness, one of the methods is used to locate particularly well-conserved regions in the beta-globin gene locus control region and in the 5' flank of the gamma-globin gene.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Base Sequence
  • Conserved Sequence*
  • DNA / genetics
  • Globins / genetics
  • Humans
  • Molecular Sequence Data
  • Rabbits
  • Sequence Alignment / methods*
  • Sequence Alignment / statistics & numerical data
  • Software

Substances

  • Globins
  • DNA