Computation of repetitions and regularities of biologically weighted sequences

J Comput Biol. 2006 Jul-Aug;13(6):1214-31. doi: 10.1089/cmb.2006.13.1214.

Abstract

Biological weighted sequences are used extensively in molecular biology as profiles for protein families, in the representation of binding sites and often for the representation of sequences produced by a shotgun sequencing strategy. In this paper, we address three fundamental problems in the area of biologically weighted sequences: (i) computation of repetitions, (ii) pattern matching, and (iii) computation of regularities. Our algorithms can be used as basic building blocks for more sophisticated algorithms applied on weighted sequences.

MeSH terms

  • Algorithms*
  • Animals
  • Base Sequence
  • Binding Sites
  • Computational Biology
  • Hemoglobins / chemistry
  • Hemoglobins / genetics
  • Humans
  • Molecular Sequence Data
  • Sequence Alignment*

Substances

  • Hemoglobins