Degenerate primer design via clustering

Proc IEEE Comput Soc Bioinform Conf. 2003:2:75-83.

Abstract

This paper describes a new strategy for designing degenerate primers for a given multiple alignment of amino acid sequences. Degenerate primers are useful for amplifying homologous genes. However, when a large collection of sequences is considered, no consensus region may exist in the multiple alignment, making it impossible to design a single pair of primers for the collection. In such cases, manual methods are used to find smaller groups from the input collection so that primers can be designed for individual groups. Our strategy proposes an automatic grouping of the input sequences by using clustering techniques. Conserved regions are then detected for each individual group. Conserved regions are scored using a BlockSimilarity score, a novel alignment scoring scheme that is appropriate for this application. Degenerate primers are then designed by reverse translating the conserved amino acid sequences to the corresponding nucleotide sequences. Our program, DePiCt, was written in BioPerl and was tested on the Toll-Interleukin Receptor (TIR)and the non-TIR family of plant resistance genes. Existing programs for degenerate primer design were unable to find primers for these data sets.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Base Sequence
  • Cluster Analysis
  • DNA Primers / genetics*
  • DNA Probes / genetics
  • Equipment Design
  • Equipment Failure Analysis
  • Molecular Sequence Data
  • Oligonucleotide Array Sequence Analysis / methods*
  • Proteins / genetics*
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, Protein / methods*
  • Software

Substances

  • DNA Primers
  • DNA Probes
  • Proteins