Clustering and detection of 5' splice sites of mRNA by k weight-matrices model

Pac Symp Biocomput. 1999:171-81. doi: 10.1142/9789814447300_0017.

Abstract

We propose a novel method to detect 5' splice sites of eukaryotic mRNA. We have grouped the 5' splice splice sites into various classes. The clustered sites are represented by a set of PWMs. The clustering algorithm is similar to k-means clustering algorithm but the distance definition and the training score function were arranged. The clustered PWMs were applied to 5' splice site detection. The results showed an improvement in comparison with traditional single PWM. The result of the clusters suggests there are new motifs of 5' splice sites.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Sequence
  • Computational Biology / methods
  • Computer Simulation*
  • Consensus Sequence
  • Databases, Factual*
  • Internet
  • Models, Genetic*
  • RNA Splicing*
  • RNA, Messenger / chemistry*
  • RNA, Messenger / genetics*
  • Sensitivity and Specificity
  • Software

Substances

  • RNA, Messenger