Genome-wide analysis of mammalian DNA segment fusion/fission

J Theor Biol. 2006 May 21;240(2):200-8. doi: 10.1016/j.jtbi.2005.09.016. Epub 2005 Nov 14.

Abstract

As a powerful tool for gene function prediction, gene fusion has been widely studied in prokaryotes and certain groups of eukaryotes, but it has been little applied in studies of mammalian genomes. With the first fully sequenced mammalian genomes (human, mouse, rat) now available, we defined and collected a set of fusion/fission event-linked segments (FFLS) based on structured organized genomic alignment. The statistics of the sequence features highlighted the FFLSs against their random context. We found that there are three groups of FFLSs with different component pairs (i.e. gene-gene, gene-noncoding and noncoding-noncoding) in all three mammalian genomes. The proteins encoded by the components of FFLSs in the first group shown a strong tendency to interact with each other. The segmental components in the last two groups which did not contain any protein-coding genes, were found not only to be transcribed to some level, but also more conserved than the random background. Thus, these segments are possibly carrying certain biologically functional elements. We propose that FFLS may be a potential tool for prediction and analysis of function and functional interaction of genetic elements, including both genes and noncoding elements, in mammalian genomes. The full list of the FFLSs in the genomes of the three mammals is available as supporting information at doi:10.1016/j.jtbi.2005.09.016.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Composition
  • Base Sequence
  • DNA Cleavage
  • DNA*
  • Evolution, Molecular*
  • Gene Fusion
  • Genetic Code
  • Genome*
  • Genome, Human
  • Genomics
  • Humans
  • Mammals / genetics*
  • Mice
  • Models, Genetic*
  • Molecular Sequence Data
  • Rats

Substances

  • DNA