Unsupervised learning in detection of gene transfer

J Biomed Biotechnol. 2008:2008:472719. doi: 10.1155/2008/472719.

Abstract

The tree representation as a model for organismal evolution has been in use since before Darwin. However, with the recent unprecedented access to biomolecular data, it has been discovered that, especially in the microbial world, individual genes making up the genome of an organism give rise to different and sometimes conflicting evolutionary tree topologies. This discovery calls into question the notion of a single evolutionary tree for an organism and gives rise to the notion of an evolutionary consensus tree based on the evolutionary patterns of the majority of genes in a genome embedded in a network of gene histories. Here, we discuss an approach to the analysis of genomic data of multiple genomes using bipartition spectral analysis and unsupervised learning. An interesting observation is that genes within genomes that have evolutionary tree topologies, which are in substantial conflict with the evolutionary consensus tree of an organism, point to possible horizontal gene transfer events which often delineate significant evolutionary events.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Archaeoglobus fulgidus / genetics
  • Artificial Intelligence
  • Cluster Analysis
  • Computational Biology / methods*
  • Evolution, Molecular
  • Gene Frequency
  • Gene Transfer, Horizontal*
  • Genes, Archaeal*
  • Genomics / methods
  • Halobacterium salinarum / genetics
  • Methanosarcina / genetics
  • Models, Genetic
  • Phylogeny
  • RNA / classification
  • Selection, Genetic
  • Sequence Homology, Nucleic Acid*
  • Species Specificity

Substances

  • RNA