On the origin of new genes in Drosophila

Genome Res. 2008 Sep;18(9):1446-55. doi: 10.1101/gr.076588.108. Epub 2008 Jun 11.

Abstract

Several mechanisms have been proposed to account for the origination of new genes. Despite extensive case studies, the general principles governing this fundamental process are still unclear at the whole-genome level. Here, we unveil genome-wide patterns for the mutational mechanisms leading to new genes and their subsequent lineage-specific evolution at different time nodes in the Drosophila melanogaster species subgroup. We find that (1) tandem gene duplication has generated approximately 80% of the nascent duplicates that are limited to single species (D. melanogaster or Drosophila yakuba); (2) the most abundant new genes shared by multiple species (44.1%) are dispersed duplicates, and are more likely to be retained and be functional; (3) de novo gene origination from noncoding sequences plays an unexpectedly important role during the origin of new genes, and is responsible for 11.9% of the new genes; (4) retroposition is also an important mechanism, and had generated approximately 10% of the new genes; (5) approximately 30% of the new genes in the D. melanogaster species complex recruited various genomic sequences and formed chimeric gene structures, suggesting structure innovation as an important way to help fixation of new genes; and (6) the rate of the origin of new functional genes is estimated to be five to 11 genes per million years in the D. melanogaster subgroup. Finally, we survey gene frequencies among 19 globally derived strains for D. melanogaster-specific new genes and reveal that 44.4% of them show copy number polymorphisms within a population. In conclusion, we provide a panoramic picture for the origin of new genes in Drosophila species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chimerism
  • Drosophila melanogaster / genetics*
  • Evolution, Molecular*
  • Genes, Duplicate
  • Genes, Insect*
  • Genome, Insect
  • Phylogeny
  • Tandem Repeat Sequences