Integration of new genes into cellular networks, and their structural maturation

Genetics. 2013 Dec;195(4):1407-17. doi: 10.1534/genetics.113.152256. Epub 2013 Sep 20.

Abstract

It has been recently discovered that new genes can originate de novo from noncoding DNA, and several biological traits including expression or sequence composition form a continuum from noncoding sequences to conserved genes. In this article, using yeast genes I test whether the integration of new genes into cellular networks and their structural maturation shows such a continuum by analyzing their changes with gene age. I show that 1) The number of regulatory, protein-protein, and genetic interactions increases continuously with gene age, although with very different rates. New regulatory interactions emerge rapidly within a few million years, while the number of protein-protein and genetic interactions increases slowly, with a rate of 2-2.25 × 10(-8)/year and 4.8 × 10(-8)/year, respectively. 2) Gene essentiality evolves relatively quickly: the youngest essential genes appear in proto-genes ∼14 MY old. 3) In contrast to interactions, the secondary structure of proteins and their robustness to mutations indicate that new genes face a bottleneck in their evolution: proto-genes are characterized by high β-strand content, high aggregation propensity, and low robustness against mutations, while conserved genes are characterized by lower strand content and higher stability, most likely due to the higher probability of gene loss among young genes and accumulation of neutral mutations.

Keywords: Saccharomyces cerevisiae; aggregation; de novo genes; protein–protein interaction; regulatory network; secondary structure.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Ascomycota / classification
  • Ascomycota / genetics*
  • Ascomycota / metabolism
  • Binding Sites
  • Evolution, Molecular*
  • Fungal Proteins / chemistry
  • Fungal Proteins / genetics
  • Fungal Proteins / metabolism*
  • Gene Regulatory Networks*
  • Genes, Fungal*
  • Molecular Sequence Data
  • Phylogeny
  • Protein Binding
  • Protein Conformation

Substances

  • Fungal Proteins