Positionally biased gene loss after whole genome duplication: evidence from human, yeast, and plant

Genome Res. 2012 Dec;22(12):2427-35. doi: 10.1101/gr.131953.111. Epub 2012 Jul 26.

Abstract

Whole genome duplication (WGD) has made a significant contribution to many eukaryotic genomes including yeast, plants, and vertebrates. Following WGD, some ohnologs (WGD paralogs) remain in the genome arranged in blocks of conserved gene order and content (paralogons). However, the most common outcome is loss of one of the ohnolog pair. It is unclear what factors, if any, govern gene loss from paralogons. Recent studies have reported physical clustering (genetic linkage) of functionally linked (interacting) genes in the human genome and propose a biological significance for the clustering of interacting genes such as coexpression or preservation of epistatic interactions. Here we conduct a novel test of a hypothesis that functionally linked genes in the same paralogon are preferentially retained in cis after WGD. We compare the number of protein-protein interactions (PPIs) between linked singletons within a paralogon (defined as cis-PPIs) with that of PPIs between singletons across paralogon pairs (defined as trans-PPIs). We find that paralogons in which the number of cis-PPIs is greater than that of trans-PPIs are significantly enriched in human and yeast. The trend is similar in plants, but it is difficult to assess statistical significance due to multiple, overlapping WGD events. Interestingly, human singletons participating in cis-PPIs tend to be classified into "response to stimulus." We uncover strong evidence of biased gene loss after WGD, which further supports the hypothesis of biologically significant gene clusters in eukaryotic genomes. These observations give us new insight for understanding the evolution of genome structure and of protein interaction networks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Biological Evolution
  • Epistasis, Genetic
  • Evolution, Molecular*
  • Gene Deletion*
  • Gene Duplication*
  • Genome, Fungal*
  • Genome, Human*
  • Genome, Plant*
  • Humans
  • Models, Genetic
  • Multigene Family
  • Plants / genetics
  • Protein Interaction Maps
  • Vertebrates / genetics
  • Yeasts / genetics