Evidence for deep phylogenetic conservation of exonic splice-related constraints: splice-related skews at exonic ends in the brown alga Ectocarpus are common and resemble those seen in humans

Genome Biol Evol. 2013;5(9):1731-45. doi: 10.1093/gbe/evt115.

Abstract

The control of RNA splicing is often modulated by exonic motifs near splice sites. Chief among these are exonic splice enhancers (ESEs). Well-described ESEs in mammals are purine rich and cause predictable skews in codon and amino acid usage toward exonic ends. Looking across species, those with relatively abundant intronic sequence are those with the more profound end of exon skews, indicative of exonization of splice site recognition. To date, the only intron-rich species that have been analyzed are mammals, precluding any conclusions about the likely ancestral condition. Here, we examine the patterns of codon and amino acid usage in the vicinity of exon-intron junctions in the brown alga Ectocarpus siliculosus, a species with abundant large introns, known SR proteins, and classical splice sites. We find that amino acids and codons preferred/avoided at both 3' and 5' ends in Ectocarpus, of which there are many, tend, on average, to also be preferred/avoided at the same exon ends in humans. Moreover, the preferences observed at the 5' ends of exons are largely the same as those at the 3' ends, a symmetry trend only previously observed in animals. We predict putative hexameric ESEs in Ectocarpus and show that these are purine rich and that there are many more of these identified as functional ESEs in humans than expected by chance. These results are consistent with deep phylogenetic conservation of SR protein binding motifs. Assuming codons preferred near boundaries are "splice optimal" codons, in Ectocarpus, unlike Drosophila, splice optimal and translationally optimal codons are not mutually exclusive. The exclusivity of translationally optimal and splice optimal codon sets is thus not universal.

Keywords: ESE; Ectocarpus; splicing; translational selection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Codon / genetics*
  • Computational Biology
  • Drosophila / genetics
  • Enhancer Elements, Genetic*
  • Evolution, Molecular*
  • Exons
  • Humans
  • Introns
  • Phaeophyceae / genetics
  • Phylogeny
  • RNA Splice Sites / genetics*
  • RNA Splicing / genetics
  • Regulatory Sequences, Nucleic Acid

Substances

  • Codon
  • RNA Splice Sites