Quantification and modeling of turnover dynamics of de novo transcripts in Drosophila melanogaster

Nucleic Acids Res. 2024 Jan 11;52(1):274-287. doi: 10.1093/nar/gkad1079.

Abstract

Most of the transcribed eukaryotic genomes are composed of non-coding transcripts. Among these transcripts, some are newly transcribed when compared to outgroups and are referred to as de novo transcripts. De novo transcripts have been shown to play a major role in genomic innovations. However, little is known about the rates at which de novo transcripts are gained and lost in individuals of the same species. Here, we address this gap and estimate the de novo transcript turnover rate with an evolutionary model. We use DNA long reads and RNA short reads from seven geographically remote samples of inbred individuals of Drosophila melanogaster to detect de novo transcripts that are gained on a short evolutionary time scale. Overall, each sampled individual contains around 2500 unspliced de novo transcripts, with most of them being sample specific. We estimate that around 0.15 transcripts are gained per year, and that each gained transcript is lost at a rate around 5× 10-5 per year. This high turnover of transcripts suggests frequent exploration of new genomic sequences within species. These rate estimates are essential to comprehend the process and timescale of de novo gene birth.

MeSH terms

  • Animals
  • Biological Evolution
  • Drosophila melanogaster* / genetics
  • Evolution, Molecular*
  • Genome
  • Genomics
  • Geography
  • Humans
  • RNA
  • RNA, Untranslated* / chemistry
  • RNA, Untranslated* / genetics
  • RNA, Untranslated* / metabolism
  • Transcription, Genetic*

Substances

  • RNA
  • RNA, Untranslated