Tandem repeat variation in human and great ape populations and its impact on gene expression divergence

Genome Res. 2015 Nov;25(11):1591-9. doi: 10.1101/gr.190868.115. Epub 2015 Aug 19.

Abstract

Tandem repeats (TRs) are stretches of DNA that are highly variable in length and mutate rapidly. They are thus an important source of genetic variation. This variation is highly informative for population and conservation genetics. It has also been associated with several pathological conditions and with gene expression regulation. However, genome-wide surveys of TR variation in humans and closely related species have been scarce due to technical difficulties derived from short-read technology. Here we explored the genome-wide diversity of TRs in a panel of 83 human and nonhuman great ape genomes, in a total of six different species, and studied their impact on gene expression evolution. We found that population diversity patterns can be efficiently captured with short TRs (repeat unit length, 1-5 bp). We examined the potential evolutionary role of TRs in gene expression differences between humans and primates by using 30,275 larger TRs (repeat unit length, 2-50 bp). Genes that contained TRs in the promoters, in their 3' untranslated region, in introns, and in exons had higher expression divergence than genes without repeats in the regions. Polymorphic small repeats (1-5 bp) had also higher expression divergence compared with genes with fixed or no TRs in the gene promoters. Our findings highlight the potential contribution of TRs to human evolution through gene regulation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions
  • Animals
  • Chromosome Mapping
  • Evolution, Molecular
  • Exons
  • Female
  • Gene Expression Regulation*
  • Genetic Loci
  • Genetic Variation*
  • Genome, Human
  • Genotyping Techniques
  • Humans
  • Introns
  • Male
  • Microsatellite Repeats*
  • Primates / genetics*
  • Promoter Regions, Genetic

Substances

  • 3' Untranslated Regions