Genetic variation and recent positive selection in worldwide human populations: evidence from nearly 1 million SNPs

PLoS One. 2009 Nov 18;4(11):e7888. doi: 10.1371/journal.pone.0007888.

Abstract

Background: Genome-wide scans of hundreds of thousands of single-nucleotide polymorphisms (SNPs) have resulted in the identification of new susceptibility variants to common diseases and are providing new insights into the genetic structure and relationships of human populations. Moreover, genome-wide data can be used to search for signals of recent positive selection, thereby providing new insights into the genetic adaptations that occurred as modern humans spread out of Africa and around the world.

Methodology: We genotyped approximately 500,000 SNPs in 255 individuals (5 individuals from each of 51 worldwide populations) from the Human Genome Diversity Panel (HGDP-CEPH). When merged with non-overlapping SNPs typed previously in 250 of these same individuals, the resulting data consist of over 950,000 SNPs. We then analyzed the genetic relationships and ancestry of individuals without assigning them to populations, and we also identified candidate regions of recent positive selection at both the population and regional (continental) level.

Conclusions: Our analyses both confirm and extend previous studies; in particular, we highlight the impact of various dispersals, and the role of substructure in Africa, on human genetic diversity. We also identified several novel candidate regions for recent positive selection, and a gene ontology (GO) analysis identified several GO groups that were significantly enriched for such candidate genes, including immunity and defense related genes, sensory perception genes, membrane proteins, signal receptors, lipid binding/metabolism genes, and genes involved in the nervous system. Among the novel candidate genes identified are two genes involved in the thyroid hormone pathway that show signals of selection in African Pygmies that may be related to their short stature.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Africa
  • Algorithms
  • Databases, Genetic
  • Dwarfism / genetics
  • Genetic Variation*
  • Genetics, Population*
  • Genotype
  • Heterozygote
  • Humans
  • Models, Biological
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide*
  • Principal Component Analysis
  • Regression Analysis
  • Reproducibility of Results