Genome-Wide Association Studies in Arabidopsis thaliana: Statistical Analysis and Network-Based Augmentation of Signals

Methods Mol Biol. 2021:2200:187-210. doi: 10.1007/978-1-0716-0880-7_9.

Abstract

Genome-wide association studies (GWAS) have proven effective at identifying genetic variants and genes that are associated with phenotypes in humans, animals, and plants. Since most phenotypes of plant species are complex traits regulated by many genes and their functional interactions, GWAS are increasing in popularity for genetic dissections of plant phenotypes. For the reference plant, Arabidopsis thaliana, detailed information on genetic variations became available with the completion of the 1001 Genomes Project, enabling highly resolved association mapping between chromosomal loci and complex traits. Improvements have been made in the statistical analysis methods for testing the significance of genotype-to-phenotype associations, thereby substantially reducing the confounding effects of population structures. Furthermore, there have been large efforts toward post-GWAS augmentation of signals via integration with other types of information to overcome the limited statistical power of GWAS. This chapter describes the stepwise procedure of GWAS in Arabidopsis, focusing on data analysis processes including preprocessing of genotype and phenotype data, statistical analysis to identify phenotype-associated chromosomal loci, identification of phenotype-associated genes based on the phenotype-associated loci, and finally network-based augmentation of GWAS signals to identify additional candidate genes for the phenotype.

Keywords: Arabidopsis thaliana; Genome-wide association study; Genotype-to-phenotype association; Network-based augmentation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis* / genetics
  • Arabidopsis* / metabolism
  • Genetic Association Studies*
  • Genetic Variation*
  • Genome-Wide Association Study
  • Quantitative Trait, Heritable*