Uncovering networks from genome-wide association studies via circular genomic permutation

G3 (Bethesda). 2012 Sep;2(9):1067-75. doi: 10.1534/g3.112.002618. Epub 2012 Sep 1.

Abstract

Genome-wide association studies (GWAS) aim to detect single nucleotide polymorphisms (SNP) associated with trait variation. However, due to the large number of tests, standard analysis techniques impose highly stringent significance thresholds, leaving potentially associated SNPs undetected, and much of the trait genetic variation unexplained. Pathway- and network-based methodologies applied to GWAS aim to detect associations missed by standard single-marker approaches. The complex and non-random architecture of the genome makes it a challenge to derive an appropriate testing framework for such methodologies. We developed a rapid and simple permutation approach that uses GWAS SNP association results to establish the significance of pathway associations while accounting for the linkage disequilibrium structure of SNPs and the clustering of functionally related elements in the genome. All SNPs used in the GWAS are placed in a "circular genome" according to their location. Then the complete set of SNP association P values are permuted by rotation with respect to the genomic locations of the SNPs. Once these "simulated" P values are assigned, the joint gene P values are calculated using Fisher's combination test, and the association of pathways is tested using the hypergeometric test. The circular genomic permutation approach was applied to a human genome-wide association dataset. The data consists of 719 individuals from the ORCADES study genotyped for ~300,000 SNPs and measured for 51 traits ranging from physical to biochemical measurements. KEGG pathways (n = 225) were used as the sets of pathways to be tested. Our results demonstrate that the circular genomic permutations provide robust association P values. The non-permuted hypergeometric analysis generates ~1400 pathway-trait combination results with an association P value more significant than P ≤ 0.05, whereas applying circular genomic permutation reduces the number of significant results to a more credible 40% of that value. The circular permutation software ("genomicper") is available as an R package at http://cran.r-project.org/.

Keywords: GWAS; cardiac disease; genomicper R package; pathway-based; permutation method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cardiomyopathies / genetics
  • Cardiomyopathies / metabolism
  • Computer Simulation
  • Gene Regulatory Networks*
  • Genome, Human*
  • Genome-Wide Association Study*
  • Genomics / methods
  • Humans
  • Internet
  • Models, Genetic
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide*
  • Signal Transduction*
  • Software