Pathway analysis approaches for rare and common variants: insights from Genetic Analysis Workshop 18

Genet Epidemiol. 2014 Sep;38 Suppl 1(0 1):S86-91. doi: 10.1002/gepi.21831.

Abstract

Pathway analysis, broadly defined as a group of methods incorporating a priori biological information from public databases, has emerged as a promising approach for analyzing high-dimensional genomic data. As part of Genetic Analysis Workshop 18, seven research groups applied pathway analysis techniques to whole-genome sequence data from the San Antonio Family Study. Overall, the groups found that the potential of pathway analysis to improve detection of causal variants by lowering the multiple-testing burden and incorporating biologic insight remains largely unrealized. Specifically, there is a lack of best practices at each stage of the pathway approach: annotation, analysis, interpretation, and follow-up. Annotation of genetic variants is inconsistent across databases, incomplete, and biased toward known genes. At the analysis stage insufficient statistical power remains a major challenge. Analyses combining rare and common variants may have an inflated type I error rate and may not improve detection of causal genes. Inclusion of known causal genes may not improve statistical power, although the fraction of explained phenotypic variance may be a more appropriate metric. Interpretation of findings is further complicated by evidence in support of interactions between pathways and by the lack of consensus on how to best incorporate functional information. Finally, all presented approaches warranted follow-up studies, both to reduce the likelihood of false-positive findings and to identify specific causal variants within a given pathway. Despite the initial promise of pathway analysis for modeling biological complexity of disease phenotypes, many methodological challenges currently remain to be addressed.

Keywords: family studies; hypertension; pathway analysis; whole-genome sequence.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Frequency
  • Genetic Variation*
  • Genome, Human
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Hypertension / genetics
  • Hypertension / metabolism
  • Hypertension / pathology
  • Models, Genetic*
  • Pedigree
  • Phenotype
  • Sequence Analysis, DNA / methods*