PSCAN: Spatial scan tests guided by protein structures improve complex disease gene discovery and signal variant detection

Genome Biol. 2020 Aug 26;21(1):217. doi: 10.1186/s13059-020-02121-0.

Abstract

Germline disease-causing variants are generally more spatially clustered in protein 3-dimensional structures than benign variants. Motivated by this tendency, we develop a fast and powerful protein-structure-based scan (PSCAN) approach for evaluating gene-level associations with complex disease and detecting signal variants. We validate PSCAN's performance on synthetic data and two real data sets for lipid traits and Alzheimer's disease. Our results demonstrate that PSCAN performs competitively with existing gene-level tests while increasing power and identifying more specific signal variant sets. Furthermore, PSCAN enables generation of hypotheses about the molecular basis for the associations in the context of protein structures and functional domains.

Keywords: Gene-level association tests; Protein 3D structures; Risk variant detection; Spatial scan approach.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alzheimer Disease / genetics
  • Genetic Association Studies*
  • Genetic Predisposition to Disease / genetics*
  • Genetic Variation
  • Genome-Wide Association Study
  • Humans
  • Lipids
  • Models, Genetic
  • Phenotype
  • Proteins / chemistry*
  • Proteins / genetics

Substances

  • Lipids
  • Proteins