STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis

Bioinformatics. 2022 May 26;38(11):3116-3117. doi: 10.1093/bioinformatics/btac272.

Abstract

Summary: We developed the variant-Set Test for Association using Annotation infoRmation (STAAR) workflow description language (WDL) workflow to facilitate the analysis of rare variants in whole genome sequencing association studies. The open-access STAAR workflow written in the WDL allows a user to perform rare variant testing for both gene-centric and genetic region approaches, enabling genome-wide, candidate and conditional analyses. It incorporates functional annotations into the workflow as introduced in the STAAR method in order to boost the rare variant analysis power. This tool was specifically developed and optimized to be implemented on cloud-based platforms such as BioData Catalyst Powered by Terra. It provides easy-to-use functionality for rare variant analysis that can be incorporated into an exhaustive whole genome sequencing analysis pipeline.

Availability and implementation: The workflow is freely available from https://dockstore.org/workflows/github.com/sheilagaynor/STAAR_workflow.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cloud Computing*
  • Genome
  • Genome-Wide Association Study
  • Software*
  • Workflow