happi: a hierarchical approach to pangenomics inference

Genome Biol. 2023 Sep 29;24(1):214. doi: 10.1186/s13059-023-03040-6.

Abstract

Recovering metagenome-assembled genomes (MAGs) from shotgun sequencing data is an increasingly common task in microbiome studies, as MAGs provide deeper insight into the functional potential of both culturable and non-culturable microorganisms. However, metagenome-assembled genomes vary in quality and may contain omissions and contamination. These errors present challenges for detecting genes and comparing gene enrichment across sample types. To address this, we propose happi, an approach to testing hypotheses about gene enrichment that accounts for genome quality. We illustrate the advantages of happi over existing approaches using published Saccharibacteria MAGs, Streptococcus thermophilus MAGs, and via simulation.

Keywords: Hypothesis testing; Metagenome-assembled genomes; Microbiome; Shotgun metagenomics; Statistical models.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Simulation
  • Metagenome
  • Metagenomics*
  • Microbiota* / genetics
  • Sequence Analysis, DNA