ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences

Microbiome. 2024 May 9;12(1):84. doi: 10.1186/s40168-024-01805-0.

Abstract

Background: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing.

Results: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG.

Conclusions: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract.

Keywords: ARGNet; Antibiotic resistance; Antibiotic resistance genes; Autoencoder; Deep learning; Multiclass classification convolutional neural network.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Anti-Bacterial Agents / pharmacology
  • Bacteria* / classification
  • Bacteria* / drug effects
  • Bacteria* / genetics
  • Computational Biology / methods
  • Deep Learning
  • Drug Resistance, Bacterial / genetics
  • Drug Resistance, Microbial / genetics
  • Genes, Bacterial / genetics
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Neural Networks, Computer*

Substances

  • Anti-Bacterial Agents

Grants and funding