ProteoClade: A taxonomic toolkit for multi-species and metaproteomic analysis

PLoS Comput Biol. 2020 Mar 9;16(3):e1007741. doi: 10.1371/journal.pcbi.1007741. eCollection 2020 Mar.

Abstract

We present ProteoClade, a Python toolkit that performs taxa-specific peptide assignment, protein inference, and quantitation for multi-species proteomics experiments. ProteoClade scales to hundreds of millions of protein sequences, requires minimal computational resources, and is open source, multi-platform, and accessible to non-programmers. We demonstrate its utility for processing quantitative proteomic data derived from patient-derived xenografts and its speed and scalability enable a novel de novo proteomic workflow for complex microbiota samples.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Databases, Protein
  • Humans
  • Mice
  • Microbiota / genetics
  • Proteins* / chemistry
  • Proteins* / classification
  • Proteins* / genetics
  • Proteomics / methods*
  • Sequence Analysis, Protein / methods
  • Software*

Substances

  • Proteins