Expanding the coverage of spatial proteomics: a machine learning approach

Bioinformatics. 2024 Feb 1;40(2):btae062. doi: 10.1093/bioinformatics/btae062.

Abstract

Motivation: Multiplexed protein imaging methods use a chosen set of markers and provide valuable information about complex tissue structure and cellular heterogeneity. However, the number of markers that can be measured in the same tissue sample is inherently limited.

Results: In this paper, we present an efficient method to choose a minimal predictive subset of markers that for the first time allows the prediction of full images for a much larger set of markers. We demonstrate that our approach also outperforms previous methods for predicting cell-level protein composition. Most importantly, we demonstrate that our approach can be used to select a marker set that enables prediction of a much larger set than could be measured concurrently.

Availability and implementation: All code and intermediate results are available in a Reproducible Research Archive at https://github.com/murphygroup/CODEXPanelOptimization.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Machine Learning*
  • Proteomics* / methods