Toward deterministic and semiautomated SPADE analysis

Cytometry A. 2017 Mar;91(3):281-289. doi: 10.1002/cyto.a.23068. Epub 2017 Feb 24.

Abstract

SPADE stands for spanning-tree progression analysis for density-normalized events. It combines downsampling, clustering and a minimum-spanning tree to provide an intuitive visualization of high-dimensional single-cell data, which assists with the interpretation of the cellular heterogeneity underlying the data. SPADE has been widely used for analysis of high-content flow cytometry data and CyTOF data. The downsampling and clustering components of SPADE are both stochastic, which lead to stochasticity in the tree visualization it generates. Running SPADE twice on the same data may generate two different tree structures. Although they typically lead to the same biological interpretation of subpopulations present in the data, robustness of the algorithm can be improved. Another avenue of improvement is the interpretation of the SPADE tree, which involves visual inspection of multiple colored versions of the tree based on expression of measured markers. This is essentially manual gating on the SPADE tree and can benefit from automated algorithms. This article presents improvements of SPADE in both aspects above, leading to a deterministic SPADE algorithm and a software implementation for semiautomated interpretation. © 2017 International Society for Advancement of Cytometry.

Keywords: SPADE; deterministic.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Flow Cytometry / methods*
  • Flow Cytometry / statistics & numerical data
  • Genetic Heterogeneity
  • Humans
  • Software*