Tailor: Targeting heavy tails in flow cytometry data with fast, interpretable mixture modeling

Matei Ionita; Richard Schretzenmair; Derek Jones; Jonni Moore; Li-San Wang; Wade Rogers

doi:10.1002/cyto.a.24307

Tailor: Targeting heavy tails in flow cytometry data with fast, interpretable mixture modeling

Cytometry A. 2021 Feb;99(2):133-144. doi: 10.1002/cyto.a.24307.

Authors

Matei Ionita¹, Richard Schretzenmair¹, Derek Jones¹, Jonni Moore¹, Li-San Wang¹, Wade Rogers^{1

2}

Affiliations

¹ Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA.
² Corporate/Research, Still Pond Cytomics, West Chester, PA, USA.

PMID: 33476090
DOI: 10.1002/cyto.a.24307

Abstract

Automated clustering workflows are increasingly used for the analysis of high parameter flow cytometry data. This trend calls for algorithms which are able to quickly process tens of millions of data points, to compare results across subjects or time points, and to provide easily actionable interpretations of the results. To this end, we created Tailor, a model-based clustering algorithm specialized for flow cytometry data. Our approach leverages a phenotype-aware binning scheme to provide a coarse model of the data, which is then refined using a multivariate Gaussian mixture model. We benchmark Tailor using a simulation study and two flow cytometry data sets, and show that the results are robust to moderate departures from normality and inter-sample variation. Moreover, Tailor provides automated, non-overlapping annotations of its clusters, which facilitates interpretation of results and downstream analysis. Tailor is released as an R package, and the source code is publicly available at www.github.com/matei-ionita/Tailor.

Keywords: clustering algorithms; computation and informatics; high-parameter flow cytometry; mixture modeling.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Cluster Analysis
Flow Cytometry
Humans
Normal Distribution
Software*

Abstract

Publication types

MeSH terms

Grants and funding