A decision-tree approach for the differential diagnosis of chronic lymphoid leukemias and peripheral B-cell lymphomas

Comput Methods Programs Biomed. 2019 Sep:178:85-90. doi: 10.1016/j.cmpb.2019.06.014. Epub 2019 Jun 13.

Abstract

Background and objective: Here we propose a decision-tree approach for the differential diagnosis of distinct WHO categories B-cell chronic lymphoproliferative disorders using flow cytometry data. Flow cytometry is the preferred method for the immunophenotypic characterization of leukemia and lymphoma, being able to process and register multiparametric data about tens of thousands of cells per second.

Methods: The proposed decision-tree is composed by logistic function nodes that branch throughout the tree into sets of (possible) distinct leukemia/lymphoma diagnoses. To avoid overfitting, regularization via the Lasso algorithm was used. The code can be run online at https://codeocean.com/2018/03/08/a-decision-tree-approach-for-the-differential-diagnosis-of-chronic-lymphoid-leukemias-and-peripheral-b-cell-lymphomas/ or downloaded from https://github.com/lauramoraes/bioinformatics-sourcecode to be executed in Matlab.

Results: The proposed approach was validated in diagnostic peripheral blood and bone marrow samples from 283 mature lymphoid leukemias/lymphomas patients. The proposed approach achieved 95% correctness in the cross-validation test phase (100% in-sample), 61% giving a single diagnosis and 34% (possible) multiple disease diagnoses. Similar results were obtained in an out-of-sample validation dataset. The generated tree reached the final diagnoses after up to seven decision nodes.

Conclusions: Here we propose a decision-tree approach for the differential diagnosis of mature lymphoid leukemias/lymphomas which proved to be accurate during out-of-sample validation. The full process is accomplished through seven binary transparent decision nodes.

Keywords: Classification; Diagnosis; Flow cytometry; Hierarchical tree; Lymphomas; Machine learning.

MeSH terms

  • Algorithms
  • Chronic Disease
  • Decision Trees*
  • Flow Cytometry*
  • Humans
  • Immunophenotyping*
  • Leukemia, Lymphoid / diagnosis*
  • Lymphoma, B-Cell / diagnosis*
  • Medical Oncology / standards*
  • Models, Statistical
  • Reproducibility of Results