L O Moraes1, C E Pedreira2, S Barrena3, A Lopez4, A Orfao5. 1. Rua Horacio Macedo 2030, Rio de Janeiro/RJ, CEP: 21941-914, Brazil. Electronic address: lmoraes@cos.ufrj.br. 2. Rua Horacio Macedo 2030, Rio de Janeiro/RJ, CEP: 21941-914, Brazil. Electronic address: pedreira56@gmail.com. 3. Lab 11, Centro de Investigacion del Cancer, Paseo de la Universidad de Coimbra, Campus Miguel Unamuno, 37002 Salamanca, España. Electronic address: subadelfa@usal.es. 4. Lab 11, Centro de Investigacion del Cancer, Paseo de la Universidad de Coimbra, Campus Miguel Unamuno, 37002 Salamanca, España. Electronic address: antuam@usal.es. 5. Lab 11, Centro de Investigacion del Cancer, Paseo de la Universidad de Coimbra, Campus Miguel Unamuno, 37002 Salamanca, España. Electronic address: orfao@usal.es.
Abstract
BACKGROUND AND OBJECTIVE: Here we propose a decision-tree approach for the differential diagnosis of distinct WHO categories B-cell chronic lymphoproliferative disorders using flow cytometry data. Flow cytometry is the preferred method for the immunophenotypic characterization of leukemia and lymphoma, being able to process and register multiparametric data about tens of thousands of cells per second. METHODS: The proposed decision-tree is composed by logistic function nodes that branch throughout the tree into sets of (possible) distinct leukemia/lymphoma diagnoses. To avoid overfitting, regularization via the Lasso algorithm was used. The code can be run online at https://codeocean.com/2018/03/08/a-decision-tree-approach-for-the-differential-diagnosis-of-chronic-lymphoid-leukemias-and-peripheral-b-cell-lymphomas/ or downloaded from https://github.com/lauramoraes/bioinformatics-sourcecode to be executed in Matlab. RESULTS: The proposed approach was validated in diagnostic peripheral blood and bone marrow samples from 283 mature lymphoid leukemias/lymphomas patients. The proposed approach achieved 95% correctness in the cross-validation test phase (100% in-sample), 61% giving a single diagnosis and 34% (possible) multiple disease diagnoses. Similar results were obtained in an out-of-sample validation dataset. The generated tree reached the final diagnoses after up to seven decision nodes. CONCLUSIONS: Here we propose a decision-tree approach for the differential diagnosis of mature lymphoid leukemias/lymphomas which proved to be accurate during out-of-sample validation. The full process is accomplished through seven binary transparent decision nodes.
BACKGROUND AND OBJECTIVE: Here we propose a decision-tree approach for the differential diagnosis of distinct WHO categories B-cell chronic lymphoproliferative disorders using flow cytometry data. Flow cytometry is the preferred method for the immunophenotypic characterization of leukemia and lymphoma, being able to process and register multiparametric data about tens of thousands of cells per second. METHODS: The proposed decision-tree is composed by logistic function nodes that branch throughout the tree into sets of (possible) distinct leukemia/lymphoma diagnoses. To avoid overfitting, regularization via the Lasso algorithm was used. The code can be run online at https://codeocean.com/2018/03/08/a-decision-tree-approach-for-the-differential-diagnosis-of-chronic-lymphoid-leukemias-and-peripheral-b-cell-lymphomas/ or downloaded from https://github.com/lauramoraes/bioinformatics-sourcecode to be executed in Matlab. RESULTS: The proposed approach was validated in diagnostic peripheral blood and bone marrow samples from 283 mature lymphoid leukemias/lymphomaspatients. The proposed approach achieved 95% correctness in the cross-validation test phase (100% in-sample), 61% giving a single diagnosis and 34% (possible) multiple disease diagnoses. Similar results were obtained in an out-of-sample validation dataset. The generated tree reached the final diagnoses after up to seven decision nodes. CONCLUSIONS: Here we propose a decision-tree approach for the differential diagnosis of mature lymphoid leukemias/lymphomas which proved to be accurate during out-of-sample validation. The full process is accomplished through seven binary transparent decision nodes.
Authors: Nathan Radakovich; Manja Meggendorfer; Luca Malcovati; C Beau Hilton; Mikkael A Sekeres; Jacob Shreve; Yazan Rouphail; Wencke Walter; Stephan Hutter; Anna Galli; Sara Pozzi; Chiara Elena; Eric Padron; Michael R Savona; Aaron T Gerds; Sudipto Mukherjee; Yasunobu Nagata; Rami S Komrokji; Babal K Jha; Claudia Haferlach; Jaroslaw P Maciejewski; Torsten Haferlach; Aziz Nazha Journal: Blood Adv Date: 2021-11-09
Authors: Yousra El Alaoui; Adel Elomri; Marwa Qaraqe; Regina Padmanabhan; Ruba Yasin Taha; Halima El Omri; Abdelfatteh El Omri; Omar Aboumarzouk Journal: J Med Internet Res Date: 2022-07-12 Impact factor: 7.076