Literature DB >> 31733590

Cell Population Data-Driven Acute Promyelocytic Leukemia Flagging Through Artificial Neural Network Predictive Modeling.

Rana Zeeshan Haider1, Ikram Uddin Ujjan2, Tahir S Shamsi3.   

Abstract

A targeted and timely offered treatment can be a benefitting tool for patients with acute promyelocytic leukemia (APML). Current round of study made use of potential morphological and immature fraction-related parameters (cell population data) generated during complete blood cell count (CBC), through artificial neural network (ANN) predictive modeling for early flagging of APML cases. We collected classical CBC items along with cell population data (CPD) from hematology analyzer at diagnosis of 1067 study subjects with hematological neoplasms. For morphological assessment, peripheral blood films were examined. Statistical and machine learning tools including principal component analysis (PCA) helped in the evaluation of predictive capacity of routine and CPD items. Then selected CBC item-driven ANN predictive modeling was developed to smartly use the hidden trend by increasing the auguring accuracy of these parameters in differentiation of APML cases. We found a characteristic triad based on lower (53.73) platelet count (PLT) with decreased/normal (4.72) immature fraction of platelet (IPF) with addition of significantly higher value (65.5) of DNA/RNA content-related neutrophil (NE-SFL) parameter in patients with APML against other hematological neoplasm's groups. On PCA, APML showed exceptionally significant variance for PLT, IPF, and NE-SFL. Through training of ANN predictive modeling, our selected CBC items successfully classify the APML group from non-APML groups at highly significant (0.894) AUC value with lower (2.3 percent) false prediction rate. Practical results of using our ANN model were found acceptable with value of 95.7% and 97.7% for training and testing data sets, respectively. We proposed that PLT, IPF, and NE-SFL could potentially be used for early flagging of APML cases in the hematology-oncology unit. CBC item-driven ANN modeling is a novel approach that substantially strengthen the predictive potential of CBC items, allowing the clinicians to be confident by the typical trend raised by these studied parameters.
Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Year:  2019        PMID: 31733590      PMCID: PMC6859536          DOI: 10.1016/j.tranon.2019.09.009

Source DB:  PubMed          Journal:  Transl Oncol        ISSN: 1936-5233            Impact factor:   4.243


Introduction

Acute promyelocytic leukemia (APML) in comparison with other types of hematological neoplasms, particularly acute myeloid leukemia (AMLs) has highly bizarre clinical, morphological, and biological characteristics and requires a targeted therapy [1]. Start of the treatment without any delay for definitive diagnosis or other concerns is an effective approach in APML [2]. This is particularly because of life-threatening coagulopathy initiated by the abnormal promyelocytes (AP) [3]. The development of innovative technologies and analytical principles in flow cytometry has allowed the commercialization of new-generation hematological analyzers. These are capable to generate data about the morphological characteristics and immature fractions of blood cells termed as "cell population data" (CPD) along with classical complete blood cells (CBC) count parameters. CPD items have a high degree of analytical efficiency for identifying many cellular abnormalities and can be used for the screening of several hematological and nonhematological disorders [[4], [5], [6], [7], [8]]. The aim of this study was to evaluate the clinical utility of WBC morphology (CPD) parameters in early premicroscopic differentiation of APML. Apart from the microscopic classification systems and its limitations, we worked to propose a novel prediction model based on artificial neural network (ANN); multilayer perceptron (MLP) that would utilize routinely generated scattering items along with CBC testing. We hypothesized that an ANN-based model built on CPD items would enhance predictive ability of these morphological parameters and this kind of modeling can be incorporated in real-time clinical practice.

Materials and Methods

We clinically evaluated CPD items generated by one of flow cytometry–based hematology analyzer (Sysmex XN-1000, Kobe, Japan) in peripheral blood for early premicroscopic exclusion of APML from other common hematological neoplasms. From January 2014 to July 2017, a total of 1067 patients with 44 APML (PML-RARA), 181 AML (excluded APML), 89 chronic myeloid leukemia (CML), 51 myelodysplastic syndrome (MDS), 71 myeloproliferative disorders (MPN) except CML, 10 MDS/MPN, 136 acute lymphocytic leukemia (ALL), 9 Hodgkin's lymphoma (HL), 95 non-Hodgkin's lymphoma (NHL), 32 multiple myeloma, and 349 normal control were prospectively enrolled at the National Institute of Blood Disease and Bone Marrow Transplantation (NIBD & BMT) Karachi, Pakistan. Flow cytometry–based modern hematology analyzer (Sysmex XN-1000) was used for CBC testing of 1067 blood samples. Demographic features, classical CBC items, and CPD parameters were compared between the study groups. To explore the subtle pattern and to generate better predictive model on significantly deviated CBC items (PLT, immature platelet fraction (IPF), and NE-SFL), we used unsupervised machine learning tool (principal component analysis [PCA]) and MLP analysis, one of the common types of ANN. It performed prediction and classification along with generation of predicted-by-observed chart, ROC curve, cumulative gains, and lift charts for MLP. The Institutional Review Board of NIBD and BMT approved this study (permit number: NIBD/RD-167/14-2014).

Results

From routine and extended CBC parameters (Table 1), values of platelet count and IPF showed characteristic trend for APML group against other study groups. Although individually lower values for platelet count or IPF is noted in various study groups as in APML but none of these group showed decrease/near normal value of IPF in demand of such degree of thrombocytopenia. While among white cell morphological items, patients with APML showed significantly higher value for NE-SFL (side fluorescent light intensity) compared with its corresponding hematological neoplasms (Table 2).
Table 1

Mean (Along Standard Deviation) Values for Classical and Extended CBC Items Generated by Modern (Sysmex XN-1000) Hematology Analyzer Are Presented for Our Study Groups

For quick visual identification of "hot" and "cold" spots in reference to normal control color-shading (blue for low and red for high count) approach is used.

Hb, hemoglobin; RBC, red blood cell; PCV, pack cell volume; MCV, mean cell volume; MCH, mean cell hemoglobin; MCHC, mean cell hemoglobin concentration; WBC, white blood cell; PLT, platelet; NEUT#, absolute neutrophil count; LYMPH#, absolute lymphocyte count; MONO, absolute monocyte count; EO#, absolute eosinophil count; BASO#, absolute basophil count; NEUT%, neutrophil percent; LYMPH%, lymphocyte percent; MONO%, monocyte percent; EO%, eosinophil percent; BASO%, basophil percent; IG#, absolute immature granulocyte count; IG%, immature granulocyte percent; RDW, red cell distribution width; NRBC#, absolute nucleated red blood cell count; NRBC%, nucleated red blood cell percent; IPF, immature platelet fraction; MM, multiple myeloma.

Table 2

Values of CBC-Based White Cell Scattering (Morphological) Items Among Study Groups

NE-SSC, neutrophils cell complexity; NE-SFL, neutrophils fluorescence intensity; NE-FSC, neutrophils cell size; LY-X, lymphocytes cell complexity; LY-Y, lymphocytes fluorescence intensity; LY-Z, lymphocytes cell size; MO-X, monocytes cells complexity; MO-Y, monocytes fluorescence intensity; MO-Z, monocytes cell size; NE-WX, neutrophils complexity and the width of dispersion; NE-WY, neutrophils fluorescence intensity and the width of dispersion; NE-WZ, neutrophils cell size and the width of dispersion; LY-WX, lymphocytes complexity and width of dispersion; LY-WY, lymphocytes fluorescence intensity and the width of dispersion; LY-WZ, lymphocytes cell size and the width of dispersion; MO-WX, monocytes complexity and the width of dispersion; MO-WY, monocytes fluorescence intensity and the width of dispersion; MO-WZ, monocytes cell size and the width of dispersion; MM, multiple myeloma.

Mean (Along Standard Deviation) Values for Classical and Extended CBC Items Generated by Modern (Sysmex XN-1000) Hematology Analyzer Are Presented for Our Study Groups For quick visual identification of "hot" and "cold" spots in reference to normal control color-shading (blue for low and red for high count) approach is used. Hb, hemoglobin; RBC, red blood cell; PCV, pack cell volume; MCV, mean cell volume; MCH, mean cell hemoglobin; MCHC, mean cell hemoglobin concentration; WBC, white blood cell; PLT, platelet; NEUT#, absolute neutrophil count; LYMPH#, absolute lymphocyte count; MONO, absolute monocyte count; EO#, absolute eosinophil count; BASO#, absolute basophil count; NEUT%, neutrophil percent; LYMPH%, lymphocyte percent; MONO%, monocyte percent; EO%, eosinophil percent; BASO%, basophil percent; IG#, absolute immature granulocyte count; IG%, immature granulocyte percent; RDW, red cell distribution width; NRBC#, absolute nucleated red blood cell count; NRBC%, nucleated red blood cell percent; IPF, immature platelet fraction; MM, multiple myeloma. Values of CBC-Based White Cell Scattering (Morphological) Items Among Study Groups NE-SSC, neutrophils cell complexity; NE-SFL, neutrophils fluorescence intensity; NE-FSC, neutrophils cell size; LY-X, lymphocytes cell complexity; LY-Y, lymphocytes fluorescence intensity; LY-Z, lymphocytes cell size; MO-X, monocytes cells complexity; MO-Y, monocytes fluorescence intensity; MO-Z, monocytes cell size; NE-WX, neutrophils complexity and the width of dispersion; NE-WY, neutrophils fluorescence intensity and the width of dispersion; NE-WZ, neutrophils cell size and the width of dispersion; LY-WX, lymphocytes complexity and width of dispersion; LY-WY, lymphocytes fluorescence intensity and the width of dispersion; LY-WZ, lymphocytes cell size and the width of dispersion; MO-WX, monocytes complexity and the width of dispersion; MO-WY, monocytes fluorescence intensity and the width of dispersion; MO-WZ, monocytes cell size and the width of dispersion; MM, multiple myeloma. We conducted PCA to visualize a typical trend of IPF in lower platelet count along with higher values of NE-SFL for APML against its peer study groups. As shown in Figure 1, APML showed significant variance in comparison with other hematological neoplasms and found on upper extreme left end of PCA pilot. Triad of IPF, plate count, and NE-SFL successfully differentiate APML from other study groups (Figure 1).
Figure 1

Visualization of latent pattern of selected (PLT, IPF, and NE-SFL) CBC parameters for study groups through principal component analysis (PCA). By unsupervised machine learning tools (PCA) three-dimensional data reduced into two dimensions so that we can plot and understand our data in a better way. Together, both components (PC1 and PC2) covered 90.2% of the information (variance). In plot, groups are labeled with their names.

Visualization of latent pattern of selected (PLT, IPF, and NE-SFL) CBC parameters for study groups through principal component analysis (PCA). By unsupervised machine learning tools (PCA) three-dimensional data reduced into two dimensions so that we can plot and understand our data in a better way. Together, both components (PC1 and PC2) covered 90.2% of the information (variance). In plot, groups are labeled with their names. The summary of MLP model (Figure 2) to challenge the typical trend of NE-SFL, platelet count, and IPF for prediction of patients with APML showed various positive signs as "cross entropy error" with smallest error (20.73) for testing data set. In addition to that, it also showed notably lower percentage of incorrect predictions as a whole and particularly for testing (2.3%) as compared with training data set (4.3%). Practical results of using the MLP network were acceptable with 95.7% and 97.7% for training and testing data sets, respectively, as shown in classification table (Figure 2). A visual look of ROC curve and 0.894 AUC value further endorsed the predictive ability of the network.
Figure 2

The model summary, classification table, predicted-by-observed chart, ROC curve, cumulative gains and lift chart for multilayer perceptron for APML vs. non-APML.

The model summary, classification table, predicted-by-observed chart, ROC curve, cumulative gains and lift chart for multilayer perceptron for APML vs. non-APML. The predicted-by-observed chart (Figure 2) for the combined training and testing samples displayed predicted pseudoprobabilities as clustered boxplots. All boxplots were found symmetrical about the horizontal line at 0.5 mark. The cumulative gains chart showed notable trend that if we score a data set with the network and sort all the cases by predicted pseudoprobability of APML, we can expect top 10% to contain approximately 65% of all the cases considering the APML category (Figure 2). The lift chart provides (Figure 2) us a different look on cumulative gains chart; the lift at 10% for the category APML is 65%/10% = 6.5. We must know that both cumulative and lift charts are based on combined testing and training samples.

Discussion

Clinically, APML represents a hematologic emergency that enforces a rapid exclusion for a quick start of its treatment, even before molecular and cytogenetic diagnosis (PML-RARA rearrangement). CPD items generated by hematology analyzers have a strong potential for being introduced as automated morphological items for detection of changes in WBC white cell's morphology. Various studies were conducted to evaluate clinical utility of CPD items for early exclusion of hematological disorders. A study by Yang et al. evaluated white cell scattering items for differentiation of acute leukemia lineage by using Coulter DxH800 analyzer. In this study, the authors derive 21-item–based model and reported very high (100%) specificity and sensitivity for differentiation of APML cases, whereas for ALL, comparatively less-significant specificity and sensitivity was achieved [4]. In another study by Virk et al., the clinical utility of the white cell scattering items, scattergrams, and flags for screening of AML cases with significant specificity was published [9]. In this study, we found an interesting "triad", that is a fraction of immature platelets (IPF) remained decreased or normal in response to thrombocytopenia (lower platelet count) and NE-SFL came with significantly higher values in patients with APML. Higher values of NE-SFL may be justified by an increased infiltration of immature hyperactive (AP) having higher RNA/DNA content than other immature and mature cells that falsely counted in neutrophil area. Increased destruction along with suppression of platelet production may partly explain the low platelet count and decreased IPF value (near normal) in regard to severity of thrombocytopenia. Hence, we propose the use of platelet count, NE-SFL, and IPF as criteria for early flagging of APML, which will enforce the rapid and targeted peripheral film examination for APs and subsequent diagnostic workup. Here, it is also clear that this criterion cannot be used as diagnostic parameters because it is confirmed by PML-RARA. Through this study, we have also demonstrated the clinical utility of applying machine learning tools in predictive modeling on data generated with CBC test, which in our hands could be used as morphological parameters for premicroscopic exclusion and guide the further diagnostic workup of APML. On peripheral blood film's microscopic images, by using image processing tools (image segmentation, feature extraction, cell identification, and other) and ANN-based modeling, various studies were conducted. These studies include the classification of acute lymphocytic leukemic (ALL) cells from healthy cells [[10], [11], [12], [13], [14], [15]], acute myeloid leukemic (AML) cells from healthy cells [16,17], and WBCs grouping into various classes of AML, ALL, and their French-American-British (FAB) subtypes [[18], [19], [20], [21], [22], [23], [24], [25], [26]]. To our knowledge, this is the first study for CPD-driven flagging of APML through an ANN predictive modeling. We suggest that an ANN-based modeling would be able to smartly recognize the typical patterns in selected CBC items for differentiation of APML cases more accurately. Eventually, if successful, CBC (routine, extended, and CPD) items through ANNs could be incorporated into hematology laboratory decision support/information system for flagging of APML. As a future work, this approach could be studied in an extended way by using WBC, red blood cell (RBC), and platelet morphological and immature fraction parameters for short and targeted diagnostic workups especially in common (regional) blood disorders.

Conclusion

A characteristic triad of lower platelet counts with decreased fraction of immature platelet (IPF) along higher value for DNA/RNA content–related neutrophil (NE-SFL) parameter is illustrated for flagging of APML. –driven-driven–The platelet, IPF, and NE-SFL values driven ANN modeling is a novel approach that substantially strengthen the flagging potential of these parameters in augur of APML cases. This approach could potentially reduce the frequency of extra and irrational diagnostic tests, which is time-consuming and extra burden on patients. Hence, the proposed selected CBC item–driven predictive modeling can be used as an assistant diagnostic tool in decision support system of hematology laboratory and clinic for screening of APML.

Author contribution

R.Z.H. conducted the experiments, collected and analyzed the data under supervision of I.U. and T.S.S.; I.U., T.S.S., and R.Z.H. wrote the paper. All authors read and approved the final manuscript.

Conflicts of interest

Authors have no conflict of interest.

Ethics approval

The Institutional Review Board of NIBD, Karachi, Pakistan (Permit number: NIBD/RD-167/14-2014).
  18 in total

1.  Lymphocyte volume and conductivity indices of the haematology analyser Coulter GEN.S in lymphoproliferative disorders and viral diseases.

Authors:  M Silva; C Fourcade; C Fartoukh; B Lenormand; G Buchonnet; M P Callat; C Leclerc; J P Basuyau; M Vasse
Journal:  Clin Lab Haematol       Date:  2006-02

2.  A new high-throughput screening method for the detection of chronic lymphatic leukemia and myelodysplastic syndrome.

Authors:  Elisabeth Haschke-Becher; Michael Vockenhuber; Paul Niedetzky; Uwe Totzke; Christian Gabriel
Journal:  Clin Chem Lab Med       Date:  2008       Impact factor: 3.694

3.  Correction: Segmentation and Classification of Bone Marrow Cells Images Using Contextual Information for Medical Diagnosis of Acute Leukemias.

Authors:  Carolina Reta; Leopoldo Altamirano; Jesus A Gonzalez; Raquel Diaz-Hernandez; Hayde Peregrina; Ivan Olmos; Jose E Alonso; Ruben Lobato
Journal:  PLoS One       Date:  2015-07-24       Impact factor: 3.240

4.  Determination of acute leukemia lineage with new morphologic parameters available in the complete blood cell count.

Authors:  Jin Hyuk Yang; Yonggoo Kim; Jihyang Lim; Myungshin Kim; Eun-Jee Oh; Hae-Kyung Lee; Yeon-Joon Park; Woo Sung Min; Bin Cho; Kwangyoung Lee; Kyungja Han
Journal:  Ann Clin Lab Sci       Date:  2014       Impact factor: 1.256

5.  Utility of cell population data (VCS parameters) as a rapid screening tool for Acute Myeloid Leukemia (AML) in resource-constrained laboratories.

Authors:  Harpreet Virk; Neelam Varma; Shano Naseem; Ishwar Bihana; Dmitry Sukhachev
Journal:  J Clin Lab Anal       Date:  2018-09-29       Impact factor: 2.352

6.  Automated neutrophil morphology and its utility in the assessment of neutrophil dysplasia.

Authors:  Amparo Miguel; Maite Orero; Ramon Simon; Rosa Collado; Pedro L Perez; Alejandro Pacios; Rosario Iglesias; Antonio Martinez; Felix Carbonell
Journal:  Lab Hematol       Date:  2007

7.  Recognition of acute lymphoblastic leukemia cells in microscopic images using k-means clustering and support vector machine classifier.

Authors:  Morteza Moradi Amin; Saeed Kermani; Ardeshir Talebi; Mostafa Ghelich Oghli
Journal:  J Med Signals Sens       Date:  2015 Jan-Mar

8.  Comparison of laboratory characteristics between acute promyelocytic leukemia and other subtypes of acute myeloid leukemia with disseminated intravascular coagulation.

Authors:  Hee-Jeong Lee; Hyung-Jin Park; Hyun-Wook Kim; Sang-Gon Park
Journal:  Blood Res       Date:  2013-12-24

9.  Automatic Recognition of Acute Myelogenous Leukemia in Blood Microscopic Images Using K-means Clustering and Support Vector Machine.

Authors:  Fatemeh Kazemi; Tooraj Abbasian Najafabadi; Babak Nadjar Araabi
Journal:  J Med Signals Sens       Date:  2016 Jul-Sep

10.  Cell Population Data NE-SFL and MO-WX From Sysmex XN-3000 Can Provide Additional Information for Exclusion of Acute Promyelocytic Leukemia From Other Acute Myeloid Leukemias: A Preliminary Study.

Authors:  Sang Hyuk Park; Hyung Hoi Kim; In Suk Kim; Jongyoun Yi; Chulhun L Chang; Eun Yup Lee
Journal:  Ann Lab Med       Date:  2016-11       Impact factor: 3.464

View more
  6 in total

1.  Usefulness of Leucocyte Cell Population Data by Sysmex XN1000 Hematology Analyzer in Rapid Identification of Acute Leukemia.

Authors:  Shruti Mishra; Gaurav Chhabra; Somanath Padhi; Sonali Mohapatra; Ashutosh Panigrahi; Mukund Namdev Sable; Prabodha Kumar Das
Journal:  Indian J Hematol Blood Transfus       Date:  2021-09-28       Impact factor: 0.915

2.  Deep learning identifies Acute Promyelocytic Leukemia in bone marrow smears.

Authors:  Karsten Wendt; Jan Moritz Middeke; Jan-Niklas Eckardt; Tim Schmittmann; Sebastian Riechert; Michael Kramer; Anas Shekh Sulaiman; Katja Sockel; Frank Kroschinsky; Johannes Schetelig; Lisa Wagenführ; Ulrich Schuler; Uwe Platzbecker; Christian Thiede; Friedrich Stölzel; Christoph Röllig; Martin Bornhäuser
Journal:  BMC Cancer       Date:  2022-02-22       Impact factor: 4.430

Review 3.  A Review of Artificial Intelligence Applications in Hematology Management: Current Practices and Future Prospects.

Authors:  Yousra El Alaoui; Adel Elomri; Marwa Qaraqe; Regina Padmanabhan; Ruba Yasin Taha; Halima El Omri; Abdelfatteh El Omri; Omar Aboumarzouk
Journal:  J Med Internet Res       Date:  2022-07-12       Impact factor: 7.076

4.  Leukocyte cell population data from the blood cell analyzer as a predictive marker for severity of acute pancreatitis.

Authors:  Yihui Wang; Zhihong Xu; Yuhua Zhou; Mengqi Xie; Xing Qi; Zhiwei Xu; Qi Cai; Huiqiu Sheng; Erzhen Chen; Bing Zhao; Enqiang Mao
Journal:  J Clin Lab Anal       Date:  2021-06-01       Impact factor: 2.352

5.  Beyond the In-Practice CBC: The Research CBC Parameters-Driven Machine Learning Predictive Modeling for Early Differentiation among Leukemias.

Authors:  Rana Zeeshan Haider; Ikram Uddin Ujjan; Najeed Ahmed Khan; Eloisa Urrechaga; Tahir Sultan Shamsi
Journal:  Diagnostics (Basel)       Date:  2022-01-07

6.  A machine learning PROGRAM to identify COVID-19 and other diseases from hematology data.

Authors:  Patrick A Gladding; Zina Ayar; Kevin Smith; Prashant Patel; Julia Pearce; Shalini Puwakdandawa; Dianne Tarrant; Jon Atkinson; Elizabeth McChlery; Merit Hanna; Nick Gow; Hasan Bhally; Kerry Read; Prageeth Jayathissa; Jonathan Wallace; Sam Norton; Nick Kasabov; Cristian S Calude; Deborah Steel; Colin Mckenzie
Journal:  Future Sci OA       Date:  2021-06-12
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.