| Literature DB >> 26107374 |
Carolina Reta1, Leopoldo Altamirano1, Jesus A Gonzalez1, Raquel Diaz-Hernandez1, Hayde Peregrina1, Ivan Olmos2, Jose E Alonso3, Ruben Lobato3.
Abstract
Morphological identification of acute leukemia is a powerful tool used by hematologists to determine the family of such a disease. In some cases, experienced physicians are even able to determine the leukemia subtype of the sample. However, the identification process may have error rates up to 40% (when classifying acute leukemia subtypes) depending on the physician's experience and the sample quality. This problem raises the need to create automatic tools that provide hematologists with a second opinion during the classification process. Our research presents a contextual analysis methodology for the detection of acute leukemia subtypes from bone marrow cells images. We propose a cells separation algorithm to break up overlapped regions. In this phase, we achieved an average accuracy of 95% in the evaluation of the segmentation process. In a second phase, we extract descriptive features to the nucleus and cytoplasm obtained in the segmentation phase in order to classify leukemia families and subtypes. We finally created a decision algorithm that provides an automatic diagnosis for a patient. In our experiments, we achieved an overall accuracy of 92% in the supervised classification of acute leukemia families, 84% for the lymphoblastic subtypes, and 92% for the myeloblastic subtypes. Finally, we achieved accuracies of 95% in the diagnosis of leukemia families and 90% in the diagnosis of leukemia subtypes.Entities:
Mesh:
Year: 2015 PMID: 26107374 PMCID: PMC4479443 DOI: 10.1371/journal.pone.0130805
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Examples of images with a Wright stain preparation, but with color differences in cells and backgrounds.
Fig 2Blood smear in the CIE L*a*b* color space.
Fig 3Groups creation using similarity features from a channel intensity.
Fig 4Wold's decomposition texture model.
Fig 5Harmonic field parameterization.
Fig 6Generalized evanescent field parameterization.
Fig 7Procedure of MAP estimation for cell segmentation.
Fig 8Decision rules to identify nucleus.
Fig 9Decision rules to identify cells.
Fig 10Cell separation procedure.
Representative features for the cell description.
| No. | Feature | Type | Description |
|---|---|---|---|
| 1 | Area | Morphologic | Actual number of pixels in the ROI. |
| 2 | Perimeter | Morphologic | Distance between each adjoining pair of pixels around the border of the ROI. |
| 3 | Circularity | Morphologic | Complexity of the perimeter of a circular object. Perimeter2 / (4*Area*pi). |
| 4 | Width | Morphologic | Width of the smallest rectangle containing the ROI. |
| 5 | Length | Morphologic | Length of the smallest rectangle containing the ROI. |
| 6 | Elongation | Morphologic | Growth in one direction of the ROI. Length / Width. |
| 7 | Major axis length | Morphologic | Major axis of the ellipse containing the ROI. |
| 8 | Minor axis length | Morphologic | Minor axis of the ellipse containing the ROI. |
| 9 | Eccentricity | Morphologic | Ratio of the distance between the foci of the ellipse containing the ROI and its major axis length. |
| 10 | Extent | Morphologic | Proportion of pixels in the smallest rectangle that are also in the ROI. Area / (Width*Length). |
| 11 | Equivalent diameter | Morphologic | Diameter of the circle with the same area as the ROI. |
| 12 | Euler number | Morphologic | Number of objects in the region minus the number of holes in those objects. |
| 13 | Convex area | Morphologic | Area of the smallest convex polygon containing the ROI. |
| 14 | Solidity | Morphologic | Proportion of pixels in the convex hull that are also in the ROI. Area / Convex area. |
| 15 | Mode | Statistical | Most frequent value of the pixels intensity of the ROI. |
| 16 | Mean | Statistical | Average value of the pixels intensity of the ROI. |
| 17 | Standard deviation | Statistical | Standard deviation of the pixels intensity of the ROI. |
| 18 | Variance | Statistical | Variance value of the pixels intensity of the ROI. |
| 19 | Sum | Statistical | Sum of the pixels intensity of the ROI. |
| 20 | Homogeneity | Texture | Closeness of the distribution of elements in the gray-level co-occurrence matrix to its diagonal. |
| 21 | Contrast | Texture | Intensity contrast between a pixel and its neighbor over the whole image. |
| 22 | Correlation | Texture | Value that measures how correlated a pixel is to its neighbor over the whole image. |
| 23 | Energy | Texture | Sum of squared elements in the gray-level co-occurrence matrix. |
| 24 | Entropy | Texture | Value that measures the smoothness of the image in terms of gray-levels. |
| 25–34 | Eigen R | Eigen | Firsts 10 eigen values of the R channel of the RGB image. |
| 35–44 | Eigen G | Eigen | Firsts 10 eigen values of the G channel of the RGB image. |
| 45–54 | Eigen B | Eigen | Firsts 10 eigen values of the B channel of the RGB image. |
| 55–64 | Eigen gray | Eigen | Firsts 10 eigen values of the gray image. |
| 65 | N-Cyto area | Size ratio | Ratio of the area between the nucleus and the cytoplasm. Nucleus area / Cytoplasm area. |
| 66 | N-Cell area | Size ratio | Proportion of nucleus’ area in the cell’s area. Nucleus area / Cell area. |
| 67 | N-Cell perimeter | Size ratio | Ratio of the perimeter between the nucleus and the cell. Nucleus perimeter / Cell perimeter. |
Fig 11Classification process for acute leukemia subtypes.
Fig 12Cascade classification model for the diagnosis of acute leukemia.
Fig 13Classifiers fusion for the family and subtypes diagnosis of acute leukemia.
Fig 14Diagnosis algorithm of types and subtypes of leukemia.
Samples by subtypes.
| Family | Subtype | No. Samples |
|---|---|---|
|
|
| |
| L1 | 102 | |
| L2 | 135 | |
| No subtype | 58 | |
|
|
| |
| M2 | 95 | |
| M3 | 47 | |
| M5 | 56 | |
| No subtype | 140 |
Patients by subtypes.
| Family | Subtype | No. Samples |
|---|---|---|
|
|
| |
| L1 | 14 | |
| L2 | 15 | |
| No subtype | 5 | |
|
|
| |
| M2 | 6 | |
| M3 | 3 | |
| M5 | 5 | |
| No subtype | 15 |
Evaluation of the cells segmentation algorithm.
| Precision | FP Rate | FN Rate | |
|---|---|---|---|
|
| 95.87% | 4.13% | 2.33% |
|
| 95.75% | 3.16% | 3.83% |
Fig 15Examples of images segmentation with different staining and cell population.
Results of the best classifier for each type or subtype of acute leukemia cells.
| Classification Problem | Classifier | Features | Prec. % | TPR | TNR | AUC |
|---|---|---|---|---|---|---|
|
| SMO | N&C | 92.20 | 0.920 | 0.924 | 0.921 |
|
| SL | C | 81.32 | 0.822 | 0.803 | 0.899 |
|
| IBk | N&C | 84.40 | 0.835 | 0.853 | 0.907 |
|
| SL | C | 76.78 | 0.667 | 0.845 | 0.814 |
|
| RC.RF | N&C | 92.45 | 0.883 | 0.962 | 0.959 |
|
| RC.RF | C | 74.24 | 0.706 | 0.778 | 0.805 |
|
| IBk | N&C | 91.89 | 0.805 | 0.955 | 0.880 |
|
| RC.RF | C | 80.79 | 0.390 | 0.940 | 0.788 |
|
| IBk | N&C | 91.89 | 0.870 | 0.938 | 0.955 |
|
| RF | C | 84.37 | 0.730 | 0.890 | 0.866 |
|
| RC.RF | N&C | 88.39 | 0.904 | 0.894 | 0.945 |
|
| RC.RF | C | 66.63 | 0.801 | 0.612 | 0.784 |
*For the Features column N stands for Nucleus and C for Cytoplasm.
Evaluation of the acute leukemia diagnosis algorithm by merging binary classifiers results.
| Classification | No. examples | Correct | Not determined | Failed |
|---|---|---|---|---|
|
|
|
|
|
|
| ALL | 34 | 88.2352% | 5.8824% | 5.8824% |
| AML | 29 | 96.5517% | 3.4483% | 0% |
|
|
|
|
|
|
| L1 | 15 | 80% | 13.3333% | 6.6667% |
| L2 | 14 | 78.5714% | 14.2857% | 7.1429% |
| M2 | 6 | 83.3333% | 16.6667% | 0% |
| M3 | 3 | 66.6667% | 33.3333% | 0% |
| M5 | 5 | 100% | 0% | 0% |
Evaluation of the acute leukemia diagnosis algorithm by merging multi-class classifiers results.
| Classification | No. examples | Correct | Not determined | Failed |
|---|---|---|---|---|
|
|
|
|
|
|
| ALL | 34 | 94.1176% | 2.9412% | 2.9412% |
| AML | 29 | 96.5517% | 3.4483% | 0.0000% |
|
|
|
|
|
|
| L1 | 15 | 86.6667% | 6.6667% | 6.6667% |
| L2 | 14 | 85.7143% | 7.1429% | 7.1429% |
| M2 | 6 | 100% | 0% | 0% |
| M3 | 3 | 100% | 0% | 0% |
| M5 | 5 | 100% | 0% | 0% |