| Literature DB >> 24683415 |
Sarawut Saichanma1, Sucha Chulsomlee1, Nonthaya Thangrua1, Pornsuri Pongsuchart1, Duangmanee Sanmun1.
Abstract
It is undeniable that laboratory information is important in healthcare in many ways such as management, planning, and quality improvement. Laboratory diagnosis and laboratory results from each patient are organized from every treatment. These data are useful for retrospective study exploring a relationship between laboratory results and diseases. By doing so, it increases efficiency in diagnosis and quality in laboratory report. Our study will utilize J48 algorithm, a data mining technique to predict abnormality in peripheral blood smear from 1,362 students by using 13 data set of hematological parameters gathered from automated blood cell counter. We found that the decision tree which is created from the algorithm can be used as a practical guideline for RBC morphology prediction by using 4 hematological parameters (MCV, MCH, Hct, and RBC). The average prediction of RBC morphology has true positive, false positive, precision, recall, and accuracy of 0.940, 0.050, 0.945, 0.940, and 0.943, respectively. A newly found paradigm in managing medical laboratory information will be helpful in organizing, researching, and assisting correlation in multiple disciplinary other than medical science which will eventually lead to an improvement in quality of test results and more accurate diagnosis.Entities:
Year: 2014 PMID: 24683415 PMCID: PMC3943190 DOI: 10.1155/2014/493706
Source DB: PubMed Journal: Adv Hematol
The data set in this study.
| Number | Code | Description | Domain |
|---|---|---|---|
| 1 | Sex | Sex | Male, female |
| 2 | WBC | White blood cell count (cell/uL) | Integer |
| 3 | RBC | Red blood cell count | Integer |
| 4 | Hb | Hemoglobin (g/dL) | Integer |
| 5 | Hct | Hematocrit (%) | Integer |
| 6 | MCV | Mean corpuscular volume (fL) | Integer |
| 7 | MCH | Mean corpuscular hemoglobin (pg) | Integer |
| 8 | MCHC | Mean corpuscular hemoglobin concentration (g/dL) | Integer |
| 9 | PLT | Platelet count (cell/uL) | Integer |
| 10 | NEU | Neutrophil count (%) | Integer |
| 11 | LYMP | Lymphocyte count (%) | Integer |
| 12 | MONO | Monocyte count (%) | Integer |
| 13 | EO | Eosinophil count (%) | Integer |
| 14 | BASO | Basophil count (%) | Integer |
| 15 | RBC morphology | Red blood cell morphology | Normal, abnormal |
The hematological parameters which were categorized by RBC morphology and sex.
| Hematological parameters | Female | Male | ||||
|---|---|---|---|---|---|---|
| Abnormal blood smear (mean ± SD) | Normal blood smear (mean ± SD) | Significant ( | Abnormal blood smear (mean ± SD) | Normal blood smear (mean ± SD) | Significant ( | |
| WBC (×103cell/µL) | 7.87 ± 1.85 | 7.62 ± 1734.89 | S | 8.27 ± 2.07 | 7.31 ± 1.66 | S** |
| RBC (cell/µL) | 5.28 ± 0.56 | 4.73 ± 0.36 | S | 6.41 ± 0.58 | 5.45 ± 0.38 | S |
| Hb (mg/dL) | 11.64 ± 1.22 | 13.02 ± 0.83 | S | 13.86 ± 1.11 | 15.34 ± 0.86 | S |
| Hct (%) | 36.18 ± 3.33 | 40.03 ± 2.41 | S | 42.66 ± 2.89 | 46.17 ± 2.35 | S |
| MCV (fL) | 69.09 ± 7.45 | 84.9 ± 4.93 | S | 66.98 ± 6.55 | 84.88 ± 4.53 | S |
| MCH (pg) | 22.22 ± 2.56 | 27.62 ± 1.78 | S | 21.76 ± 2.41 | 28.21 ± 1.64 | S |
| MCHC (g/dL) | 32.16 ± 1.36 | 32.53 ± 0.76 | S | 32.48 ± 1.32 | 33.23 ± 0.72 | S |
| PLT (×103 cell/µL) | 313.85 ± 69.64 | 273.12 ± 54.92 | S | 282.08 ± 61.59 | 253.15 ± 47.35 | S |
| NEU (%) | 57.09 ± 8.23 | 57.22 ± 8.06 | NS | 59.67 ± 9.16 | 57.54 ± 7.85 | NS* |
| LYMP (%) | 37.57 ± 7.58 | 37.45 ± 7.41 | NS | 34.89 ± 8.26 | 36.83 ± 6.98 | NS |
| MONO (%) | 2.96 ± 1.03 | 2.99 ± 1.05 | NS | 2.67 ± 0.93 | 2.88 ± 1.06 | NS |
| EO (%) | 2.18 ± 1.95 | 2.12 ± 1.93 | NS | 2.51 ± 1.9 | 2.49 ± 2.45 | NS |
| BASO (%) | 0.2 ± 0.4 | 0.22 ± 0.41 | NS | 0.27 ± 0.45 | 0.26 ± 0.44 | NS |
|
| ||||||
| Total | 318 | 784 | 36 | 224 | ||
*NS: no statistically significant; **S: statistically significant.
The confusion matrix of predicted RBC morphology by J48 algorithm.
| Actual class from manual RBC morphology report | Predicted class from J48 model | |
|---|---|---|
| Abnormal | Normal | |
| Abnormal | TP (338) | FN (16) |
| Normal | FP (66) | TN (942) |
The performance evaluation of predicted normal and abnormal RBC morphology.
| Class | Abnormal | Normal | Average |
|---|---|---|---|
| TP | 0.955 | 0.935 | 0.940 |
| FP | 0.065 | 0.045 | 0.050 |
| Precision | 0.837 | 0.983 | 0.945 |
| Recall | 0.955 | 0.935 | 0.940 |
|
| 0.892 | 0.958 | 0.941 |
| Accuracy | 0.943 | 0.943 | 0.943 |
Figure 1The decision tree from J48 algorithm in predicting RBC morphology. If MCV is less than or equal to 78.3 fL, RBC is labeled as abnormal RBC morphology but if MCV is more than 78.3 fL and MCH is more than 25.2 pg, RBC is still labeled as normal. And if MCH is less than or equal to 25.2 pg but Hct is greater than 37.1%, RBC is normal. On the other hand, if Hct is less than or equal to 36.7%, RBC is abnormal. But if Hct is between 36.7 and 37.1% and RBC is less than or equal to 4.7 × 1012 cell/L, it will most likely be normal RBC morphology. However, if RBC is more than 4.7 × 1012 cell/L, it is abnormal RBC morphology.