| Literature DB >> 32377437 |
Omer F Akmese1, Gul Dogan2, Hakan Kor3, Hasan Erbay4, Emre Demir5.
Abstract
Acute appendicitis is one of the most common emergency diseases in general surgery clinics. It is more common, especially between the ages of 10 and 30 years. Additionally, approximately 7% of the entire population is diagnosed with acute appendicitis at some time in their lives and requires surgery. The study aims to develop an easy, fast, and accurate estimation method for early acute appendicitis diagnosis using machine learning algorithms. Retrospective clinical records were analyzed with predictive data mining models. The predictive success of the models obtained by various machine learning algorithms was compared. A total of 595 clinical records were used in the study, including 348 males (58.49%) and 247 females (41.51%). It was found that the gradient boosted trees algorithm achieves the best success with an accurate prediction success of 95.31%. In this study, an estimation method based on machine learning was developed to identify individuals with acute appendicitis. It is thought that this method will benefit patients with signs of appendicitis, especially in emergency departments in hospitals.Entities:
Year: 2020 PMID: 32377437 PMCID: PMC7196991 DOI: 10.1155/2020/7306435
Source DB: PubMed Journal: Emerg Med Int ISSN: 2090-2840 Impact factor: 1.112
Dataset description.
| Name | Type | Description | Role |
|---|---|---|---|
| Group | Categorical | No surgery/having surgery | Target |
| Gender | Categorical | Female/male | Input |
| HGB | Numeric | Hemoglobin | Input |
| NEU | Numeric | Neutrophil | Input |
| LYM | Numeric | Lymphocytes | Input |
| MCV | Numeric | Mean corpuscular volume | Input |
| MPV | Numeric | Mean platelet volume | Input |
| HTC | Numeric | Hematocrit | Input |
| PLT | Numeric | Thrombosis | Input |
| CRP | Numeric | C-reactive protein | Input |
| WBC | Numeric | White blood cells (leukocytes) | Input |
Categorical data represent types of data which may be divided into groups. Numerical data are data expressed in numbers, unlike letters or words. Target: dependent variable; input: independent variable.
Blood test results in acute appendicitis dataset.
| Name | Value range | Having surgery | No surgery |
|
|---|---|---|---|---|
| Group | 0 or 1 | |||
| Gender | 1 or 2 | 77/137 | 104/110 | 0.008 |
| HGB | 1.4–130 | 13.63 ± 8.18 | 13.02 ± 1.43 | 0.348 |
| NEU | 0.8–29 | 11.82 ± 5.31 | 9.11 ± 5.77 |
|
| LYM | 0.2–94 | 3.09 ± 7.09 | 2.44 ± 1.17 | 0.016 |
| MCV | 4–97.3 | 80.98 ± 8.76 | 81.20 ± 5.79 | 0.434 |
| MPV | 6–99 | 14.03 ± 20.02 | 9.31 ± 7.93 | 0.708 |
| HTC | 3–99 | 40.32 ± 6.24 | 39.69 ± 3.49 | 0.143 |
| PLT | 111–593 | 272.49 ± 73.58 | 274.37 ± 82.73 | 0.954 |
| CRP | 0–302 | 39.61 ± 51.62 | 22.86 ± 40.64 |
|
| WBC | 6–31590 | 15280 ± 5302 | 12541 ± 6259 |
|
Group: 0 refers to those without surgery and 1 refers to those who were operated on. Gender variable: 1 refers to females and 2 refers to males. HGB: hemoglobin, NEU: neutrophil, LYM: lymphocytes, MCV: mean corpuscular volume, MPV: mean platelet volume, HTC: hematocrit, PLT: thrombosis, CRP: C-reactive protein, and WBC: white blood cells (leukocytes). p values < 0.05 are statistically significant. Bold denotes a significant p value less than 0.05.
Attributes and weighting.
| No | Attributes | Weighting (chi-square) | |
|---|---|---|---|
| 1 | NEU | 1 | |
| 2 | WBC | 0.994 | |
| 3 | CRP | 0.436 | |
| 4 | MPV | 0.201 | |
| 5 | LYM | 0.120 | |
| 6 | HTC | 0.119 | |
| 7 | Gender | 0.079 | |
| 8 | MCV | 0.075 | |
| 9 | MPV | 0.049 | |
| 10 | HB | 0 | |
HGB: hemoglobin, NEU: neutrophil, LYM: lymphocytes, MCV: mean corpuscular volume, MPV: mean platelet volume, HTC: hematocrit, PLT: thrombosis, CRP: C-reactive protein, and WBC: white blood cells (leukocytes).
Figure 1The architecture of the proposed system.
Figure 2Graph of those who underwent surgery according to gender after data preprocessing.
Figure 3Accuracy percentages of algorithms.
Figure 4Gradient boosting tree prediction.
Results of gradient boosting tree analysis.
| Accuracy: 95.31% | True 1 | True 0 | Total | Class precision (%) |
|---|---|---|---|---|
| Pred. 1 | 55 (TP) | 2 (FP) | 57 (P′) | 96.49 |
| Pred. 0 | 4 (FN) | 67 (TN) | 71 (N′) | 94.36 |
| Total | 59 (P) | 69 (N) | 128 (P + N) | |
| Class recall | 93.22% | 97.10% |
TP: true positives, TN: true negatives, FN: false negatives, and FP: false positives. Precision: it is the ratio of correctly predicted positive samples to the number of samples estimated in the positive class. Recall: it is the ratio of correctly predicted positive samples the ratio to the number of samples in the true positive class.