| Literature DB >> 34322636 |
Sonia Darvishi1, Omid Hamidi2, Jalal Poorolajal3.
Abstract
INTRODUCTION: Hamedan Province is one of Iran's high-risk regions for Multiple Sclerosis (MS). Early diagnosis of MS based on an accurate system can control the disease. The aim of this study was to compare the performance of four machine learning techniques with traditional methods for predicting MS patients.Entities:
Keywords: Classification; Machine learning; Multiple sclerosis
Mesh:
Year: 2021 PMID: 34322636 PMCID: PMC8283630 DOI: 10.15167/2421-4248/jpmh2021.62.1.1651
Source DB: PubMed Journal: J Prev Med Hyg ISSN: 1121-2233
demographic and clinical characteristics of the case and control groups.
| Variable | Cases (%) | Controls (%) | P-value |
|---|---|---|---|
| Male | 20(20) | 52(52) | 0.000 |
| Female | 80(80) | 48(48) | |
| Single | 25(25) | 19(19) | 0.3 |
| Married | 75(75) | 81(81) | |
| Non-academic | 63(63) | 72(72) | 0.1 |
| Academic | 37(37) | 28(28) | |
| No | 90(90) | 94(94) | 0.2 |
| Yes | 10(10) | 6(6) | |
| Non-smoker | 94(94) | 73(73) | 0.000 |
| Smoker | 6(6) | 27(27) | |
| Non-breast feeding | 22(22) | 6(6) | 0.001 |
| Breast feeding | 78(78) | 94(94) | |
| No | 64(64) | 50(50) | 0.04 |
| Yes | 36(36) | 50(50) | |
| Spring | 23(23) | 33(33) | 0.08 |
| Summer | 27(27) | 31(31) | |
| Autumn | 29(29) | 15(15) | |
| Winter | 21(21) | 21(21) | |
| AB | 5(5) | 14(14) | 0.3 |
| A | 21(21) | 22(22) | |
| B | 20(20) | 24(24) | |
| O | 25(25) | 40(40) | |
| Negative | 13(13) | 24(24) | 0.2 |
| Positive | 60(60) | 73(73) | |
| Underweight | 2(2) | 4(4) | 0.6 |
| Normal weight | 32(32) | 33(33) | |
| Overweight & obesity | 32(32) | 43(43) | |
| B | 30(30) | 40(40) | 0.1 |
| A | 70(70) | 60(60) | |
Mean and standard deviation of sensitivity, specificity, PPV, NPV, total accuracy, positive LR and negative LR for various models.
| Scenario | Models | Sensitivity | Specificity | PPV | NPV | TA | LR+ | LR- | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean | Std. | Mean | Std. | Mean | Std. | Mean | Std. | Mean | Std. | Mean | Std.dv | Mean | Std.dv | ||
| 70, 30 | NB | 0.79 | 0.10 | 0.55 | 0.13 | 0.64 | 0.08 | 0.74 | 0.10 | 0.67 | 0.05 | 1.92 | 0.54 | 0.35 | 0.14 |
| LSSVM | 0.61 | 0.09 | 0.67 | 0.09 | 0.65 | 0.08 | 0.64 | 0.07 | 0.64 | 0.05 | 2.06 | 0.86 | 0.51 | 0.01 | |
| RF | 0.71 | 0.08 | 0.67 | 0.08 | 0.68 | 0.07 | 0.70 | 0.08 | 0.68 | 0.04 | 2.06 | 0.61 | 0.51 | 0.11 | |
| SVM | 0.72 | 0.89 | 0.64 | 0.1 | 0.67 | 0.08 | 0.70 | 0.09 | 0.68 | 0.05 | 2.06 | 0.65 | 0.51 | 0.13 | |
| LR | 0.67 | 0.08 | 0.65 | 0.1 | 0.66 | 0.08 | 0.66 | 0.09 | 0.66 | 0.06 | 2.06 | 0.64 | 0.51 | 0.15 | |
| LDA | 0.68 | 0.08 | 0.64 | 0.10 | 0.66 | 0.08 | 0.66 | 0.09 | 0.55 | 0.06 | 2.06 | 0.66 | 0.51 | 0.16 | |
| 50, 50 | NB | 0.77 | 0.15 | 0.56 | 0.15 | 0.64 | 0.07 | 0.74 | 0.10 | 0.66 | 0.05 | 1.90 | 0.45 | 0.37 | 0.17 |
| LSSVM | 0.62 | 0.10 | 0.63 | 0.10 | 0.63 | 0.07 | 0.63 | 0.06 | 0.63 | 0.04 | 1.90 | 0.46 | 0.51 | 0.13 | |
| RF | 0.71 | 0.09 | 0.57 | 0.09 | 0.68 | 0.07 | 0.70 | 0.07 | 0.68 | 0.04 | 1.90 | 0.43 | 0.51 | 0.11 | |
| SVM | 0.69 | 0.10 | 0.63 | 0.1 | 0.65 | 0.06 | 0.68 | 0.08 | 0.66 | 0.04 | 1.90 | 0.42 | 0.51 | 0.13 | |
| LR | 0.67 | 0.09 | 0.63 | 0.08 | 0.65 | 0.06 | 0.66 | 0.07 | 0.66 | 0.04 | 1.90 | 0.65 | 0.51 | 0.11 | |
| LDA | 0.68 | 0.09 | 0.63 | 0.09 | 0.64 | 0.06 | 0.66 | 0.07 | 0.65 | 0.04 | 1.90 | 0.50 | 0.51 | 0.12 | |
Fig. 1.Variable importance in predicting MS disease using RF model.
Fig. 2.Partial plots for variables in predicting MS using RF.