| Literature DB >> 34054342 |
Fumin Xu1, Xiao Chen2, Chenwenya Li3, Jing Liu4, Qiu Qiu5, Mi He4, Jingjing Xiao6, Zhihui Liu7, Bingjun Ji8, Dongfeng Chen1, Kaijun Liu1.
Abstract
BACKGROUND: Multiple organ failure (MOF) may lead to an increased mortality rate of moderately severe (MSAP) or severe acute pancreatitis (SAP). This study is aimed to use machine learning to predict the risk of MOF in the course of disease.Entities:
Mesh:
Year: 2021 PMID: 34054342 PMCID: PMC8112913 DOI: 10.1155/2021/5525118
Source DB: PubMed Journal: Mediators Inflamm ISSN: 0962-9351 Impact factor: 4.711
The optimal feature subset of each machine learning method.
| Features | TG | HDL | LDL | PT | APTT | TT | INR | FIB | R-time | K-time |
| MA | IL-6 | PCT | BUN | Creatinine | K+ | Na+ | AUC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LR | √ | √ | √ | √ | √ | √ | √ | √ | √ | √ | √ | √ | 0.8401 | ||||||
| QDA | √ | √ | √ | √ | √ | √ | √ | √ | 0.8653 | ||||||||||
| NB | √ | √ | √ | √ | √ | √ | √ | 0.8646 | |||||||||||
| SVM | √ | √ | √ | √ | √ | √ | √ | √ | 0.8390 | ||||||||||
| AdaBoost | √ | √ | √ | √ | √ | √ | √ | √ | 0.8629 | ||||||||||
| BP | √ | √ | √ | √ | √ | √ | √ | √ | 0.8616 |
Abbreviations: TG: triglyceride; HDL: high-density lipoprotein; LDL: low-density lipoprotein; PT: prothrombin time; APTT: activated partial thromboplastin time; TT: thrombin time; INR: international normalized ratio; FIB: fibrinogen; R-time: reaction time; K-time: kinetic time; α: alpha angle; MA: maximum amplitude; IL-6: interleukin-6; PCT: procalcitonin; BUN: blood urea nitrogen; K+: potassium; Na+: sodium.
Figure 1The ROC curves of different models in the validation set. (a) LR. (b) QDA. (c) NB. (d) SVM. (e) AdaBoost. (f) BP.
Comparison of the predictive performance of different models in optimal feature subset in validation set.
| Variable | LR (95% CI) | QDA (95% CI) | NB (95% CI) | SVM (95% CI) | AdaBoost (95% CI) | BP (95% CI) |
|
|---|---|---|---|---|---|---|---|
| SEN | 65.4% (55.2-74.5%)a,c,d,e | 83.2% (75.7-90.6%)b | 81.2% (73.4-88.9%)b | 71.3% (62.3-80.3%) | 80.2% (72.3-88.1%)b | 83.2% (75.7-90.6%)b |
|
| SPE | 90.0% (85.4-93.6%) | 77.4% (71.5-82.4%) | 78.3% (72.4-83.2%) | 83.9% (78.6-88.3%) | 80.4% (75.3-85.6%) | 76.5% (71.0-82.0%) |
|
| FPR | 10.0% (6.4-14.9%) | 22.6% (17.6-28.5%) | 21.7% (16.8-27.6%) | 16.1% (11.7-21.4%) | 19.57% (14.4-24.7%) | 23.5% (18.0-29.0%) | 0.002 |
| FNR | 35.6% (25.5-46.8%)a,c,d,e | 16.8% (9.4-24.3%)b | 9.1% (11.1-26.6%)b | 28.7% (10.7-37.7%) | 19.8% (11.9-27.7%)b | 26.8% (9.4-24.3%)b | 0.008 |
| PPV | 73.3% (64.0-82.6%) | 61.3% (53.1-69.6%)a | 61.7% (53.3-70.0%)b,c | 65.5% (56.4-74.5%)a | 64.3% (55.8-72.8%) | 60.9% (52.6-69.1%) | 0.437 |
| NPV | 85.5% (81.0-90.0%) | 91.2% (87.2-95.3%) | 90.4% (86.3-94.5%) | 86.9% (82.4-91.4%) | 90.2% (86.1-94.3%) | 91.2% (87.2-95.2%) | 0.239 |
| Accuracy | 82.2% (78.0-86.3%) | 78.9% (74.4-83.3%) | 78.9% (74.4-83.3%) | 79.8% (75.4-81.4%) | 80.4% (76.1-84.7%) | 78.5% (74.1-83.0%) | 0.862 |
| AUC | 0.840 (0.796-0.878) | 0.865 (0.824-0.900) | 0.864 (0.823-0.899) | 0.839 (0.795-0.877) | 0.863 (0.821-0.898) | 0.862 (0.820-0.897) | / |
aCompared with QDA, p < 0.05; bCompared with LR, p < 0.05; cCompared with NB, p < 0.05; dCompared with AdaBoost, p < 0.05; eCompared with BP, p < 0.05. p value denoted the overall statistical result for the four models.
Figure 2The importance of each feature in optimal feature subset in the validation set. (a) LR. (b) QDA. (c) NB. (d) SVM. (e) AdaBoost. (f) BP.
Comparison of the predictive performance of different models in optimal feature subset in test set.
| Variable | LR (95%CI) | QDA (95%CI) | NB (95%CI) | SVM (95%CI) | Adaboost (95%CI) | BP (95%CI) |
|
|---|---|---|---|---|---|---|---|
| SEN | 58.54% (42.20%-73.30%) | 60.98% (44.54%-75.38%) | 73.17% (56.69%-85.25%) | 60.98% (44.54%-75.38%) | 80.49% (64.63%-90.63%) | 75.61% (59.36%-87.09%) | 0.15 |
| SPE | 93.33% (84.47%-95.52%)a,c,d,e | 86.67% (76.39%-93.08%)d | 76.00% (64.50%-84.79%)b | 89.33% (79.54%-94.95%)d | 73.33% (61.66%-82.58%)a,b,f | 74.67% (63.08%-83.69%)a,b |
|
| FPR | 6.67% (1.02%-12.32%)a,c,d,e | 13.33% (5.64%-21.03%) d | 24.00% (14.33%-33.67%) b | 10.67% (3.68%-17.66%) d | 26.67% (16.66%-36.67%) a,b,f | 25.33% (15.49%-35.17%) a,b |
|
| FNR | 41.46% (26.38%-56.54%) | 39.02% (24.09%-77.80%) | 26.83% (13.27%-40.39%) | 39.02% (24.09%-77.80%) | 19.51% (7.38%-31.64%) | 24.39% (11.25%-37.53%) | 0.15 |
| PPV | 82.76% (63.51%-93.47%) | 71.43% (53.48%-84.76%) | 62.50% (47.33%-75.68%) | 75.76% (57.37%-88.26%) | 62.26% (47.87%-74.88%) | 62.00% (47.16%-75.00%) | 0.281 |
| NPV | 93.33% (84.47%-97.52%) | 80.25% (69.61%-87.95%) | 83.82% (72.47%-91.27%) | 80.72% (70.29%-88.25%) | 87.30% (75.96%-93.97%) | 84.85% (73.44%-92.11%) | 0.87 |
| Accuracy | 80.3% (73.0-87.7%) | 78.5% (71.1-85.9%) | 75.0% (67.0-83.0%) | 79.3% (71.8-86.8%) | 75.9% (68.0-83.8%) | 75.0% (67.0-83.0%) | 0.831 |
| AUC | 0.782 (0.694-0.853) | 0.785 (0.686-0.848) | 0.779 (0.688-0.849) | 0.772 (0.679-0.842) | 0.826 (0.740-0.888) | 0.805 (0.714-0.869) | / |
aCompared with QDA, p < 0.05; bCompared with LR, p < 0.05; cCompared with NB, p < 0.05; dCompared with AdaBoost, p < 0.05; eCompared with BP, p < 0.05; fCompared with SVM, p < 0.05. p value denoted the overall statistical result for the four models.
Figure 3The ROC curves of optimal feature set of different models in test set.