| Literature DB >> 33051513 |
Davide Chicco1, Giuseppe Jurman2.
Abstract
Sepsis is a life-threatening condition caused by an exaggerated reaction of the body to an infection, that leads to organ failure or even death. Since sepsis can kill a patient even in just one hour, survival prediction is an urgent priority among the medical community: even if laboratory tests and hospital analyses can provide insightful information about the patient, in fact, they might not come in time to allow medical doctors to recognize an immediate death risk and treat it properly. In this context, machine learning can be useful to predict survival of patients within minutes, especially when applied to few medical features easily retrievable. In this study, we show that it is possible to achieve this goal by applying computational intelligence algorithms to three features of patients with sepsis, recorded at hospital admission: sex, age, and septic episode number. We applied several data mining methods to a cohort of 110,204 admissions of patients, and obtained high prediction scores both on this complete dataset (top precision-recall area under the curve PR AUC = 0.966) and on its subset related to the recent Sepsis-3 definition (top PR AUC = 0.860). Additionally, we tested our models on an external validation cohort of 137 patients, and achieved good results in this case too (top PR AUC = 0.863), confirming the generalizability of our approach. Our results can have a huge impact on clinical settings, allowing physicians to forecast the survival of patients by sex, age, and septic episode number alone.Entities:
Mesh:
Year: 2020 PMID: 33051513 PMCID: PMC7555553 DOI: 10.1038/s41598-020-73558-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Meanings, measurement units, and intervals of each feature of the dataset.
| Feature | Explanation | Measurement | Range |
|---|---|---|---|
| Age | Age of the patient at the hospital stay | Years | [0, ..., 100] |
| Episode number | Number of septic episodes experienced by the patient | Integer | [1, ..., 5] |
| Sex | 0: male; 1: female | Binary | 0, 1 |
| [Target] survival | 0: dead; 1: alive | Boolean | 0, 1 |
Ranges refer both to the primary cohort and the study cohort. We used survival as prediction the target in this manuscript.
Statistical quantitative description of the category features.
| Category feature | Primary cohort | Study cohort | ||
|---|---|---|---|---|
| # | % | # | % | |
| Survival (0: dead) | 8105 | 7.35 | 3606 | 18.93 |
| Survival (1: alive) | 102,099 | 92.65 | 15,445 | 81.07 |
| Sex (0: male) | 57,973 | 52.61 | 10,505 | 55.14 |
| Sex (1: female) | 52,231 | 47.39 | 8546 | 44.86 |
#: Number of admissions. %: percentage of admissions. Primary cohort full sample: 110,204 admissions. Study cohort full sample: 19,051 admissions.
Statistical quantitative description of the numeric features.
| Numeric feature | Primary cohort | Study cohort | ||||
|---|---|---|---|---|---|---|
| Median | Mean | Median | Mean | |||
| Age | 68 | 62.74 | 24.13 | 77 | 72.50 | 18.61 |
| Episode number | 1 | 1.35 | 0.75 | 1 | 1.40 | 0.75 |
Primary cohort full sample: 110,204 admissions. Study cohort full sample: 19,051 admissions. : standard deviation.
Figure 1Primary cohort: stacked barplots of the distribution of categories. Distribution of sepsis episode number and sex of the admissions of patients who deceased (left) and survived (right). Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0).
Figure 2Primary cohort: histograms of the patients’ ages in relation with the number of admissions. On the left, the admissions of the patients who deceased. On the right, the admissions of patients who survived. Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0).
Figure 3Study cohort: stacked barplots of the distribution of categories. Distribution of sepsis episode number and sex of the admissions of patients who deceased (left) and survived (right). Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0).
Figure 4Study cohort: histograms of the patients’ ages in relation with the number of admissions. On the left, the admissions of the patients who deceased. On the right, the admissions of patients who survived. Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0).
Results of the application of univariate biostatistics tests between each feature and the survival target, in the primary cohort.
| Feature | Mann–Whitney | Chi-squared |
|---|---|---|
| Test | Test | |
| Age | ||
| Episode number | ||
| Sex |
Mann–Whitney test p-value: probability value generated by the application of the Mann–Whitney U test to the corresponding feature and survival. chi-squared test p-value: probability value generated by the application of the chi-squared test to sex and survival. We reported the features in alphabetical order.
Results of the survival prediction made with machine learning classifiers, with training phase and testing phase done on the Norwegian primary cohort or study cohort[87].
| Method | PR AUC | ROC AUC | TP rate | TN rate | PPV | NPV | MCC | accuracy | |
|---|---|---|---|---|---|---|---|---|---|
| Radial SVM | 0.492 | 0.807 | 0.112 | 0.652 | 0.515 | ||||
| Gradient boosting | 0.690 | 0.179 | 0.934 | 0.126 | + 0.061 | ||||
| Naïve Bayes | 0.954 | 0.649 | 0.553 | 0.745 | 0.965 | 0.117 | + 0.156 | 0.703 | 0.567 |
| Linear regression | 0.941 | 0.599 | 0.836 | 0.361 | 0.943 | 0.149 | + 0.135 | 0.886 | 0.801 |
| Linear SVM | 0.860 | 0.586 | 0.205 | 0.896 | + 0.104 | 0.333 | 0.337 | ||
| Linear SVM | 0.205 | 0.210 | 0.333 | 0.337 | |||||
| Radial SVM | 0.858 | 0.408 | 0.718 | 0.861 | 0.222 | + 0.102 | 0.553 | 0.467 | |
| Gradient boosting | 0.856 | 0.574 | 0.208 | 0.822 | 0.231 | + 0.038 | |||
| Naïve Bayes | 0.841 | 0.562 | 0.405 | 0.718 | 0.861 | 0.220 | + 0.100 | 0.551 | 0.465 |
| Linear regression | 0.826 | 0.541 | 0.764 | 0.318 | 0.828 | + 0.074 | 0.794 | 0.679 | |
Mean results of 100 executions with random selection of the elements in the training set and test set, with ROSE oversampling[97] applied to the training set. Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0). Linear SVM: support vector machine with linear kernel. Optimized cost regularization hyper-parameter of the linear SVM, most frequently selected C by the MCC-based grid search: for primary cohort (63 times out of 100) and for study cohort (51 times out of 100). Radial SVM: support vector machine with radial Gaussian kernel. Optimized cost regularization of the radial SVM, most frequently selected C by the MCC-based grid search: for the primary cohort (56 times out of 100) and for the study cohort (51 times out of 100). MCC: Matthews correlation coefficient. MCC worst value and best value . TP rate: true positive rate, sensitivity, recall. TN rate: true negative rate, specificity. PR: precision-recall curve. PPV: positive predictive value, precision. NPV: negative predictive value. ROC: receiver operating characteristic curve. AUC: area under the curve. score, accuracy, TP rate, TN rate, PPV, NPV, PR AUC, ROC AUC: worst value and best value . We report the formulas of these rates in the Supplementary Information. ROSE minority class probability: for SVMs; for gradient boosting, naïve Bayes, and linear regression in the primary cohort; for gradient boosting, naïve Bayes, and linear regression in the study cohort. We highlighted in italic and with an asterisk * the top result for each statistical indicator. We report the mean scores with the standard deviations in Supplementary Table S1.
Results of the survival prediction made with machine learning classifiers on the South Korean external validation cohort[96].
| Training and testing on the validation cohort | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Method | PR AUC | ROC AUC | TP rate | TN rate | PPV | NPV | MCC | accuracy | |
| Linear SVM | 0.676 | 0.911 | 0.388 | 0.873 | 0.490 | + 0.309 | 0.889 | 0.818 | |
| Naïve Bayes | 0.887 | 0.899 | 0.538 | 0.893 | |||||
| Gradient boosting | 0.883 | 0.682 | 0.912 | 0.448 | 0.885 | + 0.378 | |||
| Linear regression | 0.880 | 0.689 | 0.849 | 0.530 | 0.885 | 0.458 | + 0.350 | 0.863 | 0.788 |
| Radial SVM | 0.873 | 0.642 | 0.226 | 0.849 | 0.465 | + 0.179 | 0.883 | 0.806 | |
Mean results of 100 executions with random selection of the elements in the training set and test set, with ROSE oversampling[97] applied to the training set. In this analysis, both the training phase and the testing phase happened on the validation cohort. : standard deviation. Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0). Linear SVM: support vector machine with linear kernel. Optimized cost regularization hyper-parameter of the linear SVM, most frequently selected C by the MCC-based grid search: (59 times out of 100). Radial SVM: support vector machine with radial Gaussian kernel. Optimized cost regularization of the radial SVM, most frequently selected C by the MCC-based grid search: (70 times out of 100). MCC: Matthews correlation coefficient. MCC worst value and best value . TP rate: true positive rate, sensitivity, recall. TN rate: true negative rate, specificity. PR: precision-recall curve. PPV: positive predictive value, precision. NPV: negative predictive value. ROC: receiver operating characteristic curve. AUC: area under the curve. score, accuracy, TP rate, TN rate, PPV, NPV, PR AUC, ROC AUC: worst value and best value . ROSE p-value: 0.5 for all. We report the results with standard deviations in Table S2 and the formulas of the statistical indicators in the Supplementary Information. We highlighted in italic and with an asterisk * the top result for each statistical indicator.
Results of the survival prediction made with machine learning classifiers, including standard deviation, with training phase done on the Norwegian primary cohort or study cohort[87] and testing phase done on the South Korean external validation cohort[96].
| method | PR AUC | ROC AUC | TP rate | TN rate | PPV | NPV | MCC | accuracy | |
|---|---|---|---|---|---|---|---|---|---|
| Naïve Bayes | 0.715 | 0.777 | 0.663 | ||||||
| Gradient boosting | 0.843 | 0.527 | 0.953 | 0.035 | 0.823 | 0.123 | – 0.018 | 0.882 | |
| Radial SVM | 0.821 | 0.514 | 0.949 | 0.013 | 0.819 | 0.040 | – 0.068 | 0.879 | 0.785 |
| Gradient boosting | 0.552 | 0.061 | 0.830 | ||||||
| Naïve Bayes | 0.848 | 0.747 | 0.244 | + 0.113 | 0.795 | 0.683 | |||
| Radial SVM | 0.829 | 0.537 | 0.955 | 0.011 | 0.820 | 0.043 | – 0.068 | 0.882 | 0.789 |
| Linear regression | 0.824 | 0.499 | 0.956 | 0.042 | 0.824 | 0.166 | – 0.005 | 0.885 | 0.796 |
Mean results of 100 executions with random selection of the elements in the training set and test set, with ROSE oversampling[97] applied to the training set. : standard deviation. Admissions of survived patients: positives data instances (class 1). Admissions of deceased patients: negative data instances (class 0). Linear SVM: support vector machine with linear kernel. Optimized cost regularization hyper-parameter of the linear SVM, most frequently selected C by the MCC-based grid search: for primary cohort (63 times out of 100) and for study cohort (51 times out of 100). Radial SVM: support vector machine with radial Gaussian kernel. Optimized cost regularization of the radial SVM, most frequently selected C by the MCC-based grid search: for the primary cohort (56 times out of 100) and for the study cohort (51 times out of 100). MCC: Matthews correlation coefficient. MCC worst value and best value . TP rate: true positive rate, sensitivity, recall. TN rate: true negative rate, specificity. PR: precision-recall curve. PPV: positive predictive value, precision. NPV: negative predictive value. ROC: receiver operating characteristic curve. AUC: area under the curve. score, accuracy, TP rate, TN rate, PPV, NPV, PR AUC, ROC AUC: worst value and best value . We report the formulas of these rates in the Supplementary Information, and the same results including the standard deviations in Table S3. ROSE minority class probability: for SVMs; for gradient boosting, naïve Bayes, and linear regression in the primary cohort; for gradient boosting, naïve Bayes, and linear regression in the study cohort. We highlighted in italic and with an asterisk * the top result for each statistical indicator. We did not report the results of linear regression trained on the primary cohort and the results of the linear SVM on both the cohorts because these methods predicted all positives in the validation cohort.