| Literature DB >> 35808172 |
Elias Dritsas1, Maria Trigka1.
Abstract
A stroke is caused when blood flow to a part of the brain is stopped abruptly. Without the blood supply, the brain cells gradually die, and disability occurs depending on the area of the brain affected. Early recognition of symptoms can significantly carry valuable information for the prediction of stroke and promoting a healthy life. In this research work, with the aid of machine learning (ML), several models are developed and evaluated to design a robust framework for the long-term risk prediction of stroke occurrence. The main contribution of this study is a stacking method that achieves a high performance that is validated by various metrics, such as AUC, precision, recall, F-measure and accuracy. The experiment results showed that the stacking classification outperforms the other methods, with an AUC of 98.9%, F-measure, precision and recall of 97.4% and an accuracy of 98%.Entities:
Keywords: data analysis; machine learning; risk prediction; stroke
Mesh:
Year: 2022 PMID: 35808172 PMCID: PMC9268898 DOI: 10.3390/s22134670
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Participants distribution per age group and gender type in the balanced dataset.
Figure 2Participants distribution per hypertension and heart disease status in the balanced dataset.
Figure 3Participants distribution per BMI category and smoke status in the balanced dataset.
Figure 4Participants distribution per residence and work type in the balanced dataset.
Features importance in the balanced data.
| Random Forest | Information Gain | ||
|---|---|---|---|
|
|
|
|
|
| Age | 0.4702 | Age | 0.75627 |
| BMI | 0.404 | Ever_married | 0.09382 |
| Avg_glucose_level | 0.1139 | BMI | 0.06991 |
| Ever_married | 0.0929 | Avg_glucose_level | 0.06265 |
| Work_type | 0.0898 | Work_type | 0.05651 |
| Smoking_status | 0.0661 | Heart_disease | 0.02777 |
| Residence_type | 0.0537 | Smoking_status | 0.02554 |
| Gender | 0.0500 | Residence_type | 0.02129 |
| Heart_disease | 0.0499 | Gender | 0.01667 |
| Hypertension | 0.0177 | Hypertension | 0.00523 |
Figure 5Machine learning models AUC and F-measure evaluation for the stroke class.
Figure 6Machine learning models precision and recall evaluation for the stroke class.
Average performance of ML models.
| Precision | Recall | F-Measure | AUC | Accuracy | |
|---|---|---|---|---|---|
|
| 0.812 | 0.860 | 0.835 | 0.867 | 0.84 |
|
| 0.791 | 0.791 | 0.791 | 0.877 | 0.79 |
|
| 0.918 | 0.916 | 0.915 | 0.943 | 0.81 |
|
| 0.791 | 0.791 | 0.791 | 0.791 | 0.88 |
|
| 0.909 | 0.909 | 0.909 | 0.927 | 0.91 |
|
| 0.884 | 0.881 | 0.881 | 0.929 | 0.92 |
|
| 0.93 | 0.93 | 0.93 | 0.93 | 0.93 |
|
| 0.966 | 0.966 | 0.966 | 0.986 | 0.97 |
|
| 0.974 | 0.974 | 0.974 | 0.989 | 0.98 |
Comparison of ML models performance.
| Precision | Recall | F-Measure | Accuracy | |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
|
| 0.812 | 0.786 | 0.860 | 0.857 | 0.835 | 0.823 | 0.84 | 0.82 |
|
| 0.791 | 0.775 | 0.791 | 0.760 | 0.791 | 0.776 | 0.79 | 0.78 |
|
| 0.918 | 0.774 | 0.916 | 0.838 | 0.915 | 0.804 | 0.81 | 0.80 |
|
| 0.909 | 0.909 | 0.909 | 0.775 | 0.909 | 0.776 | 0.88 | 0.66 |
|
| 0.974 | 0.720 | 0.974 | 0.735 | 0.974 | 0.727 | 0.98 | 0.73 |