Literature DB >> 32082247

Impact of 25-Hydroxyvitamin D on the Prognosis of Acute Ischemic Stroke: Machine Learning Approach.

Chulho Kim1,2, Sang-Hwa Lee1, Jae-Sung Lim3, Yerim Kim4, Min Uk Jang5, Mi Sun Oh3, San Jung6, Ju-Hun Lee4, Kyung-Ho Yu3, Byung-Chul Lee3.   

Abstract

Background and Purpose: Vitamin D is a predictor of poor outcome for cardiovascular disease. We evaluated whether serum 25-hydroxyvitamin D level was associated with poor outcome in patients with acute ischemic stroke (AIS) using machine learning approach. Materials and
Methods: We studied a total of 328 patients within 7 days of AIS onset. Serum 25-hydroxyvitamin D level was obtained within 24 h of hospital admission. Poor outcome was defined as modified Rankin Scale score of 3-6. Logistic regression and extreme gradient boosting algorithm were used to assess association of 25-hydroxyvitamin D with poor outcome. Prediction performances were compared with area under ROC curve and F1 score.
Results: Mean age of patients was 67.6 ± 13.3 years. Of 328 patients, 59.1% were men. Median 25-hydroxyvitamin D level was 10.4 (interquartile range, 7.1-14.8) ng/mL and 47.2% of patients were 25-hydroxyvitamin D-deficient (<10 ng/mL). Serum 25-hydroxyvitamin D deficiency was a predictor for poor outcome in multivariable logistic regression analysis (odds ratio, 3.38; 95% confidence interval, 1.24-9.18, p = 0.017). Stroke severity, age, and 25-hydroxyvitamin D level were also significant predictors in extreme gradient boosting classification algorithm. Performance of extreme gradient boosting algorithm was comparable to those of logistic regression (AUROC, 0.805 vs. 0.746, p = 0.11). Conclusions: 25-hydroxyvitamin D deficiency was highly prevalent in Korea and low 25-hydroxyvitamin D level was associated with poor outcome in patients with AIS. The machine learning approach of extreme gradient boosting was also useful to assess stroke prognosis along with logistic regression analysis.
Copyright © 2020 Kim, Lee, Lim, Kim, Jang, Oh, Jung, Lee, Yu and Lee.

Entities:  

Keywords:  cerebral infarction; machine learning; outcome assessment; stroke; vitamin D

Year:  2020        PMID: 32082247      PMCID: PMC7005206          DOI: 10.3389/fneur.2020.00037

Source DB:  PubMed          Journal:  Front Neurol        ISSN: 1664-2295            Impact factor:   4.003


Introduction

Vitamin D is a prohormone synthesized by sun-exposure and dietary intake (1). Besides its role in bone integrity and calcium homeostasis (2), vitamin D status is also associated with cardiovascular morbidity and mortality (3–6). Vitamin D has a protective effect on endothelial function and vascular remodeling in experimental models (7, 8). One meta-analysis has shown that low vitamin D level is associated with 2.5-fold increase of the risk of ischemic stroke (9). However, the benefit of vitamin D supplementation in improving cardiovascular outcome remains controversial (10–12). Several studies have suggested that low vitamin D is also associated with poor outcome in patients with acute ischemic stroke (AIS) (13–17). In these studies, logistic regression analysis alone was used to assess the relationship between low vitamin D status and prognosis of AIS patients. Logistic regression analysis should consider model complexity including interaction and model appropriateness using goodness of fit (18). Vitamin D is associated with several cardiovascular risk factors, including hypertension (19), insulin resistance (20), cerebral small vessel disease burden (21), stroke severity (16, 17), infarct volume (22), mood (23), and cognition (24) in previous studies. However, studies considering these interactions in logistic regression models are scarce. Recently, machine learning (ML) has been found to be capable of accurately predicting prognosis in several disease categories, including cancer (25), cardiovascular disease (26), and psychiatric disorders (27). ML has advantage of being able to deal with large-size data and having several optimization technique, thus reducing the overfitting of training algorithms (28). In addition, several ML models such as random forest and extreme gradient boosting are tree-based feature selection ML algorithms that enable us to find non-linear relationship and interactions between independent variables more efficiently compared to logistic regression model (29). In this study on the association between vitamin D and stroke prognosis, we hypothesize that low vitamin D is associated with poor outcome in patients with AIS and that the predicting performance of extreme gradient boosting is superior to that of binary logistic regression for considering vitamin D with several risk factors.

Materials and Methods

Study Population

The present study was a single-center retrospective study that screened all patients with AIS within 7 days of symptom onset using prospectively collected hospital registry. From August 2013 to October 2015, a total of 738 AIS patients with a positive diffusion-weighted lesion on brain MRI were selected for screening. We excluded patients with prior history of stroke (n = 243) because dietary intake or physical activity known to be associated with vitamin D intake and biosynthesis could be affected by premorbid neurological status. Additionally, patients without 3-month outcome capture (n = 85) and laboratory measures including admission serum 25-hydroxyvitamin D level (n = 82) were excluded.

Measurement of Serum 25-Hydroxyvitamin D Level

Vitamin D status was determined by serum 25-hydroxyvitamin D concentration, a major circulating form with a long circulating half-life (~3 weeks) (30). 25-hydroxyvitamin D level was measured within 24 h of admission using a radioimmunoassay kit (DiaSorin Liaison, Stillwater, MN, USA) with <10% of inter-assay coefficient of variation. The assay was standardized against NIST Standard Reference Material 2972 (NIST, Gaithersburg, ME, USA) and certified by the Centers for Disease Control and Prevention Vitamin D Standardization Program. 25-hydroxyvitamin D deficiency was defined when its concentration was <10 ng/mL following the criteria of Korean population study (31).

Covariates

Definitions for vascular risk factors were based on our previous reports (32). Hypertension was defined if participants were taking antihypertensive medications or if their average sitting systolic blood pressure was 140/90 mmHg or more. Diabetes was defined if they were taking medical treatments for diabetes, if they had a fasting serum glucose level of 126 mg/dL or more, or if they had a non-fasting random serum glucose level of 200 mg/dL or more with corresponding symptoms of diabetes. Subjects were considered to have hyperlipidemia if they had a fasting total cholesterol level of 240 mg/dL or more or if they were being treated with a lipid-lowering agent. A current smoker was defined as a person smoking one or more cigarettes per day within the last 6 months. Stroke subtypes were classified as large artery atherosclerosis (LAA), small vessel occlusion (SVO), cardioembolism (CE), undetermined etiology (SUE), and other determined etiology (SOE) according to the Trial of Org 10172 in Acute Stroke Treatment (TOAST) criteria (33). Laboratory parameters included serum level of fasting blood sugar, total cholesterol, triglyceride, high-density lipoprotein, low-density lipoprotein, blood urea nitrogen, creatinine, and whole blood level of white blood cell count, hemoglobin, and platelet count.

Outcomes

Primary outcome measure was 3-months poor functional outcome defined as a modified Rankin Scale (mRS) score of 3–6. Certified neurologists and nurses assessed the mRS score. Secondary outcome was F1 score of the prediction model (binary logistic regression vs. extreme gradient boosting) for AIS prognosis classification. F1 score is a measure of the model's accuracy. It represents harmonized mean of precision (positive predictive value) and recall (sensitivity) of the ML classifier.

Statistical Analysis

Baseline characteristics according to tertiles of 25-hydroxyvitamin D were compared with analysis of variance or Kruskal Wallis test for continuous variables. For categorical variables, Pearson's χ2-test or Fisher's exact test was used. We performed univariate and multivariate logistic regression analyses for 3-months poor functional outcome (mRS 3–6) according to 25-hydroxyvitamin D status. All variables that were significant with p < 0.10 in univariable analysis, age, and sex were entered into multivariable models. Two-sided p < 0.05 was considered significant in multivariable analysis performed with R version 3.4.1 (the R Foundation for Statistical Computing).

Machine Learning Algorithm

We used the Extreme Gradient Boosting (XGBoost) R package that could perform regression and classification task well. XGBoost is an ML algorithm widely used in various Kaggle contests because it can perform non-linear prediction well, deal with missing data, and prove that computational speed is faster than other ML methods (34). As previously mentioned, we labeled patient's outcome as good (mRS: 0–2) or poor (mRS: 3–6). All participants' dataset were randomly divided into 2 group: training (60% of 328 subjects) and testing (40%). Proportion of poor outcome in training and testing dataset was identically allocated using R cart package. Input features of this model were independent variables in the multivariate logistic regression model. Precise parameter tuning methods of XGBoost for preventing overfitting of this ML model are shown in Supplemental Material. We calculated the probabilistic score of the prediction model of binary logistic regression and extreme gradient boosting algorithm, in which we set the probabilistic score of 0.5 or more to “poor outcome” (or test positive) and those <0.5 to “good outcome.” We presented the performance of these two classification tasks as precision, recall, accuracy, and F1 score. We compared model performance with area under the ROC curve (AUROC). Additionally, we also examined the performance of other machine learning algorithms such as support vector machines.

Results

Baseline Characteristics According to 25-Hydroxyvitamin D Tertile

A total of 328 patients (mean age, 67.4 ± 13.2 years; male, 59.1%) were included in this study (Table 1). The mean (± standard deviation) and median [interquartile range (IQR)] of 25-hydroxyvitamin D were 11.5 ± 6.1 and 10.3 (7.1–14.7) ng/mL, respectively. The 25-hydroxyvitamin D tertile was inversely correlated with NIHSS score (p for Kruskal Wallis test = 0.029) and blood urea nitrogen level (p for one-way ANOVA = 0.007). It was negatively correlated with hemoglobin level (p for one-way ANOVA < 0.001). Stroke subtype and cardiovascular risk factors such as hypertension, diabetes, smoking, or atrial fibrillation showed no significant associations with 25-hydroxyvitamin D tertile.
Table 1

Baseline characteristic of participants according to 25-hydroxyvitamin D tertile.

Total25-hydroxyvitamin D concentrationp
Tertile 1 (n = 109, 3.0–7.8 ng/mL)Tertile 2 (n = 109, 7.9–12.6 ng/mL)Tertile 3 (n = 110, 12.7–37.4 ng/mL)
Age (±SD), years67.4 ± 13.269.5 ± 14.066.2 ± 13.166.7 ± 12.40.138
Male, %194 (59.1)58 (53.2)65 (59.6)71 (64.5)0.231
BMI, IQR, Kg/m224.0 ± 3.424.2 ± 4.423.8 ± 2.723.9 ± 2.80.720
Hypertension, %220 (67.1)79 (72.5)75 (68.8)66 (60.0)0.130
Diabetes, %104 (31.7)42 (38.5)32 (29.4)30 (27.3)0.164
Hyperlipidemia, %116 (35.4)42 (38.5)41 (37.6)33 (30.0)0.349
Current smoking, %138 (42.1)41 (37.6)45 (41.3)52 (47.3)0.344
Atrial fibrillation, %65 (19.8)20 (18.3)23 (21.1)22 (20.0)0.877
TOAST classification0.469
LAA124 (37.8)35 (32.1)46 (42.2)43 (39.1)
SVO94 (28.7)31 (28.4)29 (26.6)34 (30.9)
CE65 (19.8)22 (20.2)22 (20.2)21 (19.1)
SOE and SUE45 (13.7)21 (19.3)12 (11.0)12 (10.9)
NIHSS score, mean ± SD5.4 ± 6.06.6 ± 7.15.5 ± 5.64.2 ± 5.00.012
Median(IQR)3.0 (1.0–8.8)3.0 (1.5–9.5)3.0 (1.5–11.0)2.0 (1.0–5.3)0.029
IV thrombolysis44 (13.4)13 (11.9)20 (18.3)11 (10.0)0.166
Fasting blood glucose, mg/dL125 ± 53131 ± 62118 ± 39124 ± 540.217
Total cholesterol, mg/dL181 ± 40177 ± 47182 ± 42185 ± 310.281
TG, mg/dL121 ± 74126 ± 82110 ± 58125 ± 780.205
HDL, mg/dL49 ± 1647 ± 1549 ± 1651 ± 170.102
LDL, mg/dL115 ± 37111 ± 43116 ± 38117 ± 290.377
BUN, mg/dL16.9 ± 8.719.1 ± 10.815.8 ± 8.515.9 ± 5.70.007
Creatinine, mg/dL0.9 ± 1.11.1 ± 1.30.8 ± 0.40.9 ± 1.30.206
eGFR, mL/min/1.73 m262.3 ± 84.969.2 ± 97.052.5 ± 23.265.3 ± 107.80.313
WBC count, × 103 cells/mm38.2 ± 3.38.3 ± 3.08.0 ± 2.48.3 ± 4.20.803
Hemoglobin, g/dL13.6 ± 2.113.0 ± 2.413.8 ± 1.914.1 ± 1.7<0.001
Platelet count, × 103/L227 ± 77231 ± 85235 ± 90215 ± 500.146
BPsystolic, mmHg148 ± 26149 ± 26147 ± 27148 ± 260.751
BPdiastolic, mmHg85 ± 1685 ± 1585 ± 1685 ± 160.944

BMI, body mass index; IQR, interquartile range; TOAST, Trial of Org 10172 in Acute Stroke Treatment; LAA, large artery atherosclerosis; SVO, small vessel occlusion; CE, cardioembolism; SOE, stroke of other determined etiology; SUE, stroke of undetermined etiology; NIHSS, National Institute of Health Stroke Scale; IV, intravenous; TG, triglyceride; HDL, high-density lipoprotein; LDL, low-density lipoprotein; BUN, blood urea nitrogen; eGFR, estimated glomerular filtration rate; WBC, white blood cell; BP, blood pressure.

Baseline characteristic of participants according to 25-hydroxyvitamin D tertile. BMI, body mass index; IQR, interquartile range; TOAST, Trial of Org 10172 in Acute Stroke Treatment; LAA, large artery atherosclerosis; SVO, small vessel occlusion; CE, cardioembolism; SOE, stroke of other determined etiology; SUE, stroke of undetermined etiology; NIHSS, National Institute of Health Stroke Scale; IV, intravenous; TG, triglyceride; HDL, high-density lipoprotein; LDL, low-density lipoprotein; BUN, blood urea nitrogen; eGFR, estimated glomerular filtration rate; WBC, white blood cell; BP, blood pressure.

25-Hydroxyvitamin D Level and Poor Outcome

Of 328 participants, proportions of AIS patients with poor 3-months outcome and 25-hydroxyvitamin D deficiency were 22.9 and 48.2%, respectively (Table 2). Patients with 25-hydroxyvitamin D deficiency tended to have higher mRS score compared to those without such deficiency (p for χ2 trend = 0.026, Figure 1). Poor outcome at 3-months was positively associated with age, female gender, stroke severity, blood urea nitrogen, and white blood cell count. Hemoglobin level and 25-hydroxyvitamin D deficiency were inversely associated with poor outcome in patients with AIS. In univariate analysis, 25-hydroxyvitamin D deficiency was a predictor for poor 3-months outcome [OR (odds ratio), 1.86; CI (confidence interval), 1.10–3.14; p = 0.021]. Although age, sex, stroke severity, stroke subtype, intravenous thrombolysis, blood urea nitrogen, white blood cell count, and hemoglobin were adjusted in the multivariate logistic regression model, 25-hydroxyvitamin D deficiency was a significant predictor for 3-months poor outcome (OR, 3.21; CI, 1.22–8.48; p = 0.019, Table 3).
Table 2

Differences of demographic and clinical characteristics between good and poor outcome groups of patients with acute ischemic stroke.

Poor (n = 75)Good (n = 253)Total (n = 328)p
Age, years72.6 ± 14.365.9 ± 12.567.4 ± 13.2<0.001
Male, %37 (49.3)157 (62.1)194 (59.1)0.049
BMI, Kg/m224.2 ± 5.123.9 ± 2.724.0 ± 3.40.628
Hypertension, %54 (72.0)166 (65.6)220 (67.1)0.301
Diabetes, %25 (33.3)79 (31.2)104 (31.7)0.730
Hyperlipidemia, %29 (38.7)87 (34.4)116 (35.4)0.496
Current smoking, %27 (36.0)111 (43.9)138 (42.1)0.225
Atrial fibrillation, %19 (25.3)46 (18.2)65 (19.8)0.172
TOAST classification0.329
LAA27 (36.0)97 (38.3)124 (37.8)
SVO17 (22.7)77 (30.4)94 (28.7)
CE17 (22.7)48 (19.0)65 (19.8)
SOE and SUE14 (18.7)31 (12.3)45 (13.7)
NIHSS, mean ± SD10.2 ± 7.04.0 ± 4.95.4 ± 6.0<0.001
NIHSS, median (IQR)9 (4-15)2 (1-5)3.0 (1.0–8.8)<0.001
IV thrombolysis15 (20.0)29 (11.5)44 (13.4)0.057
Fasting blood glucose, mg/dL131 ± 57123 ± 52125 ± 530.240
Total cholesterol, mg/dL187 ± 47180 ± 38181 ± 400.195
TG, mg/dL110 ± 82124 ± 71121 ± 740.154
HDL, mg/dL52 ± 1448 ± 1649 ± 160.120
LDL, mg/dL119 ± 45113 ± 35115 ± 370.252
BUN, mg/dL18.9 ± 8.716.3 ± 8.616.9 ± 8.70.026
Creatinine, mg/dL0.9 ± 0.41.0 ± 1.20.9 ± 1.10.456
WBC count, × 103 cells/mm38.9 ± 3.28.0 ± 3.38.2 ± 3.30.032
Hemoglobin, g/dL13.2 ± 2.413.8 ± 2.013.6 ± 2.10.040
Platelet count, × 103/L234 ± 92225 ± 72227 ± 770.378
BPsystolic, mmHg149 ± 26147 ± 26148 ± 260.613
BPdiastolic, mmHg83 ± 1586 ± 1685 ± 160.288
25-hydroxyvitamin D deficiency, %45 (60.0)113 (44.7)158 (48.2)0.020

BMI, body mass index; TOAST, Trial of Org 10172 in Acute Stroke Treatment; LAA, large artery atherosclerosis; SVO, small vessel occlusion; CE, cardioembolism; SOE, stroke of other determined etiology; SUE, stroke of undetermined etiology; NIHSS, National Institute of Health Stroke Scale; IQR, interquartile range; IV, intravenous; TG, triglyceride; HDL, high-density lipoprotein; LDL, low-density lipoprotein; BUN, blood urea nitrogen; eGFR, estimated glomerular filtration rate; WBC, white blood cell; BP, blood pressure.

Figure 1

Distribution of 3-month modified Rankin Scale score according to 25-hydroxyvitamin D deficiency status.

Table 3

Binary logistic regression analysis of predictors for poor functional outcome (mRS 3-6) at 3 months.

OR (95% CI)p
25(OH)D deficiency (<10 ng/mL)1.86 (1.10–3.14)0.021
Model 11.74 (1.01–2.97)0.044
Model 23.01 (1.16–7.84)0.024
Model 33.18 (1.21–8.36)0.019
Model 43.21 (1.22–8.48)0.019

mRS, modified Rankin Scale; OR, odds ratio; CI, confidence interval; 25(OH)D, 25-hydroxyvitamin D. Model 1 was adjusted for age and sex. Model 2 included variables in model 1 plus NIHSS and NIHSS.

Differences of demographic and clinical characteristics between good and poor outcome groups of patients with acute ischemic stroke. BMI, body mass index; TOAST, Trial of Org 10172 in Acute Stroke Treatment; LAA, large artery atherosclerosis; SVO, small vessel occlusion; CE, cardioembolism; SOE, stroke of other determined etiology; SUE, stroke of undetermined etiology; NIHSS, National Institute of Health Stroke Scale; IQR, interquartile range; IV, intravenous; TG, triglyceride; HDL, high-density lipoprotein; LDL, low-density lipoprotein; BUN, blood urea nitrogen; eGFR, estimated glomerular filtration rate; WBC, white blood cell; BP, blood pressure. Distribution of 3-month modified Rankin Scale score according to 25-hydroxyvitamin D deficiency status. Binary logistic regression analysis of predictors for poor functional outcome (mRS 3-6) at 3 months. mRS, modified Rankin Scale; OR, odds ratio; CI, confidence interval; 25(OH)D, 25-hydroxyvitamin D. Model 1 was adjusted for age and sex. Model 2 included variables in model 1 plus NIHSS and NIHSS.

Comparison of Logistic Regression and the Other Machine Learning Classifier

We used extreme gradient boosting ML algorithm to classify and predict factors associated with poor 3-months outcome of AIS. Figure 2 shows factors affecting this binary classification weighed by XGBoost algorithm. NIHSS (feature importance, 32.8%) score was the top predictive factor affecting XGBoost classification, followed by age (16.0%) and 25-hydroxyvitamin D level (12.5%) in the training dataset. Table 4 shows result of classification performance of binary logistic regression and extreme gradient boosting to predict 3-months poor outcome. Overall prediction performances including precision, recall, accuracy, and F1 score were higher in extreme gradient boosting model compared to those in the binary logistic regression model. Figure 3 shows performances of these two classifiers with AUROC curve analysis for the test dataset (total 131 subjects). The XGBoost's performance was higher than those of logistic regression in classifying prognosis of patients with AIS, but statistical significance was not achieved (AUROC, 0.805 vs. 0.746, p-value = 0.11). We also examined the performance of the support vector machine for the binary prediction of poor outcome, and overall procedures of the support vector machine was presented in the Supplemental Material.
Figure 2

Variable importance plot for XGBoost. NIHSS, National Institute of Health Stroke Scale; 25-OH D, 25-hydroxyvitamin D, WBC, white blood cell; BUN, blood urea nitrogen; TOAST, Trial of Org 10172 in Acute Stroke Treatment; LAA, large artery atherosclerosis; d2-3, 2nd and 3rd dummy variable; IV, intravenous. X-axis represents variable importance in the extreme gradient boosting prediction.

Table 4

Results of classification performance of binary logistic regression and extreme gradient boosting to predict 3-month poor outcome of AIS patients in the test data.

TPFPFNTNTotalRecallSpecificityPrecisionNPVAccuracyF1 score
BLR1011159513140.089.647.686.480.243.5
XGB1210139613148.090.654.588.182.451.1

TP, true positive; FP, false positive; FN, false negative; TN, true negative; NPV, negative predictive value; BLR, binary logistic regression; XGB, extreme gradient boosting.

Figure 3

Receiver operating characteristic curve of binary logistic regression and extreme gradient boosting algorithm. BLR, binary logistic regression; XGB, extreme gradient boosting.

Variable importance plot for XGBoost. NIHSS, National Institute of Health Stroke Scale; 25-OH D, 25-hydroxyvitamin D, WBC, white blood cell; BUN, blood urea nitrogen; TOAST, Trial of Org 10172 in Acute Stroke Treatment; LAA, large artery atherosclerosis; d2-3, 2nd and 3rd dummy variable; IV, intravenous. X-axis represents variable importance in the extreme gradient boosting prediction. Results of classification performance of binary logistic regression and extreme gradient boosting to predict 3-month poor outcome of AIS patients in the test data. TP, true positive; FP, false positive; FN, false negative; TN, true negative; NPV, negative predictive value; BLR, binary logistic regression; XGB, extreme gradient boosting. Receiver operating characteristic curve of binary logistic regression and extreme gradient boosting algorithm. BLR, binary logistic regression; XGB, extreme gradient boosting.

Discussion

Our study ascertained that low 25-hydroxyvitamin D status in stroke patients was highly prevalent even at the time of admission of AIS patients and that 25-hydroxyvitamin D status was a significant predictor for poor outcome after adjusting multiple variables in logistic regression model. Additionally, serum 25-hydroxyvitamin D level was also associated with a poor outcome in the XGBoost ML model that considered multiple interactions. Are there possible causal relationship between vitamin D deficiency and stroke prognosis? First, it has been shown that endothelial dysfunction has an important role in atherosclerosis and stroke development (35, 36). Markers for endothelial dysfunction are associated with stroke lesion volume or clinical outcome (37). Witham et al. have reported that high dose vitamin D supplementation for 16 weeks can increase flow mediated dilatation in patients with stroke (38) which is one of well-known predictors for the cardiovascular outcome (39). These could explain a possible link between vitamin D status and stroke outcome. Second, low vitamin D status is closely related to “general health status” which could reflect individual's physical activity or dietary intake. On the contrary, vitamin D status can predict physical performance in an aged populations (40). Krarup et al. have reported that pre-stroke physical activity is associated with severity and long-term outcome from first-ever ischemic stroke (41). In this regards, low 25-hydroxyvitamin D could be a result of poor general health or increased frailty in these stroke population. New interventional studies on 25-hydroxyvitamin D supplementation in AIS patients with low vitamin D status are needed to improve our understanding of role of 25-hydroxyvitamin D in cardiovascular disease. We observed that 25-hydroxyvitamin D levels were extremely low in our stroke population compared to the normal Korean population (31). If so, how can we explain these high prevalence of 25-hydroxyvitamin D deficiency or insufficiency in stroke participants? First, the prevalence of 25-hydroxyvitamin D deficiency has been gradually increasing as more people are living in a modernized and indoor place in which sun light exposure is insufficient to produce vitamin D biosynthesis. Analysis of national health surveys in US could support this hypothesis (42). Second, relatively high prevalence of vitamin D deficiency in stroke patients has been reported in other cohorts (43, 44). In these studies, mean level of 25-hydroxyvitamin D concentration ranged from 13.7 to 14.2 ng/mL, comparable to that of our study. Unquestionably, prevalence of vitamin D deficiency is a pandemic phenomenon. It has markedly increased irrespective of age, sex, ethnicity, types of concurrent comorbidity, or socioeconomic state. Despite of these positive correlation between 25-hydroxyvitamin D deficiency and various cardiovascular outcomes, there have been few evidence of randomized controlled trials whether vitamin D supplementation has a positive effect on the cardiovascular outcome (45). These discrepancies could be partly explained by various factors such as supplemental dosage of vitamin D, ethnical and environmental factors for the study participants, study design, and outcome in each trials. It's too early to conclude that vitamin D deficiency is as salient predictor for cardiovascular disease. Therefore, future trial are necessary to consider these differences and the exact causal pathways of vitamin D metabolism. We used logistic regression and XGBoost ML algorithm to predict stroke outcome in this study. Logistic regression is the most popular multivariate statistical method in medical science. Using logistic regression, we can quantitatively estimate the relationship between two variables of interest through an index called “odds ratio.” However, it is impossible to consider all interactions among variables in logistic regression because there might be a variety of combinations of interactions between variables in the real world. Results of XGBoost ML algorithm showed that, in addition to important predictors such as stroke and stroke severity, factors such as 25-hydroxyvitamin D, white blood cell count, and hemoglobin were also important factors predicting prognosis of stroke patients. We could suggest that 25-hydroxyvitamin D is associated with a variety of factors affecting stroke prognosis such as stroke severity, age, several laboratory parameters, stroke subtype, and gender and that XGBoost ML algorithm is useful for dealing with multiple interactions between independent predictors in binary classification of stroke prognosis. In addition, XGBoost algorithm can analyze large size data with a high computational speed. In this era, medical data are becoming so vast that it becomes hard to analyze. An ML approach can be an expeditious and efficient way to analyze such big data. Our study has several limitations. First, our study was a single center retrospective case control study, although measurement of 25-hydroxyvitamin D level and 3-months clinical outcome capture were prospectively collected. Therefore, our results could not presume a causal relationship of vitamin D and stroke severity. In addition, they could not be generalizable to entire ischemic stroke patients or other ethnicities. Second, although pre-stroke physical activity and dietary intake were significant confounders in assessment with vitamin D level and stoke severity, we could not assess these variables in our model. However, some strengths of our study could reinforce the relationship between vitamin D status and stroke outcome. First, we excluded patients with prior history of stroke to minimize the effect of physical activity and dietary intake. Second, analysis of vitamin D and stroke outcome was adjusted for clinically important variables with a multivariate logistic model. Lastly, interactions between all independent variables were statistically considered by means of an ensemble ML mechanism called extreme gradient boosting. In conclusion, 25-hydroxyvitamin D level is associated with 3-months poor outcome in patients with AIS. In addition, extreme gradient boosting ML algorithm could reveal the association of stroke prognosis with 25-hydroxyvitamin D level which might have certain interactions with other predictors.

Data Availability Statement

The datasets analyzed in this article are not publicly available. Requests to access the datasets should be directed to the corresponding author (gumdol52@hallym.or.kr).

Ethics Statement

The studies involving human participants were reviewed and approved by Hallym University Sacred Heart Hospital Institutional Review Board/Ethics Committee (IRB No. 2015-I064). The patients/participants provided their written informed consent to participate in this study.

Author Contributions

CK and B-CL drafted the manuscript. J-HL performed the statistical analysis. S-HL and MO contributed to the statistical analyses. J-SL and YK provided critical suggestions for the discussion. MJ and SJ revised the first draft. K-HY designed the analyses to the final manuscript and technical comments on the results.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  43 in total

Review 1.  Endothelial cell apoptosis: biochemical characteristics and potential implications for atherosclerosis.

Authors:  J C Choy; D J Granville; D W Hunt; B M McManus
Journal:  J Mol Cell Cardiol       Date:  2001-09       Impact factor: 5.000

2.  Serum 25-hydroxyvitamin D predicts the short-term outcomes of Chinese patients with acute ischaemic stroke.

Authors:  Wen-Jun Tu; Sheng-Jie Zhao; Dong-Jiang Xu; Hui Chen
Journal:  Clin Sci (Lond)       Date:  2014-03       Impact factor: 6.124

3.  The effects of calcium and vitamin D supplementation on blood glucose and markers of inflammation in nondiabetic adults.

Authors:  Anastassios G Pittas; Susan S Harris; Paul C Stark; Bess Dawson-Hughes
Journal:  Diabetes Care       Date:  2007-02-02       Impact factor: 19.112

Review 4.  Vitamin D status and cardiovascular outcome.

Authors:  F Saponaro; C Marcocci; R Zucchi
Journal:  J Endocrinol Invest       Date:  2019-06-06       Impact factor: 4.256

5.  Localization of Ventricular Activation Origin from the 12-Lead ECG: A Comparison of Linear Regression with Non-Linear Methods of Machine Learning.

Authors:  Shijie Zhou; Amir AbdelWahab; John L Sapp; James W Warren; B Milan Horáček
Journal:  Ann Biomed Eng       Date:  2018-11-21       Impact factor: 3.934

6.  Is 25(OH)D Associated with Cognitive Impairment and Functional Improvement in Stroke? A Retrospective Clinical Study.

Authors:  Seniz Akcay Yalbuzdag; Banu Sarifakioglu; Sevgi Ikbali Afsar; Canan Celik; Aslı Can; Tugba Yegin; Burcu Senturk; Aliye Yildirim Guzelant
Journal:  J Stroke Cerebrovasc Dis       Date:  2015-04-25       Impact factor: 2.136

7.  Prognostic value of serum 25-hydroxyvitamin D in patients with stroke.

Authors:  Yan Wang; Hui Ji; Ying Tong; Zhuo-bo Zhang
Journal:  Neurochem Res       Date:  2014-05-01       Impact factor: 3.996

8.  Vitamin D insufficiency in Korea--a greater threat to younger generation: the Korea National Health and Nutrition Examination Survey (KNHANES) 2008.

Authors:  Han Seok Choi; Han Jin Oh; Hoon Choi; Woong Hwan Choi; Jung Gu Kim; Kyoung Min Kim; Kwang Joon Kim; Yumie Rhee; Sung-Kil Lim
Journal:  J Clin Endocrinol Metab       Date:  2010-12-29       Impact factor: 5.958

Review 9.  Making Individual Prognoses in Psychiatry Using Neuroimaging and Machine Learning.

Authors:  Ronald J Janssen; Janaina Mourão-Miranda; Hugo G Schnack
Journal:  Biol Psychiatry Cogn Neurosci Neuroimaging       Date:  2018-04-22

10.  Circulating endothelial microparticles in acute ischemic stroke: a link to severity, lesion volume and outcome.

Authors:  J Simak; M P Gelderman; H Yu; V Wright; A E Baird
Journal:  J Thromb Haemost       Date:  2006-06       Impact factor: 5.824

View more
  1 in total

1.  Predicting futile recanalization, malignant cerebral edema, and cerebral herniation using intelligible ensemble machine learning following mechanical thrombectomy for acute ischemic stroke.

Authors:  Weixiong Zeng; Wei Li; Kaibin Huang; Zhenzhou Lin; Hui Dai; Zilong He; Renyi Liu; Zhaodong Zeng; Genggeng Qin; Weiguo Chen; Yongming Wu
Journal:  Front Neurol       Date:  2022-09-28       Impact factor: 4.086

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.