| Literature DB >> 35047529 |
Bin Wang1,2, Xiong Han1,2, Zongya Zhao3, Na Wang1, Pan Zhao1, Mingmin Li1, Yue Zhang2, Ting Zhao1, Yanan Chen1, Zhe Ren2, Yang Hong4.
Abstract
Objective: Antiseizure medicine (ASM) is the first choice for patients with epilepsy. The choice of ASM is determined by the type of epilepsy or epileptic syndrome, which may not be suitable for certain patients. This initial choice of a particular drug affects the long-term prognosis of patients, so it is critical to select the appropriate ASMs based on the individual characteristics of a patient at the early stage of the disease. The purpose of this study is to develop a personalized prediction model to predict the probability of achieving seizure control in patients with focal epilepsy, which will help in providing a more precise initial medication to patients.Entities:
Keywords: EEG complexity; gradient boosting decision tree (GBDT) model; machine learning; precision medicine; prediction model
Year: 2022 PMID: 35047529 PMCID: PMC8761908 DOI: 10.3389/fmed.2021.781937
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Flow chart. EEG, electroencephalogram; AEDs, antiepileptic drugs; OXC, oxcarbazepine; fPWEs, patients with focal epilepsy; SF, seizure-free; NSF, not seizure-free.
Demographical and clinical status of the participants.
|
|
|
|
| |
|---|---|---|---|---|
| Sex (Male/female) | 33/19 | 11/11 | 1.162 | 0.281 |
| Age, year | 14.5 ± 11.50 | 16.50 ± 12.00 | −0.681 | 0.496 |
| Age at onset, year | 13.5 ± 12.25 | 15.50 ± 13.38 | −0.361 | 0.718 |
| Follow-up time, months | 32.58 ± 9.83 | 36.18 ± 8.91 | −1.481 | 0.143 |
| Seizure frequency before OXC, times/month | 0.65 ± 0.70 | 15.50 ± 13.37 | −1.983 |
|
| Seizure circadian rhythm (day/night/both) | 19/16/17 | 8/7/7 | 0.009 | 0.995 |
| Comorbidity (Y/N) | 27/25 | 16/6 | 2.749 | 0.097 |
| Inducement (Y/N) | 29/23 | 17/5 | 3.039 | 0.081 |
| History of perinatal injury (Y/N) | 12/40 | 12/10 | 6.986 |
|
| Physical development (N/AN) | 9/43 | 3/19 | 4.469 |
|
| Family history (Y/N) | 2/50 | 2/20 | 0.832 | 0.362 |
| MRI(P/N) | 17/35 | 13/9 | 4.469 |
|
| TLE(Y/N) | 16/36 | 8/14 | 0.221 | 0.638 |
| History of CNS infection (Y/N) | 7/45 | 2/20 | 0.261 | 0.599 |
| History of head injury (Y/N) | 5/47 | 2/20 | 0.005 | 0.944 |
SF, seizure-free; NSF, not seizure-free; OXC, oxcarbazepine; Y/N, yes/no; N/AN, normal/abnormal; P/N, positive/negative; MRI, magnetic resonance imaging; TLE, temporal lobe epilepsy; CNS, central nervous system.
For qualitative data, Chi-square tests were used.
For quantitative data, after Shapiro-Wilk normality test, the Mann-Whitney U-test was applied for data with abnormal distributions, data that did not conform to normal distributions were presented as the median ± interquartile range.
Data with a normal distribution were compared by the independent sample t-tests, mean ± standard deviation was used to describe. p < 0.05 is considered as statistically significant.
Defined as features that have statistically significant between SF group and NSF group.
Bold values are statistically significant.
The top 10 features that impacting the GBDT classifier mostly.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
| |||||
| δ-F8 | 0.0240 ± 0.0214 | 0.0298 ± 0.0132 | −1.656 | 0.098 | 0.098 |
| θ-T3 | 0.0518 ± 0.0356 | 0.0566 ± 0.0558 | −2.010 | 0.044 | 0.060 |
| α-Cz | 0.0631 ± 0.0383 | 0.0745 ± 0.0500 | −2.472 | 0.013 |
|
| θ-F3 | 0.0510 ± 0.0352 | 0.0561 ± 0.0526 | −2.032 | 0.042 | 0.057 |
| α-Fz | 0.0649 ± 0.0384 | 0.0757 ± 0.0582 | −2.424 | 0.015 |
|
| θ-T6 | 0.0531 ± 0.0394 | 0.0587 ± 0.0528 | −1.880 | 0.060 | 0.068 |
| TLE | |||||
| β-T3 | 0.1119 ± 0.0742 | 0.1261 ± 0.1133 | −2.081 | 0.037 | 0.050 |
| β-Pz | 0.1100 ± 0.7230 | 0.1227 ± 0.1024 | −2.081 | 0.037 | 0.050 |
| α-T6 | 0.0624 ± 0.0334 | 0.0742 ± 0.0569 | −2.389 | 0.017 |
|
|
| |||||
| δ-T3 | 0.2245 ± 0.0068 | 0.2531 ± 0.2333 | −1.809 | 0.070 | 0.070 |
| θ-F7 | 0.0503 ± 0.0204 | 0.0575 ± 0.0555 | −1.904 | 0.057 | 0.065 |
| Seizure frequency before OXC | |||||
| θ-FP1 | 0.0521 ± 0.038 | 0.0573 ± 0.0566 | −1.928 | 0.054 | 0.061 |
| θ-T6 | 0.0537 ± 0.0398 | 0.0594 ± 0.0535 | −1.904 | 0.057 | 0.065 |
| θ-Fz | 0.0505 ± 0.0371 | 0.0564 ± 0.0540 | −2.105 | 0.035 |
|
| β-T6 | 0.1116 ± 0.0728 | 0.1250 ± 0.1084 | −2.105 | 0.035 |
|
| β-O2 | 0.1109 ± 0.0704 | 0.1244 ± 0.1148 | −2.200 | 0.028 |
|
| Seizure circadian rhythm | |||||
| α-Pz | 0.0645 ± 0.037 | 0.0753 ± 0.0542 | −2.306 | 0.021 |
|
GBDT, gradient boosting decision tree; LZC, Lempel-Ziv complexity; KC, Kolmogorov complexity; SF, seizure-free; NSF, not seizure-free; TLE, temporal lobe epilepsy; δ-F8, δ band from F8 channel; OXC, oxcarbazepine; P′-value refers to P-value that is corrected by false discovery rate correction.
The features that have statistically significance. Although the selected features may not be statistically significant, they did have a classification value in the model.
Bold values are statistically significant.
Figure 2EEG features that have the most significant impact on classification. θ-T3: θ band from T3 channel, p < 0.05 is considered statistically significant.
The performance of the four classifier models.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| ||||||
| Accuracy (%) | 67 | 87 | 80 | 93 | 79 | 81 |
| Precision (%) | 69 | 80 | 81 | 100 | 90 | 84 |
| Recall (%) | 90 | 100 | 90 | 92 | 82 | 91 |
| F1-score (%) | 78 | 89 | 86 | 96 | 86 | 87 |
| AUC (%) | 64 | 89 | 82 | 100 | 70 | 81 |
| Sensitivity (%) | 90 | 100 | 90 | 92 | 82 | 91 |
| Specificity (%) | 20 | 71 | 60 | 100 | 67 | 64 |
|
| ||||||
| Accuracy (%) | 60 | 53 | 67 | 67 | 64 | 62 |
| Precision (%) | 70 | 56 | 78 | 100 | 80 | 77 |
| Recall (%) | 70 | 63 | 70 | 62 | 73 | 67 |
| F1-score (%) | 70 | 59 | 74 | 76 | 76 | 71 |
| AUC (%) | 54 | 64 | 64 | 100 | 33 | 63 |
| Sensitivity (%) | 70 | 63 | 70 | 62 | 73 | 67 |
| Specificity (%) | 40 | 43 | 60 | 100 | 33 | 55 |
|
| ||||||
| Accuracy (%) | 67 | 87 | 80 | 100 | 79 | 82 |
| Precision (%) | 69 | 80 | 82 | 100 | 90 | 84 |
| Recall (%) | 90 | 100 | 90 | 100 | 82 | 92 |
| F1-score (%) | 78 | 89 | 86 | 100 | 86 | 88 |
| AUC (%) | 66 | 89 | 88 | 100 | 73 | 83 |
| Sensitivity (%) | 66 | 89 | 88 | 100 | 73 | 83 |
| Specificity (%) | 90 | 100 | 90 | 100 | 82 | 92 |
|
| ||||||
| Accuracy (%) | 60 | 53 | 67 | 67 | 64 | 62 |
| Precision (%) | 70 | 56 | 78 | 100 | 80 | 77 |
| Recall (%) | 70 | 63 | 70 | 62 | 73 | 67 |
| F1-score (%) | 70 | 59 | 74 | 76 | 76 | 71 |
| AUC (%) | 54 | 63 | 64 | 100 | 33 | 63 |
| Sensitivity (%) | 70 | 63 | 70 | 62 | 73 | 67 |
| Specificity (%) | 40 | 43 | 60 | 100 | 33 | 55 |
LZC, Lempel-Ziv complexity; GBDT, gradient boosting decision tree; AUC, the area under the curve; SVM, support vector machine; RFE, recursive feature elimination; KC, Kolmogorov complexity.
Figure 3The mean evaluation indexes after five-fold cross-validation.
Figure 4The performance of four models. GBDT, gradient boosting decision tree; LZC, Lempel-Ziv complexity; AUC, area under the curve; ROC, receiver operating-characteristic curve; std. dev, standard deviation; SVM, support vector machine, RFE, recursive feature elimination, CV, cross-validation; KC, Kolmogorov complexity.