OBJECTIVE: Anterior temporal lobectomy (ATL) is a widely performed and successful intervention for drug-resistant temporal lobe epilepsy (TLE). However, up to one third of patients experience seizure recurrence within 1 year after ATL. Despite the extensive literature on presurgical electroencephalography (EEG) and magnetic resonance imaging (MRI) abnormalities to prognosticate seizure freedom following ATL, the value of quantitative analysis of visually reviewed normal interictal EEG in such prognostication remains unclear. In this retrospective multicenter study, we investigate whether machine learning analysis of normal interictal scalp EEG studies can inform the prediction of postoperative seizure freedom outcomes in patients who have undergone ATL. METHODS: We analyzed normal presurgical scalp EEG recordings from 41 Mayo Clinic (MC) and 23 Cleveland Clinic (CC) patients. We used an unbiased automated algorithm to extract eyes closed awake epochs from scalp EEG studies that were free of any epileptiform activity and then extracted spectral EEG features representing (a) spectral power and (b) interhemispheric spectral coherence in frequencies between 1 and 25 Hz across several brain regions. We analyzed the differences between the seizure-free and non-seizure-free patients and employed a Naïve Bayes classifier using multiple spectral features to predict surgery outcomes. We trained the classifier using a leave-one-patient-out cross-validation scheme within the MC data set and then tested using the out-of-sample CC data set. Finally, we compared the predictive performance of normal scalp EEG-derived features against MRI abnormalities. RESULTS: We found that several spectral power and coherence features showed significant differences correlated with surgical outcomes and that they were most pronounced in the 10-25 Hz range. The Naïve Bayes classification based on those features predicted 1-year seizure freedom following ATL with area under the curve (AUC) values of 0.78 and 0.76 for the MC and CC data sets, respectively. Subsequent analyses revealed that (a) interhemispheric spectral coherence features in the 10-25 Hz range provided better predictability than other combinations and (b) normal scalp EEG-derived features provided superior and potentially distinct predictive value when compared with MRI abnormalities (>10% higher F1 score). SIGNIFICANCE: These results support that quantitative analysis of even a normal presurgical scalp EEG may help prognosticate seizure freedom following ATL in patients with drug-resistant TLE. Although the mechanism for this result is not known, the scalp EEG spectral and coherence properties predicting seizure freedom may represent activity arising from the neocortex or the networks responsible for temporal lobe seizure generation within vs outside the margins of an ATL.
OBJECTIVE: Anterior temporal lobectomy (ATL) is a widely performed and successful intervention for drug-resistant temporal lobe epilepsy (TLE). However, up to one third of patients experience seizure recurrence within 1 year after ATL. Despite the extensive literature on presurgical electroencephalography (EEG) and magnetic resonance imaging (MRI) abnormalities to prognosticate seizure freedom following ATL, the value of quantitative analysis of visually reviewed normal interictal EEG in such prognostication remains unclear. In this retrospective multicenter study, we investigate whether machine learning analysis of normal interictal scalp EEG studies can inform the prediction of postoperative seizure freedom outcomes in patients who have undergone ATL. METHODS: We analyzed normal presurgical scalp EEG recordings from 41 Mayo Clinic (MC) and 23 Cleveland Clinic (CC) patients. We used an unbiased automated algorithm to extract eyes closed awake epochs from scalp EEG studies that were free of any epileptiform activity and then extracted spectral EEG features representing (a) spectral power and (b) interhemispheric spectral coherence in frequencies between 1 and 25 Hz across several brain regions. We analyzed the differences between the seizure-free and non-seizure-free patients and employed a Naïve Bayes classifier using multiple spectral features to predict surgery outcomes. We trained the classifier using a leave-one-patient-out cross-validation scheme within the MC data set and then tested using the out-of-sample CC data set. Finally, we compared the predictive performance of normal scalp EEG-derived features against MRI abnormalities. RESULTS: We found that several spectral power and coherence features showed significant differences correlated with surgical outcomes and that they were most pronounced in the 10-25 Hz range. The Naïve Bayes classification based on those features predicted 1-year seizure freedom following ATL with area under the curve (AUC) values of 0.78 and 0.76 for the MC and CC data sets, respectively. Subsequent analyses revealed that (a) interhemispheric spectral coherence features in the 10-25 Hz range provided better predictability than other combinations and (b) normal scalp EEG-derived features provided superior and potentially distinct predictive value when compared with MRI abnormalities (>10% higher F1 score). SIGNIFICANCE: These results support that quantitative analysis of even a normal presurgical scalp EEG may help prognosticate seizure freedom following ATL in patients with drug-resistant TLE. Although the mechanism for this result is not known, the scalp EEG spectral and coherence properties predicting seizure freedom may represent activity arising from the neocortex or the networks responsible for temporal lobe seizure generation within vs outside the margins of an ATL.
We performed a machine learning analysis to evaluate whether visually reviewed normal presurgical scalp electroencephalography (EEG) findings without any interictal abnormalities can predict the outcomes following an anterior temporal lobectomy.We achieved an area under the curve of 0.78 in predicting outcomes using a leave‐one‐out cross‐validation in a data set including 41 patients from the Mayo Clinic.Independent testing of the machine learning model on a data set including 23 patients from the Cleveland Clinic yielded an area under the curve of 0.76.Our results provide strong evidence that quantitative analysis of normal scalp EEG can help in the prognostication of anterior temporal lobectomy outcomes.
INTRODUCTION
Temporal lobe epilepsy (TLE) is a common type of drug‐resistant epilepsy, and anterior temporal lobectomy (ATL) is a widely performed and successful intervention with class 1 evidence of efficacy from randomized clinical trials.
,
Despite the recent advances in brain imaging, electroencephalography (EEG) monitoring, and surgical technologies, up to one third of patients do not achieve long‐term seizure freedom after ATL.
,
Therefore, the identification of prognostic factors that can help predict the likelihood of seizure freedom following ATL remains of great clinical interest. Several previous studies have reported that ictal EEG patterns, interictal EEG abnormalities, magnetic resonance imaging (MRI) structural abnormalities, and their relationship to the resected area are useful for prognosticating surgical outcomes following ATL.
,
However, the potential value of quantitative analysis of normal interictal scalp EEG, that is, in the absence of interictal epileptiform abnormalities or focal slowing, remains unknown for such prognostication. Because nearly 50% of routine EEG studies and 10% of prolonged video‐EEG studies in patients with known epilepsy do not contain any recognizable epileptiform activity,
,
the ability to predict ATL outcomes using normal scalp EEG findings could prove useful as a cost‐effective and expeditious additional feature for evaluating surgical candidacy and predicting outcomes.EEG is a cornerstone in the evaluation of drug‐resistant epilepsy. There is extensive literature on visual and quantitative approaches applied to scalp EEG data for identifying interictal epileptiform activity, seizures, and propagation patterns with the aim of mapping the epileptogenic zone and predicting outcomes.
,
,
,
,
Similarly, there is a large body of literature on visual and quantitative methods applied to invasive intracranial EEG data to answer the same research questions.
,
,
,
Spectral power and coherence measures are among the common quantitative measures studied using both scalp and intracranial EEG.
,
,
,
,
,
,
More recently, the investigation of high‐frequency oscillations recorded on scalp EEG
,
,
and interictal spikes from combined scalp and intracranial EEG
have received significant interest in the study of outcome prognostication. However, the evaluation of interictal EEG without epileptiform activity or slowing, that is, visually classified normal EEG by an epileptologist, has received less attention.Multiple studies have reported that quantitative analysis of EEG segments without interictal epileptiform abnormalities can reveal subtle nonspecific abnormalities in the normal brain function of patients with epilepsy.
,
,
,
,
,
,
,
,
These electrophysiological changes are hypothesized to arise from epilepsy‐related pathologic changes such as neuronal loss, gliosis, and synaptic and network changes. They may also reflect interictal dysfunction in memory and cognition because the same neuronal‐glial circuits underlying seizure generation in TLE may subserve normal brain functions.
,
The EEG biomarkers of normal brain function investigated in previous studies include alpha activity during eyes‐closed wakefulness,
,
,
,
spectral connectivity measures based on phase‐locking factor
,
,
and weighted partial directed coherence,
and EEG‐based local and global synchrony measures.
Furthermore, several of those studies showed that the changes in EEG biomarkers of normal brain function can help determine, a) drug resistance,
,
,
b) focal vs generalized origin of seizures,
,
and c) seizure‐generating hemisphere in focal epilepsy.
Building on those findings, we hypothesize that biomarkers of normal EEG can help predict the likelihood of seizure recurrence following ATL.To test that hypothesis, we analyzed presurgical scalp EEG studies of 41 Mayo Clinic (MC) and 23 Cleveland Clinic (CC) patients who underwent ATL for drug‐resistant TLE. We used an unbiased automated tool to select eyes‐closed awake epochs in scalp EEG recordings determined by epileptologists to be free of any epileptiform activity or other abnormalities and calculated spectral power and interhemispheric spectral coherence measures between 1 and 25 Hz at several brain regions covered by the extended 10–20 scalp EEG montage. We then analyzed the group statistical differences within those measures based on at least 1 year of follow‐up for seizure freedom and used a machine learning analysis to quantify the predictability of surgery outcomes. Furthermore, we compared the predictability of normal scalp EEG‐derived features with MRI abnormalities to evaluate whether there is a distinct, additional predictive value in normal scalp EEG.
MATERIALS AND METHODS
Study participants
In this retrospective cohort study, we identified 64 patients who underwent ATL for drug‐resistant epilepsy at the Mayo Clinic (41 patients) and Cleveland Clinic (23 patients). We used the inclusion criterion that either their routine outpatient EEG or the first day of prolonged video‐EEG study was interictally normal (i.e., free of interictal epileptiform abnormalities, focal slowing, or excessive fast activity). We used this inclusion criterion to ensure that the EEGs we later analyzed did not contain any interictal abnormalities. In the case of prolonged video‐EEG studies, we did not place any restrictions beyond the first day. All patients except one had their habitual seizures recorded during the video‐EEG study. Although this is a select cohort, it reflects the purpose of our study to determine if quantitative analysis of visually reviewed normal scalp EEG can predict ATL outcomes. Our study was approved by the institutional review boards of the respective institutions.
Presurgical evaluations
All patients underwent comprehensive evaluations for epilepsy surgery that included (a) neurological history and examination, (b) routine scalp EEG, (c) MRI, and (d) prolonged video‐EEG to record their seizures. All patients had abnormal video‐EEG results, including recorded seizures prior to ATL. In addition, some patients underwent positron emission tomography (PET), single‐photon emission computerized tomography (SPECT), magneto‐electroencephalography (MEG), and intracranial EEG (iEEG) monitoring as indicated by the clinical evaluation. (See Table S1 for detailed results of the clinical evaluation.)
Surgery and outcomes
All patients underwent standard ATL, which included anterior temporal neocortex and amygdala‐hippocampal resection.
,
,
To binarize seizure outcomes following ATL, we used Engel IA‐IB or International League Against Epilepsy (ILAE) class 1–2 outcomes during the first year following ATL as the threshold to determine seizure‐free outcomes. In the above‐identified cohort, 30 MC and 16 CC patients experienced seizure‐free outcomes for at least 1 year following ATL.
EEG acquisition
The EEG studies at MC were recorded using the XLTEK EMU40EX headbox (from Natus Medical Incorporated) using 31 channels according to the extended 10–20 localization system
at a sampling rate of 256 Hz. The EEG studies at CC were recorded using the Nihon Kohden JE‐921A, JE‐120A, and JE‐208A headboxes (Nihon Kohden Corporation) using the same extended 10–20 localization system at a sampling rate of 200 Hz.
Data curation and preprocessing
We analyzed exactly one EEG per participant, which included 42 routine (32 MC and 10 CC) and 22 prolonged epilepsy monitoring unit (EMU; 9 MC and 13 CC) studies. To be consistent with the effect of antiseizure medications (ASMs), we discarded the EEG recorded beyond the first day of prolonged EMU studies when ASMs are tapered. All EEG preprocessing was performed using the MNE library in the Python programming language.
The EEG recordings were bandpass‐filtered within 0.5–30 Hz, and artifact components reflecting electrocardiography and eye movements were removed using the signal subspace projection method implemented in the MNE library.
Eyes‐closed epoch selection
We then selected multiple 10 s‐long eyes‐closed awake (EC) epochs (maximum of six epochs per participant) using an automated algorithm (58 participants had 6 EC epochs, and 6 participants had <6 epochs). The automated algorithm included the following steps: (a) divide the preprocessed EEG recording into epochs of 10 s; (b) automatically score sleep stages using a previously published algorithm
; (c) select epochs that do not contain eye blinks from the epochs scored as “awake” by the sleep staging algorithm
; (d) rank those eyes‐closed epochs in order of spectral power in the alpha frequency range (8–12 Hz) in posterior channels (O1 and O2); and (e) select the first six epochs (fewer if six epochs were not available) with the highest posterior alpha power. This approach was able to successfully extract six EC epochs from 58 of the 64 EEG records and two to six EC epochs from the rest of the EEG records.
Feature extraction
For each epoch, we estimated the power spectral density (PSD, in decibels) at frequencies between 1 and 25 Hz (25 integer frequencies) using a Welch fast‐Fourier transform approach
across all EEG channels. Those PSD values were then aggregated among the channels located in prefrontal (Fp1 and Fp2), frontal (F3, F7 and F4, F8), temporal (T7 and T8), central (C3 and C4), parietal (P3, P7 and P4, P8), and occipital (O1 and O2) regions to estimate the PSD values for those respective regions. Note that the PSD values in the left and right hemispheres were averaged to produce a single PSD value per region. Hence, this process yielded N × 6 × 25 features per patient (6 regions and 24 frequencies), where N is the number of epochs. Similarly, we also estimated the interhemispheric spectral‐coherence density (ISD) for each region (magnitude squared spectral coherence
between the channels in the left and right hemispheres of the respective region) at the same frequencies, which yielded another N × 6 × 25 features per patient.
Feature normalization
Prior to any statistical or machine learning analysis, PSD features were normalized to eliminate any systematic differences in data acquisition. We calculated a grand average of the total spectral power between 1 and 25 Hz (among all channels and participants) for the two data sets and divided the individual spectrums using that average. We performed this normalization separately for the two data sets because of the differences in the range of absolute spectral power values observed. We did not normalize ISD features, as they were already normalized within the 0–1 range.
Statistical analyses
We first analyzed the statistical differences within the PSD and ISD features between seizure‐free and non–seizure‐free groups. To do so, we aggregated the PSD and ISD values within delta (1–4 Hz), theta (4–8 Hz), alpha1 (8–10 Hz), alpha2 (10–13 Hz), and beta (13–25 Hz) bands. We then calculated Pearson correlations between the band‐limited PSD and ISD values and surgery outcomes and performed a Mann‐Whitney‐Wilcoxon test between those features conditioned on the surgery outcomes with Bonferroni correction for multiple comparisons.
Classification framework
We then employed a Naïve Bayes (NB) classification approach using both the PSD and ISD features to predict surgery outcomes. We chose to utilize the NB classifier for this task because of the relatively small sample size. This approach was evaluated using a leave‐one‐patient‐out cross‐validation strategy within the MC data set and tested using the out‐of‐sample CC data set. We assigned labels of “1” and “0” to seizure‐free and non–seizure‐free outcomes, respectively. During model training, we treated each epoch from a patient's EEG as a separate sample with the same outcome label as the patient and developed a classifier that predicted the probability of seizure freedom for an unseen epoch in the testing set. Note that the epochs of a single patient were never shared between the training and the testing sets during each cross‐validation. We calculated the class probabilities in this fashion for all epochs in the data set and compared them with the ground truth labels to plot the receiver‐operating characteristic (ROC) curve for the Mayo data set. In addition, we utilized the trained model during each cross‐validation and predicted the class probabilities for the CC data set. We then averaged the predicted probabilities across all cross‐validations to calculate the final probabilities, which were used to plot the ROC curve for the CCF data set. Furthermore, the predicted probabilities of all the epochs of a single patient were averaged to produce the probability for that patient. Using the patient‐level probabilities, we also plotted patient‐level ROC curves to ensure that the window‐level classification translated well to individual patients. Apart from the area under the ROC curve (AUC) metric, we calculated other metrics such as precision, recall, and F1 score as the weighted averages of the respective individual class‐specific metrics at an operating point on the ROC curve that provided the best sensitivity and specificity.
This process is illustrated in Figure 1.
FIGURE 1
Classification of ATL outcomes using power spectral and interhemispheric coherence features extracted from presurgical routine EEG studies. We employed a Naïve Bayes classification approach using a leave‐one‐patient‐out cross‐validation within the MC data set and out‐of‐sample testing of the CC data set. ATL, Anterior temporal lobectomy; CC, Cleveland Clinic; EEG, electroencephalography; MC, Mayo Clinic
Classification of ATL outcomes using power spectral and interhemispheric coherence features extracted from presurgical routine EEG studies. We employed a Naïve Bayes classification approach using a leave‐one‐patient‐out cross‐validation within the MC data set and out‐of‐sample testing of the CC data set. ATL, Anterior temporal lobectomy; CC, Cleveland Clinic; EEG, electroencephalography; MC, Mayo Clinic
RESULTS
Table 1 describes the participants of this study, which included 41 TLE patients from MC and 23 TLE patients from CC. All participants were 18 years or older, and 30 MC and 16 CC patients were seizure‐free for at least 1 year following ATL. Twenty‐seven MC patients (65.9%) and 15 CC patients (65.2%) showed abnormalities in MRI during presurgical evaluation, and 11 MC patients (26.8%) and 12 CC patients (52.2%) were determined to have mesial temporal sclerosis (MTS) on pathology assessment of resected tissue. Among those with abnormal MRI, 20 MC patients (74.1%) and 8 CC patients (73.3%) were seizure‐free for 1 year following ATL. Similarly, among those with MTS, 10 MC patients (90.9%) and 9 CC patients (75%) were seizure‐free for 1 year following ATL. Additional details regarding each individual participant can be found in Table S1.
TABLE 1
Participants of our study and their classifications based on 1‐year outcomes of anterior temporal lobectomy, sex, age, seizure focus lateralization, general MRI abnormalities, and mesial temporal sclerosis
Participants of our study and their classifications based on 1‐year outcomes of anterior temporal lobectomy, sex, age, seizure focus lateralization, general MRI abnormalities, and mesial temporal sclerosisAbbreviations: ATL, anterior temporal lobectomy; EEG, electroencephalography; MRI, magnetic resonance imaging; TLE, temporal lobe epilepsy.
Comparison of spectral features
Figure 2 illustrates the average spectral differences between MC and CC patients who experienced 1‐year seizure freedom following ATL and those who did not. Here we analyzed normalized regional power spectral and interhemispheric spectral coherence densities at every integer frequency between 1 and 25 Hz (as described in Methods). Figure 2A,B suggest that the PSD values of those who were seizure‐free are, on average, higher than those who were not. Similarly, Figure 2C,D indicates that the ISD values of those who were seizure‐free are higher in frontal, temporal, parietal, and occipital regions in frequencies above 10 Hz. The lower frequency behavior (i.e., 1–10 Hz), however, is more complex and does not appear consistent across both cohorts.
FIGURE 2
Power spectral (A‐B) and spectral coherence‐based (C‐D) differences between MC and CC patients based on seizure‐free outcomes following ATL. We estimated power spectral and spectral coherence densities using the Welch FFT approach at every integer frequency between 1 and 25 Hz. We then aggregated those densities among the channels located in prefrontal (Fp1 and Fp2), frontal (F3, F7 and F4, F8), temporal (T7, T9 and T8, T10), central (C3 and C4), parietal (P3, P7 and P4, P8), and occipital (O1 and O2) regions to estimate regional power spectral and regional interhemispheric spectral coherence densities. The power spectral densities were then normalized across the two populations within 1–25 Hz to harmonize the two data sets. Here solid lines indicate average values and shaded areas indicate 95% confidence intervals. ATL, Anterior temporal lobectomy; CC, Cleveland Clinic; FFT, Fast Fourier Transform; MC, Mayo Clinic
Power spectral (A‐B) and spectral coherence‐based (C‐D) differences between MC and CC patients based on seizure‐free outcomes following ATL. We estimated power spectral and spectral coherence densities using the Welch FFT approach at every integer frequency between 1 and 25 Hz. We then aggregated those densities among the channels located in prefrontal (Fp1 and Fp2), frontal (F3, F7 and F4, F8), temporal (T7, T9 and T8, T10), central (C3 and C4), parietal (P3, P7 and P4, P8), and occipital (O1 and O2) regions to estimate regional power spectral and regional interhemispheric spectral coherence densities. The power spectral densities were then normalized across the two populations within 1–25 Hz to harmonize the two data sets. Here solid lines indicate average values and shaded areas indicate 95% confidence intervals. ATL, Anterior temporal lobectomy; CC, Cleveland Clinic; FFT, Fast Fourier Transform; MC, Mayo ClinicTo further quantify the differences, we aggregated the PSD and ISD values within delta (1–4 Hz), theta (4–8 Hz), alpha1 (8–10 Hz), alpha2 (10–13 Hz), and beta (13–25 Hz) bands. We then (a) calculated the Pearson correlations between the band‐limited PSD and ISD values and surgery outcomes and (b) performed a Mann‐Whitney‐Wilcoxon test between the features conditioned on the surgery outcomes with the application of Bonferroni correction for multiple comparisons. The results of these analyses are shown in Figure 3. Those results indicate that both MC and CC data sets show significant differences between patients who experienced seizure freedom following ATL and those who did not. However, we note that the commonalities between the two data sets (MC and CC) are complex, although we observed somewhat consistent patterns in ISD features above 10 Hz across several brain regions (prefrontal, frontal, parietal, and occipital).
FIGURE 3
Statistical differences of power spectral density and interhemispheric spectral coherence density features between patients who experienced seizure freedom following anterior temporal lobectomy and those who did not. Those features were aggregated among six brain regions and five frequency bands prior to analyses. We then calculated Pearson correlations and performed Mann‐Whitney‐Wilcoxon tests between the two sets of features conditioned on the surgery outcomes with the application of Bonferroni correction for multiple comparisons. For Pearson correlation, blue entries indicate positive correlations (>0) with darker colors indicating stronger correlations. For significance tests, dark blue entries indicate statistically significant differences (p < .05), and white entries indicate nonsignificant differences (p > .05)
Statistical differences of power spectral density and interhemispheric spectral coherence density features between patients who experienced seizure freedom following anterior temporal lobectomy and those who did not. Those features were aggregated among six brain regions and five frequency bands prior to analyses. We then calculated Pearson correlations and performed Mann‐Whitney‐Wilcoxon tests between the two sets of features conditioned on the surgery outcomes with the application of Bonferroni correction for multiple comparisons. For Pearson correlation, blue entries indicate positive correlations (>0) with darker colors indicating stronger correlations. For significance tests, dark blue entries indicate statistically significant differences (p < .05), and white entries indicate nonsignificant differences (p > .05)
Classification of seizure freedom following ATL using normal interictal scalp EEG
We performed several classification experiments between the patients who experienced 1‐year seizure freedom following ATL and those who did not, using the framework described in Methods. The results of those experiments are shown in Table 2. First, we compared the predictability using all frequencies full bandwidth (1–25 Hz), low frequencies (1–9 Hz), and high frequencies (10–25 Hz), using the PSD and ISD values from all six brain regions (the first three rows of Table 2). Our results suggest that using all the frequencies (1–25 Hz) and high frequencies (10–25 Hz) provide relatively better classification performances between MC and CC data sets compared to lower frequencies (1–9 Hz). Second, we compared the contributions of PSD and ISD features within the low‐ and high‐frequency range (rows 4–7). Our results indicate that ISD features, particularly in the 10–25 Hz frequency range, provide balanced predictability between MC and CC, compared to all other combinations. Overall, all the experiments provided classification performances substantially better than chance‐level (>0.5), and ISD features in the 10–25 Hz range contributed the most to the classification performance. Figure 4A,B shows the ROC curves for MC and CC data sets at the window and patient levels for the classification using ISD features in the 10–25 Hz range across all six brain regions (corresponding to row 7 of Table 2). Figure 4C shows the confusion matrices for MC and CC data sets (at patient level) derived at an operating point that provided the best sensitivity and specificity for the respective ROC curve. The confusion matrices indicate that the proposed approach is very successful in identifying those who may experience seizure recurrence within the first year (top‐left cell in each matrix).
TABLE 2
Results of classification experiments at the patient level
Spectral power
Spectral coherence
Mayo Clinic
Cleveland Clinic (out of sample)
Regions
Frequencies
Regions
Frequencies
AUC
Precision
Recall
F1
AUC
Precision
Recall
F1
All
1–25 Hz
All
1–25 Hz
0.69
0.8
0.63
0.65
0.86
0.88
0.87
0.87
All
1–9 Hz
All
1–9 Hz
0.67
0.76
0.73
0.74
0.63
0.75
0.61
0.62
All
10–25 Hz
All
10–25 Hz
0.71
0.8
0.73
0.75
0.85
0.88
0.87
0.87
All
1–9 Hz
None
None
0.57
0.72
0.73
0.73
0.62
0.72
0.74
0.73
All
10–25 Hz
None
None
0.5
0.66
0.66
0.66
0.75
0.79
0.78
0.79
None
None
All
1–9 Hz
0.72
0.77
0.76
0.76
0.54
0.66
0.61
0.62
None
None
All
10–25 Hz
0.78
0.82
0.68
0.7
0.76
0.79
0.78
0.79
A Naïve Bayes classifier was trained using the MC data set in a leave‐one‐patient‐out cross‐validation setting and tested on the out‐of‐sample CC data set. The area under curve metric was calculated by plotting the ROC curve, and other metrics were calculated at an operating point on the ROC curve that provided the best sensitivity and specificity.
Abbreviations: CC, Cleveland Clinic; MC, Mayo Clinic; ROC, receiver‐operating characteristic.
FIGURE 4
Epoch‐level (A) and patient‐level (B) receiver‐operating characteristic (ROC) curves for MC and CC data sets obtained using a Naïve Bayes classifier trained with features including interhemispheric spectral coherence densities of all six brain regions within 10–25 Hz. The classifier was trained using the MC data set in a leave‐one‐patient‐out cross‐validation (LOOCV) setting and tested on the out‐of‐sample (OOS) CC data set. Note that the classifier was trained using features extracted from epochs, and then the predicted epoch‐level probabilities were averaged to produce patient‐level probabilities. Those probabilities were then used to plot ROC curves. (c) Confusion matrices for the MC and CC data sets (at patient level) were obtained using a threshold on the respective ROC curve that provided the best sensitivity and specificity. The cross‐diagonal terms of the confusion matrices show correctly predicted outcomes. CC, Cleveland Clinic; MC, Mayo Clinic; ROC, receiver‐operating characteristic
Results of classification experiments at the patient levelA Naïve Bayes classifier was trained using the MC data set in a leave‐one‐patient‐out cross‐validation setting and tested on the out‐of‐sample CC data set. The area under curve metric was calculated by plotting the ROC curve, and other metrics were calculated at an operating point on the ROC curve that provided the best sensitivity and specificity.Abbreviations: CC, Cleveland Clinic; MC, Mayo Clinic; ROC, receiver‐operating characteristic.Epoch‐level (A) and patient‐level (B) receiver‐operating characteristic (ROC) curves for MC and CC data sets obtained using a Naïve Bayes classifier trained with features including interhemispheric spectral coherence densities of all six brain regions within 10–25 Hz. The classifier was trained using the MC data set in a leave‐one‐patient‐out cross‐validation (LOOCV) setting and tested on the out‐of‐sample (OOS) CC data set. Note that the classifier was trained using features extracted from epochs, and then the predicted epoch‐level probabilities were averaged to produce patient‐level probabilities. Those probabilities were then used to plot ROC curves. (c) Confusion matrices for the MC and CC data sets (at patient level) were obtained using a threshold on the respective ROC curve that provided the best sensitivity and specificity. The cross‐diagonal terms of the confusion matrices show correctly predicted outcomes. CC, Cleveland Clinic; MC, Mayo Clinic; ROC, receiver‐operating characteristic
Comparison between normal scalp EEG and MRI abnormalities
Table 3 illustrates a comparison between features derived from normal interictal scalp‐EEG segments and MRI abnormalities in predicting ATL outcomes at 1 year. Specifically, we compared the predictability of previously described scalp‐EEG–derived ISD features in the 10–25 Hz frequency range (row 1) against (a) abnormal MRI findings (row 2) and (b) the presence of mesial temporal sclerosis (row 3). Our findings indicate that the features extracted from normal interictal scalp EEG segments provide superior predictability when compared with MRI abnormalities in both the MC and CC cohorts.
TABLE 3
Comparison between the predictability of ATL outcomes using PSD and ISD features derived from normal scalp EEG segments and MRI abnormalities
Comparison between the predictability of ATL outcomes using PSD and ISD features derived from normal scalp EEG segments and MRI abnormalitiesAbbreviations: ATL, anterior temporal lobectomy; ISD, interhemispheric spectral‐coherence density; PSD, power spectral density.
DISCUSSION
Main contribution of the study
The ability to predict surgical outcomes of anterior temporal lobectomy is of great clinical and research interest. Previous studies have reported the impact of ictal and interictal EEG abnormalities and structural MRI abnormalities in prognosticating ATL outcomes.
,
However, to our knowledge, whether visually interpreted normal interictal EEG offers any prognostic value has not been considered previously. Here we analyzed multiple EEG features such as spectral power and interhemispheric spectral coherence within normal interictal EEG segments prior to surgery and demonstrated that there are substantial differences within those features between the TLE patients who experienced 1‐year seizure freedom following ATL and those who did not achieve seizure‐free outcomes. Furthermore, a machine learning analysis revealed that those EEG features could predict 1‐year seizure freedom following ATL with AUCs greater than or equal to 0.75 in both the training and validation data set (MC) and in the out‐of‐sample testing data set from a different institution (CC). Further analyses showed that frequencies in the 10–25 Hz provided higher predictability compared to lower frequencies, particularly in ISD features. In addition, a comparison showed that the features extracted from normal scalp EEG provide superior predictability compared to MRI abnormalities in both the study cohorts. These findings provide strong evidence that quantitative analysis of normal interictal scalp EEG can help in the presurgical assessment of ATL candidates. Considering that routine scalp EEG is relatively inexpensive (compared to intracranial EEG and MRI), and that normal scalp EEG samples can be acquired in ambulatory settings, our findings can form the basis for novel and cost‐effective ways to help assess the surgical candidacy of TLE patients, and, potentially, to identify candidates for iEEG monitoring prior to surgery.
Interpretation of the results
A comparison of the spectral features, illustrated in Figure 2, indicates that spectral power and interhemispheric spectral coherence are significantly lower in several brain regions and frequencies for those who experienced seizure recurrence within 1 year of surgery. Although the exact mechanisms underlying this finding require further exploration, several factors could have contributed to this observation. Previous studies have reported that age at seizure onset, age at surgery, duration of epilepsy, and seizure frequency can influence outcomes of ATL, particularly in the long term.
,
,
,
A longer history and higher frequency of seizures may be associated with more severe chronic structural and functional abnormalities and can lead to progressive hippocampal injury and possibly the formation of secondary epileptogenic zones.
Furthermore, an association between the epilepsy duration and the bilateral decline of hippocampal volumes, brain glucose metabolism, and Wada hemispheric memory performance has been reported.
It is plausible that the spectral differences in visually interpreted normal interictal EEG segments represent an overall measure of epilepsy‐related injury to the circuits involved in normal brain function.Prior investigations using intracranial EEG show that the local synchrony is decreased surrounding the epileptogenic zone.
,
,
Because PSD is a measure of local synchrony and ISD a measure of interhemispheric network synchrony, the widespread decreased PSD and ISD in the patients with poor outcomes compared to those with good outcomes may reflect more severe and widespread epilepsy with more loss of local and network neuronal synchrony. On the other hand, the scalp EEG findings associated with good outcomes are those that are driven by anterior and/or mesial temporal lobe epileptogenic zone, which would be removed or disconnected by a standard ATL. Furthermore, we hypothesize that reduced scalp EEG power in the higher frequency range reflects abnormalities in the neocortex as opposed to deeper brain structures. In summary, we can speculate that changes in limbic networks associated with mesial temporal lobe epilepsy may be distinguishable with quantitative EEG measures used here and would characterize the patients most likely to have seizure‐free outcomes after ATL surgery.
Contribution of antiseizure medications
In addition, we considered whether the changes in EEG spectral power and coherence could be related to ASMs, particularly benzodiazepines.
However, as detailed in Table S1, only 2 MC patients (out of 41) and 3 CC patients (out of 23) were taking benzodiazepines at the time of EEG. Because the number of patients taking ASMs is relatively small compared to the total size of the cohort, we conclude that ASMs, and in particular benzodiazepines, cannot explain the population differences we observed between the seizure‐free and non–seizure‐free groups.
Predictive value of normal scalp EEG when compared with MRI abnormalities
The presence of a distinct anatomical lesion identified by MRI is now arguably the most reliable predictor of seizure freedom following ATL.
However, our results indicate that normal scalp EEG–derived features provide superior and potentially distinct predictability of ATL outcomes compared to MRI abnormalities in the two cohorts we studied. Although this finding needs further validation in a larger population of patients, if validated, this finding could improve the current state of presurgical evaluations, particularly for those with normal MRI studies.
Observations from a multicenter study
Replicating the results of machine learning–based analyses among multiple data sets, particularly when the data sets include physiological measurements such as EEG, is important to evaluate the potential for clinical translation. Our classification results were obtained by training a machine learning model using the EEG data acquired at MC and independently testing that model on out‐of‐sample EEG data acquired at CC. Although the ability to demonstrate similar predictability between the two data sets strengthens our findings, there were noticeable differences between the two data sets. First, the ranges of absolute spectral power values significantly differed between the two data sets. This observation is most likely a result of the different EEG acquisition systems used at the two institutions. Therefore, separate normalizations of the two data sets were necessary to evaluate them under the same machine learning framework. We normalized each data set using a grand average of the total spectral power between 1 and 25 Hz, calculated separately for each data set. However, we anticipate that control EEG data sets acquired using the respective acquisition conditions of the data sets will benefit future evaluations, including multicenter data sets. Second, even after normalization, we noticed that the group differences within the spectral features suggested inconsistent relationships in low frequencies (<10 Hz), particularly close to 1 Hz (Figure 2). This observation might result from hardware filter settings of the respective acquisition systems and needs further exploration. However, both the data sets showed similar relationships in frequencies above 10 Hz. This effect is also evident in the classification results; we obtained relatively worse classification results when low‐frequency features were used for classification, compared to higher frequencies (see Table 2). Nonetheless, we obtained better and more consistent results when features of frequencies above 10 Hz were utilized.
Limitations and future work
Replicating our findings on large multicenter data sets can help clarify the clinical value and potential pitfalls. A larger population‐based data set will elucidate the independent predictive value of our finding when controlled for the effects of confounders such as age and medications.
In addition, the development of a clinically useful individualized EEG‐based ATL outcome prognostication tool will require EEG recordings of healthy controls using the EEG acquisition systems at the respective medical centers. Our future efforts will focus on acquiring control EEG studies from multiple EEG acquisition systems for data normalization to ensure clinical translation.Although our study utilized 1‐year seizure freedom as a measure of outcomes, several studies have reported that the success rate of ATL keeps diminishing with longer follow‐up periods.
In the future, we will investigate whether the same methodology can predict long‐term outcomes beyond 1 year. We believe that our methodology can help quantify the changes in normal brain function following ATL (by comparing pre‐ and postoperative scalp EEG recordings) and hence may be useful in determining long‐term outcomes even if the patient is seizure‐free in the short term.Furthermore, our study included awake eyes‐closed EEG segments extracted from routine or the first day of prolonged EEG studies while the ASM effects were still present. However, it is possible that the pathological features in EEG are more identifiable during the time when ASMs are withdrawn or during other behavioral states, including slow‐wave sleep. Therefore, in the future, we plan to analyze the predictability of ATL outcomes using EEG segments extracted from different stages of prolonged studies accounting for medication variability and behavioral states, including awake and sleep.
CONCLUSION
Our results support that quantitative evaluation of interictal scalp EEG visually interpreted as normal by experienced epileptologists may help prognosticate seizure freedom after ATL in patients with drug‐resistant TLE. Specifically, spectral power and interhemispheric coherence features extracted from normal scalp EEG segments during eyes‐closed wakefulness showed significant differences between patients who became seizure‐free and those who did not. Future studies focused on (a) larger and prospective patient cohorts, (b) effect of behavioral state, and (c) confounders such as age and medication effects can clarify the potential of using scalp EEG recordings to prognosticate surgical outcomes of ATL.
CONFLICT OF INTEREST
Dr. Worrell has licensed intellectual property to NeuroOne Inc. and Cadence Neuroscience Inc. Dr. Brinkmann reports that he has licensed intellectual property to Cadence Neuroscience Inc. Dr. Cendes reports personal fees from UCB Pharma, grants from Sao Paulo Research Foundation (FAPESP), and grants from CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico), outside the submitted work; and is a member of the editorial boards of the following journals: Neurology, Epilepsy Research, Epilepsia (Associate Editor), and Frontiers in Neurology (Specialty Chief Editor in Epilepsy). All other authors have nothing to disclose. We confirm that we have read the Journal's position on issues involved in ethical publication and affirm that this report is consistent with those guidelines.Supplementary MaterialClick here for additional data file.
Authors: H Jokeit; A Ebner; S Arnold; M Schüller; C Antke; Y Huang; H Steinmetz; R J Seitz; O W Witte Journal: J Neurol Date: 1999-10 Impact factor: 4.849
Authors: Zhengxiang Cai; Abbas Sohrabpour; Haiteng Jiang; Shuai Ye; Boney Joseph; Benjamin H Brinkmann; Gregory A Worrell; Bin He Journal: Proc Natl Acad Sci U S A Date: 2021-04-27 Impact factor: 11.205