| Literature DB >> 31899530 |
Andrey Zhdanov1,2, Sravya Atluri3,4, Willy Wong5, Yasaman Vaghei1,2, Zafiris J Daskalakis3,6,7, Daniel M Blumberger3,6,7, Benicio N Frey8,9, Peter Giacobbe6,10, Raymond W Lam11, Roumen Milev12, Daniel J Mueller3,6, Gustavo Turecki13, Sagar V Parikh14, Susan Rotzinger6,10,15, Claudio N Soares15,16, Colleen A Brenner17, Fidel Vila-Rodriguez11, Mary Pat McAndrews18, Killian Kleffner11, Esther Alonso-Prieto11, Stephen R Arnott19, Jane A Foster8,20, Stephen C Strother19,21, Rudolf Uher22,23, Sidney H Kennedy6,7,10,15,18,20, Faranak Farzan1,2,3,6,7.
Abstract
Importance: Social and economic costs of depression are exacerbated by prolonged periods spent identifying treatments that would be effective for a particular patient. Thus, a tool that reliably predicts an individual patient's response to treatment could significantly reduce the burden of depression. Objective: To estimate how accurately an outcome of escitalopram treatment can be predicted from electroencephalographic (EEG) data on patients with depression. Design, Setting, and Participants: This prognostic study used a support vector machine classifier to predict treatment outcome using data from the first Canadian Biomarker Integration Network in Depression (CAN-BIND-1) study. The CAN-BIND-1 study comprised 180 patients (aged 18-60 years) diagnosed with major depressive disorder who had completed 8 weeks of treatment. Of this group, 122 patients had EEG data recorded before the treatment; 115 also had EEG data recorded after the first 2 weeks of treatment. Interventions: All participants completed 8 weeks of open-label escitalopram (10-20 mg) treatment. Main Outcomes and Measures: The ability of EEG data to predict treatment outcome, measured as accuracy, specificity, and sensitivity of the classifier at baseline and after the first 2 weeks of treatment. The treatment outcome was defined in terms of change in symptom severity, measured by the Montgomery-Åsberg Depression Rating Scale, before and after 8 weeks of treatment. A patient was designated as a responder if the Montgomery-Åsberg Depression Rating Scale score decreased by at least 50% during the 8 weeks and as a nonresponder if the score decrease was less than 50%.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31899530 PMCID: PMC6991244 DOI: 10.1001/jamanetworkopen.2019.18377
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Demographic and Clinical Data for Study Participants
| Clinical or Demographic Variable | EEG Recording Site | All (N = 122) | All Responders at Week 8 (n = 55) | All Nonresponders at Week 8 (n = 67) | |||
|---|---|---|---|---|---|---|---|
| UBC (n = 52) | TGH (n = 45) | QNS (n = 18) | CAM (n = 7) | ||||
| Age, mean (SD), y | 35.4 (11.5) | 35.7 (12.6) | 42.7 (14.0) | 30.4 (12.1) | 36.3 (12.7) | 36.0 (12.7) | 36.6 (12.6) |
| Sex, No. | |||||||
| Male | 19 | 18 | 9 | 0 | 46 | 19 | 27 |
| Female | 33 | 27 | 9 | 7 | 76 | 36 | 40 |
| MADRS score, mean (SD) | |||||||
| Baseline | 28.5 (5.8) | 32.3 (5.6) | 30.0 (4.6) | 28.0 (4.8) | 30.1 (5.8) | 29.5 (5.8) | 30.5 (5.8) |
| Week 2 | 21.9 (7.3) | 25.1 (10.1) | 23.1 (5.2) | 21.1 (3.7) | 23.2 (8.5) | 20.1 (8.4) | 25.8 (7.8) |
| Week 8 | 14.6 (9.2) | 19.1 (12.0) | 18.1 (9.9) | 15.7 (5.6) | 16.8 (10.5) | 7.9 (5.0) | 24.2 (7.7) |
| Decrease in MADRS score, mean (SD) | |||||||
| Baseline to week 8 | 13.9 (9.2) | 13.2 (11.8) | 11.9 (9.9) | 12.3 (4.1) | 13.2 (10.2) | 21.7 (6.7) | 6.3 (6.8) |
| Baseline to week 8 compared with baseline, % | 48.7 (32.7) | 41.1 (3.8) | 39.2 (34.9) | 44.5 (15.7) | 44.3 (32.6) | 73.3 (16.0) | 20.4 (21.6) |
| Responders as assessed at week 8, No. (%) | 24 (46.2) | 21 (46.7) | 7 (38.9) | 3 (42.9) | 55 (45.1) | NA | NA |
| Nonresponders as assessed at week 8, No. (%) | 28 (53.8) | 24 (53.3) | 11 (61.1) | 4 (57.1) | 67 (54.9) | NA | NA |
Abbreviations: CAM, Centre for Addiction and Mental Health; EEG, electroencephalography; MADRS, Montgomery-Åsberg Depression Rating Scale; NA, not applicable; QNS, Queens University; TGH, Toronto General Hospital; UBC, University of British Columbia.
Figure 1. Number of Votes for Multiscale Entropy Features
The figure shows rankings for only 3 of 4 feature sources: baseline (A), week 2 (B), and early change (C). Combined feature source was the union of baseline and early change sources; therefore, computing the rankings for the combined features yielded results that were similar to the results for the baseline and early change sources. For that reason, feature rankings for combined feature source are not shown. The matrices in the top row display rankings for all the features; the topographic plots (left ear is left, nose is up) in the bottom row show distribution over the scalp of the rankings for the features corresponding to multiscale entropy scales 2 and 17 (marked by vertical green lines in the top row plots).
Figure 2. Number of Votes for Spectral Power Features
The figure shows rankings for only 3 of 4 feature sources: baseline (A), week 2 (B), and early change (C). Combined feature source was the union of baseline and early change sources; therefore, computing the rankings for the combined features yielded results that were similar to the results for the baseline and early change sources. For that reason, feature rankings for combined feature source are not shown. The matrices in the top row display rankings for all the features; the topographic plots (left ear is left, nose is up) in the bottom row show distribution over the scalp of the rankings for the features corresponding to some chosen frequency bands.
Number of Features and Prediction Accuracy for Different Vote Thresholds and Feature Sources
| Vote Threshold | Balanced Accuracy (No. of Features for a Particular Feature Source), % (No.) | |||
|---|---|---|---|---|
| Baseline | Week 2 | Early Change | Combined | |
| ≥50 | 78.6 (54) | 79.7 (75) | 68.4 (149) | 83.3 (185) |
| ≥60 | 79.2 (34) | 74.3 (50) | 68.4 (106) | 82.4 (127) |
| ≥70 | 77.3 (17) | 71.4 (28) | 74.2 (73) | 76.9 (82) |
| ≥80 | 73.6 (9) | 72.8 (14) | 77.1 (50) | 72.5 (52) |
| ≥90 | 68.6 (4) | 68.8 (5) | 70.7 (20) | 72.7 (21) |
Balanced accuracy is defined as the mean of sensitivity and specificity.
Because the combined feature source was the union of baseline and early change feature sources, the feature count for the combined source was similar to the sum of the counts for baseline and early change sources. The count was similar rather than equal owing to the noise introduced into the feature ranking procedure by randomly choosing the 80% of the data to which the 2-tailed, unpaired t test was applied at each of the 100 iterations (see the Feature Ranking subsection of the Methods section).
Cross-site Generalizability of the Treatment Outcome Prediction
| Feature Source | Balanced Prediction Accuracy Testing on Data From a Single Site | Balanced Prediction Accuracy | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| UBC | TGH | QNS | CAM | |||||||
| Accuracy, % | Individuals, No. | Accuracy, % | Individuals, No. | Accuracy, % | Individuals, No. | Accuracy, % | Individuals, No. | Accuracy, % | Individuals, No. | |
| Baseline | 62.0 | 52 | 71.0 | 45 | 82.7 | 18 | 74.9 | 7 | 79.2 | 122 |
| Week 2 | 74.5 | 51 | 74.7 | 40 | 79.2 | 18 | 72.0 | 6 | 74.3 | 115 |
| Early change | 69.9 | 85.0 | 77.4 | 78.6 | 68.4 | |||||
| Combined | 71.3 | 94.6 | 83.1 | 77.4 | 82.4 | |||||
Abbreviations: CAM, Centre for Addiction and Mental Health; QNS, Queens University; TGH, Toronto General Hospital; UBC, University of British Columbia.
Balanced prediction accuracy is defined as the mean of sensitivity and specificity.
Training was performed on data from the remaining sites.
Training and testing were performed on the data from all 4 sites using 10-fold cross-validation.