| Literature DB >> 32558996 |
Aleks Stolicyn1, Mathew A Harris1, Xueyi Shen1, Miruna C Barbu1, Mark J Adams1, Emma L Hawkins1, Laura de Nooij1, Hon Wah Yeung1, Alison D Murray2, Stephen M Lawrie1, J Douglas Steele3, Andrew M McIntosh1, Heather C Whalley1.
Abstract
Major depressive disorder (MDD) has been the subject of many neuroimaging case-control classification studies. Although some studies report accuracies ≥80%, most have investigated relatively small samples of clinically-ascertained, currently symptomatic cases, and did not attempt replication in larger samples. We here first aimed to replicate previously reported classification accuracies in a small, well-phenotyped community-based group of current MDD cases with clinical interview-based diagnoses (from STratifying Resilience and Depression Longitudinally cohort, 'STRADL'). We performed a set of exploratory predictive classification analyses with measures related to brain morphometry and white matter integrity. We applied three classifier types-SVM, penalised logistic regression or decision tree-either with or without optimisation, and with or without feature selection. We then determined whether similar accuracies could be replicated in a larger independent population-based sample with self-reported current depression (UK Biobank cohort). Additional analyses extended to lifetime MDD diagnoses-remitted MDD in STRADL, and lifetime-experienced MDD in UK Biobank. The highest cross-validation accuracy (75%) was achieved in the initial current MDD sample with a decision tree classifier and cortical surface area features. The most frequently selected decision tree split variables included surface areas of bilateral caudal anterior cingulate, left lingual gyrus, left superior frontal, right precentral and paracentral regions. High accuracy was not achieved in the larger samples with self-reported current depression (53.73%), with remitted MDD (57.48%), or with lifetime-experienced MDD (52.68-60.29%). Our results indicate that high predictive classification accuracies may not immediately translate to larger samples with broader criteria for depression, and may not be robust across different classification approaches.Entities:
Keywords: brain structure; classification; depression; diffusion MRI; machine learning; major depressive disorder; structural MRI
Mesh:
Year: 2020 PMID: 32558996 PMCID: PMC7469862 DOI: 10.1002/hbm.25095
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
Summary of the main characteristics of the five investigated diagnostic criteria
| Diagnostic criteria | Current symptoms | Past symptoms | Assessment criteria | Assessment method | Cases (morphometry/white matter) | |
|---|---|---|---|---|---|---|
| Current depression | cMDD‐STR |
| ‐ | DSM | Clinical interview | 30/40 |
| cMDD‐UKB |
|
| Manually defined | Self‐report | 735/1,435 | |
| Past depression | rMDD‐STR |
|
| DSM | Clinical interview | 148/202 |
| pMDD‐UKB CIDI | ‐ |
| DSM | Self‐report | 1,665/3,418 | |
| pMDD‐UKB ICD | ‐ |
| ICD | Clinical interview | 140/289 |
Note: Tick symbol denotes symptoms were present, dash denotes symptoms could be either present or absent, cross symbol denotes symptoms were absent.
Abbreviations: cMDD‐STR, current MDD criteria in STRADL cohort; cMDD‐UKB, probable current MDD criteria in UK Biobank cohort; DSM, Diagnostic and Statistical Manual of Mental Disorders; ICD, International Statistical Classification of Diseases and Related Health Problems; pMDD‐UKB‐CIDI, lifetime MDD criteria based on the online Composite International Diagnostic Interview (CIDI) in UK Biobank cohort; pMDD‐UKB‐ICD, lifetime MDD criteria based on ICD and hospital records in UK Biobank cohort; rMDD‐STR, remitted MDD criteria in STRADL cohort.
Summary demographic information for cases and controls in the five analysed samples with brain morphometric measures (cortical thickness, surface areas, and volumes)
| Sample | Characteristic | Controls | Cases | |
|---|---|---|---|---|
| Current depression | cMDD‐STR | Size | 30 | 30 |
| Sex (male/female) | 8/22 | 8/22 | ||
| Age (years) | 54.23 (10.98) | 54.07 (10.96) | ||
| QIDS | 3.3 (2.25) | 13.97 (3.59) | ||
| Medicated | 2 | 18 | ||
| cMDD‐UKB | Size | 735 | 735 | |
| Sex (male/female) | 215/520 | 215/520 | ||
| Age (years) | 59.66 (7.21) | 59.66 (7.21) | ||
| Past depression | rMDD‐STR | Size | 148 | 148 |
| Sex (male/female) | 44/104 | 44/104 | ||
| Age (years) | 58.06 (8.02) | 57.16 (8.81) | ||
| QIDS | 3.49 (2.35) | 5.48 (3.91) | ||
| pMDD‐UKB‐CIDI | Size | 1,665 | 1,665 | |
| Sex (male/female) | 544/1,121 | 544/1,121 | ||
| Age (years) | 60.91 (7.18) | 60.90 (7.19) | ||
| pMDD‐UKB‐ICD | Size | 140 | 140 | |
| Sex (male/female) | 49/91 | 49/91 | ||
| Age (years) | 61.59 (7.64) | 61.59 (7.65) |
Note: Standard deviations for age and QIDS are in brackets. In cMDD‐STR sample participants were considered medicated if they had at least one antidepressant prescription.
Abbreviations: cMDD‐STR, current MDD criteria in STRADL cohort; cMDD‐UKB, probable current MDD criteria in UK Biobank cohort; pMDD‐UKB‐CIDI, lifetime MDD criteria based on the online Composite International Diagnostic Interview (CIDI) in UK Biobank cohort; pMDD‐UKB‐ICD, lifetime MDD criteria based on ICD and hospital records in UK Biobank cohort; QIDS, Quick Inventory of Depressive Symptomatology; rMDD‐STR, remitted MDD criteria in STRADL cohort.
Summary demographic information for cases and controls in the five analysed samples with white matter integrity measures (FA and MD)
| Sample | Characteristic | Controls | Cases | |
|---|---|---|---|---|
| Current depression | cMDD‐STR | Size | 40 | 40 |
| Sex (male/female) | 10/30 | 10/30 | ||
| Age (years) | 55.03 (9.77) | 54.23 (10.35) | ||
| QIDS | 3.58 (2.54) | 14.13 (3.88) | ||
| Medicated | 2 | 28 | ||
| cMDD‐UKB | Size | 1,435 | 1,435 | |
| Sex (male/female) | 451/984 | 451/984 | ||
| Age (years) | 60.11 (7.16) | 60.11 (7.16) | ||
| Past depression | rMDD‐STR | Size | 202 | 202 |
| Sex (male/female) | 56/146 | 56/146 | ||
| Age (years) | 57.89 (8.84) | 56.97 (9.28) | ||
| QIDS | 3.53 (2.36) | 5.38 (3.68) | ||
| pMDD‐UKB‐CIDI | Size | 3,418 | 3,418 | |
| Sex (male/female) | 1,094/2,324 | 1,094/2,324 | ||
| Age (years) | 61.46 (7.09) | 61.44 (7.11) | ||
| pMDD‐UKB‐ICD | Size | 289 | 289 | |
| Sex (male/female) | 97/192 | 97/192 | ||
| Age (years) | 61.77 (7.70) | 61.76 (7.70) |
Note: Standard deviations for age and QIDS are in brackets. In cMDD‐STR sample participants were considered medicated if they had at least one antidepressant prescription.
Abbreviations: cMDD‐STR, current MDD criteria in STRADL cohort; cMDD‐UKB, probable current MDD criteria in UK Biobank cohort; pMDD‐UKB‐CIDI, lifetime MDD criteria based on the online Composite International Diagnostic Interview (CIDI) in UK Biobank cohort; pMDD‐UKB‐ICD, lifetime MDD criteria based on ICD and hospital records in UK Biobank cohort; QIDS, Quick Inventory of Depressive Symptomatology; rMDD‐STR, remitted MDD criteria in STRADL cohort.
Case‐control classification accuracies and ROC AUC measures (on cross‐validation) with brain morphometric features in cMDD‐STR sample (30 cases and 30 controls)
| Classifier type | Feature selection | Hyperparam. optimisation | Outer CV | Inner CV | Feature domain | Classification accuracy (sensitivity/specificity) | ROC AUC |
|---|---|---|---|---|---|---|---|
| PLR | Embedded | Grid search | LOOCV | 10‐fold | Thickness |
| 0.609 |
| Surface area | 51.67% (50.00/53.33%) | 0.547 | |||||
| Volume | 56.67% (60.00/53.33%) | 0.539 | |||||
| Subcortical | 55.00% (56.67/53.33%) | 0.572 | |||||
| Combined | 50.00% (46.67/53.33%) | 0.546 | |||||
| SVM | None | None | LOOCV | ‐ | Thickness |
| 0.682 |
| Surface area | 46.67% (43.33/50.00%) | 0.529 | |||||
| Volume | 51.67% (40.00/63.33%) | 0.550 | |||||
| Subcortical | 60.00% (60.00/60.00%) | 0.582 | |||||
| Combined | 61.67% (70.00/53.33%) | 0.568 | |||||
| Grid search | LOOCV | Thickness | 60.00% (60.00/60.00%) | 0.556 | |||
| Surface area | 50.00% (46.67/53.33%) | 0.500 | |||||
| Volume | 61.67% (50.00/73.33%) | 0.649 | |||||
| Subcortical | 58.33% (50.00/66.67%) | 0.602 | |||||
| Combined | 58.33% (63.33/53.33%) | 0.628 | |||||
| Statistical filter | None | Combined | 53.33% (40.00/66.67%) | 0.540 | |||
| Grid search | 10‐fold | Combined | 45.00% (40.00/50.00%) | 0.513 | |||
| Sequential elimination | None | Thickness | 61.67% (56.67/66.67%) | 0.687 | |||
| Surface area | 48.33% (43.33/53.33%) | 0.519 | |||||
| Volume | 50.00% (36.67/63.33%) | 0.556 | |||||
| Subcortical | 53.33% (50.00/56.67%) | 0.573 | |||||
| Combined | 61.67% (63.33/60.00%) | 0.659 | |||||
| DT | None | None | LOOCV | ‐ | Thickness | 38.33% (40.00/36.67%) | 0.283 |
| Surface area |
| 0.680 | |||||
| Volume | 45.00% (43.33/46.67%) | 0.377 | |||||
| Subcortical | 43.33% (46.67/40.00%) | 0.394 | |||||
| Combined | 55.00% (50.00/60.00%) | 0.473 | |||||
| Grid search | LOOCV | Thickness | 51.67% (53.33/50.00%) | 0.318 | |||
| Surface area | 65.00% (76.67/53.33%) | 0.677 | |||||
| Volume | 46.67% (46.67/46.67%) | 0.442 | |||||
| Subcortical | 58.33% (56.67/60.00%) | 0.500 | |||||
| Combined | 38.33% (46.67/30.00%) | 0.407 | |||||
| Statistical filter | None | Combined | 33.33% (43.33/23.33%) | 0.189 | |||
| Grid search | 10‐fold | Combined | 43.33% (43.33/43.33%) | 0.350 | |||
| Sequential elimination | None | Thickness | 35.00% (46.67/23.33%) | 0.255 | |||
| Surface area | 68.33% (70.00/66.67%) | 0.637 | |||||
| Volume | 40.00% (40.00/40.00%) | 0.292 | |||||
| Subcortical | 51.67% (46.67/56.67%) | 0.426 | |||||
| Combined | 63.33% (56.67/70.00%) | 0.533 |
Note: Top accuracies for SVM, PLR and DT classifiers are in italics.
Abbreviations: CV, cross‐validation; DT, decision tree; LOOCV, leave‐one‐out cross‐validation; PLR, penalised logistic regression; ROC AUC, receiver operating characteristic area under the curve; SVM, support vector machine.
Case‐control classification accuracies and ROC AUC measures (on cross‐validation) with white matter integrity features in the cMDD‐STR sample (40 cases and 40 controls)
| Classifier type | Feature selection | Hyperparam optimisation | Outer CV | Inner CV | Feature domain | Classification accuracy (sensitivity/specificity) | ROC AUC |
|---|---|---|---|---|---|---|---|
| PLR | Embedded | Grid search | LOOCV | 10‐fold | FA | 31.25% (35.00/27.50%) | 0.363 |
| MD | 53.75% (55.00/52.50%) | 0.589 | |||||
| Combined | 48.75% (50.00/47.50%) | 0.474 | |||||
| SVM | None | None | LOOCV | ‐ | FA | 48.75% (40.00/57.50%) | 0.484 |
| MD | 57.50% (55.00/60.00%) | 0.536 | |||||
| Combined | 52.50% (50.00/55.00%) | 0.520 | |||||
| Grid search | LOOCV | FA | 50.00% (40.00/60.00%) | 0.505 | |||
| MD |
| 0.673 | |||||
| Combined | 53.75% (52.50/55.00%) | 0.559 | |||||
| Statistical filter | None | FA | 40.00% (32.50/47.50%) | 0.345 | |||
| MD | 37.50% (30.00/45.00%) | 0.353 | |||||
| Combined | 30.00% (20.00/40.00%) | 0.283 | |||||
| Grid search | 10‐fold | FA | 52.50% (60.00/45.00%) | 0.476 | |||
| MD | 38.75% (40.00/37.50%) | 0.385 | |||||
| Combined | 38.75% (40.00/37.50%) | 0.328 | |||||
| Sequential elimination | None | FA | 47.50% (40.00/55.00%) | 0.488 | |||
| MD | 53.75% (52.50/55.00%) | 0.534 | |||||
| Combined | 51.25% (55.00/47.50%) | 0.501 | |||||
| DT | None | None | LOOCV | ‐ | FA | 53.75% (50.00/57.50%) | 0.434 |
| MD | 56.25% (55.00/57.50%) | 0.514 | |||||
| Combined |
| 0.552 | |||||
| Grid search | LOOCV | FA | 48.75% (65.00/32.50%) | 0.350 | |||
| MD | 47.50% (40.00/55.00%) | 0.372 | |||||
| Combined | 51.25% (47.50/55.00%) | 0.563 | |||||
| Statistical filter | None | FA | 45.00% (35.00/55.00%) | 0.323 | |||
| MD | 43.75% (30.00/57.50%) | 0.331 | |||||
| Combined | 36.25% (30.00/42.50%) | 0.256 | |||||
| Grid search | 10‐fold | FA | 42.50% (47.50/37.50%) | 0.280 | |||
| MD | 40.00% (35.00/45.00%) | 0.204 | |||||
| Combined | 33.75% (35.00/32.50%) | 0.267 | |||||
| Sequential elimination | None | FA | 48.75% (45.00/52.50%) | 0.433 | |||
| MD | 52.50% (52.50/52.50%) | 0.458 | |||||
| Combined | 56.25% (55.00/57.50%) | 0.488 |
Note: Top accuracies for SVM, PLR and DT classifiers are in italics.
Abbreviations: CV, cross‐validation; DT, decision tree; LOOCV, leave‐one‐out cross‐validation; PLR, penalised logistic regression; ROC AUC, receiver operating characteristic area under the curve; SVM, support vector machine.
Best accuracies and related ROC AUC measures for case‐control classification (on cross‐validation) for brain moprhometric and white matter integrity features in cMDD‐UKB, rMDD‐STR, pMDD‐UKB‐CIDI, and pMDD‐UKB‐ICD samples
| Data set | Feature domain | Sample size | Classification approach | Classification accuracy (sensitivity/specificity) | ROC AUC |
|---|---|---|---|---|---|
| cMDD‐UKB | Cortical thickness features | 1,470 |
PLR classifier ‐ Hyperparameter grid search ‐ Embedded feature selection |
52.80% (52.66/52.92%) | 0.540 |
| Combined FA and MD features | 2,870 |
SVM classifier ‐ No hyperparameter optim. ‐ Sequential feat. Elimination |
53.73% (51.08/56.37%) | 0.549 | |
| rMDD‐STR | Combined brain morphometric features | 296 |
Decision tree classifier ‐ Hyperparameter grid search ‐ Filter feature selection |
| 0.572 |
| MD features | 404 |
SVM classifier ‐ Hyperparameter grid search ‐ No feature selection |
55.54% (59.16/51.92%) | 0.560 | |
| pMDD‐UKB‐CIDI | Cortical thickness features | 3,330 |
SVM classifier ‐ No hyperparameter optim. ‐ No feature selection |
53.63% (53.72/53.54%) | 0.532 |
| Combined FA and MD features | 6,836 |
SVM classifier ‐ ‐ No hyperparameter optim. ‐ Sequential feat. Elimination |
52.68% (53.63/51.73%) | 0.531 | |
| pMDD‐UKB‐ICD | Combined brain morphometric features | 280 |
PLR classifier ‐ Hyperparameter grid search ‐ Embedded feature selection |
| 0.645 |
| MD features | 578 |
SVM classifier ‐ No hyperparameter optim. ‐ Filter feature selection |
56.18% (68.56/43.83%) | 0.566 |
Note: Combined brain morphometric features included cortical thickness, surface area, cortical and subcortical volume measures. Nested 10‐fold outer and 10‐fold inner cross‐validation was performed in all analyses, except where otherwise specified. Top two accuracies are in italics.
Abbreviations: cMDD‐UKB, sample with probable current MDD in UK Biobank cohort; FA, fractional anisotropy; MD, mean diffusivity; PLR, penalised logistic regression; pMDD‐UKB‐CIDI, sample with lifetime MDD based on the online Composite International Diagnostic Interview (CIDI) criteria in UK Biobank cohort; pMDD‐UKB‐ICD, sample with lifetime MDD based on the ICD criteria and hospital records in UK Biobank cohort; rMDD‐STR, sample with remitted MDD in STRADL cohort; ROC AUC, receiver operating characteristic area under the curve; SVM, support vector machine.
FIGURE 1Surface area regions consistently selected as decision tree cut features across cross‐validation folds in cMDD‐STR sample. Colour of each region indicates fraction of folds where surface area of the region was selected as one of the cut variables. Regions in dark grey were never selected. Most frequently selected regions include bilateral caudal anterior cingulate, left lingual gyrus, left superior frontal, right precentral and paracentral regions
FIGURE 2Best classification accuracies plotted against sample sizes for all 10 analysed samples (five diagnostic definitions across two feature domains—brain morphometry and white matter integrity). Best accuracy tended to decrease towards chance level with increasing sample size. Sample size/abscissa axis logarithmically scaled. Abbreviation: ROC AUC, receiver operating characteristic area under the curve