| Literature DB >> 30935858 |
Richard Dinga1, Lianne Schmaal2, Brenda W J H Penninx3, Marie Jose van Tol4, Dick J Veltman3, Laura van Velzen3, Maarten Mennes5, Nic J A van der Wee6, Andre F Marquand5.
Abstract
BACKGROUND: Psychiatric disorders are highly heterogeneous, defined based on symptoms with little connection to potential underlying biological mechanisms. A possible approach to dissect biological heterogeneity is to look for biologically meaningful subtypes. A recent study Drysdale et al. (2017) showed promising results along this line by simultaneously using resting state fMRI and clinical data and identified four distinct subtypes of depression with different clinical profiles and abnormal resting state fMRI connectivity. These subtypes were predictive of treatment response to transcranial magnetic stimulation therapy.Entities:
Keywords: Anxiety; Clustering; Machine learning; Major depressive disorder; Replication
Mesh:
Year: 2019 PMID: 30935858 PMCID: PMC6543446 DOI: 10.1016/j.nicl.2019.101796
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Fig. 1A scheme of the pipeline used in the original study and our pipeline. Data: in the original study, 220 depressed subjects have been analyzed as a part of a “cluster discovery” set and an additional 92 subjects were used as evaluation set. The clinical data (Clin) consisted of 17 HAM-D items. We have used 187 subjects with depression, anxiety disorder or depression-anxiety comorbidity. The clinical data consisted of 17 IDS items that best-matched the HAM-D item used in the original study. After preprocessing of fMRI data (RS), a correlation matrix between selected regions was created, resulting in ~35,000 features. A small subset of features (178 in the original study and 150 in our study) were selected based on their correlation with clinical symptoms (Sel.RS). Then, CCA was performed using these selected features and clinical symptoms. In the original study, a parametric test was used to the established statistical significance of CCA without taking a previous feature selection into an account. Hierarchical clustering was performed on first two resting state connectivity canonical variates (CV1, CV2). We have included an additional test, to test if the data cluster more than what is expected from data sampled from a Gaussian distribution. Stability of cluster assignment was evaluated in the original study by resampling of CV1 and CV2, We have extended the resampling stability evaluation to feature selection (in addition to the CCA procedures). Out of sample evaluation: in the original study, an additional 92 subjects were assigned to clusters according to a SVM model and clinical profiles of these clusters were compared to clinical profiles of clusters obtained in the cluster discovery set. We have evaluated the reproducibility of canonical correlations directly, using 10-fold cross-validation.
Sample characteristics for the NESDA (stratified by study site) and MOTAR samples.
| NESDA 1 | NESDA 2 | NESDA 3 | MOTAR | Total | |
|---|---|---|---|---|---|
| N | 32 | 57 | 62 | 36 | 187 |
| Age (SD) | 36 (9.52) | 37.26 (10.26) | 35.87 (11.51) | 36.69 (12.37) | 36.48 (10.93) |
| N Female (%) | 19 (59%) | 39 (68%) | 43 (69%) | 23 (66%) | 124 (66%) |
| N MDD (%) (no anx. disorder) | 13 (40%) | 18 (32%) | 22 (35%) | 10 (28%) | 63 (34%) |
| N Anxiety disorder (%) (no MDD) | 11 (34%) | 13 (22%) | 21 (34%) | 2 (6%) | 47 (25%) |
| N Comorbid (%) | 8 (25%) | 26 (45%) | 19 (30%) | 24 (67%) | 77 (41%) |
| Current | 22.31 (9.66) | 24.66 (11.45) | 19.93 (12.42) | 15.81 (10.56) | 21.11 (11.96) |
| Current | 5/16/9/1/1 | 10/18/22/6/1 | 22/20/16/2/2 | 19/10/5/2/0 | 56/64/52/11/4 |
| Current | 13.19 (8.45) | 13.33 (9.73) | 13.34 (9.35) | 21.67 (11.95) | 14.91 (10.34) |
| Current | 11/3/7/1 | 22/19/13/3 | 27/18/12/5 | 5/11/11/9 | 65/61/43/18 |
| N in remission (IDS and BAI) | 3 (9%) | 10 (18%) | 16 (25%) | 5 (14%) | 34 (18%) |
| Baseline antidepressant use | 7 (21%) | 20 (35%) | 23 (37%) | 0 (0%) | 50 (26%) |
| Social phobia | 12 (37%) | 23 (40%) | 23 (37%) | 15 (42%) | 73 (39%) |
| Panic with agoraphobia | 6 (18%) | 13 (22%) | 17 (27%) | 8 (22%) | 44 (24%) |
| Panic without agoraphobia | 4 (13%) | 11 (19%) | 9 (15%) | 1 (3%) | 25 (13%) |
| Agoraphobia | 4 (13%) | 3 (5%) | 3 (5%) | 0 (0%) | 10 (5%) |
| GAD | 7 (22%) | 16 (28%) | 15 (24%) | 7 (19%) | 45 (24%) |
Anxiety disorder: one or more of panic disorder, generalized anxiety disorder, social phobia, and agoraphobia. IDS: inventory of depressive symptomatology. BAI: Beck anxiety inventory. GAD: generalized anxiety disorder. IDS categorical cutoffs: 0–13/14–25/26–38/39–48/49–84. BAI categorical cutoffs (Kabacoff et al., 1997): 0–9/10–18/19–29/30–63.
At the time of scanning.
HAM-D items used in the original study and best-matched IDS items used in this study.
| HAMD item | IDS item |
|---|---|
| Mood | Feeling sad |
| Guilt | Self criticism and blame |
| Suicide | Thoughts of death or suicide |
| Early insomnia | Early insomnia |
| Mid insomnia | Mid insomnia |
| Late insomnia | Late insomnia |
| Anhedonia | Capacity for pleasure or enjoyment (excluding sex) |
| Retardation | Psychomotor retardation |
| Agitation | Psychomotor agitation |
| Anxiety psychological | Feeling anxious or tense |
| Anxiety physiological | Other bodily symptoms/sympathetic arousal |
| Somatic gastro-internal | Gastrointestinal complaints |
| Fatigue/aches/low energy | Energy level/fatiguability |
| Genital | Interest in sex |
| Hypochondria | Somatic complains |
| Weight loss | Weight loss |
| Insight | Sensitivity |
Fig. 2A, B) CCA finds a linear combination (canonical variate) of brain connectivity features that maximizes correlation with a linear combination of clinical symptoms. Canonical correlations are high and comparable to the original study (0.95 and 0.91). C) The null distribution of the first canonical correlation obtained using permutation test. Although canonical correlations in A and B are seemingly high, they are also high under the null hypothesis and thus not statistically significant. D) Out of sample canonical correlation for first two canonical pairs estimated by 10 fold cross-validation. Each point represents out of sample canonical correlation for each cross-validation fold. Although the canonical correlation was high in the training set as showed in A and B, id dropped to a chance level correlation in the test sets. E) Canonical loadings for the first canonical variate and their stability under resampling of the data using leave-one-out (jack-knife) procedure. F) Clinical canonical loadings for all canonical variates (1–17) and first two reported in the original study (D1-D2).
Fig. 3A) obtained 4-cluster solution using hierarchical clustering. B) Stability of the cluster assignment. Each subject is shown with the same color as it had in A, but the connectivity scores are recomputed under a small perturbation of the data i.e. leaving one subject out of the feature selection and CCA procedure. C) Variance ratio criterion is maximized at 3 clusters (4 in the original study). D) Silhouette index is maximized at 3 clusters. E, F) Null distribution of Variance ratio and silhouette indices. Showing that although these indices are maximized at 3 clusters, these results are not unusual even for the data simulated from a distribution with no clusters. Therefore these criteria do not imply evidence for the existence of clusters in our data or in the data presented in the original study.