| Literature DB >> 27642641 |
Andre F Marquand1, Thomas Wolfers2, Maarten Mennes2, Jan Buitelaar3, Christian F Beckmann4.
Abstract
Heterogeneity is a key feature of all psychiatric disorders that manifests on many levels, including symptoms, disease course, and biological underpinnings. These form a substantial barrier to understanding disease mechanisms and developing effective, personalized treatments. In response, many studies have aimed to stratify psychiatric disorders, aiming to find more consistent subgroups on the basis of many types of data. Such approaches have received renewed interest after recent research initiatives, such as the National Institute of Mental Health Research Domain Criteria and the European Roadmap for Mental Health Research, both of which emphasize finding stratifications that are based on biological systems and that cut across current classifications. We first introduce the basic concepts for stratifying psychiatric disorders and then provide a methodologically oriented and critical review of the existing literature. This shows that the predominant clustering approach that aims to subdivide clinical populations into more coherent subgroups has made a useful contribution but is heavily dependent on the type of data used; it has produced many different ways to subgroup the disorders we review, but for most disorders it has not converged on a consistent set of subgroups. We highlight problems with current approaches that are not widely recognized and discuss the importance of validation to ensure that the derived subgroups index clinically relevant variation. Finally, we review emerging techniques-such as those that estimate normative models for mappings between biology and behavior-that provide new ways to parse the heterogeneity underlying psychiatric disorders and evaluate all methods to meeting the objectives of such as the National Institute of Mental Health Research Domain Criteria and Roadmap for Mental Health Research.Entities:
Keywords: European Roadmap for Mental Health Research; Heterogeneity; Latent cluster analysis; Psychiatry; RDoC; ROAMER; Research Domain Criteria; Subgroup
Year: 2016 PMID: 27642641 PMCID: PMC5013873 DOI: 10.1016/j.bpsc.2016.04.002
Source DB: PubMed Journal: Biol Psychiatry Cogn Neurosci Neuroimaging ISSN: 2451-9022
Studies Using Clustering Methods to Stratify Schizophrenia
| Study | Subjects ( | Measures | Algorithm | No. of Clusters (Method) | Cluster Descriptions | External Validation |
|---|---|---|---|---|---|---|
| Farmer | SCZ (65) | Symptoms and case history variables | K means and hierarchical clustering | 2 (maximal agreement between methods) | Good premorbid adjustment, late onset, and well organized delusions | None |
| Poor premorbid functioning, early onset, incoherent speech, and bizarre behavior | ||||||
| Castle | SCZ (447) | Symptoms and case history variables | LCCA | 3 (χ2 test) | Neurodevelopmental | Premorbid, phenomenologic, and treatment response variables [see ( |
| Paranoid | ||||||
| Schizoaffective | ||||||
| Dollfus | SCZ (138) | Symptoms | Ward’s hierarchical clustering method ( | 4 (informal examination of cluster dendrogram) | Positive symptoms | Social variables |
| Negative symptoms | ||||||
| Disorganized symptoms | ||||||
| Mixed symptoms | ||||||
| Kendler | SCZ (348) | Symptoms | LCCA | 6 (not specified) | Classic schizophrenia | Historical data |
| Major depression | ||||||
| Schiophreniform disorder | ||||||
| Bipolar-schizomania | ||||||
| Hebephrenia | ||||||
| Murray | SCZ (387) | “Operational criteria” diagnostic measures (medical records and interview) | LCCA | BIC ( | Depression | None |
| Reality distortion | ||||||
| Mania | ||||||
| Disorganization | ||||||
| Dawes | SCZ and SAD (144) | Neuropsychological measures | K means | 5 (Ward method) | Visual learning and memory (–) | None |
| Verbal comprehension (+), processing speed (+), abstraction (–) auditory and visual learning, and memory (–) | ||||||
| Abstraction (–) | ||||||
| Verbal comprehension (+), visual learning and memory (+), abstraction (–), auditory learning and memory (–) | ||||||
| Verbal comprehension (+), abstraction (–), visual learning and memory (–) | ||||||
| Cole et al., 2012 ( | SCZ (208) | Social and academic adjustment scales | LCGA | 3 [BIC and Lo-Mendell-Rubin test ( | Good—stable | None |
| Insidious onset | ||||||
| Poor deteriorating | ||||||
| Bell | SCZ and SAD (77 + 63 validation) | Symptoms and social cognitive measures | K means | 3 (Ward method) | High negative symptoms | None |
| High social cognition | ||||||
| Low social cognition | ||||||
| Brodersen | SCZ (41) and HC (42) | Dynamic causal model ( | Gaussian mixture | 3 [Bayesian model evidence ( | Subgroups characterized in terms of DCM model parameters | Symptoms and medication |
| Geisler | SCZ (129) | Neuropsychological measures | K-means | 4 (fixed a priori) | Verbal fluency (–), processing speed (–) | fMRI |
| Verbal episodic memory (–), fine motor control (–), signal detection | ||||||
| Face episodic memory (–), processing speed (–) | ||||||
| General intellectual function (–) | ||||||
| Sun et al., 2015 ( | SCZ (113) | White matter integrity measured by diffusion tensor imaging | Hierarchical clustering | 2 [Silhouette, Dunn, and connectivity indices ( | Subgroups characterized in terms of white matter abnormalities | Symptoms |
External validation is defined as a data measure used to validate the derived classes that is of a different type to the data use to derive the classes. Wherever possible, we follow the authors’ own nomenclature for describing clusters, and a (+) or (–) indicates relative improvement or deficit in the specified variable.
BIC, Bayesian information criterion; DCM, dynamic causal modeling; fMRI, functional magnetic resonance imaging; LCCA, latent class cluster analysis; LCGA, latent class growth analysis; SAD, schizoaffective disorder; SCZ, schizophrenia.
Studies Using Clustering Methods to Stratify Depression
| Study | Subjects ( | Measures | Algorithm | No. of Clusters (Method) | Cluster Descriptions | External Validation |
|---|---|---|---|---|---|---|
| Paykel, 1971 ( | Patients with depression (165) | Clinical interviews, case history, and personality variables | Friedman–Rubin algorithm ( | 4 (maximize the ratio of between to within class scatter) | Psychotic | None |
| Anxious | ||||||
| Hostile | ||||||
| Young depressive with personality disorder | ||||||
| Maes | MDD (80) | Symptoms | K means | 2 (not specified) | Vital (i.e., psychomotor disorders, loss of energy, early morning awakening, and nonreactivity) | Biological (e.g., endocrine) measures |
| Nonvital | ||||||
| Kendler | Female twin pairs (2163) | Symptoms | LCCA | 7 (not specified) | Only 3 clusters described: | Body mass index, personality, and concordance of cluster membership among twin pairs |
Mild typical depression Atypical depression Severe typical depression | ||||||
| Sullivan | National comorbidity survey respondents (2836) | Symptoms | LCCA | 6 (χ2 statistic) | Severe typical | Demographic and personality variables |
| Mild typical | ||||||
| Severe atypical | ||||||
| Mild atypical | ||||||
| Intermediate | ||||||
| Minimal symptoms | ||||||
| Hybels | MDD (368) | Symptoms | LCCA | 4 [ | DSM-IV depression: Moderate sadness, lassitude and inability to feel | Demographic, social, and clinical variables |
| Higher severity for all items, especially apparent sadness | ||||||
| Milder profile | ||||||
| Highest severity and most functional limitations | ||||||
| Lamers | MDD (818) | Symptoms plus demographic, psychosocial, and physical health variables | LCCA | 3 [BIC and AIC ( | Severe melancholic (decreased appetite, weight loss) | Stability over time, sociodemographic, clinical, and biological (e.g., metabolic) variables ( |
| Severe atypical (overeating and weight gain) | ||||||
| Moderate severity | ||||||
| Lamers | National comorbidity survey—replication respondents. Adolescents (912) and adults (805) | Symptoms | LCCA | Adolescents: 3, adults: 4 (BIC) | Adolescents: | None |
Moderate typical Severe typical Severe atypical | ||||||
| Adults: | ||||||
Moderate Moderate typical Severe typical Severe atypical | ||||||
| Rhebergen | MDD (804) | Longitudinal symptom scores | LCGA | 5 (BIC and Lo-Mendell-Rubin test) | Remission | Demographic and diagnostic variables, fMRI [see ( |
| Decline (moderate severity) | ||||||
| Decline (severe) | ||||||
| Chronic (moderate severity) | ||||||
| Decline (severe) | ||||||
| Van Loo | MDD (8,261) | Retrospective symptom reports and demographic data that predict disease course | K-means | 3 (Inspection of dichotomization scores and area under the receiver operating characteristic curve [see ( | High risk | None |
| Intermediate risk | ||||||
| Low risk | ||||||
| Milaneschi | MDD (1477) | Symptoms | LCCA | 3 (BIC, AIC, and likelihood ratio test) | Severe melancholic [see Lamers | Polygenic risk scores |
| Severe atypical | ||||||
| Moderate |
External validation is defined as a data measure used to validate the derived classes that is of a different type to the data use to derive the classes. Wherever possible, we follow the authors’ own nomenclature for describing clusters.
AIC, Akaike information criterion; BIC, Bayesian information criterion; fMRI, functional magnetic resonance imaging; LCCA, latent class cluster analysis; LCGA, latent class growth analysis; MDD, major depressive disorder.
Studies Using Clustering Methods to Stratify Attention-Deficit/Hyperactivity Disorder
| Study | Subjects ( | Measures | Algorithm | No. of Clusters (Method) | Cluster Descriptions | External Validation |
|---|---|---|---|---|---|---|
| Fair | ADHD (285) and TDC (213) | Neuropsychologic scores | CD ( | 6 for ADHD (determined implicitly by the algorithm) | Response time variability (+) | None |
| Working memory (–), memory span (–), inhibition (–), and output speed (–) | ||||||
| Working memory (–), memory span (–), inhibition (–), and output speed (–), minor differences in remaining measures | ||||||
| Temporal processing (–) | ||||||
| Arousal (–) | ||||||
| Arousal (–), minor differences in remaining measures | ||||||
| Karalunas | ADHD (247) and TDC (190) | Personality measures (e.g., temperament) | CD | 3 (determined implicitly by the algorithm) | Mild | Physiological (e.g., cardiac) measures, resting state fMRI and 1-year clinical outcomes |
| Surgent (positive apporach motivation) | ||||||
| Irritable (negative emotionality, anger, and poor soothability) | ||||||
| Gates | ADHD (32) and TDC (58) | fMRI (functional connectivity) | CD | 5 (determined implicitly by the algorithm) | Subgroups characterized in terms of functional connectivity profiles | None |
| Costa Dias | ADHD (42) and TDC (63) | fMRI (reward related functional connectivity) | CD | 3 (determined implicitly by the algorithm) | Subgroups characterized in terms of functional connectivity profiles | Clinical variables and reward sensitivity |
| Van Hulst | ADHD (96) and TDC (121) | Neuropsychological scores | LCCA | 5 (BIC) | Quick and accurate | Parent ratings of behavioral problems |
| Poor cognitive control | ||||||
| Slow and variable timing | ||||||
| Remaining 2 groups were too small to characterize | ||||||
| Mostert | ADHD (133) and TDC (132) | Neuropsychological scores | CD | 3 (determined implicitly by the algorithm) | Attention (–), inhibition (–) | Clinical symptoms and case history |
| Reward sensitivity (+) | ||||||
| Working memory (–) and verbal fluency (–) |
External validation is defined as a data measure used to validate the derived classes that is of a different type to the data use to derive the classes. Wherever possible, we follow the authors’ own nomenclature for describing clusters, and a (+) or (–) indicates relative improvement or deficit in the specified variable.
ADHD, attention-deficit/hyperactivity disorder; BIC, Bayesian information criterion; CD, community detection; fMRI, functional magnetic resonance imaging; LCCA, latent class cluster analysis; TDC, typically developing control.
Studies Using Clustering Methods to Stratify Autism
| Study | Subjects ( | Measures | Algorithm | No. of Clusters (Method) | Cluster Descriptions | External Validation |
|---|---|---|---|---|---|---|
| Munson | ASD (245) | IQ scores | LCCA and taxonometric analysis | 4 (BIC, entropy, and Lo-Mendell-Rubin test) | Low IQ | Symptom scores |
| Low verbal IQ/medium nonverbal | ||||||
| Medium IQ | ||||||
| High IQ | ||||||
| Sacco | ASD (245) | Demographic, clinical, case history, and physiologic (e.g., head circumference) variables | K means | 4 (Ward’s method) | Immune + circadian and sensory | None |
| Circadian and sensory | ||||||
| Stereotypic behaviors | ||||||
| Mixed | ||||||
| Fountain | ASD (6795) | Symptoms | LCGA | 6 (BIC) | High functioning | Demographic variables and autism risk factors |
| Bloomers (substantial improvement) | ||||||
| Medium-high functioning | ||||||
| Medium functioning | ||||||
| Low-medium functioning | ||||||
| Low functioning | ||||||
| Georgiades | ASD (391) | Symptom scores | FMM | 3 (AIC and BIC) | Social communication (–), repetitive behaviors (+) | Demographic and cognitive meaures |
| Social communication (+), repetitive behaviors (–) | ||||||
| Social communication (–), repetitive behaviors (–) | ||||||
| Doshi-Velez | ASD (4927) | Electronic medical records | Ward’s method | 4 (Ward’s method) | Seizures | None |
| Multisystem disorders | ||||||
| Auditory disorders and infections | ||||||
| Psychiatric disorders | ||||||
| Not otherwise specified | ||||||
| Veatch | ASD (1261 + 2563 for replication) | Symptoms, demographic, and somatic variables | Ward’s method | 2 [Adjusted Arabie Rand index ( | Severe | Genomic data |
| Less severe |
External validation is defined as a data measure used to validate the derived classes that is of a different type to the data use to derive the classes. Wherever possible, we follow the authors’ own nomenclature for describing clusters, and a (+) or (–) indicates relative improvement or deficit in the specified variable.
ASD, autism spectrum disorder; BIC, Bayesian information criterion; FMM, factor mixture modeling; LCCA, latent class cluster analysis; LCGA, latent class growth analysis.
Studies Employing Clustering Methods to Stratify Patients in a Cross-Diagnostic Setting
| Study | Subjects ( | Measures | Algorithm | No. of Clusters (Method) | Cluster Descriptions | External Validation |
|---|---|---|---|---|---|---|
| Olinio | Adolescents (1653), including MDD (603), ANX (253), SUD (453) | Diagnosis (longitudinal) | LCGA | 6 (BIC) | Persistent depression | Demographic and case history variables |
| Persistent anxiety | ||||||
| Late onset anxiety, increasing depression | ||||||
| Increasing depression | ||||||
| Initially high, decreasing anxiety | ||||||
| Absence of psychopathology | ||||||
| Lewdanowski | SCZ (41), SAD (53), BPDp (73) | Clinical and cognitive measures | K means | 4 (Ward’s method) | Neuropsychologically normal | Diagnosis, demographic variables, and community functioning |
| Globally and significantly impaired | ||||||
| Mixed cognitive profiles (×2) | ||||||
| Kleinman | ADHD (23), BPD (10), BPDa (33), and HCs (18) | Continuous performance test measures | K means | 2 [Silhouette index ( | Sustained attention (–) , inhibitory control (–), impulsiveness (+), and vigilance (–) | Diagnosis |
| The converse of above |
External validation is defined as a data measure used to validate the derived classes that is of a different type to the data use to derive the classes. Wherever possible, we follow the authors’ own nomenclature for describing clusters and a (+) or (–) indicates relative improvement or deficit in the specified variable.
ADHD, attention-deficit/hyperactivity disorder; ANX, anxiety disorders; BPD(p/a), bipolar disorder (with psychosis/ADHD); BIC, Bayesian information criterion; DEP, depressive disorders (major depression and dysthymia); HC, healthy control; LCGA, latent class growth analysis; MDD, major depressive disorder; SAD, schizoaffective disorder; SCZ, schizophrenia; SUD, substance use disorder.
Figure 1Schematic examples of alternative approaches to clustering and finite mixture models based on supervised learning. (A) This example shows the benefit of correcting mislabeled training samples. A supervised classifier trained to separate experimental classes (black and red points) may be forced to use a complex nonlinear decision boundary (blue line) to separate classes if data points are mislabeled (circled). (B) A simpler decision boundary results if the incorrect labels are corrected, for example using a wrapper method (74). (C) In a semisupervised learning context (75), only some data points have labels (black and red points). These can correspond to samples for which a certain diagnosis can be obtained. All other data points are unlabeled, but can still contribute to defining the decision boundary. Hybrid methods (76, 77, 78) combine supervised classification with unsupervised clustering and use multiple linear decision boundaries to separate the healthy class (blue points) from putative disease subgroups (colored points). See text for further details.
Figure 2Schematic examples of alternative approaches to clustering and finite mixture models based on unsupervised learning. (A) Manifold learning techniques aim to find some low-dimensional manifold (right panels) that represent the data more efficiently than the original high-dimensional data (depicted by the cube on the right). Basic dimensionality reduction techniques, such as principal components analysis (PCA), find a single subspace for the data based on maximizing variance. This may not efficiently show structure in high-dimensional data. In contrast, approaches that preserve local distances, such as t-stochastic neighbor (t-SNE) embedding (80), may highlight intrinsic structure more effectively. (B) Novelty detection algorithms, such as the one-class support vector machine (83), aim to find a decision boundary that encloses a set of healthy subjects (blue points), allowing disease profiles to be detected as outliers (red points). Note that this approach does not provide an estimate of the probability density at each point.
Figure 3(A) Normative modeling approaches (22, 85, 86) aim to link a set of clinically relevant predictor variables with a set of quantitative biological response variables while quantifying the variation across this mapping. This is achieved by estimating a nonlinear regression model that provides probabilistic measures of predictive confidence (blue contour lines). These could be certainty estimates derived from a probabilistic model (22) or classical confidence intervals (86) and can be interpreted as centiles of variation within the cohort (blue numerals, right). Predictions for new data points (red) can then be derived that provide measures of predictive confidence to quantify the fit of the new data point to the normative model. [Adapted with permission from (22).] (B) By performing this mapping across different domains of functioning (e.g., different cognitive or clinical domains), many types of abnormal patterns can be detected, including classical disease clusters and also disease continua that describe pathology in terms of a gradual progression rather than in terms of sharply defined clusters (see Supplementary Methods for further details).