| Literature DB >> 35101172 |
Christian Beckmann1,2,3, Andre F Marquand1,2, Saige Rutherford1,2,4, Charlotte Fraza1,2, Richard Dinga1,2, Seyed Mostafa Kia1,2,5, Thomas Wolfers6,7, Mariam Zabihi1,2, Pierre Berthet6,7, Amanda Worker8, Serena Verdi9,10, Derek Andrews11, Laura Km Han12,13, Johanna Mm Bayer14,15, Paola Dazzan8,16, Phillip McGuire17, Roel T Mocking18, Aart Schene1,19, Chandra Sripada4, Ivy F Tso4, Elizabeth R Duval4, Soo-Eun Chang4, Brenda Wjh Penninx12,13, Mary M Heitzeg4, S Alexandra Burt20, Luke W Hyde21, David Amaral11, Christine Wu Nordahl11, Ole A Andreasssen7,22, Lars T Westlye6,7,22, Roland Zahn23, Henricus G Ruhe1,19.
Abstract
Defining reference models for population variation, and the ability to study individual deviations is essential for understanding inter-individual variability and its relation to the onset and progression of medical conditions. In this work, we assembled a reference cohort of neuroimaging data from 82 sites (N=58,836; ages 2-100) and used normative modeling to characterize lifespan trajectories of cortical thickness and subcortical volume. Models are validated against a manually quality checked subset (N=24,354) and we provide an interface for transferring to new data sources. We showcase the clinical value by applying the models to a transdiagnostic psychiatric sample (N=1985), showing they can be used to quantify variability underlying multiple disorders whilst also refining case-control inferences. These models will be augmented with additional samples and imaging modalities as they become available. This provides a common reference platform to bind results from different studies and ultimately paves the way for personalized clinical decision-making.Entities:
Keywords: big data; brain chart; growth chart; human; individual prediction; lifespan; neuroscience; normative model
Mesh:
Year: 2022 PMID: 35101172 PMCID: PMC8828052 DOI: 10.7554/eLife.72904
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.713
Sample description and demographics.
mQC refers to the manual quality checked subset of the full sample. ‘All’ rows=Train+Test. Clinical refers to the transdiagnostic psychiatric sample (diagnostic details in Figure 2A).
| N (subjects) | N (sites) | Sex (%F/%M) | Age (Mean, S.D) | ||
|---|---|---|---|---|---|
| Full | All | 58,836 | 82 | ||
| Training set | 29,418 | 82 | 51.1/48.9 | 46.9, 24.4 | |
| Test set | 29,418 | 82 | 50.9/49.1 | 46.9, 24.4 | |
| mQC | All | 24,354 | 59 | ||
| Training set | 12,177 | 59 | 50.2/49.8 | 30.2, 24.1 | |
| Test set | 12,177 | 59 | 50.4/49.4 | 30.1, 24.2 | |
| Clinical | Test set | 1985 | 24 | 38.9/61.1 | 30.5, 14.1 |
| Transfer | Test set | 546 | 6 | 44.5/55.5 | 24.8, 13.7 |
Figure 1.Normative model overview.
(A) Age density distribution (x-axis) of each site (y-axis) in the full model train and test, clinical, and transfer validation set. (B) Age count distribution of the full sample (N=58,836). (C, D) Examples of lifespan trajectories of brain regions. Age is shown on x-axis and predicted thickness (or volume) values are on the y-axis. Centiles of variation are plotted for each region. In (C), we show that sex differences between females (red) and males (blue) are most pronounced when modeling large-scale features such as mean cortical thickness across the entire cortex or total gray matter volume. These sex differences manifest as a shift in the mean in that the shape of these trajectories is the same for both sexes, as determined by sensitivity analyses where separate normative models were estimated for each sex. The explained variance (in the full test set) of the whole cortex and subcortex is highlighted inside the circle of (D). All plots within the circle share the same color scale. Visualizations for all ROI trajectories modeled are shared on GitHub for users that wish to explore regions not shown in this figure.
Figure 2.Normative modeling in clinical cohorts.
Reference brain charts were transferred to several clinical samples (described in (A)). Patterns of extreme deviations were summarized for each clinical group and compared to matched control groups (from the same sites). (B) Shows extreme positive deviations (thicker/larger than expected) and (C) shows the extreme negative deviation (thinner/smaller than expected) patterns. (D) Shows the significant (FDR corrected p<0.05) results of classical case-control methods (mass-univariate t-tests) on the true cortical thickness data (top row) and on the deviations scores (bottom row). There is unique information added by each approach which becomes evident when noticing the maps in (B–D) are not identical. ADHD, attention-deficit hyperactive disorder; ASD, autism spectrum disorder; BD, bipolar disorder; EP, early psychosis; FDR, false discovery rate; MDD, major depressive disorder; SZ, schizophrenia.
Figure 3.Evaluation metrics across all test sets.
The distribution of evaluation metrics in four different test sets (full, mQC, patients, and transfer, see Materials and methods) separated into left and right hemispheres and subcortical regions, with the skew and excess kurtosis being measures that depict the accuracy of the estimated shape of the model, ideally both would be around zero. Note that kurtosis is highly sensitive to outlying samples. Overall, these models show that the models fit well in term of central tendency and variance (explained variance and MSLL) and model the shape of the distribution well in most regions (skew and kurtosis). Code and sample data for transferring these models to new sites not included in training is shared.
The patterns appear to be robust and consistent across the different test sets.
Figure 3—figure supplement 1.Comparison of the explained variance in cortical thickness across the different test sets.
The patterns appear to be robust and consistent across the different test sets.
Figure 3—figure supplement 3.Per site explained variance across the different test sets.
Video 1."Demonstation of the functionality of our interactive online visualization tool (https://brainviz-app.herokuapp.com/) that is available for all evaluation metrics across all test sets.
The code for creating this website can be found on GitHub (https://github.com/saigerutherford/brainviz-app; copy archived at swh:1:rev:021fff9a48b26f2d07bbb4b3fb92cd5202418905; Rutherford, 2022b).
Figure 3—figure supplement 2.Showing the explained variance for each brain region across 10 randomized resampling of the full control test set.
Author response image 1.Example of the online interactive visualizations created to help interpret the evaluation metrics.
This interactive figure was created for each evaluation metric (EV, MSLL, skew, and kurtosis) and all test sets (full controls, mQC controls, clinical, and transfer).