| Literature DB >> 36127723 |
Andrew T McKenzie1,2,3,4, Gabriel A Marx1,2,3, Daniel Koenigsberg1,2,3, Mary Sawyer1,2,3, Megan A Iida1,2,3, Jamie M Walker5,6, Timothy E Richardson5,6, Gabriele Campanella1, Johannes Attems7, Ann C McKee8, Thor D Stein8, Thomas J Fuchs1, Charles L White9, Kurt Farrell10,11,12,13, John F Crary14,15,16,17.
Abstract
Age-related cognitive impairment is multifactorial, with numerous underlying and frequently co-morbid pathological correlates. Amyloid beta (Aβ) plays a major role in Alzheimer's type age-related cognitive impairment, in addition to other etiopathologies such as Aβ-independent hyperphosphorylated tau, cerebrovascular disease, and myelin damage, which also warrant further investigation. Classical methods, even in the setting of the gold standard of postmortem brain assessment, involve semi-quantitative ordinal staging systems that often correlate poorly with clinical outcomes, due to imperfect cognitive measurements and preconceived notions regarding the neuropathologic features that should be chosen for study. Improved approaches are needed to identify histopathological changes correlated with cognition in an unbiased way. We used a weakly supervised multiple instance learning algorithm on whole slide images of human brain autopsy tissue sections from a group of elderly donors to predict the presence or absence of cognitive impairment (n = 367 with cognitive impairment, n = 349 without). Attention analysis allowed us to pinpoint the underlying subregional architecture and cellular features that the models used for the prediction in both brain regions studied, the medial temporal lobe and frontal cortex. Despite noisy labels of cognition, our trained models were able to predict the presence of cognitive impairment with a modest accuracy that was significantly greater than chance. Attention-based interpretation studies of the features most associated with cognitive impairment in the top performing models suggest that they identified myelin pallor in the white matter. Our results demonstrate a scalable platform with interpretable deep learning to identify unexpected aspects of pathology in cognitive impairment that can be translated to the study of other neurobiological disorders.Entities:
Keywords: Brain aging; Cognitive impairment; Deep learning; Interpretability; Luxol fast blue; Multiple instance learning; Myelin pathology
Mesh:
Substances:
Year: 2022 PMID: 36127723 PMCID: PMC9490907 DOI: 10.1186/s40478-022-01425-5
Source DB: PubMed Journal: Acta Neuropathol Commun ISSN: 2051-5960 Impact factor: 7.578
Description of cohort subset and whole slide image dataset used in this study
| Category | Non-cognitively impaired | Cognitively impaired | Total group | p-value for difference* |
|---|---|---|---|---|
| Sample size | 367 | 349 | 716 | Not applicable |
| Age (Mean ± SEM) | 83.0 ± 0.58 | 87.4 ± 0.48 | 85.2 ± 0.39 | |
| Proportion female | 0.54 | 0.53 | 0.54 | |
| Mean Braak score | 2.4 | 2.5 | 2.4 | |
| Proportion CERAD neuropathology positive | 0.15 | 0.18 | 0.17 | |
| Proportion hippocampal ARTAG positive | 0.22 | 0.31 | 0.27 | |
| Proportion with hippocampal WSI | 0.99 | 0.99 | 0.99 | |
| Proportion with frontal WSI | 0.46 | 0.46 | 0.46 |
This table describes the pathoclinical characteristics of the subset of brain donors employed in this study. The significance of differences in categorical variables between the non-cognitively impaired and cognitively impaired groups was assessed with a two-proportions z-test, while the significance of differences in numerical variables was assessed with a t-test. WSI = Whole slide image; SEM = Standard error of the mean; ARTAG = Aging-Related Tau Astrogliopathy; CERAD = Consortium to Establish a Registry for Alzheimer's Disease
Fig. 1Workflow for performing weakly supervised deep learning of age-related cognitive impairment. a: Generation of digital neuropathology whole slide images (WSI) with associated cognitive labels. Human brain sections were stained with Luxol fast blue (LFB) and counterstained with hematoxylin & eosin (LH&E). Cognitive labels were generated based on clinical diagnosis, clinical dementia rating (CDR) scores, and/or mini-mental state exam (MMSE) scores. b: WSI were segmented into tiles and passed through a convolutional neural network for feature extraction. The resulting tile-level feature vectors were passed through an attention network. Each feature vector was multiplied by its associated attention score and a weighted summation operation was performed to create slide-level feature vectors. The slide-level feature vectors were then passed through a classification network. The attention and classification networks were trained via backpropagation. c For interpretation analysis, attention heatmaps were created by mapping the attention scores at their associated tile locations in the original WSI. Among the top attention tiles, a dark blue hue range associated with LFB staining was counted and quantified to calculate a slide-level median staining intensity value
Fig. 2Weakly supervised classification predicts cognitive impairment based on whole slide image data from the hippocampus and frontal cortex. a Venn diagram showing the overlap of the measures used for defining the presence of cognitive impairment in brain donors. b Average receiver operating characteristic curves across tenfold cross-validation. Error envelopes show ± 1 standard deviation. Horizontal dotted lines show chance-level predictions. c Summary statistics for test evaluation of model performance across tenfold cross-validation in the frontal cortex and hippocampus. Balanced accuracy refers to the accuracy of predictions weighted by the proportion of labels in both groups in the test split. Horizontal lines are shown at the arithmetic mean values. d, e Probability estimates of cognitive impairment from the top-performing model by each measure of cognitive impairment in the hippocampus (d) and frontal cortex (e). CDR = Clinical Dementia Rating; MMSE = Mini-Mental State Examination; AUC = Area Under the Curve
Fig. 3Differential correlations of cognitive impairment probability estimates and age by clinical cognitive impairment label. Scatter plots for the correlation of age and the deep learning model probability estimates for cognitive impairment in the hippocampus (a) and frontal cortex (b) are shown. Trend lines show predictions using a linear model in each group of data and grey error envelopes show the associated 95% confidence intervals. NCI = Not Cognitively Impaired; CI = Cognitively Impaired
Fig. 4Interpretation of tissue-level attention maps and tile-level staining intensity in the hippocampus suggests myelin loss. a, b Representative WSIs labeled and predicted to be in the non-cognitive impaired (upper) or cognitively impaired (lower) groups (a) and corresponding representative attention heatmaps (b). In these heatmaps, dark red indicates relatively high attention values, while dark blue indicates relatively low attention values. c Top 5 highest attention tiles (upper) and blue hue range positive pixel annotations (lower) from the matching WSIs as shown in sub-figures A/B. Scale bar = 20 μm. d Median z-transformed attention score values in the grey matter and white matter. Each data point is a median attention score from the white matter or the grey matter from one WSI. e, f Median dark blue range pixel counts as a measure of LFB staining intensity (e) and ratio of the dark blue to light blue pixel counts in the top attention tiles of WSIs predicted and labeled to have cognitive impairment or not (f). g Scatter plot and contour lines showing the relationship between dark blue range pixel counts and the ratio of the dark blue to light blue pixel counts in the top attention tiles of WSIs. Orange dots indicate that the WSI was predicted to come from a CI donor, while blue dots indicate NCI. *p < 0.05, ***p < 0.001. GM = Grey Matter; WM = White Matter; CI = Cognitively Impaired; NCI = Not Cognitively Impaired
Fig. 5Interpretation of attention maps and tile-level myelin density in the frontal cortex suggests myelin loss. a, b: Representative WSIs labeled and predicted to be in the non-cognitive impaired (upper) or cognitively impaired (lower) groups (a) and corresponding representative attention heatmaps (b). In these heatmaps, dark red indicates relatively high attention values while dark blue indicates relatively low attention values. c Top 5 highest attention tiles (upper) and blue hue range positive pixel annotations (lower) from the matching WSIs as shown in sub-figures A/B. Scale bar = 20 μm. d Median z-transformed attention score values in the grey matter and white matter. Each data point is a median attention score from the white matter or the grey matter from one WSI. e, f Median dark blue range pixel counts as a measure of LFB staining intensity (e) and ratio of the dark blue to light blue pixel counts in the top attention tiles of WSIs predicted and labeled to have cognitive impairment or not (f). g Scatter plot and contour lines showing the relationship between dark blue range pixel counts and the ratio of the dark blue to light blue pixel counts in the top attention tiles of WSIs. Orange dots indicate that the WSI was predicted to come from a CI individual, while blue dots indicate NCI. ***p < 0.001. GM = Grey Matter; WM = White Matter; CI = Cognitively Impaired; NCI = Not Cognitively Impaired
Fig. 6Deep histopathology features are partially associated with several known clinicopathologic features and partially independent. a Correlation analysis of deep histopathology results and clinicopathologic features: age, Braak score, evidence of cerebrovascular pathology (coded as 0 = not present and 1 = present), ARTAG positivity in the hippocampus (coded as 0 = not present and 1 = present), cognitive label (coded as 0 = not cognitively impaired and 1 = cognitively impaired), probability of cognitive impairment as predicted by the top-performing model trained on the hippocampal data, and median LFB staining intensity in the top attention tiles in the hippocampus data set. Upper right: rank correlation values and associated p-values (*p < 0.05, **p < 0.01, ***p < 0.001). Diagonal: histograms of variables. Lower left: scatterplots with linear model trend lines for the variable pairs (red lines) and 95% confidence intervals (blue envelopes). This plot was made using the R package GGally (v. 2.1.2). b, c Scatter plots for probability of cognitive impairment estimated in the frontal cortex and hippocampus with Braak stage (b) and AT8 staining positive pixel counts in the medial temporal lobe (MTL) (c). Trend lines show predictions via a linear model and grey envelopes show associated 95% confidence intervals. CI = Cognitive impairment; ARTAG = Aging-related tau astrogliopathy; LFB = Luxol Fast Blue