| Literature DB >> 26936938 |
Lorna Harper1, Giorgio G Fumagalli2, Frederik Barkhof3, Philip Scheltens4, John T O'Brien5, Femke Bouwman4, Emma J Burton6, Jonathan D Rohrer1, Nick C Fox1, Gerard R Ridgway7, Jonathan M Schott8.
Abstract
Accurately distinguishing between different degenerative dementias during life is challenging but increasingly important with the prospect of disease-modifying therapies. Molecular biomarkers of dementia pathology are becoming available, but are not widely used in clinical practice. Conversely, structural neuroimaging is recommended in the evaluation of cognitive impairment. Visual assessment remains the primary method of scan interpretation, but in the absence of a structured approach, diagnostically relevant information may be under-utilized. This definitive, multi-centre study uses post-mortem confirmed cases as the gold standard to: (i) assess the reliability of six visual rating scales; (ii) determine their associated pattern of atrophy; (iii) compare their diagnostic value with expert scan assessment; and (iv) assess the accuracy of a machine learning approach based on multiple rating scales to predict underlying pathology. The study includes T1-weighted images acquired in three European centres from 184 individuals with histopathologically confirmed dementia (101 patients with Alzheimer's disease, 28 patients with dementia with Lewy bodies, 55 patients with frontotemporal lobar degeneration), and scans from 73 healthy controls. Six visual rating scales (medial temporal, posterior, anterior temporal, orbito-frontal, anterior cingulate and fronto-insula) were applied to 257 scans (two raters), and to a subset of 80 scans (three raters). Six experts also provided a diagnosis based on unstructured assessment of the 80-scan subset. The reliability and time taken to apply each scale was evaluated. Voxel-based morphometry was used to explore the relationship between each rating scale and the pattern of grey matter volume loss. Additionally, the performance of each scale to predict dementia pathology both individually and in combination was evaluated using a support vector classifier, which was compared with expert scan assessment to estimate clinical value. Reliability of scan assessment was generally good (intraclass correlation coefficient > 0.7), and average time to apply all six scales was <3 min. There was a very close association between the pattern of grey matter loss and the regions of interest each scale was designed to assess. Using automated classification based on all six rating scales, the accuracy (estimated using the area under the receiver-operator curves) for distinguishing each pathological group from controls ranged from 0.86-0.97; and from one another, 0.75-0.92. These results were substantially better than the accuracy of any single scale, at least as good as expert reads, and comparable to previous studies using molecular biomarkers. Visual rating scores from magnetic resonance images routinely acquired as part of the investigation of dementias, offer a practical, inexpensive means of improving diagnostic accuracy.Entities:
Keywords: MRI; brain atrophy; dementia; neuropathology; visual rating
Mesh:
Year: 2016 PMID: 26936938 PMCID: PMC4806219 DOI: 10.1093/brain/aww005
Source DB: PubMed Journal: Brain ISSN: 0006-8950 Impact factor: 13.501
Figure 1Correlation between grey matter volume and visual rating score. Voxel-based morphometry images demonstrating negative partial correlation between grey matter volume and each visual rating scale, adjusted for the other scales (Y = β AC X AC + β OF X OF + β AT X AT + β FI X FI + β MTA X MTA + β PA X PA + β Age X Age + β Gender X Gender + β TIV X TIV + β 1T X 1T + β 3T X 3T + β London X London + β Amsterdam X Amsterdam + μ + e). In all images statistical significance of correlations was corrected for multiple comparisons (family wise error rate P < 0.05). The corresponding visual rating scale reference images are displayed adjacent to each statistical parametric map. R indicates the right hemisphere.
Patient demographics and mean visual rating scores
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 73 | 33 | 40 | 101 | 73 | 28 | 28 | 55 | 24 | 28 | NA |
| Gender (% male) | 52% | 70% | 30% | 61% | 59% | 68% | 75% | 56% | 58% | 50% | None |
| Age at scan (years) | 66.6 (7.9) | 59.9 (4.8) | 72.2 (5.2) | 61.1(11.4) | 55.9 (8.1) | 74.9 (5.5) | 70.1 (5.9) | 61.1 (8.8) | 63.5 (8.5) | 60.4 (8.3) |
c, e, j
|
| Disease duration at scan (years) | NA | NA | NA | 3.7 (3.1) | 4.1 (3.1) | 2.8 (3.1) | 3.0 (2.4) | 3.3 (2.6) | 3.9 (2.1) | 3.0 (3.0) | None |
| Time from scan until death (years) a | NA | NA | NA | 5.6 (3.0) | 5.4 (2.8) | 5.9 (3.4) | 3.5 (2.3) | 5.3 (2.9) | 5.0 (2.9) | 5.6 (3.0) | j, l, m, p |
| MMSE within 6 months b | NA | NA | NA | 17.5 (6.0) | 16.6 (6.3) | 19.4 (4.8) | 20.1 (4.6) | 22.7 (5.9) | 23.3 (5.2) | 21.5 (6.8) |
k
|
| Total intracranial volume (ml) | 1501 (159) | 1442 (126) | 1549 (168) | 1479 (150) | 1478 (158) | 1482 (132) | 1550 (148) | 1498 (149) | 1500 (149) | 1474 (151) | None |
|
| |||||||||||
| Orbito-frontal | 0.9 (0.5) | 0.8 (0.6) | 1.0 (0.5) | 1.6 (0.8) | 1.5 (0.8) | 1.7 (0.7) | 1.5 (0.7) | 2.3 (0.8) | 2.3 (0.8) | 2.2 (0.8) |
c
|
| Anterior cingulate | 1.0 (0.5) | 0.8 (0.5) | 1.2 (0.5) | 1.3 (0.7) | 1.4 (0.7) | 1.2 (0.6) | 1.3 (0.5) | 1.9 (0.8) | 2.0 (0.9) | 1.8 (0.7) |
c, e
|
| Fronto-insula | 1.2 (0.5) | 0.9 (0.4) | 1.5 (0.5) | 1.7 (0.6) | 1.7 (0.6) | 1.7 (0.6) | 1.6 (0.5) | 2.1 (0.6) | 2.3 (0.7) | 2.0 (0.6) |
c
|
| Anterior temporal | 0.9 (0.5) | 0.7 (0.5) | 1.1 (0.4) | 1.5 (0.5) | 1.4 (0.5) | 1.6 (0.5) | 1.3 (0.4) | 2.1 (0.9) | 2.0 (0.9) | 2.2 (0.9) |
c
|
| Medial temporal | 0.6 (0.6) | 0.4 (0.4) | 0.8 (0.7) | 1.6 (0.9) | 1.5 (0.9) | 2.1 (0.9) | 1.1 (0.7) | 2.2 (0.9) | 2.2 (1.0) | 2.2 (1.0) |
c
|
| Posterior | 0.9 (0.7) | 0.7 (0.7) | 1.1 (0.7) | 1.6 (0.9) | 1.8 (0.8) | 1.3 (0.9) | 1.2 (0.8) | 1.3 (0.6) | 1.4 (0.6) | 1.3 (0.7) |
c
|
Data are reported as mean (SD).
a Pathological diagnosis was determined by biopsy in four patients with Alzheimer’s disease and three patients with FTLD patients; therefore, date of death was not available for these patients and they were not included in this analysis.
b MMSE within 6 months of scan date was only available in 116/184 patients.
c Alzheimer’s disease versus control.
d DLB versus control.
e FTLD versus control.
f Early-onset Alzheimer’s disease versus younger controls.
g Late-onset Alzheimer’s disease versus older controls.
h FTLD-Tau versus younger controls.
i FTLD-TDP43 versus younger controls.
j Alzheimer’s disease versus DLB.
k Alzheimer’s disease versus FTLD.
l DLB versus FTLD.
m Early-onset Alzheimer’s disease versus DLB.
n Early-onset Alzheimer’s disease versus FTLD-Tau.
o Early-onset Alzheimer’s disease versus FTLD-TDP43.
p Late-onset Alzheimer’s disease versus DLB.
q Late-onset Alzheimer’s disease versus FTLD-Tau.
r Late-onset Alzheimer’s disease versus FTLD-TDP43.
s DLB versus FTLD-Tau.
t FTLD-Tau versus FTLD-TDP43.
u Early-onset Alzheimer’s disease versus late-onset Alzheimer’s disease.
*lndicates significance at P < 0.001, otherwise P < 0.05; NA = not applicable.
Accuracy of visual assessment for the primary pathology groups
|
Unstructured visual assessment
|
Best single visual rating scale
|
SVC performance based on all scales
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
| AD from controls | 92% (7%) | 86% (10%) | 89% (6%) | MTA (1.5) | 64% (56–72%) | 89% (83–93%) | 77% (69–83%) | 94% (86–97%) | 89% (80–94%) | 92% (83–96%) |
| AD from DLB | 84% (9%) | 58% (17%) | 71% (6%) | MTA (1.5) | 64% (52–75%) | 68% (56–78%) | 66% (54–76%) | 82% (66–91%) | 64% (47–78%) | 73% (56–85%) |
| AD from FTLD | 81% (12%) | 74% (11%) | 77% (10%) | PA (2.5) | 22% (15–30%) | 98% (94–99%) | 60% (50–69%) | 88% (77–94%) | 56% (42–68%) | 72% (59–82%) |
| AD from DLB+FTLD | 67% (13%) | 69% (4%) | 68% (8%) | PA (2.5) | 22% (17–28%) | 86% (81–90%) | 54% (47–61%) | 69% (57–78%) | 68% (56–78%) | 68% (57–78%) |
| DLB from controls | 49% (20%) | 92% (6%) | 70% (9%) | OF (1.5) | 57% (45–69%) | 84% (73–90%) | 70% (58–80%) | 64% (46–79%) | 92% (77–97%) | 78% (60–89%) |
| DLB from AD | 58% (17%) | 84% (9%) | 71% (6%) | OF (1.5) | 57% (45–68%) | 48% (36–59%) | 52% (41–64%) | 64% (47–78%) | 82% (66–91%) | 73% (56–85%) |
| DLB from FTLD | 75% (26%) | 89% (8%) | 82% (12%) | PA (3.0) | 7% (3–16%) | 100% (95–100%) | 54% (41–66%) | 93% (78–98%) | 89% (72–96%) | 91% (75–97%) |
| DLB from AD+FTLD | 34% (16%) | 86% (6%) | 60% (7%) | AC (1.0) | 93% (85–97%) | 11% (6–19%) | 52% (41–63%) | 93% (80–97%) | 55% (39–70%) | 74% (58–85%) |
| FTLD from controls | 82% (5%) | 99% (3%) | 90% (2%) | MTA (1.5) | 82% (73–88%) | 89% (82–94%) | 85% (77–91%) | 89% (77–95%) | 97% (89–99%) | 93% (83–97%) |
| FTLD from AD | 74% (11%) | 81% (12%) | 77% (10%) | OF (2.5) | 55% (45–64%) | 81% (73–87%) | 68% (59–76%) | 56% (42–68%) | 88% (77–94%) | 72% (59–82%) |
| FTLD from DLB | 89% (8%) | 75% (26%) | 82% (12%) | MTA (2.0) | 69% (56–79%) | 82% (70–90%) | 76% (63–85%) | 89% (72–96%) | 93% (78–98%) | 91% (75–97%) |
| FTLD from AD+DLB | 59% (9%) | 80% (11%) | 69% (6%) | AT (2.0) | 64% (55–71%) | 63% (54–71%) | 63% (55–71%) | 78% (66–86%) | 68% (55–78%) | 73% (60–82%) |
a Accuracy of expert diagnosis based on visual assessment of structural imaging ( n = 80, 20 per group). Experts were blinded to all clinical and pathological information except the person’s age. Data are presented as mean (SD) based on six dementia experts. Six raters, n = 80 scans.
b Performance of visual rating scale that most accurately predicts pathology for each binary group comparison. The optimal cut-off points should be interpreted as: < cut-off = normal, ≥ cut-off = abnormal. Sensitivity and specificity values are selected based on the maximum balanced accuracy score. Four raters, n = 257 scans.
c SVC performance based on mean left/right scores for each of the six visual rating scales. All values in parts (b) and (c) are based on mean scores from four raters in the 80-scan subset and two raters in the remaining images, and are presented with 95% confidence intervals in brackets. Four raters, n = 257 scans.
AD = Alzheimer’s disease.
Accuracy of visual rating for the pathology subgroups
|
Best single visual rating scale
|
SVC performance based on all scales
| ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Classification task | Best scale | Sensitivity | Specificity | Balanced accuracy | AUC | Sensitivity | Specificity | Balanced accuracy | AUC |
| EO-AD from younger controls | FI (1.5) | 74% (63–83%) | 94% (86–97%) | 84% (74–90%) | 0.89 (0.80–0.94) | 86% (71–94%) | 100% (91–100%) | 93% (80–98%) | 0.98 (0.87–1.0) |
| EO-AD from LO-AD | PA (1.5) | 75% (63–84%) | 61% (48–72%) | 68% (56–78%) | 0.68 (0.56–0.78) | 67% (49–80%) | 79% (61–89%) | 73% (55–85%) | 0.79 (0.61–0.89) |
| EO-AD from DLB | PA (2.0) | 53% (41–65%) | 75% (63–84%) | 64% (52–75%) | 0.68 (0.55–0.78) | 86% (70–94%) | 71% (53–84%) | 79% (61–89%) | 0.80 (0.63–0.90) |
| EO-AD from FTLD-Tau | PA (2.0) | 53% (40–66%) | 71% (58–81%) | 62% (49–74%) | 0.65 (0.52–0.76) | 78% (59–89%) | 50% (32–68%) | 64% (45–79%) | 0.56 (0.38–0.73) |
| EO-AD from FTLD-TDP43 | PA (2.5) | 26% (17–38%) | 100% (95–100%) | 63% (50–74%) | 0.64 (0.52–0.75) | 67% (49–80%) | 71% (53–84%) | 69% (51–82%) | 0.73 (0.56–0.85) |
| EO-AD from LO-AD+DLB+FTLD | PA (1.5) | 75% (68–81%) | 38% (31–46%) | 57% (49–64%) | 0.60 (0.52–0.67) | 56% (43–67%) | 83% (73–90%) | 69% (57–79%) | 0.70 (0.58–0.79) |
| LO-AD from older controls | MTA (1.5) | 82% (69–90%) | 80% (67–88%) | 81% (68–89%) | 0.86 (0.74–0.93) | 50% (32–68%) | 100% (88–100%) | 75% (55–87%) | 0.78 (0.58–0.89) |
| LO-AD from EO-AD | MTA (2.0) | 68% (55–78%) | 64% (52–75%) | 66% (54–76%) | 0.69 (0.56–0.79) | 79% (61–89%) | 67% (49–80%) | 73% (55–85%) | 0.79 (0.61–0.89) |
| LO-AD from DLB | MTA (1.5) | 82% (68–90%) | 68% (53–80%) | 75% (60–85%) | 0.79 (0.64–0.88) | 43% (25–64%) | 93% (74–98%) | 68% (46–83%) | 0.66 (0.45–0.82) |
| LO-AD from FTLD-Tau | MTA (2.0) | 68% (52–80%) | 42% (28–57%) | 55% (39–69%) | 0.50 (0.35–0.65) | 86% (64–95%) | 58% (37–77%) | 72% (49–87%) | 0.71 (0.49–0.86) |
| LO-AD from FTLD-TDP43 | MTA (0.5) | 96% (87–99%) | 4% (1–13%) | 50% (36–64%) | 0.45 (0.31–0.60) | 86% (65–95%) | 79% (57–90%) | 82% (61–93%) | 0.87 (0.66–0.95) |
| LO-AD from EOAD+DLB+FTLD | MTA (2.0) | 68% (57–77%) | 53% (42–64%) | 60% (49–71%) | 0.61 (0.50–0.71) | 57% (41–72%) | 84% (69–92%) | 71% (54–83%) | 0.71 (0.55–0.83) |
| DLB from (Older) Controls | OF (1.5) | 57% (43–70%) | 83% (70–90%) | 70% (56–80%) | 0.70 (0.56–0.81) | 86% (67–94%) | 40% (24–60%) | 63% (43–79%) | 0.53 (0.34–0.70) |
| DLB from EO-AD | OF (1.5) | 57% (45–69%) | 48% (36–60%) | 53% (40–64%) | 0.50 (0.38–0.63) | 71% (53–84%) | 86% (70–94%) | 79% (61–89%) | 0.80 (0.63–0.90) |
| DLB from LO-AD | AC (0.5) | 100% (93–100%) | 7% (3–18%) | 54% (39–68%) | 0.54 (0.39–0.68) | 93% (74–98%) | 43% (25–64%) | 68% (46–83%) | 0.66 (0.45–0.82) |
| DLB from FTLD-Tau | PA (3.0) | 7% (3–19%) | 100% (92–100%) | 54% (38–68%) | 0.45 (0.30–0.60) | 86% (64–95%) | 67% (44–83%) | 76% (54–89%) | 0.76 (0.53–0.89) |
| DLB from FTLD-TDP43 | PA (2.5) | 7% (3–18%) | 100% (93–100%) | 54% (39–68%) | 0.44 (0.30–0.59) | 71% (50–86%) | 100% (86–100%) | 86% (65–95%) | 0.90 (0.70–0.97) |
| DLB from AD+FTLD | AC (1.0) | 93% (85–97%) | 10% (6–19%) | 52% (41–63%) | 0.43 (0.33–0.54) | 86% (71–93%) | 59% (43–73%) | 72% (56–84%) | 0.76 (0.60–0.86) |
| FTLD-Tau from younger controls | MTA (1.0) | 100% (93–100%) | 82% (68–90%) | 91% (79–96%) | 0.98 (0.89–1.0) | 100% (85–100%) | 100% (85–100%) | 100% (85–100%) | 1.0 (0.85–1.0) |
| FTLD-Tau from EO-AD | OF (2.0) | 83% (72–91%) | 59% (46–71%) | 71% (58–81%) | 0.76 (0.63–0.85) | 50% (32–68%) | 78% (59–89%) | 64% (45–79%) | 0.56 (0.38–0.73) |
| FTLD-Tau from LO-AD | AC (2.0) | 58% (43–72%) | 86% (72–93%) | 72% (56–83%) | 77% (61–87%) | 58% (37–77%) | 86% (64–95%) | 72% (49–87%) | 0.71 (0.49–0.86) |
| FTLD-Tau from DLB | MTA (1.5) | 79% (64–89%) | 68% (52–80%) | 74% (58–84%) | 0.80 (0.65–0.89) | 67% (44–83%) | 86% (64–95%) | 76% (54–89%) | 0.76 (0.53–0.89) |
| FTLD-Tau from FTLD-TDP43 | FI (3.0) | 33% (21–49%) | 86% (72–93%) | 60% (44–73%) | 0.62 (0.46–0.75) | 86% (64–95%) | 17% (6–39%) | 51% (31–71%) | 0.40 (0.22–0.62) |
| FTLD-Tau from AD+DLB+FTLD-TDP43 | FI (2.5) | 50% (38–62%) | 78% (67–86%) | 64% (52–75%) | 0.69 (0.57–0.79) | 50% (34–66%) | 81% (64–90%) | 65% (48–79%) | 0.62 (0.45–0.77) |
| FTLD-TDP43 from younger controls | FI (1.5) | 93% (82–97%) | 94% (84–98%) | 93% (83–97%) | 0.96 (0.87–0.99) | 93% (75–98%) | 100% (86–100%) | 96% (80–99%) | 0.98 (0.83–1.0) |
| FTLD-TDP43 from EO-AD | AT (2.0) | 68% (55–78%) | 73% (60–82%) | 70% (58–80%) | 0.76 (0.64–0.85) | 71% (53–84%) | 67% (49–80%) | 69% (51–82%) | 0.73 (0.56–0.85) |
| FTLD-TDP43 from LO-AD | AC (2.0) | 54% (39–68%) | 86% (72–93%) | 70% (54–81%) | 71% (56–82%) | 79% (57–90%) | 86% (65–95%) | 82% (61–93%) | 0.87 (0.66–0.95) |
| FTLD-TDP43 from DLB | AT (2.0) | 68% (53–80%) | 86% (72–93%) | 77% (62–87%) | 0.81 (0.67–0.90) | 100% (86–100%) | 71% (50–86%) | 86% (65–95%) | 0.90 (0.70–0.97) |
| FTLD-TDP43 from FTLD-Tau | MTA (2.0) | 75% (59–86%) | 42% (28–57%) | 58% (43–72%) | 0.55 (0.39–0.69) | 17% (6–39%) | 86% (64–95%) | 51% (31–71%) | 0.40 (0.22–0.62) |
| FTLD-TDP43 from AD+DLB+FTLD-Tau | AT (2.0) | 68% (57–77%) | 64% (52–73%) | 66% (54–75%) | 0.69 (0.58–0.78) | 71% (55–83%) | 70% (53–82%) | 71% (54–83%) | 0.72 (0.56–0.84) |
a Performance of visual rating scale that most accurately predicts pathology for each binary subgroup comparison. The optimal cut-off points should be interpreted as: < cut-off = normal, ≥ cut-off = abnormal. Sensitivity and specificity values are selected based on the maximum balanced accuracy score. Four raters, n = 254 scans*.
b SVC performance based on mean left/right scores for each of the six visual rating scales. All values in parts (a) and (b) are presented with 95% confidence intervals in brackets. Four raters, n = 254 scans*.
*Three scans from patients with a primary pathology diagnosis of FTLD-FUS were excluded from the classification analysis due to insufficient sample size; AD = Alzheimer’s disease; EO = early onset; LO = late-onset.
Figure 2Receiver operator characteristic curves of support vector machine performance. Receiver operator characteristic plots of SVC performance for prediction of primary pathology groups. AUC values with 95% confidence intervals are displayed for each classifier. AD = Alzheimer’s disease.