| Literature DB >> 23741425 |
Martin Dyrba1, Michael Ewers, Martin Wegrzyn, Ingo Kilimann, Claudia Plant, Annahita Oswald, Thomas Meindl, Michela Pievani, Arun L W Bokde, Andreas Fellgiebel, Massimo Filippi, Harald Hampel, Stefan Klöppel, Karlheinz Hauenstein, Thomas Kirste, Stefan J Teipel.
Abstract
Diffusion tensor imaging (DTI) based assessment of white matter fiber tract integrity can support the diagnosis of Alzheimer's disease (AD). The use of DTI as a biomarker, however, depends on its applicability in a multicenter setting accounting for effects of different MRI scanners. We applied multivariate machine learning (ML) to a large multicenter sample from the recently created framework of the European DTI study on Dementia (EDSD). We hypothesized that ML approaches may amend effects of multicenter acquisition. We included a sample of 137 patients with clinically probable AD (MMSE 20.6±5.3) and 143 healthy elderly controls, scanned in nine different scanners. For diagnostic classification we used the DTI indices fractional anisotropy (FA) and mean diffusivity (MD) and, for comparison, gray matter and white matter density maps from anatomical MRI. Data were classified using a Support Vector Machine (SVM) and a Naïve Bayes (NB) classifier. We used two cross-validation approaches, (i) test and training samples randomly drawn from the entire data set (pooled cross-validation) and (ii) data from each scanner as test set, and the data from the remaining scanners as training set (scanner-specific cross-validation). In the pooled cross-validation, SVM achieved an accuracy of 80% for FA and 83% for MD. Accuracies for NB were significantly lower, ranging between 68% and 75%. Removing variance components arising from scanners using principal component analysis did not significantly change the classification results for both classifiers. For the scanner-specific cross-validation, the classification accuracy was reduced for both SVM and NB. After mean correction, classification accuracy reached a level comparable to the results obtained from the pooled cross-validation. Our findings support the notion that machine learning classification allows robust classification of DTI data sets arising from multiple scanners, even if a new data set comes from a scanner that was not part of the training sample.Entities:
Mesh:
Year: 2013 PMID: 23741425 PMCID: PMC3669206 DOI: 10.1371/journal.pone.0064925
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic data and MMSE of the subjects.
| AD | controls | |
| No. of subjects (women) | 137 (79) | 143 (72) |
| Age (SD) in years | 72.5 (8.3) | 69.2 (5.9) |
| MMSE (SD) | 20.6 (5.3) | 28.8 (1.1) |
| Years of education (SD) | 10.2 (3.3) | 13.1 (3.8) |
not significantly different between groups, χ 2 (1) = 1.5, p = 0.22.
significantly different between groups, t (278) = 3.92, p<0.001.
significantly different between groups, Mann-Whitney U = 263, p<0.001.
significantly different between groups, t (271) = −6.7, p<0.001.
Abbreviations: SD, standard deviation; MMSE, mini-mental state examination; AD, Alzheimer’s disease.
Scan parameters for DTI and number of subjects per scanner.
| Center | Scanner | Tesla | TR | TE | gradients | b-values | voxel size [mm] | Gap [%] | iPAT | averages | number of subjects (AD) |
| I | Allegra | 3.0 | 5000 | 118 | 30 | 0; 1000 | 2×2×6 | 20 | 2 | 1 | 33 (17) |
| II | Achieva | 3.0 | 12396 | 52 | 15 | 0; 800 | 2×2×2 | 0 | 2 | 2 | 29 (9) |
| III | Trio | 3.0 | 146 | 100 | 60 | 0; 1000 | 2×2×2 | 0 | 2 | 1 | 24 (16) |
| IV | Trio | 3.0 | 11800 | 94 | 61 | 0; 1000 | 2×2×2 | 0 | 2 | 1 | 13 (4) |
| V | Sonata | 1.5 | 8000 | 105 | 6 | 0; 1000 | 2×2×3 | 0 | 2 | 1 | 31 (18) |
| VI | Avanto | 1.5 | 6500 | 95 | 12 | 0; 1000 | 2×2×2.5 | 0 | 2 | 3 | 29 (15) |
| VII | Trio | 3.0 | 9300 | 102 | 12 | 0; 1000 | 2×2×2 | 0 | 2 | 4 | 46 (26) |
| VIII | Avanto | 1.5 | 5100 | 85 | 30 | 0; 1000 | 2×2×2.4 | 20 | 2 | 3 | 40 (15) |
| IX | Verio | 3.0 | 8200 | 93 | 20 | 0; 1000 | 2×2×2 | 0 | 2 | 3 | 35 (20) |
Abbreviations: TR, repetition time; TE, echo time; iPAT, integrated parallel imaging techniques; AD, Alzheimer’s disease.
Figure 1Flow chart of the ML analysis.
SVM classification results for the original and PCA variance reduced data (pooled cross-validation).
| Modality | Accuracy [%] | Sensitivity [%] | Specificity [%] | No. of features [103 voxels] | Reduced variance | |
| FA | original | 80.3 [66.0, 94.7] | 78.8 [57.1, 96.6] | 81.9 [64.3, 100.0] | 26 (11%) | – |
| reduced | | 81.8 [71.4, 100.0] | 78.0 [57.1, 100.0] | 85.5 [65.4, 100.0] | 23 (10%) | 29% | |
| reduced | | 79.9 [66.0, 89.5] | 74.5 [53.4, 96.6] | 85.1[64.3, 100.0] | 22 (9%) | 46% | |
| reduced | | 78.3 [62.5, 89.3] | 74.4 [50.0, 100.0] | 82.0 [57.1, 100.0] | 21 (9%) | 58% | |
| MD | original | 83.3 [69.1, 96.4] | 79.6 [57.1, 100.0] | 86.9 [71.4, 100.0] | 128 (54%) | – |
| reduced | | 83.4 [70.1, 94.7] | 75.9 [55.4, 92.9] | 90.7 [71.4, 100.0] | 67 (28%) | 31% | |
| reduced | | 82.9 [71.4, 93.0] | 74.8 [57.1, 92.9] | 90.6 [75.8, 100.0] | 49 (21%) | 56% | |
| reduced | | 82.2 [67.9, 94.7] | 74.2 [51.8, 92.9] | 89.8 [66.7, 100.0] | 43 (18%) | 63% | |
| WMD | original | 82.7 [67.9, 96.4] | 77.9 [55.4, 92.9] | 87.4 [71.4, 100.0] | 41 (17%) | – |
| reduced | | – | – | – | – | – | |
| reduced | | 81.1 [66.0, 93.0] | 74.2 [50.0, 92.9] | 87.8 [65.4, 100.0] | 60 (25%) | 23% | |
| reduced | | 79.1 [64.3, 92.9] | 72.8 [51.8, 92.9] | 85.2 [64.3, 100.0] | 53 (22%) | 45% | |
| GMD | original | 89.3 [78.6, 100.0] | 87.4 [69.2, 100.0] | 91.2 [72.3, 100.0] | 182 (71%) | – |
| reduced | | – | – | – | – | – | |
| reduced | – | – | – | – | – | |
| reduced | | 74.6 [57.1, 89.3] | 66.3 [40.5, 85.7] | 82.7 [64.3, 100.0] | 20 (8%) | 32% | |
For each modality the average number of informative voxels is provided and in parentheses the proportion compared to the respective tissue masks is presented. In the last column the removed variance proportion is given.
Abbreviations: FA, fractional anisotropy; MD, mean diffusivity; WMD, white matter density; GMD, gray matter density.
Cross-validation results using the data of each scanner as fold.
| Modality | ML algorithm | Accuracy [%] | Sensitivity [%] | Specificity [%] | No. of features [103 voxels] | |
| FA | original | SVM | 73.8 [57.8, 86.0] | 73.0 [13.1, 94.0] | 70.4 [19.3, 98.0] | 11 (5%) |
| NB | 69.4 [52.6, 88.6] | 68.3 [19.0, 100.0] | 70.5 [12.0, 100.0] | |||
| mean corrected | SVM | 76.2 [60.5, 91.1] | 65.0 [44.3, 97.8] | 86.3 [52.0, 100.0] | 49 (21%) | |
| NB | 72.7 [63.8, 89.2] | 66.7 [30.7, 93.8] | 77.8 [62.5, 89.8] | |||
| MD | original | SVM | 63.6 [46.6, 82.3] | 72.2 [25.3, 100.0] | 58.5 [20.0, 97.5] | 129 (55%) |
| NB | 68.0 [57.7, 89.0] | 54.0 [26.0, 97.8] | 81.4 [25.0, 100.0] | |||
| mean corrected | SVM | 78.3 [67.0, 94.8] | 60.8 [26.7, 93.1] | 92.4 [80.3, 100.0] | 157 (66%) | |
| NB | 72.7 [58.1, 88.2] | 63.3 [35.4, 94.0] | 82.0 [33.8, 95.0] | |||
| WMD | original | SVM | 78.8 [58.7, 91.8] | 72.6 [42.0, 98.7] | 85.7 [47.0, 100.0] | 42 (18%) |
| NB | 73.4 [58.8, 81.9] | 67.4 [50.0, 87.6] | 77.6 [57.7, 99.0] | |||
| mean corrected | SVM | 85.4 [71.5, 98.6] | 73.8 [43.3, 97.8] | 96.7 [88.6, 100.0] | 55 (23%) | |
| NB | 73.0 [52.3, 83.6] | 61.5 [31.3, 84.3] | 84.5 [66.3, 98.8] | |||
| GMD | original | SVM | 82.4 [71.9, 97.5] | 82.2 [52.0, 100.0] | 84.1 [48.0, 100.0] | 180 (71%) |
| NB | 69.9 [35.8, 91.4] | 65.0 [0.8, 97.6] | 78.0 [13.7, 100.0] | |||
| mean corrected | SVM | 91.1 [82.7, 100.0] | 84.0 [67.9, 100.0] | 98.3 [95.0, 100.0] | 200 (78%) | |
| NB | 70.4 [61.0, 82.1] | 67.1 [47.3, 91.7] | 74.5 [58.2, 86.0] | |||
For each modality the average number of informative voxels is provided and in parentheses the proportion compared to the respective tissue masks is presented.
Abbreviations: FA, fractional anisotropy; MD, mean diffusivity; WMD, white matter density; GMD, gray matter density; ML, machine learning; SVM, Support Vector Machine; NB, Naïve Bayes.
NB classification results for the original and PCA variance reduced data (pooled cross-validation).
| Modality | Accuracy [%] | Sensitivity [%] | Specificity [%] | |
| FA | original | 70.4 [56.1, 84.0] | 65.4 [42.9, 85.7] | 75.1 [57.1, 93.3] |
| reduced | | 72.2 [54.3, 86.2] | 72.7 [51.8, 92.9] | 71.8 [42.9, 93.3] | |
| reduced | | 71.1 [54.5, 85.7] | 71.2 [50.0, 92.9] | 71.0 [44.7, 92.9] | |
| reduced | | 71.3 [53.6, 85.7] | 70.0 [44.4, 92.9] | 72.6 [46.3, 92.9] | |
| MD | original | 68.8 [53.6, 80.8] | 50. 9 [28.6, 75.2] | 85.9 [71.4, 100.0] |
| reduced | | 71.1 [54.3, 85.7] | 55.6 [32.0, 78.6] | 85.9 [64.3, 100.0] | |
| reduced | | 68.6 [54.3, 80.8] | 43.5 [16.0, 70.4] | 92.7 [78.6, 100.0] | |
| reduced | | 68.6 [55.3, 79.0] | 44.5 [21.4, 71.4] | 91.7 [74.8, 100.0] | |
| WMD | original | 74.7 [53.6, 89.5] | 70.9 [40.5, 92.9] | 78.4 [53.3, 100.0] |
| reduced | | – | – | – | |
| reduced | | 72.4[53.6, 91.2] | 62.5 [35.7, 89.2] | 82.0 [57.1, 100.0] | |
| reduced | | 68.6 [57.1, 82.1] | 57.0 [30.8, 78.6] | 79.7 [58.5, 100.0] | |
| GMD | original | 69.9 [51.7, 84.0] | 61.6 [35.7, 82.3] | 78.1 [55.1, 92.9] |
| reduced | | – | – | – | |
| reduced | | – | – | – | |
| reduced | | 66.0 [42.9, 85.7] | 63.3 [40.5, 85.7] | 68.6 [40.0, 92.9] | |
Abbreviations: FA, fractional anisotropy; MD, mean diffusivity; WMD, white matter density; GMD, gray matter density.
Figure 2SVM sensitivity maps (upper 5% percentiles).
Sensitivity maps for (A) FA, (B) MD, (C) WMD, and (D) GMD. The maps display the relative importance of each voxel for the classification decision, with white/yellow areas being more important than red areas. Preceding SVM classification, voxels that did not contribute any information to the group separation of AD and HC were masked out (IG criterion). The slices shown are: −46, −38, −28, −20, −10, −2, 8, 16, 26, 34, and 44 in MNI space.
Anatomic areas of the twenty most informative voxels derived from the averaged SVM sensitivity maps for FA.
| Coordinates (mm) | |||||
| Region | Side | x | y | z | Sensitivity |
| Cuneus WM | L | −29 | −65 | 14 | 1.00 |
| Precentral gyrus WM | L | −17 | −51 | 36 | 0.98 |
| Parietal lobe WM | L | 2 | −18 | 23 | 0.98 |
| Temporal lobe WM | R | 32 | −21 | 20 | 0.94 |
| Parahippocampal gyrus WM | R | 24 | −23 | −21 | 0.93 |
| Uncus WM | L | −35 | −2 | −27 | 0.92 |
| Parietal lobe WM | L | 6 | −5 | 24 | 0.92 |
| Uncus WM | R | 33 | −6 | −24 | 0.91 |
| Parahippocampal gyrus WM | L | −38 | −18 | −14 | 0.90 |
| Temporal lobe WM | L | −2 | −6 | 14 | 0.90 |
| Fornix | L | 35 | −48 | 2 | 0.89 |
| Postcentral gyrus WM | L | −17 | −29 | 27 | 0.88 |
| Cingulate gyrus WM | R | 12 | 20 | 18 | 0.87 |
| Postcentral gyrus WM | L | −17 | −11 | 29 | 0.87 |
| Frontal lobe WM | L | −29 | 2 | −12 | 0.87 |
| Fusiform gyrus WM | L | 48 | −50 | −14 | 0.85 |
| Temporal lobe WM | L | −36 | −65 | −9 | 0.85 |
| Corpus callosum | R | −38 | −35 | −5 | 0.84 |
| Cuneus WM | L | −27 | −39 | 20 | 0.84 |
| Insula WM | R | 27 | −45 | 17 | 0.84 |
These points were restricted to be at least 10.5 mm distant from each other. The coordinates given are in MNI space. For easier interpretation, we first applied the natural logarithm to the sensitivity values and then rescaled them to be between zero and one.
Abbreviations: WM, white matter; L, left hemisphere; R, right hemisphere.
Anatomic areas of the twenty most informative voxels derived from the averaged SVM sensitivity maps for MD.
| Coordinates (mm) | |||||
| Region | Side | x | y | z | Sensitivity |
| Inferior frontal gyrus WM | R | 39 | 27 | 2 | 1.00 |
| Parahippocampal gyrus WM | R | 20 | −11 | −29 | 1.00 |
| Middle occipital gyrus WM | R | 24 | −90 | 18 | 0.98 |
| Fusiform gyrus WM | L | −48 | −62 | −12 | 0.97 |
| Superior parietal lobule WM | L | −29 | −62 | 54 | 0.96 |
| Inferior temporal gyrus WM | R | 60 | −30 | −24 | 0.95 |
| Middle occipital gyrus WM | R | 36 | −89 | 11 | 0.95 |
| Middle frontal gyrus WM | R | 44 | 41 | 14 | 0.94 |
| Inferior frontal gyrus WM | L | −45 | 30 | −2 | 0.94 |
| Cerebellum WM | R | 8 | −54 | −27 | 0.93 |
| Lingual gyrus WM | R | −33 | −14 | −8 | 0.92 |
| Fusiform gyrus WM | R | 30 | −77 | −14 | 0.91 |
| Parahippocampal gyrus WM | L | −29 | −8 | −30 | 0.91 |
| Putamen WM | L | −24 | 2 | −6 | 0.91 |
| Middle frontal gyrus WM | R | 44 | 15 | 32 | 0.91 |
| Inferior temporal gyrus WM | R | 59 | −51 | −17 | 0.91 |
| Putamen WM | L | −14 | 23 | −8 | 0.91 |
| Putamen WM | R | 35 | −20 | −5 | 0.90 |
| Superior temporal gyrus WM | L | −38 | 2 | −18 | 0.90 |
| Supramarginal gyrus WM | L | −57 | −41 | 38 | 0.90 |
These points were restricted to be at least 10.5 mm distant from each other. The coordinates given are in MNI space. For easier interpretation, we first applied the natural logarithm to the sensitivity values and then rescaled them to be between zero and one.
Abbreviations: WM, white matter; L, left hemisphere; R, right hemisphere.
Anatomic areas of the twenty most informative voxels derived from the averaged SVM sensitivity maps for WMD.
| Coordinates (mm) | |||||
| Region | Side | x | y | z | Sensitivity |
| Parahippocampal gyrus WM | R | 24 | −23 | −24 | 1.00 |
| Limbic lobe WM | R | 29 | −29 | −6 | 0.99 |
| Parahippocampal gyrus WM | L | −26 | −24 | −20 | 0.98 |
| Middle temporal gyrus WM | R | 56 | −62 | 2 | 0.97 |
| Middle frontal gyrus WM | L | −29 | 9 | 45 | 0.96 |
| Inferior temporal gyrus WM | L | 50 | 12 | −18 | 0.95 |
| Superior temporal gyrus WM | L | −59 | 2 | −9 | 0.95 |
| Superior temporal gyrus WM | R | 56 | 3 | −12 | 0.95 |
| Middle occipital gyrus WM | L | −36 | −75 | 6 | 0.95 |
| Supramarginal gyrus WM | R | 59 | −42 | 33 | 0.95 |
| Middle temporal gyrus WM | R | 54 | 2 | −27 | 0.94 |
| Lingual gyrus WM | R | 50 | 12 | 2 | 0.93 |
| Middle temporal gyrus WM | R | 63 | −17 | −17 | 0.93 |
| Precentral gyrus WM | R | 24 | −51 | 2 | 0.93 |
| Insula WM | R | 51 | 0 | 6 | 0.92 |
| Fornix | L | −2 | −6 | 11 | 0.92 |
| Inferior parietal Lobule WM | L | −60 | −26 | 38 | 0.92 |
| Cerebellum WM | R | 27 | −56 | −54 | 0.90 |
| Lentiform nucleus, lateral globus pallidus WM | R | 23 | −5 | −14 | 0.90 |
| Superior temporal gyrus WM | R | −56 | −32 | −18 | 0.90 |
These points were restricted to be at least 10.5 mm distant from each other. The coordinates given are in MNI space. For easier interpretation, we first applied the natural logarithm to the sensitivity values and then rescaled them to be between zero and one.
Abbreviations: WM, white matter; L, left hemisphere; R, right hemisphere.
Anatomic areas of the twenty most informative voxels derived from the averaged SVM sensitivity maps for GMD.
| Coordinates (mm) | ||||||
| Region | Brodmann area | Side | x | y | z | Sensitivity |
| Middle frontal gyrus | 6 | L | −26 | −2 | 48 | 1.00 |
| Caudate tail | L | −21 | −9 | −11 | 0.97 | |
| Lentiform nucleus, lateral globus pallidus | R | 27 | −15 | −11 | 0.97 | |
| Precuneus | 7 | R | 11 | −60 | 39 | 0.97 |
| Precuneus | 7 | L | −11 | −60 | 42 | 0.94 |
| Hippocampus | R | 33 | −30 | −5 | 0.94 | |
| Precentral gyrus | 9 | L | −33 | 27 | 35 | 0.90 |
| Posterior cingulate | 23 | R | 8 | −53 | 23 | 0.89 |
| Thalamus, pulvinar | L | −5 | −30 | 17 | 0.88 | |
| Superior temporal gyrus | 21 | R | 63 | −15 | −8 | 0.88 |
| Amygdala | L | −38 | −24 | −11 | 0.88 | |
| Supramarginal gyrus | 40 | L | −51 | −47 | 33 | 0.87 |
| Middle occipital gyrus | 19 | R | −36 | −74 | 12 | 0.86 |
| Middle occipital gyrus | 19 | L | 33 | −80 | 12 | 0.86 |
| Middle temporal gyrus | 39 | R | 47 | −59 | 26 | 0.86 |
| Uncus | 20 | R | 36 | −9 | −35 | 0.86 |
| Caudate head | R | 18 | 30 | −3 | 0.86 | |
| Precuneus | 7 | R | 3 | −45 | 53 | 0.84 |
| Middle frontal gyrus | 9 | L | −30 | 44 | 32 | 0.83 |
| Supramarginal gyrus | 40 | R | 51 | −47 | 36 | 0.83 |
These points were restricted to be at least 10.5 mm distant from each other. The coordinates given are in MNI space. For easier interpretation, we first applied the natural logarithm to the sensitivity values and then rescaled them to be between zero and one.
Abbreviations: L, left hemisphere; R, right hemisphere.
Figure 3Comparison of informative voxel clusters.
Comparison of the original cluster maps with the variance reduced ones for (A) FA and (B) MD. The slices shown are: −46, −38, −28, −20, −10, −2, 8, 16, 26, 34, and 44 in MNI space. Red – IG clusters of the original data, Blue – IG clusters of variance reduced data |r|>0.6, Yellow – overlap of both.
Figure 4Principal components and correlated factors for a randomly selected training data set.
Correlations for (A) FA, (B) MD, (C) WMD, and (D) GMD. The first thirteen components each explain at least 1% of the variance in the selected training data set.