| Literature DB >> 29104527 |
Andreia V Faria1, Zifei Liang2, Michael I Miller3, Susumu Mori1.
Abstract
We explored the performance of structure-based computational analysis in four neurodegenerative conditions [Ataxia (AT, n = 16), Huntington's Disease (HD, n = 52), Alzheimer's Disease (AD, n = 66), and Primary Progressive Aphasia (PPA, n = 50)], all characterized by brain atrophy. The independent variables were the volumes of 283 anatomical areas, derived from automated segmentation of T1-high resolution brain MRIs. The segmentation based volumetric quantification reduces image dimensionality from the voxel level [on the order of [Formula: see text](106)] to anatomical structures [[Formula: see text](102)] for subsequent statistical analysis. We evaluated the effectiveness of this approach on extracting anatomical features, already described by human experience and a priori biological knowledge, in specific scenarios: (1) when pathologies were relatively homogeneous, with evident image alterations (e.g., AT); (2) when the time course was highly correlated with the anatomical changes (e.g., HD), an analogy for prediction; (3) when the pathology embraced heterogeneous phenotypes (e.g., AD) so the classification was less efficient but, in compensation, anatomical and clinical information were less redundant; and (4) when the entity was composed of multiple subgroups that had some degree of anatomical representation (e.g., PPA), showing the potential of this method for the clustering of more homogeneous phenotypes that can be of clinical importance. Using the structure-based quantification and simple linear classifiers (partial least square), we achieve 87.5 and 73% of accuracy on differentiating AT and pre-symptomatic HD patents from controls, respectively. More importantly, the anatomical features automatically revealed by the classifiers agreed with the patterns previously described on these pathologies. The accuracy was lower (68%) on differentiating AD from controls, as AD does not display a clear anatomical phenotype. On the other hand, the method identified PPA clinical phenotypes and their respective anatomical signatures. Although most of the data are presented here as proof of concept in simulated clinical scenarios, structure-based analysis was potentially effective in characterizing phenotypes, retrieving relevant anatomical features, predicting prognosis, and aiding diagnosis, with the advantage of being easily translatable to clinics and understandable biologically.Entities:
Keywords: automated MRI analysis; computer aid; pattern recognition; precision medicine; quantitative MRI
Year: 2017 PMID: 29104527 PMCID: PMC5655969 DOI: 10.3389/fnins.2017.00578
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Demographic and protocol information.
| 208 | 20–95 | 57.9 ± 18.8 | 98/110 | Phillips, 1.5, 1 × 0.875 × 0.875, 28 | |
| Phillips, 1.5, 1.2 × 0.94 × 0.94, 9 | |||||
| Phillips, 3, 1 × 1 × 1, 7 | |||||
| Phillips, 3, 1.2 × 1 × 1, 26 | |||||
| Phillips, 3, 0.9 × 0.9 × 0.9, 52 | |||||
| Phillips, 3, 1.1 × 0.83 × 0.83, 16 | |||||
| GE, 1.5, 1.2 × 0.94 × 0.94, 6 | |||||
| GE, 3, 1.2 × 1.02 × 1.02, 6 | |||||
| Siemens, 1.5, 1.2 × 1.25 × 1.25, 7 | |||||
| Siemens, 3, 1 × 1 × 1, 28 | |||||
| Siemens, 3, 1.2 × 1 × 1, 23 | |||||
| 16 | 48–73 | 60.8 ± 6.8 | 13/3 | Phillips, 3, 1.1 × 0.83 × 0.83, 16 | |
| Far from onset | 23 | 21–51 | 36.8 ± 9.7 | 10/13 | |
| Near to onset | 16 | 20–55 | 45.1 ± 8.6 | 13/3 | Phillips, 3, 0.9 × 0.9 × 0.9, 52 |
| Early symptoms | 13 | 30–59 | 50.8 ± 7.9 | 7/6 | |
| 66 | 55–93 | 74 ± 10.5 | 40/26 | Siemens, 3, 1.2 × 1 × 1, 27 | |
| Siemens, 1.5, 1.2 × 1.25 × 1.25, 7 | |||||
| Phillips, 1.5, 1.2 × 0.94 × 0.94, 7 | |||||
| Phillips, 3, 1.2 × 1 × 1, 8 | |||||
| GE, 1.5, 1.2 × 0.94 × 0.94, 9 | |||||
| GE, 3, 1.2 × 1.02 × 1.02, 8 | |||||
| Lv | 18 | 51–79 | 68.3 ± 5.4 | 10/8 | Siemens, 3, 1 × 1 × 1, 21 |
| Sv | 16 | 57–77 | 65.5 ± 6.5 | 11/5 | Phillips, 3, 1.2 × 1 × 1, 29 |
| Av | 16 | 48–84 | 68.2 ± 10.7 | 9/7 |
Figure 1Schematic representation of the automated image parcellation using a multi-atlas likelihood fusion (MALF) algorithm. Each brain image is mapped to each atlas, and the pre-defined labels are correlated with each original brain. The labels can be grouped into five ontological hierarchical levels (L1–L5). By this process, the images are converted to matrices of structures by image features; in the present study, we used the regional volumes.
Figure 2Biplot of scores and loadings from a PLS-DA analysis between controls (C) and patients with ataxia (at). The loading weights of the regional volumes, or the importance of regional atrophy in the classifier, are color-coded on the axial MRIs (radiological view). This gives, at a glance, a snapshot of the important features of the disease. In this case, the volume of the cerebellum (component 1) and brainstem/mesencephalon (component 2) had the highest absolute weights, in agreement with the physiopathology of ataxia. At the bottom left, the classifier accuracy in an external test set is reported. The actual brain images of the patients used in the model are shown.
Figure 3PLS-DA biplot of controls (C) and patients with early symptoms of Huntington's Disease (HD). The deep gray matter has the highest absolute weight, in agreement with the anatomical pattern typically described and visually detectable. At the bottom left, we report the accuracy of this model on classifying pre-symptomatic individuals close to HD onset. The actual brain images of two participants are shown.
Figure 4PLS-DA biplot of controls (C) and individuals with AD (A). The overlap between groups is likely due to the heterogeneity and subtleness of imaging features (see the map of loading weights at the bottom left). The anatomical images (brain MRIs) show that individuals at the extremes of the groups show marked anatomical features, while those in the intermediary zone have dubious (both quantitative and qualitative) findings. The colors overlaid in the brain MRIs code the z-scores of the volume (i.e., the regional degree of atrophy); blue is atrophy, red is enlargement. They also show how the quantitative information can be delivered in an understandable way. Using the higher level of granularity, it was possible to create a model with accuracy greater than by chance (bottom right), although lower than in the cases reported before. This was evidenced by the probability plot's showing less segregation between individuals of different groups (bottom center).
Figure 5Potential of detecting subgroups in heterogeneous pathologies. The top row is a supervised analysis, with knowledge about the PPA variant; the bottom row is unsupervised, based only on image features. The colors in the plots code three PPA variants (L = logopenic, S = semantic, A = agrammatic). The PCA plot (top left) shows a natural segregation between the variants. Without any clinical information, the images are clustered with high accuracy (Rand Index = 0.71) (bottom left). The anatomical features extracted in the PLS-DA model (center) when patients are grouped by clinical information (top right) or clustered by image features (bottom right) are very similar, and agree with the anatomical features described for the variants, indicating that both methods yield groups based on the same anatomical pattern.
Figure 6Pattern recognition and probabilistic diagnostic plots. This figure was created by inputting the probability of classification of each individual in different groups, i.e., the individual's chances of belonging to different groups, given by the PLS models. In the star plot (left), each star is an individual, and the colors are their true diagnosis. The x and y axis represent the diagnosis according to our classification models. The point where the stars cross the circles in each axis represents the probability of an individual's being labeled as having the diagnosis coded by that axis. The fact that the stars are elongated where the color (true diagnosis) agrees with the axis diagnosis indicates that the vast majority of patients are correctly classified. At right, a different representation of the same data, easier to visualize the probability of diagnoses in a single individual. Now, the colors represent the diagnosis given by our classification models. Each line represents an individual; the crossing point between the colored lines and the axial lines represents the probability of such an individual's being given that diagnosis. The four small panels at right show the probability curves of diagnosis for four selected individuals (bold arrows), color-coded by the true diagnosis. Y axis ranges from 0 to 1 and encodes the chance of the selected individual of being classified, by the algorithm, with the diagnosis in the X axis. For instance, the individual in the upper left quadrant has almost no chance (close to 0) of being classified as AT, a low chance of being classified as HD, a higher chance of being classified as control, and a high chance of being classified as AD. In fact, this individual had AD, as revealed by the color (purple) that represents the true diagnosis.