| Literature DB >> 30534065 |
Andrei Irimia1,2, Xiaoyu Lei1, Carinna M Torgerson1, Zachary J Jacokes1, Sumiko Abe1, John D Van Horn1.
Abstract
Despite substantial efforts, it remains difficult to identify reliable neuroanatomic biomarkers of autism spectrum disorder (ASD) based on magnetic resonance imaging (MRI) and diffusion tensor imaging (DTI). Studies which use standard statistical methods to approach this task have been hampered by numerous challenges, many of which are innate to the mathematical formulation and assumptions of general linear models (GLM). Although the potential of alternative approaches such as machine learning (ML) to identify robust neuroanatomic correlates of psychiatric disease has long been acknowledged, few studies have attempted to evaluate the abilities of ML to identify structural brain abnormalities associated with ASD. Here we use a sample of 110 ASD patients and 83 typically developing (TD) volunteers (95 females) to assess the suitability of support vector machines (SVMs, a robust type of ML) as an alternative to standard statistical inference for identifying structural brain features which can reliably distinguish ASD patients from TD subjects of either sex, thereby facilitating the study of the interaction between ASD diagnosis and sex. We find that SVMs can perform these tasks with high accuracy and that the neuroanatomic correlates of ASD identified using SVMs overlap substantially with those found using conventional statistical methods. Our results confirm and establish SVMs as powerful ML tools for the study of ASD-related structural brain abnormalities. Additionally, they provide novel insights into the volumetric, morphometric, and connectomic correlates of this epidemiologically significant disorder.Entities:
Keywords: DTI; MRI; autism; machine learning; neuroimaging; support vector machine
Year: 2018 PMID: 30534065 PMCID: PMC6276724 DOI: 10.3389/fncom.2018.00093
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Figure 1Percentage variance (σ2) explained by the PCs of the study dataset. The first ~63 PCs were found to explain ~95% of the total variance and are highlighted by a red rectangle. All PCs are sorted in descending order of the percentage σ2 explained. The log-log plot obviates how components beyond the ~63rd PC contribute to σ2 negligibly.
ASD and HC cohort demographics and DAS scores by domain.
| ASD | 12.74 (2.79) | 1.01:1 | 102.56 (20.23) | 101.17 (17.81) | 99.79 (17.17) | 101.65 (19.44) | 100.78 (18.15) |
| HC | 13.04 (2.95) | 1.08:1 | 109.95 (15.64) | 107.13 (14.82) | 105.13 (13.26) | 108.86 (14.78) | 107.20 (13.73) |
V, verbal; NV, non-verbal; S, spatial; GCA, general conceptual ability; SNC, spatial non-verbal composite.
The sex ratio is reported as the number of males for every female in each sample. Averages are reported, and the standard deviation is shown in parentheses.
ASD volunteers' mean scores for each ADOS domain.
| 9.30 (3.57) | 2.58 (1.76) | 11.88 (4.25) | 6.78 (2.04) | 19.13 (5.41) | 16.06 (4.25) | 5.90 (2.45) |
AB, social affect; AD, behavior; AB + AD, overall total score; C, comparison score and for each ADI domain T, social affect; R, communication; S, behavior, with standard deviations in parentheses.
SVM-identified brain structure measures which can together distinguish among patients based on their diagnosis (i.e., ASD vs. TD, Welch's T test) or on the interaction between sex and diagnosis (two-way ANOVA).
| Medial orbital sulcus | Right | Thickness | 2.388 | 189.649 | 0.018* | 3.113 | 0.027* |
| Straight gyrus | Right | CD | 2.124 | 187.787 | 0.035* | 2.351 | 0.074* |
| Left | Thickness | 2.150 | 183.505 | 0.033* | 3.409 | 0.019* | |
| Left | Volume | 2.190 | 186.758 | 0.030* | 4.618 | 0.004* | |
| Inferior frontal gyrus, orbital part | Left | CD | −2.138 | 170.856 | 0.034* | 2.655 | 0.050* |
| Temporal pole | Left | Curvature | 2.909 | 190.090 | 0.004* | 3.929 | 0.009* |
| Right | Thickness | −1.971 | 183.857 | 0.049* | 2.883 | 0.037* | |
| Parahippocampal gyrus | Right | Volume | −2.026 | 166.292 | 0.044* | 3.394 | 0.019* |
| Left | Volume | −2.304 | 190.792 | 0.022* | 2.254 | 0.083* | |
| Superior temporal gyrus | Right | Curvature | 2.080 | 182.553 | 0.039* | 3.052 | 0.030* |
| Isthmus of the cingulate gyrus | Right | Volume | 2.276 | 188.168 | 0.024* | 3.606 | 0.014* |
| Left | Area | 2.543 | 186.679 | 0.012* | 3.002 | 0.032* | |
| Pericallosal sulcus | Left | Area | 2.318 | 178.371 | 0.022* | 2.320 | 0.077* |
| Right | Area | 1.962 | 189.338 | 0.049* | 1.271 | 0.285* | |
| Cuneus | Right | Area | 2.223 | 186.596 | 0.027* | 3.676 | 0.013* |
| Left | Area | 1.957 | 185.521 | 0.049* | 1.976 | 0.119* | |
| Superior and transverse occipital sulci | Left | CD | 2.069 | 181.599 | 0.040* | 1.837 | 0.142* |
| Occipital pole | Right | Area | 2.454 | 190.459 | 0.015* | 4.570 | 0.004* |
| Left | Area | 2.443 | 180.449 | 0.016* | 3.536 | 0.016* | |
In the first case (diagnosis), a posteriori statistical analysis indicates that all identified measures exhibit statistically significant differences between the study (ASD) group and the control (TD) group. Welch's T-test with a significance level of α < 0.05 was used; t statistics, the associated d. f. and p-values are reported. A positive t statistic indicates that a corresponding measure's mean over the study (ASD) group is significantly larger than its mean over the control (TD) group. A negative t statistic indicates the reverse. To estimate the d. f., Welch's T-test relies on the Welch-Satterthwaite approximation, which involves the variances of the two samples. Thus, unlike in the case of Student's T-test, the d. f. of Welch's T test can differ whenever sample variances also differ. In the second case (diagnosis × sex interaction), a two-way ANOVA (factors: ASD diagnosis, sex) was used to identify the brain features which, in addition to being able to distinguish ASD from TD subjects, can also distinguish these subjects based on their sex. In this second analysis, the F statistic has 3 and 189 d. f. In both analyses, the null hypothesis is rejected at a significance level α < 0.05 subject to a multiple comparison correction, and statistically significant p-values are marked with an asterisk.
Figure 2MDS representation which illustrates the ability of structural brain variables to distinguish ASD patients from TD subjects. The cortical surface of each participant is shown, with the brains of participants belonging to each group being surrounded by a circle whose color indicates cohort membership (ASD in yellow, TD in blue). The spatial coordinates of each brain are specified by the MDS projection of each volunteer's brain structure descriptors from a hyperspace containing all descriptive variables to 3D space. Each coordinate axis in this 3D representation corresponds to each of the first three MDS eigenvectors of the matrix YY (see Methods), accounting for the largest amount of variance in the data. In this representation, any pair of subjects whose brains are located farther apart from each other differ more in their structural features than pairs of subjects whose brains are closer. The excellent separation between the two cohorts is apparent due to the clustering of ASD patients away from the cluster of TD subjects. The visualization was produced within the INVIZIAN software package (Bowman et al., 2012).
Figure 3Visual depiction of SVM-identified brain regions whose volumetric, morphometric, and/or connectomic features can together distinguish among patients based on the interaction between sex and diagnosis. First, a posteriori two-way ANOVA (factors: ASD diagnosis, sex) revealed brain locations whose features, in addition to being able to distinguish ASD from TD subjects, can also distinguish these subjects based on their sex (see Table 1 and text). Then, the ANOVA test statistic (F3, 189) was plotted on the cortical surface of an average brain, with color shades at each cortical location encoding the F statistic, which ranges from 0 (white) to a maximum of 4.618 (bright green).