| Literature DB >> 35411059 |
Caio Pinheiro Santana1, Emerson Assis de Carvalho2,3, Igor Duarte Rodrigues2, Guilherme Sousa Bastos2, Adler Diniz de Souza4, Lucelmo Lacerda de Brito5.
Abstract
Autism Spectrum Disorder (ASD) diagnosis is still based on behavioral criteria through a lengthy and time-consuming process. Much effort is being made to identify brain imaging biomarkers and develop tools that could facilitate its diagnosis. In particular, using Machine Learning classifiers based on resting-state fMRI (rs-fMRI) data is promising, but there is an ongoing need for further research on their accuracy and reliability. Therefore, we conducted a systematic review and meta-analysis to summarize the available evidence in the literature so far. A bivariate random-effects meta-analytic model was implemented to investigate the sensitivity and specificity across the 55 studies that offered sufficient information for quantitative analysis. Our results indicated overall summary sensitivity and specificity estimates of 73.8% and 74.8%, respectively. SVM stood out as the most used classifier, presenting summary estimates above 76%. Studies with bigger samples tended to obtain worse accuracies, except in the subgroup analysis for ANN classifiers. The use of other brain imaging or phenotypic data to complement rs-fMRI information seems promising, achieving higher sensitivities when compared to rs-fMRI data alone (84.7% versus 72.8%). Finally, our analysis showed AUC values between acceptable and excellent. Still, given the many limitations indicated in our study, further well-designed studies are warranted to extend the potential use of those classification algorithms to clinical settings.Entities:
Mesh:
Year: 2022 PMID: 35411059 PMCID: PMC9001715 DOI: 10.1038/s41598-022-09821-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Screening and selection of studies according to inclusion and exclusion criteria at different stages of the meta-analysis. The numbers between parentheses indicate the total of articles remaining after each step. The numbers separated by indicate the count of articles from the first and second search, respectively. Created with Lucidchart Free https://www.lucidchart.com.
Figure 2Distribution of the selected studies by year of publication and type of ML technique used (MV/MT multiview/multitask learning, RF Random Forest, LR Logistic Regression; LDA Linear Discriminant Analysis). The numbers inside the bars indicate each article. Created with Microsoft Excel 2019.
General characteristics of the studies selected in the systematic review (SR) and the studies and samples included in the meta-analysis (MA).
| Characteristics | Studies (SR) | Studies (MA) | Samples (MA) |
|---|---|---|---|
| Total | 93 | 55 | 132 |
| SVM | 33 | 27 | 54 |
| L-SVM | − | 14 | 33 |
| Other | − | 13 | 21 |
| ANN | 30 | 13 | 44 |
| CNN | − | 5 | 16 |
| Other | − | 8 | 28 |
| M | 19 | 2 | 2 |
| MV/MT | 4 | 3 | 15 |
| RF | 4 | 2 | 2 |
| LR | 2 | 3 | 4 |
| LDA | 1 | 2 | 8 |
| Ridge | − | 1 | 1 |
| XGB | − | 1 | 1 |
| Affine | − | 1 | 1 |
| ABIDE (any version) | 79 | 45 | 121 |
| ABIDE without version | 34 | 21 | 41 |
| ABIDE I − preprocessed | 34 | 19 | 54 |
| ABIDE I + ABIDE II | 7 | 5 | 26 |
| ABIDE I | 2 | 0 | 0 |
| ABIDE II | 2 | 0 | 0 |
| UMCD | 3 | 2 | 2 |
| NDAR | 3 | 2 | 2 |
| Own sample | 3 | 4 | 4 |
| Own sample + ABIDE | 3 | 2 | 2 |
| Others | 2 | 1 | 1 |
| Only rs-fMRI | 73 | 49 | 114 |
| rs-fMRI plus other types of brain imaging data | 11 | 3 | 14 |
| rs-fMRI plus phenotypic information | 9 | 3 | 4 |
| Males and females | 62 | 37 | 80 |
| Not enough information | 26 | 13 | 44 |
| Only males | 5 | 6 | 8 |
| Both above and below 18 y.o. | 42 | 26 | 62 |
| Not enough information | 28 | 10 | 25 |
| Below 18 y.o. | 20 | 20 | 39 |
| Above 18 y.o. | 3 | 4 | 6 |
| Not enough information | − | 33 | 90 |
| Both high- and low-functioning | − | 14 | 30 |
| Only high-functioning | − | 8 | 12 |
Note that for the dataset, sex, and age of the subjects, the sum of the column Studies (MA) is greater than 55 due to articles with multiple samples included in different categories.
L-SVM Linear SVM, CNN Convolutional Neural Network, Ridge Ridge classifier, XGB Extreme Gradient Boosting, Affine Affine-Invariant, y.o. years old, FIQ Full Intelligence Quotient.
Significance values are given in italics.
Figure 3Conceptual map of ML techniques used throughout the articles selected for meta-analysis (number of articles/number of samples). Created with Lucidchart Free https://www.lucidchart.com.
Figure 4Risk of bias and applicability concerns by domain in QUADAS-2 for all the studies selected for the systematic review (left) and considering only the ones included in the meta-analysis (right). Created with Microsoft Excel 2019.
Figure 5SROC curves of all the included studies with summary estimate (a) and the studies using SVM and ANN with their summary estimates and confidence region (b). Created with R Statistics[125] version 4.1.1 using the package mada[126] version 0.5.10.
Figure 6Linear regression models with sample size predicting sensitivity (a) and specificity (b) for all the studies. Created with R Statistics[125] version 4.1.1 using the package mada[126] version 0.5.10.
Figure 7SROC curves of the studies using AAL90, AAL116, or CC200 with their summary estimates and confidence regions. Created with R Statistics[125] version 4.1.1 using the package mada[126] version 0.5.10.