| Literature DB >> 33919878 |
Nadire Cavus1,2, Abdulmalik A Lawan1,3, Zurki Ibrahim4, Abdullahi Dahiru5, Sadiya Tahir6, Usama Ishaq Abdulrazak7, Adamu Hussaini3,8.
Abstract
Autism spectrum disorder (ASD) is associated with significant social, communication, and behavioral challenges. The insufficient number of trained clinicians coupled with limited accessibility to quick and accurate diagnostic tools resulted in overlooking early symptoms of ASD in children around the world. Several studies have utilized behavioral data in developing and evaluating the performance of machine learning (ML) models toward quick and intelligent ASD assessment systems. However, despite the good evaluation metrics achieved by the ML models, there is not enough evidence on the readiness of the models for clinical use. Specifically, none of the existing studies reported the real-life application of the ML-based models. This might be related to numerous challenges associated with the data-centric techniques utilized and their misalignment with the conceptual basis upon which professionals diagnose ASD. The present work systematically reviewed recent articles on the application of ML in the behavioral assessment of ASD, and highlighted common challenges in the studies, and proposed vital considerations for real-life implementation of ML-based ASD screening and diagnostic systems. This review will serve as a guide for researchers, neuropsychiatrists, psychologists, and relevant stakeholders on the advances in ASD screening and diagnosis using ML.Entities:
Keywords: artificial intelligence; autism spectrum disorder; diagnosis; machine learning; screening
Year: 2021 PMID: 33919878 PMCID: PMC8070763 DOI: 10.3390/jpm11040299
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1PRISMA flow diagram of the search results.
Inclusion and exclusion criteria of the study.
| Inclusion Criteria |
|---|
| Journal articles published in the English language |
| Documents published within the last ten years from 2011 to date |
| Full-text papers that are accessible and downloadable |
| Studies that utilized behavioral data |
| Studies that employed machine learning as the main technique |
| Studies that considered autism as the main disorder assessed |
| Exclusion criteria |
| Papers that are written in other languages |
| Duplicated papers |
| Full-text of the document is not accessible on the internet |
| The study aim is not clearly defined |
| Studies that are not relevant to the stated research question |
| Relevant studies, but machine learning is not the main method |
| Relevant studies, but autism is not the main disorder assessed |
| Conferences papers, editorial materials, and literature reviews |
| Studies that utilized data from either brain imaging, genetic, or physical/metabolic biomarkers. |
| Intervention studies |
Figure 2Article distribution over the years.
Figure 3The number of articles published by journals.
Information extracted from the articles.
| Article/ | Aim | Tool | Data Source | FS/FT | FS/FT Method | Modeling Algorithms | Key Findings |
|---|---|---|---|---|---|---|---|
| Goel et al. [ | Proposed Optimization Algorithm for improved performance over common ML | AQ-10 (child, adolescent, adult) | ASDTest | - | - | GOA, BACO, LR, NB, KNN, RF-CART + ID3, * MGOA | The proposed MGOA (GOA with Random Forest classifier) predicted ASD cases with approximate accuracy, specificity, and sensitivity of 100%. |
| Shahamiri and Thabtah [ | Implementation and evaluation of CNN-based ASD scoring system | Q-CHAT-10, AQ-10 | ASDTest | - | - | C4.5, Bayes Net, RIDOR, * CNN | The performance evaluation showed the superior performance of CNN over other algorithms; indicating the robustness of the implemented system. |
| Thabtah and Peebles [ | Demonstrate the superiority of Rules-based ML over other models | Q-CHAT-10, AQ-10 (child, Adolescent, adult) | ASDTest | - | - | RIPPER, RIDOR, Nnge, Bagging, CART, C4.5, and PRISM, * RML | Empirically evaluated rule induction, Bagging, Boosting, and decision trees algorithms on different ASD datasets. The superiority of the RML model was reported in not only classifying ASD but also offer rules that can be utilized in understanding the reasons behind the classification. |
| Wall et al. [ | Streamlining ADR-I and evaluate ML performance | ADI-R | AGRE, SSC, AC | FS | Trial-error | * ADTree, BFTree, ConjunctiveRule, DecisionStump, FilteredClassifier, J48, J48graft, JRip, LADTree, Nnge, OneR, OrdinalClassClassifier, PART, Ridor, and SimpleCart | The best model utilized 7 of the 93 items contained in the ADI-R in classifying ASD with 99.9% accuracy. |
| Duda et al. [ | Streamlining ADOS and demonstrate the superior performance of ADTree over common hand-crafted methods | ADOS | AC, AGRE, SSC, NDAR, SVIP | FS | Trial-error | ADTree | 72% reduction in the items from ADOS-G with >97% accuracy. |
| Küpper et al. [ | Streamlining ADOS and demonstrate the performance of SVM | ADOS | ASD outpatient clinics in Germany | FS | Recursive Feature Selection | SVM | SVM achieved good sensitivity and specificity with fewer ADOS items pointing to 5 behavioral features. |
| Wall et al. [ | Streamlining ADOS and evaluate ML performance | ADOS | AC, AGRE, SSC | FS | Trial-error | * ADTree, BFTree, Decision Stump, Functional Tree, J48, J48graft, Jrip, LADTree, LMT, Nnge, OneR, PART, Random Tree, REPTree, Ridor, Simple Cart | The ADTree model utilized 8 of the 29 items in Module 1 of the ADOS and classified ASD with 100% accuracy. |
| Levy et al. [ | Streamlining ADOS and evaluate ML performance | ADOS | AC, AGRE, SSC, SVIP | FS | Sparsity/parsimony enforcing regularization techniques | LR, Lasso, Ridge, Elastic net, Relaxed Lasso, Nearest shrunken centroids, LDA, * LR, * SVM, ADTree, RF, Gradient boosting, AdaBoost | With at most 10 features from ADOS′s Module 3 and Module 2, AUC of 0.95 and 0.93 was achieved, respectively. |
| Kosmicki et al. [ | Streamlining ADOS and evaluate ML performance | ADOS | AC, AGRE, SSC, NDAR, SVIP | FS | Stepwise Backward Feature Selection | ADTree, * SVM, Logistic Model Tree, * LR, NB, NBTree, RF | The best performing models have utilized 9 of the 28 items from module 2, and 12 of the 28 items from module 3 in classifying ASD with 98.27% and 97.66% accuracy, respectively. |
| Thabtah [ | Propose ASDTest; AQ-based mobile screening app, streamline AQ-10 items, and evaluate the performance of 2 ML models | AQ-10 (child, adolescent, adult) | ASDTest | FS | Trial-error | NB, * LR | Feature and predictive analyses demonstrate small groups of autistic traits improving the efficiency and accuracy of screening processes. |
| Thabtah et al. [ | Demonstrate the superiority of Va over other FS methods based on the performance of ML models on the streamlined datasets | Q-CHAT-10, and AQ-10 (child, adolescent, adult) | ASDTest | FS | Va, IG, Correlation, CFS, and CHI | Repeated Incremental Pruning to Produce Error Reduction (RIPPER), C4.5 (Decision Tree) | Va derived fewer features from adults, adolescents, and child datasets with optimal model performance. Demonstrate the efficacy of Va over IG, Correlation, CFS, and CHI in reducing AQ-10 items |
| Thabtah et al. [ | Streamlining AQ-10 and demonstrate the superior performance of LR over common hand-crafted methods | AQ-10 (adolescent, adult) | ASDTest | FS | IG, CHI | LR | LR showed acceptable performance in terms of sensitivity, specificity, and accuracy among others. |
| Suresh Kumar and Renugadevi [ | Algorithm Optimization (improvement in accuracy compared to common ML) | AQ-10 (child, adolescent, adult) | ASDTest | FS | SFS | SVM, ANN, * DE SVM, DE ANN | DE optimized SVM outperformed ANN and DE optimized ANN in classifying ASD. DE is effective. |
| Pratama et al. [ | Input Optimization using Va | AQ-10 (child, adolescent, adult) | ASDTest | FS | Va | SVM, * RF, ANN | RF succeeded in producing higher adult AQ sensitivity (87.89%), and a rise in the specificity level of AQ-Adolescents was better produced using SVM (86.33%). |
| Usta et al. [ | ML Performance Evaluation | Autism Behavior Checklist, Aberrant Behavior Checklist, Clinical Global Impression | Ondokuz Mayis University Samsun | FS | Trial-error | NB, LR, * ADTree | The ML modeling revealed the significant influence of other demographic parameters in ASD classification. |
| Wingfield et al. [ | Propose PASS; a culturally sensitive app embedded with ML model | PASS | VPASS app | FS | CFS, mRMR | * RF, NB, Adaboost, Multilayer Perceptron, J48, PART, SMO | PASS app overcomes the cultural variation in interpreting ASD symptoms, and the study demonstrated the possibility of removing feature redundancy. |
| Duda et al. [ | ML Performance Evaluation in classifying ASD from ADHD | SRS | AC, AGRE, SSC | FS | Forward Feature Selection | ADTree, RF, SVM, LR, Categorical lasso, LDA | All the models could classify ASD from ADHD by utilizing 5 of the 65 items of SRS with high average accuracy (AUC = 0.965). |
| Duda et al. [ | Improve models’ reliability using expanded datasets for classifying ASD from ADHD | SRS | AC, AGRE, SSC, and crowdsourced data | FS | - | SVM, LR, * LDA | LDA model achieved an AUC of 0.89 with 15 items. |
| Bone et al. [ | Demonstrate the improved accuracy of SVM over common hand-crafted rules | ADI-R, SRS | Balanced Independent Dataset | FT | Tuned parameters across multiple levels of cross-validation | SVM | The SVM model utilized five of the fused ADI-R and SRS items and classified ASD sufficiently with below (above) 89.2% (86.7%) sensitivity and 59.0% (53.4%) specificity. |
| Puerto et al. [ | Propose MFCM-ASD and evaluate its performance against other ML models | ADOS, ADI-R | APADA | FT | Inputs fuzzification | * MFCM-ASD, SVM, Random forest, NB | The superior performance of MFCM characterized by its robustness makes it an effective ASD diagnostic technique. |
| Akter et al. [ | Compare FT methods and evaluate the performance of ML models on the transformed datasets | Q-CHAT-10, and AQ-10 (child, adolescent, adult) | ASDTest | FT | Log, Z-score, and Sine FT | Adaboost, FDA, C5.0, LDA, MDA, PDA, SVM, and CART | Varying superior performances of the ML models and FT approaches were achieved across the datasets. |
| Baadel et al. [ | Input Optimization using a clustering approach | AQ-10 (child, adolescent, adult) | ASDTest | FT | CATC | OMCOKE, RIPPER, PART, * RF, RT, ANN | CATC showed significant improvement in screening ASD based on traits′ similarity as opposed to scoring functions. The improvement was more pronounced with RF classifier. |
ASD, autism spectrum disorder; FS, feature selection; FT, feature transformation; ML, machine learning; ANN, artificial neural network; SVM, support vector machine; CNN, convolutional neural network; RF, random forest; LR, logistic regression; ADTree, alternative decision tree; LDA, linear discriminant analysis; MGOA, modified grasshopper optimization algorithm; BACO, binary ant colony optimization; NB, naïve Bayes; KNN, K-nearest neighbor; RIPPER, repeated incremental pruning to produce error reduction; ADOS, autism diagnostic observation schedule; ADI-R, autism diagnostic interview-revised; Q-CHAT, quantitative checklist for autism toddlers; AQ, autism quotient; SRS, social responsiveness scale; PASS, pictorial autism assessment schedule; AC, boston autism consortium; AGRE, autism genetic resource exchange; SSC, Simons Simplex Collection; NDAR, National Database for Autism Research; SVIP, Simons Variation In Individuals Project; APADA, Association of Parents and Friends for the Support and Defense of the Rights of People with Autism; MFCM, multilayer fuzzy cognitive maps; CATC, clustering-based autistic trait classification. * Best performing models.
Figure 4Sum of citations per journal.
Figure 5Number of citations across years.