| Literature DB >> 36213933 |
Chao Song1, Zhong-Quan Jiang2, Li-Fei Hu1, Wen-Hao Li1, Xiao-Lin Liu1, Yan-Yan Wang1, Wen-Yuan Jin1, Zhi-Wei Zhu1.
Abstract
Background: Early detection of children with autism spectrum disorder (ASD) and comorbid intellectual disability (ID) can help in individualized intervention. Appropriate assessment and diagnostic tools are lacking in primary care. This study aims to explore the applicability of machine learning (ML) methods in diagnosing ASD comorbid ID compared with traditional regression models. Method: From January 2017 to December 2021, 241 children with ASD, with an average age of 6.41 ± 1.96, diagnosed in the Developmental Behavior Department of the Children's Hospital Affiliated with the Medical College of Zhejiang University were included in the analysis. This study trained the traditional diagnostic models of Logistic regression (LR), Support Vector Machine (SVM), and two ensemble learning algorithms [Random Forest (RF) and XGBoost]. Socio-demographic and behavioral observation data were used to distinguish whether autistic children had combined ID. The hyperparameters adjustment uses grid search and 10-fold validation. The Boruta method is used to select variables. The model's performance was evaluated using discrimination, calibration, and decision curve analysis (DCA). Result: Among 241 autistic children, 98 (40.66%) were ASD comorbid ID. The four diagnostic models can better distinguish whether autistic children are complicated with ID, and the accuracy of SVM is the highest (0.836); SVM and XGBoost have better accuracy (0.800, 0.838); LR has the best sensitivity (0.939), followed by SVM (0.952). Regarding specificity, SVM, RF, and XGBoost performed significantly higher than LR (0.355). The AUC of ML (SVM, 0.835 [95% CI: 0.747-0.944]; RF, 0.829 [95% CI: 0.738-0.920]; XGBoost, 0.845 [95% CI: 0.734-0.937]) is not different from traditional LR (0.858 [95% CI: 0.770-0.944]). Only SVM observed a good calibration degree. Regarding DCA, LR, and SVM have higher benefits in a wider threshold range.Entities:
Keywords: artificial intelligence; autism spectrum disorder; child; diagnostic model; intellectual disability; machine learning
Year: 2022 PMID: 36213933 PMCID: PMC9533131 DOI: 10.3389/fpsyt.2022.993077
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 5.435
Behavioral observation scoring criteria.
| Behavioral observation variables | 0 point | 1 point | 2 points |
| Stereotyped speech | Scarcely | Partial | Many |
| Pointing/gestures | Many | Partial | Scarcely |
| Unusual eye contact | Scarcely | Partial | Many |
| Facial expression | Many | Partial | Scarcely |
| Social quality | Good | Average | Bad |
| Unusual sensory interest | Scarcely | Partial | Many |
| Complex mannerisms | Scarcely | Partial | Many |
| Repetitive stereotyped behaviors | Scarcely | Partial | Many |
| Overactivity | Scarcely | Partial | Many |
| Negative behaviors | Scarcely | Partial | Many |
| Anxiety | Scarcely | Partial | Many |
The socio-demographic information and behavioral observations of autistic children.
| Variable | Total | Whether or not with ID | ||
| Yes | No | |||
| 241 | 98 | 143 | – | |
|
| ||||
| Male | 202 (83.82%) | 75 (76.53%) | 127 (88.81%) | 0.013 |
| Female | 39 (16.18%) | 23 (23.47%) | 16 (11.19%) | |
| Age | 6.41 ± 1.96 | 6.01 ± 1.75 | 6.67 ± 2.05 | 0.010 |
|
| ||||
| Primary school | 12 (4.98%) | 6 (6.12%) | 6 (4.20%) | 0.032 |
| Secondary school | 37 (15.35%) | 17 (17.35%) | 20 (13.99%) | |
| High school | 30 (12.44%) | 19 (19.39%) | 11 (7.69%) | |
| College/university | 147 (61.00%) | 52 (53.06%) | 95 (66.43%) | |
| Graduate and above | 15 (6.22%) | 4 (4.08%) | 11 (7.69%) | |
|
| ||||
| Primary school | 6 (2.49%) | 2 (2.04%) | 4 (2.80%) | 0.971 |
| Secondary school | 53 (21.99%) | 24 (24.49%) | 29 (20.28%) | |
| High school | 36 (14.94%) | 12 (12.24%) | 24 (16.78%) | |
| College/university | 133 (55.19%) | 50 (51.02%) | 73 (51.05%) | |
| Graduate and above | 23 (9.54%) | 10 (10.20%) | 13 (9.09%) | |
|
| ||||
| Pre-verbal/single words | 42 (17.43%) | 28 (28.57%) | 14 (9.79%) | <0.001 |
| Phrase speech | 122 (50.62%) | 65 (66.33%) | 57 (39.86%) | |
| Fluent speech | 77 (31.95%) | 5 (5.10%) | 72 (50.35%) | |
|
| ||||
| Stereotyped speech | 1.00 (0.00, 1.00) | 1.00 (0.00, 1.25) | 1.00 (0.00, 1.00) | 0.002 |
| Pointing/gestures | 2.00 (1.00, 2.00) | 2.00 (2.00, 2.00) | 2.00 (1.00, 2.00) | <0.001 |
| Unusual eye contact | 2.00 (0.00, 2.00) | 2.00 (2.00, 2.00) | 2.00 (0.00, 2.00) | 0.002 |
| Facial expression | 1.00 (0.00, 1.00) | 1.00 (0.00, 1.00) | 1.00 (0.00, 1.00) | 0.002 |
| Social quality | 1.00 (1.00, 2.00) | 2.00 (1.00, 2.00) | 1.00 (1.00, 1.00) | <0.001 |
| Unusual sensory interest | 0.00 (0.00, 1.00) | 0.00 (0.00, 1.25) | 1.00 (0.00, 1.00) | 0.001 |
| Complex mannerisms | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.026 |
| Repetitive stereotyped behaviors | 1.00 (0.00, 2.00) | 1.00 (0.00, 1.00) | 2.00 (1.00, 2.00) | <0.001 |
| Overactivity | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.059 |
| Negative behaviors | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.319 |
| Anxiety | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.00 (0.00, 0.00) | 0.721 |
FIGURE 1Boruta-based feature selection results.
Comparison of full variables and variables after feature selection.
| Type | Quantity | Variable list |
| Full variable | 16 | Gender, age, Mother.edu, Father.edu, language ability, stereotyped speech, pointing/gestures, unusual eye contact, facial expression, social quality, unusual sensory interest, complex mannerisms, repetitive stereotyped behaviors, overactivity, negative behaviors, and anxiety |
| Feature selection | 10 | Age, Mother.edu, language ability, stereotyped speech, pointing/gestures, facial expression, social quality, unusual sensory interest, negative behaviors, and repetitive stereotyped behaviors |
Improvement index before and after feature selection.
| Model | IDI | 95% CI | |
| LR | 0.049 | −0.018 to 0.117 | 0.150 |
| RF | 0.043 | 0.012–0.075 | 0.006 |
| SVM | 0.382 | 0.244–0.520 | <0.001 |
| XGBoost | 0.103 | 0.022–0.184 | 0.011 |
Diagnostic models performance.
| Model | Accuracy | Precision | Sensitivity | Specificity | AUC |
| LR | 0.712 | 0.672 | 0.976 | 0.355 | 0.858 (0.770–0.944) |
| RF | 0.726 | 0.789 | 0.714 | 0.742 | 0.829 (0.738–0.920) |
| SVM | 0.836 | 0.800 | 0.952 | 0.677 | 0.845 (0.747–0.944) |
| XGBoost | 0.767 | 0.838 | 0.738 | 0.806 | 0.845 (0.734–0.937) |
FIGURE 2Performance of the diagnostic models.
De-long test, pairwise comparison between models.
| Model | Z |
|
| LR vs. RF | 1.016 | 0.309 |
| LR vs. SVM | 0.491 | 0.623 |
| LR vs. XGBoost | 0.190 | 0.848 |
| RF vs. XGBoost | −0.497 | 0.618 |
| RF vs. XGBoost | −0.244 | 0.807 |
| SVM vs. XGBoost | 0.011 | 0.990 |
FIGURE 3Calibration diagram.
FIGURE 4DCA of the four models.
Variable importance order of LR, SVM, RF, and XGBoost.
| Variable | Logistic regression | SVM | Random forest | XGBoost | Average rank | |||
| OR | Rank | Rank | IncNodePurity | Rank | Gain | Rank | ||
| Language ability | 27.16 | 2 | 3 | 6.07 | 2 | 0.35 | 1 | 1 |
| Repetitive stereotyped behaviors | 4.84 | 3 | 1 | 3.15 | 5 | 0.1 | 3 | 2 |
| Mother.edu | 27.69 | 1 | 7 | 3.18 | 4 | 0.06 | 8 | 3 |
| Social quality | 0.11 | 7 | 5 | 2.46 | 6 | 0.09 | 4 | 4 |
| Stereotyped speech | 0.27 | 6 | 2 | 2.12 | 8 | 0.04 | 9 | 5 |
| Pointing/gestures | 0.07 | 8 | 8 | 3.25 | 3 | 0.07 | 6 | 5 |
| Negative behaviors | 0 | 10 | 4 | 0.41 | 10 | 0.13 | 2 | 7 |
| Age | 5.03 | 9 | 10 | 8.58 | 1 | 0.06 | 7 | 8 |
| Facial expression | 0.64 | 4 | 9 | 1.51 | 9 | 0.08 | 5 | 8 |
| Unusual sensory interest | 0.37 | 5 | 6 | 2.46 | 7 | 0.02 | 10 | 10 |
FIGURE 5SHAP value and importance of each feature in SVM.
Schedule 1 Assignment table.
| Variable | Category | Assignment |
| Gender | Categorical | Male = 1; |
| Language ability | Categorical | Pre-verbal/single words = 1; |
| Father’s education attainment | Categorical | Primary school = 1; |
| Mother’s education attainment | Categorical | Primary school = 1; |
| Age | Continuous | – |
| Stereotyped speech | Continuous | – |
| Pointing/gestures | Continuous | – |
| Unusual eye contact | Continuous | – |
| Facial expression | Continuous | – |
| Social quality | Continuous | – |
| Unusual sensory interest | Continuous | – |
| Complex mannerisms | Continuous | – |
| Repetitive stereotyped behaviors | Continuous | – |
| Overactivity | Continuous | – |
| Negative behaviors | Continuous | – |
| Anxiety | Continuous | – |
| Outcome | Categorical | Non-ID = 1; |