| Literature DB >> 33810146 |
Gennaro Tartarisco1, Giovanni Cicceri2, Davide Di Pietro2, Elisa Leonardi1, Stefania Aiello1, Flavia Marino1, Flavia Chiarotti3, Antonella Gagliano4, Giuseppe Maurizio Arduino5, Fabio Apicella6, Filippo Muratori6,7, Dario Bruneo2, Carrie Allison8, Simon Baron Cohen8, David Vagni1, Giovanni Pioggia1, Liliana Ruta1.
Abstract
In the past two decades, several screening instruments were developed to detect toddlers who may be autistic both in clinical and unselected samples. Among others, the Quantitative CHecklist for Autism in Toddlers (Q-CHAT) is a quantitative and normally distributed measure of autistic traits that demonstrates good psychometric properties in different settings and cultures. Recently, machine learning (ML) has been applied to behavioral science to improve the classification performance of autism screening and diagnostic tools, but mainly in children, adolescents, and adults. In this study, we used ML to investigate the accuracy and reliability of the Q-CHAT in discriminating young autistic children from those without. Five different ML algorithms (random forest (RF), naïve Bayes (NB), support vector machine (SVM), logistic regression (LR), and K-nearest neighbors (KNN)) were applied to investigate the complete set of Q-CHAT items. Our results showed that ML achieved an overall accuracy of 90%, and the SVM was the most effective, being able to classify autism with 95% accuracy. Furthermore, using the SVM-recursive feature elimination (RFE) approach, we selected a subset of 14 items ensuring 91% accuracy, while 83% accuracy was obtained from the 3 best discriminating items in common to ours and the previously reported Q-CHAT-10. This evidence confirms the high performance and cross-cultural validity of the Q-CHAT, and supports the application of ML to create shorter and faster versions of the instrument, maintaining high classification accuracy, to be used as a quick, easy, and high-performance tool in primary-care settings.Entities:
Keywords: Q-CHAT; autism; early screening; machine learning
Year: 2021 PMID: 33810146 PMCID: PMC8004748 DOI: 10.3390/diagnostics11030574
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Overview of the entire analytical process. Collected questionnaires processed with machine-learning (ML) models and a feature-selection algorithm. Training phase (ML training) and validation (ML validation) used fivefold cross-validation. Lastly, hyperparameters were automatically tuned on the best-evaluated ML model, and output performance was reported.
Figure 2(a) Area under curve for Quantitative CHecklist for Autism in Toddlers (Q-CHAT; autism vs typically developing (TD)) comparing all 5 machine-learning models with all features; (b) histogram of predictions of best-performing model (SVM).
Tuning support vector machine (SVM) hyperparameters values via grid search.
| Model | Parameters | Kernel | Min | Max | Steps | Scale |
|---|---|---|---|---|---|---|
|
| ||||||
| SVM(14) | C | linear, poly, rbf, sigmoid | 1 | 1000 | 10 | logarithmic |
| degree | poly | 2 | 9 | 1 | linear | |
| γ | poly, rbf, sigmoid | 0.001 | 1 | 10 | logarithmic | |
|
| ||||||
| SVM(14) | C | rbf | 0.2 | 4 | 0.2 | linear |
| γ | rbf | 0.02 | 0.3 | 0.02 | linear | |
Figure 3Learning curves to diagnose SVM model performance.
Figure 4Accuracy selecting an increasing number of Q-CHAT items using SVM–recursive feature elimination (RFE) algorithm.
Comparison between 14 most discriminating Q-CHAT items (ordered by rank) from SVM–RFE algorithm and 10 most predicting items (ordered by PPV) identified by Allison et al. (2012). Colors highlight eight common items.
| Q-CHAT 14 Items (Ordered by Rank SVM–RFE) | Q-CHAT 10 Items (Allison Et Al.; Ordered By PPV) |
|---|---|
| Does your child look at you when you call their name? (1) | Does your child look at you when you call their name? (1) |
| How easy is it for you to have eye contact with your child? (2) | How easy is it for you to have eye contact with your child? (2) |
| Does your child use simple gestures (e.g., wave goodbye)? (19) | Does your child use simple gestures (e.g., wave goodbye)? (19) |
| Can other people easily understand your child’s speech? (4) | Would you describe your child’s first words as typical? (17) |
| Does your child point to indicate that they want something (e.g., a toy that is out of reach) (5) | Does your child point to indicate that they want something (e.g., a toy that is out of reach)? (5) |
| Does your child point to share interest with you (e.g., pointing at an interesting sight)? (6) | Does your child point to share interest with you (e.g., pointing at an interesting sight)? (6) |
| How long can your child’s interest be maintained by a spinning object (e.g., washing machine, electric fan, toy car wheels)? (7) | Does your child follow where you are looking? (10) |
| Does your child pretend (e.g., care for dolls, talk on a toy phone)? (9) | Does your child pretend (e.g., care for dolls, talk on a toy phone)? (9) |
| Does your child do the same thing over and over again (e.g., running the tap, turning the light switch on and off, opening and closing doors)? (16) | Does your child stare at nothing with no apparent purpose? (25) |
| Would you describe your child’s first words as typical? (17) | If you or someone else in the family is visibly upset, does your child show signs of wanting to comfort them? (e.g., stroking their hair, hugging them)? (15) |
| When your child is playing alone, do they line objects up? (3) | |
| Does your child stare at nothing with no apparent purpose? (25) | |
| Does your child echo things they hear (e.g., things that you say, lines from songs or movies, sounds)? (18) | |
| How long can your child’s interest be maintained by just one or two objects? (22) |
Detailed performance metrics of five selected machine-learning classifiers (SVM, random forest (RF), naïve Bayes (NB), logistic regression (LR), k-nearest neighbor (KNN)) with fivefold cross-validation with respect to the original 25-item Q-CHAT, 14 items selected by the SVM–RFE algorithm, 10 items selected by Allison and colleagues, and the 3 most discriminating items in common in the two studies. Note: ASD, autism spectrum disorder.
| Model | Accuracy | Classes | PPV | Sensitivity | F1 Score | No. of Subjects |
|---|---|---|---|---|---|---|
| SVM (25) | 0.95 (± 0.02) | ASD | 1.00 | 0.90 (±0.04) | 0.95 (±0.03) | 24 |
| TD | 0.91 (±0.04) | 1.00 | 0.96 (±0.03) | 28 | ||
| SVM (14) | 0.91 (± 0.02) | ASD | 0.95 (±0.03) | 0.86 (±0.01) | 0.90 (±0.02) | 24 |
| TD | 0.86 (±0.01) | 0.95 (±0.03) | 0.90 (±0.02) | 28 | ||
| SVM (10; Allison et al.) | 0.87 (± 0.03) | ASD | 0.79 (±0.03) | 0.65 (±0.04) | 0.71 (±0.03) | 24 |
| TD | 0.76 (±0.04) | 0.86 (±0.03) | 0.81 (±0.03) | 28 | ||
| SVM (3) | 0.83 (± 0.05) | ASD | 0.90 (±0.07) | 0.78 (±0.07) | 0.84 (±0.04) | 24 |
| TD | 0.84 (±0.06) | 0.93 (±0.07) | 0.89 (±0.06) | 28 | ||
| RF (25) | 0.90 (± 0.06) | ASD | 0.89 (±0.05) | 0.74 (±0.08) | 0.81 (±0.07) | 24 |
| TD | 0.82 (±0.07) | 0.93 (±0.06) | 0.87 (±0.06) | 28 | ||
| RF (14) | 0.88 (± 0.04) | ASD | 0.86 (±0.04) | 0.83 (±0.06) | 0.84 (±0.05) | 24 |
| TD | 0.87 (±0.05) | 0.90 (±0.04) | 0.88 (±0.04) | 28 | ||
| RF (10; Allison et al.) | 0.84 (± 0.03) | ASD | 0.89 (±0.07) | 0.70 (±0.06) | 0.78 (±0.06) | 24 |
| TD | 0.79 (±0.05) | 0.93 (±0.06) | 0.86 (±0.06) | 28 | ||
| RF (3) | 0.83 (± 0.05) | ASD | 0.83 (±0.06) | 0.83 (±0.05) | 0.83 (±0.06) | 24 |
| TD | 0.86 (±0.05) | 0.86 (±0.06) | 0.86 (±0.06) | 28 | ||
| NB (25) | 0.89 (± 0.04) | ASD | 1.00 | 0.70 (±0.02) | 0.82 (±0.01) | 24 |
| TD | 0.81 (±0.02) | 1.00 | 0.89 (±0.01) | 28 | ||
| NB (14) | 0.88 (± 0.04) | ASD | 1.00 | 0.74 (±0.02) | 0.85 (±0.01) | 24 |
| TD | 0.83 (±0.02) | 1.00 | 0.91 (±0.01) | 28 | ||
| NB (10; Allison et al.) | 0.82 (± 0.03) | ASD | 1.00 | 0.65 (±0.02) | 0.79 (±0.01) | 24 |
| TD | 0.78 (±0.02) | 1.00 | 0.88 (±0.01) | 28 | ||
| NB (3) | 0.84 (± 0.03) | ASD | 0.82 (±0.07) | 0.78 (±0.05) | 0.80 (±0.06) | 24 |
| TD | 0.83 (±0.06) | 0.86 (±0.06) | 0.85 (±0.06) | 28 | ||
| KNN(25) | 0.83 (± 0.03) | ASD | 0.95 (±0.03) | 0.71 (±0.05) | 0.81 (±0.03) | 24 |
| TD | 0.75 (±0.03) | 0.96 (±0.02) | 0.84 (±0.02) | 28 | ||
| KNN(14) | 0.85 (± 0.04) | ASD | 0.98 (±0.02) | 0.73 (±0.08) | 0.84 (±0.05) | 24 |
| TD | 0.77 (±0.05) | 0.98 (±0.02) | 0.86 (±0.03) | 28 | ||
| KNN(10; Allison et al.) | 0.83 (± 0.03) | ASD | 0.90 (±0.05) | 0.76 (±0.04) | 0.83 (±0.03) | 24 |
| TD | 0.77 (±0.04) | 0.91 (±0.05) | 0.83 (±0.03) | 28 | ||
| KNN (3) | 0.66 (± 0.05) | ASD | 0.62 (±0.03) | 0.90 (±0.05) | 0.73 (±0.03) | 24 |
| TD | 0.77 (±0.03) | 0.39 (±0.07) | 0.52 (±0.08) | 28 | ||
| LR(25) | 0.89 (± 0.03) | ASD | 0.92 (±0.05) | 0.87 (±0.05) | 0.89 (±0.03) | 24 |
| TD | 0.87 (±0.05) | 0.91 (±0.06) | 0.88 (±0.03) | 28 | ||
| LR (14) | 0.90 (± 0.02) | ASD | 0.93 (±0.03) | 0.87 (±0.03) | 0.90 (±0.02) | 24 |
| TD | 0.87 (±0.03) | 0.93 (±0.03) | 0.90 (±0.02) | 28 | ||
| LR (10; Allison et al.) | 0.88 (± 0.03) | ASD | 0.92 (±0.04) | 0.84 (±0.06) | 0.88 (±0.03) | 24 |
| TD | 0.84 (±0.05) | 0.91 (±0.05) | 0.87 (±0.03) | 28 | ||
| LR (3) | 0.84 (± 0.05) | ASD | 0.84 (±0.05) | 0.87 (±0.06) | 0.85 (±0.04) | 24 |
| TD | 0.84 (±0.06) | 0.81 (±0.08) | 0.82 (±0.06) | 28 |