| Literature DB >> 35308891 |
Nicole Wolff1, Matthias Eberlein2, Sanna Stroth3, Luise Poustka4, Stefan Roepke5, Inge Kamp-Becker3, Veit Roessner1.
Abstract
Objective: Although autism spectrum disorder (ASD) is a relatively common, well-known but heterogeneous neuropsychiatric disorder, specific knowledge about characteristics of this heterogeneity is scarce. There is consensus that IQ contributes to this heterogeneity as well as complicates diagnostics and treatment planning. In this study, we assessed the accuracy of the Autism Diagnostic Observation Schedule (ADOS/2) in the whole and IQ-defined subsamples, and analyzed if the ADOS/2 accuracy may be increased by the application of machine learning (ML) algorithms that processed additional information including the IQ level.Entities:
Keywords: ADOS; IQ; autism spectrum disorders; diagnostic; intellectual disability; intelligence; machine learning
Year: 2022 PMID: 35308891 PMCID: PMC8927055 DOI: 10.3389/fpsyt.2022.826043
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 5.435
Sample characteristics.
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| Age | 644 | 13.4 (±9.7) | 440 | 14.9 (±11.1) | 1,024 | 14.1 (±10.4) | 60 | 12.9 (±8.9) |
| IQ | 545 | 99.3 (±21.1) | 370 | 101.2 (±26.0) | 892 | 101.2 (±22.4) | 23 | 57.3 (±9.6) |
| IQ level | 644 | 3.2 (±1.2) | 440 | 3.3 (±1.5) | 1,024 | 3.1 (±1.2) | 60 | 5.5 (±0.7) |
Total number of participants is N = 1,084. Patients were chosen based on the presence of information about their IQ level.
Figure 1IQ level distribution (from IQ level 7-1 in %) of ASD (n = 440) and non-ASD (n = 644) group in a German sample referred to specialized ASD clinics.
Figure 2Comparison of error rates and their composition of false negatives (FN) and false positives (FP) by several sub-cohorts (e.g. sex-specific). For the calculation of the error rates of the respective algorithm [random forest (RF), decision tree (DT), ADOS/2] per sub-cohort, FN and FP are divided through the respective sample size N of the sub-cohort. IQL, IQ level.
Parameters considered for grid search.
|
|
| |
|---|---|---|
| Decision Tree | Maximal depth | 2, 3, 4, 5 |
| Complexity parameter for Minimal-Complexity pruning | 0 | |
| Splitting criterion | Gini index, Entropy | |
| Minimum number of samples for split | 2 | |
| Minimum number of samples per leaf | 1, 2, 4, 8 | |
| Random Forest | ||
| Number of estimators | 2, 4, 8, 16 | |
| Number of samples drawn for the training of the individual trees, in proportion to the number of samples in the train set | 0.3, 0.5, 0.7, 0.8, 1 |
Parameters that were found to be optimal to predict the final BEC ASD diagnosis.
Figure 3Decision tree with optimal parameters, trained on the full dataset. Its in-sample accuracy (i.e. on the training data/the full dataset) is 84.08%. Samples are passing the tree from top to bottom. If the conditional test associated with a node is passed, the left child-node will be visited, else the right node. Furthermore, the number of samples, proportion of ASD samples in the training set are given as well as the estimated class. The color codes the “purity” of the sample: From blue (100 % ASD) over white (50% ASD) to orange (0% ASD). For the ADOS-items, the names of corresponding items in all four different modules are given, separated by “/”. If no corresponding item for a given module could be found, they are marked as “-”.
Figure 4Feature importance for the trained random forest classifier. Features' importance have been normalized so that they sum to 1. All features used in this study are shown. The features corresponding to ADOS items are reported in the order M1/M2/M3/M4 according to their content. Features that do not appear in all modules are marked with a “-” to indicate that they do not appear in the respective module.