| Literature DB >> 31017583 |
Qandeel Tariq1,2, Scott Lanyon Fleming2, Jessey Nicole Schwartz1,2, Kaitlyn Dunlap1,2, Conor Corbin2, Peter Washington2, Haik Kalantarian1,2, Naila Z Khan3, Gary L Darmstadt1,4, Dennis Paul Wall1,2.
Abstract
BACKGROUND: Autism spectrum disorder (ASD) is currently diagnosed using qualitative methods that measure between 20-100 behaviors, can span multiple appointments with trained clinicians, and take several hours to complete. In our previous work, we demonstrated the efficacy of machine learning classifiers to accelerate the process by collecting home videos of US-based children, identifying a reduced subset of behavioral features that are scored by untrained raters using a machine learning classifier to determine children's "risk scores" for autism. We achieved an accuracy of 92% (95% CI 88%-97%) on US videos using a classifier built on five features.Entities:
Keywords: Bangladesh; Biomedical Data Science; autism; autism spectrum disorder; clinical resources; developmental delays; machine learning
Mesh:
Year: 2019 PMID: 31017583 PMCID: PMC6505375 DOI: 10.2196/13822
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Participant demographics collected from Dhaka Shishu Hospital, Bangladesh.
| Demographic | Full cohort (N=150) | ASDa cohort (N=50) | TDb cohort (N=50) | SLCc cohort (N=50) | ||
| Age (years), mean (SD) | 2.55 (0.62) | 2.51 (0.70) | 2.40 (0.59) | 2.73 (0.51) | ||
| Gender (male), n (%) | 90 (60) | 36 (72) | 23 (62) | 31 (46) | ||
| Preterm (ie, <37 weeks), n (%) | 11 (0.7) | 5 (10) | 0 (0) | 6 (12) | ||
| 1,000-10,000 | 16 (10.7) | 0 (0) | 16 (32) | 0 (0) | ||
| >10,000-30,000 | 33 (22) | 2 (4) | 21 (42) | 10 (20) | ||
| >30,000 | 101 (67.3) | 48 (96) | 13 (26) | 40 (80) | ||
| Urban | 139 (92.7) | 50 (100) | 50 (100) | 39 (78) | ||
| Semiurban | 8 (5.3) | 0 (0) | 0 (0) | 8 (16) | ||
| Rural | 3 (2) | 0 (0) | 0 (0) | 3 (6) | ||
| Muslim | 141 (94) | 44 (88) | 49 (98) | 48 (96) | ||
| Hindu | 6 (4) | 4 (8) | 0 (0) | 2 (4) | ||
| Christian | 1 (0.01) | 1 (2) | 0 (0) | 0 (0) | ||
| Buddhist | 2 (0.01) | 1 (2) | 1 (2) | 0 (0) | ||
| Missing stunting information | 60 (40) | 4 (8) | 50 (100) | 6 (12) | ||
| No stunting | 49 (32.7) | 30 (60) | 0 (0) | 19 (48) | ||
| Stunting | 41 (27.3) | 16 (32) | 0 (0) | 25 (50) | ||
| Social affect | N/Ah | 11.57 (5.30) | N/A | N/A | ||
| Restricted and repetitive behavior | N/A | 3.46 (3.29) | N/A | N/A | ||
| Composite | N/A | 5.14 (2.08) | N/A | N/A | ||
| Receptive language delay | N/A | N/A | N/A | 2 (4) | ||
| Expressive language delay | N/A | N/A | N/A | 5 (10) | ||
| Both receptive and expressive language delay | N/A | N/A | N/A | 37 (74) | ||
| Receptive and expressive language disorder | N/A | N/A | N/A | 6 (12) | ||
aASD: autism spectrum disorder.
bTD: neurotypical development.
cSLC: speech and language condition.
d1 US $=84 taka.
eMCHAT: Modified Checklist for Autism in Toddlers
fADOS: Autism Diagnostic Observation Schedule.
gADOS was only performed on a subset of 28 children with ASD.
hN/A: not available.
Figure 1Results from the top performing classifiers trained on US clinical score sheet data and tested on Bangladeshi data with an objective to distinguish between ASD and non-ASD. ROC: receiver operating characteristic; AUC: area under the curve; ASD: autism spectrum disorder.
Average demographic information of the test set calculated by testing the model on 45 videos for both layers.
| Demographic | Layer 1 (distinguishing TDa from ASDb/SLCc) | Layer 2 (distinguishing ASD from SLC) |
| Age (years), average (SD) | 2 years 7 months (5 months) | 2 years 6 months (3 months) |
| Proportion of males, mean % | 62 | 70 |
| Proportion of TD children, mean % | 33 | 22 |
| Proportion of children with ASD, mean % | 33 | 44 |
| Proportion of children with SLC, mean % | 33 | 34 |
aTD: neurotypical development.
bASD: autism spectrum disorder.
cSLC: speech and language condition.
Figure 2(A) ROC curve for layer 1 (distinguishing between children with TD and children with ASD or SLC). (B) ROC curve for layer 2 (distinguishing between ASD and SLC). ASD: autism spectrum disorder; AUC: area under the curve; SLC: speech and language condition; TD: neurotypical development; ROC: receiver operating characteristic.
Results from classifiers to distinguish among autism spectrum disorder, speech and language conditions, and neurotypical development. The results distinguish layer 1 (distinguishing neurotypical development from atypical conditions [autism spectrum disorder/speech and language conditions]) and layer 2 (distinguishing autism spectrum disorder from other delays [speech and language conditions]) from those classified as atypical in layer 1.
| Classifier Layer | Sensitivity, % (SD) | Specificity, % (SD) | Unweighted average recall, % (SD) | Area under the curve, % (SD) | Accuracy, % (SD%) |
| Layer 1a | 76 (SD 4) | 58 (SD 3) | 67 (SD 1) | 76 (SD 3) | 70 (SD 2) |
| Layer 2b | 76 (SD 6) | 77 (SD 24) | 77 (SD 9) | 85 (SD 5) | 76 (SD 11) |
aDistinguishing neurotypical development from autism spectrum disorder/speech and language conditions.
bDistinguishing autism spectrum disorder from other developmental delays (speech and language conditions).
Figure 3Shapley value distributions for two of the most important features in the rater-adaptive ensemble model. These features measure the child’s stereotyped behaviors/repetitive interests and eye contact. They demonstrate that clinical intuition and the inner workings of our classifier align closely. ASD: autism spectrum disorder.
Figure 4Logistic regression (Elastic Net penalty) classifier, trained on Bangladeshi data and tested on US data as well as a held-out test set of the Bangladeshi data. AUC: area under the curve.
Figure 5Logistic regression (Elastic Net penalty) classifier, trained on US data and tested on Bangladeshi data as well as a held-out test set of the US data.
Figure 6Feature selection analysis. Numbers within the cells indicate the frequency of selection. (A) Feature frequency comparison during cross-fold validation with alpha value 0.1 between Bangladeshi data and US data. (B) Feature frequency comparison during cross-fold validation with alpha value 0.01 between Bangladeshi data and US data.