| Literature DB >> 33277527 |
Emilie Leblanc1, Peter Washington2, Maya Varma3, Kaitlyn Dunlap1,4, Yordan Penev1,4, Aaron Kline1,4, Dennis P Wall5,6,7.
Abstract
Autism Spectrum Disorder is a neuropsychiatric condition affecting 53 million children worldwide and for which early diagnosis is critical to the outcome of behavior therapies. Machine learning applied to features manually extracted from readily accessible videos (e.g., from smartphones) has the potential to scale this diagnostic process. However, nearly unavoidable variability in video quality can lead to missing features that degrade algorithm performance. To manage this uncertainty, we evaluated the impact of missing values and feature imputation methods on two previously published autism detection classifiers, trained on standard-of-care instrument scoresheets and tested on ratings of 140 children videos from YouTube. We compare the baseline method of listwise deletion to classic univariate and multivariate techniques. We also introduce a feature replacement method that, based on a score, selects a feature from an expanded dataset to fill-in the missing value. The replacement feature selected can be identical for all records (general) or automatically adjusted to the record considered (dynamic). Our results show that general and dynamic feature replacement methods achieve a higher performance than classic univariate and multivariate methods, supporting the hypothesis that algorithmic management can maintain the fidelity of video-based diagnostics in the face of missing values and variable video quality.Entities:
Mesh:
Year: 2020 PMID: 33277527 PMCID: PMC7719177 DOI: 10.1038/s41598-020-76874-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Pipeline description and contribution.
Figure 2Pipeline training process on instrument scoresheets.
Figure 3Pipeline testing process on YouTube video ratings.
Figure 4General feature replacement process illustration.
Figure 5Dynamic feature replacement process illustration.
Baseline performance of LR9 and ADTree7 on 420 ratings of 140 YouTube videos (average performance and standard deviation). With listwise deletion, a rating is dropped if it contains at least one NULL value in the model's features (this is the case for 135 ratings for LR9). We are unable to rate a video if at least one model feature is missing in all 3 ratings of this video (this is the case for 5 videos for LR9).
| Model | Sensitivity | Specificity | UAR | AUC-ROC | AUC PR | Ratings dropped |
|---|---|---|---|---|---|---|
| LR9 | 0.8939 (0.0152) | 0.7536 (0.0102) | 0.8238 (0.0118) | 0.9109 (0.0021) | 0.9195 (0.0018) | 135 ratings and 5 videos entirely |
| ADTree7 | 0.8172 (0.0397) | 0.8721 (0.0709) | 0.8447 (0.0261) | 0.9083 (0.0117) | 0.8706 (0.0644) | 205 ratings and 21 videos entirely |
Univariate feature imputation methods—performance of LR9 and ADTree7 on 420 ratings of 140 YouTube videos.
| (a) Performance of LR9 (average performance and standard deviation). | |||||
|---|---|---|---|---|---|
| Method | Sensitivity | Specificity | UAR | AUC-ROC | AUC PR |
| Mean | 0.9029 (0.0064) | 0.8571 | 0.8800 | 0.9541 | 0.9658 |
| Median | 0.9143 | 0.8514 | 0.8829 | 0.9577 | 0.9695 |
| Mode | 0.9171 | 0.8400 | 0.8786 | 0.9569 | 0.9671 |
Multivariate feature imputation methods—performance of LR9 and ADTree7 on 420 ratings of 140 YouTube videos.
| (a) Performance of LR9 (average performance and standard deviation). | |||||
|---|---|---|---|---|---|
| Method | Sensitivity | Specificity | UAR | AUC-ROC | AUC PR |
| Gaussian mixture | 0.9029 (0.0186) | 0.8457 (0.1085) | 0.8743 (0.0467) | 0.9477 | 0.9604 |
| Ridge regression | 0.9114 (0.0064) | 0.8657 | 0.8886 | 0.9549 | 0.9660 |
| Decision trees | 0.9057 (0.0128) | 0.8686 | 0.8871 | 0.9576 | 0.9680 |
General feature replacement methods—performance of LR9 and ADTree7 on 420 ratings of 140 YouTube videos.
| (a) Performance of LR9 (average performance and standard deviation). | |||||
|---|---|---|---|---|---|
| Method | Sensitivity | Specificity | UAR | AUC-ROC | AUC PR |
| Most correlated feature | 0.9114 (0.0064) | 0.8629 | 0.8871 | 0.9597 | 0.9708 |
| Nearest-neighbor feature | 0.9171 | 0.8286 | 0.8729 | 0.9582 | 0.9696 |
| Highest mutual information feature | 0.9086 (0.0078) | 0.8657 | 0.8871 | 0.9599 | 0.9706 |
Dynamic feature replacement methods—performance of LR9 and ADTree7 on 420 ratings of 140 YouTube videos.
| (a) Performance of LR9 (average performance and standard deviation). | |||||
|---|---|---|---|---|---|
| Method | Sensitivity | Specificity | UAR | AUC-ROC | AUC PR |
| Dynamic—most correlated feature | 0.9171 | 0.8286 | 0.8729 | 0.9585 | 0.9690 |
| Dynamic—nearest-neighbor feature | 0.9171 | 0.8543 | 0.8857 | 0.9596 | 0.9697 |
| Dynamic—highest mutual information feature | 0.9114 (0.0057) | 0.8800 | 0.8957 | 0.9613 | 0.9711 |