| Literature DB >> 34942856 |
Gianpaolo Alvari1,2, Luca Coviello3,4, Cesare Furlanello5,6.
Abstract
The high level of heterogeneity in Autism Spectrum Disorder (ASD) and the lack of systematic measurements complicate predicting outcomes of early intervention and the identification of better-tailored treatment programs. Computational phenotyping may assist therapists in monitoring child behavior through quantitative measures and personalizing the intervention based on individual characteristics; still, real-world behavioral analysis is an ongoing challenge. For this purpose, we designed EYE-C, a system based on OpenPose and Gaze360 for fine-grained analysis of eye-contact episodes in unconstrained therapist-child interactions via a single video camera. The model was validated on video data varying in resolution and setting, achieving promising performance. We further tested EYE-C on a clinical sample of 62 preschoolers with ASD for spectrum stratification based on eye-contact features and age. By unsupervised clustering, three distinct sub-groups were identified, differentiated by eye-contact dynamics and a specific clinical phenotype. Overall, this study highlights the potential of Artificial Intelligence in categorizing atypical behavior and providing translational solutions that might assist clinical practice.Entities:
Keywords: autism spectrum disorders; behavior imaging; computational phenotyping; eye contact; heterogeneity; preschool children
Year: 2021 PMID: 34942856 PMCID: PMC8699076 DOI: 10.3390/brainsci11121555
Source DB: PubMed Journal: Brain Sci ISSN: 2076-3425
Population characteristics.
| ASD Sample | |
|---|---|
| Age (months), mean (SD) | 46.32 (13.8) |
| GQ, mean (SD) | 71.54 (17.4) |
| ADOS, mean (SD) | 14.82 (1.4) |
| Gender, N (%) | |
| Male | 74 (87.1) |
| Female | 11 (12.9) |
Note: ADOS: Autism Diagnostic Observation Schedule, 2nd edition, raw score; GQ: Global Developmental Quotient (GMDS-ER); ASD: Autism Spectrum Disorders.
Video resolution for the ASD sample.
| Room Size. | Video Resolution (px) | ||||
|---|---|---|---|---|---|
| 384 × 288 | 640 × 480 | 720 × 576 | 1280 × 720 | 1920 × 1080 | |
| Small, N (%) | 10 (11.8) | 5 (5.9) | 43 (50.6) | 1 (1.2) | 0 |
| Large, N (%) | 3 (3.5) | 10 (11.8) | 11 (12.9) | 0 | 2 (2.4) |
Figure 1Eye-contact detection model (EYE-C). The images represent the output of the model run on a video example from YouTube (the video is licensed under a CC licence, and was kindly offered by White, R. [Good Behavior Beginnings]. (15 May 2015). How to Redirect Escape Behavior in 2 year olds (Video). YouTube. https://www.youtube.com/watch?v=GzGLF8GlPmo, accessed on 28 September 2021); (A) OpenPose body keypoints output [47]; (B) Gaze360 gaze vectors output [48] and eye-contact detection system; g0A: headbox center of subject A; g0A/g0B: headbox center of subject A/B; gA/gB: gaze vector of subject A/B; pA: intersection point between gaze of subject B and headbox x-axis coordinates of subject A; dB: distance (pixels) between pA and g0A.
Model evaluation results.
| Video | Frames (N) | Res (px) | Time (min/s) | Room | Acc | Pre | Rec | MCC |
|---|---|---|---|---|---|---|---|---|
| 1 | 13,786 | 640 | 12′31″ | Small | 0.96 | 0.65 | 0.80 | 0.70 |
| 2 | 13,955 | 1280 | 6′20″ | Small | 0.95 | 0.53 | 0.76 | 0.61 |
| 3 | 14,317 | 720 | 3′29″ | Large | 0.96 | 0.79 | 0.65 | 0.69 |
| 4 | 14,497 | 720 | 9′11″ | Small | 0.99 | 0.94 | 0.94 | 0.93 |
| 5 | 14,312 | 384 | 20′42″ | Small | 0.93 | 0.34 | 0.71 | 0.46 |
Note: Time: subsection beginning timing in the original video; Res: Resolution; Acc: Accuracy; Pre: Precision; Rec: Recall; MCC: Matthews Correlation Coefficient.
Variable Inflation Factor (VIF) results.
| Variable | VIF |
|---|---|
| Freq | 5.6 |
| Num | 5.42 |
| D | 1.07 |
| Dur | 1.34 |
Note: freq: eye-contact periods frequency; num: eye-contact periods total number; d: average child gaze distance d; dur: average eye-contact periods duration.
Figure 2(A) HDBSCAN clusters on 2-components UMAP output, the 2 black marks represent the single data points classified as noise; (B) Eye-contact metrics and age pairplot; freq: frequency of eye-contact episodes; num: total number of eye-contact episodes; dur: average duration of eye-contact episodes; d: average distance of children’s gaze vectors from therapists’ headboxes during interaction; age: children’s age at first assessment.
Sub-groups characteristics.
| Sub-Group 0 | Sub-Group 1 | Sub-Group 2 | F/χ2 |
| |
|---|---|---|---|---|---|
| Clinical sample | |||||
| Gender, N (%) | 3.572 | 0.734 | |||
| Male | 22 (95.7) | 17 (80.9) | 12 (75) | ||
| Female | 1 (4.3) | 4 (19.1) | 4 (25) | ||
| Video resolution (px) | - | - | - | 11.069 | 0.748 |
| Interaction duration (min), mean (SD) | 64 (37.1) | 56.3 (16.3) | 67.5 (28.1) | 0.750 | 0.477 |
| Room setting (small/large) | - | - | - | 0.291 | 0.865 |
| Age (months), mean (SD) | 37.9 (10.2) | 43.9 (10.1) | 60.9 (10.1) | ||
| Eye-contact num, mean (SD) | 9 (6.2) | 40.7 (6.2) | 11.6 (6.1) | ||
| Eye-contact freq (N/min), mean (SD) | 0.2 (0.1) | 0.7 (0.1) | 0.2 (0.1) | ||
| Eye-contact dur (sec), mean (SD) | 1.6 (0.3) | 1.9 (0.3) | 1.7 (0.3) | ||
| Eye-contact d (px), mean (SD) | 308.9 (152.1) | 301 (152.1) | 649.4 (152.1) | ||
| GQ, mean (SD) | 69.7 (15.4) | 77.1 (15.4) | 65 (15.4) | ||
| Coordination, mean (SD) | 69.6 (14.5) | 82.2 (14.5) | 64.8 (14.5) | ||
| Language, mean (SD) | 55.7 (26.6) | 67.8 (26.6) | 60.4 (26.6) | ||
| Motor, mean (SD) | 78.4 (13.8) | 79.1 (13.8) | 71.5 (13.8) | ||
| Social, mean (SD) | 64.6 (16.6) | 75.8 (16.6) | 58.7 (16.6) | ||
| Perform, mean (SD) | 89.9 (18) | 88.1(18) | 72.6 (18) | ||
| ADOS, mean (SD) | 15.6 (3.5) | 13.1 (3.5) | 15.4 (3.5) | ||
| SA, mean (SD) | 12.2 (3.5) | 10.2 (3.5) | 11.8 (3.5) | ||
Note: freq: eye-contact periods frequency; num: eye-contact periods total number; d: average child gaze distance d; dur; average eye-contact periods duration. GQ: Global Developmental Quotient; SA: Social Abilities subscale.
Figure 3Boxplots of each dependent and independent variable for resulting sub-groups; num: eye-contact periods total number; freq: eye-contact periods frequency (N/min); dur: eye-contact periods average duration (s); d: average distance of children’s gaze vectors from therapists’ headboxes during interaction (px); age: children’s age at first assessment (months); length: video total duration (min).