| Literature DB >> 35415454 |
Bilikis Banire1, Dena Al Thani1, Marwa Qaraqe1, Bilal Mansoor2.
Abstract
Attention recognition plays a vital role in providing learning support for children with autism spectrum disorders (ASD). The unobtrusiveness of face-tracking techniques makes it possible to build automatic systems to detect and classify attentional behaviors. However, constructing such systems is a challenging task due to the complexity of attentional behavior in ASD. This paper proposes a face-based attention recognition model using two methods. The first is based on geometric feature transformation using a support vector machine (SVM) classifier, and the second is based on the transformation of time-domain spatial features to 2D spatial images using a convolutional neural network (CNN) approach. We conducted an experimental study on different attentional tasks for 46 children (ASD n=20, typically developing children n=26) and explored the limits of the face-based attention recognition model for participant and task differences. Our results show that the geometric feature transformation using an SVM classifier outperforms the CNN approach. Also, attention detection is more generalizable within typically developing children than within ASD groups and within low-attention tasks than within high-attention tasks. This paper highlights the basis for future face-based attentional recognition for real-time learning and clinical attention interventions. Supplementary Information: The online version contains supplementary material available at 10.1007/s41666-021-00101-y.Entities:
Keywords: ASD; Attention recognition; Facial landmarks; Geometric features; Machine learning
Year: 2021 PMID: 35415454 PMCID: PMC8982782 DOI: 10.1007/s41666-021-00101-y
Source DB: PubMed Journal: J Healthc Inform Res ISSN: 2509-498X
Demographics of participants with ASD and TD group
| Group | ASD ( | TD ( |
|---|---|---|
| Age | 8.57 (1.40) | 8.58 (1.36) |
| ASD moderate (mild) | 11 (9) | - |
| CAST score | 17.75 (2.04) | 5.7 (3.2) |
| Gender: male (female) | 16 (4) | 18 (8) |
Fig. 1Real-time face-tracking during the attention task
Observation checklist for attention and inattention annotation
| Attention | Inattention |
|---|---|
| Participants looked at the screen and clicked the keyboard when letter X appears | Participants looked away and clicked the keyboard when letter X appears |
| Participants looked at the screen and called the letters on the screen. | Participants looked at the screen and did not click the keyboard when letter X appears. |
| Participants looked at the screen and did not call the letters on the screen. | |
| Participants did not look at the screen and call the letters on the screen. |
Fig. 4Feature selection process
Fig. 2Block diagram of geometric-based feature extraction
Fig. 3Thirty-four facial landmarks with labels. Where 0, right top jaw; 1, right jaw angle; 2, gnathion; 3, left jaw angle; 4, left top jaw; 5, outer right brow; 6, right brow corner; 7, inner right brow corner; 8, inner left brow corner; 9, left brow center; 10, outer left brow corner; 11, nose root; 13, nose lower right boundary; 14, nose bottom boundary; 15, nose lower left boundary; 16, outer right eye; 17, inner right eye; 18, inner left eye; 19, outer left eye; 20, right lip corner; 21, right apex upper lip; 22, upper lip center; 23, left apex upper lip; 24, left lip corner; 25, left edge lower lip; 26, lower lip center; 27, right edge lower lip; 28, bottom lower lip; 12, nose tip; 29, top lower lip; 30, upper corner right eye; 31, lower corner right eye; 32, upper corner left eye; 33, lower corner left eye
Evaluation of participant-specific model using SVM and CNN
| SVM | CNN | |||||
|---|---|---|---|---|---|---|
| Training | Test | Training | Test | |||
| ACC. | ACC | AUC | ACC | ACC | AUC | |
| P1 | 1.000 | 0.957 | 0.941 | 0.965 | 0.900 | 0.767 |
| P2 | 0.998 | 0.997 | 1.000 | 0.987 | 0.972 | 0.850 |
| P3 | 0.998 | 0.952 | 0.992 | 0.970 | 0.800 | 0.822 |
| P5 | 0.999 | 0.984 | 0.904 | 0.998 | 0.974 | 0.915 |
| P6 | 0.996 | 0.989 | 0.996 | 0.978 | 0.951 | 0.886 |
| P7 | 0.997 | 0.920 | 0.970 | 0.829 | 0.720 | 0.826 |
| P8 | 0.993 | 0.945 | 0.983 | 0.944 | 0.895 | 0.835 |
| P10 | 0.994 | 0.984 | 0.878 | 0.990 | 0.982 | 0.856 |
| P12 | 0.950 | 0.920 | 0.941 | 0.971 | 0.842 | 0.828 |
| P13 | 1.000 | 0.996 | 0.964 | 0.994 | 0.956 | 0.888 |
| P14 | 0.997 | 0.839 | 0.929 | 0.981 | 0.718 | 0.760 |
| P15 | 0.998 | 0.972 | 0.978 | 0.948 | 0.889 | 0.918 |
| P16 | 1.000 | 0.954 | 0.992 | 0.767 | 0.903 | 0.966 |
| P18 | 0.992 | 0.939 | 0.981 | 0.958 | 0.935 | 0.941 |
| P19 | 1.000 | 1.000 | 1.000 | 0.831 | 0.885 | 0.789 |
| P20 | 1.000 | 0.995 | 0.995 | 0.990 | 0.982 | 0.856 |
| Avg. | 0.995 | 0.959 | 0.965 | 0.944 | 0.894 | 0.856 |
Fig. 5Mean intensity frame for attention and inattention
Fig. 6Attention classification using a CNN model structure
Best 20 distance-based features used for the SVM algorithm
| Features | Feature description | Inattention (mean values) | Attention (mean values) | Distance threshold values (mm) |
|---|---|---|---|---|
| D: 3–15 | Left jaw angle-outer right brow corner | 171.45 | 146.9 | 24.55 |
| D: 4–5 | Left top jaw-outer right brow corner | 168.45 | 144.06 | 24.38 |
| D: 4–6 | Left top jaw-right brow center | 149.17 | 125.72 | 23.45 |
| D: 3–6 | Left jaw angle-right brow center | 158.32 | 135.01 | 23.30 |
| D: 4–16 | Left top jaw-outer right eye | 148.79 | 126.22 | 22.56 |
| D: 4–7 | Left top jaw-inner right brow corner | 122.89 | 100.99 | 21.89 |
| D: 4–31 | Left top jaw-lower corner right eye | 132.7 | 110.85 | 21.85 |
| D: 4–30 | Left top jaw-upper corner right eye | 134.16 | 112.37 | 21.79 |
| D: 0–4 | Gnathion-outer right brow corner | 167.05 | 145.36 | 21.69 |
| D: 3–16 | Left jaw angle-outer right eye | 147.11 | 125.43 | 21.67 |
| D: 2_5 | Gnathion-outer right brow corner | 159.94 | 138.36 | 21.58 |
| D: 3_7 | Left jaw angle-inner right brow corner | 136.43 | 114.93 | 21.49 |
| D: 3_31 | Left jaw angle-lower corner right eye | 131.48 | 110.45 | 21.02 |
| D: 3_30 | Left jaw angle-upper corner right eye | 136.74 | 115.77 | 20.96 |
| D: 4_13 | Left top jaw-nose lower right boundary | 111.63 | 90.99 | 20.63 |
| D: 4_12 | Left top jaw-nose tip | 95.09 | 74.58 | 20.51 |
| D: 4_17 | Left top jaw-inner right eye | 117.02 | 96.6 | 20.42 |
| D: 4_11 | Left top jaw-nose root | 98.58 | 78.51 | 20.06 |
| D: 2_6 | Gnathion-right brow center | 154.53 | 134.66 | 19.87 |
| D: 3_17 | Left jaw angle-inner right eye | 121.38 | 101.61 | 19.77 |
Fig. 7Confusion matrix for participant-specific model for P1 (left: SVM; right: CNN)
Evaluation of participant-specific model using SVM and CNN
| SVM | CNN | |||||
|---|---|---|---|---|---|---|
| Training | Test | Training | Test | |||
| ACC. | ACC | AUC | AUC | ACC | AUC | |
| P1 | 0.958 | 0.825 | 0.532 | 0.894 | 0.849 | 0.551 |
| P2 | 0.954 | 0.936 | 0.288 | 0.878 | 0.968 | 0.678 |
| P3 | 0.951 | 0.905 | 0.315 | 0.875 | 0.766 | 0.581 |
| P5 | 0.957 | 0.683 | 0.654 | 0.873 | 0.936 | 0.597 |
| P6 | 0.954 | 0.874 | 0.271 | 0.886 | 0.936 | 0.519 |
| P7 | 0.959 | 0.595 | 0.584 | 0.893 | 0.609 | 0.469 |
| P8 | 0.956 | 0.724 | 0.593 | 0.878 | 0.884 | 0.568 |
| P10 | 0.954 | 0.668 | 0.594 | 0.872 | 0.976 | 0.334 |
| P12 | 0.956 | 0.302 | 0.688 | 0.891 | 0.824 | 0.535 |
| P13 | 0.956 | 0.833 | 0.578 | 0.884 | 0.93 | 0.597 |
| P14 | 0.954 | 0.877 | 0.586 | 0.913 | 0.546 | 0.425 |
| P15 | 0.958 | 0.390 | 0.577 | 0.885 | 0.775 | 0.597 |
| P16 | 0.955 | 0.839 | 0.676 | 0.886 | 0.906 | 0.768 |
| P18 | 0.956 | 0.765 | 0.388 | 0.873 | 0.928 | 0.735 |
| P19 | 0.955 | 0.577 | 0.665 | 0.894 | 0.106 | 0.667 |
| P20 | 0.957 | 0.645 | 0.590 | 0.866 | 0.984 | 0.837 |
| Avg. | 0.956 | 0.715 | 0.536 | 0.884 | 0.808 | 0.591 |
Fig. 8Confusion matrix for generalized model for P1 (left: SVM; right: CNN)
Fig. 9Example of cross-groups and within-group with the participant–independence level
Fig. 10Model generalizations between children with ASD and TD
Fig. 11Model generalization between children with mild ASD and moderate ASD
Fig. 12Model generalizations between children with ASD and TD
Evaluation of landmark distortion across participants (mm)
| Name (ASD) | Error (Attn.) | Name (TD) | Error (Attn.) |
|---|---|---|---|
| P1 | 0.102 | T10 | 0.035 |
| P10 | 0.084 | T11 | 0.085 |
| P11 | 0.015 | T12 | 0.087 |
| P12 | 0.059 | T13 | 0.136 |
| P13 | 0.101 | T14 | 0.041 |
| P14 | 0.128 | T15 | 1.010 |
| P15 | 0.138 | T16 | 0.531 |
| P16 | 0.276 | T17 | 0.156 |
| P18 | 0.096 | T18 | 0.165 |
| P19 | 1.065 | T19 | 0.183 |
| P20 | 0.088 | T2 | 0.059 |
| P2 | 0.094 | T3 | 0.199 |
| P3 | 0.088 | T4 | 0.033 |
| P3 | 0.251 | T5 | 0.101 |
| P4 | 0.026 | T6 | 0.165 |
| P5 | 0.096 | T7 | 0.031 |
| P6 | 0.094 | T8 | 0.129 |
| P7 | 0.508 | T9 | 0.089 |