| Literature DB >> 34804256 |
Pinar Uluer1,2, Hatice Kose2, Elif Gumuslu3, Duygun Erol Barkana3.
Abstract
This study presents an assistive robotic system enhanced with emotion recognition capabilities for children with hearing disabilities. The system is designed and developed for the audiometry tests and rehabilitation of children in a clinical setting and includes a social humanoid robot (Pepper), an interactive interface, gamified audiometry tests, sensory setup and a machine/deep learning based emotion recognition module. Three scenarios involving conventional setup, tablet setup and setup with the robot+tablet are evaluated with 16 children having cochlear implant or hearing aid. Several machine learning techniques and deep learning models are used for the classification of the three test setups and for the classification of the emotions (pleasant, neutral, unpleasant) of children using the recorded physiological signals by E4 wristband. The results show that the collected signals during the tests can be separated successfully and the positive and negative emotions of children can be better distinguished when they interact with the robot than in the other two setups. In addition, the children's objective and subjective evaluations as well as their impressions about the robot and its emotional behaviors are analyzed and discussed extensively.Entities:
Keywords: Deep learning; Emotion recognition; Human-robot interaction; Machine learning; Physiological signals; Social robots
Year: 2021 PMID: 34804256 PMCID: PMC8594648 DOI: 10.1007/s12369-021-00830-5
Source DB: PubMed Journal: Int J Soc Robot ISSN: 1875-4791 Impact factor: 3.802
Fig. 1Test setups for the auditory perception game
Fig. 2Test setups for the experimental studies with children
Fig. 3Work flow of classification task using physiological signal
Classification Results of RT, RC,TC, RTC using ANN and CNN
| Item | Metric | ANN | CNN | ||||||
|---|---|---|---|---|---|---|---|---|---|
| set | RT | RC | TC | RTC | RT | RC | TC | RTC | |
| env | Accuracy | 0.990 | 0.980 | 0.747 | 0.809 | 0.988 | 0.976 | 0.753 | 0.817 |
| F1-score | 0.990 | 0.980 | 0.743 | 0.734 | 0.988 | 0.976 | 0.744 | 0.740 | |
| ton | Accuracy | 0.994 | 0.994 | 0.713 | 0.796 | 0.995 | 0.994 | 0.765 | 0.857 |
| F1-score | 0.994 | 0.994 | 0.712 | 0.718 | 0.995 | 0.994 | 0.763 | 0.798 | |
| env+ton | Accuracy | 0.989 | 0.984 | 0.645 | 0.747 | 0.991 | 0.985 | 0.619 | 0.750 |
| F1-score | 0.989 | 0.984 | 0.642 | 0.656 | 0.991 | 0.985 | 0.615 | 0.655 | |
Classification Results of PU, PN, NU, PNU in Robot Setup using ANN and CNN
| Item | Metric | ANN | CNN | ||||||
|---|---|---|---|---|---|---|---|---|---|
| set | PU | PN | NU | PNU | PU | PN | NU | PNU | |
| Accuracy | 0.771 | 0.562 | 0.628 | 0.486 | 0.792 | 0.521 | 0.654 | 0.445 | |
| Precision | 0.843 | 0.565 | 0.646 | 0.507 | 0.809 | 0.544 | 0.657 | 0.476 | |
| Recall | 0.771 | 0.562 | 0.628 | 0.328 | 0.792 | 0.521 | 0.654 | 0.285 | |
| F1-score | 0.757 | 0.560 | 0.622 | 0.373 | 0.783 | 0.482 | 0.652 | 0.333 | |
| Specificity | 0.771 | 0.562 | 0.628 | 0.671 | 0.792 | 0.521 | 0.654 | 0.645 | |
| GSP | 0.782 | 0.562 | 0.630 | 0.394 | 0.792 | 0.506 | 0.654 | 0.355 | |
| GSS | 0.735 | 0.557 | 0.614 | 0.448 | 0.773 | 0.438 | 0.650 | 0.410 | |
| MCC | 0.610 | 0.127 | 0.273 | 0.001 | 0.600 | 0.062 | 0.311 | -0.071 | |
| Accuracy | 0.521 | 0.542 | 0.682 | 0.472 | 0.458 | 0.604 | 0.636 | 0.389 | |
| Precision | 0.524 | 0.538 | 0.686 | 0.489 | 0.460 | 0.612 | 0.640 | 0.415 | |
| Recall | 0.521 | 0.542 | 0.682 | 0.314 | 0.458 | 0.604 | 0.636 | 0.249 | |
| F1-score | 0.514 | 0.523 | 0.680 | 0.368 | 0.455 | 0.593 | 0.631 | 0.287 | |
| Specificity | 0.521 | 0.542 | 0.682 | 0.650 | 0.458 | 0.604 | 0.636 | 0.584 | |
| GSP | 0.518 | 0.532 | 0.682 | 0.384 | 0.457 | 0.601 | 0.634 | 0.308 | |
| GSS | 0.507 | 0.500 | 0.677 | 0.440 | 0.450 | 0.580 | 0.624 | 0.354 | |
| MCC | 0.045 | 0.081 | 0.368 | -0.035 | -0.082 | 0.216 | 0.276 | -0.179 | |
Classification Results of Emotions in Tablet and Conventional Setups using ANN and CNN
| Item | Metric | NU | Item | Metric | PN | ||
|---|---|---|---|---|---|---|---|
| set | ANN | CNN | set | ANN | CNN | ||
| Accuracy | 0.960 | 0.980 | Accuracy | 0.762 | 0.774 | ||
| Precision | 0.963 | 0.979 | Precision | 0.785 | 0.846 | ||
| Recall | 0.947 | 0.977 | Recall | 0.762 | 0.774 | ||
| F1-score | 0.954 | 0.977 | F1-score | 0.758 | 0.760 | ||
| Specificity | 0.947 | 0.977 | Specificity | 0.762 | 0.774 | ||
| GSP | 0.954 | 0.978 | GSP | 0.766 | 0.784 | ||
| GSS | 0.946 | 0.977 | GSS | 0.751 | 0.738 | ||
| MCC | 0.910 | 0.956 | MCC | 0.546 | 0.614 | ||
| Accuracy | 0.841 | 0.952 | Accuracy | 0.795 | 0.487 | ||
| Precision | 0.904 | 0.968 | Precision | 0.692 | 0.455 | ||
| Recall | 0.762 | 0.929 | Recall | 0.690 | 0.444 | ||
| F1-score | 0.789 | 0.943 | F1-score | 0.671 | 0.441 | ||
| Specificity | 0.762 | 0.929 | Specificity | 0.690 | 0.444 | ||
| GSP | 0.811 | 0.945 | GSP | 0.680 | 0.445 | ||
| GSS | 0.722 | 0.924 | GSS | 0.508 | 0.365 | ||
| MCC | 0.650 | 0.895 | MCC | 0.426 | -0.100 | ||
The results of ANOVA for test metrics across test setups
| Mean (SD) | |||||
|---|---|---|---|---|---|
| Conventional | Robot | Tablet | |||
| Test Score (%) | 0.74 (0.22) | 0.63 (0.18) | 0.66 (0.19) | 1.101 | 0.341 |
| Tutorial (s) | 150 (237) | 190 (69) | 258 (226) | 1.319 | 0.277 |
| Total Test (s) | 461 (109) | 596 (81) | 502 (81) | 9.169 | < 0.001 |
| Response (s) | 15 (4) | 20 (3) | 17 (2) | 9.541 | < 0.001 |
Fig. 4The distributions of the test metrics for all the setups, and t-test significance between setup pairs: (ns) not significant, (**) are significant at
The results of Welch two sample t-test for test setups based on item set
| Setup | Item Set | N | Score ( | t | p |
|---|---|---|---|---|---|
| C | Env | 6 | 0.85, 0.22 | 1.887 | 0.090 |
| Ton | 6 | 0.63, 0.17 | |||
| R | Env | 10 | 0.72, 0.16 | 3.311 | 0.002 |
| Ton | 6 | 0.48, 0.10 | |||
| T | Env | 14 | 0.75, 0.18 | 3.417 | < 0.001 |
| Ton | 8 | 0.51, 0.10 |
Fig. 5The results of Welch two sample t-test for test setups based on item set
The ANOVA results for test metrics of the group tested with the same item set in all the test setups
| Mean (SD) | |||||
|---|---|---|---|---|---|
| Conventional | Robot | Tablet | |||
| Test Score (%) | 0.75 (0.23) | 0.67 (0.2) | 0.72 (0.27) | 0.177 | 0.84 |
| Tutorial (s) | 89 (15) | 152 (38) | 150 (23) | 10.53 | 0.001 |
| Total Test (s) | 450 (113) | 555 (47) | 481 (66) | 2.684 | 0.101 |
| Response (s) | 15 (4) | 18 (2) | 16 (2) | 2.591 | 0.108 |
Fig. 6The distributions of the test metrics of the group tested with the same item set in all the setups, and t-test significance between setup pairs: (ns) not significant, (*) and (**) are significant at and , respectively
Fig. 7Correlation among different factors on impressions of children: Correlations with (*), (**) and (***) are significant at , and respectively
Classification Results of RT, RC,TC, RTC using SVM, RF and LSTM
| Item | Metric | SVM | RF | LSTM | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| set | RT | RC | TC | RTC | RT | RC | TC | RTC | RT | RC | TC | RTC | |
| env | Accuracy | 0.991 | 0.926 | 0.792 | 0.754 | 0.805 | 1.000 | 0.663 | 0.716 | 0.991 | 0.982 | 0.744 | 0.791 |
| F1-score | 0.990 | 0.920 | 0.775 | 0.759 | 0.793 | 1.000 | 0.614 | 0.625 | 0.991 | 0.982 | 0.740 | 0.711 | |
| ton | Accuracy | 0.945 | 0.999 | 0.675 | 0.838 | 0.717 | 0.926 | 0.588 | 0.599 | 0.991 | 0.995 | 0.719 | 0.794 |
| F1-score | 0.948 | 0.999 | 0.536 | 0.829 | 0.691 | 0.926 | 0.541 | 0.564 | 0.991 | 0.995 | 0.717 | 0.714 | |
| env+ton | Accuracy | 0.826 | 0.796 | 0.757 | 0.620 | 0.692 | 0.644 | 0.715 | 0.373 | 0.992 | 0.991 | 0.616 | 0.722 |
| F1-score | 0.821 | 0.795 | 0.748 | 0.609 | 0.658 | 0.593 | 0.681 | 0.248 | 0.992 | 0.991 | 0.612 | 0.620 | |
Classification Results of PU, PN, NU, PNU in Robot Setup using SVM, RF and LSTM
| Item | Metric | SVM | RF | LSTM | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| set | PU | PN | NU | PNU | PU | PN | NU | PNU | PU | PN | NU | PNU | |
| Accuracy | 0.521 | 0.524 | 0.397 | 0.384 | 0.690 | 0.963 | 0.678 | 0.629 | 0.750 | 0.521 | 0.564 | 0.319 | |
| Precision | 0.580 | 0.543 | 0.394 | 0.242 | 0.691 | 0.968 | 0.708 | 0.733 | 0.765 | 0.524 | 0.564 | 0.262 | |
| Recall | 0.521 | 0.524 | 0.397 | 0.287 | 0.690 | 0.963 | 0.678 | 0.537 | 0.750 | 0.521 | 0.564 | 0.205 | |
| F1-score | 0.499 | 0.477 | 0.392 | 0.221 | 0.688 | 0.962 | 0.668 | 0.516 | 0.744 | 0.479 | 0.564 | 0.215 | |
| Specificity | 0.521 | 0.524 | 0.397 | 0.619 | 0.690 | 0.963 | 0.678 | 0.782 | 0.750 | 0.521 | 0.564 | 0.504 | |
| GSP | 0.524 | 0.503 | 0.394 | 0.241 | 0.689 | 0.964 | 0.681 | 0.561 | 0.751 | 0.500 | 0.564 | 0.224 | |
| GSS | 0.440 | 0.428 | 0.387 | 0.149 | 0.683 | 0.961 | 0.655 | 0.548 | 0.736 | 0.435 | 0.563 | 0.234 | |
| MCC | 0.094 | 0.064 | -0.208 | -0.155 | 0.381 | 0.930 | 0.385 | 0.351 | 0.515 | 0.045 | 0.129 | -0.338 | |
| Accuracy | 0.553 | 0.414 | 0.461 | 0.457 | 0.762 | 0.545 | 0.754 | 0.589 | 0.625 | 0.521 | 0.667 | 0.403 | |
| Precision | 0.568 | 0.407 | 0.456 | 0.575 | 0.772 | 0.592 | 0.828 | 0.619 | 0.697 | 0.441 | 0.702 | 0.378 | |
| Recall | 0.553 | 0.414 | 0.461 | 0.312 | 0.762 | 0.545 | 0.754 | 0.423 | 0.625 | 0.521 | 0.667 | 0.256 | |
| F1-score | 0.546 | 0.403 | 0.455 | 0.334 | 0.750 | 0.529 | 0.715 | 0.465 | 0.588 | 0.457 | 0.657 | 0.293 | |
| Specificity | 0.553 | 0.414 | 0.461 | 0.649 | 0.762 | 0.545 | 0.754 | 0.749 | 0.625 | 0.521 | 0.667 | 0.581 | |
| GSP | 0.553 | 0.407 | 0.457 | 0.361 | 0.758 | 0.548 | 0.745 | 0.492 | 0.618 | 0.469 | 0.670 | 0.304 | |
| GSS | 0.536 | 0.390 | 0.448 | 0.391 | 0.736 | 0.509 | 0.674 | 0.527 | 0.550 | 0.338 | 0.645 | 0.335 | |
| MCC | 0.121 | -0.178 | -0.083 | -0.025 | 0.534 | 0.132 | 0.554 | 0.194 | 0.298 | 0.045 | 0.366 | -0.190 | |
Classification Results of Emotions in Tablet and Conventional Setups using SVM, RF and LSTM
| Item | Metric | NU | Item | Metric | PN | ||||
|---|---|---|---|---|---|---|---|---|---|
| set | SVM | RF | LSTM | set | SVM | RF | LSTM | ||
| Accuracy | 0.983 | 0.995 | 0.869 | Accuracy | 0.941 | 1.000 | 0.726 | ||
| Precision | 0.988 | 0.996 | 0.869 | Precision | 0.947 | 1.000 | 0.827 | ||
| Recall | 0.974 | 0.992 | 0.894 | Recall | 0.941 | 1.000 | 0.726 | ||
| F1-score | 0.980 | 0.994 | 0.864 | F1-score | 0.940 | 1.000 | 0.695 | ||
| Specificity | 0.974 | 0.992 | 0.894 | Specificity | 0.941 | 1.000 | 0.726 | ||
| GSP | 0.981 | 0.994 | 0.873 | GSP | 0.942 | 1.000 | 0.733 | ||
| GSS | 0.974 | 0.992 | 0.887 | GSS | 0.938 | 1.000 | 0.657 | ||
| MCC | 0.962 | 0.988 | 0.763 | MCC | 0.888 | 1.000 | 0.537 | ||
| Accuracy | 0.667 | 0.667 | 0.714 | Accuracy | 0.385 | 0.701 | 0.769 | ||
| Precision | 0.333 | 0.333 | 0.687 | Precision | 0.206 | 0.516 | 0.710 | ||
| Recall | 0.500 | 0.500 | 0.738 | Recall | 0.308 | 0.514 | 0.625 | ||
| F1-score | 0.400 | 0.400 | 0.679 | F1-score | 0.242 | 0.438 | 0.607 | ||
| Specificity | 0.500 | 0.500 | 0.738 | Specificity | 0.308 | 0.514 | 0.625 | ||
| GSP | 0.408 | 0.408 | 0.695 | GSP | 0.249 | 0.467 | 0.635 | ||
| GSS | 0.000 | 0.000 | 0.585 | GSS | 0.029 | 0.098 | 0.402 | ||
| MCC | 0.000 | 0.000 | 0.457 | MCC | -0.409 | 0.082 | 0.358 | ||