| Literature DB >> 24240598 |
Fernando Alonso-Martín1, María Malfaz, João Sequeira, Javier F Gorostiza, Miguel A Salichs.
Abstract
In this paper, a multimodal user-emotion detection system for social robots is presented. This system is intended to be used during human-robot interaction, and it is integrated as part of the overall interaction system of the robot: the Robotics Dialog System (RDS). Two modes are used to detect emotions: the voice and face expression analysis. In order to analyze the voice of the user, a new component has been developed: Gender and Emotion Voice Analysis (GEVA), which is written using the Chuck language. For emotion detection in facial expressions, the system, Gender and Emotion Facial Analysis (GEFA), has been also developed. This last system integrates two third-party solutions: Sophisticated High-speed Object Recognition Engine (SHORE) and Computer Expression Recognition Toolbox (CERT). Once these new components (GEVA and GEFA) give their results, a decision rule is applied in order to combine the information given by both of them. The result of this rule, the detected emotion, is integrated into the dialog system through communicative acts. Hence, each communicative act gives, among other things, the detected emotion of the user to the RDS so it can adapt its strategy in order to get a greater satisfaction degree during the human-robot dialog. Each of the new components, GEVA and GEFA, can also be used individually. Moreover, they are integrated with the robotic control platform ROS (Robot Operating System). Several experiments with real users were performed to determine the accuracy of each component and to set the final decision rule. The results obtained from applying this decision rule in these experiments show a high success rate in automatic user emotion recognition, improving the results given by the two information channels (audio and visual) separately.Entities:
Mesh:
Year: 2013 PMID: 24240598 PMCID: PMC3871074 DOI: 10.3390/s131115549
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.The multimodal interaction system Robotics Dialog System (RDS).
Figure 2.Two kinds of fusion levels: decision and feature extraction level. (a) A unique classifier (fusion at the feature extraction level); (b) one classifier for each channel (fusion at the decision level).
Figure 3.Multimodal emotion detection system.
Figure 4.The three audio domains in which voice feature extraction is performed.
Figure 5.Rotation parameters: roll, pitch and yaw.
Figure 6.Scheme of the process for determining the main user emotion in each communicative act (CA).
Figure 7.The robot used in the experiments.
Figure 8.Image taken during the experiments carried out in the ISTin Lisbon. (a) Gender and Emotion Voice Analysis (GEVA); (b) Computer Expression Recognition Toolbox (CERT); (c) Sophisticated High-speed Object Recognition Engine (SHORE).
Confusion matrices for GEVA (rows: real emotions; columns: detected emotions).
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| |||||||||
| 50 | 50 | 0 | 0 | 30 | 70 | 0 | 0 | |||
| 0 | 80.28 | 19.71 | 0 | 0 | 87.32 | 12.67 | 0 | |||
| 0 | 33.33 | 66.66 | 0 | 0 | 22.22 | 77.77 | 0 | |||
| 28.57 | 42.85 | 0 | 28.57 | 16.66 | 50 | 0 | 33.33 | |||
|
|
| |||||||||
Confusion matrices for Gender and Emotion Facial Analysis (rows: real emotions; columns: detected emotions).
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||||
| 100 | 0 | 0 | 0 | 100 | 0 | 0 | 0 | ||||
| 0 | 66.66 | 16.66 | 16.66 | 0 | 85.71 | 14.28 | 0 | ||||
| 0 | 22.22 | 55.55 | 22.22 | 0 | 71.42 | 28.57 | 0 | ||||
| 0 | 10 | 10 | 80 | 9.09 | 54.54 | 0 | 36.36 | ||||
|
|
| ||||||||||
Statistically-computed confusion matrix (rows: real emotions; columns: detected emotions).
| 100 | 0 | 0 | 0 | |
| 0 | 46.77 | 41.46 | 11.76 | |
| 0 | 1.635 | 96.66 | 1.701 | |
| 4.345 | 2.639 | 2.60 | 90.411 |
Figure 9.The robot, Maggie.
Experimental confusion matrix (rows: real emotions; columns: detected emotions).
| 100 | 0 | 0 | 0 | |
| 6 | 67 | 27 | 0 | |
| 15 | 18.33 | 66.66 | 0 | |
| 0 | 12 | 10 | 78 |