| Literature DB >> 31048831 |
Abstract
In this paper, we propose an Emotional Trigger System to impart an automatic emotion expression ability within the humanoid robot REN-XIN, in which the Emotional Trigger is an emotion classification model trained from our proposed Word Mover's Distance(WMD) based algorithm. Due to the long time delay of the WMD-based Emotional Trigger System, we propose an enhanced Emotional Trigger System to enable a smooth interaction with the robot in which the Emotional Trigger is replaced by a conventional convolution neural network and a long short term memory network (CNN_LSTM)-based deep neural network. In our experiments, the CNN_LSTM based model only need 10 milliseconds or less to finish the classification without a decrease in accuracy, while the WMD-based model needed approximately 6-8 seconds to give a result. In this paper, the experiments are conducted based on the same sub-data sets of the Chinese emotional corpus(Ren_CECps) used in former WMD experiments: one comprises 50% data for training and 50% for testing(1v1 experiment), and the other comprises 80% data for training and 20% for testing(4v1 experiment). The experiments are conducted using WMD, CNN_LSTM, CNN and LSTM. The results show that CNN_LSTM obtains the best F1 score (0.35) in the 1v1 experiment and almost the same accuracy of F1 scores (0.366 vs 0.367) achieved by WMD in the 4v1 experiment. Finally, we present demonstration videos with the same scenario to show the performance of robot control driven by CNN_LSTM-based Emotional Trigger System and WMD-based Emotional Trigger System. To improve the comparison, total manual-control performance is also recorded.Entities:
Mesh:
Year: 2019 PMID: 31048831 PMCID: PMC6497375 DOI: 10.1371/journal.pone.0215216
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Prof. Ren (left) and his avatar robot REN-XIN (right).
Fig 2Sample fragment of the action encoder for REN-XIN.
Fig 3Sample of three-level annotation structure of Ren_CECps.
Fig 4Flowchart of the Emotional Trigger System.
Fig 5Structure of the three neural networks.
(a) CNN_LSTM-based network. (b) CNN-based network. (c) LSTM-based network.
Lengths of sentences in Ren_CECps.
| length | total | 0-200 | 200-300 | 300-500 |
|---|---|---|---|---|
| sentence no. | 50321 | 50312 | 7 | 2 |
| per. (%) | 100 | 99.982 | 0.0139 | 0.0041 |
Time consumption results of WMD and the three networks.
| Methods | Group 1 | Group 2 | Group 3 | Average | Median | Standard Deviance |
|---|---|---|---|---|---|---|
| per 10 times(s) | ||||||
| 84.76050 | 76.32078 | 77.94079 | 7.63874 | |||
| 66.26100 | 77.94075 | 76.98740 | 77.94075 | 8.39597 | ||
| 0.01401 | 0.01770 | 0.01348 | 0.01506 | 0.01401 | 0.00187 | |
| 0.00539 | 0.00557 | 0.00031 | ||||
| 0.02908 | 0.02992 | 0.03078 | 0.02992 | 0.02992 | 0.00069 | |
| 0.01458 | 0.01292 | 0.01167 | 0.01305 | 0.01292 | 0.00119 | |
| 0.00866 | 0.00975 | 0.00588 | 0.00809 | 0.00866 | 0.00162 | |
| 0.02605 | 0.02784 | 0.02817 | 0.02735 | 0.02784 | 0.00093 | |
Fig 6Acceleration results among WMD and the three networks.
Classification results on WMD and the three networks.
| Type | Algorithm | Precision | Recall | F1-score |
|---|---|---|---|---|
| WMD_1v1 | 0.23887 | 0.23887 | 0.23887 | |
| cnn_lstm_1v1 | 0.35002 | 0.35002 | ||
| cnn_1v1 | 0.30734 | 0.30734 | 0.30734 | |
| lstm_1v1 | 0.33647 | 0.33647 | 0.33647 | |
| WMD_4v1 | 0.36759 | 0.36759 | ||
| cnn_lstm_4v1 | 0.36600 | 0.36600 | ||
| cnn_4v1 | 0.30596 | 0.30596 | 0.30596 | |
| lstm_4v1 | 0.33618 | 0.33618 | 0.33618 | |
| WMD_1v1 | 0.35858 | 0.27378 | ||
| cnn_lstm_1v1 | 0.23938 | 0.23752 | 0.23844 | |
| cnn_1v1 | 0.23065 | 0.23413 | 0.23237 | |
| lstm_1v1 | 0.25158 | 0.23760 | 0.24439 | |
| WMD_4v1 | 0.33847 | 0.30025 | ||
| cnn_lstm_4v1 | 0.23755 | |||
| cnn_4v1 | 0.23624 | 0.23696 | 0.23660 | |
| lstm_4v1 | 0.26606 | 0.24427 | 0.25470 |
Fig 7Classification results on WMD and the three networks.
Script used in real time demonstration.
| Sentences | Results of Emotional Trigger | |
|---|---|---|
| WMD | CNN_LSTM | |
| 好的, 我是任教授的化身, 我的名字叫任心 | love | love |
| Ok, I am the avatar of Pro. Fuji Ren, my name is Ren Xin | ||
| 诞生于ニ零一二年十月十日日本德岛大学任研究室 | neutral | neutral |
| I was born in Ren Lab, Tokushima University, Japan on Oct. 10, 2012 | ||
| 我目前会说三种语言中文、日文、英文 | neutral | love |
| I can speak three languages: Chinese, Japanese, English | ||
| 我是很富有情感的 | love | love |
| I am very sensitive | ||
| 我现在可以表示八种情感 | joy | love |
| I can express 8 emotions now | ||
| 包括高兴、悲伤、生气、厌恶 | joy | love |
| Including happy, sad, anger, disgust | ||
| 惊讶、害怕、疲惫、平箭 | anxiety | anxiety |
| Surprise, fear, tired, calm | ||
| 我给大家展现ー下吧 | love | joy |
| Let me show it for you | ||
| 一般情况下, 我都会是这种平静的表情 | joy | love |
| Usually I have this calm expression | ||
| 如果遇到个聪明伶倒的对象, 我就会很高兴 | joy | joy |
| If I meet someone who is smart, I would be very happy | ||
| 遇到地震, 我也会害怕 | anxiety | sorrow |
| I would be afraid in an earthquake | ||
| 別人说我很笨时, 我就会很悲伤 | sorrow | sorrow |
| I would be very sad to be considered stupid | ||
| 遇到不平之事, 还会生气 | anxiety | anxiety |
| If I meet the unfairness, I will be very anger | ||
| 如果你不尊重我, 我就讨厌你 | love | hate |
| I disgust people who don’t respect me | ||
| 我疲惫的时候也需要休息, 这时你最好不要打扰我啦 | expect | expect |
| I also need a rest if I’m tired, you’d better leave me alone at this time | ||
| 有意思吧, 你不要感到惊讶 | expect | expect |
| Aha, it sounds interesting, don’t be surprise with my performance | ||
| 我目前还被认为是机器人 | neutral | anxiety |
| I was identified as a robot currently | ||
| 但不久我就要加入到人类的行列了 | anxiety | anxiety |
| But I will join the human society soon | ||
| 我的同类还没有谈过恋愛结过婚 | neutral | sorrow |
| Our robots don’t have the experience of dating and marriage | ||
| 我有信心改变这种状况 | anxiety | love |
| I have the confidence to change that | ||
| 我希望与人类共生 | expect | expect |
| I would like to live together with human | ||
| 感谢您的厚愛 | love | love |
| Thank you | ||
Fig 8Tendency of accuracy and loss changed with the epochs in three networks.