| Literature DB >> 35271083 |
Gulmira Bekmanova1, Banu Yergesh1, Altynbek Sharipbay1, Assel Mukanova1,2.
Abstract
The emotional speech recognition method presented in this article was applied to recognize the emotions of students during online exams in distance learning due to COVID-19. The purpose of this method is to recognize emotions in spoken speech through the knowledge base of emotionally charged words, which are stored as a code book. The method analyzes human speech for the presence of emotions. To assess the quality of the method, an experiment was conducted for 420 audio recordings. The accuracy of the proposed method is 79.7% for the Kazakh language. The method can be used for different languages and consists of the following tasks: capturing a signal, detecting speech in it, recognizing speech words in a simplified transcription, determining word boundaries, comparing a simplified transcription with a code book, and constructing a hypothesis about the degree of speech emotionality. In case of the presence of emotions, there occurs complete recognition of words and definitions of emotions in speech. The advantage of this method is the possibility of its widespread use since it is not demanding on computational resources. The described method can be applied when there is a need to recognize positive and negative emotions in a crowd, in public transport, schools, universities, etc. The experiment carried out has shown the effectiveness of this method. The results obtained will make it possible in the future to develop devices that begin to record and recognize a speech signal, for example, in the case of detecting negative emotions in sounding speech and, if necessary, transmitting a message about potential threats or riots.Entities:
Keywords: affective computing; artificial intelligence; crowd emotion recognition; distance learning; e-learning; emotion recognition; speech recognition
Mesh:
Year: 2022 PMID: 35271083 PMCID: PMC8915129 DOI: 10.3390/s22051937
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Emotional speech recognition process.
Figure 2The result of automatic splitting of the voice section of the signal into quasi-periods.
Figure 3Kazakh speech recognition system.
Symbols of the current alphabet and intermediate alphabet.
| Current Alphabet | Intermediate Alphabet | Transcription | Current Alphabet | Intermediate Alphabet | Transcription |
|---|---|---|---|---|---|
| А | А | (ɑ) | Б | Б | (b) |
| Ә | Ә | (æ) | В | В | (v) |
| Е | Е | (е) | Г | Г | (g) |
| О | О | (ɔ) | Ғ | Ғ | (ɣ) |
| Ө | Ө | (ɵ) | Д | Д | (d) |
| Ұ | Ұ | (ʊ, u) | Ж | Ж | (Ʒ) |
| Ү | Ү | (ү) | З | З | (z) |
| Ы | Ы | (ɯ) | Й | Й | (y) |
| І | І | (ɪ, i) | К | К | (k) |
| Э | Е | (jɪ) | Қ | Қ | (q) |
| Я | ЙА | (yɑ) | Л | Л | (l) |
| Ю | ЙУ | (yw) | М | М | (m) |
| Ё | ЙО | (yɔ) | Н | Н | (n) |
| И | ІЙ | (iy) | Ң | Ң | (ŋ) |
| И | ЫЙ | (ɯj) | П | П | (p) |
| Ч | Ш | (tʃ) | Р | Р | (r) |
| Щ | Ш | (ʃ) | С | С | (s) |
| Ц | С | (tc) | Т | Т | (t) |
| Һ | Х | (h) | У | У | (w) |
| Ъ | - | Ш | Ш | (ʃ) | |
| Ь | - | Ф | Ф | (f) | |
| Х | Х | (h) |
Symbols of the current alphabet and intermediate alphabet.
| Classes | Symbols | Meaning |
|---|---|---|
| W | аұыoеәүіөу | vowels and consonant «У» |
| C | бвгғджзйлмнңр | voiced consonants |
| F | сш | voiceless hush consonants |
| P | кқптфх | voiceless consonants |
Example of the words with a structure CWCCWCW.
| Kazakh Word | Transcription | Generalized Transcription |
|---|---|---|
| бұлдану | bʊldɑnw | CWCCWCW |
| бұлдыра | bʊldɯrɑ | CWCCWCW |
| бүлдіру | bүldɪrw | CWCCWCW |
Fragment of the emotional vocabulary.
| Word | Transcription | Translation | POS | Emotion |
|---|---|---|---|---|
| діріл | dɪrɪl | trembling | N | fear |
| қoрқақтық | qɔrqɑqtɯq | cowardice | N | fear |
| ақылсыз | ɑqɯlswz | stupid | N | anger |
| қызғаныш | qɯzɣɑnɯʃ | jealousy | N | anger |
| құрмет | qʊrmеt | honor | N | happiness |
| нәзіктік | næzɪktɪk | tenderness | N | happiness |
| шапшаң | ʃɑpʃɑŋ | quick | Adv | happiness |
| шарасыздан | ʃɑrɑsɯzdɑn | involuntarily | Adv | sadness |
| сөзқұмар | sɵzqʊmɑr | garrulous, chatty | Adj | disgust |
Example of words from emotion vocabulary with generalized transcription.
| Word | Transcription | Translation | POS | Emotion | Generalized Transcriptions |
|---|---|---|---|---|---|
| қoрқақтық | qɔrqɑqtɯq | cowardice | N | fear | PWCPWPPWP |
| ақылсыз | ɑqɯlswz | stupid | N | anger | WPWCFWC |
| көз жасы | kɵz Ʒɑsɯ | tear | N | sadness | PWC CWFW |
| құрмет | qʊrmеt | honor | N | happiness | PWCCWP |
| шапшаң | ʃɑpʃɑŋ | quick | Adv | happiness | FWPFWC |
| шарасыздан | ʃɑrɑsɯzdɑn | involuntarily | Adv | sadness | FWCWFWCCWC |
| сөзқұмар | sɵzqʊmɑr | garrulous, chatty | Adj | disgust | FWCPWCWC |
| тату | tɑtw | amicably | Adj | happiness | PWPW |
| тиянақсыз | tiyyɑnɑqsɯz | fragile | Adj | anger | PWCCWCWPFWC |
| пішту! | pɪʃtw! | my gosh | Intj | disgust | PWFPW! |
| туу | tww | Holy | Intj | sadness | PWW |
| уай | wɑy | Wow | Intj | happiness | WWC |
| бұзықтық істеу | bʊzɯqtɯq ɪstеw | roughhouse | V | anger | CWCWPPWP WFPWW |
| бәрекелді | bærеkеldɪ | Bravo | Intj | happiness | CWCWPWCCW |
| әй | æy | hey | Intj | anger | WC |
| әттеген-ай | ættеgеn-ɑy | What a pity | Intj | sadness | WPPWCWC-WC |
| қап | qɑp | it’s a shame | Intj | sadness | PWP |
| масқарай | mɑsqɑrɑy | What a mess | Intj | sadness | CWFPWCWC |
| мәссаған | mæssɑɣɑn | Gee | Intj | fear | CWFFWCWC |
Meta-designations.
| Designation | Purpose |
|---|---|
|
| Many words in the language—Variables |
|
| |
|
| Set of sentences in the language |
|
| Set of nouns |
|
| Set of adjectives |
|
| Set of pronouns |
|
| Set of positive verb forms |
|
| Set of negative verb forms |
|
| Set of interjections |
|
| Set of adverbs or enhancing |
| emo | Emotion Establishment—Predicate |
| @ | Negation words “емес/жoқ”(no)—Constants |
|
| Transformation to negative form—Operation |
|
| Concatenation—Operation |
Emotion classes.
| Emotion Classes | Polarity | Example |
|---|---|---|
| happiness | positive | Алақай! Мен сәтті аяқтадым |
| fear | negative | Жауаптарды ұмытып қалдым |
| disgust | negative | Туу, oйдағыдай баға алмадым |
| sadness | negative | Қап! кейбір жауапты білмей қалдым. |
| anger | negative | Кедергі жасама! Уақыт тығыз |
| neutral | neutral | Бүгін барлығы емтихан тапсырады (everyone is taking exams today) |
Quantitative evaluation of the different methods on the speech emotion recognizer.
| Method | Dataset Language | Number of Classes | Accuracy |
|---|---|---|---|
| Emotional Speech Recognition Method | Kazakh | 6 | 79.7% |
| DNN model [ | Kazakh, Russian | 3 | 82.07% |
Figure 4Confusion matrix of Emotional Speech Recognition Method.