| Literature DB >> 25279686 |
Andoni Arruti1, Idoia Cearreta1, Aitor Alvarez2, Elena Lazkano1, Basilio Sierra1.
Abstract
Study of emotions in human-computer interaction is a growing research area. This paper shows an attempt to select the most significant features for emotion recognition in spoken Basque and Spanish Languages using different methods for feature selection. RekEmozio database was used as the experimental data set. Several Machine Learning paradigms were used for the emotion classification task. Experiments were executed in three phases, using different sets of features as classification variables in each phase. Moreover, feature subset selection was applied at each phase in order to seek for the most relevant feature subset. The three phases approach was selected to check the validity of the proposed approach. Achieved results show that an instance-based learning algorithm using feature subset selection techniques based on evolutionary algorithms is the best Machine Learning paradigm in automatic emotion recognition, with all different feature sets, obtaining a mean of 80,05% emotion recognition rate in Basque and a 74,82% in Spanish. In order to check the goodness of the proposed process, a greedy searching approach (FSS-Forward) has been applied and a comparison between them is provided. Based on achieved results, a set of most relevant non-speaker dependent features is proposed for both languages and new perspectives are suggested.Entities:
Mesh:
Year: 2014 PMID: 25279686 PMCID: PMC4184843 DOI: 10.1371/journal.pone.0108975
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of RekEmozio database scope for recordings.
| Language | #Actors | #male/#female | Mean Age (std dev) |
| Basque | 7 | 4/3 | 31.3 (5.2) |
| Spanish | 10 | 5/5 | 30.7 (4.1) |
| Overall | 17 | 9/8 | 30.9 (4.4) |
Amount of text used for both Spanish and Basque languages.
| Text unit | Specific foreach emotion | Used in allemotions | Total | Per actor |
| Words | 35 | 5 | 40 | 70 |
| Sentences | 21 | 3 | 24 | 42 |
| Paragraphs | 21 | 3 | 24 | 42 |
| Total | 77 | 11 | 88 | 154 |
Lengths of RekEmozio database’s audio recordings.
| Language | Recording’s lengths |
| Basque | 130’41” |
| Spanish | 166’17” |
| Total | 296’58” |
Prosodic Features extracted for each validated recording.
| Feature class | Description | Computed values |
| FundamentalFrequency | F0 curve in thevoiced parts.Estimation basedon Sun algorithm. | Maximum and its position, minimum and its position, mean, variance, standard deviation, maximum positive slope in contour, regression coefficient and its mean square error. |
| Pitch derivative based features | ||
| Energy | Energy, RMSenergy andLoudness. | Maximum and its position, minimum and its position, mean, variance, regression coefficient and its mean square error. |
| RMS: maximum, minimum, mean, range, variance and standard deviation. | ||
| Loudness: absolute loudness based on Zwicker’s model. | ||
| Voiced/Unvoiced | Features basedon Voiced andUnvoiced framesand regions. | F0 value of the first and last voiced frames, number of voiced and unvoiced frames and regions, length of the longest voiced and unvoiced regions, ratio of number of voiced and unvoiced frames and regions. |
| Relations | Relations amongseveral features. | Mean, variance, mean of the maximum, variance of the maximum, mean of the pitch ranges and mean of the flatness of the pitch based on every voiced region pitch values. |
| Pitch increasing and decreasing in voiced parts as well as the mean of the voiced regions duration. | ||
| Many features related with the energy among the voiced regions, such as global energy mean, vehemence, mean of the flatness and tremor in addition to others. | ||
| Rhythm | Alternation betweenspeech and silence. | Duration of voice, silence, maximum voice, minimum voice, maximum silence and minimum silence in the whole utterance are computed. |
Spectral Features extracted for each validated recording.
| Feature class | Description | Computed values |
| Formants | Resonance characteristicsof the vocal tract. | Mean of the first, second and third formant frequencies and their bandwidths among all voiced region as well as the mean, maximum and range of the second formant ratio. |
| Critical Bands | Energy in severalfrequency bands,using two differentspectral distributions. | Energy in three frequency bands: low band (0–1300 Hz), medium band (1300–2600 Hz) and high band (2600–4000 Hz). |
| Energy in four frequency bands: (0 - F0 Hz), (0–1000 Hz), (2500–3500 Hz) and (4000–5000 Hz). | ||
| Relative energy in each band for voiced parts of utterance. |
Quality Features extracted for each validated recording.
| Feature class | Description | Computed values |
| Harmonicity to noise ratio | Ratio of the energy ofharmonic frames to theenergy of remainingpart of the signal. | Maximum harmonicity, minimum, mean, range and standard deviation. |
| Jitter | Pitch perturbationin vocal chordsvibration. | Cycle-to-cycle variation of pitch. |
| Shimmer | Energy perturbationin vocal chordsvibration. | Cycle-to-cycle variation of energy. |
| Active level | Signal activelevel features. | Maximum, minimum, mean and variance of the speech active level among the voiced regions. |
Figure 1Main scheme of the Estimation of Distribution Algorithms (EDA) approach.
10-fold crossvalidation accuracy of first phase for actors in Basque.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 35.38 | 48.79 | 35.23 | 39.80 | 44.17 | 49.32 | 36.89 | 40.91 | 42.82 | 41.52 |
| ID3 | 38.71 | 45.45 | 44.70 |
| 46.67 | 46.97 | 43.26 | 51.14 | 47.01 | 45.27 |
| C4.5 | 41.52 | 52.20 | 35.00 | 42.90 | 60.38 | 53.26 | 45.08 | 49.47 |
|
|
| NB | 42.95 | 45.76 | 37.65 | 42.12 | 52.20 | 44.09 | 36.21 | 41.44 | 43.48 | 42.90 |
10-fold crossvalidation accuracy of third phase for actors in Spanish applying EDA-FSS.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 71.82 | 77.27 | 80.91 | 80.91 | 78.18 |
| 59.09 | 73.64 | 80.91 | 74.55 | 69.09 |
|
|
| ID3 | 68.18 | 75.45 | 80.00 | 70.00 | 75.45 | 73.82 | 50.00 | 70.00 | 80.00 | 72.73 | 67.27 | 68.00 | 70.91 |
| C4.5 | 67.27 | 73.64 | 80.00 | 71.82 | 70.91 | 72.73 | 52.73 | 70.00 | 76.36 | 75.45 | 66.36 | 68.18 | 70.46 |
| NB | 70.00 | 77.27 | 78.18 | 77.27 | 62.73 | 73.09 | 51.82 | 63.64 | 74.55 | 69.09 | 60.00 | 63.82 | 68.46 |
Confusion Matrix of the F2 Basque actor.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
| Sadness |
| 0 | 0 | 0 | 0 | 0 | 0 |
| Fear | 0 |
| 1 | 0 | 0 | 0 | 1 |
| Joy | 0 | 0 |
| 1 | 0 | 0 | 0 |
| Anger | 0 | 0 | 2 |
| 0 | 0 | 1 |
| Surprise | 1 | 1 | 1 | 0 |
| 1 | 1 |
| Disgust | 2 | 0 | 2 | 0 | 0 |
| 0 |
| Neutral | 1 | 0 | 0 | 0 | 0 | 0 |
|
Confusion Matrix of the M3 Spanish actor.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
| Sadness |
| 0 | 0 | 0 | 0 | 0 | 1 |
| Fear | 0 |
| 2 | 1 | 0 | 1 | 0 |
| Joy | 0 | 1 |
| 0 | 0 | 0 | 1 |
| Anger | 0 | 0 | 1 |
| 0 | 2 | 0 |
| Surprise | 0 | 1 | 0 | 1 |
| 2 | 1 |
| Disgust | 0 | 0 | 3 | 1 | 2 |
| 0 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 |
|
10-fold crossvalidation accuracy for Basque applying FSS-FORWARD to the whole set.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 38.91 | 46.55 | 44.00 | 43.15 | 66.18 | 51.45 | 47.45 | 48.55 | 53.41 | 49.01 |
| ID3 | 42.73 | 43.55 | 52.45 | 46.24 | 59.27 | 42.82 | 49.36 | 45.45 | 49.23 | 47.95 |
| C4.5 | 47.18 | 49.45 | 36.00 | 44.21 | 63.00 | 33.73 | 39.64 | 43.64 | 45.00 | 44.66 |
| NB | 47.45 | 62.09 | 31.09 |
| 69.73 | 56.18 | 46.45 | 50.36 |
|
|
10-fold crossvalidation accuracy for Spanish using FSS-FORWARD to the whole set.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 45.45 | 46.36 | 56.36 | 52.73 | 32.73 | 46.73 | 23.64 | 26.36 | 47.27 | 44.55 | 35.45 | 35.45 | 41.09 |
| ID3 | 38.18 | 45.45 | 60.00 | 48.18 | 45.45 | 47.45 | 26.36 | 40.91 | 44.55 | 42.73 | 40.00 | 38.91 | 43.18 |
| C4.5 | 35.45 | 46.36 | 57.27 | 55.45 | 39.09 | 46.72 | 29.09 | 28.18 | 44.55 | 35.45 | 37.27 | 34.91 | 40.82 |
| NB | 45.45 | 54.55 | 53.64 | 61.72 | 40.91 |
| 28.18 | 38.18 | 53.64 | 49.09 | 34.55 |
|
|
Figure 2Results for the Basque Language without Feature Subset Selection.
Performance comparison between four Machine Learning paradigms (IB: Instance Based, ID3: Decision Tree, C4.5: Decision Tree, NB: Naive-Bayes) without any kind of FSS. Mean accuracy obtained in the three phases, for the Basque language, is shown.
Figure 5Results for the Spanish Language using EDA Feature Subset Selection.
Performance comparison between four Machine Learning paradigms (IB: Instance Based, ID3: Decision Tree, C4.5: Decision Tree, NB: Naive-Bayes) using EDA-FSS. Mean accuracy obtained in the three phases, for the Basque language, is shown. Results obtained with a standard FSS-Forward approach are also shown.
p-values obtained with Wilcoxon test comparing FSS methods.
| Classifier | FSS-FWD > without FSS ? | EDA > FSS-FWD ? |
| All |
|
|
| IB |
|
|
| ID3 | 0,95359 |
|
| C4.5 | 0,92620 |
|
| NB |
|
|
p-values obtained with Wilcoxon test comparing the best classifier for each FSS method with the others classifiers.
| FSS method | Classifier | > IB ? | > ID3 ? | > C4.5 ? | > NB ? |
| None | ID3 |
| 1 | 0,06477 |
|
| Forward | NB |
| 0,05119 |
| 1 |
| EDA | IB | 1 |
|
|
|
10-fold crossvalidation accuracy of first phase for actors in Spanish.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 34.55 | 43.64 | 54.55 | 54.55 | 38.18 | 45.09 | 25.45 | 33.64 | 51.82 | 47.65 | 33.64 |
|
|
| ID3 | 36.36 | 52.73 | 49.09 | 47.27 | 42.73 |
| 20.91 | 30.91 | 40.91 | 47.27 | 40.00 | 36.00 | 40.82 |
| C4.5 | 30.91 | 50.00 | 46.36 | 43.64 | 42.73 | 42.72 | 29.09 | 31.82 | 46.36 | 42.73 | 35.45 | 37.09 | 39.91 |
| NB | 38.18 | 42.73 | 49.09 | 40.00 | 42.73 | 42.54 | 24.55 | 30.91 | 49.09 | 45.45 | 34.55 | 36.91 | 39.73 |
10-fold crossvalidation accuracy of first phase for actors in Basque applying EDA-FSS.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 63.03 | 68.03 | 59.32 |
| 72.65 | 67.35 | 60.98 | 62.80 |
|
|
| ID3 | 62.73 | 60.48 | 65.45 | 62.88 | 72.65 | 61.97 | 56.52 | 62.65 | 63.44 | 63.20 |
| C4.5 | 60.23 | 65.98 | 60.00 | 62.07 | 71.82 | 62.80 | 60.08 | 63.56 | 64.56 | 63.49 |
| NB | 64.47 | 64.55 | 48.94 | 59.32 | 74.55 | 62.50 | 62.73 | 60.00 | 64.94 | 62.53 |
10-fold crossvalidation accuracy of first phase for actors in Spanish applying EDA-FSS.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 61.82 | 66.36 | 75.45 | 71.82 | 68.18 |
| 42.73 | 57.27 | 69.09 | 63.64 | 60.91 |
|
|
| ID3 | 59.09 | 66.36 | 66.36 | 60.00 | 61.81 | 62.72 | 42.73 | 51.82 | 66.36 | 61.82 | 60.00 | 56.54 | 59.63 |
| C4.5 | 57.27 | 62.73 | 64.55 | 65.45 | 63.64 | 62.72 | 43.64 | 56.36 | 65.45 | 64.55 | 56.36 | 57.27 | 60.00 |
| NB | 54.55 | 59.09 | 68.18 | 65.45 | 60.00 | 61.45 | 40.91 | 48.18 | 64.55 | 59.09 | 51.82 | 52.91 | 57.18 |
10-fold crossvalidation accuracy of second phase for actors in Basque.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 34.00 | 42.91 | 33.91 | 36.94 | 56.18 | 41.00 | 36.91 | 36.82 | 42.73 | 40.25 |
| ID3 | 49.45 | 45.91 | 46.78 |
| 54.27 | 44.00 | 51.45 | 49.45 |
|
|
| C4.5 | 42.73 | 40.09 | 42.73 | 41.85 | 60.36 | 39.55 | 48.45 | 37.82 | 46.55 | 44.54 |
| NB | 39.82 | 31.00 | 46.45 | 39.09 | 60.36 | 29.91 | 36.91 | 41.44 | 42.16 | 40.84 |
10-fold crossvalidation accuracy of second phase for actors in Spanish.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 36.46 | 41.92 | 41.92 | 43.64 | 33.64 | 39.52 | 30.00 | 36.46 | 44.55 | 36.46 | 30.00 | 35.49 | 37.51 |
| ID3 | 38.18 | 47.27 | 55.45 | 43.64 | 44.55 | 45.82 | 24.55 | 40.00 | 50.00 | 46.36 | 34.55 |
|
|
| C4.5 | 42.73 | 48.18 | 50.91 | 50.91 | 45.45 |
| 21.82 | 39.09 | 46.36 | 48.18 | 27.27 | 36.54 | 42.00 |
| NB | 34.55 | 34.45 | 40.91 | 32.73 | 31.82 | 34.89 | 20.91 | 39.09 | 40.00 | 35.45 | 21.82 | 31.45 | 33.17 |
10-fold crossvalidation accuracy of second phase for actors in Basque applying EDA-FSS.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 72.55 | 79.73 | 62.27 |
| 91.36 | 73.00 | 77.82 | 71.82 |
|
|
| ID3 | 71.00 | 71.73 | 66.64 | 69.79 | 78.73 | 65.82 | 72.64 | 66.91 | 71.03 | 70.50 |
| C4.5 | 67.73 | 75.91 | 68.09 | 70.58 | 76.73 | 65.82 | 69.91 | 68.91 | 70.34 | 70.44 |
| NB | 73.00 | 77.73 | 63.36 | 71.36 | 89.45 | 67.27 | 66.18 | 65.36 | 72.07 | 71.76 |
10-fold crossvalidation accuracy of second phase for actors in Spanish applying EDA-FSS.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 72.73 | 72.73 | 80.91 | 76.36 | 64.55 |
| 58.18 | 72.73 | 76.36 | 70.00 | 62.73 |
|
|
| ID3 | 67.27 | 75.45 | 73.64 | 72.73 | 68.18 | 71.45 | 51.82 | 63.64 | 76.36 | 69.09 | 59.09 | 64.00 | 67.72 |
| C4.5 | 70.91 | 75.45 | 74.55 | 64.55 | 66.36 | 70.36 | 54.55 | 63.64 | 80.91 | 66.36 | 56.36 | 64.36 | 67.35 |
| NB | 75.45 | 73.64 | 68.18 | 67.27 | 64.55 | 69.82 | 50.00 | 60.00 | 76.36 | 68.18 | 58.18 | 62.54 | 66.18 |
10-fold crossvalidation accuracy of third phase for actors in Basque.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 36.00 | 46.82 | 33.82 | 38.88 | 59.45 | 44.36 | 40.45 | 36.55 | 45.20 | 42.49 |
| ID3 | 49.55 | 47.64 | 39.91 |
| 61.00 | 49.27 | 53.36 | 50.36 |
|
|
| C4.5 | 50.73 | 47.36 | 35.82 | 44.64 | 63.91 | 35.09 | 48.18 | 38.64 | 46.46 | 45.68 |
| NB | 43.73 | 40.91 | 40.91 | 41.85 | 58.36 | 37.09 | 46.64 | 40.82 | 45.73 | 44.07 |
Figure 3Results for the Spanish Language without Feature Subset Selection.
Performance comparison between four Machine Learning paradigms (IB: Instance Based, ID3: Decision Tree, C4.5: Decision Tree, NB: Naive-Bayes) without any kind of FSS. Mean accuracy obtained in the three phases, for the Spanish language, is shown.
Figure 4Results for the Basque Language using EDA Feature Subset Selection.
Performance comparison between four Machine Learning paradigms (IB: Instance Based, ID3: Decision Tree, C4.5: Decision Tree, NB: Naive-Bayes) using EDA-FSS. Mean accuracy obtained in the three phases, for the Basque language, is shown. Results obtained with a standard FSS-Forward approach are also shown.
10-fold crossvalidation accuracy of third phase for actors in Basque applying EDA-FSS.
| Female | Male | Total | ||||||||
| F1 | F2 | F3 | mean | M1 | M2 | M3 | M4 | mean | ||
| IB | 75.36 | 82.55 | 73.73 |
| 90.45 | 84.27 | 76.27 | 77.73 |
|
|
| ID3 | 68.09 | 75.64 | 71.64 | 71.79 | 78.82 | 69.55 | 73.73 | 69.73 | 72.96 | 72.46 |
| C4.5 | 69.82 | 77.73 | 68.09 | 71.88 | 78.64 | 64.91 | 66.91 | 71.45 | 70.48 | 71.04 |
| NB | 74.82 | 82.55 | 67.27 | 74.88 | 91.27 | 78.73 | 67.91 | 74.73 | 78.16 | 76.75 |
The most relevant features using the IB paradigm with EDA for Basque.
| Feature class | Female | Male |
| FundamentalFrequency | Position of themaximum, minimum andits position, mean, varianceand mean square error ofthe regression coefficient. | Mean, variance, maximum positive slope in contour, mean square error of the regression coefficient. |
| Mean of the derivative and mean square error of the regression coefficient of the derivative. | ||
| Energy | Maximum, mean, varianceand regression coefficient. | Maximum, minimum, mean, variance, mean square error of the regression coefficient. |
| RMS maximum and mean. | RMS maximum and mean. | |
| Loudness. | Loudness | |
| Voiced/Unvoiced | F0 value of the first andlast voiced frames andlength of the longestunvoiced region. | Ratio of number of voiced and unvoiced frames and number of frames. |
| Relations | Mean of the pitch meansin every regions andduration from beginningto pitch maximum. | Mean of the pitch means in every regions. |
| Ratio of the energy maximum. | ||
| Formants | Mean of the second andthird formant frequency,the bandwidths of the firstand second formants andmean of the second formant ratio. | Mean of the first, second and third formant frequency and the bandwidths of the first and second formants |
| Critical Bands | Energy in bands(0–1300 Hz),(0 - F0 Hz) and(2500–3500 Hz). | Energy in band (1300–2600 Hz). Energy in band (2500–3500 Hz) of whole the utterance divided by the energy over all frequencies |
| Rate of the energy of thelongest region and energyover all the utterance. | Rate of energy in longest region and energy over all the utterance. | |
| Harmonicity tonoise ratio | Range. | Range. |
| Jitter | Cycle-to-cyclevariation of pitch. | |
| Shimmer | Cycle-to-cyclevariation of energy. | |
| Active level | Maximum and mean. | Maximum and mean. |
The most relevant features using the IB paradigm with EDA for Spanish.
| Feature class | Female | Male |
| Fundamental Frequency | Minimum, mean, varianceand regression coefficientand its mean square error. | Maximum, minimum, mean, variance and mean square error of the regression coefficient. |
| Maximum, mean and meansquare error of theregression coefficientof the derivative | Mean of the derivative. | |
| Energy | Maximum, minimum,mean, variance andregression coefficientand its mean square error. | Maximum and mean. |
| RMS maximum,minimum and mean. | RMS value, maximum, mean. | |
| Loudness. | Loudness. | |
| Voiced/Unvoiced | F0 value of the first andlast voiced frames andlength of the longest unvoicedregion, ratio of numberof voiced frames andnumber of frames. | F0 value of the first voiced frame, number of unvoiced frames, length of the longest unvoiced region, ratio of unvoiced regions. |
| Relations | Mean and variance of the pitch means in every regions. | Mean, variance, variance of the maximum, mean of the pitch ranges and mean of the flatness of the pitch based on every voiced region pitch values. |
| Global energy mean among voiced regions | ||
| Rhythm | Duration of silence andmaximum voiced parts. | Duration of silence parts. |
| Formants | Mean of the first, secondand third formant frequencyand the bandwidths of thesecond and third formants. | Mean of the first formant frequency and the bandwidths of the first, second and third formants. |
| Critical Bands | Energy in bands (0–1300 Hz)and (2600–4000 Hz).Energy in bands (0–1000 Hz),(2500–3500 Hz)and of whole theutterance divided bythe energy over allfrequencies. | Energy in bands (0–1300 Hz) and (2600–4000 Hz). Energy in band (4000–5000 Hz) of whole the utterance divided by the energy over all frequencies. |
| Rate of the energyof the longestregion and energyover all the utterance. | Rate of the energy of the longest region and energy over all the utterance. | |
| Harmonicity tonoise ratio | Minimum | |
| Shimmer | Perturbation cycle to cycle of the energy. | |
| Active level | Maximum, minimum,mean and variance. | Maximum, mean and variance. |
10-fold crossvalidation accuracy of third phase for actors in Spanish.
| Female | Male | Total | |||||||||||
| F1 | F2 | F3 | F4 | F5 | mean | M1 | M2 | M3 | M4 | M5 | mean | ||
| IB | 32.73 | 36.36 | 48.18 | 45.45 | 40.00 | 40.54 | 28.18 | 40.91 | 47.27 | 37.27 | 31.82 | 37.09 | 38.82 |
| ID3 | 35.45 | 50.00 | 55.45 | 41.92 | 50.91 | 46.75 | 30.00 | 49.09 | 55.45 | 47.27 | 39.09 |
|
|
| C4.5 | 44.55 | 51.82 | 57.27 | 49.09 | 45.45 |
| 25.45 | 44.55 | 46.36 | 45.45 | 34.55 | 39.27 | 44.46 |
| NB | 30.91 | 38.18 | 44.55 | 32.73 | 40.91 | 37.46 | 20.91 | 37.27 | 46.36 | 40.91 | 26.36 | 34.36 | 35.91 |
Confusion Matrix of the M1 Basque actor.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
| Sadness |
| 0 | 0 | 0 | 0 | 0 | 2 |
| Fear | 1 |
| 0 | 0 | 0 | 0 | 0 |
| Joy | 0 | 0 |
| 1 | 0 | 0 | 0 |
| Anger | 0 | 0 | 2 |
| 0 | 1 | 1 |
| Surprise | 0 | 0 | 0 | 2 |
| 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 |
| 0 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 |
|
Confusion Matrix of the F3 Spanish actor.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
| Sadness |
| 0 | 0 | 0 | 0 | 0 | 1 |
| Fear | 0 |
| 2 | 0 | 1 | 1 | 0 |
| Joy | 0 | 0 |
| 2 | 2 | 0 | 2 |
| Anger | 0 | 0 | 1 |
| 0 | 0 | 0 |
| Surprise | 0 | 0 | 2 | 1 |
| 1 | 0 |
| Disgust | 1 | 0 | 1 | 0 | 0 |
| 3 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 |
|