| Literature DB >> 31613902 |
E Morales-Vargas1, C A Reyes-García1, Hayde Peregrina-Barreto1.
Abstract
Facial expression recognition is related to the automatic identification of affective states of a subject by computational means. Facial expression recognition is used for many applications, such as security, human-computer interaction, driver safety, and health care. Although many works aim to tackle the problem of facial expression recognition, and the discriminative power may be acceptable, current solutions have limited explicative power, which is insufficient for certain applications, such as facial rehabilitation. Our aim is to alleviate the current limited explicative power by exploiting explainable fuzzy models over sequences of frontal face images. The proposed model uses appearance features to describe facial expressions in terms of facial movements, giving a detailed explanation of what movements are in the face, and why the model is making a decision. The model architecture was selected to keep the semantic meaning of the found facial movements. The proposed model can discriminate between the seven basic facial expressions, obtaining an average accuracy of 90.8±14%, with a maximum value of 92.9±28%.Entities:
Mesh:
Year: 2019 PMID: 31613902 PMCID: PMC6793860 DOI: 10.1371/journal.pone.0223563
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Action Units and their corresponding movements.
| AU | Movement | AU | Movement |
|---|---|---|---|
| 0 | Neutral state | 14 | Dimpler |
| 1 | Inner brow raiser | 15 | Lip corner depressor |
| 2 | Outer brow raiser | 16 | Lower lip depressor |
| 4 | Brow lowerer | 17 | Chin raiser |
| 5 | Upper lid raiser | 18 | Lip puckerer |
| 6 | Cheek raiser | 20 | Lip stretcher |
| 7 | Lid tightener | 22 | Lip funneler |
| 9 | Nose wrinkler | 23 | Lip tightener |
| 10 | Upper lip raiser | 24 | Lips pressor |
| 11 | Nasolabial deepener | 25 | Lips parted |
| 12 | Lip corner puller | 26 | Jaw drop |
| 13 | Cheek puffer | 27 | Mouth stretch |
Fig 1Proposed method for facial expression recognition using fuzzy rules over facial movements.
Fig 2Neuro-Fuzzy representation of the proposed model for facial expression recognition.
A) Full model overview and B) overview of the neuron E1 in the full model.
Frequency for each facial expression from the CK+ dataset.
| Emotion | N |
|---|---|
| Angry (En) | 45 |
| Disgust (Dis) | 59 |
| Contempt (Con) | 18 |
| Happiness (Hap) | 69 |
| Fear (Fe) | 25 |
| Sadness (Sad) | 28 |
| Surprise (Sur) | 83 |
Label example of a sample in the CK+ dataset.
Each sequence in the dataset contains a similar one.
| Subject | Sequence | AU labels | Emotion label |
|---|---|---|---|
| 01 | 04 | {[4, 0], [7, 5], [17, 4], [23, 4], [24, 4]} | Angry |
Emotion description in terms of Action Units.
| Emotion | Action Units |
|---|---|
| Angry (An) | AU23 + AU24 must be present |
| Disgust (Dis) | AU9 o AU10 must be present |
| Contempt (Con) | AU14 must be present, unilateral or bilateral |
| Happiness (Hap) | AU12 must be present |
| Fear (Fe) | AU1+2+4 present, unless AU5 is E, then AU4 can be absent |
| Sadness (Sad) | AU 1 + 4 + 15, with exception of AU6 + 15 |
| Surprise (Sur) | AU1+2 or 5 must be present, and AU5 should not be stronger than B |
Frequency for each facial expression from the Radboud dataset.
| Emotion | N |
|---|---|
| Angry (En) | 201 |
| Disgust (Dis) | 201 |
| Contempt (Con) | 201 |
| Happiness (Hap) | 201 |
| Fear (Fe) | 201 |
| Sadness (Sad) | 201 |
| Surprise (Sur) | 201 |
Confusion matrix of facial expression recognition for γ = 0.4, γ = 0.2.
| 0.54 | 0.06 | 0.00 | 0.04 | 0.02 | 0.00 | ||
| 0.01 | 0.00 | 0.01 | 0.07 | 0.014 | 0.02 | ||
| 0.02 | 0.04 | 0.01 | 0.00 | 0.01 | 0.00 | ||
| 0.00 | 0.01 | 0.03 | 0.06 | 0.07 | 0.01 | ||
| 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | ||
| 0.06 | 0.05 | 0.00 | 0.06 | 0.00 | 0.00 | ||
| 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.01 |
Accuracy obtained with the proposed method.
| Data | Method | Mean Accuracy |
|---|---|---|
| CK+ | AAM | 0.91±0.03 |
| CK+ | Deva | 0.76±0.04 |
| RaFD | Deva | 0.81±0.02 |
| RaFD | OpenFace | 0.45±0.03 |
Fig 3Synthetic generation of a neutral facial expression.
A) All samples without PA, B) aligned landmarks after PA, and C) generated model through a mean operation.
Comparison of the accuracy obtained in a cross-database process.
| Method | Train | Target | Accuracy |
|---|---|---|---|
| [ | 6 databases | RaFD | 0.85±0.04 |
| [ | JAFFE | RaFD | 0.52±N/R |
| [ | TFEID | RaFD | 0.55±N/R |
| FROAUFER | CK+ | RaFD | 0.69±0.01 |
*CK+, MMI, RaFD, KDEF, BU3DFE, ARFace
Comparison with related works that use dynamic approaches.
| Work | Data | Method | Interpretable | FACS | Mean Accuracy |
|---|---|---|---|---|---|
| [ | CK+ | CAPP | no | no | 0.80±N/R |
| [ | CK+ | S+C | no | no | 0.88±N/R |
| [ | CK+ | ORB | no | no | 0.92±N/R |
| [ | JAFFE | MFS | 14 rules | no | 0.87±N/R |
| [ | JAFFE | FRM | 565 rules | no | 0.96±N/R |
| [ | CK+ | RM | no | no | 0.85±N/R |
| [ | JAFFE | FKC & SVMs | no | no | 0.97±N/R |
| Proposed | CK+ | FROAUFERAU0 | yes | yes | 0.70±0.07 |
| Proposed | CK+ | FROAUFER | yes | yes | 0.91±0.03 |
| Proposed | RaFD | FROAUFERAU0 | yes | yes | 0.61±0.05 |
| Proposed | RaFD | FROAUFER | yes | yes | 0.81±0.61 |
Fig 4Fuzzy model for facial expression recognition which can lead to an explanation interface.