| Literature DB >> 30042706 |
Yee-Hui Oh1, John See2, Anh Cat Le Ngo3, Raphael C-W Phan1,4, Vishnu M Baskaran5.
Abstract
Over the last few years, automatic facial micro-expression analysis has garnered increasing attention from experts across different disciplines because of its potential applications in various fields such as clinical diagnosis, forensic investigation and security systems. Advances in computer algorithms and video acquisition technology have rendered machine analysis of facial micro-expressions possible today, in contrast to decades ago when it was primarily the domain of psychiatrists where analysis was largely manual. Indeed, although the study of facial micro-expressions is a well-established field in psychology, it is still relatively new from the computational perspective with many interesting problems. In this survey, we present a comprehensive review of state-of-the-art databases and methods for micro-expressions spotting and recognition. Individual stages involved in the automation of these tasks are also described and reviewed at length. In addition, we also deliberate on the challenges and future directions in this growing field of automatic facial micro-expression analysis.Entities:
Keywords: databases; expressions; facial micro-expressions; recognition; spontaneous; spotting; subtle emotions; survey
Year: 2018 PMID: 30042706 PMCID: PMC6049018 DOI: 10.3389/fpsyg.2018.01128
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Micro-expression databases.
| USF-HD | – | 100 | 30 | P | No | 6 | Macro/micro | – | |
| Polikovsky's | 10 | 42 | 200 | P | No | 6 | Micro | – | |
| YorkDDT | 9 | 18 | 25 | S | No | 2 | Micro | – | |
| Silesian deception | 101 | 101 | 100 | S | No | – | Macro/micro | Eye closures, gaze aversion, micro-tensions | |
| SMIC-sub | 6 | 77 | 100 | S | No | 3 | Micro | – | |
| SMIC | HS | 16 | 164 | 100 | S | No | 3 | Micro | – |
| VIS | 8 | 71 | 25 | S | No | 3 | |||
| NIR | 8 | 71 | 25 | S | No | 3 | |||
| E-HS | 16 | 157 | 100 | S | No | 3 | Micro | Onset,offset | |
| E-VIS | 8 | 71 | 25 | S | No | 3 | |||
| E-NIR | 8 | 71 | 25 | S | No | 3 | |||
| CASME | 19 | 195 | 60 | S | Yes | 7 | Micro | Onset,offset,apex | |
| CASME II | 26 | 247 | 200 | S | Yes | 5 | Micro | Onset,offset,apex | |
| CAS(ME)2 | Part A | 22 | 87 | 30 | S | Yes | 4 | Macro/Micro | Onset,offset,apex |
| Part B | 22 | 57 | 30 | S | Yes | 4 | |||
| SAMM | 32 | 159 | 200 | S | Yes | 7 | Macro/micro | Onset,offset,apex | |
| MEVIEW | 16 | 31 | 25 | S | Yes | 5 | macro/micro | onset,offset |
P/S, Posed/Spontaneous.
Not all samples contain micro-expressions and only a total of 183 occurrences of “micro-tensions” were annotated. No emotion classes were available.
Seven objective classes are also provided (Davison et al., .
Set of emotions are atypical (contempt, surprise, fear, anger, happy), likely in the context of environment. Some sample clips involve person speaking, or only have AUs marked with no emotions observed.
Figure 1Sample frames from a “Surprise” sequence (Subject 1) in SMIC. Images reproduced from the database with permission from Li et al. (2013).
Figure 2Sample frames from a “Happiness” sequence (Subject 6) in CASME II. Images reproduced from the database with permission from Yan et al. (2014a).
Figure 3Sample frames from a “Disgust” sequence (Subject 15) in CAS(ME)2. Images reproduced from the database (©Xiaolan Fu) with permission from Qu et al. (2017).
Figure 4Sample frames from a sequence (Subject 6) in SAMM that contains micro-movements. Images reproduced from the database with permission from Davison et al. (2016a).
Figure 5Sample frames from a “Contempt” sequence in MEVIEW that contains micro-movements marked with AU L12. Images reproduced from the database (Husak et al., 2017) under Fair Use.
Figure 6A video sequence depicting the order in which onset, apex and offset frames occur. Sample frames are from a “Happiness” sequence (Subject 2) in CASME II. Images reproduced from the database with permission from Yan et al. (2014a).
A survey of pre-processing techniques applied in facial micro-expression spotting.
| Polikovsky et al., | Manual | – | – | – | 12 ROIs |
| Shreve et al., | – | – | – | – | 3 ROIs |
| Wu et al., | – | – | – | – | Whole face |
| Shreve et al., | – | – | Face alignment | Eyes, nose | 8 ROIs |
| and mouth | |||||
| Polikovsky and Kameda, | Manual | APF | – | – | 12 ROIs |
| Shreve et al., | SCMS | – | – | Eyes and mouth | 4 Parts |
| Moilanen et al., | Manual | KLT | Face alignment | – | 6 × 6 blocks |
| Davison et al., | Face++ | – | Affine transform | – | 5 × 5 blocks |
| Patel et al., | DRMF | OF | – | – | 49 ROIs |
| Liong et al., | DRMF | – | – | 3 ROIs | |
| Wang et al., | DRMF | – | Non-reflective similarity transformation | – | 6 × 6 blocks |
| Liong et al., | DRMF | – | – | Eyes | 3 ROIs |
| Xia et al., | ASM | – | Procrutes analysis | – | Whole face |
| Liong et al., | DRMF | – | – | – | 3 ROIs |
| Davison et al., | Face++ | – | Affine transform | – | 4 × 4, 5 × 5 blocks |
| Davison et al., | Face++ | – | 2D-DFT and Piecewise affine warping | Binary masking | 26 ROIs |
| Yan and Chen, | CLM | – | – | – | 16 ROIs |
| Li et al., | Manual | KLT | – | – | 6 × 6 blocks |
| Ma et al., | CLNF from OpenFace | KLT | – | – | 5 ROIs |
| Qu et al., | ASM | – | LWM | – | Various block sizes |
| Duque et al., | AAM | KLT | – | – | 5 ROIs |
Facial micro-expression (or micro-movement) spotting works in literature.
| Polikovsky et al., | 3D gradient histogram | – | k mean cluster | High-speed ME database (not available) | |
| Shreve et al., | Optical strain | – | M | Threshold technique | USF |
| Wu et al., | Gabor features | – | M | GentleSVM | METT (48 videos) |
| Shreve et al., | Optical strain | – | Threshold technique | USF-HD | |
| M | Canal-9 (not available) | ||||
| Found videos (not available) | |||||
| Polikovsky and Kameda, | 3D gradient histogram | – | k mean cluster | High-speed ME database (not available) | |
| Shreve et al., | Optical strain | – | M | Threshold technique | USF |
| SMIC | |||||
| Moilanen et al., | LBP | ✓ | Threshold technique | CASME-A | |
| M | CASME-B | ||||
| SMIC-VIS-E | |||||
| Davison et al., | HOG | ✓ | M | Threshold technique | SAMM |
| Patel et al., | Spatio-temporal integration of OF vectors | – | M | Threshold technique | SMIC-VIS-E |
| Liong et al., | LBP correlation | – | Binary search | CASME II | |
| CLM | A | ||||
| Optical strain | |||||
| Wang et al., | MDMD | ✓ | M | Threshold technique | CAS(ME)2 |
| Xia et al., | Geometrical motion | – | M | Random walk model | CASME |
| deformation | SMIC | ||||
| Liong et al., | LBP correlation | – | A | Binary search | CASME II |
| Liong et al., | LBP correlation | – | A | Binary search | CASME II |
| Optical strain | |||||
| Davison et al., | HOG | ✓ | M | Threshold technique | SAMM |
| Davison et al., | 3D HOG | ✓ | Threshold technique | SAMM | |
| LBP | M | CASME II | |||
| OF | |||||
| Li et al., | HOOF | ✓ | Threshold technique | CASME II | |
| LBP | M | SMIC-E-HS | |||
| SMIC-E-VIS | |||||
| SMIC-E-NIR | |||||
| Yan and Chen, | LBP correlation | – | Peak detection | CASME II | |
| CLM | A | ||||
| HOOF | |||||
| Ma et al., | RHOOF | – | A | Threshold technique | CASME |
| CASME II | |||||
| Qu et al., | LBP | ✓ | M | Threshold technique | CAS(ME)2 |
| Duque et al., | Riesz Pyramid | ✓ | M | Threshold technique | SMIC-E-HS |
| CASME II |
Benchmarking facial micro-expression recognition works in literature.
| Li et al., | – | LBP-TOP | SVM | – | 48.78 | – | – |
| Liong et al., | – | OSF + OS and weighted LBP-TOP | SVM | – | 52.44 | – | – |
| Liong et al., | – | OS | SVM | – | 53.56 | – | – |
| Liong et al., | – | OS weighted LBP-TOP | SVM | 42.00 | 53.66 | 0.38 | 0.54 |
| Le Ngo et al., | – | STM | Adaboost | 43.78 | 44.34 | 0.3337 | 0.4731 |
| Wang et al., | – | LBP-MOP | SVM | 44.13 | 50.61 | – | – |
| Xu et al., | – | Facial Dynamics Map | SVM | 45.93 | 54.88 | 0.4053 | 0.538 |
| Oh et al., | – | Monogenic + LBP-TOP | SVM | – | – | 0.41 | 0.44 |
| Oh et al., | – | Riesz wavelet + LBP-TOP | SVM | – | – | 0.43 | – |
| Liong et al., | ROIs | LBP-TOP | SVM | 46.00 | 54.00 | 0.32 | 0.52 |
| Wang et al., | – | LBP-SIP | SVM | 46.56 | 44.51 | 0.448 | 0.4492 |
| Le Ngo et al., | A-EMM | LBP-TOP | SVM | – | – | 0.51 | – |
| Le Ngo et al., | DMDSP | LBP-TOP | SVM | 49.00 | 58.00 | 0.51 | 0.60 |
| Park et al., | Adaptive MM | LBP-TOP | SVM | 51.91 | – | – | – |
| Happy and Routray, | – | HFOFO | SVM | 56.64 | 51.83 | 0.5248 | 0.5243 |
| Liong et al., | – | Bi-WOOF | SVM | – | – | 0.56 | 0.53 |
| Huang et al., | – | STCLQP | SVM | 58.39 | 64.02 | 0.5836 | 0.6381 |
| Huang et al., | – | STLBP-IP | SVM | 59.51 | 57.93 | 0.57* | 0.58* |
| Liong et al., | – | Bi-WOOF (apex frame) | SVM | – | – | 0.61 | 0.62 |
| He et al., | – | MMFL | SVM | 59.81 | 63.15 | – | – |
| Kim et al., | – | CNN + LSTM | Softmax | 60.98 | – | – | – |
| Liong and Wong, | – | Bi-WOOF + Phase | SVM | 62.55 | 68.29 | 0.65 | 0.67 |
| Zheng et al., | – | LBP-TOP | RK-SVD | 63.25 | – | – | |
| Zong et al., | – | Hierarchical STLBP-IP | KGSL | 63.83 | 60.78 | 0.6110 | 0.6126 |
| Huang and Zhao, | TIM | STRBP | SVM | 64.37 | 60.98 | – | – |
| Huang et al., | – | Discriminative STLBP-IP | SVM | 64.78 | 63.41 | – | – |
| Allaert et al., | – | OF Maps | SVM | 65.35 | – | – | – |
| Li et al., | TIM+EVM | HIGO | SVM | 67.21 | 68.29 | – | – |
| Zheng, | – | 2DSGR | SRC | – | 71.19 | – | – |
| Liu et al., | – | MDMO | SVM | 67.37 | 80.00 | – | – |
| Davison et al., | – | HOOF | SVM | 76.60 | – | 0.55 | – |
| Wang et al., | TIM | LBP-TOP on TICS | SVM | 62.30 | – | – | – |
| Yan et al., | – | LBP-TOP | SVM | 63.41 | – | – | – |
| Wang et al., | TIM | DLSTD | SVM | 63.41 | 68.29 | – | – |
| Happy and Routray, | – | HFOFO | SVM | 64.06 | 56.1 | 0.6025 | 0.5536 |
| Liong et al., | – | OS weighted LBP-TOP | SVM | 65.59 | – | – | – |
| Wang et al., | – | LBP-MOP | SVM | 66.8 | 60.98 | – | – |
| Wang et al., | – | LBP-SIP | SVM | 67.21 | – | – | – |
| Ping et al., | LBP-TOP | GSLSR | 67.89 | 70.12 | – | – | |
| Park et al., | Adaptive MM | LBP-TOP | SVM | 69.63 | – | – | – |
| Wang et al., | EVM | LBP-TOP | SVM | 75.3 | – | – | – |
| Li et al., | TIM+EVM | HIGO | SVM | 78.14 | 75 | – | – |
| Zhang et al., | – | LBP-TOP and HOOF | RF | 62.5 | – | -– | – |
| | |||||||
| Jia et al., | – | SVD+ LBP/LBP-TOP | KNN | 65.5 | – | – | – |
| | |||||||
| Peng et al., | – | DTSCNN | SVM | 66.67 | – | – | – |
| | |||||||
| Adegun and Vadapalli, | – | LBP-TOP | ELM | 96.12 | – | – | – |
| | |||||||
Not all the samples in the dataset were used in the experiments.
Different number of emotion classes were used in the experiments.
Combined CASME I/II database was used.
Not reported in paper, but computed from confusion table provided.