Literature DB >> 30524157

Learning Facial Action Units with Spatiotemporal Cues and Multi-label Sampling.

Wen-Sheng Chu1, Fernando De la Torre1, Jeffrey F Cohn2.   

Abstract

Facial action units (AUs) may be represented spatially, temporally, and in terms of their correlation. Previous research focuses on one or another of these aspects or addresses them disjointly. We propose a hybrid network architecture that jointly models spatial and temporal representations and their correlation. In particular, we use a Convolutional Neural Network (CNN) to learn spatial representations, and a Long Short-Term Memory (LSTM) to model temporal dependencies among them. The outputs of CNNs and LSTMs are aggregated into a fusion network to produce per-frame prediction of multiple AUs. The hybrid network was compared to previous state-of-the-art approaches in two large FACS-coded video databases, GFT and BP4D, with over 400,000 AU-coded frames of spontaneous facial behavior in varied social contexts. Relative to standard multi-label CNN and feature-based state-of-the-art approaches, the hybrid system reduced person-specific biases and obtained increased accuracy for AU detection. To address class imbalance within and between batches during training the network, we introduce multi-labeling sampling strategies that further increase accuracy when AUs are relatively sparse. Finally, we provide visualization of the learned AU models, which, to the best of our best knowledge, reveal for the first time how machines see AUs.

Entities:  

Keywords:  00-01; 99-00; Multi-label learning; deep learning; facial action unit detection; multi-label sampling; spatio-temporal learning; video analysis

Year:  2018        PMID: 30524157      PMCID: PMC6277040          DOI: 10.1016/j.imavis.2018.10.002

Source DB:  PubMed          Journal:  Image Vis Comput        ISSN: 0262-8856            Impact factor:   2.818


  14 in total

1.  Meta-Analysis of the First Facial Expression Recognition Challenge.

Authors:  M F Valstar; M Mehu; M Pantic; K Scherer
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2012-06-20

2.  Automatic Analysis of Facial Affect: A Survey of Registration, Representation, and Recognition.

Authors:  Evangelos Sariyanidi; Hatice Gunes; Andrea Cavallaro
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-06       Impact factor: 6.226

3.  Entropy and distance of random graphs with application to structural pattern recognition.

Authors:  A K Wong; M You
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1985-05       Impact factor: 6.226

4.  Facial Action Unit Event Detection by Cascade of Tasks.

Authors:  Xiaoyu Ding; Wen-Sheng Chu; Fernando De la Torre; Jeffery F Cohn; Qiao Wang
Journal:  Proc IEEE Int Conf Comput Vis       Date:  2013

5.  A Model of the Perception of Facial Expressions of Emotion by Humans: Research Overview and Perspectives.

Authors:  Aleix Martinez; Shichuan Du
Journal:  J Mach Learn Res       Date:  2012-05-01       Impact factor: 3.654

6.  Context-Sensitive Dynamic Ordinal Regression for Intensity Estimation of Facial Action Units.

Authors:  Ognjen Rudovic; Vladimir Pavlovic; Maja Pantic
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-05       Impact factor: 6.226

7.  Dynamic Cascades with Bidirectional Bootstrapping for Action Unit Detection in Spontaneous Facial Behavior.

Authors:  Yunfeng Zhu; Fernando De la Torre; Jeffrey F Cohn; Yu-Jin Zhang
Journal:  IEEE Trans Affect Comput       Date:  2011 Apr-Jun       Impact factor: 10.506

8.  Facial action unit recognition by exploiting their dynamic and semantic relationships.

Authors:  Yan Tong; Wenhui Liao; Qiang Ji
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2007-10       Impact factor: 6.226

9.  Joint Patch and Multi-label Learning for Facial Action Unit Detection.

Authors:  Kaili Zhao; Wen-Sheng Chu; Fernando De la Torre; Jeffrey F Cohn; Honggang Zhang
Journal:  Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit       Date:  2015-06

Review 10.  Compound facial expressions of emotion: from basic research to clinical applications.

Authors:  Shichuan Du; Aleix M Martinez
Journal:  Dialogues Clin Neurosci       Date:  2015-12       Impact factor: 5.986

View more
  1 in total

1.  Unmasking the Devil in the Details: What Works for Deep Facial Action Coding?

Authors:  Koichiro Niinuma; Laszlo A Jeni; Itir Onal Ertugrul; Jeffrey F Cohn
Journal:  BMVC       Date:  2019-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.