Literature DB >> 33786087

End-to-end face parsing via interlinked convolutional neural networks.

Zi Yin1, Valentin Yiu2,3, Xiaolin Hu2, Liang Tang1.   

Abstract

Face parsing is an important computer vision task that requires accurate pixel segmentation of facial parts (such as eyes, nose, mouth, etc.), providing a basis for further face analysis, modification, and other applications. Interlinked Convolutional Neural Networks (iCNN) was proved to be an effective two-stage model for face parsing. However, the original iCNN was trained separately in two stages, limiting its performance. To solve this problem, we introduce a simple, end-to-end face parsing framework: STN-aided iCNN(STN-iCNN), which extends the iCNN by adding a Spatial Transformer Network (STN) between the two isolated stages. The STN-iCNN uses the STN to provide a trainable connection to the original two-stage iCNN pipeline, making end-to-end joint training possible. Moreover, as a by-product, STN also provides more precise cropped parts than the original cropper. Due to these two advantages, our approach significantly improves the accuracy of the original model. Our model achieved competitive performance on the Helen Dataset, the standard face parsing dataset. It also achieved superior performance on CelebAMask-HQ dataset, proving its good generalization. Our code has been released at https://github.com/aod321/STN-iCNN. © Springer Nature B.V. 2020.

Entities:  

Keywords:  End-to-end; Face parsing; STN-iCNN

Year:  2020        PMID: 33786087      PMCID: PMC7947053          DOI: 10.1007/s11571-020-09615-4

Source DB:  PubMed          Journal:  Cogn Neurodyn        ISSN: 1871-4080            Impact factor:   5.082


  6 in total

1.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex.

Authors:  D H HUBEL; T N WIESEL
Journal:  J Physiol       Date:  1962-01       Impact factor: 5.182

2.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

Authors:  Liang-Chieh Chen; George Papandreou; Iasonas Kokkinos; Kevin Murphy; Alan L Yuille
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-04-27       Impact factor: 6.226

3.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.

Authors:  Vijay Badrinarayanan; Alex Kendall; Roberto Cipolla
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-01-02       Impact factor: 6.226

4.  EEG classification of driver mental states by deep learning.

Authors:  Hong Zeng; Chen Yang; Guojun Dai; Feiwei Qin; Jianhai Zhang; Wanzeng Kong
Journal:  Cogn Neurodyn       Date:  2018-07-18       Impact factor: 5.082

5.  Banknote recognition: investigating processing and cognition framework using competitive neural network.

Authors:  Oyebade K Oyedotun; Adnan Khashman
Journal:  Cogn Neurodyn       Date:  2016-08-22       Impact factor: 5.082

6.  Detecting prostate cancer using deep learning convolution neural network with transfer learning approach.

Authors:  Adeel Ahmed Abbasi; Lal Hussain; Imtiaz Ahmed Awan; Imran Abbasi; Abdul Majid; Malik Sajjad Ahmed Nadeem; Quratul-Ain Chaudhary
Journal:  Cogn Neurodyn       Date:  2020-04-11       Impact factor: 5.082

  6 in total
  2 in total

1.  Trilateral Attention Network for Real-Time Cardiac Region Segmentation.

Authors:  Ghada Zamzmi; Sivaramakrishnan Rajaraman; Vandana Sachdev; Sameer Antani
Journal:  IEEE Access       Date:  2021-08-24       Impact factor: 3.367

2.  Real-time echocardiography image analysis and quantification of cardiac indices.

Authors:  Ghada Zamzmi; Sivaramakrishnan Rajaraman; Li-Yueh Hsu; Vandana Sachdev; Sameer Antani
Journal:  Med Image Anal       Date:  2022-06-09       Impact factor: 13.828

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.