Literature DB >> 34290902

Can Deep Learning Recognize Subtle Human Activities?

Vincent Jacquot1, Zhuofan Ying2, Gabriel Kreiman3.   

Abstract

Deep Learning has driven recent and exciting progress in computer vision, instilling the belief that these algorithms could solve any visual task. Yet, datasets commonly used to train and test computer vision algorithms have pervasive confounding factors. Such biases make it difficult to truly estimate the performance of those algorithms and how well computer vision models can extrapolate outside the distribution in which they were trained. In this work, we propose a new action classification challenge that is performed well by humans, but poorly by state-of-the-art Deep Learning models. As a proof-of-principle, we consider three exemplary tasks: drinking, reading, and sitting. The best accuracies reached using state-of-the-art computer vision models were 61.7%, 62.8%, and 76.8%, respectively, while human participants scored above 90% accuracy on the three tasks. We propose a rigorous method to reduce confounds when creating datasets, and when comparing human versus computer vision performance. Source code and datasets are publicly available.

Entities:  

Year:  2020        PMID: 34290902      PMCID: PMC8291217     

Source DB:  PubMed          Journal:  Conf Comput Vis Pattern Recognit Workshops        ISSN: 2160-7508


  5 in total

1.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.

Authors:  Vijay Badrinarayanan; Alex Kendall; Roberto Cipolla
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-01-02       Impact factor: 6.226

2.  Focal Loss for Dense Object Detection.

Authors:  Tsung-Yi Lin; Priya Goyal; Ross Girshick; Kaiming He; Piotr Dollar
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-07-23       Impact factor: 6.226

3.  OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields.

Authors:  Zhe Cao; Gines Hidalgo Martinez; Tomas Simon; Shih-En Wei; Yaser A Sheikh
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-07-17       Impact factor: 6.226

4.  Mask R-CNN.

Authors:  Kaiming He; Georgia Gkioxari; Piotr Dollar; Ross Girshick
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-06-05       Impact factor: 6.226

5.  Recurrent computations for visual pattern completion.

Authors:  Hanlin Tang; Martin Schrimpf; William Lotter; Charlotte Moerman; Ana Paredes; Josue Ortega Caro; Walter Hardesty; David Cox; Gabriel Kreiman
Journal:  Proc Natl Acad Sci U S A       Date:  2018-08-13       Impact factor: 11.205

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.