Literature DB >> 33320810

A Review on Deep Learning Techniques for Video Prediction.

Sergiu Oprea, Pablo Martinez-Gonzalez, Alberto Garcia-Garcia, John Alejandro Castro-Vargas, Sergio Orts-Escolano, Jose Garcia-Rodriguez, Antonis Argyros.   

Abstract

The ability to predict, anticipate and reason about future outcomes is a key component of intelligent decision-making systems. In light of the success of deep learning in computer vision, deep-learning-based video prediction emerged as a promising research direction. Defined as a self-supervised learning task, video prediction represents a suitable framework for representation learning, as it demonstrated potential capabilities for extracting meaningful representations of the underlying patterns in natural videos. Motivated by the increasing interest in this task, we provide a review on the deep learning methods for prediction in video sequences. We first define the video prediction fundamentals, as well as mandatory background concepts and the most used datasets. Next, we carefully analyze existing video prediction models organized according to a proposed taxonomy, highlighting their contributions and their significance in the field. The summary of the datasets and methods is accompanied with experimental results that facilitate the assessment of the state of the art on a quantitative basis. The paper is summarized by drawing some general conclusions, identifying open research challenges and by pointing out future research directions.

Entities:  

Mesh:

Year:  2022        PMID: 33320810     DOI: 10.1109/TPAMI.2020.3045007

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  6 in total

1.  Cross-institutional outcome prediction for head and neck cancer patients using self-attention neural networks.

Authors:  William Trung Le; Eugene Vorontsov; Francisco Perdigón Romero; Lotfi Seddik; Mohamed Mortada Elsharief; Phuc Felix Nguyen-Tan; David Roberge; Houda Bahig; Samuel Kadoury
Journal:  Sci Rep       Date:  2022-02-24       Impact factor: 4.379

2.  A texture-aware U-Net for identifying incomplete blinking from eye videography.

Authors:  Qinxiang Zheng; Xin Zhang; Juan Zhang; Furong Bai; Shenghai Huang; Jiantao Pu; Wei Chen; Lei Wang
Journal:  Biomed Signal Process Control       Date:  2022-03-16       Impact factor: 5.076

3.  Use of semantic segmentation for mapping Sargassum on beaches.

Authors:  Javier Arellano-Verdejo; Martin Santos-Romero; Hugo E Lazcano-Hernandez
Journal:  PeerJ       Date:  2022-06-09       Impact factor: 3.061

4.  Traffic-Data Recovery Using Geometric-Algebra-Based Generative Adversarial Network.

Authors:  Di Zang; Yongjie Ding; Xiaoke Qu; Chenglin Miao; Xihao Chen; Junqi Zhang; Keshuang Tang
Journal:  Sensors (Basel)       Date:  2022-04-02       Impact factor: 3.576

5.  Cropland encroachment detection via dual attention and multi-loss based building extraction in remote sensing images.

Authors:  Junshu Wang; Mingrui Cai; Yifan Gu; Zhen Liu; Xiaoxin Li; Yuxing Han
Journal:  Front Plant Sci       Date:  2022-09-06       Impact factor: 6.627

6.  The state prediction method of the silk dryer based on the GA-BP model.

Authors:  Hao Jiang; Zegang Yu; Yonghua Wang; Baowei Zhang; Jiuxiang Song; Jingdian Wei
Journal:  Sci Rep       Date:  2022-08-26       Impact factor: 4.996

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.