Literature DB >> 27608449

Long-Term Recurrent Convolutional Networks for Visual Recognition and Description.

Jeff Donahue1, Lisa Anne Hendricks1, Marcus Rohrbach1, Subhashini Venugopalan2, Sergio Guadarrama1, Kate Saenko3, Trevor Darrell1.   

Abstract

Models based on deep convolutional networks have dominated recent image interpretation tasks; we investigate whether models which are also recurrent are effective for tasks involving sequences, visual and otherwise. We describe a class of recurrent convolutional architectures which is end-to-end trainable and suitable for large-scale visual understanding tasks, and demonstrate the value of these models for activity recognition, image captioning, and video description. In contrast to previous models which assume a fixed visual representation or perform simple temporal averaging for sequential processing, recurrent convolutional models are "doubly deep" in that they learn compositional representations in space and time. Learning long-term dependencies is possible when nonlinearities are incorporated into the network state updates. Differentiable recurrent models are appealing in that they can directly map variable-length inputs (e.g., videos) to variable-length outputs (e.g., natural language text) and can model complex temporal dynamics; yet they can be optimized with backpropagation. Our recurrent sequence models are directly connected to modern visual convolutional network models and can be jointly trained to learn temporal dynamics and convolutional perceptual representations. Our results show that such models have distinct advantages over state-of-the-art models for recognition or generation which are separately defined or optimized.

Year:  2016        PMID: 27608449     DOI: 10.1109/TPAMI.2016.2599174

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  62 in total

1.  Minimal videos: Trade-off between spatial and temporal information in human and machine vision.

Authors:  Guy Ben-Yosef; Gabriel Kreiman; Shimon Ullman
Journal:  Cognition       Date:  2020-04-20

2.  Machine Learning Techniques for Personalized Detection of Epileptic Events in Clinical Video Recordings.

Authors:  Matthew Pediaditis; Anca-Nicoleta Ciubotaru; Thomas Brunschwiler; Peter Hilfiker; Thomas Grunwald; Marcellina Ha Berlin; Lukas Imbach; Carl Muroi; Christian Stra Ssle; Emanuela Keller; Maria Gabrani
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

3.  Deep Learning Predicts Lung Cancer Treatment Response from Serial Medical Imaging.

Authors:  Yiwen Xu; Ahmed Hosny; Roman Zeleznik; Chintan Parmar; Thibaud Coroller; Idalid Franco; Raymond H Mak; Hugo J W L Aerts
Journal:  Clin Cancer Res       Date:  2019-04-22       Impact factor: 12.531

4.  A Heterogeneous Architecture for the Vision Processing Unit with a Hybrid Deep Neural Network Accelerator.

Authors:  Peng Liu; Zikai Yang; Lin Kang; Jian Wang
Journal:  Micromachines (Basel)       Date:  2022-02-07       Impact factor: 2.891

5.  A Survey on Multi-View Clustering.

Authors:  Guoqing Chao; Shiliang Sun; Jinbo Bi
Journal:  IEEE Trans Artif Intell       Date:  2021-04-05

6.  Prediction of Non-small Cell Lung Cancer Histology by a Deep Ensemble of Convolutional and Bidirectional Recurrent Neural Network.

Authors:  Dipanjan Moitra; Rakesh Kumar Mandal
Journal:  J Digit Imaging       Date:  2020-08       Impact factor: 4.056

7.  REDN: A Recursive Encoder-Decoder Network for Edge Detection.

Authors:  Truc LE; Y E Duan
Journal:  IEEE Access       Date:  2020-05-12       Impact factor: 3.367

8.  Estimation of pharmacokinetic parameters from DCE-MRI by extracting long and short time-dependent features using an LSTM network.

Authors:  Jiaren Zou; James M Balter; Yue Cao
Journal:  Med Phys       Date:  2020-06-03       Impact factor: 4.071

Review 9.  Machine learning for sperm selection.

Authors:  Jae Bem You; Christopher McCallum; Yihe Wang; Jason Riordon; Reza Nosrati; David Sinton
Journal:  Nat Rev Urol       Date:  2021-05-17       Impact factor: 14.432

10.  A Deep Learning-Based Method for Forecasting Gold Price with Respect to Pandemics.

Authors:  Mahtab Mohtasham Khani; Sahand Vahidnia; Alireza Abbasi
Journal:  SN Comput Sci       Date:  2021-06-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.