Literature DB >> 27411231

LSTM: A Search Space Odyssey.

Klaus Greff, Rupesh K Srivastava, Jan Koutnik, Bas R Steunebrink, Jurgen Schmidhuber.   

Abstract

Several variants of the long short-term memory (LSTM) architecture for recurrent neural networks have been proposed since its inception in 1995. In recent years, these networks have become the state-of-the-art models for a variety of machine learning problems. This has led to a renewed interest in understanding the role and utility of various computational components of typical LSTM variants. In this paper, we present the first large-scale analysis of eight LSTM variants on three representative tasks: speech recognition, handwriting recognition, and polyphonic music modeling. The hyperparameters of all LSTM variants for each task were optimized separately using random search, and their importance was assessed using the powerful functional ANalysis Of VAriance framework. In total, we summarize the results of 5400 experimental runs ( ≈ 15 years of CPU time), which makes our study the largest of its kind on LSTM networks. Our results show that none of the variants can improve upon the standard LSTM architecture significantly, and demonstrate the forget gate and the output activation function to be its most critical components. We further observe that the studied hyperparameters are virtually independent and derive guidelines for their efficient adjustment.

Year:  2016        PMID: 27411231     DOI: 10.1109/TNNLS.2016.2582924

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  165 in total

1.  Segmenting and classifying activities in robot-assisted surgery with recurrent neural networks.

Authors:  Robert DiPietro; Narges Ahmidi; Anand Malpani; Madeleine Waldram; Gyusung I Lee; Mija R Lee; S Swaroop Vedula; Gregory D Hager
Journal:  Int J Comput Assist Radiol Surg       Date:  2019-04-29       Impact factor: 2.924

2.  The Discriminative Kalman Filter for Bayesian Filtering with Nonlinear and Nongaussian Observation Models.

Authors:  Michael C Burkhart; David M Brandman; Brian Franco; Leigh R Hochberg; Matthew T Harrison
Journal:  Neural Comput       Date:  2020-03-18       Impact factor: 2.026

3.  Classifying symmetrical differences and temporal change for the detection of malignant masses in mammography using deep neural networks.

Authors:  Thijs Kooi; Nico Karssemeijer
Journal:  J Med Imaging (Bellingham)       Date:  2017-10-10

4.  Unsupervised classification of multi-omics data during cardiac remodeling using deep learning.

Authors:  Neo Christopher Chung; Bilal Mirza; Howard Choi; Jie Wang; Ding Wang; Peipei Ping; Wei Wang
Journal:  Methods       Date:  2019-03-07       Impact factor: 3.608

5.  Deep recurrent neural network reveals a hierarchy of process memory during dynamic natural vision.

Authors:  Junxing Shi; Haiguang Wen; Yizhen Zhang; Kuan Han; Zhongming Liu
Journal:  Hum Brain Mapp       Date:  2018-02-12       Impact factor: 5.038

6.  A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing.

Authors:  A Serb; I Kobyzev; J Wang; T Prodromakis
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2019-12-23       Impact factor: 4.226

7.  Temporal convolutional networks allow early prediction of events in critical care.

Authors:  Finneas J R Catling; Anthony H Wolff
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

8.  Learning Inter-Sentence, Disorder-Centric, Biomedical Relationships from Medical Literature.

Authors:  Anton H van der Vegt; Guido Zuccon; Bevan Koopman
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

9.  De-identification of medical records using conditional random fields and long short-term memory networks.

Authors:  Zhipeng Jiang; Chao Zhao; Bin He; Yi Guan; Jingchi Jiang
Journal:  J Biomed Inform       Date:  2017-10-13       Impact factor: 6.317

10.  Automated Item Generation with Recurrent Neural Networks.

Authors:  Matthias von Davier
Journal:  Psychometrika       Date:  2018-03-12       Impact factor: 2.500

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.