Literature DB >> 16112549

Framewise phoneme classification with bidirectional LSTM and other neural network architectures.

Alex Graves1, Jürgen Schmidhuber.   

Abstract

In this paper, we present bidirectional Long Short Term Memory (LSTM) networks, and a modified, full gradient version of the LSTM learning algorithm. We evaluate Bidirectional LSTM (BLSTM) and several other network architectures on the benchmark task of framewise phoneme classification, using the TIMIT database. Our main findings are that bidirectional networks outperform unidirectional ones, and Long Short Term Memory (LSTM) is much faster and also more accurate than both standard Recurrent Neural Nets (RNNs) and time-windowed Multilayer Perceptrons (MLPs). Our results support the view that contextual information is crucial to speech processing, and suggest that BLSTM is an effective architecture with which to exploit it.

Mesh:

Year:  2005        PMID: 16112549     DOI: 10.1016/j.neunet.2005.06.042

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  161 in total

1.  NetGO 2.0: improving large-scale protein function prediction with massive sequence, text, domain, family and network information.

Authors:  Shuwei Yao; Ronghui You; Shaojun Wang; Yi Xiong; Xiaodi Huang; Shanfeng Zhu
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

2.  Deep Chronnectome Learning via Full Bidirectional Long Short-Term Memory Networks for MCI Diagnosis.

Authors:  Weizheng Yan; Han Zhang; Jing Sui; Dinggang Shen
Journal:  Med Image Comput Comput Assist Interv       Date:  2018-09-13

3.  deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

Authors:  Ahmad Pesaranghader; Stan Matwin; Marina Sokolova; Ali Pesaranghader
Journal:  J Am Med Inform Assoc       Date:  2019-05-01       Impact factor: 4.497

4.  Adverse Drug Events Detection in Clinical Notes by Jointly Modeling Entities and Relations Using Neural Networks.

Authors:  Bharath Dandala; Venkata Joopudi; Murthy Devarakonda
Journal:  Drug Saf       Date:  2019-01       Impact factor: 5.606

5.  Automated detection of arrhythmia from electrocardiogram signal based on new convolutional encoded features with bidirectional long short-term memory network classifier.

Authors:  Saroj Kumar Pandey; Rekh Ram Janghel
Journal:  Phys Eng Sci Med       Date:  2021-01-06

6.  FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Methods       Date:  2019-03-26       Impact factor: 3.608

7.  Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting.

Authors:  Martin Wöllmer; Erik Marchi; Stefano Squartini; Björn Schuller
Journal:  Cogn Neurodyn       Date:  2011-08-09       Impact factor: 5.082

8.  Snuba: Automating Weak Supervision to Label Training Data.

Authors:  Paroma Varma; Christopher Ré
Journal:  Proceedings VLDB Endowment       Date:  2018-11

9.  Context-Aware Emotion Recognition in the Wild Using Spatio-Temporal and Temporal-Pyramid Models.

Authors:  Nhu-Tai Do; Soo-Hyung Kim; Hyung-Jeong Yang; Guee-Sang Lee; Soonja Yeom
Journal:  Sensors (Basel)       Date:  2021-03-27       Impact factor: 3.576

10.  Speaker-Independent Silent Speech Recognition from Flesh-Point Articulatory Movements Using an LSTM Neural Network.

Authors:  Myungjong Kim; Beiming Cao; Ted Mau; Jun Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-11-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.