Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 SEQUENCE SEGMENTATION USING JOINT RNN AND STRUCTURED PREDICTION MODELS.

Literature DB >> 29033692

SEQUENCE SEGMENTATION USING JOINT RNN AND STRUCTURED PREDICTION MODELS.

Yossi Adi¹, Joseph Keshet¹, Emily Cibelli², Matthew Goldrick².

Abstract

We describe and analyze a simple and effective algorithm for sequence segmentation applied to speech processing tasks. We propose a neural architecture that is composed of two modules trained jointly: a recurrent neural network (RNN) module and a structured prediction model. The RNN outputs are considered as feature functions to the structured model. The overall model is trained with a structured loss function which can be designed to the given segmentation task. We demonstrate the effectiveness of our method by applying it to two simple tasks commonly used in phonetic studies: word segmentation and voice onset time segmentation. Results suggest the proposed model is superior to previous methods, obtaining state-of-the-art results on the tested datasets.

Entities: Chemical Disease Gene Species

Keywords: Sequence segmentation; recurrent neural networks (RNNs); structured prediction; voice onset time; word segmentation

Year: 2017 PMID： 29033692 PMCID： PMC5638122 DOI： 10.1109/ICASSP.2017.7952591

Source DB: PubMed Journal: Proc IEEE Int Conf Acoust Speech Signal Process ISSN： 1520-6149

1 in total

1. Automatic measurement of voice onset time using discriminative structured prediction.

Authors: Morgan Sonderegger; Joseph Keshet
Journal: J Acoust Soc Am Date: 2012-12 Impact factor: 1.840

1 in total

3 in total

1. The influence of lexical selection disruptions on articulation.

Authors: Matthew Goldrick; Rhonda McClain; Emily Cibelli; Yossi Adi; Erin Gustafson; Cornelia Moers; Joseph Keshet
Journal: J Exp Psychol Learn Mem Cogn Date: 2018-07-19 Impact factor: 3.051

2. Using automated acoustic analysis to explore the link between planning and articulation in second language speech production.

Authors: Matthew Goldrick; Yosi Shrem; Oriana Kilbourn-Ceron; Cristina Baus; Joseph Keshet
Journal: Lang Cogn Neurosci Date: 2020-08-19 Impact factor: 2.331

3. Enhancement of Local Crowd Location and Count: Multiscale Counting Guided by Head RGB-Mask.

Authors: Guoyin Ren; Xiaoqi Lu; Jingyu Wang; Yuhao Li
Journal: Comput Intell Neurosci Date: 2022-08-24

3 in total