Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 VOWEL DURATION MEASUREMENT USING DEEP NEURAL NETWORKS.

Literature DB >> 29034132

VOWEL DURATION MEASUREMENT USING DEEP NEURAL NETWORKS.

Yossi Adi¹, Joseph Keshet¹, Matthew Goldrick².

Abstract

Vowel durations are most often utilized in studies addressing specific issues in phonetics. Thus far this has been hampered by a reliance on subjective, labor-intensive manual annotation. Our goal is to build an algorithm for automatic accurate measurement of vowel duration, where the input to the algorithm is a speech segment contains one vowel preceded and followed by consonants (CVC). Our algorithm is based on a deep neural network trained at the frame level on manually annotated data from a phonetic study. Specifically, we try two deep-network architectures: convolutional neural network (CNN), and deep belief network (DBN), and compare their accuracy to an HMM-based forced aligner. Results suggest that CNN is better than DBN, and both CNN and HMM-based forced aligner are comparable in their results, but neither of them yielded the same predictions as models fit to manually annotated data.

Entities: Chemical Disease Species

Keywords: convolution neural networks; deep belief networks; forced alignment; hidden Markov models; vowel duration measurement

Year: 2015 PMID： 29034132 PMCID： PMC5636193 DOI： 10.1109/MLSP.2015.7324331

Source DB: PubMed Journal: IEEE Int Workshop Mach Learn Signal Process

5 in total

1. Erratum to: Grammatical constraints on phonological encoding in speech production.

Authors: Jordana R Heller; Matthew Goldrick
Journal: Psychon Bull Rev Date: 2015-10

2. The effect of phonological neighborhood density on vowel articulation.

Authors: Benjamin Munson; Nancy Pearl Solomon
Journal: J Speech Lang Hear Res Date: 2004-10 Impact factor: 2.297

3. A fast learning algorithm for deep belief nets.

Authors: Geoffrey E Hinton; Simon Osindero; Yee-Whye Teh
Journal: Neural Comput Date: 2006-07 Impact factor: 2.026

4. Random effects structure for confirmatory hypothesis testing: Keep it maximal.

Authors: Dale J Barr; Roger Levy; Christoph Scheepers; Harry J Tily
Journal: J Mem Lang Date: 2013-04 Impact factor: 3.059

5. Grammatical constraints on phonological encoding in speech production.

Authors: Jordana R Heller; Matthew Goldrick
Journal: Psychon Bull Rev Date: 2014-12

5 in total

2 in total

1. Automatic measurement of vowel duration via structured prediction.

Authors: Yossi Adi; Joseph Keshet; Emily Cibelli; Erin Gustafson; Cynthia Clopper; Matthew Goldrick
Journal: J Acoust Soc Am Date: 2016-12 Impact factor: 1.840

2. The influence of lexical selection disruptions on articulation.

Authors: Matthew Goldrick; Rhonda McClain; Emily Cibelli; Yossi Adi; Erin Gustafson; Cornelia Moers; Joseph Keshet
Journal: J Exp Psychol Learn Mem Cogn Date: 2018-07-19 Impact factor: 3.051

2 in total