Literature DB >> 19045664

Modeling the temporal dynamics of distinctive feature landmark detectors for speech recognition.

Aren Jansen1, Partha Niyogi.   

Abstract

This paper elaborates on a computational model for speech recognition that is inspired by several interrelated strands of research in phonology, acoustic phonetics, speech perception, and neuroscience. The goals are twofold: (i) to explore frameworks for recognition that may provide a viable alternative to the current hidden Markov model (HMM) based speech recognition systems and (ii) to provide a computational platform that will facilitate engaging, quantifying, and testing various theories in the scientific traditions in phonetics, psychology, and neuroscience. This motivation leads to an approach that constructs a hierarchically structured point process representation based on distinctive feature landmark detectors and probabilistically integrates the firing patterns of these detectors to decode a phonological sequence. The accuracy of a broad class recognizer based on this framework is competitive with equivalent HMM-based systems. Various avenues for future development of the presented methodology are outlined.

Mesh:

Year:  2008        PMID: 19045664     DOI: 10.1121/1.2956472

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Closure duration analysis of incomplete stop consonants due to stop-stop interaction.

Authors:  Prasanta Kumar Ghosh; Shrikanth S Narayanan
Journal:  J Acoust Soc Am       Date:  2009-07       Impact factor: 1.840

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.