Literature DB >> 12002871

Toward a model for lexical access based on acoustic landmarks and distinctive features.

Kenneth N Stevens1.   

Abstract

This article describes a model in which the acoustic speech signal is processed to yield a discrete representation of the speech stream in terms of a sequence of segments, each of which is described by a set (or bundle) of binary distinctive features. These distinctive features specify the phonemic contrasts that are used in the language, such that a change in the value of a feature can potentially generate a new word. This model is a part of a more general model that derives a word sequence from this feature representation, the words being represented in a lexicon by sequences of feature bundles. The processing of the signal proceeds in three steps: (1) Detection of peaks, valleys, and discontinuities in particular frequency ranges of the signal leads to identification of acoustic landmarks. The type of landmark provides evidence for a subset of distinctive features called articulator-free features (e.g., [vowel], [consonant], [continuant]). (2) Acoustic parameters are derived from the signal near the landmarks to provide evidence for the actions of particular articulators, and acoustic cues are extracted by sampling selected attributes of these parameters in these regions. The selection of cues that are extracted depends on the type of landmark and on the environment in which it occurs. (3) The cues obtained in step (2) are combined, taking context into account, to provide estimates of "articulator-bound" features associated with each landmark (e.g., [lips], [high], [nasal]). These articulator-bound features, combined with the articulator-free features in (1), constitute the sequence of feature bundles that forms the output of the model. Examples of cues that are used, and justification for this selection, are given, as well as examples of the process of inferring the underlying features for a segment when there is variability in the signal due to enhancement gestures (recruited by a speaker to make a contrast more salient) or due to overlap of gestures from neighboring segments.

Mesh:

Year:  2002        PMID: 12002871     DOI: 10.1121/1.1458026

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  92 in total

Review 1.  Computational neuroanatomy of speech production.

Authors:  Gregory Hickok
Journal:  Nat Rev Neurosci       Date:  2012-01-05       Impact factor: 34.870

2.  The effects of selective consonant amplification on sentence recognition in noise by hearing-impaired listeners.

Authors:  Rithika Saripella; Philipos C Loizou; Linda Thibodeau; Jennifer A Alford
Journal:  J Acoust Soc Am       Date:  2011-11       Impact factor: 1.840

3.  Combined spectral and temporal enhancement to improve cochlear-implant speech perception.

Authors:  Aparajita Bhattacharya; Andrew Vandali; Fan-Gang Zeng
Journal:  J Acoust Soc Am       Date:  2011-11       Impact factor: 1.840

Review 4.  Perceptuo-motor interactions in the perceptual organization of speech: evidence from the verbal transformation effect.

Authors:  Anahita Basirat; Jean-Luc Schwartz; Marc Sato
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2012-04-05       Impact factor: 6.237

5.  Effects of introducing low-frequency harmonics in the perception of vocoded telephone speech.

Authors:  Yi Hu; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2010-09       Impact factor: 1.840

6.  Masking release and the contribution of obstruent consonants on speech recognition in noise by cochlear implant users.

Authors:  Ning Li; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2010-09       Impact factor: 1.840

7.  The functional neuroanatomy of language.

Authors:  Gregory Hickok
Journal:  Phys Life Rev       Date:  2009-09       Impact factor: 11.025

8.  Variability of articulator positions and formants across nine English vowels.

Authors:  D H Whalen; Wei-Rong Chen; Mark K Tiede; Hosung Nam
Journal:  J Phon       Date:  2018-02-23

9.  Retrieving Tract Variables From Acoustics: A Comparison of Different Machine Learning Strategies.

Authors:  Vikramjit Mitra; Hosung Nam; Carol Y Espy-Wilson; Elliot Saltzman; Louis Goldstein
Journal:  IEEE J Sel Top Signal Process       Date:  2010-09-13       Impact factor: 6.856

10.  Effect of initial-consonant intensity on the speed of lexical decisions.

Authors:  Daniel Fogerty; Allen A Montgomery; Kimberlee A Crass
Journal:  Atten Percept Psychophys       Date:  2014-04       Impact factor: 2.199

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.