Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Speaker-Independent Phoneme Alignment Using Transition-Dependent States.

Literature DB >> 20161342

Speaker-Independent Phoneme Alignment Using Transition-Dependent States.

Abstract

Determining the location of phonemes is important to a number of speech applications, including training of automatic speech recognition systems, building text-to-speech systems, and research on human speech processing. Agreement of humans on the location of phonemes is, on average, 93.78% within 20 msec on a variety of corpora, and 93.49% within 20 msec on the TIMIT corpus. We describe a baseline forced-alignment system and a proposed system with several modifications to this baseline. Modifications include the addition of energy-based features to the standard cepstral feature set, the use of probabilities of a state transition given an observation, and the computation of probabilities of distinctive phonetic features instead of phoneme-level probabilities. Performance of the baseline system on the test partition of the TIMIT corpus is 91.48% within 20 msec, and performance of the proposed system on this corpus is 93.36% within 20 msec. The results of the proposed system are a 22% relative reduction in error over the baseline system, and a 14% reduction in error over results from a non-HMM alignment system. This result of 93.36% agreement is the best known reported result on the TIMIT corpus.

Entities: Disease Gene Species

Year: 2009 PMID： 20161342 PMCID： PMC2682710 DOI： 10.1016/j.specom.2008.11.003

Source DB: PubMed Journal: Speech Commun ISSN： 0167-6393 Impact factor: 2.017

4 in total

Speaker-Independent Phoneme Alignment Using Transition-Dependent States.

1. Age-related differences in identification and discrimination of temporal cues in speech segments.

2. On the role of spectral transition for speech perception.

3. A diagnostic marker for childhood apraxia of speech: the lexical stress ratio.

4. A diagnostic marker for childhood apraxia of speech: the coefficient of variation ratio.

1. Using automatic alignment to analyze endangered language data: testing the viability of untrained alignment.

2. Spoken Language Derived Measures for Detecting Mild Cognitive Impairment.

3. Automatic analysis of slips of the tongue: Insights into the cognitive architecture of speech production.

4. Determining the relevance of different aspects of formant contours to intelligibility.