Literature DB >> 23967953

Using automatic alignment to analyze endangered language data: testing the viability of untrained alignment.

Christian DiCanio1, Hosung Nam, Douglas H Whalen, H Timothy Bunnell, Jonathan D Amith, Rey Castillo García.   

Abstract

While efforts to document endangered languages have steadily increased, the phonetic analysis of endangered language data remains a challenge. The transcription of large documentation corpora is, by itself, a tremendous feat. Yet, the process of segmentation remains a bottleneck for research with data of this kind. This paper examines whether a speech processing tool, forced alignment, can facilitate the segmentation task for small data sets, even when the target language differs from the training language. The authors also examined whether a phone set with contextualization outperforms a more general one. The accuracy of two forced aligners trained on English (hmalign and p2fa) was assessed using corpus data from Yoloxóchitl Mixtec. Overall, agreement performance was relatively good, with accuracy at 70.9% within 30 ms for hmalign and 65.7% within 30 ms for p2fa. Segmental and tonal categories influenced accuracy as well. For instance, additional stop allophones in hmalign's phone set aided alignment accuracy. Agreement differences between aligners also corresponded closely with the types of data on which the aligners were trained. Overall, using existing alignment systems was found to have potential for making phonetic analysis of small corpora more efficient, with more allophonic phone sets providing better agreement than general ones.

Mesh:

Year:  2013        PMID: 23967953      PMCID: PMC5392066          DOI: 10.1121/1.4816491

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  5 in total

1.  Perception of coarticulatory nasalization by speakers of English and Thai: evidence for partial compensation.

Authors:  P S Beddor; R A Krakow
Journal:  J Acoust Soc Am       Date:  1999-11       Impact factor: 1.840

2.  On the role of perception in shaping phonological assimilation rules.

Authors:  S L Hura; B Lindblom; R L Diehl
Journal:  Lang Speech       Date:  1992 Jan-Jun       Impact factor: 1.500

3.  Information for Mandarin tones in the amplitude contour and in brief segments.

Authors:  D H Whalen; Y Xu
Journal:  Phonetica       Date:  1992       Impact factor: 1.759

4.  Detection of target phonemes in spontaneous and read speech.

Authors:  G Mehta; A Cutler
Journal:  Lang Speech       Date:  1988 Apr-Jun       Impact factor: 1.500

5.  Speaker-Independent Phoneme Alignment Using Transition-Dependent States.

Authors:  John-Paul Hosom
Journal:  Speech Commun       Date:  2009-04       Impact factor: 2.017

  5 in total
  1 in total

1.  Computational Modelling of Tone Perception Based on Direct Processing of f0 Contours.

Authors:  Yue Chen; Yingming Gao; Yi Xu
Journal:  Brain Sci       Date:  2022-03-02
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.