Literature DB >> 31439969

Computing low-dimensional representations of speech from socio-auditory structures for phonetic analyses.

Andrew R Plummer1, Patrick F Reidy2.   

Abstract

Low-dimensional representations of speech data, such as formant values extracted by linear predictive coding analysis or spectral moments computed from whole spectra viewed as probability distributions, have been instrumental in both phonetic and phonological analyses over the last few decades. In this paper, we present a framework for computing low-dimensional representations of speech data based on two assumptions: that speech data represented in high-dimensional data spaces lie on shapes called manifolds that can be used to map speech data to low-dimensional coordinate spaces, and that manifolds underlying speech data are generated from a combination of language-specific lexical, phonological, and phonetic information as well as culture-specific socio-indexical information that is expressed by talkers of a given speech community. We demonstrate the basic mechanics of the framework by carrying out an analysis of children's productions of sibilant fricatives relative to those of adults in their speech community using the phoneigen package - a publicly available implementation of the framework. We focus the demonstration on enumerating the steps for constructing manifolds from data and then using them to map the data to a low-dimensional space, explicating how manifold structure affects the learned low-dimensional representations, and comparing the use of these representations against standard acoustic features in a phonetic analysis. We conclude with a discussion of the framework's underlying assumptions, its broader modeling potential, and its position relative to recent advances in the field of representation learning.

Entities:  

Keywords:  Laplacian Eigenmaps; low-dimensional representations of speech; manifold alignment; phonetic categories; socio-indexical

Year:  2018        PMID: 31439969      PMCID: PMC6706093     

Source DB:  PubMed          Journal:  J Phon        ISSN: 0095-4470


  22 in total

1.  The uncontrolled manifold concept: identifying control variables for a functional task.

Authors:  J P Scholz; G Schöner
Journal:  Exp Brain Res       Date:  1999-06       Impact factor: 1.972

2.  Nonlinear dimensionality reduction by locally linear embedding.

Authors:  S T Roweis; L K Saul
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

3.  A global geometric framework for nonlinear dimensionality reduction.

Authors:  J B Tenenbaum; V de Silva; J C Langford
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

4.  Cognition. The manifold ways of perception.

Authors:  H S Seung; D D Lee
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

5.  Acoustic characteristics of English fricatives.

Authors:  A Jongman; R Wayland; S Wong
Journal:  J Acoust Soc Am       Date:  2000-09       Impact factor: 1.840

6.  Modeling the development of pronunciation in infant speech acquisition.

Authors:  Ian S Howard; Piers Messum
Journal:  Motor Control       Date:  2011-01       Impact factor: 1.422

7.  Developmental and cross-linguistic variation in the infant vowel space: the case of Canadian English and Canadian French.

Authors:  Susan Rvachew; Karen Mattock; Linda Polka; Lucie Ménard
Journal:  J Acoust Soc Am       Date:  2006-10       Impact factor: 1.840

8.  Acoustic and spectral characteristics of young children's fricative productions: a developmental perspective.

Authors:  Shawn L Nissen; Robert Allen Fox
Journal:  J Acoust Soc Am       Date:  2005-10       Impact factor: 1.840

9.  Some cross-linguistic evidence for modulation of implicational universals by language-specific frequency effects in phonological development.

Authors:  Jan Edwards; Mary E Beckman
Journal:  Lang Learn Dev       Date:  2008-04-01

10.  Methodological questions in studying consonant acquisition.

Authors:  Jan Edwards; Mary E Beckman
Journal:  Clin Linguist Phon       Date:  2008-12       Impact factor: 1.346

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.