| Literature DB >> 28367052 |
Catherine T Best1, Louis M Goldstein2, Hosung Nam3, Michael D Tyler4.
Abstract
To become language users, infants must embrace the integrality of speech perception and production. That they do so, and quite rapidly, is implied by the native-language attunement they achieve in each domain by 6-12 months. Yet research has most often addressed one or the other domain, rarely how they interrelate. Moreover, mainstream assumptions that perception relies on acoustic patterns whereas production involves motor patterns entail that the infant would have to translate incommensurable information to grasp the perception-production relationship. We posit the more parsimonious view that both domains depend on commensurate articulatory information. Our proposed framework combines principles of the Perceptual Assimilation Model (PAM) and Articulatory Phonology (AP). According to PAM, infants attune to articulatory information in native speech and detect similarities of nonnative phones to native articulatory patterns. The AP premise that gestures of the speech organs are the basic elements of phonology offers articulatory similarity metrics while satisfying the requirement that phonological information be discrete and contrastive: (a) distinct articulatory organs produce vocal tract constrictions and (b) phonological contrasts recruit different articulators and/or constrictions of a given articulator that differ in degree or location. Various lines of research suggest young children perceive articulatory information, which guides their productions: discrimination of between- versus within-organ contrasts, simulations of attunement to language-specific articulatory distributions, multimodal speech perception, oral/vocal imitation, and perceptual effects of articulator activation or suppression. We conclude that articulatory gesture information serves as the foundation for developmental integrality of speech perception and production.Entities:
Year: 2016 PMID: 28367052 PMCID: PMC5351798 DOI: 10.1080/10407413.2016.1230372
Source DB: PubMed Journal: Ecol Psychol ISSN: 1040-7413
Figure 1. Schematic diagram of the Perceptual Assimilation Model (PAM; Best, 1995; Best & Tyler, 2007), illustrating an adult's native language phonological space, in which the conical “islands” represent native consonant categories that have been delineated and sharpened by experience with perceiving and producing native speech, and the major predicted patterns of perceptual assimilation of nonnative consonant contrasts to the native phonological system. Pairs of black circles represent nonnative consonant contrasts, with the various predicted contrast assimilation patterns indicated by arrows and labels.
Figure 2. Schematics of the three dimensions of articulatory gestures as proposed for the revised Perceptual Assimilation Model with Articulatory Organ Hypothesis (PAM-AOH): (A) articulatory geometry (modeled after Browman & Goldstein, 1989, 1992); (B) articulatory organ hierarchy (active articulators and their nested nodes), which is an unfolded, straightened version of A; (C) articulatory actions (constriction degrees) represented along a straightened side view of the vocal tract's ventral (lower surface: active articulators) and dorsal (upper surface: passive articulators/locations) surfaces.
Summary of prior findings on infants' perception of native and nonnative consonant contrasts at 6–8 versus 10–12 months of age, as interpreted in terms of the epigenetic trajectories posited for infant speech perception development (Aslin & Pisoni, 1980) and suborganized according to Articulatory Organ Hypothesis predictions for between-organ [unshaded] versus within-organ [light shading] contrasts (AOH; Best & McRoberts, 2003; L. M. Goldstein, 2003; L. M. Goldstein & Fowler, 2003; Studdert-Kennedy & Goldstein, 2003) and our newly introduced privative contrasts [medium shading] (+/– gesture of a given articulator).
| Consonant contrast | Stimuli/Infant language | AOH contrast | Organ gesture distinction | References |
|---|---|---|---|---|
| English /b/-/d/ | Native/English | Between-organ | Lips vs. tongue tip closure | Best et al. ( |
| English /b/-/g/ | Native/English | Between-organ | Lips vs. tongue dorsum closure | Moffitt ( |
| English /s/-/ʃ/ | Native/English | Between-organ | Tongue tip vs. dorsum critical | Eilers & Minifie ( |
| English /f/-/θ/ | Native/English | Between-organ | Lip vs. tongue tip critical | Holmberg et al. ( |
| Zulu /ǀ/-/ǁ/ | Nonnative/English | Between-organ | Tongue tip vs. dorsum closure | |
| Tigrinya /p′/-/t′/ | Nonnative English | Between-organ | Lips vs. tongue tip closure | Best & McRoberts ( |
| !Xóõ /ʘx/-/ǀx/ | Nonnative English | Between-organ | Lips vs. tongue tip closure | Tyler et al. ( |
| Nuu-Chah-Nulth /x/-/χ/ | Nonnative English | Between-organ | Tongue dorsum vs. root critical | Tyler et al. ( |
| Nuu-Chah-Nulth /χ/-/ħ/ | Nonnative English | Between-organ | Tongue root vs. aryepiglottis critical | Tyler et al. ( |
| English /b/-/v/ | Native English | Within-organ | Lip: bilabial closure vs. dental critical | Polka et al. ( |
| English /tʃ/-/ʃ/ | Native English | Within-organ | Tongue tip closure vs. critical | Tsao et al. ( |
| English /s/-/θ/ | Native English | Within-organ | Tongue tip critical alveolar vs. dental | Tyler et al. ( |
| English /b/-/m/ | Native English | Privative | +/– velum-lowering gesture | Eimas & Miller ( |
| English /b/-/p/ | Native English | Privative | +/– glottal-opening gesture phased with release of lip closure | Eilers et al. ( |
| Spanish /b/-/p/ | Native Spanish | Privative | +/– glottal-opening gesture phased with lip closure | Eilers et al. ( |
| Native*/Kikuyu | Streeter ( | |||
| English /r/-/l/ | Native English | Within-organ | Tongue tip closure vs. narrow + root narrow pharyngeal vs. uvular | Kuhl et al. ( |
| English /d/-/ð/ | Native English | Within-organ | Tongue tip alveolar closure vs. interdental critical | Polka et al. ( |
| English /s/-/z/ voiced vs. voiceless | Native English | Privative | +/– glottal-opening gesture | Best & McRoberts ( |
| Hindi /t̪/-/ʈ/ | Nonnative English | Within-organ | Tongue tip closure dental vs. retroflex | Anderson et al. ( |
| Czech /z/-/r̝/ (ř) | Nonnative English | Within-organ | Tongue tip alveolar critical vs. (loose) closure | Trehub ( |
| Nthlakampx /k′/-/q′/ | Nonnative English | Within-organ | Tongue dorsum closure velar vs. uvular | Best et al. ( |
| Zulu /k/-/k′/ | Nonnative English | Within-organ | Glottal closure vs. opening gestures | Best & McRoberts ( |
| Mandarin /tɕ/-/ɕ/ | Nonnative English | Within-organ | Tongue dorsum closure vs. critical | Tsao et al. ( |
| English /r/-/l/ | Nonnative/Japanese | Within-organ | Tongue tip closure vs. narrow + root narrow pharyngeal vs. uvular | Kuhl et al. ( |
| Zulu /b/-/ɓ/ | Nonnative English | Privative | +/– larynx-lowering gesture | Best & McRoberts ( |
| Zulu /ɬ/-/ɮ/ | Nonnative English | Privative | +/– glottal opening | Best & McRoberts ( |
| English /tʃ/-/ʃ/ | Nonnative/Mandarin | Privative | Tongue tip closure vs. critical | Tsao et al. ( |
| English /d/-/ð/ | Nonnative/French | Within-organ | Tongue tip alveolar closure vs. interdental critical | Polka et al. ( |
| Spanish /b/-/p/ | Nonnative/English | Privative | +/– glottal-opening gesture phased with lip closure | Eilers et al. ( |
Figure 3. (A) Frequency distribution of tongue tip (TT) constriction location along hard palate (normalized to 0–1), as measured by electromagnetic articulometry (EMA), for all coronal stops in a natural spoken English passage of ∼1,000 words produced by an adult female native speaker (L. M. Goldstein et al., 2008); (B) corresponding distribution for all coronal stops in a natural spoken Hindi passage of ∼6,000 words produced by an adult female native speaker (L. M. Goldstein et al., 2008).
Figure 4. Simulation of “blank slate” infant (lower left) attunement to English “parent” input (top center) based on articulatory data from a native English speaker, which shows a unimodal frequency distribution of alveolar Tongue Tip (TT) constriction locations along the hard palate (normalized 0–1). The time series (lower portion of diagram) indicates successive 2,500-iteration steps in the 10,000-iteration simulation.
Figure 5. Simulation of “blank slate” infant (lower left) attunement to Hindi “parent” input (top center) based on articulatory data from a native Hindi speaker, which shows a bimodal frequency distribution of dental versus retroflex Tongue Tip (TT) constriction locations along the hard palate (normalized 0–1). The time series (lower portion of diagram) indicates successive 2,500-iteration steps in the 10,000-iteration simulation.
Figure 6. Simulation of an adult second language (L2) learner of Hindi showing native-language (L1) ingle ategory (SC) assimilation (lower left) and L2 attunement (time series) to input from an idealized Hindi “teacher” (top center). The L2 learner is an idealized speaker of English with a well-established unimodal distribution of English coronal stops centered at alveolar position, which does not line up with either Hindi mode. The time series shows the same simulation steps as in Figure 5.
Figure 7. Simulation of a different adult L2 Hindi learner showing initial L1 ategory oodness difference (CG) assimilation (lower left) and L2 attunement (time series) to input from the same idealized Hindi “teacher” (top center) and simulation steps as in Figure 6. The second language (L2) learner is an idealized native language (L1) speaker of Spanish with a well-established unimodal distribution of Spanish coronal stops centered at dental position, which does line up with one of the two Hindi modes (dental).