| Literature DB >> 31149647 |
Alejandrina Cristia1, Emmanuel Dupoux1, Nan Bernstein Ratner2, Melanie Soderstrom3.
Abstract
Previous computational modeling suggests it is much easier to segment words from child-directed speech (CDS) than adult-directed speech (ADS). However, this conclusion is based on data collected in the laboratory, with CDS from play sessions and ADS between a parent and an experimenter, which may not be representative of ecologically collected CDS and ADS. Fully naturalistic ADS and CDS collected with a nonintrusive recording device as the child went about her day were analyzed with a diverse set of algorithms. The difference between registers was small compared to differences between algorithms; it reduced when corpora were matched, and it even reversed under some conditions. These results highlight the interest of studying learnability using naturalistic corpora and diverse algorithmic definitions.Entities:
Keywords: computational modeling; infant word segmentation; learnability; lexicon; statistical learning
Year: 2019 PMID: 31149647 PMCID: PMC6515859 DOI: 10.1162/opmi_a_00022
Source DB: PubMed Journal: Open Mind (Camb) ISSN: 2470-2986
Characteristics of the ADS and CDS portions of the corpus, depending on whether the human or automatic utterance boundaries were considered.
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| ADS | 10,051 | 8,224 | 1,342 | 0.93 | 1,772 | 10,100 | 8,267 | 1,342 | 0.93 | 1,892 |
| CDS | 24,933 | 20,786 | 2,015 | 0.89 | 5,320 | 24,933 | 20,777 | 2,012 | 0.89 | 5,630 |
Note. Tokens differ for the Human versus Automatic because utterances where human coders (mistakenly) changed register within a continuation were dropped from the Human analyses. ADS = adult-directed speech; CDS = child-directed speech; Sylls = syllables; tokens and types refer to words; MTTR = Moving average Type to Token Ratio (over a sliding 10-word window); Utts = utterances.
Token F-score (in percentage) achieved by each algorithm in child-directed speech (CDS) as a function of that in adult-directed speech (ADS) in the full Winnipeg corpus with human-set utterance boundaries. Error bars indicate two standard deviations (over 10 resamples; see main text and Supplemental Materials, Cristia, 2018a, for details).
CDS F-score minus ADS F-score (in percentages) by algorithm, type of match, and whether human (H) or automatic (A) utterance boundaries were considered.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| base-utt | 2.9 | 1.4 | 1.5 | 3.3 | 1.8 | 1.9 | 1.85 |
| base-syll | 1.3 | −0.2 | 0 | 1.2 | −0.2 | −0.3 | −0.1 |
| phonotactic-unsupervised | −1.6 | −2.5 | −2.5 | −1.3 | −2.3 | −2.4 | −2.35 |
| phonotactic-gold | 3 | 2.6 | 2.8 | 3.1 | 2.8 | 2.8 | 2.8 |
| TP-relative | 5 | 0.7 | 0.9 | 5 | 0.4 | 0.7 | 0.8 |
| TP-average | −1.6 | −2.9 | −2.8 | −1.5 | −2.9 | −3.1 | −2.85 |
| lexical-incremental | 7.9 | −0.6 | 1.6 | 7.1 | 1.2 | 2.3 | 1.95 |
| lexical-unigram | 3.8 | 2.9 | 3.6 | 2.8 | 2.3 | 2.4 | 2.85 |
| lexical-multigram | 3.4 | 1.9 | 0.2 | 3.1 | −0.8 | 0.4 | 1.15 |
| Median | 3 | 0.7 | 0.9 | 3.1 | 0.4 | 0.7 | 1.15 |
Note. Full means the full child-directed speech (CDS) corpus was used; UM = utterance match: CDS corpus shortened to have as many utterances as the adult-directed speech (ADS) corpus; WM = word match: idem for words.
Characteristics of ADS and CDS studied in past and present work.
|
|
|
|
|
|
|---|---|---|---|---|
| Bernstein Ratner | Experimenter | 19,753 | 1,797 | 2,668 |
| Children 9–27 months | 30,996 | 1,501 | 8,252 | |
| Riken | Experimenter | 22,844 | 2,022 | 3,582 |
| Children 18–24 months | 51,315 | 2,850 | 14,570 | |
| Winnipeg | Adults | 8,224 | 1,342 | 1,772 |
| Children 13–38 months | 20,786 | 2,015 | 5,320 |
Note. ADS = adult-directed speech; CDS = child-directed speech; MTTR for the Bernstein Ratner ADS was .93; CDS .88.