| Literature DB >> 24416119 |
Martijn Wieling1, John Nerbonne2, Jelke Bloem3, Charlotte Gooskens2, Wilbert Heeringa2, R Harald Baayen4.
Abstract
In this study we develop pronunciation distances based on naive discriminative learning (NDL). Measures of pronunciation distance are used in several subfields of linguistics, including psycholinguistics, dialectology and typology. In contrast to the commonly used Levenshtein algorithm, NDL is grounded in cognitive theory of competitive reinforcement learning and is able to generate asymmetrical pronunciation distances. In a first study, we validated the NDL-based pronunciation distances by comparing them to a large set of native-likeness ratings given by native American English speakers when presented with accented English speech. In a second study, the NDL-based pronunciation distances were validated on the basis of perceptual dialect distances of Norwegian speakers. Results indicated that the NDL-based pronunciation distances matched perceptual distances reasonably well with correlations ranging between 0.7 and 0.8. While the correlations were comparable to those obtained using the Levenshtein distance, the NDL-based approach is more flexible as it is also able to incorporate acoustic information other than sound segments.Entities:
Mesh:
Year: 2014 PMID: 24416119 PMCID: PMC3886970 DOI: 10.1371/journal.pone.0075734
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Basic Levenshtein distance alignment.
| w | ε | n | z | d | e |
| |
| w | ε | n |
| s | d | e | |
| 1 | 1 | 1 |
Levenshtein distance alignment with sensitive sound distances.
| w | ε | n | z | d | e |
| |
| w | ε | n |
| s | d | e | |
| 0.031 | 0.020 | 0.030 |
Part of the table used for estimating the association strengths. The ‘#’ marks the word boundary.
| Speaker | Outcome | Pronunciation | Cues | Frequency |
| english23 | with | [w | #w | 28,169,384 |
| english167 | with | [w | #w | 28,169,384 |
| english23 | her | [h | #h | 852,131 |
| english167 | her | [ | # | 852,131 |
The association strengths for the cues and outcomes in Table 1 for our simulated native AE listener after these have been estimated on the basis of the input of 58 randomly selected native AE speakers.
| Cue | Association strength for ‘with’ | Association strength for ‘her’ |
| #w | 0.2519 | 0.0000 |
| w | 0.3738 | 0.0000 |
|
| 0.3738 | 0.0000 |
| w | 0.3741 | 0.0000 |
|
| 0.3741 | 0.0000 |
| #h | 0.0000 | 0.4973 |
| h | 0.0000 | 0.2433 |
|
| 0.0000 | 0.2594 |
| # | 0.0000 | 1.0000 |
The activations of different outcomes on the basis of the association strengths between the cues and outcomes for our simulated native AE listener (shown in Table 2).
| Speaker | Outcome | Pronunciation | Cues | Activation of outcome |
| english23 | with | [w | #w | 0.9995 |
| english167 | with | [w | #w | 1.0000 |
| english23 | her | [h | #h | 1.0000 |
| english167 | her | [ | # | 1.0000 |
| mandarin10 | with | [w | #w | 0.2519 |
| serbian10 | her | [x | #x | 0.2594 |
Part of the NDL-based Norwegian dialect pronunciation distances.
| Bergen | Bjugn | Bodø | |
| Bergen | X | 0.545 | 0.584 |
| Bjugn | 0.559 | X | 0.319 |
| Bodø | 0.574 | 0.314 | X |
Figure 1Logarithmic relationship between NDL-based pronunciation distances and perceptual distances.