| Literature DB >> 23082167 |
Nozomi Naoi1, Shigeru Watanabe, Kikuo Maekawa, Junko Hibiya.
Abstract
In human verbal communication, not only lexical information, but also paralinguistic information plays an important role in transmitting the speakers' mental state. Paralinguistic information is conveyed mainly through acoustic features like pitch, rhythm, tempo and so on. These acoustic features are generally known as prosody. It is known that some species of birds can discriminate certain aspects of human speech. However, there have not been any studies on the discrimination of prosody in human language which convey different paralinguistic meanings by birds. In the present study, we have shown that the Java sparrow (Padda oryzivora) can discriminate different prosodic patterns of Japanese sentences. These birds could generalize prosodic discrimination to novel sentences, but could not generalize sentence discrimination to those with novel prosody. Moreover, unlike Japanese speakers, Java sparrows used the first part of the utterance as the discrimination cue.Entities:
Mesh:
Year: 2012 PMID: 23082167 PMCID: PMC3474804 DOI: 10.1371/journal.pone.0047446
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Results of generalization tests.
The abscissa is the probability of a false alarm, and the ordinate is the probability of a hit. ROC (Relative Operating Characteristics) curve and the value of d’ are depicted in the figure. The bar graph displays the means of hits and false alarms. * p<0.05, ** p<0.01, *** p<0.001 A. Results of the prosody generalization tests (Test 1) in Experiment 1. Open circles indicate birds in the Admiration Group, and closed circles, birds in the Suspicion Group. B. Results of the test with hybrid stimuli (Test 2-1) in Experiment 1. C. Results of the test with hybrid stimuli (Test 2-2) in Experiment 1. D. Results of the text generalization test in Experiment 2. Open circles indicate birds trained to respond to the text “so'H desu ka” and closed circles indicate birds trained to respond to the text “ana'ta desu ka” respectively. NS: non-significant.
Figure 2The typical pitch contours of the sentence “so’H desu ka” (Maekawa & Kitagawa (2000), partly corrected by the authors).
The time axis is in seconds, and the zero point corresponds to the beginning of/s/. Arrows denote the timing of accentual fall. Sentences were spoken by speaker ST.
Figure 3A diagram of the apparatus.
Acoustic features of training and test stimuli.
| Total | The Beginning Part | The Latter Part | ||||
| Mean Pitch (Hz) | Mean Pitch Range (Hz) | Mean Pitch (Hz) | Mean Pitch Range (Hz) | Mean Pitch (Hz) | Mean Pitch Range (Hz) | |
| Training Stimuli | ||||||
| Stimulus A | 146.9 | 193.3 | 180.4 | 188.0 | 93.3 | 45.1 |
| Stimulus S | 174.3 | 267.0 | 184.9 | 203.4 | 162.8 | 267.0 |
| Test Stimuli | ||||||
| Test Stimulus A | 154.9 | 217.4 | 203.2 | 217.4 | 92.9 | 48.3 |
| Test Stimulus S | 150.8 | 273.4 | 160.3 | 252.7 | 139.2 | 268.7 |