| Literature DB >> 27888257 |
Virginia Best1, Timothy Streeter2, Elin Roverud2, Christine R Mason2, Gerald Kidd2.
Abstract
This report introduces a new speech task based on simple questions and answers. The task differs from a traditional sentence recall task in that it involves an element of comprehension and can be implemented in an ongoing fashion. It also contains two target items (the question and the answer) that may be associated with different voices and locations to create dynamic listening scenarios. A set of 227 questions was created, covering six broad categories (days of the week, months of the year, numbers, colors, opposites, and sizes). All questions and their one-word answers were spoken by 11 female and 11 male talkers. In this study, listeners were presented with question-answer pairs and asked to indicate whether the answer was true or false. Responses were given as simple button or key presses, which are quick to make and easy to score. Two preliminary experiments are presented that illustrate different ways of implementing the basic task. In the first experiment, question-answer pairs were presented in speech-shaped noise, and performance was compared across subjects, question categories, and time, to examine the different sources of variability. In the second experiment, sequences of question-answer pairs were presented amidst competing conversations in an ongoing, spatially dynamic listening scenario. Overall, the question-and-answer task appears to be feasible and could be implemented flexibly in a number of different ways.Entities:
Keywords: dynamic conversations; realistic tests; speech comprehension
Mesh:
Year: 2016 PMID: 27888257 PMCID: PMC5131808 DOI: 10.1177/2331216516678706
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Description of the Six Question Categories.
| Category | Number of questions | Number of valid answers | Example question | Correct answer | Example incorrect answer |
|---|---|---|---|---|---|
| Days | 14 | 7 | What day comes after Monday? | Tuesday | Friday |
| Months | 24 | 12 | What month comes before April? | March | October |
| Colors | 20 | 8 | What color is the sky? | Blue | Green |
| Opposites | 20 | 20 | What is the opposite of up? | Down | Open |
| Sizes | 20 | 2 | Which is bigger, an elephant, or a mouse? | Elephant | Mouse |
| Numbers | 129 | 23 | What is two plus two? | Four | Eight |
Mean Word Duration and Mean F0, Calculated Across all Questions, for Each of the 22 Talkers. Also Shown Are the Across-Talker Mean and Standard Deviations for Females and Males.
| Female | Male | |||
|---|---|---|---|---|
| Talker | Duration (ms) | F0 (Hz) | Duration (ms) | F0 (Hz) |
| 1 | 394 | 231 | 343 | 121 |
| 2 | 378 | 210 | 337 | 110 |
| 3 | 351 | 227 | 298 | 139 |
| 4 | 341 | 204 | 290 | 117 |
| 5 | 357 | 214 | 350 | 133 |
| 6 | 328 | 200 | 363 | 99 |
| 7 | 334 | 220 | 349 | 132 |
| 8 | 306 | 193 | 360 | 153 |
| 9 | 353 | 226 | 339 | 115 |
| 10 | 402 | 207 | 367 | 127 |
| 11 | 418 | 223 | 409 | 120 |
| Mean (± | 360 ± 34 | 214 ± 12 | 346 ± 32 | 124 ± 15 |
Figure 1.(a) Psychometric functions for each listener (gray lines) and the mean psychometric function (black lines and circles). (b) Mean psychometric functions for each category type. (c) Mean psychometric functions based on the first half (circles) and second half (squares) of the trials completed by each subject at each SNR. (d) Mean psychometric function in units of d′ (circles) and bias (squares). Error bars, where shown, represent across-subject standard deviations.
Figure 2.Spatial and temporal configuration of the targets and maskers in the dynamic listening condition. The nine horizontal lines represent the nine stimulus positions (from −60° to +60° azimuth), of which three were potential target locations (−30°, 0°, +30°), and the remaining six were occupied by three pairs of masker talkers engaged in conversation. The shaded bars indicate the times during which a particular talker was speaking (colors: targets; gray: maskers). In this example, three questions and answers (labeled Q1, A1, etc.) out of the sequence of 12 are shown. The fixed condition was identical except that all questions and answers came from one of the three target locations.
Figure 3.(a) Psychometric functions for the fixed and dynamic listening conditions (averaged across all locations and all listeners). (b) Psychometric functions in units of d′ (circles) and bias (squares) for the fixed and dynamic listening conditions (averaged across all locations and all listeners). Error bars in both panels show across-subject standard deviations.