| Literature DB >> 28203463 |
Yang Guo1, Yuanyuan Sun1, Yanmei Feng1, Yujun Zhang1, Shankai Yin1.
Abstract
Acoustic temporal envelope (E) cues containing speech information are distributed across the frequency spectrum. To investigate the relative weight of E cues in different frequency regions for Mandarin sentence recognition, E information was extracted from 30 contiguous bands across the range of 80-7,562 Hz using Hilbert decomposition and then allocated to five frequency regions. Recognition scores were obtained with acoustic E cues from 1 or 2 random regions from 40 normal-hearing listeners. While the recognition scores ranged from 8.2% to 16.3% when E information from only one region was available, the scores ranged from 57.9% to 87.7% when E information from two frequency regions was presented, suggesting a synergistic effect among the temporal E cues in different frequency regions. Next, the relative contributions of the E information from the five frequency regions to sentence perception were computed using a least-squares approach. The results demonstrated that, for Mandarin Chinese, a tonal language, the temporal E cues of Frequency Region 1 (80-502 Hz) and Region 3 (1,022-1,913 Hz) contributed more to the intelligence of sentence recognition than other regions, particularly the region of 80-502 Hz, which contained fundamental frequency (F0) information.Entities:
Mesh:
Year: 2017 PMID: 28203463 PMCID: PMC5288535 DOI: 10.1155/2017/7416727
Source DB: PubMed Journal: Neural Plast ISSN: 1687-5443 Impact factor: 3.599
Cutoff frequency for extracting temporal envelope information.
| Frequency regions | Bands | Lower frequency (Hz) | Upper frequency (Hz) |
|---|---|---|---|
| 1 | 1 | 80 | 115 |
| 2 | 115 | 154 | |
| 3 | 154 | 198 | |
| 4 | 198 | 246 | |
| 5 | 246 | 300 | |
| 6 | 300 | 360 | |
| 7 | 360 | 427 | |
| 8 | 427 | 502 | |
|
| |||
| 2 | 9 | 502 | 585 |
| 10 | 585 | 677 | |
| 11 | 677 | 780 | |
| 12 | 780 | 894 | |
| 13 | 894 | 1022 | |
|
| |||
| 3 | 14 | 1022 | 1164 |
| 15 | 1164 | 1322 | |
| 16 | 1322 | 1499 | |
| 17 | 1499 | 1695 | |
| 18 | 1695 | 1913 | |
|
| |||
| 4 | 19 | 1913 | 2157 |
| 20 | 2157 | 2428 | |
| 21 | 2428 | 2729 | |
| 22 | 2729 | 3066 | |
| 23 | 3066 | 3440 | |
| 24 | 3440 | 3856 | |
|
| |||
| 5 | 25 | 3856 | 4321 |
| 26 | 4321 | 4837 | |
| 27 | 4837 | 5413 | |
| 28 | 5413 | 6054 | |
| 29 | 6054 | 6767 | |
| 30 | 6767 | 7562 | |
Figure 1Averaged percent-correct scores for sentence recognition using acoustic temporal envelope as a function of condition in Group 1. The error bars indicate standard errors.
Figure 2Averaged percent-correct scores for sentence recognition using acoustic temporal envelope as a function of conditions in Group 2 and the condition with a full frequency region in Group 1. The error bars indicate standard errors.
Comparison of percent-correct scores for conditions with two adjacent frequency regions in two groups.
| Conditions | Scores of Group 1 | Scores of Group 2 |
|
|---|---|---|---|
| Region 1 + 2 | 89.6 ± 5.4 (%) | 87.7 ± 4.3 (%) |
|
| Region 2 + 3 | 74.0 ± 5.0 (%) | 77.2 ± 6.4 (%) |
|
| Region 3 + 4 | 79.2 ± 6.7 (%) | 79.9 ± 7.9 (%) |
|
| Region 4 + 5 | 68.8 ± 9.2 (%) | 69.7 ± 7.7 (%) |
|
Figure 3The relative weights of different frequency regions for Mandarin sentence recognition using acoustic temporal envelope. The error bars indicate standard errors.