Literature DB >> 19194286

The influence of age, hearing, and working memory on the speech comprehension benefit derived from an automatic speech recognition system.

Adriana A Zekveld1, Sophia E Kramer, Judith M Kessens, Marcel S M G Vlaming, Tammo Houtgast.   

Abstract

OBJECTIVE: The aim of the current study was to examine whether partly incorrect subtitles that are automatically generated by an Automatic Speech Recognition (ASR) system, improve speech comprehension by listeners with hearing impairment. In an earlier study (Zekveld et al. 2008), we showed that speech comprehension in noise by young listeners with normal hearing improves when presenting partly incorrect, automatically generated subtitles. The current study focused on the effects of age, hearing loss, visual working memory capacity, and linguistic skills on the benefit obtained from automatically generated subtitles during listening to speech in noise.
DESIGN: In order to investigate the effects of age and hearing loss, three groups of participants were included: 22 young persons with normal hearing (YNH, mean age = 21 years), 22 middle-aged adults with normal hearing (MA-NH, mean age = 55 years) and 30 middle-aged adults with hearing impairment (MA-HI, mean age = 57 years). The benefit from automatic subtitling was measured by Speech Reception Threshold (SRT) tests (Plomp & Mimpen, 1979). Both unimodal auditory and bimodal audiovisual SRT tests were performed. In the audiovisual tests, the subtitles were presented simultaneously with the speech, whereas in the auditory test, only speech was presented. The difference between the auditory and audiovisual SRT was defined as the audiovisual benefit. Participants additionally rated the listening effort. We examined the influences of ASR accuracy level and text delay on the audiovisual benefit and the listening effort using a repeated measures General Linear Model analysis. In a correlation analysis, we evaluated the relationships between age, auditory SRT, visual working memory capacity and the audiovisual benefit and listening effort.
RESULTS: The automatically generated subtitles improved speech comprehension in noise for all ASR accuracies and delays covered by the current study. Higher ASR accuracy levels resulted in more benefit obtained from the subtitles. Speech comprehension improved even for relatively low ASR accuracy levels; for example, participants obtained about 2 dB SNR audiovisual benefit for ASR accuracies around 74%. Delaying the presentation of the text reduced the benefit and increased the listening effort. Participants with relatively low unimodal speech comprehension obtained greater benefit from the subtitles than participants with better unimodal speech comprehension. We observed an age-related decline in the working-memory capacity of the listeners with normal hearing. A higher age and a lower working memory capacity were associated with increased effort required to use the subtitles to improve speech comprehension.
CONCLUSIONS: Participants were able to use partly incorrect and delayed subtitles to increase their comprehension of speech in noise, regardless of age and hearing loss. This supports the further development and evaluation of an assistive listening system that displays automatically recognized speech to aid speech comprehension by listeners with hearing impairment.

Entities:  

Mesh:

Year:  2009        PMID: 19194286     DOI: 10.1097/AUD.0b013e3181987063

Source DB:  PubMed          Journal:  Ear Hear        ISSN: 0196-0202            Impact factor:   3.570


  7 in total

1.  Combining partial information from speech and text.

Authors:  Daniel Fogerty; Irraj Iftikhar; Rachel Madorskiy
Journal:  J Acoust Soc Am       Date:  2020-02       Impact factor: 1.840

2.  User evaluation of a communication system that automatically generates captions to improve telephone communication.

Authors:  Adriana A Zekveld; Sophia E Kramer; Judith M Kessens; Marcel S M G Vlaming; Tammo Houtgast
Journal:  Trends Amplif       Date:  2009-01-05

3.  Text as a Supplement to Speech in Young and Older Adults.

Authors:  Vidya Krull; Larry E Humes
Journal:  Ear Hear       Date:  2016 Mar-Apr       Impact factor: 3.570

Review 4.  How linguistic closure and verbal working memory relate to speech recognition in noise--a review.

Authors:  Jana Besser; Thomas Koelewijn; Adriana A Zekveld; Sophia E Kramer; Joost M Festen
Journal:  Trends Amplif       Date:  2013-08-13

5.  Left temporal alpha-band activity reflects single word intelligibility.

Authors:  Robert Becker; Maria Pefkou; Christoph M Michel; Alexis G Hervais-Adelman
Journal:  Front Syst Neurosci       Date:  2013-12-27

6.  Effect of signal to noise ratio on the speech perception ability of older adults.

Authors:  Elahe Shojaei; Hassan Ashayeri; Zahra Jafari; Mohammad Reza Zarrin Dast; Koorosh Kamali
Journal:  Med J Islam Repub Iran       Date:  2016-03-09

7.  Objective Assessment of Listening Effort: Coregistration of Pupillometry and EEG.

Authors:  Kelly Miles; Catherine McMahon; Isabelle Boisvert; Ronny Ibrahim; Peter de Lissa; Petra Graham; Björn Lyxell
Journal:  Trends Hear       Date:  2017 Jan-Dec       Impact factor: 3.293

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.