Julie M Liss1, Sue LeGendre, Andrew J Lotto. 1. Motor Speech Disorders Laboratory, Arizona State University Coor, 870102, Tempe, AZ 85287, USA. julie.liss@asu.edu
Abstract
PURPOSE: Previous research demonstrated the ability of temporally based rhythm metrics to distinguish among dysarthrias with different prosodic deficit profiles (J. M. Liss et al., 2009). The authors examined whether comparable results could be obtained by an automated analysis of speech envelope modulation spectra (EMS), which quantifies the rhythmicity of speech within specified frequency bands. METHOD: EMS was conducted on sentences produced by 43 speakers with 1 of 4 types of dysarthria and healthy controls. The EMS consisted of the spectra of the slow-rate (up to 10 Hz) amplitude modulations of the full signal and 7 octave bands ranging in center frequency from 125 to 8000 Hz. Six variables were calculated for each band relating to peak frequency and amplitude and relative energy above, below, and in the region of 4 Hz. Discriminant function analyses (DFA) determined which sets of predictor variables best discriminated between and among groups. RESULTS: Each of 6 DFAs identified 2-6 of the 48 predictor variables. These variables achieved 84%-100% classification accuracy for group membership. CONCLUSIONS: Dysarthrias can be characterized by quantifiable temporal patterns in acoustic output. Because EMS analysis is automated and requires no editing or linguistic assumptions, it shows promise as a clinical and research tool.
PURPOSE: Previous research demonstrated the ability of temporally based rhythm metrics to distinguish among dysarthrias with different prosodic deficit profiles (J. M. Liss et al., 2009). The authors examined whether comparable results could be obtained by an automated analysis of speech envelope modulation spectra (EMS), which quantifies the rhythmicity of speech within specified frequency bands. METHOD: EMS was conducted on sentences produced by 43 speakers with 1 of 4 types of dysarthria and healthy controls. The EMS consisted of the spectra of the slow-rate (up to 10 Hz) amplitude modulations of the full signal and 7 octave bands ranging in center frequency from 125 to 8000 Hz. Six variables were calculated for each band relating to peak frequency and amplitude and relative energy above, below, and in the region of 4 Hz. Discriminant function analyses (DFA) determined which sets of predictor variables best discriminated between and among groups. RESULTS: Each of 6 DFAs identified 2-6 of the 48 predictor variables. These variables achieved 84%-100% classification accuracy for group membership. CONCLUSIONS:Dysarthrias can be characterized by quantifiable temporal patterns in acoustic output. Because EMS analysis is automated and requires no editing or linguistic assumptions, it shows promise as a clinical and research tool.
Authors: Julie M Liss; Laurence White; Sven L Mattys; Kaitlin Lansford; Andrew J Lotto; Stephanie M Spitzer; John N Caviness Journal: J Speech Lang Hear Res Date: 2009-08-28 Impact factor: 2.297
Authors: Kristin M Rosen; Joanne E Folker; Adam P Vogel; Louise A Corben; Bruce E Murdoch; Martin B Delatycki Journal: J Neurol Date: 2012-06-06 Impact factor: 4.849
Authors: Alexandra Basilakos; Grigori Yourganov; Dirk-Bart den Ouden; Daniel Fogerty; Chris Rorden; Lynda Feenaughty; Julius Fridriksson Journal: J Speech Lang Hear Res Date: 2017-12-20 Impact factor: 2.297
Authors: Visar Berisha; Julie Liss; Steven Sandoval; Rene Utianski; Andreas Spanias Journal: Proc IEEE Int Conf Acoust Speech Signal Process Date: 2014-05