Literature DB >> 24376305

Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors.

Daniel Bone1, Ming Li1, Matthew P Black1, Shrikanth S Narayanan2.   

Abstract

Segmental and suprasegmental speech signal modulations offer information about paralinguistic content such as affect, age and gender, pathology, and speaker state. Speaker state encompasses medium-term, temporary physiological phenomena influenced by internal or external biochemical actions (e.g., sleepiness, alcohol intoxication). Perceptual and computational research indicates that detecting speaker state from speech is a challenging task. In this paper, we present a system constructed with multiple representations of prosodic and spectral features that provided the best result at the Intoxication Subchallenge of Interspeech 2011 on the Alcohol Language Corpus. We discuss the details of each classifier and show that fusion improves performance. We additionally address the question of how best to construct a speaker state detection system in terms of robust and practical marginalization of associated variability such as through modeling speakers, utterance type, gender, and utterance length. As is the case in human perception, speaker normalization provides significant improvements to our system. We show that a held-out set of baseline (sober) data can be used to achieve comparable gains to other speaker normalization techniques. Our fused frame-level statistic-functional systems, fused GMM systems, and final combined system achieve unweighted average recalls (UARs) of 69.7%, 65.1%, and 68.8%, respectively, on the test set. More consistent numbers compared to development set results occur with matched-prompt training, where the UARs are 70.4%, 66.2%, and 71.4%, respectively. The combined system improves over the Challenge baseline by 5.5% absolute (8.4% relative), also improving upon our previously best result.

Entities:  

Keywords:  GMM supervectors; cognitive and motor load; hierarchical features; intoxication detection; speaker normalization; speaker state

Year:  2014        PMID: 24376305      PMCID: PMC3872081          DOI: 10.1016/j.csl.2012.09.004

Source DB:  PubMed          Journal:  Comput Speech Lang        ISSN: 0885-2308            Impact factor:   1.899


  10 in total

1.  Fast, but error-prone, responses during acute alcohol intoxication: effects of stimulus-response mapping complexity.

Authors:  Tom A Schweizer; Pierre Jolicoeur; M Vogel-Sprott; Mike J Dixon
Journal:  Alcohol Clin Exp Res       Date:  2004-04       Impact factor: 3.455

2.  The effects of alcohol and other drugs on psychomotor performance and cognitive function.

Authors:  I Hindmarch; J S Kerr; N Sherwood
Journal:  Alcohol Alcohol       Date:  1991       Impact factor: 2.826

3.  Perceiving the effects of ethanol intoxication on voice.

Authors:  Harry Hollien; James D Harnsberger; Camilo A Martin; Rebecca Hill; G Allan Alderman
Journal:  J Voice       Date:  2009-06-16       Impact factor: 2.009

4.  Effects of alcohol on the acoustic-phonetic properties of speech: perceptual and acoustic analyses.

Authors:  D B Pisoni; C S Martin
Journal:  Alcohol Clin Exp Res       Date:  1989-08       Impact factor: 3.455

5.  Effects of alcohol on the speech of alcoholics.

Authors:  L C Sobell; M B Sobell
Journal:  J Speech Hear Res       Date:  1972-12

6.  Effects of ethanol intoxication on speech suprasegmentals.

Authors:  H Hollien; G DeJong; C A Martin; R Schwartz; K Liljegren
Journal:  J Acoust Soc Am       Date:  2001-12       Impact factor: 1.840

7.  Alcohol-induced impairment of inhibitory mechanisms involved in visual search.

Authors:  Ben D Abroms; Mark T Fillmore
Journal:  Exp Clin Psychopharmacol       Date:  2004-11       Impact factor: 3.157

8.  Acoustic sleepiness detection: framework and validation of a speech-adapted pattern recognition approach.

Authors:  Jarek Krajewski; Anton Batliner; Martin Golz
Journal:  Behav Res Methods       Date:  2009-08

9.  Alcohol: its effect on handwriting.

Authors:  N G Galbraith
Journal:  J Forensic Sci       Date:  1986-04       Impact factor: 1.832

10.  Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data.

Authors:  Xuegong Zhang; Xin Lu; Qian Shi; Xiu-Qin Xu; Hon-Chiu E Leung; Lyndsay N Harris; James D Iglehart; Alexander Miron; Jun S Liu; Wing H Wong
Journal:  BMC Bioinformatics       Date:  2006-04-10       Impact factor: 3.169

  10 in total
  1 in total

1.  Speech volume indexes sex differences in the social-emotional effects of alcohol.

Authors:  Catharine E Fairbairn; Michael A Sayette; Marlissa C Amole; John D Dimoff; Jeffrey F Cohn; Jeffrey M Girard
Journal:  Exp Clin Psychopharmacol       Date:  2015-08       Impact factor: 3.157

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.