Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 FRAUG: A FRAME RATE BASED DATA AUGMENTATION METHOD FOR DEPRESSION DETECTION FROM SPEECH SIGNALS.

Literature DB >> 35531125

FRAUG: A FRAME RATE BASED DATA AUGMENTATION METHOD FOR DEPRESSION DETECTION FROM SPEECH SIGNALS.

Vijay Ravi¹, Jinhan Wang¹, Jonathan Flint², Abeer Alwan¹.

Abstract

In this paper, a data augmentation method is proposed for depression detection from speech signals. Samples for data augmentation were created by changing the frame-width and the frame-shift parameters during the feature extraction process. Unlike other data augmentation methods (such as VTLP, pitch perturbation, or speed perturbation), the proposed method does not explicitly change acoustic parameters but rather the time-frequency resolution of frame-level features. The proposed method was evaluated using two different datasets, models, and input acoustic features. For the DAIC-WOZ (English) dataset when using the DepAudioNet model and mel-Spectrograms as input, the proposed method resulted in an improvement of 5.97% (validation) and 25.13% (test) when compared to the baseline. The improvements for the CONVERGE (Mandarin) dataset when using the x-vector embeddings with CNN as the backend and MFCCs as input features were 9.32% (validation) and 12.99% (test). Baseline systems do not incorporate any data augmentation. Further, the proposed method outperformed commonly used data-augmentation methods such as noise augmentation, VTLP, Speed, and Pitch Perturbation. All improvements were statistically significant.

Entities: Chemical

Keywords: data augmentation; depression detection; frame rate; time-frequency resolution; x-vector

Year: 2022 PMID： 35531125 PMCID： PMC9070766 DOI： 10.1109/icassp43922.2022.9746307

Source DB: PubMed Journal: Proc IEEE Int Conf Acoust Speech Signal Process ISSN： 1520-6149

Keyword Cloud
References

8 in total

1. Note on the sampling error of the difference between correlated proportions or percentages.

Authors: Q McNEMAR
Journal: Psychometrika Date: 1947-06 Impact factor: 2.500

2. Automated depression analysis using convolutional neural networks from speech.

Authors: Lang He; Cui Cao
Journal: J Biomed Inform Date: 2018-05-29 Impact factor: 6.317

3. Detection of depressive disorder for patients receiving prepaid or fee-for-service care. Results from the Medical Outcomes Study.

Authors: K B Wells; R D Hays; M A Burnam; W Rogers; S Greenfield; J E Ware
Journal: JAMA Date: 1989-12-15 Impact factor: 56.272

4. The PHQ-8 as a measure of current depression in the general population.

Authors: Kurt Kroenke; Tara W Strine; Robert L Spitzer; Janet B W Williams; Joyce T Berry; Ali H Mokdad
Journal: J Affect Disord Date: 2008-08-27 Impact factor: 4.839

5. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017.

Authors:
Journal: Lancet Date: 2018-11-08 Impact factor: 79.321

6. Patterns of co-morbidity with anxiety disorders in Chinese women with recurrent major depression.

Authors: Y Li; S Shi; F Yang; J Gao; Youhui Li; M Tao; G Wang; K Zhang; C Gao; L Liu; Kan Li; Keqing Li; Y Liu; Xumei Wang; J Zhang; L Lv; Xueyi Wang; Q Chen; J Hu; L Sun; J Shi; Y Chen; D Xie; J Flint; K S Kendler; Z Zhang
Journal: Psychol Med Date: 2011-11-30 Impact factor: 7.723

7. Projections of global mortality and burden of disease from 2002 to 2030.

Authors: Colin D Mathers; Dejan Loncar
Journal: PLoS Med Date: 2006-11 Impact factor: 11.069

8. Multilayer perceptron neural network model development for mechanical ventilator parameters prediction by real time system learning.

Authors: Sita Radhakrishnan; Suresh G Nair; Johney Isaac
Journal: Biomed Signal Process Control Date: 2021-09-20 Impact factor: 3.880

8 in total