| Literature DB >> 35023093 |
Eleana E I Almaloglou1, Geronikolou S2, George Chrousos3, Kotropoulos K1.
Abstract
Pathological speech, in its many forms, is a symptom of numerous serious diseases affecting millions of people worldwide, including more than 10 million Parkinson patients. Here, a powerful method is proposed for detecting pathological speech, using a two-dimensional (2D) convolutional neural network (CNN). Spectrograms are extracted from voice recordings of healthy and Parkinson diagnosed patients, which are fed into the CNN architecture. The voice samples comprise a subset of the benchmark mobile Parkinson Disease (mPower) study. The proposed model achieves 98% accuracy in Parkinson detection (i.e., a two-class problem). Moreover, an average accuracy exceeding 94% is measured in binary tests (i.e., pathological versus healthy) employing six voice pathologies conducted on the Saarbruecken Voice Database. These pathologies are dysphonia, functional dysphonia, hyperfunctional dysphonia, spasmodic dysphonia, vocal fold polyp, and dysody.Entities:
Keywords: Audio classification; Convolutional neural network; Deep learning; Pathological speech; Saarbruecken voice database; Spectrogram; mPower study
Mesh:
Year: 2021 PMID: 35023093 DOI: 10.1007/978-3-030-78787-5_11
Source DB: PubMed Journal: Adv Exp Med Biol ISSN: 0065-2598 Impact factor: 2.622