| Literature DB >> 33916993 |
Seedahmed S Mahmoud1, Akshay Kumar1, Youcun Li1, Yiting Tang1, Qiang Fang1.
Abstract
Speech assessment is an essential part of the rehabilitation procedure for patients with aphasia (PWA). It is a comprehensive and time-consuming process that aims to discriminate between healthy individuals and aphasic patients, determine the type of aphasia syndrome, and determine the patients' impairment severity levels (these are referred to here as aphasia assessment tasks). Hence, the automation of aphasia assessment tasks is essential. In this study, the performance of three automatic speech assessment models based on the speech dataset-type was investigated. Three types of datasets were used: healthy subjects' dataset, aphasic patients' dataset, and a combination of healthy and aphasic datasets. Two machine learning (ML)-based frameworks, classical machine learning (CML) and deep neural network (DNN), were considered in the design of the proposed speech assessment models. In this paper, the DNN-based framework was based on a convolutional neural network (CNN). Direct or indirect transformation of these models to achieve the aphasia assessment tasks was investigated. Comparative performance results for each of the speech assessment models showed that quadrature-based high-resolution time-frequency images with a CNN framework outperformed all the CML frameworks over the three dataset-types. The CNN-based framework reported an accuracy of 99.23 ± 0.003% with the healthy individuals' dataset and 67.78 ± 0.047% with the aphasic patients' dataset. Moreover, direct or transformed relationships between the proposed speech assessment models and the aphasia assessment tasks are attainable, given a suitable dataset-type, a reasonably sized dataset, and appropriate decision logic in the ML framework.Entities:
Keywords: Mandarin; aphasia assessment; deep neural network; machine learning framework; speech impairment
Mesh:
Year: 2021 PMID: 33916993 PMCID: PMC8067696 DOI: 10.3390/s21082582
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Aphasic patients’ details.
| Number of Patients | Gender Male/Female | Age, Yrs. | Cardinal Symptom (#) | Native Dialect (#) |
|---|---|---|---|---|
| 12 | 7/5 | 61.8 ± 14.4 | Broca (6) | Mandarin (6) |
| Dysarthria (3) | ||||
| Anomic (1) | Teochew (2) | |||
| Combined (1) | ||||
| Transcortical motor (1) | Jiaxing (4) |
Figure 1A typical classical machine learning framework for the three speech assessment models.
Figure 2A typical convolutional neural network (CNN)-based classification framework for the three speech assessment models.
Figure 3A general framework for aphasia assessment tasks.
Figure 4Performance evaluation for the CNN and classical machine learning (CML) classification frameworks, Model-A, on (a) the Only words (20 classes) of healthy dataset-type and (b) the Vowels + Words (26 classes) of healthy dataset-type.
Figure 5Performance evaluation for the CNN and CML classification frameworks, Model-B, on (a) the Only words of aphasic patients’ dataset-type and (b) the Vowels + Words of aphasic patients’ dataset-type.
Figure 6Performance evaluation for the CNN and CML classification frameworks, Model-C, on (a) the Only words of aphasic patients and healthy subjects’ dataset-types and (b) the Vowels + Words of aphasic patients and healthy subjects’ dataset-types.