| Literature DB >> 27652095 |
S S Mostafa1, M A Awal2, M Ahmad1, M A Rashid3.
Abstract
Some people cannot produce sound although their facial muscles work properly due to having problem in their vocal cords. Therefore, recognition of alphabets as well as sentences uttered by these voiceless people is a complex task. This paper proposes a novel method to solve this problem using non-invasive surface Electromyogram (sEMG). Firstly, eleven Bangla vowels are pronounced and sEMG signals are recorded at the same time. Different features are extracted and mRMR feature selection algorithm is then applied to select prominent feature subset from the large feature vector. After that, these prominent features subset is applied in the Artificial Neural Network for vowel classification. This novel Bangla vowel classification method can offer a significant contribution in voice synthesis as well as in speech communication. The result of this experiment shows an overall accuracy of 82.3 % with fewer features compared to other studies in different languages.Entities:
Keywords: ANN; Bangla vowel; Classification; Feature selection; Wavelet transform; sEMG
Year: 2016 PMID: 27652095 PMCID: PMC5017969 DOI: 10.1186/s40064-016-3170-9
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1Anatomy of a a normal person (with larynx) and b a laryngectomy patient
Fig. 2General pipeline of voiceless Bangla vowel classification
Bengali vowel letter chart
| Numerical | Alphabetical full form | Name of full form | IPA transcription | Numerical | Alphabetical full form | Name of full form | IPA transcription |
|---|---|---|---|---|---|---|---|
| 1 | অ | shôro ô (shôre ô) “vowel ô” | /ɔ/and/o/ | 6 | ঊ | dirgho u “long u” | /u/ |
| 2 | আ | shôro a (shôre a) “vowel a” | /a/ | 7 | ঋ | ri | /ri/ |
| 3 | ই | hrôshsho i (hrôshsho i) “short i” | /i/ | 8 | এ | e | /e/and/æ/ |
| 4 | ঈ | dirgho i “long i” | /i/ | 9 | ঐ | oi | /oj/ |
| 5 | উ | hrôshsho u (rôshsho u) “short u” | /u/ | 10 | ও | o | /o/ |
| 11 | ঔ | ou | /ow/ |
Fig. 3Facial EMG recording of a subject
Fig. 4EMG and integrated EMG data extraction
Fig. 5Raw sEMG signal recorded during experiments 1–11 represent the bengali vowel letter from অ to ঔ
Features employed in this study
| Domain | Name of the features | Number of features |
|---|---|---|
| Time based | Average, maximum, standard deviation, minimum, variance, CoV, skewness, kurtosis, RMS | 9 |
| Entropy based | Hjorth mobility, Hjorth complexity, mean of lower envelop, mean of upper envelop, mean diff operator, Higuchi fractal dimension, Higuchi algorithmic entropy, renyi entropy, shannon entropy, Tsallis entropy, Hurst exponential, approximate entropy, sample entropy | 13 |
| Frequency based | Spectral flatness, spectral flux, spectral entropy, spectral edge frequency (SEF) 80, spectral edge power (SEF) 80, spectral edge frequency (SEF) 90, spectral edge power (SEF) 90, spectral edge frequency (SEF) 95, spectral edge power (SEF) 95 | 9 |
| Time–frequency | Symlet 4 wavelet family and decomposition level:3. Entropy (e), variance(V), standard deviation, median (s), average, ration of maximum and min of 4 decomposing s vector’s. Percentage of energy corresponding to the approximation, percentage of energy corresponding to the details | 26 |
| – | – | Total 57 |
Fig. 6Confusion matrix for the Bangla vowels
Fig. 7ROC curve for Bangla vowels. The zoomed version of the figure is shown inside the figure
Fig. 8The performance curve showing the train, validation and test as well as best performance of the system