| Literature DB >> 35494799 |
Hira Zahid1, Munaf Rashid2, Samreen Hussain3, Fahad Azim4, Sidra Abid Syed5, Afshan Saad6.
Abstract
Background and Objective: Humans communicate with one another using language systems such as written words or body language (movements), hand motions, head gestures, facial expressions, lip motion, and many more. Comprehending sign language is just as crucial as learning a natural language. Sign language is the primary mode of communication for those who have a deaf or mute impairment or are disabled. Without a translator, people with auditory difficulties have difficulty speaking with other individuals. Studies in automatic recognition of sign language identification utilizing machine learning techniques have recently shown exceptional success and made significant progress. The primary objective of this research is to conduct a literature review on all the work completed on the recognition of Urdu Sign Language through machine learning classifiers to date. Materials and methods: All the studies have been extracted from databases, i.e., PubMed, IEEE, Science Direct, and Google Scholar, using a structured set of keywords. Each study has gone through proper screening criteria, i.e., exclusion and inclusion criteria. PRISMA guidelines have been followed and implemented adequately throughout this literature review.Entities:
Keywords: Deep learning; Machine learning; Pakistani sign language; Pattern recognition; Sign language recognition; Urdu sign language
Year: 2022 PMID: 35494799 PMCID: PMC9044266 DOI: 10.7717/peerj-cs.883
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Figure 1(A) Sign of Huroof-e- tahaji. (B) Sign of numbers (anonymous, b).
Detail conditions that are set for the adding and eliminating of published articles.
| Conditions to add published articles | Conditions to eliminate published articles |
|---|---|
| Research articles that use Urdu sign language as a language for detection | Research articles that do not use Urdu sign language for detection. |
| Research articles that use machine learning classifiers as a problem solution. | Research articles that do not use machine learning classifiers as a problem solution |
| Research articles that report quantitative outcomes of machine learning. | Research articles that do not report quantitative outcomes of machine learning. |
| Research articles that only uses gesture-based, character bases, or EMG signal-based as a dataset to recognize Urdu Sign Language. | Research articles that do not use either gesture- based, character-based, or EMG signal-based as a dataset to recognize Urdu Sign Language. |
| Research articles that are written in the English language | Research articles that do not write in the English language. |
| Research articles that are published in Journals or conference proceedings. | Research articles that are published in either Journals or conference proceedings. |
Summarized detail of dataset for 20 included articles.
| Author/Year | Dataset name | Dataset type | Dataset demographics | Publically available or not? | No. of images/ signals | Overall accuracy |
|---|---|---|---|---|---|---|
|
| NA | PSL signs images | No | NA | 91% | |
|
| NA | PSL signs images | No | NA | 90% | |
|
| NA | PSL signs images | NA | No | NA | 97% ; 86% |
|
| NA | Text images from environment | NR | Yes | Isolated Urdu character image dataset contains 19901 images; cropped word image dataset contains 14100 words. | 95%; 78.13% |
|
| NA | Signs of 37 characters of Urdu language | NA | Yes | 40 images of each letter of Urdu language. 3˘7 x 40 = 1480 images. | 90% |
|
| NA | Counting from 1 till 10 | NA | No | Each sign’s dataset ranged from 1500 to 2000 images. The total number of imagesk was around 21000. | NA |
|
| CENPARMI | Images of s isolated digits, numeral strings with/without decimal points, five special symbols, 44 isolated characters, 57 Urdu words (mostly financial related), | NR | No | 14,407 samples images | 97% |
|
| UPTI | Images of sentences | NR | Yes | 10063 sentence images | 96%; 95% |
|
| NA | 11 phrases of PSL using sEMG | NA | No | 550 EMG signals | 85.4% |
|
| NA | 37 Urdu alphabets | NA | No | NA | 75% |
|
| NA | 38 Urdu character and 10 numerals | 500 native Urdu speakers | No | 800 images (800 × 10 = 8000 numeral images and | 96.04%; 98.3% |
|
| NA | 26 alphabets in PSL using sEMG | NA | No | 30 signals Per alphabet was recorded. | 81%; 63% |
|
| NA | NA | NA | No | NA | 92.5% |
|
| NA | NA | NA | No | NA | 93.4% |
|
| CLE | Synthetic image with ligature | NR | Yes | 3801 training images and 423 test images. | 94% |
|
| UNHD | Sentences of Urdu language | 500 writers | Yes | The dataset consist of 312,000 words written | 92% |
Notes.
Pakistani Sign Language
Not Available
Not Required
Centre for Pattern Recognition and Machine Intelligence
Urdu Printed Text Images
Urdu-Nasta’liq Handwritten Dataset
Figure 2Pie Chart depicts the total number of research studies.
Figure 3Publication trends in the last 17 years.
Figure 4Flowchart as per PRISMA (Liberati et al., 2009) guidelines.
Figure 5Pie chart mainly representing the used dataset.
Figure 6Graph of reported accuracies of the publicly available dataset.
Figure 7Graph of reported accuracies of the dataset that is not publicly available.
Summarized detail of machine learning classifiers and feature extraction of 20 included articles.
| Author/Year | Journal/ Conference | Dataset | Classification technique | Feature selection/filter | Overall accuracy | Overall sensitivity | Overall specificity |
|---|---|---|---|---|---|---|---|
|
| Journal | NA | DTW | Normalization of skeleton frame data | 91% | NA | NA |
|
| Conference | NA | Euclidean Distance | PCA coefficients | 90% | NA | NA |
|
| Conference | NA | Bagging Ensemble | Dimension reduction by PCA | 97%; 86% | NA | NA |
|
| Journal | NA | SVM; RNN-CNN | HOG, LBP; CTC | 95%; 78.13% | NA | NA |
|
| Journal | NA | SVM | Saturation and Hue components of image | 90% | NA | NA |
|
| Journal | NA | CNN | Inception V3 architecture | NA | NA | NA |
|
| Conference | CENPARMI | SVM | Structural and gradient features | 97% | NA | NA |
|
| Journal | UPTI | ULR-SDA; SVM | jitter, elastic elongation, threshold and sensitivity | 96%; 95% | NA | NA |
|
| Conference | NA | Linear SVM | Time domain, spectral domain, shape, and texture | 85.4% | 85.36% | 85.81% |
|
| Conference | NA | Cross correlation | Edge detection of second derivative | 75% | NA | NA |
|
| Journal | NA | CNN | Pixel- and geometrical-based | 98.3% | NA | NA |
|
| Conference | NA | Linear Discriminant; Quadratic discriminant | Time domain, statistical domain, shape, spectral domain, cepstral domain | 81%; 63% | 84.05%; 66.4% | 84.7%; 65.9% |
|
| Conference | NA | HMM | 3D vectors | 92.5% | NA | NA |
|
| Journal | NA | NN | Segmentation of characters | 93.4% | NA | NA |
|
| Journal | CLE | DLN (FasterRCNN, RRNN, TSDNN) | Resnet50, Googlenet | 94% | NA | NA |
|
| Conference | CENPARMI | SVM | 400D gradient feature | 98.61% | NA | NA |
|
| Journal | UNHD | RNN | Textline segmentation | 92% | NA | NA |
|
| Conference | UPTI | Line segmentation | Contour extraction, shape context | 91% | NA | NA |
|
| Journal | UPTI | MD-RNN | vertical and horizontal edges intensities, foreground distribution, density function, intensity feature, horizontal projection, contrast intensity | 94.97% | NA | NA |
|
| Conference | UPTI | BLSTM-RNN | Baseline information | 86.42% | NA | NA |
|
| journal | NA | SVM | Achromatic decomposition | 90% | NA | NA |
Notes.
Pakistani Sign Language
Not Available
Not Required
Dynamic Time Warping
Principal Component Analysis
Support Vector Machine
Histogram of Oriented Gradients
Local Binary Pattern
Connectionist Temporal Classification
Recurrent Neural Network
Convolutional Neural Network
Centre for Pattern Recognition and Machine Intelligence
Urdu Printed Text Images
Urdu Ligature Recognition Stacked Denoising Auto encoder
Electro MyoGraphy
Hidden Markov Model
Neural Network
Deep Learning Network
Regression Residual Neural Network
Two Stream Deep Neural Network
Center of Language Engineering
Urdu-Nasta’liq Handwritten Dataset
Multi-Dimensional Recurrent Neural Network
Bidirectional Long Short Term Memory
Figure 8Graph of reported accuracies of detection of Urdu Sign Language using SVM.
Figure 9Graph of reported accuracies of detection of Urdu Sign Language using Neural Network.
Figure 10Graph of reported accuracies for detection of Urdu Printed Text Images.