| Literature DB >> 32715044 |
Abstract
Automatic sign language recognition provides better services to the deaf as it avoids the existing communication gap between them and the rest of the society. Hand gestures, the primary mode of sign language communication, plays a key role in improving sign language recognition. This article presents a video dataset of the hand gestures of Indian sign language (ISL) words used in emergency situations. The videos of eight ISL words have been collected from 26 individuals (including 12 males and 14 females) in the age group of 22 to 26 years with two samples from each individual in an indoor environment with normal lighting conditions. Such a video dataset is highly needed for automatic recognition of emergency situations from the sign language for the benefit of the deaf. The dataset is useful for the researchers working on vision based sign language recognition (SLR) as well as hand gesture recognition (HGR). Moreover, support vector machine based classification and deep learning based classification of the emergency gestures has been carried out and the base classification performance shows that the database can be used as a benchmarking dataset for developing novel and improved techniques for recognizing the hand gestures of emergency words in Indian sign language.Entities:
Keywords: Emergency words; Hand gestures; Indian sign language recognition; Video data
Year: 2020 PMID: 32715044 PMCID: PMC7378574 DOI: 10.1016/j.dib.2020.106016
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1The key frame sequences of the hand gestures of the ISL words included in the `Cropped_Data’ set.
Organization of raw videos in the dataset.
| Folder | File Name | Description |
|---|---|---|
| accident_Raw | accident_001_01 to accident_026_01, accident_001_02 to accident_026_02 | 52 sample videos of ISL hand gestures for the word `accident' presented by 26 subjects. |
| call_Raw | call_001_01 to call_026_01, call_001_02 to call_026_02 | 52 sample videos of ISL hand gestures for the word `call' presented by 26 subjects. |
| doctor_Raw | doctor_001_01 to doctor_026_01, doctor_001_02 to doctor_026_02 | 52 sample videos of ISL hand gestures for the word `doctor' presented by 26 subjects. |
| help_Raw | help_001_01 to help_026_01, help_001_02 to help_026_02 | 52 sample videos of ISL hand gestures for the word `help' presented by 26 subjects. |
| hot_Raw | hot_001_01 to hot_026_01, hot_001_02 to hot_026_02 | 52 sample videos of ISL hand gestures for the word `hot' presented by 26 subjects. |
| lose_Raw | lose_001_01 to lose_018_01, lose_020_01 to lose_026_01, lose_001_02 to lose_018_02, lose_020_02 to lose_026_02 | 50 sample videos of ISL hand gestures for the word `lose' presented by 25 subjects. |
| pain_Raw | pain_001_01 to pain_026_01, pain_001_02 to pain_026_02 | 52 sample videos of ISL hand gestures for the word `pain' presented by 26 subjects. |
| thief_Raw | thief_001_01 to thief_019_01, thief_021_01 to thief_026_01, thief_001_02 to thief_019_02, thief_021_02 to thief_026_02 | 50 sample videos of ISL hand gestures for the word `thief' presented by 25 subjects. |
Organization of cropped videos in the dataset
| Folder | File Name | Description |
|---|---|---|
| accident_Cropped | accident_crop_xxx_yy | 52 sample videos of ISL hand gestures for the word `accident' presented by 26 subjects. |
| call_Cropped | call_crop_xxx_yy | 52 sample videos of ISL hand gestures for the word `call' presented by 26 subjects. |
| doctor_Cropped | doctor_crop_xxx_yy | 52 sample videos of ISL hand gestures for the word `doctor' presented by 26 subjects. |
| help_Cropped | help_crop_xxx_yy | 52 sample videos of ISL hand gestures for the word `help' presented by 26 subjects. |
| hot_Cropped | hot_crop_xxx_yy | 52 sample videos of ISL hand gestures for the word `hot' presented by 26 subjects. |
| lose_Cropped | lose_crop_xxx_yy | 50 sample videos of ISL hand gestures for the word `lose' presented by 25 subjects. |
| pain_Cropped | pain_crop_xxx_yy | 52 sample videos of ISL hand gestures for the word `pain' presented by 26 subjects. |
| thief_Cropped | thief_crop_xxx_yy | 50 sample videos of ISL hand gestures for the word `thief' presented by 25 subjects. |
Fig. 2(a) A single frame of the video for the hand gesture of the word ‘accident’ in original form (b) the corresponding frame obtained after cropping and downsampling.
Classification performance of multiclass SVM as well as deep learning model on the ISL words in the cropped set.
| ISL Word | SVM Classifier | Deep Learning | ||||
|---|---|---|---|---|---|---|
| Precision (%) | Recall (%) | F-score (%) | Precision (%) | Recall (%) | F-score (%) | |
| Accident | 96.55 | 93.33 | 94.92 | 100 | 100 | 100 |
| Call | 96.15 | 83.33 | 89.29 | 90.32 | 93.33 | 91.80 |
| Doctor | 90.63 | 96.67 | 93.55 | 93.75 | 100 | 96.77 |
| Help | 96.55 | 93.33 | 94.92 | 100 | 93.33 | 96.55 |
| Hot | 92.59 | 83.33 | 87.72 | 100 | 93.33 | 96.55 |
| Lose | 96.30 | 86.67 | 91.23 | 96.67 | 96.67 | 96.67 |
| Pain | 93.10 | 90 | 91.53 | 96.55 | 93.33 | 94.92 |
| Thief | 68.29 | 93.33 | 78.87 | 93.75 | 100 | 96.77 |
| Subject | Computer Vision and Pattern Recognition |
| Specific subject area | Automatic sign language recognition |
| Type of data | Videos |
| How data were acquired | The videos in this dataset were collected by asking the participants to stand comfortably behind a black colored board and present the hand gestures, in front of the board. A Sony cyber shot DSC-W810 digital camera with 20.1 mega pixel resolution has been used for capturing the videos. |
| Data format | Raw videos as well as cropped videos. |
| Parameters for data collection | All the videos have been collected with plain black background by placing the camera at a fixed distance. Both male and female subjects from various parts of India with varying hand sizes and skin tones have been included for collecting the data. Two sample videos have been collected from each participant with the gap of small time duration. The data collection is done on different days and at different times in an indoor environment with normal lighting conditions. No restriction has been imposed on the speed of hand movements so as to get the gesture presentations as natural as possible. |
| Description of data collection | Videos for a set of eight hand gestures representing the ISL words namely, ‘accident’, ‘call’, ‘doctor’, ‘help’, ‘hot’, ‘lose’, ‘pain’ and ‘thief’ have been included in the dataset. |
| Data source location | Department of Computer Science, Central University of Kerala |
| Data accessibility | Repository name: Mendeley data. |