| Literature DB >> 31372425 |
Ghazanfar Latif1,2, Nazeeruddin Mohammad1, Jaafar Alghazo1, Roaa AlKhalaf1, Rawan AlKhalaf1.
Abstract
A fully-labelled dataset of Arabic Sign Language (ArSL) images is developed for research related to sign language recognition. The dataset will provide researcher the opportunity to investigate and develop automated systems for the deaf and hard of hearing people using machine learning, computer vision and deep learning algorithms. The contribution is a large fully-labelled dataset for Arabic Sign Language (ArSL) which is made publically available and free for all researchers. The dataset which is named ArSL2018 consists of 54,049 images for the 32 Arabic sign language sign and alphabets collected from 40 participants in different age groups. Different dimensions and different variations were present in images which can be cleared using pre-processing techniques to remove noise, center the image, etc. The dataset is made available publicly at https://data.mendeley.com/datasets/y7pckrw6z2/1.Entities:
Year: 2019 PMID: 31372425 PMCID: PMC6661066 DOI: 10.1016/j.dib.2019.103777
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Representation of the Arabic Sign Language for Arabic Alphabets.
Input Arabic Alphabet Sign classes with their labels and number of images.
| # | Letter name in English Script | Letter name in Arabic script | # of Images | # | Letter name in English Script | Letter name in Arabic script | # of images |
|---|---|---|---|---|---|---|---|
| 1 | Alif | أَلِف)أ) | 1672 | 17 | Zā | ظَاء)ظ) | 1723 |
| 2 | Bā | بَاء) ب) | 1791 | 18 | Ayn | عَين)ع) | 2114 |
| 3 | Tā | أتَاء) ت) | 1838 | 19 | Ghayn | غَين)غ) | 1977 |
| 4 | Thā | ثَاء) ث) | 1766 | 20 | Fā | فَاء)ف) | 1955 |
| 5 | Jīm | جِيمْ) ج) | 1552 | 21 | Qāf | قَاف) ق) | 1705 |
| 6 | Hā | حَاء) ح) | 1526 | 22 | Kāf | كَاف)ك) | 1774 |
| 7 | Khā | خَاء) خ) | 1607 | 23 | Lām | لاَمْ)ل) | 1832 |
| 8 | Dāl | دَالْ) د) | 1634 | 24 | Mīm | مِيمْ)م) | 1765 |
| 9 | Dhāl | ذَال) ذ) | 1582 | 25 | Nūn | نُون)ن) | 1819 |
| 10 | Rā | رَاء) ر) | 1659 | 26 | Hā | هَاء)ه) | 1592 |
| 11 | Zāy | زَاي) ز) | 1374 | 27 | Wāw | وَاو)و) | 1371 |
| 12 | Sīn | سِينْ) س) | 1638 | 28 | Yā | يَا) ئ) | 1722 |
| 13 | Shīn | شِينْ) ش) | 1507 | 29 | Tāa | ة)ة) | 1791 |
| 14 | Sād | صَادْ)ص) | 1895 | 30 | Al | ال)ال) | 1343 |
| 15 | Dād | ضَاد)ض) | 1670 | 31 | Laa | ﻻ)ﻻ) | 1746 |
| 16 | Tā | طَاء)ط) | 1816 | 32 | Yāa | يَاء) يَاء) | 1293 |
Specifications table
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility | |
| Related research article |
The current trend of machine learning and deep learning in developing applications helpful in our daily lives such as fingerprint or face recognition and other application in fields such as healthcare, assistive technology, and others. The main core of these applications is image pre-processing, classification and recognition to automate tasks usually done by humans. The ArSL2018 dataset is a valuable resource for researchers in the machine learning and deep learning community for development of assistive technology applications for persons with disability. The ArSL2018 dataset collected in Al Khobar, Saudi Arabia is a collection of 54,000 images of the 32 Arabic Sign Language Signs and Alphabet. The ArSL2018 is a comprehensive Arabic Sign Language Image repository fully-labelled for purposes of classification and recognition, and for the purpose of applications automating the recognition of sign language for Arabic deaf and hard of hearing individuals. The ArSL2018 dataset would assist researchers and allow for faster application development, and faster prototyping of different applications and devices in the assistive technology field. The ArSL2018 is a base for the research community to build on this dataset to produce a dataset with more image variations. |