| Literature DB >> 28409178 |
Mithun Biswas1, Rafiqul Islam1, Gautam Kumar Shom1, Md Shopon2, Nabeel Mohammed1, Sifat Momen1, Anowarul Abedin1.
Abstract
BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes labels indicating the age and the gender of the subjects from whom the samples were collected. This dataset could be used not only for optical handwriting recognition research but also to explore the influence of gender and age on handwriting. The dataset is publicly available at https://data.mendeley.com/datasets/hf6sf8zrkc/2.Entities:
Year: 2017 PMID: 28409178 PMCID: PMC5382023 DOI: 10.1016/j.dib.2017.03.035
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Example of a filled data collection form.
Fig. 2Age distribution of the subjects.
Fig. 3Sample images from the dataset.
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | Subjects filled prearranged forms, which were then scanned. |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility |