Literature DB >> 36164293

GC3558: An open-source annotated dataset of Ghana currency images for classification modeling.

Kwabena Adu1, Patrick Kwabena Mensah1, Mighty Abra Ayidzoe1, Obed Appiah1, Ebenezer Quayson1, Christopher Bombie Ninfaakang1, Michael Opoku1.   

Abstract

The field of deep learning has led to remarkable advancements in many areas, including banking. Identifying currency denomination type and model is challenging due to intraclass variation and different illumination conditions. Although, in this domain, many datasets regarding currency denomination type and model, e.g., Indian Currency, Thai Currency, Chinese Currency, U.K. currency, etc., have already been experimented with by different researchers. More datasets are needed from a variety of currencies, especially Ghana currency (cedi). This article presents the Ghana Currency image dataset (GC3558) of 3558 color images in 13 classes created from a high-resolution camera. The dataset is comprised of only genuine currency. The class consists of coin and paper notes: 10 pesewas coin, 20 pesewas coin, 50 pesewas coin, 1 cedi coin, 2 cedis coin, 1 cedi note, 2 cedis note, 5 cedis note, 10 cedis note, 20 cedis note, 50 cedis note, 100 cedis note and 200 cedis note. All images are de-identified, validated, and freely available for download to A.I. researchers. The dataset will help researchers evaluate their machine learning models on real-world data.
© 2022 The Author(s).

Entities:  

Keywords:  Banknote recognition; Classification; Currency detection; Dataset; Deep learning

Year:  2022        PMID: 36164293      PMCID: PMC9508434          DOI: 10.1016/j.dib.2022.108616

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

The dataset is comprehensive and consists of 3558 high-quality images of 13 different classes. The dataset consists of coins and paper notes denomination of the Ghana Currency. This dataset is useful for building applications for Ghana Currency classification and detection. It can also be used by researchers working in currency classification and identification. This dataset is useful for training, testing, and validating Ghana Currency or for classification and identification models. The dataset will play an important role in the value identification of Ghana Currency. The dataset will help build an application for currency classification, identification, and detection that can be used by visually impaired people, bank customers, governments, and various agencies.

Data Description

The currency dataset's creation is vital for the following reasons: Correct recognition of currency denomination is an essential task for automated teller machines (ATMs) and currency identification machines [1,2]. In addition, it is necessary to design a system that detects a genuine currency [3]. Furthermore, recognizing currency denominations is a problem for visually impaired people [4,5]. The dataset associated with this paper contains 3558 color images and consists of thirteen (13) classes. The original captured were in varied sizes of (1512×2016), (1560×2080), (2080×1560), and (1080×1440). This paper considers deep learning classification tasks on single and multiple models and input image resolution or size. Increasing image resolution for training with deep learning models often has a trade-off with the maximum possible batch size. Yet, the optimal selection of image resolution can further increase neural network performance for various image processing tasks [6]. Since the originally captured images were in varied resolutions, such as (1512×2016), (1560×2080), (2080×1560), and (1080×1440), and hence very large for training with deep learning techniques. Moreover, large input image sizes introduce memory constraints. As a result, there is an intense computational complexity and requirement, which leads to long training and inference time of the deep learning models [7]. For example, the training time on computer hardware with Graphical Processing Units for 2080 × 2080 pixels input images may take approximately 40 days of consecutive model training, which can be seen as impractical. To alleviate this problem of the time budget of training, we downscaled the image size to a dimension of 128×128. The downscaled 128×128 image pixels are in jpg file format. The dataset can be downloaded as a 1.98 GB zip file GC3558.zip. After unzipping, the main folder Ghana_Cedis Currency contains the Ghana Cedis Currency folder, which contains two subfolders: train and validation folder. Each of the two folders contains thirteen subfolders. The subfolders are 10_pesewas_coin (328), 20_pesewas_coin (261), 50_pesewas_coin (327), 1_cedi_coin (257), 2_cedi_coin (264), 1_cedi_note (329), 2_cedi_note (200), 5_cedi_note (370), 10_cedi_note (241), 20_cedi_note (200), 50_cedi_note (123), 100_cedi_note (353, and 200_cedi_note (305). Table 1 presents the camera specification used to capture the dataset. The resolution quality of the image dataset depends on the quality of the camera used. Therefore, the camera specification presented in Table 1 was used in capturing the GC3558 dataset. Table 2 shows the description of the dataset. The Table shows the various denominations, the direction of image capturing, backgrounds, and the number of images of each denomination. Fig. 1 illustrates the percentage of each denomination presented in the dataset. The Fig. 1 shows that the 5 cedi and 100 cedi notes contain 10% each, which is the highest representation of the total dataset. The 10pesewas coin, 50pesewas coin, 1 cedi, and 200 cedi notes comprise 9% each of the total dataset. The 20pesewas coin, 1 cedi coin, 2 cedi coin, and 10 cedi note comprise 7% each of the total dataset. The 2 cedi and 20 cedi notes comprise 6% each of the total dataset. The 50 cedi denomination comprises 4% of the total dataset, which is the least representation.
Table 1

Camera specifications.

Description
Camera NameNikon D3500
TypeDSLR
SensorAPS-C
Megapixels24.2MP
Lens MountNikon F
VideofinderOptical
Max View ResolutionFull HD
Table 2

Description of Ghana Currency (GC3558) dataset.

S.N.Denomination ConsideredDirection of image CapturingDifferent Backgrounds considered for image capturingNo. of Images of each denomination
110 pesewas coinFront Direction, Front Direction Rotated 1800, Backward Direction, Backward Direction Rotated 1800white, dark, yellow, and illuminated.328
220 pesewas coin261
350 pesewas coin327
41 cedi coin257
52 cedi coin264
61 cedi note329
72 cedi note200
85 cedi note370
910 cedi note241
1020 cedi note200
1150 cedi note123
12100 cedi note353
13200 cedi note305
Total No. of Images3558
Fig. 1

Percentage of each currency denomination in the GC3558 dataset.

Camera specifications. Description of Ghana Currency (GC3558) dataset. Percentage of each currency denomination in the GC3558 dataset. Fig. 2 shows data samples of the GC3558 images presenting the various currency denomination. The Figure shows both the coins (left) and banknotes (paper note) currencies (right). The directory structure of the currency dataset is shown in Fig. 3. Fig. 3 describes the folder structure of the GC3558 dataset. The first folder is Ghana Cedi Currency which contains a subfolder named Ghana Cedi Currency. In the subfolder, there are two (2) additional subfolders; train and validation, which contains the 13 classes of the Ghana currency images.
Fig. 2

Data Samples of the GC3558 images.

Fig. 3

Ghana Currency dataset directory structure.

Data Samples of the GC3558 images. Ghana Currency dataset directory structure.

Experimental Design, Materials and Methods

Experimental design

Fig. 4 illustrates the image data acquisition process. The images were captured using Nikon D3500 high-resolution rear camera. All images were captured using a camera and then separated and saved in their respective folders per their denomination values. The images were annotated using labelIMG tool the annotated txt file was saved in a respective folder.
Fig. 4

Ghana Cedis Currency (GC3558) dataset acquisition process.

Ghana Cedis Currency (GC3558) dataset acquisition process. Table 3 gives a detailed description of the dataset acquisition process, and a description of the cameras is specified in Table 1. The Ghana Cedis Currency (GC3558) images were captured daily and during day time from November 2021 to January 2022. The images were captured in different directions and backgrounds and with variant sizes, as mentioned in Table 2. After the captured images were further separated into specific folders. The folder structure of images is shown in Fig. 3. The images were resized to 128×128 dimensions using python script and then annotated using the LabelImg tool from 2022 February to April 2022. The dataset is comprised of only genuine currency. Therefore, the authors have planned to update the dataset with counterfeit currencies in the future version, which is believed to help further improve the identification of genuine and counterfeit currency.
Table 3

Data acquisition steps.

No.StepDurationActivity
1Data GatheringNovember 2021 to January 2022Daily and during daytime capturing of the currency images
2Image LabelingFebruary 2022 to April 2022Labeled the 3558 images of Ghana Cedis Currency images
Data acquisition steps.

Materials or specifications of the image acquisition system

The Ghana currency images were captured using Nikon D3500 with a rear camera of 24.2 MP. All the original image datasets were of varied sizes (1512×2016), (1560×2080), (2080×1560), and (1080×1440) and were resized to 128×128 dimensions using a python script. The images were saved in .jpg format. Robots perceive objects effectively and efficiently, highlighting the need to understand the environmental factors and their impact on visual perception, such as illumination changes. In object recognition and classification, illumination of the scene and differences in sensor capturing are two factors [8]. The inconsistency of quality between the training and testing images reduces the performance of deep learning models. Furthermore, inconsistency in lighting also reduces deep learning performance; however, introducing different lighting conditions can alleviate the reduction of performance [9]. In this paper, the images are captured in various environmental conditions such as different light conditions, different backgrounds, and from different angles. Capturing the images in these conditions serve as data augmentation, where more data is generated to train the deep learning model. Additionally, this technique helps to achieve better generalizability and improve the robustness of the deep learning model. After capturing the images, they were organized as Ghana Cedis Currency. The Ghana currency dataset consists of 13 different folders. The dataset directory structure of images is shown in Fig. 3. The images are annotated using the LabelImg tool. The annotations images of currency are stored in their respective folders.

Method

The images were acquired using the Nikon D3500 camera in different angles and backgrounds. The original images were of different varied sizes (1512×2016), (1560×2080), (2080×1560), and (1080×1440) and were resized to 128 × 128 using a python script and then labeled using the LabelImg tool. Table 2. describes the classes, number of images, and the environments in which images were taken.

Ethics Statement

There is no funding present for the present effort. There is no conflict of interest. The data is available in the public domain.

CRediT Author Statement

Kwabena Adu: Methodology, Software, Writing – original draft; Patrick Kwabena Mensah: Data curation, Conceptualization, Supervision; Mighty Abra Ayidzoe: Writing – review & editing; Obed Appiah: Software, Validation; Ebenezer Quayson, Christopher Bombie Ninfaakang and Michael Opoku: Data curation, Investigation.

Declaration of Competing Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.
SubjectMachine Learning / Deep Learning
Specific subject areaCurrency detection and identification
Type of dataGhana currency images
How the data were acquiredThe Ghana currency images were collected by taking images using a high-resolution camera device. Table 1 shows a description of the camera used to collect the dataset.
Data formatRawAnnotated
Parameters for the data collectionThe Ghana currency dataset images are .jpg images of 1512×2016 dimension, and resolution is 96 dpi
Description of data collectionThe denominations of the Ghana currency were collected using a high-resolution camera device. The original .jpg images of currency were in varied dimensions (1512×2016), (1560×2080), (2080×1560), and (1080×1440). These images are resized to 128×128 dimensions. There are total 13 classes of the Ghana currency namely 10_pesewas_coin, 20_pesewas_coin, 50_pesewas_coin, 1_cedi_coin, 2_cedi_coin, 1_cedi_note, 2_cedi_note, 5_cedi_note, 10_cedi_note, 20_cedi_note, 50_cedi_note, 100_cedi_note, and 200_cedi_note. The images were captured from various environmental conditions like white background, dark background, yellow background, and illuminated background.
Data source locationUniversity of Energy and Natural ResourcesP.O. Box 214, Sunyani – Ghana
Data accessibilityRepository name: Dataset of Ghana Currency with AnnotationsData identification number(doi): 10.17632/vws5r8mj4wDirect URL to data: https://data.mendeley.com/datasets/vws5r8mj4w/draft?a=42fcc651-6826-49f0-b20c-6adb0f632a91
  6 in total

1.  On the Illumination Influence for Object Learning on Robot Companions.

Authors:  Ingo Keller; Katrin S Lohan
Journal:  Front Robot AI       Date:  2020-01-21

2.  Euro Banknote Recognition System for Blind People.

Authors:  Larisa Dunai Dunai; Mónica Chillarón Pérez; Guillermo Peris-Fajarnés; Ismael Lengua Lengua
Journal:  Sensors (Basel)       Date:  2017-01-20       Impact factor: 3.576

3.  Deep Learning Fundus Image Analysis for Diabetic Retinopathy and Macular Edema Grading.

Authors:  Jaakko Sahlsten; Joel Jaskari; Jyri Kivinen; Lauri Turunen; Esa Jaanio; Kustaa Hietala; Kimmo Kaski
Journal:  Sci Rep       Date:  2019-07-24       Impact factor: 4.379

4.  Dataset of Indian and Thai banknotes with annotations.

Authors:  Vidula Meshram; Kailas Patil; Prawit Chumchu
Journal:  Data Brief       Date:  2022-03-02

5.  The Effect of Image Resolution on Deep Learning in Radiography.

Authors:  Carl F Sabottke; Bradley M Spieler
Journal:  Radiol Artif Intell       Date:  2020-01-22

6.  Efficient Banknote Recognition Based on Selection of Discriminative Regions with One-Dimensional Visible-Light Line Sensor.

Authors:  Tuyen Danh Pham; Young Ho Park; Seung Yong Kwon; Kang Ryoung Park; Dae Sik Jeong; Sungsoo Yoon
Journal:  Sensors (Basel)       Date:  2016-03-04       Impact factor: 3.576

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.