Literature DB >> 35341036

Updating thermal imaging dataset of hand gestures with unique labels.

Sreenivasa Reddy Yeduri1, Daniel Skomedal Breland1, Om Jee Pandey2, Linga Reddy Cenkeramaddi1.   

Abstract

An update to the previously published low resolution thermal imaging dataset is presented in this paper. The new dataset contains high resolution thermal images corresponding to various hand gestures captured using the FLIR Lepton 3.5 thermal camera and Purethermal 2 breakout board. The resolution of the camera is 160 × 120 with calibrated array of 19,200 pixels. The images captured by the thermal camera are light-independent. The dataset consists of 14,400 images with equal share from color and gray scale. The dataset consists of 10 different hand gestures. Each gesture has a total of 24 images from a single person with a total of 30 persons for the whole dataset. The dataset also contains the images captured under different orientations of the hand under different lighting conditions.
© 2022 The Author(s).

Entities:  

Keywords:  Hand Gestures; Machine learning models; Sensor; Thermal Camera; Thermal imaging

Year:  2022        PMID: 35341036      PMCID: PMC8943409          DOI: 10.1016/j.dib.2022.108037

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table This section list the details of the hardware, procedure for collecting the data, and the format of the data.

Value of the Data

The existing dataset contains the images of pixel thermal camera [1], [2]. However, the new dataset is created with pixel thermal camera. Efficient machine learning models can be developed to process the data for hand gesture recognition. The academic or research communities working on thermal imaging data with efficient machine learning algorithms for hand gesture recognition or classification. The data is also helpful in developing and testing efficient algorithms to work on thermal imaging dataset. The data is collected with high thermal camera with no constraints on the environment and captured images are independent of back ground lighting conditions. This will be helpful for testing the algorithms with thermal imaging data.

Data Description

The previous dataset in [1], [2] is captured with a low resolution thermal camera of pixels resolution. The thermal images in this dataset correspond to ten hand gestures representing 0 to 9 sign language digits. This dataset has been created from various people with different hand orientations. On the other hand, the dataset present in this paper contains the images captured from the thermal camera with the resolution of pixels. The thermal images contain ten different hand gestures captured from various people. We also captured images of both color and gray scale under varying environment conditions. Further, different hand orientations are also considered while creating the dataset.

Data file description

The data repository structure is shown in Fig. 1. The root folder consists of ten folders namely Gesture_a to Gesture_j corresponding to Gesture a to j, respectively. Further, each gesture folder contains two folders such as Colorscale and Grayscale. The Colorscale folder consists of 360 color images of.png format. Similarly, Grayscale folder consists of 360 gray images in.png format. There are a total of 30 people considered for the creation of dataset. The total size of the dataset is 154 MB [3].
Fig. 1

Data structure of the repository.

Data structure of the repository. Fig. 2 shows the plasma thermal images corresponding to hand gestures from a to j. Fig. 2a is corresponding to image a and Fig. 2b corresponds to image b. Fig. 2c, 2d, 2e, 2f, 2g, 2h, 2i, and 2j are the color scale thermal images c, d, e, f, g, h, i, and j, respectively.
Fig. 2

A colored fusion thermal images: (a) Image a; (b) Image b; (c) Image c; (d) Image d; (e) Image e; (f) Image f; (g) Image g; (h) Image h; (i) Image i; and, (j) Image j.

A colored fusion thermal images: (a) Image a; (b) Image b; (c) Image c; (d) Image d; (e) Image e; (f) Image f; (g) Image g; (h) Image h; (i) Image i; and, (j) Image j. Fig. 3 shows the gray thermal images corresponding to hand gestures from a to j. Fig. 3a is corresponding to image a and Fig. 3b corresponds to image b. Fig. 3c, 3d, 3e, 3f, 3g, 3h, 3i, and 3j are the gray scale thermal images c, d, e, f, g, h, i, and j, respectively.
Fig. 3

A gray fusion thermal images: (a) Image a; (b) Image b; (c) Image c; (d) Image d; (e) Image e; (f) Image f; (g) Image g; (h) Image h; (i) Image i; and, (j) Image j.

A gray fusion thermal images: (a) Image a; (b) Image b; (c) Image c; (d) Image d; (e) Image e; (f) Image f; (g) Image g; (h) Image h; (i) Image i; and, (j) Image j.

Experimental Design, Materials and Methods

Fig. 4 shows the experimental setup of the thermal camera considered for the data collection. We used FLIR Leptop 3.5 thermal camera module to be fitted in embedded systems [4]. It has a horizontal field of view of 57 which indicates that it captures more of image details than object details. The images can be captured by connecting the thermal camera to a computer with Windows OS and SDK [5]. A portable and simple setup is to connect the thermal camera to Raspberry Pi 4 Model B [6] which make use of the python script to capture the images. For the creation of the dataset, we connected the thermal camera to Raspberry Pi 4 model B.
Fig. 4

Thermal camera setup.

Thermal camera setup. The thermal camera do not have a port to directly connect to the Raspberry Pi. Thus, we fitted the camera on purethermal 2 breakout board [7], [8] which has an interface to connect to the Raspberry Pi through USB. Thereafter, we placed the breakout board on a fixed stand to stable the camera as well as to add height for a easy capturing of the hand gestures. Fig. 5 shows the steps in the python script to capture the images for the dataset. The program makes use of the Lepton library from flirpy in Python. This library enables the thermal camera for capturing the images and then the OpenCV and matplotlib libraries are used to save the images in the Raspberry Pi. The main loop takes the inputs from 1 to 5 to capture the images. When 2 is the input, it will take second input. The second input defines how many images the thermal camera should capture before asking for the first input again.
Fig. 5

Thermal image capturing procedure.

Thermal image capturing procedure.

Ethics Statement

The data consists solely of hand gestures and contains no personal information. It was a free-for-all campaign, and people gave the hand gestures at their own discretion.

CRediT authorship contribution statement

Sreenivasa Reddy Yeduri: Conceptualization, Writing – original draft, Writing – review & editing. Daniel Skomedal Breland: Software, Methodology, Data curation, Investigation, Visualization. Om Jee Pandey: Writing – review & editing, Supervision. Linga Reddy Cenkeramaddi: Conceptualization, Supervision, Validation, Writing – review & editing.

Declaration of Competing Interest

The authors declare that there is no influence from known competing financial interests or personal relationships which have, or could be perceived for the work reported in this article.
SubjectHuman-Computer Interaction, Biomedical, Electrical and Electronic Engineering
Specific subject areaThermal images of different hand gestures
Type of dataImage (.png)
How data were acquiredThermal Camera (Flir Lepton 3.5 thermal camera)
Camera Stand
Purethermal 2 breakout board
Raspberry Pi 4 Model B
Same as in original data article
Data formatRaw (from acquisition)
Parameters for data collectionImages are collected from 30 people with 160×120 pixel camera with a radiometric calibrated array of 19200 pixels
Description of data collectionThe camera setup is mounted on a tripod to capture the images. Further, hand gestures are captured while hand is mostly static position. We have placed both camera setup and hand on top of a table to capture effective images. The software program was designed to save images based on the number that is being pressed as the first input in the range 1 to 5. Then, the second input is given to define the total number of images to be captured.
Data source locationACPS group, Department of Information and Communication Technology, University of Agder, Grimstad, Norway
Data accessibilityRepository Name:
Plain_Background_Thermal_Imaging_Dataset
https://zenodo.org/record/6247463#.YhZ2gujMJaQ
Related data articleSreenivasa Reddy Yeduri, Daniel Skomedal Breland, Simen Birkeland Skriubakken, Om Jee Pandey, Linga Reddy Cenkeramaddi, Low Resolution Thermal Imaging Dataset of Sign Language Digits, Data in Brief, 2022, 107977, ISSN 2352-3409, https://doi.org/10.1016/j.dib.2022.107977
Related research articleD. S. Breland, A. Dayal, A. Jha, P. K. Yalavarthy, O. J. Pandey and L. R. Cenkeramaddi, ”Robust Hand Gestures Recognition Using a Deep CNN and Thermal Images,” in IEEE Sensors Journal, vol. 21, no. 23, pp. 26602-26614, 1 Dec.1, 2021, doi: 10.1109/JSEN.2021.3119977
  1 in total

1.  Low resolution thermal imaging dataset of sign language digits.

Authors:  Sreenivasa Reddy Yeduri; Daniel Skomedal Breland; Simen Birkeland Skriubakken; Om Jee Pandey; Linga Reddy Cenkeramaddi
Journal:  Data Brief       Date:  2022-02-23
  1 in total
  1 in total

1.  Sign and Human Action Detection Using Deep Learning.

Authors:  Shivanarayna Dhulipala; Festus Fatai Adedoyin; Alessandro Bruno
Journal:  J Imaging       Date:  2022-07-11
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.