Literature DB >> 32462061

A dataset for Wi-Fi-based human-to-human interaction recognition.

Rami Alazrai1, Ali Awad1, Baha'A Alsaify2, Mohammad Hababeh1, Mohammad I Daoud1.   

Abstract

This paper presents a dataset for Wi-Fi-based human-to-human interaction recognition that comprises twelve different interactions performed by 40 different pairs of subjects in an indoor environment. Each pair of subjects performed ten trials of each of the twelve interactions and the total number of trials recorded in our dataset for all the 40 pairs of subjects is 4800 trials (i.e., 40 pairs of subjects × 12 interactions × 10 trials). The publicly available CSI tool [1] is used to record the Wi-Fi signals transmitted from a commercial off-the-shelf access point, namely the Sagemcom 2704 access point, to a desktop computer that is equipped with an Intel 5300 network interface card. The recorded Wi-Fi signals consist of the Received Signal Strength Indicator (RSSI) values and the Channel State Information (CSI) values. Unlike the publicly available Wi-Fi-based human activity datasets, which mainly have focused on activities performed by a single human, our dataset provides a collection of Wi-Fi signals that are recorded for 40 different pairs of subjects while performing twelve two-person interactions. The presented dataset can be exploited to advance Wi-Fi-based human activity recognition in different aspects, such as the use of various machine learning algorithms to recognize different human-to-human interactions.
© 2020 The Author(s).

Entities:  

Keywords:  Channel State Information (CSI); Human Activity Recognition; Received Signal Strength Indicator (RSSI); Two-Person Interaction; Wi-Fi

Year:  2020        PMID: 32462061      PMCID: PMC7240209          DOI: 10.1016/j.dib.2020.105668

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table

Value of the data

The dataset contains a collection of Wi-Fi signals, including the RSSI and CSI values, which are recorded for 40 different pairs of subjects while performing twelve human-to-human interactions in an indoor environment. To the best of our knowledge, this is the first Wi-Fi-based dataset that considers the activities performed by two individuals. Researchers in the field of human activity recognition can utilize the acquired data to evaluate the performance of Wi-Fi-based human-to-human interaction recognition systems that can be developed for various application domains. The acquired data can be exploited to advance human activity recognition technology in different aspects. For example, various pattern recognition and machine learning methods can be used to accurately recognize different human-to-human interactions. Another potential use of our data is to explore the utilization of various signal processing techniques to analyze the recorded Wi-Fi signals and extract salient features that can be used to recognize different human-to-human interactions. The existing publicly available Wi-Fi-based human activity datasets have focused on activities that are performed by a single human. On the contrary, our dataset contributes to the ongoing research in the field of Wi-Fi-based human activity recognition by providing a collection of Wi-Fi signals that are recorded from 40 different pairs of subjects while performing twelve human-to-human interactions.

Data description

The raw data are grouped into one main folder that comprises 40 subfolders, where each sub-folder contains the data files recorded for a particular pair of subjects. Specifically, a total of 120 trials (i.e., 10 trials per each interaction x 12 interactions) were recorded for each pair of subjects, where each trial was stored in a separate MATLAB data file (.mat). The name of each data file follows the form “Sx_Sy_In_Tk.mat”. The first part of the name of each data file, denoted as Sx_Sy, represents the pair of subjects who have performed the interaction recorded in a specific data file. In particular, Sx and Sy are the first and second subjects within the pair Sx_Sy, respectively. The pairs of the subjects were formed from a pool of 66 different subjects, as described in the next section. Hence, x and y are integer numbers between 1 and 66. The second part of the name of each data file, denoted as ln, represents one of the twelve human-to-human interactions, where n is an integer number between 1 and 12. Specifically, In = {approaching (I1), departing (I2), handshaking (I3), high five (I4), hugging (I5), kicking with the left leg (I6), kicking with the right leg (I7), pointing with the left hand (I8), pointing with the right hand (I9), punching with the left hand (I10), punching with the right hand (I11), and pushing (I12)}. Finally, the last part of the name of each data file, denoted as Tk, represents the trial number, where k is an integer number between 1 and 10. For example, the data file name “S15_S3_I2_T3.mat” represents a file containing the data recorded for the pair of subjects S15_S3 while performing the departing interaction during the third trial. The data file associated with each trial contains a cell array of dimension L × 1, where L represents the number of Wi-Fi packets captured during the recording of a particular trial. Moreover, each Wi-Fi packet is stored within an element of the cell array in the form of a structure that consists of several fields as described in Table 1.
Table 1

The description of the fields in the stucture that contains a Wi-Fi packet.

FieldDescription
timestamp_lowThe arrival time of the Wi-Fi packet, which is represented by the lower 32 bits of NIC's clock [1]. This timestamp also represents the arrival time of the RSSI and CSI values comprised within the Wi-Fi packet.
NrxNrx represents the number of antennas used at the receiver side (i.e., the NIC) and its value is set to 3.
NtxNtx represents the number of antennas used at the transmitter side (i.e., the access point) and its value is set to 2.
noiseThe measured noise over the channel.
agcRepresents the automatic gain control parameter of the NIC measured in dB. The value of this field along with the value in the noise field are necessary to convert the unit of the RSSI values from dB to dBm as described in the CSI tool [1].
RSSI_aRSSI_a represents the RSSI value received at the first antenna of the NIC measured in dB.
RSSI_bRSSI_b represents the RSSI value received at the second antenna of the NIC measured in dB.
RSSI_cRSSI_c represents the RSSI value received at the third antenna of the NIC measured in dB.
CSIThe channel state information in the form of a complex three-dimensional matrix that has a dimension of Ntx × Nrx × Nsc. Nsc represents the number of subcarriers constructed using the Orthogonal Frequency-Division Multiplexing (OFDM) modulation scheme, which is applied to the utilized 20 MHz wide channel. The CSI tool specifies the value of the Nsc parameter to 30 subcarriers [1].
labelThe recorded trial for any of the twelve human-to-human interactions consists of two types of intervals, namely the steady-state and the interaction intervals. During the steady state interval, the pair of subjects are standing against each other without performing any activity. On the other hand, during the interaction interval, the pair of subjects perform one of the twelve different human-to-human interactions.Thus, this field assigns a label to the Wi-Fi packet to specify whether the packet has arrived during the steady state interval or the interaction interval. In particular, the assigned label is a string of the form In, where n is an integer in the range of 1 to 13 that is assigned to each Wi-Fi packet as follows:
LabelDescription
I1A Wi-Fi packet is labeled as I1 if it arrived while the pair of subjects were performing the approaching interaction.
I2A Wi-Fi packet is labeled as I2 if it arrived while the pair of subjects were performing the departing interaction.
I3A Wi-Fi packet is labeled as I3 if it arrived while the pair of subjects were performing the handshaking interaction.
I4A Wi-Fi packet is labeled as I4 if it arrived while the pair of subjects were performing the high five interaction.
I5A Wi-Fi packet is labeled as I5 if it arrived while the pair of subjects were performing the hugging interaction.
I6A Wi-Fi packet is labeled as I6 if it arrived while the pair of subjects were performing the kicking with the left leg interaction.
I7A Wi-Fi packet is labeled as I7 if it arrived while the pair of subjects were performing the kicking with the right leg interaction.
I8A Wi-Fi packet is labeled as I8 if it arrived while the pair of subjects were performing the pointing with the left hand interaction.
I9A Wi-Fi packet is labeled as I9 if it arrived while the pair of subjects were performing the pointing with the right hand interaction.
I10A Wi-Fi packet is labeled as I10 if it arrived while the pair of subjects were performing the punching with the left hand interaction.
I11A Wi-Fi packet is labeled as I11 if it arrived while the pair of subjects were performing the punching with the right hand interaction.
I12A Wi-Fi packet is labeled as I12 if it arrived while the pair of subjects were performing the pushing interaction.
I13A Wi-Fi packet is labeled as I13 if it arrived while the pair of subjects were in the steady state interval.
The description of the fields in the stucture that contains a Wi-Fi packet. Fig. 1 shows the average ± standard deviation number of Wi-Fi packets recorded in all the intervals within a trial, the interaction interval within a trial, and the steady state interval within a trial computed for each of the twelve interactions over all the pairs of subjects.
Fig. 1

The average number of packets recorded for all the intervals, interaction interval, and steady state interval within a trial computed for each interaction over all the pairs of subjects. The black bars represent the standard deviation values.

The average number of packets recorded for all the intervals, interaction interval, and steady state interval within a trial computed for each interaction over all the pairs of subjects. The black bars represent the standard deviation values. Fig. 2 shows the CSI signals recorded for the pair of subjects that constitutes subject 6 and subject 24 while performing the twelve human-to-human interactions. Moreover, the three-dimensional mesh plots presented in Fig. 2 show the different intervals comprised within each of the twelve interactions, including the steady state and human-to-human interaction intervals.
Fig. 2

The raw CSI signals of the twelve human-to-human interactions performed by the pair of subjects that constitutes subject 6 and subject 24.

The raw CSI signals of the twelve human-to-human interactions performed by the pair of subjects that constitutes subject 6 and subject 24.

Experimental design, materials, and methods

Subjects

A total of 66 healthy subjects (63 males and three females, average ± standard deviation age of 22.1 ± 3.7 years) have volunteered to participate in the experiments. All subjects received a thorough explanation of the experimental procedure. The experimental procedure was conducted according to the Declaration of Helsinki and approved by the research ethics committee at the German Jordanian University. A signed consent form was collected from each subject. To perform the twelve human-to-human interactions, we have constructed 40 different pairs of subjects from the 66 subjects who have volunteered to participate in this experiment. In particular, the pairs of subjects were constructed according to the following criteria [4]: (1) each subject was selected to be a member of at most two different pairs of subjects, and (2) each subject that was selected as a member of two different pairs has to have different roles in the two pairs, where the role of a subject can be either active role or passive role depending on whether the subject has initiated the interaction or not. Table 2 shows the constructed pairs of subjects along with the role, gender, age, height, and weight of the subjects within each pair.
Table 2

The constructed pairs of subjects along with the role, gender, age, height, and weight of the subjects within each pair.

Pair no.Subjects
Active subject
Passive subject
Subject IDGenderAge (years)Weight (kg)Height (cm)Subject IDGenderAge (years)Weight (kg)Height (cm)
1S1male2070173S47male2065175
2S2male1876175S22male1878172
3S3male2265169S44male2072174
4S4male2157185S15male2057185
5S6male20101178S24male2065179
6S7male2386175S12male2473184
7S8male2181176S31male2975171
8S13male2075170S21male2082176
9S14male2696180S5male2480165
10S16male2380160S41male2473175
11S18male2769178S57male2792180
12S19male1878180S11male1868168
13S20male29125183S61male3082179
14S25male2270170S9male2168167
15S26male1857171S60male1864168
16S27male2067170S40male2071174
17S28male2883170S43male2680174
18S32male2782186S64male3387190
19S33male2281185S3male2265169
20S34female2170168S30female2080160
21S35male2094178S52male1962170
22S36male2691167S16male2380160
23S37male1957177S54female2065171
24S38male19118186S35male2094178
25S41male2473175S36male2691167
26S42male2356177S14male2696180
27S44male2072174S33male2281185
28S46male2992184S28male2883170
29S48male2072173S45male2118173
30S49male1874179S10male1817686
31S50male2170173S17male2268172
32S51male2073170S23male2671175
33S52male1962170S62male1972178
34S53male3484176S12male2473184
35S55male2164173S66male2076180
36S56male22110179S63male2086178
37S58male2158167S39male2162174
38S59male20104179S29male22105179
39S62male1972178S38male19118186
40S65male1964170S42male2356177
The constructed pairs of subjects along with the role, gender, age, height, and weight of the subjects within each pair.

Experimental procedure

Each pair of subjects was asked to perform ten different trials of each of the twelve human-to-human interactions. Fig. 3 shows sample images of the twelve human-to-human interactions considered our dataset. Each of the twelve human-to-human interactions consists of two types of intervals, namely the steady state and the interaction intervals. During the steady state interval, the pair of subjects are standing against each other without performing any activity. On the other hand, during the interaction interval, the pair of subjects perform one of the twelve different human-to-human interactions.
Fig. 3

Sample images of the twelve human-to-human interactions considered in our dataset.

Sample images of the twelve human-to-human interactions considered in our dataset. In order to accurately perform the different human-to-human interactions, we have designed twelve timing diagrams that describe how to perform each of the twelve human-to-human interaction. Moreover, we developed a group of pre-programmed beep sounds, where each one of these sounds can be played at a preset time instance to notify the subject to perform a specific action during the time interval following the beep sound. In particular, a short beep sound is used to initiate the recording of each trial, a medium beep sound is used to indicate an interval transition, and a long beep sound is used to announce the end of the recording of each trial. Table 3 shows the timing diagram associated with each of the twelve human-to-human interactions along with the steady state interval, interaction interval, and the time instances associated with the added beep sounds. These timing diagrams were thoroughly explained to the subjects before the beginning of data recording. Moreover, the subjects were asked to follow the timing diagrams during the performance of the twelve interactions and to perform their roles within the amount of time allocated to each interval of a particular interaction.
Table 3

The timing diagram associated with each of the twelve human-to-human interactions along with the steady state interval, interaction interval, and the time instances of the added beep sounds. A sound icon is used to mark the locations of the added beep sounds.

InteractionTiming Diagram
ApproachingImage, table 3
DepartingImage, table 3
HandshakingImage, table 3
High fiveImage, table 3
HuggingImage, table 3
Kicking with the left legImage, table 3
Kicking with the right legImage, table 3
Pointing with the left handImage, table 3
Pointing with the right handImage, table 3
Punching with the left handImage, table 3
Punching with the right handImage, table 3
PushingImage, table 3
The timing diagram associated with each of the twelve human-to-human interactions along with the steady state interval, interaction interval, and the time instances of the added beep sounds. A sound icon is used to mark the locations of the added beep sounds.

Software and equipment

The publicly available CSI tool [1] was used to record the Wi-Fi signals transmitted from a commercial off-the-shelf access point, namely the Sagemcom 2704 access point, to a desktop computer that is equipped with an Intel 5300 NIC. Fig. 4 shows the utilized access point and NIC. The access point was configured to operate at a frequency band of 2.4 GHz, wireless channel number 6, channel bandwidth of 20 MHz, and modulation coding scheme of index 8. Moreover, the utilized access point and the NIC are compliant with the IEEE 802.11n standard, which supports Multiple Input Multiple Output (MIMO) with the OFDM modulation scheme that allows sending and receiving information over multiple antennas [2,3,5].
Fig. 4

The equipment used for transmitting and receiving Wi-Fi packets.

The equipment used for transmitting and receiving Wi-Fi packets. The access point comprises two internal transmit antennas (i.e., Ntx=2), and the NIC has three external receive antennas (i.e., Nrx=3). Therefore, the resultant MIMO system consists of 2 × 3 Wi-Fi streams, where each MIMO stream is established between a unique pair of transmit-receive antennas. Moreover, for each OFDM-modulated MIMO stream, the CSI tool is capable of capturing the CSI for 30 subcarriers (i.e., Nsc=30) that are evenly spread over the selected channel bandwidth, which is equal to 20 MHz. Thus, our MIMO system is capable of capturing 6 × 30 subcarriers. Fig. 5 shows the MIMO streams established between the utilized access point and the NIC, which are used to record the CSI while the subjects are performing the twelve human-to-human interactions.
Fig. 5

The MIMO streams established between the access point and the NIC. Tx1 and Tx2 represent the two transmit antennas at the access point, while Rx1, Rx2, and Rx3 represent the three receive antennas at the NIC.

The MIMO streams established between the access point and the NIC. Tx1 and Tx2 represent the two transmit antennas at the access point, while Rx1, Rx2, and Rx3 represent the three receive antennas at the NIC.

Environment

The Wi-Fi signals were captured in a furnished room of dimensions 5.3 m × 5.3 m, as shown in Fig. 6. The access point and the NIC were mounted in a line-of-sight configuration at a distance of 4.3 m apart from each other. The pairs of subjects performed the twelve human-to-human interactions in the center of the area located between the access point and the NIC.
Fig. 6

The layout of the room used to collect the data.

The layout of the room used to collect the data.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.
SubjectComputer Science ApplicationsSignal ProcessingHuman-Computer InteractionComputer Vision and Pattern Recognition
Specific subject areaActivity recognition, Human-to-human interaction recognition using Wi-Fi signals.
Type of dataRaw dataset, table, image
How data were acquiredThe Wi-Fi signals were captured in an indoor environment (furnished room) from 40 different pairs of subjects while performing twelve different human-to-human interactions. The CSI tool [1] was used to record the Wi-Fi signals transmitted from a commercial off-the-shelf access point, namely the Sagemcom 2704 access point, to a desktop computer that is equipped with an Intel 5300 Network Interface Card (NIC). The number of transmit antennas in the utilized access point is two antennas, while the number of receive antennas in the employed NIC is three antennas.
Data formatRaw
Parameters for data collectionThe twelve interactions were thoroughly explained to the subjects before the beginning of data recording. All the interactions were performed in a line-of-sight manner with respect to the access point and the NIC. The access point was configured to operate at a frequency band of 2.4 GHz, wireless channel number 6, channel bandwidth of 20 MHz, and a modulation coding scheme of index 8.
Description of data collectionThe Wi-Fi signals were recorded from 40 different pairs of subjects while performing twelve different human-to-human interactions. Each pair of subjects performed ten trials of each interaction. The recorded Wi-Fi signals consist of the Received Signal Strength Indicator (RSSI) values and the Channel State Information (CSI) values.
Data source locationInstitution: German Jordanian University, Department of Computer EngineeringCity/Town/Region: Amman, 11180Country: JordanLatitude and longitude (and GPS coordinates) for collected samples/data: 31.7767° N, 35.8025° E
Data accessibilityRepository name: Mendeley DataData identification number: 10.17632/3dhn4xnjxw.1Direct URL to data: https://data.mendeley.com/datasets/3dhn4xnjxw/draft?a=90c726d4-5493-4efc-9ee6-973bcd922b31
  3 in total

1.  OPERAnet, a multimodal activity recognition dataset acquired from radio frequency and vision-based sensors.

Authors:  Mohammud J Bocus; Wenda Li; Shelly Vishwakarma; Roget Kou; Chong Tang; Karl Woodbridge; Ian Craddock; Ryan McConville; Raul Santos-Rodriguez; Kevin Chetty; Robert Piechocki
Journal:  Sci Data       Date:  2022-08-03       Impact factor: 8.501

2.  HHI-AttentionNet: An Enhanced Human-Human Interaction Recognition Method Based on a Lightweight Deep Learning Model with Attention Network from CSI.

Authors:  Islam Md Shafiqul; Mir Kanon Ara Jannat; Jin-Woo Kim; Soo-Wook Lee; Sung-Hyun Yang
Journal:  Sensors (Basel)       Date:  2022-08-12       Impact factor: 3.847

3.  SlideAugment: A Simple Data Processing Method to Enhance Human Activity Recognition Accuracy Based on WiFi.

Authors:  Junyan Li; Kang Yin; Chengpei Tang
Journal:  Sensors (Basel)       Date:  2021-03-20       Impact factor: 3.576

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.