| Literature DB >> 29160799 |
Alejandra García-Hernández1, Carlos E Galván-Tejada2, Jorge I Galván-Tejada3, José M Celaya-Padilla4, Hamurabi Gamboa-Rosales5, Perla Velasco-Elizondo6, Rogelio Cárdenas-Vargas7.
Abstract
Human Activity Recognition (HAR) is one of the main subjects of study in the areas of computer vision and machine learning due to the great benefits that can be achieved. Examples of the study areas are: health prevention, security and surveillance, automotive research, and many others. The proposed approaches are carried out using machine learning techniques and present good results. However, it is difficult to observe how the descriptors of human activities are grouped. In order to obtain a better understanding of the the behavior of descriptors, it is important to improve the abilities to recognize the human activities. This paper proposes a novel approach for the HAR based on acoustic data and similarity networks. In this approach, we were able to characterize the sound of the activities and identify those activities looking for similarity in the sound pattern. We evaluated the similarity of the sounds considering mainly two features: the sound location and the materials that were used. As a result, the materials are a good reference classifying the human activities compared with the location.Entities:
Keywords: human activity recognition; mel frequency cepstral coefficients; similarity networks
Mesh:
Year: 2017 PMID: 29160799 PMCID: PMC5713102 DOI: 10.3390/s17112688
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Workflow of the analysis process.
Activities’ general description.
| Activity | Description |
|---|---|
| Brew coffee | Brewing coffee in the stove using coffee pots and in coffee machines. |
| Cook | Cooking meat and scrambled eggs in the stove. |
| Use microwave oven | Using several microwave ovens to heat up water and a meal. |
| Take a shower | Taking a shower in different environments, in some cases water fall was interrupted at intervals. |
| Dish washing | Dishes were washed by hand individually or in groups of different dishes, water noise is in the background. |
| Hand washing | Washing hands with bar soap. |
| Teeth brushing | Audio clips include from opening the tap to closing it. |
| No activity | No activity audio clips, which are mostly noises added by the device used to record (reading in silence, resting in a coach, sleeping without snoring). |
Selected mobile phones’ system on chip (SoC) and operating system.
| Smartphone | System on Chip (SoC) | Operating System |
|---|---|---|
| Lanix Ilium s600 | Qualcomm Snapdragon 210 MSM8909 | Android 5.1 |
| LG G Pro Lite | MediaTek MT6577 | Android 4.1.2 |
| iPhone 4 | Apple A4 APL0398 | iOS 4 |
| iPhone 3GS | Samsung S5PC100 | iOS 3 |
| HTC One M7 | Qualcomm Snapdragon 600 APQ8064T | Android 4.1.2 |
Figure 2Average distance between the mobile device and activity.
Audio clips’ meta-data per activity.
| Activity | Sample Rate | Encoding Format | Channels |
|---|---|---|---|
| Brew coffee | 8000 Hz–44,100 Hz | m4a, amr | Stereo, Mono |
| Cook | 44,100 Hz | m4a | Stereo |
| Use microwave oven | 44,100 Hz | m4a | Stereo |
| Take a shower | 44,100 Hz | m4a, mp3 | Stereo |
| Dish washing | 44,100 Hz | m4a | Stereo |
| Hand washing | 8000 Hz–44,100 Hz | m4a, amr | Stereo, Mono |
| Brushing teeth | 44,100 Hz | m4a | Stereo |
| No activity | 8000 Hz–44,100 Hz | m4a, amr | Stereo, Mono |
Average Mel-Frequency Cepstral Coefficients (MFCC) per second during 10 s.
| Activities | CC1 | CC2 | CC3 | CC4 | CC5 | CC6 | CC7 | CC8 | CC9 | CC10 |
|---|---|---|---|---|---|---|---|---|---|---|
| Brew coffee (1) | 0.316 | −2.689 | 4.503 | −4.484 | 5.154 | −2.251 |
| 5.202 |
| 5.621 |
| Cook (2) | 0.277 | −5.423 | 6.552 |
| 7.445 |
|
| 7.166 |
| 5.802 |
| Use microwave (3) | 0.310 |
| 4.645 |
| 6.267 |
|
| 5.755 |
| 5.202 |
| No activity (4) | 0.163 |
| 4.369 |
| 5.194 |
|
| 7.975 |
| 5.675 |
| Take a shower (5) | 0.673 |
| 9.892 |
| 9.068 |
|
| 9.492 |
| 9.757 |
| Dish washing (6) | 0.722 |
| 8.595 |
| 7.863 |
|
| 6.834 |
| 7.690 |
| Hand washing (7) | 0.510 |
|
| 6.109 |
|
| 1.893 |
| 1.180 | 0.961 |
| Brushing teeth (8) | 0.407 |
| 5.246 |
| 5.437 |
|
| 7.878 |
| 6.750 |
MFCC dissimilarity matrix for second 1.
| Activities | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|
| 1 | 0.000 | 0.039 | 0.007 | 0.154 | 0.357 | 0.405 | 0.194 | 0.090 |
| 2 | 0.039 | 0.000 | 0.032 | 0.115 | 0.396 | 0.444 | 0.233 | 0.129 |
| 3 | 0.007 | 0.032 | 0.000 | 0.147 | 0.364 | 0.412 | 0.200 | 0.097 |
| 4 | 0.154 | 0.115 | 0.147 | 0.000 | 0.511 | 0.559 | 0.347 | 0.244 |
| 5 | 0.357 | 0.396 | 0.364 | 0.511 | 0.000 | 0.048 | 0.163 | 0.267 |
| 6 | 0.405 | 0.444 | 0.412 | 0.559 | 0.048 | 0.000 | 0.212 | 0.315 |
| 7 | 0.194 | 0.233 | 0.200 | 0.347 | 0.163 | 0.212 | 0.000 | 0.103 |
| 8 | 0.090 | 0.129 | 0.097 | 0.244 | 0.267 | 0.315 | 0.103 | 0.000 |
MFCC similarity matrix for second 1.
| Activities | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|
| 1 | 0.00 | 25.61 | 147.00 | 6.50 | 2.80 | 2.47 | 5.16 | 11.09 |
| 2 | 25.61 | 0.00 | 31.01 | 8.71 | 2.52 | 2.25 | 4.30 | 7.74 |
| 3 | 147.00 | 31.01 | 0.00 | 6.80 | 2.75 | 2.43 | 4.99 | 10.31 |
| 4 | 6.50 | 8.71 | 6.80 | 0.00 | 1.96 | 1.79 | 2.88 | 4.10 |
| 5 | 2.80 | 2.52 | 2.75 | 1.96 | 0.00 | 20.71 | 6.12 | 3.75 |
| 6 | 2.47 | 2.25 | 2.43 | 1.79 | 20.71 | 0.00 | 4.72 | 3.17 |
| 7 | 5.16 | 4.30 | 4.99 | 2.88 | 6.12 | 4.72 | 0.00 | 9.66 |
| 8 | 11.09 | 7.74 | 10.31 | 4.10 | 3.75 | 3.17 | 9.66 | 0.00 |
Figure 3Degree centrality per second.
Figure 4Closeness centrality per second.
Figure 5Similarity networks in 10 s., where each sub-figure represents a second.
Power law distribution statistics for audio similarity networks per second.
| Second | Alpha | X | LogLik | KS.stat | KS.p |
|---|---|---|---|---|---|
| 1 | 2.29 | 32.73 |
| 0.1789 | 0.9355 |
| 2 | 2.16 | 2.83 |
| 0.1750 | 0.9454 |
| 3 | 1.78 | 1.91 |
| 0.2512 | 0.6933 |
| 4 | 4.23 | 3.16 |
| 0.2770 | 0.7463 |
| 5 | 2.14 | 4.97 |
| 0.2467 | 0.7875 |
| 6 | 5.62 | 9.90 |
| 0.2057 | 0.9282 |
| 7 | 11.79 | 19.92 |
| 0.2361 | 0.9789 |
| 8 | 1.97 | 2.67 |
| 0.2261 | 0.8077 |
| 9 | 7.37 | 30.99 |
| 0.2320 | 0.9823 |
| 10 | 1.57 | 1.89 |
| 0.2010 | 0.9027 |
Figure 6Power law distribution per second.