| Literature DB >> 30071601 |
Joan Navarro1, Ester Vidaña-Vila2, Rosa Ma Alsina-Pagès3, Marcos Hervás4.
Abstract
Ambient Assisted Living (AAL) has become a powerful alternative to improving the life quality of elderly and partially dependent people in their own living environments. In this regard, tele-care and remote surveillance AAL applications have emerged as a hot research topic in this domain. These services aim to infer the patients' status by means of centralized architectures that collect data from a set of sensors deployed in their living environment. However, when the size of the scenario and number of patients to be monitored increase (e.g., residential areas, retirement homes), these systems typically struggle at processing all associated data and providing a reasonable output in real time. The purpose of this paper is to present a fog-inspired distributed architecture to collect, analyze and identify up to nine acoustic events that represent abnormal behavior or dangerous health conditions in large-scale scenarios. Specifically, the proposed platform collects data from a set of wireless acoustic sensors and runs an automatic two-stage audio event classification process to decide whether or not to trigger an alarm. Conducted experiments over a labeled dataset of 7116 s based on the priorities of the Fundació Ave Maria health experts have obtained an overall accuracy of 94.6%.Entities:
Keywords: acoustic sensor network; ambient assisted living; graphics processor unit; home monitoring; residence assistance; surveillance
Mesh:
Year: 2018 PMID: 30071601 PMCID: PMC6112031 DOI: 10.3390/s18082492
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Network of housing alternatives of flats or retirement homes to be supported by the proposed AAL monitoring service and possible sensors arrangement with coverage areas. (a,c) Retirement home with several buildings/houses with common spaces (dining room, etc.) and with private facilities for the elderly or pseudo-dependent; (b,d) independent flats where elderly or pseudo-dependent people need various surveillance support.
Figure 2Spectrograms of the nine types of sound.
Figure 3Network topology of the proposed system.
Figure 4Proposed system architecture particularized for the Fundació Ave Maria use-case.
Figure 5Block diagram of the proposed acoustic event classification system.
Accuracy at the real-time early event detection layer with the following ANN configuration: network topology: full mesh; number of layers: 5; number of neurons per layer: 100, 70, 50, 30, 10; hidden layers’ activation function: rectified linear unit [69]; output layer activation function: softmax [70]. On the top, confusion matrix. On the bottom, detailed accuracy by class computed as one-vs.-all.
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| ||
|
|
| 93.21% | 0.01% | 0.58% | 0.10% | 2.03% | 0.01% | 3.39% | 0.19% | 0.48% |
|
| 4.63% | 79.01% | 2.23% | 0.12% | 1.32% | 6.57% | 2.41% | 2.02% | 1.69% | |
|
| 0.72% | 5.43% | 91.67% | 0.23% | 1.04% | 0.23% | 0.03% | 0.28% | 0.37% | |
|
| 1.87% | 0.46% | 3.85% | 69.23% | 0.19% | 0.21% | 0.40 % | 0.71% | 23.08% | |
|
|
| 2.98% | 4.23% | 0.49% |
| 0.08% | 0.68% | 0.19% | 1.47% | |
|
| 4.13% | 2.33% | 0.04% | 0.02% | 1.94% | 80.83% | 0.39% | 4.49% | 5.83% | |
|
| 0.23% | 0.12% | 4.88% | 0.06% | 0.19% | 0.16% | 94.12% | 0.20% | 0.04% | |
|
| 0.77% | 0.68% | 0.11% | 0.34% | 0.21% |
| 1.05% |
|
| |
|
| 1.82% | 0.88% | 0.02% | 0.02% | 0.28% | 0.65% | 0.06% | 0.38% | 95.86% | |
|
|
|
|
|
|
|
|
| |||
|
|
| 0.9321 | 0.0305 | 0.6973 | 0.9695 | 0.7978 | 0.7899 | 0.9508 | 0.6174 | |
|
| 0.7901 | 0.0132 | 0.6051 | 0.9868 | 0.6853 | 0.6826 | 0.8885 | 0.5925 | ||
|
| 0.9167 | 0.0203 | 0.8221 | 0.9797 | 0.8668 | 0.8539 | 0.9482 | 0.5473 | ||
|
| 0.6923 | 0.0014 | 0.9965 | 0.9986 | 0.817 | 0.7628 | 0.8454 | 0.3479 | ||
|
| 0.0417 | 0.0075 | 0.0606 | 0.9925 | 0.0494 | 0.0411 | 0.5171 | 0.4906 | ||
|
| 0.8083 | 0.0175 | 0.9093 | 0.9825 | 0.8558 | 0.8288 | 0.8954 | 0.4495 | ||
|
| 0.9412 | 0.0071 | 0.8756 | 0.9929 | 0.9072 | 0.9028 | 0.9671 | 0.5328 | ||
|
| 0.6737 | 0.0138 | 0.8828 | 0.9862 | 0.7642 | 0.7421 | 0.8300 | 0.3954 | ||
|
| 0.9589 | 0.1346 | 0.3370 | 0.8654 | 0.4987 | 0.5244 | 0.9122 | 0.8109 | ||
Figure 6Temporal evolution of the first level (early event real-time detection) and second level (high level event analysis) classifiers. The 10-s delay of the high-level event analysis layer has been removed to ease the output comparison between two layers.