| Literature DB >> 35808297 |
Viorica Rozina Chifu1, Cristina Bianca Pop1, Alexandru Miron Rancea1, Andrei Morar2, Tudor Cioara1, Marcel Antal1, Ionut Anghel1.
Abstract
The monitoring of the daily life activities routine is beneficial, especially in old age. It can provide relevant information on the person's health state and wellbeing and can help identify deviations that signal care deterioration or incidents that require intervention. Existing approaches consider the daily routine as a rather strict sequence of activities which is not usually the case. In this paper, we propose a solution to identify flexible daily routines of older adults considering variations related to the order of activities and activities timespan. It combines the Gap-BIDE algorithm with a collaborative clustering technique. The Gap-BIDE algorithm is used to identify the most common patterns of behavior considering the elements of variations in activities sequence and the period of the day (i.e., night, morning, afternoon, and evening) for increased pattern mining flexibility. K-means and Hierarchical Clustering Agglomerative algorithms are collaboratively used to address the time-related elements of variability in daily routine like activities timespan vectors. A prototype was developed to monitor and detect the daily living activities based on smartwatch data using a deep learning architecture and the InceptionTime model, for which the highest accuracy was obtained. The results obtained are showing that the proposed solution can successfully identify the routines considering the aspects of flexibility such as activity sequences, optional and compulsory activities, timespan, and start and end time. The best results were obtained for the collaborative clustering solution that considers flexibility aspects in routine identification, providing coverage of monitored data of 89.63%.Entities:
Keywords: activities patterns; collaborative clustering; daily activity detection; daily routine; deep learning model; pattern mining
Mesh:
Year: 2022 PMID: 35808297 PMCID: PMC9269491 DOI: 10.3390/s22134803
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Abbreviations and letter symbols.
| Abbreviation | Unit or Term |
|---|---|
|
| Activity |
|
| frequency of occurrence of a length of activity |
|
| Ratio between the timespan of an activity and the timespan of the same activity in the cluster centroid |
|
| Sub routine corresponding to a period of the day |
|
| Low and High thresholds for the variation of the timespan of an activity |
|
| end time of the activities in a specific period of the day |
|
| start time of the activities in a specific period of the day |
|
| distance within the cluster |
| ADL | Activities of Daily Living |
|
| number of clusters |
| CNN | Convolutional Neural Network |
| distance between the centroids of the two clusters, | |
| the distances between all frequent daily activity patterns in the cluster | |
|
| number of days from which the routine is extracted |
| DBSCAN | Density-based spatial clustering of applications with noise |
|
| Set of days containing frequent sequence of activities |
| HMM | Hidden Markov models |
|
| the length of some sequence of activities |
| LTSM | Long Short-Term Memory |
|
| Average duration of an activity. |
| WHO | World Health Organization |
| α | Percentage value that can be adjusted based on the experimental results |
|
| Cluster of activities |
|
| Davies-Bouldin Index |
|
| Dunn index |
|
| the average distance between the frequent daily activities pattern |
|
| the smallest average distance between the frequent pattern |
|
| Activity corresponding to clusters’ centroids |
|
| Routine for a day |
|
| activity sequence representing the baseline |
|
| timespan of activities for all days and all periods of the dataset |
| Specifies if an activity is mandatory or not. | |
|
| Specific day period. It can be night, morning, afternoon, or evening. |
|
| Silhouette score |
|
| distance between the cluster |
Comparison of research literature approaches.
| Ref. | Method | ADL Features | Flexibility | Routine Definition | Solution Features |
|---|---|---|---|---|---|
| [ | DBSCAN algorithm | Activity start time, duration | Not considered | Sequence of activities | Single routine, deviations are not considered |
| [ | FMR–AD algorithm & fuzzy rules | Activity frequency, regularity | Not considered | Sequence of activities and their frequency | Single routine, deviations detection |
| [ | Partition Around Medoids algorithm | Activity label | Not considered | Sequence of activities | A single rigid routine |
| [ | Fourier series representation combined with K-means | Activity label, activity duration | Activities time-related variability | Sequence of activities and associated durations | Three routines correspond to morning, afternoon, and night. |
| [ | Weighted kernel k-means algorithm & nominal matrix factorization method | Activity label | Not considered | Sequence of activities | A set of routines for the monitoring period |
| [ | Hidden Markov models combined with Baum–Welch algorithm and Viterbi algorithm | Activity label, location, posture of the person, activity duration | Not considered | Sequence of activities, associated durations, and locations | A single routine |
| [ | Markov Decision Process combined with relative entropy inverse reinforcement learning | Sensor’s locations, sensors activation time | Not considered | Trajectory vector extracted based on the sensor’s locations and sensors activation time | A single rigid routine represented as a trajectory vector |
| [ | Transition probability matrix | Time spent by a person in a particular room | Not allowed | State transition model where states are the home’s rooms, and the connections are the transitions | Normal mobility behavior of a person and deviations from it |
| [ | Grey model with a Markovian model | Activity frequency, activity duration | Variability related to the duration and frequency of activities | Sequence of activities, associated durations, and frequency | A single routine |
| [ | Probabilistic spatio-temporal model combined with K-means | The location of the subject and the sensor activation timespan | Not allowed | Sequence of location events, start time, end time, and location label | Two rigid behavioral patterns; Normal and deviations from it |
| [ | Shapiro-Wilk test combined with a non-parametrical statistical method | Time duration and frequency of visiting the rooms | Variability related to the duration | Activity’s duration and transitions between the rooms | Single routine |
| Our solution | Deep learning, GAP-BIDE algorithm, collaborative clustering | ADL timespan, start time, end time | Variability related to Activities time, sequence gaps variability | Sequence of four sub-routines corresponding to each period of the day | A single flexible routine composed of mandatory activities, optional activities, alternative variants of activities |
Figure 1Daily routine identification considering flexibility aspects. Frequent activities are extracted with the Gap-BIDE algorithm while Collaborative Clustering is used to group and detect activity patterns. The identified clusters are represented in the graphs as geometric forms with different colors.
Figure 2Frequent sequence identification using Gap-BIDE algorithm. Six steps are defined for extracting days that could represent routines or patterns.
Day division in periods.
| Period | Time |
|---|---|
| Night | 00:00:00 PM–06:59:59 AM |
| Morning | 07:00:00 AM–11:59:59 AM |
| afternoon | 12:00:00 AM–6:59:59 PM |
| Evening | 7:00:00 PM–11:59:59 PM |
Figure 3Filtering the sequences of activities based on the frequency of occurrence. Example for five different length sequences.
Figure 4Timespan of an activity and clusters centroids. Example for computing the timespan for activity a1.
Figure 5Architecture for data gathering and activity patterns detection. A smartwatch is used for collecting raw data that is ushed through a gateway (e.g., smartphone) in a cloud database.
Figure 6The confusion matrix for activity detection. The better the model, the values outside the main diagonal are lower compared to the ones on the main diagonal.
Figure 7Daily activities detected for a person in a day (24 h).
Figure 8Frequent patterns (most frequent sequences of activities) for the afternoon identified by the Gab-BIDE algorithm.
Figure 9Silhouette analysis for K-Means clustering on time span vectors with k = 4 (left) and k = 7 (right). Cluster labels are colored differently. The average Silhouette score is shown with the dotted red line.
Figure 10Silhouette score (blue) and Davies-Bouldin index (orange) relative to the number of clusters.
Figure 11Dendrogram obtained when running the Agglomerative Clustering algorithm for Night sequences. On the X-axis the sequence ids are represented, and on the Y-axis the dissimilarity distance. Red, blue, and green colors are used to different clusters of activities created.
Dunn index and Davies-Bouldin index values for Agglomerative Hierarchical Clustering.
| Number of Clusters | Dunn Index | Davies-Bouldin Index | Distribution of Elements in Clusters |
|---|---|---|---|
|
| 0.49747 | 0.50333 | Cluster 0: 11 elements; Cluster 1: 3 elements |
|
| 0.75388 | 0.47348 | Cluster 0: 3 elements; Cluster 1: 8 elements |
|
| 0.76397 | 0.37467 | Cluster 0: 8 elements; Cluster 1: 2 elements |
|
| 0.46472 | 0.457 | Cluster 0: 3 elements; Cluster 1: 4 elements |
|
| 0.46472 | 0.36311 | Cluster 0: 4 elements; Cluster 1: 4 elements |
Figure 12Example of flexible daily routine identified for a person for a full day (24 h).
Coverage Metric Values.
| Clustering Configuration | Routine Coverage Value |
|---|---|
| Collaborative Clustering (Union) | 89.63% |
| K-Means | 77.44% |
| Agglomerative Clustering | 88.41% |
Number of clusters and clustering execution time variation.
| Nr Clusters | Execution Time (s) | |
|---|---|---|
| Agglomerative Clustering | K-Means | |
| 2 | 0.01 | 1.49 |
| 3 | 0.02 | 1.17 |
| 4 | 0.02 | 1.00 |
| 5 | 0.03 | 1.20 |
| 6 | 0.02 | 1.42 |
Data size impact on clustering execution time.
| Data Set Size | Execution Time |
|---|---|
| 400 | 1.19 |
| 300 | 0.62 |
| 200 | 0.23 |
Parameters effect on deep learning models performance.
| Parameters Varied | Average Values | ||||
|---|---|---|---|---|---|
| Window Size | Stride | Number of Models | Accuracy | Loss | Learning Rate |
| 200 | 50 | 15 | 0.945 | 0.143 | 0.003 |
| 200 | 50 | 15 | 0.936 | 0.164 | 0.025 |
| 600 | 30 | 30 | 0.977 | 0.059 | 0.015 |
| 600 | 35 | 30 | 0.979 | 0.054 | 0.012 |
| 600 | 40 | 10 | 0.978 | 0.059 | 0.015 |
| 600 | 50 | 40 | 0.976 | 0.063 | 0.011 |
| 900 | 30 | 35 | 0.978 | 0.056 | 0.008 |