| Literature DB >> 24721766 |
Oresti Banos1, Juan-Manuel Galvez2, Miguel Damas3, Hector Pomares4, Ignacio Rojas5.
Abstract
Signal segmentation is a crucial stage in the activity recognition process; however, this has been rarely and vaguely characterized so far. Windowing approaches are normally used for segmentation, but no clear consensus exists on which window size should be preferably employed. In fact, most designs normally rely on figures used in previous works, but with no strict studies that support them. Intuitively, decreasing the window size allows for a faster activity detection, as well as reduced resources and energy needs. On the contrary, large data windows are normally considered for the recognition of complex activities. In this work, we present an extensive study to fairly characterize the windowing procedure, to determine its impact within the activity recognition process and to help clarify some of the habitual assumptions made during the recognition system design. To that end, some of the most widely used activity recognition procedures are evaluated for a wide range of window sizes and activities. From the evaluation, the interval 1-2 s proves to provide the best trade-off between recognition speed and accuracy. The study, specifically intended for on-body activity recognition systems, further provides designers with a set of guidelines devised to facilitate the system definition and configuration according to the particular application requirements and target activities.Entities:
Mesh:
Year: 2014 PMID: 24721766 PMCID: PMC4029702 DOI: 10.3390/s140406474
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Principal segmentation techniques.
| Activity-defined windows | Sekine |
| Event-defined windows | Aminian |
| Sliding windows | Mantyjarvi |
Figure 1.Distribution of the activity recognition research studies presented in Tables 2 and 3 based on the window size.
Studies that use the sliding window approach (Part 1).
| Mantyjarvi | Level walking, stairs up/down, opening doors (4) | Left and right sides of the hip (2) | MLP (83%–90%) | 2 |
| Kern | Sitting, standing, shaking hands, writing on a keyboard and more (8) | Ankle, knee, hip, wrist, elbow, shoulder on both sides (12) | NB (∼90%) | ∼0.5 |
| Krause | Walking, running, sitting, knee-bends, waving arms, climbing stairs and more (8) | Back of the upper arm (2) | K-means clustering, 1st order Markov | 8 |
| Bao and Intille (2004) (20 subjects) [ | Walking, running, scrubbing, brushing teeth and more (20) | Upper arm, wrist, thigh, hip, ankle (5) | DT (84%) kNN (83%) NB (52%) | ∼6.7 |
| Huynh and Schiele (2005) (2 subjects) [ | Walking, jogging, hopping, skipping and more | Shoulder strap (1) | NCC (∼80%) | 0.25, 0.5, 1, 2, 4 |
| Ravi | Walking, running, standing, vacuuming and more (8) | Waist (pelvic region) (1) | NB (64%) SVM (63%) DT (57%) kNN (50%) | 5.12 |
| Maurer | Walking, running, standing, sitting, upstairs, downstairs (6) | Wrist, belt, shirt pocket, trouser pocket, backpack, necklace (6) | DT (87%) kNN (<87%) NB (<87%) | 0.5 |
| Parkka | Walking, running, rowing, Nordic walking and more (8) | Chest, wrist (2) | DT (86%) MLP (82%) Hierarchical (82%) | 4, 10 |
| Pirttikangas | Walking, lying down, cycling, typing, vacuuming, drinking and more (17) | Right thigh and wrist, left wrist and necklace (4) | MLP (80%) kNN (90%) | 0.1, 0.2, 0.5, 0.7, 1, 1.5 |
| Huynh | High-level (going shopping, preparing for work, doing housework) (3) + Low-level (brushing teeth, taking a shower and more) (16) | Wrist, hip, thigh (3) | SVM (91.8%) kNN (83.4%) k-means (84.9%) HMMs (80.6%) for high-level SVM (79.1%) kNN (77%) k-means (69.4%) HMMs (67.4%) for low-level | 6 |
| Lovell | Walking patterns (slope-down, slope-up, flat, stairs-down, stairs-up) (5) | Waist (1) | MLP-RFS (92%) MLP-RR (88.5%) | ∼2.56 |
| Suutala | Lying down, vacuuming, typing, cycling, reading a newspaper, drinking and more (17) | Right thigh and wrist, left wrist, necklace (4) | 17 activities (SVM (90.6%) HMM (84.2%) SVM-HMM (84.4%) DTS (93.6%)) | 0.7 |
| 9 activities (SVM (94.1%) HMM (88.7%) SVM-HMM (90.4%) DTS (96.4%)) | ||||
| Amft and Troster (2008) (6 subjects) [ | Arm movements, chewing, swallowing (3) | Upper and lower arms (4) | Arm movements (79%) Chewing (86%) Swallowing (70%) | 0.5 |
| Stikic | Housekeeping (vacuuming, sweeping, dusting, ironing, mopping and more) (10) | Wrist (1) | NB (57%) HMMs (60%) JB (68%) | 0.5, 1, 2, 4, 8, 16, 32, 64, 128 |
| Preece | 2 datasets: jogging, running, hopping, jumping and more (8) + Walking, climbing stairs up/down (3) | Waist, thigh, ankle (3) | kNN (96% with 8 activities; 98% with 3 activities) | 2 |
| Altun and Barshan (2010) (8 subjects) [ | Sitting, playing basketball, standing, rowing, jumping and more (19) | Chest, both wrists and sides of the knees (5) | BDM (99.2%) LSM (89.6%) kNN (98.7%) DTW1 (83.2%) DTW2 (98.5%) SVM (98.8%) ANN (96.2%) | 5 |
| Han | Walking, running, standing, lying, falling, jumping (6) | Waist belt (1) | Fixed: HMM-P (78.8%) HMM-PNP (80.2%) Tilted: HMM-P (79.4%) HMM-PNP (53.2%) | 0.32 |
Studies that use the sliding window approach (Part 2).
| Khan | Walking, upstairs, downstairs, running, sitting (5) | Smartphone in 5 different pocket locations (shirt's top, jeans' rear/front-left/front-right, coat's inner) (1) | ANN-OF (46%) ANN-LDA (60%) ANN-KDA (96%) | 2 |
| Marx (2010) (1 subject) [ | Ball interactions (throwing, shaking, jerking sideways, holding very still) (4) | Embedded in iBall (1) | Heuristic (90%–95%) | 0.666 |
| Sun | Walking, running, stationary, upstairs, downstairs, driving, bicycling (7) | Front/rear pockets on the trousers, front pockets on the coat (6) | SVM (93% with acceleration magnitude in 4 s; 92% without acceleration magnitude in 5 s) | 1, 2, 3, 4, 5, 6 |
| Atallah | Reading, socializing, vacuuming and more (15) | Chest, arm, wrist, waist, knee, ankle, right ear (7) | kNN with k = 5 (∼56%) and k = 7 (∼64%), NB with Gaussian priors (∼61%) | 5 |
| Gjoreski and Gams (2011) (11 subjects) [ | Standing, sitting, lying, sitting on the ground, on all fours, going down, standing up (7) | Chest, left thigh, right ankle (3) | Random Forest (93% only with chest; 96% adding left thigh; 98% with all accelerometers) | 1 |
| Jiang | Walking, jogging, weight lifting, cycling, rowing and more (10) | Both forearms and shanks (4) | SVM ideal (95.1%) SVM with errors (75.2%) SVM without orientation errors (91.2%) SVM without errors (91.9%) | 6.4 |
| Kwapisz | Walking, jogging, upstairs, downstairs and more (6) | Smartphone (1) | DT (85.1%) LR (78.1%) MLP (91.7%) | 10 |
| Lee and Cho (2011) (3 subjects) [ | 3 actions (walking, standing, climbing stairs) + 3 activities (shopping, moving by walk, taking bus) | Smartphone in the hand (1) | HHMM (84%) HMM (65%) ANN (65%) | 5 |
| Siirtola and Röning (2012) (8 subjects) [ | Walking, running, cycling, sitting/standing, driving a car (5) | Smartphone in trousers' front pocket (1) | Offline (QDA (95.4%) kNN (94.5%)) Real-Time with Nokia (QDA (95.8%) kNN (93.9%)) Real-time with Samsung Galaxy (QDA (96.5%)) | 7.5 |
| Wang | Walking, jogging, upstairs, downstairs (4) | Smartphone (1) | GMM (91.2%) J48 (88.8%) LR (93.3%) | 0.5, 0.8 |
| Hemalatha and Vaidehi (2013) (5 subjects) [ | Walking, sitting/standing, lying, falling (4) | Chest (1) | FBPAC (92%) | 10 |
| Mannini | 4 broad activity classes (ambulation, cycling, sedentary and other), daily activities (26) | Wrist or ankle (1) | SVM (84.7% with wrist, 95% with ankle) for 12.8 s | 2, 4, 12.8 |
| Nam and Park (2013) (3 subjects) [ | Walking, toddling, crawling, wiggling, rolling and more (11) | Waist (1) | NB (81%) BN (87%) DT (75%) SVM (95%) kNN (96.2%) J48 (94.7%) MLP (96.3%) LR (93.2%) | ∼2.7 |
| Nam and Park (2013) (11 subjects) [ | Walking, toddling, crawling, wiggling, rolling and more (10) | Waist (1) | NB (73%) BN (84.8%) DT (74%) SVM (86.2%) kNN (84.1%) J48 (88.3%) MLP (84.8%) LR (86.9%) | ∼2.7 |
| Zheng | 3 datasets: Walking, running, dancing and more (7) in 1 | Wrist (1 in 1 | SWEM-SVM (94%/90%/82%) SVM (93%/89%/79%) ANN (91%/78%/74%) | 10 |
Figure 2.Different stages of the activity recognition chain (ARC). An example of the correlation of the windowing approach and subsequent levels of the ARC is shown. Here, different window sizes are depicted particularly. Concretely, M sensors deliver raw signals (u1, u2, …, uM), which are subsequently processed (p1, p2, …, p). The signals are partitioned into data windows of size W (e.g., s1, s2, …, s). For each window, k, a set of features are extracted and aggregated in a single feature vector (f1(s1), f(s2), …, f(s)) that is used as the input to a classifier. The classifier yields a class (c) that represents the identified activity.
Warm up, cool down and fitness exercises considered for the activity set.
| L1: Walking | L12: Waist rotation | L23: Shoulders high-amplitude rotation |
| L2: Jogging | L13: Waist bends (reach foot with opposite hand) | L24: Shoulders low-amplitude rotation |
| L3: Running | L14: Reach heels backwards | L25: Arms inner rotation |
| L4: Jump up | L15: Lateral bend | L26: Knees (alternating) to the breast |
| L5: Jump front and back | L16: Lateral bend with arm up | L27: Heels (alternatively) to the backside |
| L6: Jump sideways | L17: Repetitive forward stretching | L28: Knees bending (crouching) |
| L7: Jump leg/arms open/closed | L18: Upper trunk and lower body opposite twist | L29: Knees (alternating) bending forward |
| L8: Jump rope | L19: Lateral elevation of arms | L30: Rotation on the knees |
| L9: Trunk twist (arms outstretched) | L20: Frontal elevation of arms | L31: Rowing |
| L10: Trunk twist (elbows bent) | L21: Frontal hand claps | L32: Elliptical bike |
| L11: Waist bends forward | L22: Frontal crossing of arms | L33: Cycling |
Figure 3.Effect of the data window size on the activity recognition system performance (F1-score). Twelve recognition systems, respectively, corresponding to the combination of three feature sets (FS1, FS2, FS3) and four classification models (DT, NB, NCC, KNN) are evaluated.
Figure 4.Activity-specific recognition performance for diverse window sizes and methodologies (
Figure 5.Activity-specific recognition performance for diverse window sizes and methodologies (
Figure 6.Minimum window size required for diverse performance thresholds. The threshold values are respectively calculated from the maximum F1 – score that could be achieved for the recognition of each activity (represented on top). The results for two particular recognition methodologies are shown: (a) DT-FS2; and (b) KNN-FS2. Non-colored spots (not defined, ND) correspond to performance values for which no window enhancement may be obtained.
Summary of the windowing guidelines defined for diverse activity categories when prioritizing the recognition speed (W_) or the recognition performance (W_).
| Arms (19–25) | 0.5–1 | 0.75–2.25 |
| Legs (26–30) | 0.25–1.75 | 1.25–5.75 |
| Trunk (12,15,17) | 0.5–1.25 | 0.75–6.25 |
| Trunk + arms (9–11,13,16) | 0.5–1.25 | 1–4.5 |
| Trunk + legs (30,33) | 0.25 | 1–1.25 |
| All body parts (2–8,18,28,29,31,32) | 0.25–3.25 | 0.5–4 |
| Translation (1–3) | 0.25–0.5 | 1–1.5 |
| Jumps (4–8) | 0.5–3.25 | 1.75–6.75 |
| Energetic (1–8,18,23,26,28,29,31–33) | 0.25–3.25 | 1–3.25 |
| Non-energetic (9–17,19–22,24,25,27,30) | 0.25–2 | 1.25–5.75 |
| Rehab lower body (2–8,18,26-33) | 0.25–3.25 | 1.25–3.25 |
| Rehab upper body (9–11,13,16,19–25) | 1–1.25 | 1.25–2.25 |
| Security/Military (1–6,18,26,27) | 0.25–2 | 1–5.75 |
| Gaming jumps (4–8) | 0.5–3.25 | 1.75–6.75 |
| Gaming hits (20,23) | 1 | 1.25–1.75 |
| Gaming dance (12,18,19,22,28) | 0.5–2 | 0.75–4 |
| Sport/Wellness (1–33) | 0.25-3.25 | 0.5–6.75 |