| Literature DB >> 35684589 |
Sonia Das1, Sukadev Meher1, Upendra Kumar Sahoo1.
Abstract
Smartphone-based gait recognition has been considered a unique and promising technique for biometric-based identification. It is integrated with multiple sensors to collect inertial data while a person walks. However, captured data may be affected by several covariate factors due to variations of gait sequences such as holding loads, wearing types, shoe types, etc. Recent gait recognition approaches either work on global or local features, causing failure to handle these covariate-based features. To address these issues, a novel weighted multi-scale CNN (WMsCNN) architecture is designed to extract local to global features for boosting recognition accuracy. Specifically, a weight update sub-network (Ws) is proposed to increase or reduce the weights of features concerning their contribution to the final classification task. Thus, the sensitivity of these features toward the covariate factors decreases using the weight updated technique. Later, these features are fed to a fusion module used to produce global features for the overall classification. Extensive experiments have been conducted on four different benchmark datasets, and the demonstrated results of the proposed model are superior to other state-of-the-art deep learning approaches.Entities:
Keywords: gait recognition; inertial sensor; multi-scale CNN; smartphone sensor
Mesh:
Year: 2022 PMID: 35684589 PMCID: PMC9182843 DOI: 10.3390/s22113968
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Overview of the proposed framework.
Figure 2Detailed design of the WMs–CNN–Local–Global model.
Figure 3Architecture of a weight update sub-network (Ws) to achieve discriminative features.
Details of four challenging datasets.
| Database | No. | Number of | Sampling | Challenges |
|---|---|---|---|---|
|
| #1 | 745 | 100 Hz | A large database with fewer samples on each subject and |
|
| #1 | 118 | 50 Hz | |
|
| #1 | 10 | Variation of walking speed: normal and fast with seven different covariates: | |
|
| - | 50 | 100 Hz | Wear different shoe types and different clothes |
Detailed parameters of the proposed single-scale CNN network.
| Layer Name | Input | Kernel | Number of | Feature | Number of |
|---|---|---|---|---|---|
| Conv1_1 |
|
| 32 |
| 1760 |
| MaxPool1 |
|
| / |
| |
| Conv2_1 |
|
| 64 |
| 10,304 |
| Conv2_2 |
|
| 128 |
| 41,088 |
| MaxPool2 |
|
| / |
| |
| Conv3_1 |
|
| 128 |
| 49,280 |
Rank-1 and Rank-5 identification rates, and verification rate (VR) of different gait datasets are reported by integrating 2/3/4/5 numbers of the Ws sub-network layers into various CNN architectures at the presence of different time scales (). Bold font indicates the best performance.
| sub-Networks (s) | whuGait | IDnet | OU-ISIR | Gait-Mob-ACC | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Rank1 | Rank-5 | VR | Rank-1 | Rank-5 | VR | Rank-1 | Rank-5 | VR | Rank-1 | Rank-5 | VR | |||||
| Id | Id | (FAR = | Id | Id | (FAR = | Id | Id | (FAR = | Id | Id | (FAR = | |||||
| 2 | AlexNet | 78.69 | 82.94 | 0.79 | 81.98 | 83.99 | 0.79 | 55.89 | 61.82 | 0.43 | 75.88 | 80.75 | 0.74 | |||
| VGG-14 | 83.66 | 86.43 | 0.83 | 86.24 | 90.91 | 0.86 | 57.76 | 61.83 | 0.44 | 75.94 | 79.04 | 0.74 | ||||
| VGG-16 | 84.56 | 87.06 | 0.83 | 86.31 | 91.65 | 0.87 | 57.88 | 62.95 | 0.44 | 76.32 | 79.99 | 0.75 | ||||
| ResNet-50 |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| CWs-AlexNet | 80.98 | 85.91 | 0.82 | 84.24 | 93.46 | 0.81 | 56.91 | 61.98 | 0.43 | 78.88 | 83.18 | 0.77 | ||||
| CWs-VGG14 | 85.87 | 90.05 | 0.85 | 88.06 | 93.32 | 0.88 | 60.78 | 64.21 | 0.46 | 83.19 | 88.06 | 0.79 | ||||
| CWs-VGG16 | 86.33 | 91.87 | 0.86 | 89.87 | 94.54 | 0.89 | 61.43 | 65.32 | 0.47 | 84.67 | 88.42 | 0.8 | ||||
| CWs-ResNet50 | 95.04 | 97.76 | 0.91 | 96.11 | 97.97 | 0.93 | 69.53 | 73.89 | 0.51 | 89.55 | 93.67 | 0.94 | ||||
| proposed |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| 3 | AlexNet | 89.01 | 92.64 | 0.81 | 88.87 | 92.57 | 0.84 | 59.56 | 61.84 | 0.45 | 80.01 | 83.65 | 0.87 | |||
| VGG14 | 91.32 | 94.54 | 0.82 | 92.01 | 95.36 | 0.87 | 61.19 | 65.92 | 0.47 | 83.88 | 87.73 | 0.88 | ||||
| VGG16 | 91.78 | 95.81 | 0.83 | 92.42 | 95.75 | 0.88 | 61.76 | 64.20 | 0.46 | 84.88 | 88.78 | 0.89 | ||||
| ResNet50 |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| CWs-AlexNet | 90.54 | 93.04 | 0.83 | 90.76 | 94.35 | 0.85 | 61.82 | 65.47 | 0.47 | 83.71 | 85.45 | 0.90 | ||||
| CWs-VGG14 | 93.12 | 96.13 | 0.89 | 94.02 | 97.61 | 0.89 | 64.89 | 68.78 | 0.48 | 87.21 | 91.65 | 0.91 | ||||
| CWs-VGG16 | 93.03 | 96.43 | 0.89 | 94.24 | 97.86 | 0.90 | 65.56 | 69.35 | 0.49 | 88.45 | 93.67 | 0.93 | ||||
| CWs-ResNet50 | 96.32 | 98.53 | 0.93 | 98.24 | 100 | 0.96 | 72.86 | 76.59 | 0.53 | 94.01 | 98.15 | 0.96 | ||||
| Proposed |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| 4 ( | AlexNet | 87.43 | 92.56 | 0.83 | 86.21 | 90.64 | 0.83 | 53.54 | 58.43 | 0.40 | 72.88 | 77.67 | 0.86 | |||
| VGG14 | 87.43 | 91.01 | 0.84 | 88.12 | 92.89 | 0.86 | 60.19 | 64.84 | 0.46 | 80.32 | 84.43 | 0.88 | ||||
| VGG16 | 88.34 | 92.43 | 0.84 | 88.51 | 93.30 | 0.86 | 60.48 | 65.48 | 0.46 | 82.98 | 85.13 | 0.89 | ||||
| ResNet50 |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| CWs-AlexNet | 90.34 | 95.89 | 0.86 | 88.13 | 93.03 | 0.85 | 55.76 | 60.20 | 0.41 | 76.71 | 80.51 | 0.87 | ||||
| CWs-VGG14 | 91.03 | 95.96 | 0.86 | 90.07 | 95.46 | 0.87 | 64.98 | 68.78 | 0.48 | 84.23 | 88.89 | 0.92 | ||||
| CWs-VGG16 | 91.89 | 96.65 | 0.87 | 90.45 | 94.55 | 0.88 | 65.16 | 69.98 | 0.49 | 86.89 | 91.78 | 0.94 | ||||
| CWs-ResNet50 | 94.12 | 98.27 | 0.9 | 94.39 | 97.06 | 0.92 | 69.97 | 73.32 | 0.52 | 91.13 | 91.98 | 0.95 | ||||
| Proposed |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| 5 ( | AlexNet | 79.45 | 83.65 | 0.76 | 79.63 | 84.32 | 0.82 | 52.17 | 56.99 | 0.41 | 77.44 | 80.21 | 0.81 | |||
| VGG14 | 81.69 | 84.32 | 0.78 | 86.33 | 90.56 | 0.82 | 53.11 | 57.42 | 0.4 | 79.64 | 82.43 | 0.83 | ||||
| VGG16 | 81.94 | 84.87 | 0.79 | 87.44 | 91.21 | 0.83 | 54.98 | 57.96 | 0.4 | 81.01 | 85.78 | 0.85 | ||||
| ResNet50 |
|
|
|
|
|
|
|
|
|
|
|
| ||||
| CWs-AlexNet | 83.17 | 85.54 | 0.79 | 82.54 | 87.54 | 0.83 | 54.12 | 58.73 | 0.42 | 80.32 | 84.04 | 0.82 | ||||
| CWs-VGG14 | 89.12 | 91.35 | 0.85 | 89.32 | 93.76 | 0.87 | 61.41 | 64.56 | 0.45 | 84.36 | 87.55 | 0.89 | ||||
| CWs-VGG16 | 89.54 | 92.98 | 0.85 | 91.07 | 93.86 | 0.88 | 61.59 | 65.32 | 0.46 | 86.14 | 90.76 | 0.91 | ||||
| CWs-ResNet50 | 93.32 | 97.89 | 0.91 | 95.89 | 97.32 | 0.91 | 65.76 | 71.34 | 0.50 | 90.12 | 94.32 | 0.93 | ||||
| Proposed |
|
|
|
|
|
|
|
|
|
|
|
| ||||
Figure 4Performance evaluation of the proposed network in terms of accuracy on the five different types of gait sequences, with the influence of varying (a) batch size (b) number of steps per gait cycle, (c) amount of training data.
Comparison of state-of-the-art methods on different benchmark datasets in terms of Rank-1 identification rate.
| Methods | whuGait | whuGait | IDNet | OU-ISIR | OU-ISIR | Gait-Mob-ACC |
|---|---|---|---|---|---|---|
| IdNet [ | 92.91% | 96.78% | 99.58% | 44.29% | 46.20% | 74.75% |
| CNN [ | 92.89% | 97.02% | 99.71% | 40.60% | 47.14% | 90.2% |
| LSTM [ | 91.88% | 96.98% | 99.46% | 66.36% | 65.32% | 81.65% |
| DeepConv [ | 92.25% | 96.80% | 99.24% | 37.33% | 41.32% | 86.23% |
| CNN+LSTM [ | 92.51% | 96.82% | 99.61% | 34.28% | 53.96% | 89.22% |
| 92.94% | 97.04% | 99.64% | - | - | - | |
| CNN+ | 93.52% | 97.33% | 99.75% | - | - | - |
| WMsCNN-Local | 93.36% | 98.28% | 99.81% | 65.74% | 72.13% | 90.49% |
| WMsCNN-Local-Global | 95.75% | 98.98% | 99.96% | 73.56% | 76.42% | 94.71% |
Figure 5A comparative ROC curves of state-of-the-art deep learning networks: IDnet, CNN, LSTM, CNN+LSTM, and the proposed models. The performance of four benchmarks, each having different sub-datasets as referred in Table 1, is shown in (a–f). (a,b) refer to whuGait Dataset #1 and Dataset #2, respectively, (c) refers to the IdNet dataset, (d,e) refer to sub-dataset #1 and sub-dataset #2 of the OU-ISIR dataset, respectively, and (f) refers to the Gait-mob-ACC dataset.