| Literature DB >> 35154616 |
Zia Ur Rahman1, Syed Irfan Ullah1, Abdus Salam1, Taj Rahman2, Inayat Khan3, Badam Niazi4.
Abstract
According to statistics, stroke is the second or third leading cause of death and adult disability. Stroke causes losing control of the motor function, paralysis of body parts, and severe back pain for which a physiotherapist employs many therapies to restore the mobility needs of everyday life. This research article presents an automated approach to detect different therapy exercises performed by stroke patients during rehabilitation. The detection of rehabilitation exercise is a complex area of human activity recognition (HAR). Due to numerous achievements and increasing popularity of deep learning (DL) techniques, in this research article a DL model that combines convolutional neural network (CNN) and long short-term memory (LSTM) is proposed and is named as 3-Layer CNN-LSTM model. The dataset is collected through RGB (red, green, and blue) camera under the supervision of a physiotherapist, which is resized in the preprocessing stage. The 3-layer CNN-LSTM model takes preprocessed data at the convolutional layer. The convolutional layer extracts useful features from input data. The extracted features are then processed by adjusting weights through fully connected (FC) layers. The FC layers are followed by the LSTM layer. The LSTM layer further processes this data to learn its spatial and temporal dynamics. For comparison, we trained CNN model over the prescribed dataset and achieved 89.9% accuracy. The conducted experimental examination shows that the 3-Layer CNN-LSTM outperforms CNN and KNN algorithm and achieved 91.3% accuracy.Entities:
Mesh:
Year: 2022 PMID: 35154616 PMCID: PMC8837430 DOI: 10.1155/2022/1563707
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1Architecture of 3-Layer CNN-LSTM model.
Figure 2Process diagram of LSTM.
Figure 3Data augmentation.
Algorithm 1Data preprocessing for training.
Algorithm 23-layer CNN-LSTM model for the detection of rehabilitation exercise.
Comparison of 3-layer CNN-LSTM model with other standard models.
| S. No | Other standard models | Proposed CNN-LSTM model |
|---|---|---|
| 1 | RNN-LSTM model used in [ | CNN is used for feature extraction and selection of useful features, while LSTM is used for exercise recognition. This model maintains a balance between spatial and temporal information. |
|
| ||
| 2 | In [ | In the proposed model, 3-layer CNN is applied first to process spatial input data. The data is then fed to the LSTM layer to further refine the extracted data and detect the rehabilitation exercise. |
|
| ||
| 3 | The CNN model for exercise recognition was tested and observed that CNN learn too many complex parameters of about 2,575,753 during training. | The model learned about 392,765 parameters which conclude that CNN-LSTM model is lightweight which has reduced complexity and achieved better accuracy. |
|
| ||
| 4 | In [ | The 3-layer CNN-LSTM model learns activity feature automatically and handles these issues efficiently. We used an RGB camera instead of Kinect sensors to reduce complexity and processing time. |
List of hyperparameters selected for training.
| Processing stage | Hyperparameters | Values selected |
|---|---|---|
| Convolution_1 | Filters | 64 |
| Kernel size | 5 | |
| Stride | 1 | |
| Max pooling | 4 | |
|
| ||
| Convolution_2 | Filters | 128 |
| Kernel size | 3 | |
| Max pooling | 2 | |
|
| ||
| Convolution_3 | Filters | 256 |
| Kernel size | 3 | |
| Max pooling | 2 | |
|
| ||
| Training parameters | Learning rate | 0.1 |
| Epochs | 36 | |
| Batch size | 32 | |
| Optimizer | Adam | |
Figure 4List of rehabilitation exercises, from left to right: (a) extension of the neck; (b) rotation of neck; (c) flexion of the trunk side view; (d) flexion of the trunk front view; (e), (f) extension of the elbow joint (front and side view); (g) dorsiflexion of the foot; (h) plantar flexion of the foot; (i), (j) extension and flexion of the knee joint; (k) extension of the trunk; and (l) flexion of the wrist.
Description of the dataset.
| Total samples of rehabilitation exercise: 2250 | |
|---|---|
| Number of participants: 20 | |
| Rehabilitation exercise | No. of samples |
| Flexion and extension of neck | 332 |
| Flexion and extension of trunk | 328 |
| Flexion of knee joint | 348 |
| Flexion and extension of wrist | 489 |
| Dorsiflexion and plantar flexion | 419 |
| Abduction of upper limb | 334 |
Figure 5Confusion matrix of 3-layer CNN-LSTM model.
Classification report of 3-layer CNN-LSTM and CNN model.
| CNN-LSTM model | CNN model | |||||
|---|---|---|---|---|---|---|
| Exercises | Precision | Recall | F1-score | Precision | Recall | F1-score |
| Dorsiflexion | 0.95 | 0.96 | 0.95 | 0.93 | 0.99 | 0.96 |
| Neck exercise | 0.87 | 0.88 | 0.88 | 0.87 | 0.87 | 0.87 |
| Plantar flexion | 0.94 | 0.87 | 0.91 | 0.91 | 0.86 | 0.89 |
| Trunk extension | 0.94 | 0.87 | 0.90 | 0.93 | 0.81 | 0.86 |
| Trunk flexion | 0.89 | 0.93 | 0.91 | 0.95 | 0.84 | 0.89 |
| Wrist extension | 0.91 | 0.92 | 0.92 | 0.85 | 0.90 | 0.88 |
| Wrist flexion | 0.84 | 0.90 | 0.87 | 0.84 | 0.90 | 0.87 |
Figure 6Performance in terms of precision, recall, and f1-score.
Performance comparison of different models.
| S. No | Model | Accuracy (%) |
|---|---|---|
| 1 | KNN [ | 86.1 |
| 2 | CNN | 89.9 |
| 3 | 3-layer CNN-LSTM | 91.3 |
Figure 7Line graph showing the average accuracy of CNN-LSTM and CNN model.