| Literature DB >> 35271162 |
Yan-Ying Li1, Shoue-Jen Wang2, Yi-Ping Hung1.
Abstract
Sleep quality is known to have a considerable impact on human health. Recent research shows that head and body pose play a vital role in affecting sleep quality. This paper presents a deep multi-task learning network to perform head and upper-body detection and pose classification during sleep. The proposed system has two major advantages: first, it detects and predicts upper-body pose and head pose simultaneously during sleep, and second, it is a contact-free home security camera-based monitoring system that can work on remote subjects, as it uses images captured by a home security camera. In addition, a synopsis of sleep postures is provided for analysis and diagnosis of sleep patterns. Experimental results show that our multi-task model achieves an average of 92.5% accuracy on challenging datasets, yields the best performance compared to the other methods, and obtains 91.7% accuracy on the real-life overnight sleep data. The proposed system can be applied reliably to extensive public sleep data with various covering conditions and is robust to real-life overnight sleep data.Entities:
Keywords: deep multi-task learning; head and upper-body detection; head and upper-body pose classification; sleep monitoring; sleep posture
Mesh:
Year: 2022 PMID: 35271162 PMCID: PMC8914692 DOI: 10.3390/s22052014
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Comparison of previous studies and the proposed method on sleep posture classification.
| Sensor | Method | Dataset Used | Advantages | Limitations |
|---|---|---|---|---|
| Pressure-sensing mat | 3D human pose estimation based on deep learning method [ | Simulation dataset | A pressure-sensing mat is robust to covering. | A pressure-sensing mat has high cost |
| Thermal camera | Human pose estimation based on deep learning method [ | Simulation dataset | A thermal camera is robust to illuminationchanges and covering. | A thermal camera has high cost for home use. |
| Depth camera | Sleep posture classification | Simulation dataset | A depth camera is robust to low light intensity. |
Depth measurements of depth cameras suffer from various noise factors. The depth camera is not prevalent at home surveillance. |
| Infrared camera | Sleep vs. wake states detection in young children based on motion analysis [ | Real sleep data |
A single infrared camera is convenient and low-cost. A general-purpose head detector is used. | This method only succeeds for 50% of nights. |
| Human pose estimation based on deep learning method (OpenPose) [ | Simulation dataset | The method can extract features of the skeleton effectively. | Their data [ | |
| Sleep posture classificationbased on deep learning method [ | Simulation dataset | The deep learning method can achieve good accuracy. | It focuses on classifying posture without detecting the upper-body and head region. | |
| Sleep posture detection and classification based on deep learning method (proposed method) | Simulation and real sleep dataset | A unified framework for simultaneously detecting and classifying upper-body pose and head pose is proposed. | Training personal data to learn CNN is required. |
Figure 1Architecture of the proposed framework.
Figure 2SleePose-FRCNN-Net architecture.
Indicators of sleep quality.
| Indicator | Description | Unit |
|---|---|---|
| Shifts in sleep posture | Count of posture changes | Count |
| Number of postures | Count of postures with duration longer than 15 min | Count |
| Average duration in a posture | Mean of the posture duration | Minutes |
| Sleep efficiency | Percentage of time without turning | Percentage |
Figure 3Sample images for each pose class from the SLP database [42].
The details of the SLP dataset.
| Dataset | Sleep Postures Categories | Covering Condition | Set | Recorded Environment | Number of Subjects | Number of Images per Covering Condition |
|---|---|---|---|---|---|---|
| SLP [ | Supine, left side, and right side | No covering, thin sheet, and thick blanket | Train | Living room | 90 | 4050 |
| Validation | Living room | 12 | 540 | |||
| Test | Hospital room | 7 | 315 |
Figure 4Illustration of the system in the experiment [43].
Figure 5Examples of images from the iSP pilot dataset [43].
Figure 6The 10 types of sleep postures [43].
The details of the iSP pilot dataset.
| Dataset | Sleep Postures Categories | Covering Condition | Set | Recorded Environment | Number of Subjects | Number of Images per Covering Condition |
|---|---|---|---|---|---|---|
| iSP [ | Supine, left side, right side, and prone | No covering, | Train | Lab | 25 | 15,000 |
| Test | Lab | 11 | 6600 |
The iSP real-life dataset summary.
| Subject | Hours | Number of Frames |
|---|---|---|
| S1 | 8 | 576,000 |
| S2 | 6 | 432,000 |
| S3 | 7 | 504,000 |
| S4 | 8 | 576,000 |
Figure 7Examples of RGB/IR images from the iSP real-life dataset.
YouTube dataset summary.
| Subject | Hours | Number of Frames |
|---|---|---|
| S1 | 8 | 28,800 |
| S2 | 5 | 18,800 |
| S3—Day 1 | 8 | 28,800 |
| S3—Day 2 | 7.5 | 27,000 |
Results of detection on the SLP dataset.
| Model | mAP | mAP | mAP | |||
|---|---|---|---|---|---|---|
| Val | Test | Val | Test | Val | Test | |
| YOLOv3 [ | 99.26 | 96.84 | 99.58 | 69.30 | 99.70 | 64.34 |
| YOLOv4 [ | 98.67 | 95.59 | 99.54 | 91.27 | 99.29 | 83.84 |
| SleePose-FRCNN-Net | 99.82 | 99.86 | 99.97 | 99.91 | 99.64 | 92.31 |
Results of head pose classification on the SLP dataset.
| Model | Accuracy | Accuracy | Accuracy | |||
|---|---|---|---|---|---|---|
| Val | Test | Val | Test | Val | Test | |
| Akbarian [ | 95.61 | 87.50 | 96.70 | 83.66 | 97.00 | 85.52 |
| Mohammadi [ | 85.56 | 78.47 | 79.42 | 71.90 | 85.19 | 71.03 |
| Inception [ | 85.77 | 78.47 | 79.61 | 72.22 | 88.20 | 70.69 |
| SleePose-FRCNN-Net | 99.25 | 96.15 | 99.07 | 90.48 | 98.70 | 92.70 |
Results of upper-body pose classification on the SLP dataset.
| Model | Accuracy | Accuracy | Accuracy | |||
|---|---|---|---|---|---|---|
| Val | Test | Val | Test | Val | Test | |
| Akbarian [ | 98.27 | 85.06 | 96.29 | 88.56 | 96.19 | 88.64 |
| Mohammadi [ | 85.36 | 77.60 | 91.84 | 73.20 | 88.57 | 68.18 |
| Inception [ | 86.51 | 66.12 | 89.98 | 82.20 | 88.57 | 64.29 |
| SleePose-FRCNN-Net | 99.44 | 95.24 | 99.07 | 91.11 | 98.70 | 91.11 |
Figure 8Confusion matrix for head pose classification on the SLP dataset.
Figure 9Confusion matrix for upper-body pose classification on the SLP dataset.
Results of detection on the iSP pilot dataset.
| Model | mAP | mAP | mAP |
|---|---|---|---|
| YOLOv3 [ | 97.11 | 93.63 | 95.38 |
| YOLOv4 [ | 99.61 | 94.11 | 98.59 |
| SleePose-FRCNN-Net | 99.82 | 94.67 | 99.64 |
Results of head pose classification on the iSP pilot dataset.
| Model | Accuracy | Accuracy | Accuracy |
|---|---|---|---|
| Akbarian [ | 87.62 | 92.10 | 91.80 |
| Mohammadi [ | 77.90 | 74.82 | 71.05 |
| Inception [ | 83.95 | 84.16 | 78.61 |
| SleePose-FRCNN-Net | 96.65 | 97.33 | 94.58 |
Results of upper-body pose classification on the iSP pilot dataset.
| Model | Accuracy | Accuracy | Accuracy |
|---|---|---|---|
| Akbarian [ | 87.62 | 92.10 | 91.80 |
| Mohammadi [ | 77.90 | 74.82 | 71.05 |
| Inception [ | 83.95 | 84.16 | 78.61 |
| SleePose-FRCNN-Net | 97.79 | 98.33 | 94.72 |
Figure 10Confusion matrix for head pose classification on the iSP pilot dataset.
Figure 11Confusion matrix for upper-body pose classification on the iSP pilot dataset.
Figure 12Repurposed SimpleBaseline [49] (first row) and OpenPose [50] (second row) performance on the iSP dataset.
Figure 13Sleep posture detection on real-life data. The left column shows images in IR mode and the right column shows in RGB mode.
Results of detection on the YouTube dataset.
| Method | mAP | |||
|---|---|---|---|---|
| S1 | S2 | S3—Day 1 | S3—Day 2 | |
| SleePose-FRCNN-Net | 96.33 | 99.57 | 99.66 | 96.30 |
Results of head pose classification on the YouTube dataset.
| Method | Accuracy | |||
|---|---|---|---|---|
| S1 | S2 | S3—Day 1 | S3—Day 2 | |
| SleePose-FRCNN-Net | 89.87 | 90.67 | 95.44 | 92.38 |
Results of upper-body pose classification on the YouTube dataset.
| Method | Accuracy | |||
|---|---|---|---|---|
| S1 | S2 | S3—Day 1 | S3—Day 2 | |
| SleePose-FRCNN-Net | 98.21 | 96.10 | 97.62 | 98.25 |
Figure 14Pictorial representation of sleep poses and motion over time of subjects.
Sleep indicators of subjects.
| Subject | ||||
|---|---|---|---|---|
| S1 | S2 | S3—Day1 | S3—Day2 | |
| Shifts in sleep posture (n/hour) | 1.63 | 1.00 | 2.38 | 2.53 |
| Number of postures that last longer than 15 min (n/hour) | 1.38 | 1.20 | 1.50 | 1.75 |
| Average duration in a posture (min) | 19.20 | 60.00 | 25.26 | 23.68 |
| Sleep efficiency (percentage) | 89.38 | 96.00 | 91.67 | 83.56 |
Results of the proposed model that was trained on the SLP dataset and tested on the iSP pilot dataset.
| Pose | Accuracy | Accuracy | Accuracy |
|---|---|---|---|
| Head | 92.57 | 94.74 | 86.44 |
| Upper-Body | 92.92 | 91.10 | 83.51 |