| Literature DB >> 35401733 |
Abstract
With the explosive growth of the number of sports videos, the traditional sports video analysis method based on manual annotation has been difficult to meet the growing demand because of its high cost and many limitations. The traditional model is usually based on the target detection algorithm of manual features, and the detection of human posture features is not accurate. Compared with global image features such as line features, texture features and structure features, local image features have the characteristics of rich quantity in the image, low correlation between features, and will not affect the detection and matching of other features due to the disappearance of some features in the case of occlusion. Referring to the practice of Deep-ID network considering both local and global features, this paper adjusts the traditional neural network, and combines the improved neural network with the human joint model to form a human pose detection method based on graph neural network, and then applies the algorithm to multiperson human pose estimation. The results of several groups of comparative experiments show that the algorithm can better estimate the human posture in sports competition video, and has a good performance in solving multiperson pose estimation in sports game video.Entities:
Mesh:
Year: 2022 PMID: 35401733 PMCID: PMC8993548 DOI: 10.1155/2022/4727375
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Improved graph neural network structure.
Figure structural parameters of neural network.
| Layer | Conv1 | Pool1 | Conv2 | Pool2 | Conv3 |
|---|---|---|---|---|---|
| Input | 64 ∗ 64 ∗ 3 | 60 ∗ 60 ∗ 32 | 30 ∗ 30 ∗ 32 | 24 ∗ 24 ∗ 32 | 12 ∗ 12 ∗ 32 |
| Output | 60 ∗ 60 ∗ 32 | 30 ∗ 30 ∗ 32 | 24 ∗ 24 ∗ 32 | 12 ∗ 12 ∗ 32 | 9 ∗ 9 ∗ 32 |
| Size | 5 ∗ 5 ∗ 3 | 2 ∗ 2 ∗ 1 | 5 ∗ 5 ∗ 32 | 2 ∗ 2 ∗ 1 | 5 ∗ 5 ∗ 32 |
| Layer | Pool3 | Pool4 | FC1 | FC2 | FC3 |
| Input | 9 ∗ 9 ∗ 32 | 5 ∗ 5 ∗ 32 | 1 ∗ 1 ∗ 2400 | 1 ∗ 1 ∗ 2400 | 1 ∗ 1 ∗ 500 |
| Output | 5 ∗ 5 ∗ 32 | 1 ∗ 1 ∗ 32 | 1 ∗ 1 ∗ 2400 | 1 ∗ 1 ∗ 500 | 1 ∗ 1 ∗ 100 |
| Size | 2 ∗ 2 ∗ 1 | 5 ∗ 5 ∗ 1 | — | — | — |
Figure 2Structural model of human upper body diagram.
Figure 3Data processing module.
Figure 4Multilevel characteristic diagram of athletes in sports competition.
Figure 5Neural network structure diagram of human posture in sports video.
Human pose estimate datasets.
| Dataset | Feature |
|---|---|
| LSP | The data comes from sports category labels, image zoom, only annotated in each picture-one person, 14 key points |
| FLIC | Data comes from hollywood movies, 10 key points |
| MPII human pose | The data comes from YouTube videos, which covers 410 human activities, each image has an activity tag, and 16 key points |
| MSCOCO | The data comes from the internet, it contains various activities, 17 key points |
| Al challenge | The data is captured from the internet. It is currently the largest human pose image data set, and 14 key point |
| PoseTrack | The data comes from the MPII human pose data set and 15 key points |
| Buffy | The data comes from a TV show. Line segments are provided to indicate the location, and the size and direction of the body parts are also provided |
Figure 6Loss curve of human posture estimation network.
Figure 7Running time comparison of human pose estimate.
Figure 8PDJ curve comparing this algorithm with other algorithms on VIPS-VideoPose database.
Comparison of PCP value results between this algorithm and other algorithms on vipsvediopose.
| Methods | Algorithm (%) [ | Algorithm (%) [ | Algorithm (%) [ | Algorithm (%) [ | Ours (%) |
|---|---|---|---|---|---|
| Neck | 78.1 | 62.8 | 78.2 | 57.8 | 87.1 |
| Elbow | 80.9 | 69.3 | 79.4 | 54.7 | 88.2 |
| Shoulder | 91.7 | 83.3 | 90.5 | 34.2 | 95.1 |
| Wrist | 65.4 | 84.2 | 93.2 | 78.5 | 92.7 |
| Waist | 63.8 | 83.0 | 92.5 | 81.0 | 95.9 |