| Literature DB >> 27722048 |
Qiannan Jiang1, Mingzhou Liu1, Xiaoqiao Wang1, Maogen Ge1, Ling Lin1.
Abstract
The observation, decomposition and record of motion are usually accomplished through artificial means during the process of motion analysis. This method not only has a heavy workload, its efficiency is also very low. To solve this problem, this paper proposes a novel method to segment and recognize continuous human motion automatically based on machine vision for mechanical assembly operation. First, the content-based dynamic key frame extraction technology was utilized to extract key frames from video stream, and then automatic segmentation of action was implemented. Further, the SIFT feature points of the region of interest (ROIs) were extracted, on the basis of which the characteristic vector of the key frame was derived. The feature vector can be used not only to represent the characteristic of motion, but also to describe the connection between motion and environment. Finally, the classifier is constructed based on support vector machine (SVM) to classify feature vectors, and the type of therblig is identified according to the classification results. Our approach enables robust therblig recognition in challenging situations (such as changing of light intensity, dynamic backgrounds) and allows automatic segmentation of motion sequences. Experimental results demonstrate that our approach achieves recognition rates of 96.00 % on sample video which captured on the assembly line.Entities:
Keywords: Key frame extraction; Mechanical assembly operation; Motion recognition; SIFT feature points; Support vector machine
Year: 2016 PMID: 27722048 PMCID: PMC5031581 DOI: 10.1186/s40064-016-3279-x
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1Key frame extraction process
Fig. 2The process of hand detection. a Original hand image. b Skin color regions. c Hand regions
Fig. 3Algorithm process of motion recognition
Fig. 4The mapping process Displacement vector 1 indicates that two ROIs are close to each other; Displacement vector 2 indicates that two ROIs are not close to each other; Input space is the low dimensional space; Feature space is the high dimensional space which determined by kernel functions
Fig. 5The process of therblig recognition. Class I displacement vector indicate that hand is close to workpiece; Class III displacement vector indicate that hands are close to each other; Class V displacement vector indicate that workpieces are close to each other. Other kinds of displacement vectors indicate that ROIs are not close
Fig. 6Prototype system in the mechanical product assembly line
Fig. 7Four key frames
Fig. 8Confusion matrix of motion categories, with an average performance of 96 %. Rows represent actual motion categories; Columns represent recognition results of our method
Comparison of different methods on the video of bolt assembly operations
| Methods | Recognition objects | Accuracy (%) |
|---|---|---|
|
| Continuous motion sequence | 96.00 |
| Schuldt et al. ( | Motion segments | 71.75 |
| Niebles et al. ( | Motion segments | 83.30 |
| Jiang et al. ( | Motion segments | 84.35 |
| Reddy and Shah ( | Motion segments | 93.40 |
| Lu et al. ( | Motion segments | 95.05 |
Fig. 9The comparison results of the first four cycles. The actual therbligs were obtained by the experts through the observation, decomposition and recording of motion. The recognition therbligs were the results obtained with such method in this paper