| Literature DB >> 24465420 |
Chunhui Zhang1, Jun Zhang1, Heng Zhao1, Jimin Liang1.
Abstract
The part-based method has been a fast rising framework for object detection. It is attracting more and more attention for its detection precision and partial robustness to the occlusion. However, little research has been focused on the problem of occlusion overlapping of the part regions, which can reduce the performance of the system. This paper proposes a part-based probabilistic model and the corresponding inference algorithm for the problem of the part occlusion. The model is based on the Bayesian theory integrally and aims to be robust to the large occlusion. In the stage of the model construction, all of the parts constitute the vertex set of a fully connected graph, and a binary variable is assigned to each part to indicate its occlusion status. In addition, we introduce a penalty term to regularize the argument space of the objective function. Thus, the part detection is formulated as an optimization problem, which is divided into two alternative procedures: the outer inference and the inner inference. A stochastic tentative method is employed in the outer inference to determine the occlusion status for each part. In the inner inference, the gradient descent algorithm is employed to find the optimal positions of the parts, in term of the current occlusion status. Experiments were carried out on the Caltech database. The results demonstrated that the proposed method achieves a strong robustness to the occlusion.Entities:
Mesh:
Year: 2014 PMID: 24465420 PMCID: PMC3894947 DOI: 10.1371/journal.pone.0084624
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Two kinds of occlusions faced by the part-based model.
(a) Occlusion which does not overlap the part areas. (b) Occlusion overlapping the part areas.
Figure 2Spatial relationship among parts.
(a) Bag model. (b) Constellation model. (c) Pictorial model. (d) -fan model ( from left to right).
Figure 3Influence of the disabled parts on the detection of the normal ones.
(a) is the manual label of the face. In (b), the eyes have been occluded (the occlusion is represented by the dotted box). (c) shows the detection results that the disabled parts degrade the detection of the normal ones. (d) illustrates the detection results after the occluded parts are discarded from the model.
Figure 4Flow chart of the outer inference.
Algorithm 1: Outer inference.
|
|
| Step 1. Use the simulated annealing algorithm to obtain the initial position, then obtain current optimal position |
| Step 2. Use all of the two-part sub-model (illustrated in |
| Step 3. |
| (a) Calculate gradient vector |
| (b) If |
| (c) In |
| (d) If there is no feasible descending bit, invert the most irresolute bit in |
| (e) If there is only one feasible descending bit, invert it, and go to Step 3g, else go to Step 3f; |
| (f) If there are at least two feasible descent bits, therein invert the corresponding bit with a probability proportional to its gradient absolute value; |
| (g) Carry out the GD algorithm for |
| (h) Choose a different bit to invert again randomly, go to Step 3g; |
|
|
|
|
Figure 61-fan model and discarded 1-fan model.
Figure 7Influence of disabled parts on the Faces dataset.
Figure 8Sample images in the two-part-shaded test dataset.
(a) is an image with part(1,2) being shaded (the occlusion degree is 81%). (b) is an image with part(4,6) being shaded (the occlusion degree is 64%). The purple solid boxes represent the part regions and the black dotted boxes reperesent the occlusion regions.
Figure 9Average distance error of the partially occluded experiment.
Average of the 1-fan model and our method when part(1,2) was shaded.
| Occlusion degree of each part | Average | Average |
| 44.00% | 6.6096 | 2.6243 |
| 49.00% | 6.9762 | 2.6026 |
| 54.00% | 7.3334 | 2.5952 |
| 59.00% | 7.7257 | 2.6025 |
| 64.00% | 8.1099 | 2.5939 |
| 69.44% | 8.7296 | 2.5946 |
| 75.11% | 11.6345 | 2.6098 |
| 81.00% | 11.9330 | 2.6090 |
| 87.11% | 14.0736 | 2.6091 |
| 93.44% | 16.0132 | 2.6021 |
| 100.00% | 18.3148 | 2.5845 |
The results of complete shading experiment on the Faces dataset.
| Test set |
|
| Average |
| One-part-shaded test datasets | 0.13% | 0.67% | 2.4057 |
| Two-part-shaded test datasets | 0.42% | 0.50% | 2.6009 |
| Three-part-shaded test datasets | 2.78% | 1.11% | 3.8830 |
| Four-part-shaded test datasets | 5.33% | 1.83% | 5.8602 |