| Literature DB >> 28333088 |
Jie Guo1, Tingfa Xu2,3, Guokai Shi4, Zhitao Rao5, Xiangmin Li6.
Abstract
In this paper, we propose a multi-view structural local subspace tracking algorithm based on sparse representation. We approximate the optimal state from three views: (1) the template view; (2) the PCA (principal component analysis) basis view; and (3) the target candidate view. Then we propose a unified objective function to integrate these three view problems together. The proposed model not only exploits the intrinsic relationship among target candidates and their local patches, but also takes advantages of both sparse representation and incremental subspace learning. The optimization problem can be well solved by the customized APG (accelerated proximal gradient) methods together with an iteration manner. Then, we propose an alignment-weighting average method to obtain the optimal state of the target. Furthermore, an occlusion detection strategy is proposed to accurately update the model. Both qualitative and quantitative evaluations demonstrate that our tracker outperforms the state-of-the-art trackers in a wide range of tracking scenarios.Entities:
Keywords: PCA; multi-view; sparse representation; structural local appearance model; visual tracking
Year: 2017 PMID: 28333088 PMCID: PMC5419779 DOI: 10.3390/s17040666
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The multi-view structural local subspace model. (a) The target templates, the spatial layout, and the target candidates; (b) the patch templates, structural PCA basis template, and the patch candidates; (c) the three sub-models and the unified model; and (d) the sparse coefficients of the three sub-models.
Figure 2The alignment-weighting average.
Figure 3Tracking results of the proposed method and the 12 state-of-the-art tracking methods on representative frames of total 51 sequences in the benchmark [2] (Football, Faceocc1, Fish, Suv, Doll, CarScale, Jogging-1, Subway, Jogging-2, Crossing, Boy, Walking, Singer1, Dog1, Deer, Freeman3, Couple, Liquor, Mhyang, Sylvester, Skiing, CarDark, Car4, Boy, Ironman, MotorRolling, Soccer, Coke, Bolt, Tiger1, Singer2, FaceOcc2, Tiger2, Girl, Lemming, David3, David, David2, Woman, Trellis, Dudek, MountainBike, Freeman1, Skaking1, Matrix, Walking2, Freeman4, FleetFace, Football1, Basketball, Shaking, from left to right, and top to bottom).
Overall performance of our tracker in terms of the value of parameter .
| 5 | 2 | 1.3 | 1.1 | 1 | 0.9 | 0.8 | 0.5 | 0.2 | |
|---|---|---|---|---|---|---|---|---|---|
| 0.277 | 0.429 | 0.485 | 0.491 | 0.488 | 0.442 | 0.416 | 0.223 | ||
| 0.352 | 0.536 | 0.604 | 0.623 | 0.610 | 0.582 | 0.519 | 0.295 |
The distribution of all of the sequences (the number of sequences which have the corresponding attribute).
| OCC | DEF | FM | IV | SV | MB | IPR | OPR | BC | OV | LR | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 29 | 19 | 17 | 25 | 28 | 12 | 31 | 39 | 21 | 6 | 4 |
Figure 4Precision plots (a) and success plots (b). The legend of the precision plot reports the score of precision plots for each method and the legend of the success plot reports the score of the success plots.
Average precision scores on different attributes: fast motion (FM), scale variation (SV), occlusion (OCC), background clutter (BC), deformation (DEF), motion blur (MB), illumination variation (IV), low-resolution (LR), in-plane rotation (IPR), out-of-plane rotation (OPR), and out-of-view (OV). The best three results are shown in red, blue, and green fonts.
| Attributes | IVT | MTT | L1APG | LSK | ASLA | DSSM | VTD | TLD | JSRFFT | SST | SCM | Struck | OURS |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.220 | 0.413 | 0.365 | 0.375 | 0.253 | 0.397 | 0.353 | 0.401 | 0.393 | 0.331 | ||||
| 0.494 | 0.461 | 0.472 | 0.480 | 0.552 | 0.422 | 0.597 | 0.606 | 0.513 | 0.541 | ||||
| 0.455 | 0.433 | 0.461 | 0.534 | 0.460 | 0.401 | 0.546 | 0.563 | 0.557 | 0.486 | ||||
| 0.421 | 0.424 | 0.425 | 0.504 | 0.496 | 0.319 | 0.571 | 0.428 | 0.511 | 0.503 | ||||
| 0.409 | 0.332 | 0.383 | 0.481 | 0.445 | 0.519 | 0.501 | 0.512 | 0.482 | |||||
| 0.222 | 0.308 | 0.375 | 0.324 | 0.278 | 0.320 | 0.375 | 0.426 | 0.339 | 0.410 | ||||
| 0.418 | 0.359 | 0.341 | 0.449 | 0.516 | 0.359 | 0.557 | 0.537 | 0.307 | 0.558 | ||||
| 0.278 | 0.460 | 0.304 | 0.156 | 0.358 | 0.168 | 0.349 | 0.274 | 0.305 | 0.385 | ||||
| 0.457 | 0.528 | 0.518 | 0.534 | 0.511 | 0.405 | 0.584 | 0.510 | 0.584 | 0.596 | ||||
| 0.464 | 0.478 | 0.478 | 0.525 | 0.518 | 0.319 | 0.596 | 0.493 | 0.532 | 0.597 | ||||
| 0.307 | 0.374 | 0.329 | 0.515 | 0.333 | 0.384 | 0.462 | 0.396 | 0.490 | 0.429 | ||||
| 0.499 | 0.479 | 0.485 | 0.505 | 0.532 | 0.438 | 0.576 | 0.608 | 0.558 | 0.563 |
Average success scores on different attributes: fast motion (FM), scale variation (SV), occlusion (OCC), background clutter (BC), deformation (DEF), motion blur (MB), illumination variation (IV), low-resolution (LR), in-plane rotation (IPR), out-of-plane rotation (OPR), out-of-view(OV). The best three results are shown in red, blue, and green fonts. The last row shows comparison results regarding computational loads in terms of fps.
| Attributes | IVT | MTT | L1APG | LSK | ASLA | DSSM | VTD | TLD | JSRFFT | SST | SCM | Struck | OURS |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.202 | 0.338 | 0.311 | 0.328 | 0.248 | 0.332 | 0.303 | 0.420 | 0.341 | 0.296 | ||||
| 0.344 | 0.348 | 0.350 | 0.373 | 0.318 | 0.405 | 0.424 | 0.367 | 0.405 | 0.425 | ||||
| 0.325 | 0.345 | 0.353 | 0.409 | 0.376 | 0.349 | 0.404 | 0.405 | 0.411 | 0.365 | ||||
| 0.291 | 0.337 | 0.350 | 0.388 | 0.408 | 0.321 | 0.425 | 0.348 | 0.401 | 0.394 | ||||
| 0.281 | 0.280 | 0.311 | 0.377 | 0.372 | 0.342 | 0.377 | 0.381 | 0.360 | 0.382 | 0.393 | |||
| 0.197 | 0.274 | 0.310 | 0.302 | 0.258 | 0.297 | 0.309 | 0.313 | 0.336 | 0.298 | ||||
| 0.306 | 0.308 | 0.283 | 0.371 | 0.429 | 0.317 | 0.420 | 0.402 | 0.291 | 0.427 | ||||
| 0.238 | 0.235 | 0.157 | 0.284 | 0.177 | 0.312 | 0.191 | 0.279 | 0.372 | 0.370 | ||||
| 0.330 | 0.398 | 0.391 | 0.411 | 0.425 | 0.347 | 0.430 | 0.419 | 0.413 | 0.443 | ||||
| 0.323 | 0.364 | 0.360 | 0.400 | 0.422 | 0.331 | 0.423 | 0.411 | 0.409 | 0.431 | ||||
| 0.274 | 0.342 | 0.303 | 0.430 | 0.312 | 0.348 | 0.446 | 0.350 | 0.384 | 0.361 | ||||
| 0.358 | 0.378 | 0.381 | 0.395 | 0.434 | 0.362 | 0.416 | 0.437 | 0.429 | 0.415 | ||||
| 1.2 | 2.1 | 5.3 | 8.8 | 1.2 | 5.8 | 1.7 | 1.3 | 0.6 | 3.1 |