| Literature DB >> 24919014 |
Jiulu Gong1, Guoliang Fan2, Liangjiang Yu3, Joseph P Havlicek4, Derong Chen5, Ningjun Fan6.
Abstract
We propose a new integrated target tracking, recognition and segmentation algorithm, called ATR-Seg, for infrared imagery. ATR-Seg is formulated in a probabilistic shape-aware level set framework that incorporates a joint view-identity manifold (JVIM) for target shape modeling. As a shape generative model, JVIM features a unified manifold structure in the latent space that is embedded with one view-independent identity manifold and infinite identity-dependent view manifolds. In the ATR-Seg algorithm, the ATR problem formulated as a sequential level-set optimization process over the latent space of JVIM, so that tracking and recognition can be jointly optimized via implicit shape matching where target segmentation is achieved as a by-product without any pre-processing or feature extraction. Experimental results on the recently released SENSIAC ATR database demonstrate the advantages and effectiveness of ATR-Seg over two recent ATR algorithms that involve explicit shape matching.Entities:
Year: 2014 PMID: 24919014 PMCID: PMC4118327 DOI: 10.3390/s140610124
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Descriptions of all mathematical symbols.
| Symbols used in Problem Formulation (Section 3) | |
|
| |
| the target's 3D position | |
| the target type | |
| the view angle | |
| shape related variables ([ | |
| a target shape segmentation | |
| an observed image frame at time | |
|
| |
| Symbols used in JVIM-based shape modeling (Section 4) | |
|
| |
| JVIM training data | |
| JVIM latent space | |
| the aspect angle of a target | |
| the elevation angle of a target | |
| the kernel hyper-parameters of JVIM | |
| the dimension of the shape space | |
| the LLE coefficients for local topology encoding | |
| the JVIM objective function | |
| the data term in | |
| the topology term in | |
| the covariance matrix of JVIM learning | |
| a reference latent point in JVIM learning | |
| the neighborhood of | |
| the size of | |
| the corresponding shape for | |
| the size of training data | |
| a new latent point for JVIM-based shape interpolation | |
| the neighborhood of | |
| the size of | |
| the corresponding shape data for | |
| a RBF kernel function in JVIM | |
| an interpolated shape at | |
|
| uncertainty of shape interpolation at |
|
| |
| Symbols used in shape-aware level set (Section 5) | |
|
| |
| a 2D pixel location in an image frame | |
| a pixel intensity value | |
| foreground/background models | |
| the smoothed Heaviside step function | |
|
| |
| Symbols used in sequential inference (Section 6) | |
|
| |
| the heading direction of a ground vehicle in frame | |
| the target velocity along | |
| the time interval of two adjacent frames | |
| the state vector in frame | |
Figure 1.Graphical modeling for the proposed ATR-Seg algorithm, where I represents an image frame, p 3D target position, Φ target segmentation, and Θ the set of shape variables.
Figure 2.Flowchart for ATR-Seg.
Figure 3.JVIM learning and shape interpolation, where one view-independent identity manifold and six identity-dependent view manifolds are color-coded according to the uncertainty of GP mapping. (Adapted from [24], with permission from Elsevier.)
Figure 4.Shape-aware level set model for implicit shape matching. (a) Illustration of a target in an infrared image: foreground Ω and background Ω, foreground/background intensity models M, and the 3D-2D camera projection W(x,p). (b) The shape embedding function Φ. (c) The graphical model for shape-aware level set, where p is the target 3D location of a ground-vehicle, and Θ is the shape parameter in JVIM.
Figure 5.Optimization of ATR-Seg by a gradient descent method.
Figure 6.Possible reasons for the failure of the gradient descent method.
Figure 7.The graphical model representation of ATR-Seg and the 3D camera coordinate. (Reprint from [28] with permission from IEEE).
Pseudo-code for ATR-Seg algorithm.
| • Initialization: Initialize the target position, |
| • For |
| 1. For |
| 1.1 Draw samples
|
| 1.2 Generate the target shape according to the target state using |
| 1.3 Compute weights
|
| End |
| 2. Normalize the weights, such that
|
| 3. Compute the mean estimates of the target state,
|
| 4. Set
|
| • End |
Figure 8.All 36 CAD models used in this work, which are ordered according to the class-constrained shortest-closed-path [8]. (Reprint from [24], with permission from Elsevier.)
Figure 9.Qualitative analysis of JVIM shape interpolation: (a) along six identity-specific view manifolds. (b) along the view-independent identity manifold between two training target types. (Reprint from [24], with permission from Elsevier.)
Figure 10.Comparison of the tracking errors of the horizontal position, slant range and heading direction. In each plot, the results for each method averaged over eight target types for each range. From left to right, the plot gives the results for JVIM (first, red), couplet of view and identity manifolds (CVIM) (second, blue), local linear (LL)-Gaussian process latent variable model (GPLVM) (third, cyan) and nearest neighbor (NN) (forth, green). (Reprint from [24], with permission from Elsevier).
Tracking errors for three ATR methods (Method I/Method II/Method III). (Reprint from [28], with permission from IEEE).
| 1 km | 0.22/0.25/0.22 | 0.22/0.25/0.18 | 0.16/0.17/0.19 | 0.21/0.23/ | |
| 5.06/8.67/7.53 | 4.19/4.48/5.14 | 9.03/8.36/10.95 | |||
| 0.13/0.17/0.18 | 0.15/0.32/0.15 | 0.11/0.24/0.22 | |||
| 1.5 km | 0.24/0.19/0.18 | 0.15/0.21/0.20 | 0.16/0.56/0.60 | ||
| 4.40/7.20/7.28 | 4.70/5.88/5.96 | – – NA – – | |||
| 0.16/0.22/0.24 | 0.18/0.53/0.51 | 0.11/0.32/0.35 | |||
| 2 km | 0.31/0.27/0.28 | 0.23/0.19/0.36 | 0.13/0.17/0.35 | 0.24/ | |
| 8.68/10.6/8.58 | 8.95/9.28/7.95 | 5.35/8.09/14.25 | |||
| 0.19/0.38/0.26 | 0.41/0.18/0.38 | 0.08/0.41/0.31 |
Recognition accuracy (%) for Methods I, II and III. (Reprint from [28], with permission from IEEE).
| 1 km | 100/96/96 | 100/100/94 | 100/100/100 | |
| 1.5 km | 98/96/94 | 99/100/89 | 100/100/100 | |
| 2 km | 98/92/86 | 100/100/85 | 100/100/98 |
Figure 11.ATR-Seg results for five IR sequences. Column 1: truth target types. Columns 2–5: selected IR frames overlayed with the segmentation results. Columns 6–7: the two best matched training targets along the identity manifold. (Reprint from [28], with permission from IEEE.)
Figure 12.Segmentation results of five targets at the range of 1.5 km.
Figure 13.Tracking failure for the pick-up and 2S3 sequences at 1.5 km.