Literature DB >> 33265349

Information Geometry for Covariance Estimation in Heterogeneous Clutter with Total Bregman Divergence.

Xiaoqiang Hua¹, Yongqiang Cheng¹, Hongqiang Wang¹, Yuliang Qin¹.

Abstract

This paper presents a covariance matrix estimation method based on information geometry in a heterogeneous clutter. In particular, the problem of covariance estimation is reformulated as the computation of geometric median for covariance matrices estimated by the secondary data set. A new class of total Bregman divergence is presented on the Riemanian manifold of Hermitian positive-definite (HPD) matrix, which is the foundation of information geometry. On the basis of this divergence, total Bregman divergence medians are derived instead of the sample covariance matrix (SCM) of the secondary data. Unlike the SCM, resorting to the knowledge of statistical characteristics of the sample data, the geometric structure of matrix space is considered in our proposed estimators, and then the performance can be improved in a heterogeneous clutter. At the analysis stage, numerical results are given to validate the detection performance of an adaptive normalized matched filter with our estimator compared with existing alternatives.

Entities: Chemical Disease

Keywords: adaptive normalized matched filter; covariance matrix estimation; information geometry; total Bregman divergence

Year: 2018 PMID： 33265349 PMCID： PMC7512773 DOI： 10.3390/e20040258

Source DB: PubMed Journal: Entropy (Basel) ISSN： 1099-4300 Impact factor: 2.524

1. Introduction

The estimation of the disturbance covariance matrix is an important subject in the field of advanced radar signal processing. Many algorithms manipulate the covariance matrix of sample data, such as array signal processing [1,2], multichannel signal processing [3,4], adaptive radar detection [5,6,7], and space-time adaptive processing [8,9,10]. The commonly used estimator is the sample covariance matrix (SCM), which is often derived from the maximum-likelihood (ML) of K N-dimensional secondary data. A condition is assumed that these K secondary data are independent and identical distributed zero-mean complex circular Gaussian vectors. Usually, the solution of this ML exists when the amount of secondary data K is greater than the matrix dimension N . In particular, it can achieve a good performance for [11]. Unfortunately, in real heterogeneous clutter, the statistical distribution of the whole environment is very difficult to obtain, since the secondary data could be contaminated by power variations, clutter discretes, and/or outliers. Therefore, it is very necessary and meaningful to obtain the estimation of disturbance matrix that is not relying on the statistical characterization of the whole environment. Many covariance estimation algorithms derived from the geometry of matrix space, not resorting to the statistical characterization of sample data, are reported in the literature. For instance, the Riemannian mean and median of covariance matrices is proposed to design the radar target detector [12,13,14,15,16]. In [17,18,19,20], the median is used for covariance estimation in many radar processing applications. The Riemannian mean is utilized to estimate the covariance matrix in space-time adaptive processing [21,22,23]. The results have shown that the projection algorithm with Riemannian mean can yield significant performance gains. In [24,25], some geometric barycenter and medians are proposed for radar training data selection in a homogeneous environment. In [26], a geometric method is presented for covariance estimation, where each estimator is associated with a given unitary invariant norm and performs the sample covariance matrix projection into a specific set of structured covariance matrices. Recently, geometric Barycenters of symmetric positive definite matrices have been used for covariance estimators in compound-Gaussian clutter [27]. Moreover, in our previous work [28,29], we have proposed a lot of divergence means and medians of covariance matrices computed in a neighborhood of the cell under test for radar target detection. Finally, the geometric approach is used also in many other applications; for instance, the Bhattacharyya mean and median are exploited for diffusion tensor magnetic resonance (DT-MRI) image segmentation [30,31]. In these contexts, the geometric approaches have achieved good performances. In this paper, a new class of total Bregman divergence (tBD) is presented on the Riemannian manifold. Based on the tBD, the three geometric medians, including total square loss (TSL) median, total von Neumann (TVN) median, and total LogDet (TLD) median, are derived, and are used as the estimators of disturbance matrix. As a matter of fact, the SCM of K secondary data can be seen as the arithmetic mean of K autocovariance matrices of rank one. The arithmetic mean ignores the fact that these matrices lie in a nonlinear matrix space, the Riemannian manifold, which is the foundation of information geometry. As the geometric medians are not relying on the statistical characteristics of the sample data in heterogeneous clutter, the performance of covariance estimation can be improved. The rest of this paper is organized as follows: Section 2 formulates the problem of covariance estimation from information geometry; the total Bregman divergence-based estimators are presented in Section 3; Section 4 gives the theoretical analysis about the robustness of the proposed estimators; experimental results are presented in Section 5; Section 6 concludes our work. Notation: Here are some notations for the description of this article. A matrix and a vector are noted as uppercase bold and lowercase bold, respectively. The conjugate transpose of matrix is denoted as . is the trace of matrix . is the determinant of matrix . denotes the identity matrix. Finally, denotes statistical expectation.

2. Problem Formulated from Information Geometry

In this section, a heterogeneous environment is considered for covariance estimation. The secondary data is a spherically invariant random vector, and can be expressed as, where is a nonnegative scalar random variable, and is a N-dimensional circularly symmetric zero-mean vectors with an arbitrary joint statistical distribution and sharing the same covariance matrix, where † denotes the conjugate transpose. Since the knowledge of the statistical characterization of the secondary data is not known in heterogeneous clutter, the classic approaches, e.g., ML and minimum mean-square error, cannot be applied for covariance estimation of the sample data. Thus, other covariance estimators, not dependent on the probability distribution of the whole environment, are very promising. In this paper, a covariance estimation method based on information geometry is proposed. Recall that the SCM of K secondary data can be given as where is an autocovariance matrix of rank one, and is singular. It can be noted from Equation (3) that is the arithmetic mean of K autocovariance matrices. In fact, these matrices lie in the nonlinear Hermitian matrix space, as the sample data is complex. It is well known that Hermitian positive-definite (HPD) matrices form a differentiable Riemannian (and also a Finsler) manifold [32,33], that is the most studied example of a manifold with nonpositive curvature [34,35]. In order to facilitate the analysis, the singular matrix is positive by adding an identity matrix, as = + . Then, the disturbance covariance matrix can be estimated by a median related to a divergence on the Riemannian manifold of HPD matrices. As illustrated in Figure 1, the geometric median is performed on the Riemannian manifold of HPD matrices with a non-Euclidean metric, whereas the arithmetic mean is considered in the Euclidean space. The difference implies that the different geometric structures are considered in these two estimators. Moreover, for different metrics on the Riemannian manifold, the performance of covariance estimation may be very different. These results will be found in our other reports [36]. In the next section, a new class of tBD is proposed on the Riemannian manifold. On basis of the tBD, the tBD median is derived and used for the estimator.

Figure 1

The arithmetic mean and geometric median.

3. Total Bregman Divergence-Based Estimators on the Manifold

In this section, a new class of tBD is proposed on the Riemannian manifold. Then, the medians associated with the tBD are derived.

3.1. The Geometry of HPD Matrices

Let denote the space of Hermitian matrix. For a Hermitian matrix , if the quadratic form , then is an HPD matrix, where is the space of n-dimensional complex vectors. All HPD matrices consist of a positive-definite Hermitian matrix space , which forms a Riemannian manifold of dimension with a constant non-positive curvature [35]. For a point on the Riemannian manifold, the infinitesimal arclength between and is given by where defines a metric on the Riemannian manifold [37]. is the Frobenius norm of a matrix. The inner product and corresponding norm on the tangent space at the point can be defined as [38] For two points and on the Riemannian manifold, the affine invariant (Riemannian) distance is given by [39] where is the logarithmic map on the Riemannian manifold of HPD matrices.

3.2. Total Bregman Divergence

The tBD has been proposed on the space of convex functions by Baba C. Vemuri, and has been used for DT-MRI analysis [40], shape retrieval [41,42], and object tracking [43]. For , f is a differentiable, strictly convex function, the total Bregman divergence between x, y is defined as [40] where is the derivative of at y. As illustrated in Figure 2, the tBD between x and y is defined as the orthogonal distance between the value of a convex and differentiable function f at the first argument of y and its tangent at the second argument. It can be also noted from Figure 2 that when the coordinate system rotates at an angle, the tBD between x and y does not change. This implies that the tBD is invariant to linear transformation; in other words, it is statistically robust.

Figure 2

The geometric definition of total Bregman divergence.

In the following, we extend the definition of tBD to the Riemannian manifold of HPD matrices. Let f be a strictly convex and differentiable function, for two HPD matrices In the following, we give the definitions of the three new divergences when the function f has various forms. Let , then , , and Equation (9) denotes the TSL If , then , , and Equation (9) yields the divergence, which is called the TVN, When , then , , and we obtain the divergence referred to as the TLD or the total Stein loss, Based on the three divergences on the Riemannian manifold, the medians for a finite set of HPD matrices are derived in the next section.

3.3. Total Bregman Divergence Median for HPD Matrices

For a set of HPD matrices, the median related to a measure is defined as the minimizer of the sum of the distance to the given matrices. Let f be a differentiable convex function; then, the median related to the tBD Equation ( It is worth pointing out that will be the arithmetic mean, if the Frobenius norm, instead of the tBD, is utilized. The median for a finite set of HPD matrices with respect to the tBD exists, and is unique. To find the median Setting the gradient equal to zero, Solving Equation (15) yields The tBD is a convex function, and , the sum of m convex functions, is also a convex function. Hence, the median indeed exists. Moreover, since f is a convex function, and is monotonic, the median is unique. In the following, three concrete cases of the tBD are derived when f is given as different forms. The median related to the TSL (10), of m HPD matrices The median related to the TVN divergence (11) of m HPD matrices The median related to the TLD divergence (12), of m HPD matrices With the help of the formulas These three equations are for the three propositions above in the same order. Equating these gradients to zero and solving for yield the medians.

4. Robustness Analysis of Total Bregman Divergence Median

This section is devoted to analyzing the robustness of total Bregman divergence median and arithmetic mean via the influence function. Let be the median, associated with total Bregman divergence, of m HPD matrices . is the median by adding a set of n outliers with a weight to . Then, we can define , and denotes the influence function. In the following, four propositions are presented. The influence function of arithmetic mean related to the Frobenius norm, of m HPD matrices Let The derivative of objection function is Note that is the median of m matrices and n outliers, and is the median of m matrices; then, we have and Substitute into Equation (26), and we have The influence function of TSL median related to TSL, of m HPD matrices The influence function of TVN median related to TVN, of m HPD matrices The influence function of TLD median related to TLD, of m HPD matrices According to Equation ( Note that is the median of m matrices and n outliers; then, we have, Using Taylor expansions on on the derivative function , we can obtain Substituting Equation (34) into Equation (33), As is the median of m matrices, we have Substituting Equation (36) into Equation (35), and ignoring the term containing , Finally, the influence function of total Bregman divergence median can be derived Let ; then, and . Substitute these into Equation (38), and we can obtain Proposition 6. When , then and . Substitute these into Equation (38), and we can obtain Proposition 7. If , then and . Substitute these into Equation (38), and we can obtain Proposition 8. In addition, we show that the influence value of total Bregman divergence median has an upper bound when the number of outlier . Let Then, Equation (38) can be rewritten as Note that and ; then, we have the following inequality: where c is a constant. From the inequality (41), we can know that the influence function has an upper bound when varies. It implies that our proposed tBD medians are robust.

5. Numerical Simulations

In this section, the performances of adaptive normalized matched filter with the proposed estimators and the normalized sample covariance matrix (NSCM) are evaluated by means of the standard Monte Carlo techniques, as the analytical expression for the probability of detection is not available. Particularly, consider a classical target detection problem, which is formulated as follows: where and denote the received signal and the clutter data in the cell under test, respectively. is an unknown complex quantity, accounting for the target radar cross section and the channel propagation effects. is the target steering vector, where is the normalized Doppler frequency. The terms and are the spherically invariant random vectors, and can be formulated as, where and are N-dimensional circularly symmetric zero-mean Gaussian vectors, and share the same covariance matrix . and are positive and real random variables, which are also independent and identical distributed. In particular, the terms and are assumed to follow the inverse gamma distribution, where and denote the shape and scale parameters, respectively. is the gamma function. For the target detection problem (42), the adaptive normalized matched filtering (ANMF) is often used, which can be given by where is an estimator that is based on the secondary data . denotes the threshold, which is derived by the Monte Carlo method in order to maintain the false alarm constant. In addition, the covariance matrix of and are given as the sum of two parts, where is accounting for the thermal noise. is related to the clutter, modeled as, where is the one-lag correlation coefficient. is the clutter-to-noise power ratio. is the clutter normalized Doppler frequency. In this simulation, we set = , = , and = 25 dB. The parameters = 3, and = 1. Performances of ANMFs with our proposed estimators are compared with the Frechet median estimator [18] and the NSCM estimator. The plots of versus signal-to-clutter ratio for different size of K, the number of secondary data, are shown in Figure 3 and Figure 4. The is estimated by the relative frequencies of 1000 simulations. The detection threshold is obtained using Monte Carlo trials, for a fixed nominal . It can be noted from Figure 3 and Figure 4 that the detection performances of ANMFs with our proposed estimators and the Frechet median estimator outperform that of ANMF with the NSCM estimator. The ANMF with the TLD estimator has the best performance, followed by the TVN estimator. The Frechet median estimator has similar performance to the TVN estimator. However, the difference is not very clear. This implies that our proposed estimators are robust for the size of K, unlike the NSCM estimator.

Figure 3

Pd versus SCR plots of ANMFs with the proposed estimators and the NSCM estimator, . (a) ; (b) .

Figure 4

Pd versus SCR plots of ANMFs with the proposed estimators and the NSCM estimator, . (a) ; (b) .

In order to analyze the robustness of total Bregman divergence median and arithmetic mean estimators, we inject a outlier with normalized Doppler frequency in 30 sample data, and compute the influence value according to (28), (29), (30), and (31). The results are shown in Figure 5. From Figure 5, we can know that the TLD median estimator has the smallest influence value, followed by the TSL median estimator. However, all of our proposed three estimators have smaller influence value than the arithmetic mean estimator, which is the NSCM estimator. These results imply that our proposed estimators are more robust than the NSCM estimator.

Figure 5

The influence value of arithmetic mean and total Bregman divergence median.

To enhance the persuasiveness of the results, we inject a outlier with the = , and = 20 dB. Then, the plots of versus of ANMF with different estimators are given in Figure 6 and Figure 7. It is clear from Figure 6 and Figure 7 that the proposed TSL, TLD, and TVN estimators outperform the NSCM in a contaminated clutter. Moreover, the performance of the Frechet median estimator is close to our proposed estimators.

Figure 6

Pd versus SCR plots of ANMFs with the proposed estimators and the NSCM estimator, . (a) ; (b) .

Figure 7

Pd versus SCR plots of ANMFs with the proposed estimators and the NSCM estimator, . (a) ; (b) .

6. Conclusions

In this work, we have proposed a covariance estimation method based on information geometry in a heterogeneous clutter. The problem of covariance estimation has been reformulated as the geometric median related to a measure. In particular, the three tBDs, including the TSL, the TLD, and the TVN, have been proposed on the Riemannian manifold. Then, we have derived the TSL, the TLD, and the TVN median estimators. At the analysis stage, the results of numerical experiments have highlighted that our proposed estimators outperform the NSCM estimator, and have similar performance to the Frechet median estimator in heterogeneous clutter.

3 in total

1. BHATTACHARYYA MEDIAN OF SYMMETRIC POSITIVE-DEFINITE MATRICES AND APPLICATION TO THE DENOISING OF DIFFUSION-TENSOR FIELDS.

Authors: Malek Charfi; Zeineb Chebbi; Maher Moakher; Baba C Vemuri
Journal: Proc IEEE Int Symp Biomed Imaging Date: 2013

2. Shape retrieval using hierarchical total Bregman soft clustering.

Authors: Meizhu Liu; Baba C Vemuri; Shun-Ichi Amari; Frank Nielsen
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2012-12 Impact factor: 6.226

3. Total Bregman divergence and its applications to DTI analysis.

Authors: Baba C Vemuri; Meizhu Liu; Shun-Ichi Amari; Frank Nielsen
Journal: IEEE Trans Med Imaging Date: 2010-10-14 Impact factor: 10.048

3 in total