| Literature DB >> 33286178 |
Julianna Pinele1, João E Strapasson2, Sueli I R Costa3.
Abstract
The Fisher-Rao distance is a measure of dissimilarity between probability distributions, which, under certain regularity conditions of the statistical model, is up to a scaling factor the unique Riemannian metric invariant under Markov morphisms. It is related to the Shannon entropy and has been used to enlarge the perspective of analysis in a wide variety of domains such as image processing, radar systems, and morphological classification. Here, we approach this metric considered in the statistical model of normal multivariate probability distributions, for which there is not an explicit expression in general, by gathering known results (closed forms for submanifolds and bounds) and derive expressions for the distance between distributions with the same covariance matrix and between distributions with mirrored covariance matrices. An application of the Fisher-Rao distance to the simplification of Gaussian mixtures using the hierarchical clustering algorithm is also presented.Entities:
Keywords: Fisher–Rao distance; Gaussian mixture simplification; information geometry; multivariate normal distributions
Year: 2020 PMID: 33286178 PMCID: PMC7516881 DOI: 10.3390/e22040404
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
The lower bound and the upper bounds , and for the Fisher–Rao distance, , between distributions and in . is the distance between univariate normal distributions given in Equation (22).
|
|
|
|
| |
|
| |
|
|
|
Figure 1A comparison between the bounds , , , , and . (, , and are fixed, and varies from zero to ).
Figure 2A comparison between the bounds , , , , and . (, , and are fixed, and varies from zero to ).
Figure 3(a) A comparison between the bounds , , , , and . (, , and the rotation angle are fixed, and varies from zero to 10). (b) A comparison between the bounds , , and . (, , and the rotation angle are fixed, and varies from zero to 10).
Figure 5Example of level curves of mirrored distributions where and are given by Equation (40).
Figure 6Approximation of the geodesic curve connecting and via the geodesic shooting algorithm. The level curve of is the dashed one.
Figure 7Contour curves of distributions and .
A time comparison between the numerical method proposed here and the geodesic shooting to calculate the distance between two mirrored distributions.
|
|
| Time Systems (s) | Time G.Shooting (s) |
|---|---|---|---|
| 1 | 2.77395 | 0.046875 | 4.70313 |
| 2 | 3.67027 | 0.046875 | 5.60938 |
| 3 | 4.52933 | 0.0625 | 7.10938 |
| 4 | 5.26093 | 0.078125 | 9.17188 |
| 5 | 5.87480 | 0.046875 | 12.5313 |
| 6 | 6.39439 | 0.0625 | 18.4219 |
| 7 | 6.84043 | 0.078125 | 492.563 |
| 8 | 7.22903 | 0.0625 | 574.422 |
| 9 | 7.57221 | 0.046875 | 917.859 |
| 10 | 7.87896 | 0.046875 | 1007.13 |
Closed forms for the Fisher–Rao distance in submanifolds of and the distance in between pairs of special distributions.
| Distance in Non-totally Geodesic Submanifolds | |
|---|---|
| Submanifold | Distance |
|
|
|
|
|
|
|
| |
|
| |
|
|
|
|
| |
| Distributions with Common Covariance Matrices, |
|
| Mirrored Distributions, | |
Figure 8Illustration of the mixture simplification using the Fisher–Rao clustering, where l is the number of components of the mixture (the last column is the original figure).
Figure 9Illustration of the simplification quality of the mixture modeling Baboon image.
Figure 10Illustration of the simplification quality of the mixture modeling Lena image.
Figure 11Illustration of the simplification quality of the mixture modeling Clown image.