| Literature DB >> 35874116 |
Ting-Kam Leonard Wong1, Jiaowen Yang2.
Abstract
Optimal transport and information geometry both study geometric structures on spaces of probability distributions. Optimal transport characterizes the cost-minimizing movement from one distribution to another, while information geometry originates from coordinate invariant properties of statistical inference. Their relations and applications in statistics and machine learning have started to gain more attention. In this paper we give a new differential-geometric relation between the two fields. Namely, the pseudo-Riemannian framework of Kim and McCann, which provides a geometric perspective on the fundamental Ma-Trudinger-Wang (MTW) condition in the regularity theory of optimal transport maps, encodes the dualistic structure of statistical manifold. This general relation is described using the framework of c-divergence under which divergences are defined by optimal transport maps. As a by-product, we obtain a new information-geometric interpretation of the MTW tensor on the graph of the transport map. This relation sheds light on old and new aspects of information geometry. The dually flat geometry of Bregman divergence corresponds to the quadratic cost and the pseudo-Euclidean space, and the logarithmic L ( α ) -divergence introduced by Pal and the first author has constant sectional curvature in a sense to be made precise. In these cases we give a geometric interpretation of the information-geometric curvature in terms of the divergence between a primal-dual pair of geodesics.Entities:
Keywords: Bregman divergence; Information geometry; Logarithmic divergence; Ma–Trudinger–Wang tensor; Optimal transport; Pseudo-Riemannian geometry; c-Divergence
Year: 2021 PMID: 35874116 PMCID: PMC9296067 DOI: 10.1007/s41884-021-00053-7
Source DB: PubMed Journal: Inf Geom ISSN: 2511-249X
Fig. 1Illustration of the c-divergence. The black curve shows the graph G of an optimal transport map. The colour represents the value of which is non-negative and zero only when belongs to the graph G. This value defines the divergence where and are respectively the horizontal and vertical projections of onto G
Fig. 2The graph G of optimal transport as an n-dimensional submanifold of . The primal and dual coordinates correspond to projections onto M and respectively
Fig. 3The projection maps