| Literature DB >> 33266683 |
Mateu Sbert1,2, Min Chen3, Jordi Poch2, Anton Bardera2.
Abstract
Cross entropy and Kullback-Leibler (K-L) divergence are fundamental quantities of information theory, and they are widely used in many fields. Since cross entropy is the negated logarithm of likelihood, minimizing cross entropy is equivalent to maximizing likelihood, and thus, cross entropy is applied for optimization in machine learning. K-L divergence also stands independently as a commonly used metric for measuring the difference between two distributions. In this paper, we introduce new inequalities regarding cross entropy and K-L divergence by using the fact that cross entropy is the negated logarithm of the weighted geometric mean. We first apply the well-known rearrangement inequality, followed by a recent theorem on weighted Kolmogorov means, and, finally, we introduce a new theorem that directly applies to inequalities between K-L divergences. To illustrate our results, we show numerical examples of distributions.Entities:
Keywords: Kolmogorov mean; Kullback–Leibler divergence; cross entropy; generalized mean; likelihood; stochastic dominance; stochastic order; weighted mean
Year: 2018 PMID: 33266683 PMCID: PMC7512543 DOI: 10.3390/e20120959
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Plotting the sums in Equation (1) for relations 1–5 in Example 2.
Figure 2Plotting the logarithm of products in Equation (4) for relations 1, 3, 6, and 7 in Example 3.