| Literature DB >> 35626567 |
Abstract
Shannon's entropy is one of the building blocks of information theory and an essential aspect of Machine Learning (ML) methods (e.g., Random Forests). Yet, it is only finitely defined for distributions with fast decaying tails on a countable alphabet. The unboundedness of Shannon's entropy over the general class of all distributions on an alphabet prevents its potential utility from being fully realized. To fill the void in the foundation of information theory, Zhang (2020) proposed generalized Shannon's entropy, which is finitely defined everywhere. The plug-in estimator, adopted in almost all entropy-based ML method packages, is one of the most popular approaches to estimating Shannon's entropy. The asymptotic distribution for Shannon's entropy's plug-in estimator was well studied in the existing literature. This paper studies the asymptotic properties for the plug-in estimator of generalized Shannon's entropy on countable alphabets. The developed asymptotic properties require no assumptions on the original distribution. The proposed asymptotic properties allow for interval estimation and statistical tests with generalized Shannon's entropy.Entities:
Keywords: Shannon’s entropy; asymptotic normality; generalized Shannon’s entropy; plug-in estimation
Year: 2022 PMID: 35626567 PMCID: PMC9141039 DOI: 10.3390/e24050683
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1Effectiveness of the 95% confidence intervals as a function of sample size. Simulations from Zeta distribution with and GSE with order . The horizontal dashed line is at 0.95.
Figure 2Effectiveness of the 95% confidence intervals as a function of sample size. Simulations from Zeta distribution with and GSE with order . The horizontal dashed line is at 0.95.
Figure 3Effectiveness of the 95% confidence intervals as a function of sample size. Simulations from Zeta distribution with and GSE with order . The horizontal dashed line is at 0.95.
Figure 4Effectiveness of the 95% confidence intervals as a function of sample size. Simulations from Zeta distribution with and GSE with order . The horizontal dashed line is at 0.95.