Literature DB >> 33445650

Distribution-Dependent Weighted Union Bound.

Luca Oneto1, Sandro Ridella2.   

Abstract

In this paper, we deal with the classical Statistical Learning Theory's problem of bounding, with high probability, the true risk R(h) of a hypothesis h chosen from a set H of m hypotheses. The Union Bound (UB) allows one to state that PLR^(h),δqh≤R(h)≤UR^(h),δph≥1-δ where R^(h) is the empirical errors, if it is possible to prove that P{R(h)≥L(R^(h),δ)}≥1-δ and P{R(h)≤U(R^(h),δ)}≥1-δ, when h, qh, and ph are chosen before seeing the data such that qh,ph∈[0,1] and ∑h∈H(qh+ph)=1. If no a priori information is available qh and ph are set to 12m, namely equally distributed. This approach gives poor results since, as a matter of fact, a learning procedure targets just particular hypotheses, namely hypotheses with small empirical error, disregarding the others. In this work we set the qh and ph in a distribution-dependent way increasing the probability of being chosen to function with small true risk. We will call this proposal Distribution-Dependent Weighted UB (DDWUB) and we will retrieve the sufficient conditions on the choice of qh and ph that state that DDWUB outperforms or, in the worst case, degenerates into UB. Furthermore, theoretical and numerical results will show the applicability, the validity, and the potentiality of DDWUB.

Entities:  

Keywords:  distribution-dependent weights; finite number of hypothesis; statistical learning theory; union bound; weighted union bound

Year:  2021        PMID: 33445650      PMCID: PMC7827710          DOI: 10.3390/e23010101

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.524


  3 in total

1.  A local Vapnik-Chervonenkis complexity.

Authors:  Luca Oneto; Davide Anguita; Sandro Ridella
Journal:  Neural Netw       Date:  2016-07-18

2.  Genome-Wide Significance Levels and Weighted Hypothesis Testing.

Authors:  Kathryn Roeder; Larry Wasserman
Journal:  Stat Sci       Date:  2009-11       Impact factor: 2.901

3.  A Novel Boolean Kernels Family for Categorical Data.

Authors:  Mirko Polato; Ivano Lauriola; Fabio Aiolli
Journal:  Entropy (Basel)       Date:  2018-06-06       Impact factor: 2.524

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.