| Literature DB >> 33495693 |
Benjamin Kedem1, Saumyadipta Pyne2,3.
Abstract
Synthetic data, when properly used, can enhance patterns in real data and thus provide insights into different problems. Here, the estimation of tail probabilities of rare events from a moderately large number of observations is considered. The problem is approached by a large number of augmentations or fusions of the real data with computer-generated synthetic samples. The tail probability of interest is approximated by subsequences created by a novel iterative process. The estimates are found to be quite precise. © Grace Scientific Publishing 2021.Entities:
Keywords: B-curve; Density ratio model; Iterative process; Repeated out of sample fusion; Residential radon; Upper bounds
Year: 2021 PMID: 33495693 PMCID: PMC7816841 DOI: 10.1007/s42519-020-00152-1
Source DB: PubMed Journal: J Stat Theory Pract ISSN: 1559-8608
Fig. 1B-Curves, 10,000 B’s, from residential radon sample . , , , , . values: top left 77.9, top right 107. Bottom left 143, bottom right 193.7. The point “” moves to the left as increases relative to . The fusion samples are uniform with support covering T
, , , , , , p-increment 0.000018
| Starting | Convergence to | Iterations | |
|---|---|---|---|
| 1000 | 0.0007009389 | 3 | Down |
| 802 | 0.0002869389 | 1 | Down |
| 761 | 0.0002689389 | 1 | Down |
| 757 | 0.0002689389 | 1 | Down |
| 755 | 0.0002689389 | 1 | Down |
| 754 | 0.0002689389 | 1 | Up |
| 751 | 0.0002689389 | 1 | Up |
| 750 | 0.0002689389 | 1 | Up |
| 740 | 0.0002689389 | 1 | Up |
| 738 | 0.0002689389 | 1 | Up |
, , , , , , p-increment 0.00002
| Starting | Convergence to | Iterations | |
|---|---|---|---|
| 800 | 0.0003401254 | 18 | Down |
| 750 | 0.0003001254 | 18 | Down |
| 140 | 0.0002801254 | 2 | Down |
| 135 | 0.0002601254 | 1 | Down |
| 133 | 0.0002601254 | 1 | Down |
| 130 | 0.0002601254 | 1 | Up |
| 122 | 0.0002601254 | 1 | Up |
| 121 | 0.0002601254 | 1 | Up |
| 120 | 0.0002601254 | 1 | Up |
| 112 | 0.0002601254 | 1 | Up |
, , , , , , p-increment 0.00001
| Starting | Convergence to | Iterations | |
|---|---|---|---|
| 800 | 0.0003600818 | 21 | Down |
| 600 | 0.0002600818 | 19 | Down |
| 440 | 0.0002700818 | 9 | Down |
| 300 | 0.0002600818 | 4 | Down |
| 246 | 0.0002600818 | 1 | Down |
| 245 | 0.0002600818 | 1 | Down |
| 244 | 0.0002600818 | 1 | Up |
| 243 | 0.0002600818 | 1 | Up |
| 240 | 0.0002600818 | 1 | Up |
| 237 | 0.0002600818 | 1 | Up |
| 222 | 0.0002500818 | 1 | Up |
| 200 | 0.0002400818 | 1 | Up |
, , , , , , p-increment 0.00004583
| Starting | Convergence to | Iterations | |
|---|---|---|---|
| 1000 | 0.0002744504 | 2 | Down |
| 999 | 0.0002744504 | 1 | Down |
| 998 | 0.0002744504 | 1 | Down |
| 997 | 0.0002286204 | 2 | Down |
| 996 | 0.0002286204 | 1 | Down |
| 994 | 0.0002286204 | No | |
| 993 | 0.0002286204 | 1 | Up |
| 992 | 0.0002286204 | 1 | Up |
| 991 | 0.0002286204 | 1 | Up |
| 990 | 0.0002286204 | 1 | Up |
| 989 | 0.0002286204 | 1 | Up |
| 988 | 0.0002286204 | 1 | Up |
, , , ,
| Error | |||
|---|---|---|---|
| 77.9 | 0.00004583 | 0.0002286204 | 4.073987e–05 |
| 107.0 | 0.00002000 | 0.0002589389 | 1.042137e–05 |
| 107.0 | 0.00002500 | 0.0002739389 | 4.578631e–06 |
| 107.0 | 0.00003000 | 0.0002689389 | 4.213694e–07 |
| 107.0 | 0.00001800 | 0.0002689389 | 4.213694e–07 |
| 107.0 | 0.00002686 | 0.0002675389 | 1.821369e–06 |
| 107.0 | 0.00001175 | 0.0002574389 | 1.192137e–05 |
| 113.7 | 0.00002200 | 0.0002637656 | 5.594700e–06 |
| 123.1 | 0.00002000 | 0.0002601254 | 9.234869e–06 |
| 125.2 | 0.00002000 | 0.0002600310 | 9.329269e–06 |
| 130.7 | 0.00003000 | 0.0002639057 | 5.454600e–06 |
| 143.0 | 0.00002140 | 0.0002565210 | 1.283927e–05 |
| 193.7 | 0.00001000 | 0.0002600818 | 9.278469e–06 |
| 193.7 | 0.00002000 | 0.0002600818 | 9.278469e–06 |