| Literature DB >> 35534439 |
Tiziana Tuoto1,2, Davide Di Cecco1,2, Andrea Tancredi2.
Abstract
The identification and treatment of "one-inflation" in estimating the size of an elusive population has received increasing attention in capture-recapture literature in recent years. The phenomenon occurs when the number of units captured exactly once clearly exceeds the expectation under a baseline count distribution. Ignoring one-inflation has serious consequences for estimation of the population size, which can be drastically overestimated. In this paper we propose a Bayesian approach for Poisson, geometric, and negative binomial one-inflated count distributions. Posterior inference for population size will be obtained applying a Gibbs sampler approach. We also provide a Bayesian approach to model selection. We illustrate the proposed methodology with simulated and real data and propose a new application in official statistics to estimate the number of people implicated in the exploitation of prostitution in Italy.Entities:
Keywords: Bayesian model selection; capture-recapture; illegal populations; zero-truncated one-inflated count data models
Mesh:
Year: 2022 PMID: 35534439 PMCID: PMC9314905 DOI: 10.1002/bimj.202100187
Source DB: PubMed Journal: Biom J ISSN: 0323-3847 Impact factor: 1.715
Simulation scenarios with data‐generating models, parameter values, and expected sample size (the expected values of are common to all three scenarios)
| Scenario I | Scenario II | Scenario III | Distribution | ||
|---|---|---|---|---|---|
| No inflation | Low inflation, | Substantial inflation, |
| Parameter |
|
| Poi | OIP | OIP | 500 |
| 316 |
|
| 432 | ||||
| 1000 |
| 632 | |||
|
| 865 | ||||
| Geo | OIG | OIG | 500 |
| 300 |
|
| 200 | ||||
| 1000 |
| 600 | |||
|
| 400 | ||||
Relative bias (%) of the unobserved units estimates,
| Generating model |
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Model | Parameter | Inflation | Poi | Geo | OIP | OIG | Poi | Geo | OIP | OIG |
| Poi | 1 | None | 1.67 | 198 | −12 | 189 | 0.37 | 196 | −9 | 190 |
| Poi | 2 | None | 1.28 | 391 | −5.49 | 389 | 0.88 | 390 | −4.12 | 388 |
| Geo | 0.4 | None | −82 | −0.80 | −91 | −5.48 | −82 | −1.13 | −91 | −4.33 |
| Geo | 0.6 | None | −68 | 0.27 | −80 | −9.34 | −68 | 0.73 | −82 | −6.84 |
| OIP | 1 | 0.2 | 52 | 514 | 3.41 | 501 | 52 | 514 | 2.32 | 507 |
| OIP | 2 | 0.2 | 37 | 273 | 0.71 | 246 | 37 | 272 | 0.38 | 254 |
| OIP | 1 | 0.5 | 147 | 497 | 14 | 339 | 146 | 496 | 6.04 | 146 |
| OIP | 2 | 0.5 | 218 | 883 | 5.38 | 619 | 219 | 886 | 3.54 | 614 |
| OIG | 0.4 | 0.2 | −72 | 25 | −91 | 0.92 | −73 | 23 | −91 | −0.03 |
| OIG | 0.6 | 0.2 | −55 | 26 | −79 | 1.50 | −56 | 26 | −81 | 1.21 |
| OIG | 0.4 | 0.5 | −39 | 100 | −91 | 1.72 | −39 | 100 | −91 | 2.07 |
| OIG | 0.6 | 0.5 | −16 | 108 | −76 | 15 | −18 | 104 | −79 | 7.74 |
FIGURE 1Box‐plot of posterior model probabilities when ; the data‐generating model is indicated above each panel
FIGURE 2Box‐plot of posterior model probabilities when ; the data‐generating model is indicated above each panel
Boundary cases for and , %bias and %MSE of for some prior specifications of . Results from MLE in the bottom row, for comparison
|
| ||||
|---|---|---|---|---|
| Prior distribution of | % Boundary cases | % Boundary cases | % bias of | % MSE of |
| for | for | |||
| Gamma(0.1,0.1) | 33 | 30 | 218.59 | 1618.82 |
| Gamma(1,1) | 11 | 11 | 97.64 | 859.51 |
| InvGamma(0.1,0.1) | 0 | 0 | −10.52 | 6.71 |
| InvGamma(0.5,0.5) | 0 | 0 | −15.58 | 5.13 |
| InvGamma(1,1) | 0 | 0 | −19.06 | 5.27 |
| InvGamma(1,2) | 0 | 0 | −26.70 | 7.91 |
| MLE | 16 | 3 | 91.75 | 2217.32 |
Results on %bias and %MSE of
| Generating model: OINB with | ||||
|---|---|---|---|---|
|
|
| |||
| % bias of | % MSE of | % bias of | % MSE of | |
| Poi | −38.11 | 14.55 | −7.25 | 0.54 |
| Geo | 5.19 | 0.38 | 42.31 | 17.94 |
| NB (Gamma) |
|
|
|
|
| NB (InvGamma) | 2518 |
|
|
|
| OIP | −56.38 | 31.80 | −19.32 | 3.74 |
| OIG | ‐29.75 | 8.89 | 12.78 | 1.65 |
| OINB (Gamma) | 246 | 2898 | 1.81 | 0.25 |
| OINB (InvGamma) | −11.73 | 5.68 | 0.49 | 0.19 |
FIGURE 3Relative frequencies of observed counts for prostitution exploitation data in Italy in 2014
The posterior mode and credible intervals for the population size , posterior mean for and model parameters for prostitution exploitation data
| Estimator/model |
| 95%CI. |
|
|
| |
|---|---|---|---|---|---|---|
| Ignoring one–inflation | ||||||
| Poi | 7210 | [6780, 7689] | 0.476 | |||
| Geo | 13332 | [12415, 14394] | 0.795 | |||
| NB | 89140 | [35162, 188368] | 0.665 | 0.088 | ||
| Chao | 9851 | [8961, 10868] | ||||
| Zelterman | 10030 | [9033, 11027] | 0.319 | |||
| Modeling one–inflation |
| |||||
| OIP | 3895 | [3656, 4156] | 1.213 | 0.645 | ||
| OIG | 8182 | [7406, 9233] | 0.669 | 0.478 | ||
| OINB | 19566 | [6174, 71710] | 0.580 | 0.213 | 0.363 | |
| Mod.Chao.OIP | 6493 | [4163, 8823] | ||||
| Mod.Chao.OIG | 19628 | [9143, 30112] |
FIGURE 4Posterior distributions of and of the parameters of all one‐inflated models for prostitution exploitation data. Vertical lines show the posterior medians
Observed count distribution for three real cases
| Real cases | Counts | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1. Prostitutes |
|
|
|
|
|
|
| ||||
| 541 | 169 | 95 | 37 | 21 | 23 | 886 | |||||
| 2. Opiate users |
|
|
|
|
|
|
|
|
|
|
|
| 1206 | 474 | 198 | 95 | 29 | 19 | 5 | 2 | 0 | 1 | 2029 | |
| 3. Heroin users |
|
|
|
|
|
|
|
|
|
|
|
| 2176 | 1600 | 1278 | 976 | 748 | 570 | 455 | 368 | 281 | 254 | 188 | |
|
|
|
|
|
|
|
|
|
|
|
| |
| 138 | 99 | 67 | 44 | 34 | 17 | 3 | 3 | 2 | 1 | 9302 |
The posterior mode and credible intervals for the population size , posterior mean for , and model parameters, for real cases
| 1. Prostitutes in Vancouver |
| 95%HPD( |
|
|
|
| |
|---|---|---|---|---|---|---|---|
| Model | Poi | 1240 | 1177–1300 | 1.254 | |||
| Geo | 2045 | 1906–2217 | 0.570 | ||||
| NB | 3340 | 1977–167925 | 0.145 | 0.395 | |||
| OIP | 1017 | 982–1058 | 0.438 | 2.037 | |||
| OIG | 1820 | 1669–2003 | 0.192 | 0.517 | |||
| OINB | 1040 | 991–1238 | 0.399 | 19.104 | 0.862 | ||
| Mod.Chao.OIP | 1005 | 933–1077 | |||||
| Mod.Chao.OIG | 1421 | 1097–1745 | |||||