| Literature DB >> 29095866 |
Jiacheng Wu1, Forrest W Crawford2,3,4, Mait Raag5, Robert Heimer6, Anneli Uusküla5.
Abstract
Estimating the size of key risk populations is essential for determining the resources needed to implement effective public health intervention programs. Several standard methods for population size estimation exist, but the statistical and practical assumptions required for their use may not be met when applied to HIV risk groups. We apply three approaches to estimate the number of people who inject drugs (PWID) in the Kohtla-Järve region of Estonia using data from a respondent-driven sampling (RDS) study: the standard "multiplier" estimate gives 654 people (95% CI 509-804), the "successive sampling" method gives estimates between 600 and 2500 people, and a network-based estimate that uses the RDS recruitment chain gives between 700 and 2800 people. We critically assess the strengths and weaknesses of these statistical approaches for estimating the size of hidden or hard-to-reach HIV risk groups.Entities:
Mesh:
Year: 2017 PMID: 29095866 PMCID: PMC5667832 DOI: 10.1371/journal.pone.0185711
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Descriptive summary of RDS data of PWID in Estonia in 2012.
The top left panel shows the recruitment tree of 600 recruited subjects originating from six seeds. The top right panel shows the number of recruited subjects daily; gaps correspond to weekends. The bottom left panel illustrates the cumulative number of recruitment at each day of the study. The bottom right panel shows a histogram of subjects’ reported degrees.
Estimates from the SS method of the number of people who inject drugs in the Kohtla-Järve region, Estonia.
We obtain posterior estimates under two degree conditions (imputed and raw degree) and two priors for population size (beta and uniform). Imputed degree substitutes the raw degree with the fitted degree by Conway-Maxwell-Poisson distribution. The beta prior models the proportion of sampled subjects among target population. We set the maximum possible number of population size as 1500 and 2500 in uniform prior. Posterior mean, 5% and 95% quantiles are reported.
| Degree | Prior size | Posterior Mean | 95% Posterior Quantile | Implied Prevalence |
|---|---|---|---|---|
| Imputed | Beta | 801 | (621,1106) | 1.8% |
| Imputed | Uniform[0,1500] | 1104 | (686,1463) | 2.5% |
| Imputed | Uniform[0,2500] | 1546 | (739,2399) | 3.5% |
| Raw | Beta | 918 | (600,2002) | 2.1% |
| Raw | Uniform[0,1500] | 1107 | (600,1497) | 2.5% |
| Raw | Uniform[0,2500] | 1320 | (600,2489) | 3.0% |
Estimates of the number of PWID in the Kohtla-Järve region from the network-based method.
The first two columns are prior mean of the link probability, and α is a scale parameter for the prior. Columns 3 to 5 shows the result of point estimates with posterior mean, 95% posterior quantiles and implied prevalence of injection drug use among all 44,721 people in Kohtla-Järve region. The last column shows semi-parametric bounds for the population size.
| Prior Parameter | Point Estimate | Bound Estimates | |||
|---|---|---|---|---|---|
| 95% Posterior Quantile | Implied Prevalence | Posterior Quantile of Lower and Upper Bound | |||
| 0.00393 | 3 | 2202 | (1851, 2713) | 4.9% | (700, 2805) |
| 5 | 2218 | (1866, 2739) | 5.0% | (700, 2920) | |
| 0.01617 | 3 | 2089 | (1791, 2504) | 4.7% | (700, 2635) |
| 5 | 2027 | (1738, 2419) | 4.5% | (700, 2631) | |
| 0.02841 | 3 | 2016 | (1716, 2439) | 4.5% | (700, 2581) |
| 5 | 1937 | (1679, 2286) | 4.3% | (700, 2589) | |
| 0.04065 | 3 | 2048 | (1733, 2466) | 4.6% | (700, 2644) |
| 5 | 1911 | (1655, 2270) | 4.3% | (700, 2552) | |
Fig 2Comparison of results from the multiplier, network-based, and SS methods for population size estimation.
The dashed horizontal line is the minimum number of population size (600); this is a lower bound for the PWID population size. For the multiplier method, results from the raw and weighted proportion of traits are presented. For the network-based method, results from Table 2 are shown, where the prior mean of the link probability and α vary. Black points and lines correspond to point estimates and posterior quantiles while grey lines represent the semi-parametric bounds. For the SS method, results from Table 1 are shown where prior selection and degree specification vary.