| Literature DB >> 24205221 |
Nicky McCreesh1, Andrew Copas, Janet Seeley, Lisa G Johnston, Pam Sonnenberg, Richard J Hayes, Simon D W Frost, Richard G White.
Abstract
INTRODUCTION: Respondent-driven sampling (RDS) is a variant of a link-tracing design intended for generating unbiased estimates of the composition of hidden populations that typically involves giving participants several coupons to recruit their peers into the study. RDS may generate biased estimates if coupons are distributed non-randomly or if potential recruits present for interview non-randomly. We explore if biases detected in an RDS study were due to either of these mechanisms, and propose and apply weights to reduce bias due to non-random presentation for interview.Entities:
Mesh:
Year: 2013 PMID: 24205221 PMCID: PMC3814964 DOI: 10.1371/journal.pone.0078402
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Diagram of the RDS recruitment process.
Percentages, unadjusted and adjusted odds ratios (OR) for being reported to have been offered a coupon (N = 2402) and for being recruited if reported to have been offered a coupon (N = 656).
| Variables | Category | Reported to have been offered a coupon (among whole target population) N = 2402 | Recruited (among men reported to have been offered a coupon) N = 656 | ||||
| % (n/N) | Unadjusted OR (95% CI) (p-value) | Adjusted* OR (95% CI) (p-value) | % (n/N) | Unadjusted OR (95% CI) (p-value) | Adjusted* OR (95% CI) (p-value) | ||
|
|
| 33% (236/714) | 1 | 1 | 80% (189/236) | 1 | 1 |
|
| 32% (160/496) | 0.96 (0.76–1.23) | 1.00 (0.77–1.29) | 77% (123/160) | 0.83 (0.51–1.35) | 0.88 (0.54–1.46) | |
|
| 26% (170/660) | 0.70 (0.56–0.89) | 0.71 (0.56–0.91) | 76% (129/170) | 0.78 (0.49–1.26) | 0.77 (0.47–1.25) | |
|
| 18% (89/485) | 0.46 (0.34–0.60) | 0.42 (0.31–0.56) | 71% (63/89) | 0.60 (0.35–1.05) | 0.61 (0.34–1.10) | |
|
| 2% (1/47) | 0.04 (0.01–0.32) | 0.04 (0.01–0.33) | 100% (1/1) | - | ||
| (p<0.0001) | (p<0.0001) | (p = 0.4) | (p = 0.4) | ||||
|
|
| 22% (134/617) | 1 | 1 | 69% (93/134) | 1 | 1 |
|
| 28% (170/597) | 1.44 (1.11–1.86) | 1.40 (1.07–1.84) | 76% (130/170) | 1.43 (0.86–2.39) | 1.64 (1.27–2.13) | |
|
| 35% (190/550) | 1.90 (1.47–2.47) | 1.83 (1.39–2.39) | 78% (149/190) | 1.60 (0.97–2.65) | 2.32 (1.78–3.02) | |
|
| 27% (139/514) | 1.34 (1.02–1.76) | 1.17 (0.88–1.56) | 85% (118/139) | 2.48 (1.37–4.48) | 2.27 (1.73–2.96) | |
|
| 19% (23/124) | 0.82 (0.5–1.34) | 0.97 (0.58–1.61) | 65% (15/23) | 0.83 (0.33–2.10) | 1.34 (0.85–2.12) | |
| (p<0.0001) | (p = 0.0001) | (p = 0.02) | (p = 0.06) | ||||
|
|
| 17% (190/1114) | 1 | 1 | 65% (123/190) | 1 | 1 |
|
| 36% (466/1288) | 2.76 (2.27–3.34) | 2.84 (2.33–3.46) | 82% (382/466) | 2.48 (1.69–3.62) | 4.30 (3.57–5.19) | |
| (p<0.0001) | (p<0.0001) | (p<0.0001) | (p<0.0001) | ||||
P-values are overall p-values for the association. * = Adjusted for other variables in the table. The order of the age categories has been reversed so that 50+ year olds are the reference category due to the small number of <20 year old men who were reported to have been offered a coupon
Associations with being reported to have been offered a coupon among RDS recruits (N = 917).
| Number reported to have been offered a coupon | % (p-value) | ||
|
|
| 1/4 | 25% |
|
| 63/122 | 52% | |
|
| 129/229 | 56% (p = 0.7) | |
|
| 123/220 | 56% | |
|
| 189/342 | 55% | |
|
|
| 93/164 | 57% |
|
| 130/222 | 59% | |
|
| 149/252 | 59% (p = 0.05) | |
|
| 118/244 | 48% | |
|
| 15/35 | 43% | |
|
|
| 123/238 | 52% (p = 0.2) |
|
| 382/679 | 56% |
P-values are overall p-values for the association. Recruits were reported to have been offered a coupon if one or more recruiters said that they had offered them a coupon.
Comparison of standard RDS-2 estimates and interview presentation weighted RDS-2 estimates (the latter adjusts for non-random presentation for interview if offered a coupon by age and socioeconomic status).
| Proportions | Estimates | Root mean squared errors | ||||||
| Population | RDS sample | Men reported to have been offered a coupon | RDS-2 | Interview presentation weighted RDS-2 | RDS-2 estimates | Interview presentation weighted RDS-2 estimates | ||
|
|
| 2.0% | 0.4% | 0.1% | 0.5% | 0.6% | ||
|
| 20.2% | 13.3% | 12.7% | 12.9% | 12.1% | |||
|
| 27.5% | 25.0% | 24.9% | 24.2% | 24.6% | 0.058 | 0.057 | |
|
| 20.7% | 24.0% | 25.6% | 23.0% | 24.7% | |||
|
| 29.7% | 37.3% | 36.6% | 39.5% | 37.9% | |||
|
|
| 25.7% | 17.9% | 21.3% | 17.0% | 20.5% | ||
|
| 24.9% | 24.2% | 26.1% | 23.8% | 26.1% | |||
|
| 22.9% | 27.5% | 28.6% | 26.9% | 28.0% | 0.056 | 0.034 | |
|
| 21.4% | 26.6% | 20.6% | 29.0% | 22.3% | |||
|
| 5.2% | 3.8% | 3.4% | 3.3% | 3.2% | |||
|
|
| 69.7% | 66.7% | 70.4% | 66.1% | 67.5% | ||
|
| 17.9% | 21.0% | 18.5% | 21.2% | 20.4% | |||
|
| 1.7% | 2.1% | 1.4% | 2.2% | 2.2% | 0.026 | 0.019 | |
|
| 4.7% | 6.1% | 6.6% | 6.9% | 6.1% | |||
|
| 6.0% | 4.0% | 3.2% | 3.6% | 3.8% | |||
|
|
| 59.8% | 62.4% | 61.1% | 64.5% | 63.6% | ||
|
| 17.0% | 17.1% | 17.8% | 15.3% | 15.1% | 0.029 | 0.023 | |
|
| 22.7% | 20.2% | 20.9% | 19.8% | 20.8% | |||
|
| 0.5% | 0.3% | 0.2% | 0.4% | 0.6% | |||
|
|
| 11.3% | 14.8% | 12.0% | 16.1% | 13.7% | ||
|
| 41.9% | 57.7% | 57.0% | 57.4% | 58.9% | |||
|
| 11.4% | 14.0% | 13.9% | 12.8% | 13.4% | 0.122 | 0.124 | |
|
| 3.7% | 3.5% | 5.3% | 4.0% | 4.0% | |||
|
| 31.6% | 10.0% | 11.7% | 9.8% | 10.0% | |||
|
|
| 6.3% | 7.9% | 7.2% | 7.4% | 7.0% | ||
|
| 60.0% | 81.7% | 78.1% | 82.0% | 82.2% | 0.184 | 0.184 | |
|
| 33.7% | 10.5% | 14.8% | 10.6% | 10.8% | |||
Root mean squared errors calculated relative to population proportions
Logistic regression model for recruitment into the RDS study.
| Variable | Category | Adjusted* OR | 95% Confidence interval |
|
|
| 1 | |
|
| 2.81 | 1.03–7.65 | |
|
| 5.37 | 1.97–14.49 | |
|
| 8.84 | 3.23–24.02 | |
|
| 9.33 | 3.43–25.13 | |
|
|
| 1 | |
|
| 1.77 | 1.34–2.34 | |
|
| 2.32 | 1.74–3.07 | |
|
| 2.55 | 1.89–3.44 | |
|
| 1.52 | 0.92–2.52 | |
|
|
| 1 | |
|
| 1.12 | 0.87–1.44 | |
|
| 1.65 | 0.80–3.41 | |
|
| 1.43 | 0.91–2.24 | |
|
| 0.67 | 0.43–1.06 | |
|
|
| 1 | |
|
| 1.14 | 0.78–1.66 | |
|
| 0.28 | 0.21–0.37 | |
|
|
| 1 | |
|
| 1.75 | 1.27–2.40 | |
|
| 1.48 | 1.00–2.19 | |
|
| 1.01 | 0.58–1.75 | |
|
| 0.40 | 0.26–0.60 | |
|
|
| 1 | |
|
| 0.50 | 0.37–0.66 | |
|
| 0.50 | 0.37–0.67 | |
|
| 0.42 | 0.28–0.64 | |
|
|
| 1 | |
|
| 1.22 | 0.93–1.59 | |
|
| 0.46 | 0.22–1.01 |
N = 2396 due to missing data for the variable distance to the nearest interview site. * = Adjusted for other variables in model. ** Transformed network size = .
Figure 2Relationship between network size and recruitment probability in the target population.
The relationship is shown for three men with (A) low, (B) medium, and (C) high probabilities of recruitment based on the other variables in the logistic regression model (age group, socioeconomic status, tribe, hiv status, sexual activity group, distance to the nearest interview site and participation in the last cohort round. The figure shows that the relationship is non-linear, and that network size has little effect on recruitment probability apart from for men with small network sizes.