| Literature DB >> 25656226 |
Mark E McGovern1,2, Till Bärnighausen3,4, Joshua A Salomon5, David Canning6,7.
Abstract
BACKGROUND: Selection bias in HIV prevalence estimates occurs if non-participation in testing is correlated with HIV status. Longitudinal data suggests that individuals who know or suspect they are HIV positive are less likely to participate in testing in HIV surveys, in which case methods to correct for missing data which are based on imputation and observed characteristics will produce biased results.Entities:
Mesh:
Year: 2015 PMID: 25656226 PMCID: PMC4429465 DOI: 10.1186/1471-2288-15-8
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Posterior probability distribution for the correlation between HIV Testing and HIV Status in Zambia 2007 (Men). Graph shows the posterior probability distribution for the correlation between testing and HIV status ρ = corr(u , ϵ ), calculated using a selection model with a flat prior probability distribution over the interval [-1,1], and interviewer random effects as the exclusion restriction. The standard maximum likelihood (ML) estimate is shown, as well as the bias corrected estimate which is the mean of the posterior probability distribution. Also shown is the 95% bootstrap confidence interval for the bias corrected estimate, based on 1,000 replications. The bootstrap confidence interval is calculated using the empirical distribution of bootstrap estimates. Details of the statistical procedure are outlined in the appendix (see Additional file 1). Source: DHS Zambia 2007 (men).
Figure 2Posterior probability distribution for the correlation between HIV Testing and HIV Status in Ghana 2003 (Men). Graph shows the posterior probability distribution for the correlation between testing and HIV status ρ = corr(u , ϵ ), calculated using a selection model with a flat prior probability distribution over the interval [-1,1], and interviewer random effects as the exclusion restriction. The standard maximum likelihood (ML) estimate is shown, as well as the bias corrected estimate which is the mean of the posterior probability distribution. Also shown is the 95% bootstrap confidence interval for the bias corrected estimate, based on 1,000 replications. The bootstrap confidence interval is calculated using the empirical distribution of bootstrap estimates. Details of the statistical procedure are outlined in the appendix (see Additional file 1). The fact that the maximum likelihood estimate lies outside the bootstrap confidence interval for the bias corrected estimate reflects the fact that the posterior distribution has a long left hand tail which is not accounted for by the standard maximum likelihood estimator, and that we use the empirical distribution of the bootstrap estimates to allow for asymmetry when calculating the confidence interval. Source: DHS Ghana 2003 (men).
Sample size for HIV prevalence estimation among Men in Zambia (2007) and Ghana (2003)
| Zambia | Ghana | |||
|---|---|---|---|---|
| N | % | N | % | |
| Observed HIV Status | 5,163 | 72% | 4,271 | 80% |
| Missing HIV Status (Consent Refused) | 1,318 | 18% | 743 | 14% |
| Missing HIV Status (No Contact) | 653 | 9% | 320 | 6% |
| Total | 7,134 | 100% | 5,334 | 100% |
Source: Demographic and Health Surveys.
Estimates of HIV prevalence among Men in Zambia (2007)
| Model | HIV prevalence | Analytic 95% CI | Bootstrap 95% CI | ||
|---|---|---|---|---|---|
| All Men - Fixed Effects Selection Model | 20.1% | 19.0% | 21.3% | ||
| All Men - Random Effects Selection Model | 16.3% | 15.3% | 17.3% | 11.0% | 18.4% |
| All Men – Random Effects Bias Correction Selection Model | 15.5% | 14.5% | 16.5% | 10.2% | 17.9% |
| Men with Valid HIV Tests | 12.1% | 11.0% | 13.3% | ||
| Men with No Contact - Imputation Model | 15.3% | 14.2% | 16.3% | ||
| All Men - Imputation Model | 12.3% | 11.4% | 13.2% |
In the Heckman-type selection models (rows 1-3), consent to test and HIV status are jointly estimated using a bivariate probit with the following covariates: education, household wealth quintile, type of location, marital status, had a sexually transmitted disease, age at first intercourse, had high risk sex, number of partners, condom use, would care for an HIV-infected relative, knows someone who died of AIDS, previously tested for HIV, smokes, drinks alcohol, language, age group, region, ethnicity and religion. The selection variable which predicts consent but not HIV status is interviewer identity. Full parameter estimates are presented in tables A4-A6 in the appendix (Additional file 1). Analytic standard errors are shown for the fixed effects and random effects models, with bootstrap errors for random effects and random effects bias correction models based on 1,000 replications. Our cluster bootstrap takes account of survey design by drawing a fixed number of clusters (the same as in the original data) from each stratum in each sample. Results from an imputation model are also shown in rows 5–6, along with estimates only using those without missing data (respondents with a valid HIV test). HIV prevalence estimates are weighted. Source: DHS Zambia 2007 (men).
Estimates of HIV Prevalence among Men in Ghana (2003)
| Model | HIV prevalence | Analytic 95% CI | Bootstrap 95% CI | ||
|---|---|---|---|---|---|
| All Men - Fixed Effects Selection Model | 1.4% | 1.1% | 1.7% | ||
| All Men - Random Effects Selection Model | 1.4% | 1.1% | 1.7% | 1.2% | 1.6% |
| All Men - Random Effects Bias Correction Selection Model | 1.4% | 1.1% | 1.7% | 1.2% | 1.6% |
| Men with Non-Missing Data (Valid HIV Test) | 1.6% | 1.2% | 2.0% | ||
| Men with No Contact - Imputation Model | 1.6% | 1.4% | 1.8% | ||
| All Men - Imputation Model | 1.6% | 1.3% | 2.0% |
Consent to test and HIV status are jointly estimated using a bivariate probit with the following covariates: education, wealth quintile, marital status, had a sexually transmitted disease, age at first intercourse, had high risk sex, number of partners, condom use, would care for an HIV-infected relative, knows someone who died of AIDS, previously tested for HIV, smokes, language, age group, region, ethnicity and religion. The selection variable which predicts consent but not HIV status is interviewer identity. Full parameter estimates are presented in tables A8-A10 in the appendix (see Additional file 1). Analytic standard errors are shown for the fixed effects and random effects models, with bootstrap errors for random effects and random effects bias correction models based on 1,000 replications. Our cluster bootstrap takes account of survey design by drawing a fixed number of clusters (the same as in the original data) from each stratum in each sample. Results from an imputation model are also shown in rows 5–6, along with estimates only using those without missing data (respondents with a valid HIV test). HIV prevalence estimates are weighted. Source: DHS Ghana 2003 (men).