| Literature DB >> 35929226 |
Anton M Palma1,2, Giampiero Marra3, Rachel Bray1, Suzue Saito1, Anna Colletar Awor4, Mohamed F Jalloh5, Alexander Kailembo6, Wilford Kirungi7, George S Mgomella5,8, Prosper Njau5, Andrew C Voetsch9, Jennifer A Ward4, Till Bärnighausen10,11,12,13, Guy Harling10,11,12,14.
Abstract
INTRODUCTION: Population-based biomarker surveys are the gold standard for estimating HIV prevalence but are susceptible to substantial non-participation (up to 30%). Analytical missing data methods, including inverse-probability weighting (IPW) and multiple imputation (MI), are biased when data are missing-not-at-random, for example when people living with HIV more frequently decline participation. Heckman-type selection models can, under certain assumptions, recover unbiased prevalence estimates in such scenarios.Entities:
Keywords: HIV prevalence; missing data; non-participation; population surveys; selection models; surveillance
Mesh:
Year: 2022 PMID: 35929226 PMCID: PMC9353488 DOI: 10.1002/jia2.25954
Source DB: PubMed Journal: J Int AIDS Soc ISSN: 1758-2652 Impact factor: 6.707
Figure 1Blood test participation by country, sex and 5‐year age group. Each point indicates the blood test participation rate (% of all eligible adult household members) for each country by sex and 5‐year age group, and lines indicate the trends across age. Bolded points and lines indicate the pooled blood test participation rate for all countries by sex and 5‐year age group.
Pooled sample characteristics
| Men | Women | |||||
|---|---|---|---|---|---|---|
| All eligible household members | Interview participants | Blood test participants | All eligible household members | Interview participants | Blood test participants | |
|
| % of eligible | % of eligible | % of total sample | % of eligible | % of eligible | |
| Total sample | 64,049 (100%) | 85.7% (54,900) | 79.0% (50,584) | 78,657 (100%) | 94.2% (74,133) | 88.2% (69,393) |
| Country | ||||||
| Tanzania | 13,059 (20.4%) | 88.3% | 84.0% | 16,057 (20.4%) | 94.9% | 91.1% |
| Uganda | 11,788 (18.4%) | 93.5% | 92.1% | 15,200 (19.3%) | 97.8% | 96.8% |
| Malawi | 9,030 (14.1%) | 80.7% | 69.8% | 11,006 (14.0%) | 92.7% | 81.3% |
| Zambia | 10,358 (16.2%) | 80.2% | 71.0% | 12,192 (15.5%) | 90.6% | 82.1% |
| Zimbabwe | 9,702 (15.1%) | 82.7% | 74.6% | 11,806 (15.0%) | 93.9% | 86.6% |
| Lesotho | 5,473 (8.5%) | 86.8% | 76.7% | 6,871 (8.7%) | 95.1% | 87.2% |
| Eswatini | 4,639 (7.2%) | 86.0% | 78.8% | 5,525 (7.0%) | 93.4% | 88.3% |
| Urbanicity | ||||||
| Urban | 22,684 (35.4%) | 80.5% | 72.1% | 28,978 (36.8%) | 93.3% | 86.3% |
| Rural | 41,365 (64.6%) | 88.6% | 82.7% | 49,679 (63.2%) | 94.8% | 89.3% |
| Age group | ||||||
| 15–19 | 15,574 (24.3%) | 85.8% | 80.7% | 16,654 (21.2%) | 91.1% | 86.2% |
| 20–24 | 11,520 (18.0%) | 87.0% | 80.0% | 15,619 (19.9%) | 94.7% | 88.3% |
| 25–29 | 9,801 (15.3%) | 84.9% | 77.1% | 13,222 (16.8%) | 95.1% | 88.2% |
| 30–34 | 8,849 (13.8%) | 83.8% | 76.1% | 11,423 (14.5%) | 95.4% | 88.8% |
| 35–39 | 7499 (11.7%) | 84.6% | 77.4% | 9,183 (11.7%) | 95.5% | 89.0% |
| 40–44 | 6155 (9.6%) | 86.3% | 79.3% | 7,203 (9.2%) | 95.2% | 89.7% |
| 45–49 | 4651 (7.3%) | 88.4% | 82.3% | 5,353 (6.8%) | 95.1% | 89.5% |
Note: Numbers of men and women in the pooled sample, by country, urban/rural and 5‐year age groups. Percentages of total sample represent the proportion of the total number of eligible household members in the total pooled sample (column percent). Percent of eligible represents the row percent of eligible household members who participated in the interview and blood test, respectively. aWhile most countries had two categories for urban and rural, in Lesotho, peri‐urban was treated as urban for analysis.
Figure 2HIV prevalence estimates among adults aged 15–49 using different missing data methods. Abbreviations: IPW, inverse probability weighting; MI, multiple imputation. Points and error bars represent HIV prevalence (%) and 95% CIs obtained from models under various missing data treatments using data from all rostered household members. Maximal bounds are the theoretical minimum and maximum of possible HIV prevalence estimates in the sample assuming that all non‐participants were HIV negative or HIV positive. CIs from other models may exceed the maximal bounds due to Taylor series variance approximation.
HIV prevalence estimates using different missingness assumptions, among all adults aged 15–49 who were eligible household members
| Country | |||||||
|---|---|---|---|---|---|---|---|
| Model | Tanzania | Uganda | Malawi | Zambia | Zimbabwe | Lesotho | Eswatini |
| Male | |||||||
|
|
|
|
|
|
|
| |
| Naive | 3.3 (2.9, 3.8) | 4.7 (4.3, 5.2) | 8.2 (7.5, 9.1) | 8.8 (8.1, 9.6) | 11.5 (10.6, 12.4) | 19.7 (18.4, 21.0) | 19.9 (18.2, 21.7) |
| IPW | 3.8 (3.4, 4.2) | 4.2 (3.8, 4.6) | 8.9 (8.2, 9.6) | 8.5 (7.9, 9.2) | 12.0 (11.2, 12.9) | 19.3 (18.1, 20.6) | 19.9 (18.5, 21.5) |
| MI | 5.4 (3.7, 7.2) | 4.6 (3.9, 5.3) | 16.5 (13.4, 19.6) | 14.2 (10.5, 17.9) | 14.9 (12.6, 17.3) | 19.1 (17.7, 20.6) | 18.9 (17.0, 20.8) |
| Selection | 7.9 (6.0, 10.6) | 4.7 (4.4, 5.3) | 9.2 (7.1, 13.0) | 9.7 (8.3, 12.5) | 13.5 (12.3, 14.6) | 24.3 (22.6, 26.6) | 20.8 (19.9, 22.5) |
| Maximal | (2.7, 20.1) | (4.4, 12.1) | (5.9, 34.8) | (6.2, 35.7) | (8.4, 35.1) | (14.9, 39.1) | (15.7, 37.0) |
| Female | |||||||
|
|
|
|
|
|
|
| |
| Naive | 6.5 (6.0, 7.1) | 7.8 (7.2, 8.5) | 13.4 (12.5, 14.4) | 15.2 (14.4, 16.1) | 17.2 (16.4, 18.1) | 30.9 (29.7, 32.2) | 34.8 (32.9, 36.6) |
| IPW | 6.2 (5.7, 6.8) | 7.5 (6.9, 8.1) | 12.1 (11.3, 13.1) | 14.3 (13.4, 15.1) | 15.9 (15.1, 16.7) | 29.7 (28.5, 30.9) | 34.3 (32.5, 36.2) |
| MI | 7.0 (6.4, 7.6) | 7.7 (6.9, 8.6) | 13.3 (12.3, 14.3) | 15.7 (15.0, 16.5) | 18.1 (16.6, 19.6) | 30.2 (29.6, 30.9) | 38.6 (35.3, 41.9) |
| Selection | 7.9 (6.7, 9.6) | 7.6 (7.1, 8.1) | 11.3 (10.2, 12.7) | 15.5 (14.9, 16.4) | 18.1 (17.6, 19.1) | 34.9 (33.5, 37.0) | 37.5 (36.3, 38.9) |
| Maximal | (5.9, 15.7) | (7.5, 10.7) | (10.8, 30.2) | (12.5, 30.7) | (14.8, 28.7) | (26.7, 40.3) | (30.6, 42.6) |
| Female‐to‐male prevalence ratio | |||||||
| Naive | 1.97 | 1.66 | 1.63 | 1.73 | 1.50 | 1.57 | 1.75 |
| IPW | 1.63 | 1.79 | 1.36 | 1.68 | 1.33 | 1.54 | 1.72 |
| MI | 1.30 | 1.67 | 0.81 | 1.11 | 1.21 | 1.58 | 2.04 |
| Selection | 1.00 | 1.62 | 1.23 | 1.60 | 1.34 | 1.44 | 1.80 |
Note: N's indicate the number of blood test participants over the number of eligible persons. The eligible population includes all de facto household members at the top and all individuals who participated in the individual interview at the bottom.
Abbreviations: IPW, inverse probability weight; MI, multiple imputation.
Figure 3Comparison of HIV prevalence estimates from selection models under various copulae versus the IPW model. Points and error bars show selection model results by sex for each bivariate copula, AIC (x‐axis) versus HIV prevalence (%) and 95% CI (y‐axis). Horizontal black line and gray bar indicate the IPW estimate and 95% CI. Points further left exhibit better model fit. Point labels denote which copula was used: “N” = Normal, “F” = Frank, “T” = Student's‐t, “FGM” = Farlie–Gumbel–Morgenstern, “AMH” = Ali–Mikhail–Haq, “PL” = Placket, “HO” = Hougaard, “J” = Joe, “C” = Clayton, “G” = Gumbel. For Joe, Clayton and Gumbel copulae, numbers indicate rotation, either 0, 90, 180 or 270 degrees.