| Literature DB >> 35001515 |
Katherine M Jia1, Hallie Eilerts2, Olanrewaju Edun1, Kevin Lam1, Adam Howes3, Matthew L Thomas4, Jeffrey W Eaton1.
Abstract
INTRODUCTION: Several HIV risk scores have been developed to identify individuals for prioritized HIV prevention in sub-Saharan Africa. We systematically reviewed HIV risk scores to: (1) identify factors that consistently predicted incident HIV infection, (2) review inclusion of community-level HIV risk in predictive models and (3) examine predictive performance.Entities:
Keywords: HIV incidence; adolescent girls and young women; risk factors for HIV incidence; risk scores; sub-Saharan Africa
Mesh:
Year: 2022 PMID: 35001515 PMCID: PMC8743366 DOI: 10.1002/jia2.25861
Source DB: PubMed Journal: J Int AIDS Soc ISSN: 1758-2652 Impact factor: 5.396
Figure 1HIV risk score study selection. Abbreviations: AGYW, adolescent girls and young women; n, number.
Characteristics of the cohorts from which HIV risk scores were developed and/or validated
| First author (year) | Cohort (study design) | Develop/validate | Year of study | Sites | Study population | Sex | Age | Settings | N (total PYs) | Incidence (per 100 PYs) |
|---|---|---|---|---|---|---|---|---|---|---|
| Wand (2012) | MIRA (RCT) | Develop | 2003–2006 | Durban, KwaZulu‐Natal, South Africa (two sites) | 18–49 yrs old, sexually active, non‐pregnant, willing to use contraception | F |
Mean: 27 IQR: 22–34 | Family‐planning clinics and other community‐based organization |
1485 | 6.85 |
| Wand (2018) |
Multiple | Develop | 2002–2012 | KwaZulu‐Natal, South Africa (multiple sites) | 16+ yrs old, sexually active, non‐pregnant, willing to use contraception | F |
Median: 27 IQR: 22–33 | Various clinical study sites |
8982 | 7.03 |
| Balkus (2016) | VOICE (RCT) | Develop | 2009–2011 | South Africa, Uganda and Zimbabwe | 18–45 yrs old, sexually active, non‐pregnant, willing to use contraception | F |
Median: 24 IQR: 21–29 | STI clinics, family planning clinics and postnatal clinics, community‐based locations |
4834 (4348) | 6.05 |
| HPTN 035 (RCT) | Validate | 2005–2009 | Malawi, South Africa, Zimbabwe and Zambia | 18+ yrs old, sexually active, not (intended to be) pregnant | F |
Median: 25 IQR: 22–29 | STI clinics, family planning clinics and postnatal clinics, community‐based locations |
2848 (2903) | 3.38 | |
| FEM‐PrEP (RCT) | Validate | 2009–2011 | Kenya, South Africa and Tanzania | Sexually active, non‐pregnant, 18–35 yrs old women at high risk | F |
Median: 23 IQE: 20–27 |
Community outreach, recruitment sites, community partners, health centres, STI clinics, HIV voluntary testing and counselling centres |
1804 (1231) | 4.79 | |
| Balkus (2018) | ASPIRE (RCT) | Validate | 2012–2015 | Malawi, South Africa, Uganda and Zimbabwe | 18–45 yrs old, sexually active, not (intended to be) pregnant | F | <25 (39%) | STIs, family‐planning clinics |
2539 (2566) | 3.70 |
| Burgess (2018) |
CAPRISA 004 (RCT) | Develop | 2007–2010 | KwaZulu‐Natal, South Africa (two sites) | 18–40 yrs old, sexually active women, non‐pregnant, willing to use contraception | F | <25 (68%) | An urban and a rural CAPRISA research clinic | 431 (660.7) | 9.08 |
| Burgess (2017) | FACTS 001 (RCT) | Validate | 2011–2014 | South Africa (nine sites) | 18–30 yrs old, sexually active, non‐pregnant, willing to use contraception | F |
Median: 23 IQR: 20–25 | Nine community‐based clinical trial sites |
1115 (1876) | 4.32 |
| Peebles (2020) |
ECHO (RCT) | Develop | 2015–2018 | South Africa (nine sites) | 18–35 yrs old, sexually active, seeking effective contraception | F | <25 (62.1%) | Nine clinics over five provinces | 5670 (5573) |
5.4 (<25) 3.4 (25+) |
| Giovenco (2019) |
HPTN 068 (RCT) | Validate | 2011–2012 | South Africa (single site) | School‐attending 13–20 yrs old AGYW | F |
Median: 15 IQR: 14–17 | Random sample in the rural Bushbuckridge in Mpumalanga province | 2178 (2455) | 1.34 |
| Rosenberg (2020) | Girl Power (quasi‐experi‐mental) | Develop | 2016–2017 | Malawi (four sites) | Sexually active, 15–24 yrs old AGYW | F | <20 (58.7%) | Four public‐sector health centres in Lilongwe, Malawi |
795 (672) | 2.08 |
| Ayton (2020) | CAPRISA 007 (RCT) | Validate | 2010–2012 | South Africa (14 sites) | 14–25 yrs old school‐attending AGYW | F |
Median: 17 IQR: 16–18 | Grade 9 and 10 students in 14 schools in Vulindlela |
971 (971) | 1.85 |
| Kagaayi (2014) | RCCS (cohort) | Develop | 2003–2011 | Uganda (∼50 communities) | Sexually active, 15–49 yrs old | F |
Mean: 27.0 SD: 7.8 | Communities in Rakai district |
7497 (30,811) | 1.11 |
| M |
Mean: 28.3 SD: 8.0 |
5783 (22,959) | 0.98 | |||||||
| Balzer (2020) | SEARCH (RCT) | Develop | 2018 | Kenya and Uganda (16 communities) | 15+ years old residents | F M | <25 (39%) | 16 communities in rural Uganda and Kenya |
75,558 (166,723) | 0.27–0.37 |
| Roberts (2021) | ACDIS (cohort) | Develop | 2012–2019 | Umkanyakude district of KwaZulu‐Natal, South Africa (single site) | 15+ years old residents | F | – | Rural and peri‐urban communities in Umkhanyakude district |
11,933 |
4.20 (2012–2015) 3.11 (2016–2019) |
| M | – |
7623 |
1.80 (2012–2015) 1.16 (2016–2019) |
Abbreviations: AGYW, adolescent girls and young women; F, female; M, male; N, number; PYs, person‐years; RCT, randomized‐controlled trial; STI, sexually transmitted infection; yrs, years.
Develop: model development (marked with
if the authors did re‐sampling procedures through cross‐validation or bootstrapping to obtain the AUC‐ROC for internal validation of their score(s),
if they split the sample into training and testing sets as part of their internal validation); Validate: external validation of the risk score in a study population different from which the score was originally developed. Peebles [9], Burgess (2018) [28] and Rosenberg [30] developed their own risk scores while also using their samples to externally validate the VOICE score developed by Balkus [8]; AUC‐ROCs are provided in Table 3.
Multiple cohort studies included: MIRA, MDP 301, NCT00213083, VOICE and HPTN035.
Mean/median (IQR).
Total sample size is provided here, whereas the authors split the sample into training and testing sets.
Cumulative incidence after follow‐up in each study.
Predictive performance from the development and validation of included risk scores
| First author (year) |
| Cohort (study design) | HIV incident cases | Prediction horizon | Score developed | AUC‐ROC(95% CI) |
| Predictors available | Score externally validated | AUC‐ROC(95% CI) |
|---|---|---|---|---|---|---|---|---|---|---|
| Wand (2012) | 1 | MIRA (RCT) | 148 | 1–2 yrs | Wand‐2012 | 0.79 (0.70, 0.81)c | – | – | – | – |
| Wand (2018) | 1 |
Multiple | 776 | 1–2+ yrs | Wand‐2018 | 0.71 (–) | 1 | – | – | – |
| Balkus (2016) | 1 | VOICE (RCT) | 263 | 1 yr | VOICE | 0.69 (0.66, 0.72) | – | – | – | – |
| VOICE (modified) | 0.67 (0.64, 0.70) | 1 | – | – | – | |||||
| HPTN 035 (RCT) | 98 | 1 yr | – | 6/7 | VOICE | 0.70 (0.65, 0.75) | ||||
| FEM‐PrEP (RCT) | 59 | 1 yr | – | 6/7 | VOICE | 0.63 (–) | ||||
| – | 4/5 | VOICE (modified) | 0.58 (0.51, 0.65) | |||||||
| Balkus (2018) | – | ASPIRE (RCT) | 95 | 1 yr | 1 | 5/7 | VOICE | 0.69 (0.64, 0.74) | ||
| Burgess (2018) | 1 |
CAPRISA 004 (RCT) |
47 (<25) 13 (25+) | 18 mo |
Burgess (under 25) Burgess (under 25) (modified) |
0.70 (0.60, 0.79) 0.62 (0.56, 0.68) | 1 | All |
VOICE VOICE (under 25) VOICE (≥25) |
0.66 (0.54, 0.74) 0.69 (0.60, 0.78) 0.49 (0.30, 0.63) |
| Burgess (2017) | – | FACTS 001 (RCT) | 81 | 2.2 yrs | 1 | All | VOICE | 0.56 (0.50, 0.62) | ||
| Peebles (2020) | 2 | ECHO (RCT) |
188 (<25) 72 (25+) | 1 yr | 1 | All | VOICE | 0.61 (0.58, 0.65) | ||
| – | All | VOICE (modified) | 0.59 (0.56, 0.62) | |||||||
|
Peebles (under‐25) Peebles (under‐25) (modified) |
0.62 (0.58, 0.64) 0.59 (0.55, 0.61) | – | – | – | – | |||||
|
Peebles (25+) Peebles (25+) (modified) |
0.64 (0.59, 0.69) 0.62 (0.58, 0.67) | – | – | – | – | |||||
| Giovenco (2019) |
HPTN 068 (RCT) | 33 | 1 yr | 1 | 6/7 | VOICE | 0.55 (0.44, 0.65) | |||
| Rosenberg (2020) | 1b | Girl Power (quasi‐experi‐mental) | 14 | 1 yr | Rosenberg | 0.79 (0.69, 0.89) | 1 | – | – | – |
| – | All | VOICE | 0.64 (0.52, 0.75) | |||||||
| Ayton (2020) | – | CAPRISA 007 (RCT) | 18 | 1 yr | 1 | 4/7 | VOICE | – | ||
| Kagaayi (2014) | 2b | RCCS (cohort) | 342 | 1 yr | Kagaayi (female) | 0.67 (0.64, 0.70) | – | – | – | – |
| RCCS (cohort) | 225 | 1 yr | Kagaayi (male) | 0.69 (0.66, 0.73) | – | – | – | – | ||
| Balzer (2020) | 3b | SEARCH (RCT) | 519 | 1 yr |
Balzer (risk group based) Balzer (model based) Balzer (machine learning) |
0.59 (0.55, 0.62) 0.70 (0.68, 0.73) 0.73 (0.71, 0.76) | – | – | – | – |
| Roberts (2021) | 2 | ACDIS (cohort) | 1160 | 1 yr |
Roberts (female) Roberts (female) (age + geographic covariates only) |
0.68 0.65c | – | – | – | – |
| 248 | 1 yr |
Roberts (male) Roberts (male) (age + geographic covariates only) |
0.72 0.71 | – | – | – | – |
Abbreviations: AUC‐ROC, area under receiver operating characteristic curve; 95% CI, 95% confidence interval; Dev, developed; mo, months; N, number; RCT, randomized‐controlled trial; Val, validated; yr(s), year(s).
Number of risk score(s) developed (“N Dev”)/externally validated (“N Val”) by the study.
AUC‐ROC was based on a “testing” set different from the “training” set for which the score was derived. Wand (2012) [24] and Wand (2018) [25] did a random split of data sets, while Roberts [34] split the data by time periods (2012–2015 vs. 2016–2019).
AUC‐ROC was obtained through cross‐validation or bootstrapping of the original derivation set for the internal validation.
Self‐reported STI histories or symptoms were used as proxies for curable STIs and HSV‐2 status in Burgess (2017) [27] and Rosenberg [30], and for curable STIs in Burgess (2018) [28].
Multiple cohort studies included: MIRA, MDP 301, NCT00213083, VOICE and HPTN035.
Risk factors retained in the final HIV risk score models and adjusted effect estimates
| Women only, all ages (RCTs) | AGYW only (RCT/quasi‐experimental trials) | General population | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| First author (year) | Retainment ratio | Wand (2012) | Wand (2018) | Balkus (2016) | Peebles (2020) | Retainment ratio | Peebles (2020) | Burgess (2018) | Rosenberg (2020) | Kagaayi (2014) | Kagaayi (2014) | |
| Cohort | MIRA | Multiple | VOICE | ECHO 25+ yrs old | ECHO <25 yrs old | CAPRISA 004 <25 yrs old | Girls Power | RCCS female | RCCS male | |||
| Demographic | Age (younger) | 3/4 | N.I. | 1.75–2.93 | 1.70 | 2.12 | 0/3 | N.I. | 1.03 | 0.81 | ||
| Age of sexual debut (<16 yrs) | 1/1 | 1.38 | ||||||||||
| Not married/living with partners | 4/4 | 2.63 | 3.62 | 1.80 | 1.85 | 0/3 | N.I. | N.I. | N.I. | 1.72–2.08 | N.I. | |
| Pregnancy history | 1/2 | N.I. | 4.55 | |||||||||
| Parity (less) | 1/3 | 2.62 | N.I. | N.I. | 0/1 | N.I. | ||||||
| Education (higher) | 0/4 | N.I. | N.I. | N.I. | N.I. | 0/2 | N.I. | N.I. | 0.83 | 0.56 | ||
| High‐risk occupation | 1.32 | |||||||||||
| Perceived HIV risk (high) | 1.49 | |||||||||||
| Alcohol use | 1/3 | N.I. | 1.41 | N.I. | 1/2 | 1.45 | N.I. | |||||
| Partnership factor | Partner >5 yrs older | 1/1 | 2.42 | |||||||||
| Partner in high‐risk occupation | 1.89 | |||||||||||
| Fin. support from partner | 1/3 | N.I. | 0.72 | N.I. | 0/3 | N.I. | N.I. | N.I. | ||||
| Partner has other partners: | ||||||||||||
| Don't know versus no | 1/2 | 1.81 | N.I. | 1/1 | 7.56 | |||||||
| Yes versus no | 1/3 | N.I. | 1.63 | N.I. | 2/3 | 7.86 | N.I. | |||||
| Yes/don't know versus no | 1.31 | |||||||||||
| Partner has unknown HIV status | 1.82 | |||||||||||
| No. of partners | 2/4 | N.I. | N.I. | 2/3 | 1.61 | 2.19 | N.I. | 1 | 1 | |||
| 2 (ref: 1) | 1.34 | 1.59 | 1.21 | |||||||||
| 3 (ref: 1) | 1.93 | 1.90 | ||||||||||
| 3+ (ref: <2) | 1.61 | |||||||||||
| New sex partner | 1.45 | |||||||||||
| Concurrent relationships | 1.50 | |||||||||||
| Transmission probability | Condom use (yes) | 1/3 | N.I. | N.I. | N.I. | 1/1 | 1.34 | |||||
| Oral contraceptive (yes) | 0/1 | N.I. | ||||||||||
| Injectable contraceptive (yes) | 1/1 | 1.43 | ||||||||||
| High‐risk sex | 1/1 | 1.96 | – | 1.44 | 1.28 | |||||||
| MMC of primary partners/MMC (male) | 0/3 | N.I. | N.I. | N.I. | 0/2 | N.I. | N.I. | 0.61 | ||||
| HSV‐2 | 2/4 | 1.63 | 1.88 | 3/3 | 1.51 | 2.54 | 1.94 | |||||
| STIs/genital infection | 4/4 | 1.60 | 1.57 | 1.49 | 2.22 | 2/3 | 2.07 | N.I. | 2.61 | 1.75 | 1.78 | |
| Community | Community type | 1.67 | ||||||||||
| Community prevalence | 0/1 | N.I. | 1/1 | 1.64 | 1.03 | 1.03 | ||||||
| Province | 1/1 | 9.05 | ||||||||||
| Adjusted for sites (Y/N) | N | N | Y | Y | Y | Y | Y | |||||
| AUC‐ROC | 0.79 | 0.71 | 0.69 | 0.64 | 0.62 | 0.7 | 0.79 | 0.67 | 0.69 | |||
| 95% CI | 0.70–0.81 | – | 0.66–0.72 | 0.59–0.69 | 0.58–0.64 | 0.60–0.79 | 0.69–0.89 | 0.64–0.70 | 0.66–0.73 | |||
Note: N.I. indicates that risk factor was measured but not selected for inclusion in final model; dark grey shaded cells indicate that risk factor was not considered as a candidate predictor in the study; light grey shaded cells in the retainment ratio column indicate that risk factor was not considered by any models.
Abbreviations: AGYW, adolescent girls and young women; AUC‐ROC, area under receiver operating characteristic curve; 95% CI, 95% confidence interval; HSV‐2, herpes simplex virus type 2; MMC, medical male circumcision; N, no; N.I., not included; RCT, randomized‐controlled trial; ref., reference category; STI, sexually transmitted infection; Y, yes; yrs, years.
Women recruited from RCTs were sexually active and willing to use effective contraception during the study period.
Balzer and Roberts also developed risk scores in their studies, but the final models were not available and thus details are not shown in table. Risk factors considered by these analyses are reported in Table S2.
Retainment ratio reports the number of times a risk factor was retained in the final score relative to the number of times it was considered as a “candidate” predictor, tabulated separately for risk scores for women of all ages and for AGYW only study populations.
Detailed list of high‐risk occupations is provided by Kagaayi (2014).
Wand (2012) considered lifetime number of sex partners, while all others considered recent number of sex partners (Table S1 contains details on temporality of sexual partners for each study). Casual partnership (none vs. at least one) among sexually active women was considered as a proxy for multiple partners in Burgess (2018) study.
High‐risk sex was defined by multiple behavioural risk factors in Wand (2012), while defined as using alcohol before sex by Kagaayi (2014).
Methods for STIs status assessment varied by studies; details are provided in Table S2.
Although partners’ financial support was not included in the final model, all HIV cases (n = 14) in Rosenberg (2020) engaged in transactional sex.
Pregnancy history referred to self‐reported past pregnancy event(s) in Rosenberg (2020).
AUC‐ROC was obtained through cross‐validation or bootstrapping of the original derivation set for the internal validation.
AUC‐ROC was based on a “testing” set different from the “training” set for which the score was derived.
(p<0.05).
(p<0.01).
(p<0.0001) significant and included in the final risk score.
(p>0.05) not significant but included.
Figure 2Forest plots of risk factor estimates among women in general. Adjusted (a) and unadjusted (b) effects were pooled together for: (i) marital/cohabiting status, (ii) age, (iii) number of sexual partners, (iv) partners having other partners, (v) curable sexually transmitted infection (STIs) and (vi) HSV‐2. Abbreviations: aHR, adjusted hazard ratio; 95% CI, 95% confidence interval; HR, hazard ratio; HSV‐2, herpes simplex virus type 2; Ref, reference category; STIs, sexually transmitted infections.
Figure 3Forest plots of risk factor estimates among adolescent girls and young women (AGYW). Adjusted (a) and unadjusted (b) effects were pooled together for (i) marital/cohabiting status, (ii) number of sexual partners, (iii) partners having other partners, (iv) curable sexually transmitted infection (STIs) and (v) HSV‐2. Abbreviations: aHR, adjusted hazard ratio; 95% CI, 95% confidence interval; HR, hazard ratio; HSV‐2, herpes simplex virus type 2; Ref, reference category; STIs, sexually transmitted infections.
Figure 4Forest plot (a) and funnel plot (b) for the area‐under‐curve of the receiver operating characteristic curves (AUC‐ROCs) from external validation studies for the VOICE score. Studies with ^ did not collect all the predictors intended by the model (details are in Table 3). Those with * used self‐reported STIs history, syndromic management or self‐reported symptoms in place of laboratory diagnosed STIs status at baseline as intended by the original VOICE score (details are in Table 3 and Table S2). Abbreviations: AUC, area under curve; 95% CI, 95% confidence interval; ROC, receiver‐operating characteristic.
Figure 5Percentage of individuals identified as high risk among incident cases versus proportion classified as high risk for: women enrolled in clinical trials (a), AGYW (b) and the general population (c). When the highest score was used as the threshold, few or none were classified as high risk and they took up a small fraction of all HIV incident cases (indicated by the origin). When the lowest score was used, all were classified as high risk and all incident cases were among them. Abbreviations: AGYW, adolescent girls and young women; Dev, development set; Val, validation set.