| Literature DB >> 35183203 |
Chen Xu1,2,3, Fengshi Jing3,4, Ying Lu3, Yuxin Ni3, Joseph Tucker3,5,6, Dan Wu3,5,7, Yi Zhou8,9, Jason Ong5,10,11, Qingpeng Zhang12, Weiming Tang13,14,15.
Abstract
BACKGROUND: Estimating the population sizes of key populations(people who inject drugs, men who have sex with men, transgender persons, and commercial sex workers) is critical for understanding the overall Human Immunodeficiency Virus burden. This scoping review aims to synthesize existing methods for population size estimation among key populations, and provide recommendations for future application of the existing methods.Entities:
Keywords: HIV; Key population; Population size estimation; Scoping review
Mesh:
Year: 2022 PMID: 35183203 PMCID: PMC8858560 DOI: 10.1186/s12981-022-00434-7
Source DB: PubMed Journal: AIDS Res Ther ISSN: 1742-6405 Impact factor: 2.250
Fig. 1Flowchart of the review
Fig. 2Relation graph of important words from full-text mining
Fig. 3Research trends of important words from 2001 to 2020 from full-text mining
Fig. 4Type of methods for population size estimation
Summary of size estimation methods. The continuity of this table is across four pages
| Sampling method | Description | Assumption | Strength | Weakness |
|---|---|---|---|---|
| Capture-recapture [ | Assesses the overlaps between incomplete case lists from multiple independent data sets | 1) the selected sampled population is a good representation of the whole population 2) the sample is a closed population 3) able to match individuals in both datasets; 4) individuals have an equal likelihood of being captured | Simple and easy to use for researchers | Capture biases: not everyone has an equal chance of being captured; Estimates would be too high if matches were not identified or too low if recaptures were matched incorrectly |
| Multiplier [ | Two independent sources of data are used to make the estimation, including an authentic count or list of the population whose size is being estimated and a survey of the populations whose size is being estimated | There is accurate demographic and geographic information of the key population | Simple and easy to use | The quality of the data can cause bias; the resulting survey samples may not be fully representative of the key population |
| Delphi [ | Estimating the size of key populations by the individual judgment of several experts | The estimation from an expert team could accurately reflect the reality | Low cost with high efficiency | The estimation may be subjective and not reliable because of the quality of the expert team; Lack of strategies to deal with the disparity between the experts |
| Mapping [ | The locations of the key population are systematically identified and mapped to estimate the size of the key population | The quality of the data can be guaranteed by the full involvement of the key populations | The estimate is made with transparency | The missing of some geographical locations may underestimate the size of key populations; overestimation may happen if the key population frequently attend multiple locations |
| Workbook [ | The key population is identified first and then the estimates are combined with the total population to calculate the proportion of the key population in a specific region | Typically used in countries or regions where the epidemic is low and concentrated | The estimate is made with transparency; errors can be prevented by automatic consistency and audit check | In some countries, data may be limited because of stigma and discrimination among the key populations and legal issues, which may make data unreliable or of poor quality |
| Network scale-up [ | Respondents are asked about the behaviors of acquaintances from their social network to estimate the number of key populations from the social network of each respondent | The average size of personal networks of key populations and the population as a whole are the same; People can accurately report the behaviors of acquaintances from their social networks | The privacy of the key populations is protected because the researchers do not directly contact them | The respondents may ignore key populations among their acquaintances (transmission error); Obtaining a representative sample is challenging because of stigma and discrimination |
| Respondent Driven Sampling [ | A sample from the key population is selected purposively and then these selected individuals are given coupons to recruit other key populations from their social network | Recruiters randomly pass coupons to their social network members who are members of the key populations; Every participant has only one chance to receive the coupon and is equally likely to be recruited; | The Respondent-Driven Sampling method is an effective sampling method for estimating hard-to-reach networked populations with no sampling frames | Limited recruitment within the key populations may lead to biased estimates |
| Bayesian Estimation [ | The key population size is estimated following Bayes' theorem, which is based on a prior probability distribution | If there exists some prior knowledge, like prior probability, the Bayesian method is suitable | It can solve the problem when there is no direct data to estimate the population size for a specified geographical area through survey sampling studies by utilizing empirical data | Bayesian methods might be subjective, due to different researchers with different prior beliefs |
| Stochastic Simulation [ | Estimating the size of a certain population (e.g., HIV-positive) using epidemiologic data using the Monte Carlo method | Parameters are based on the data from representative clinical trials or observational cohort studies | Stochastic simulation makes it possible to naturally produce plausibility intervals for estimates in the face of uncertainty | First, some complex simulation process is quite time-consuming. Second, thanks to different kinds of parameters setting and the unknown quality of observed data, the robustness of some simulation model estimates is not stable |
| Laska-Meisner-Siegel Estimation [ | Based on a single sample and in a single venue, it is an unbiased estimator for the size of a population | This method assumes that we only have a one-time sampling | This estimation method is time- and resources- saving, when comparing with capture-recapture | This method only requires one single sample, thus its estimation accuracy might be lower than other several times sampling estimation methods |
Fig. 5The researchers need to think about when choosing methods for population size estimation
Characteristics of individual studies included in the scoping review.
| No. | ID | Estimation method | Key populations | Study settings |
|---|---|---|---|---|
| 1 | E B Hook et al. | Capture-recapture | The key populations | The United States |
| 2 | M C Buster et al. | Capture-recapture | PWID | Amsterdam |
| 3 | Ruiz MS et al. | Capture-recapture | PWID | Washington DC |
| 4 | Apodaca K et al. | Capture-recapture | MSM | Uganda |
| 5 | Karami M et al. | Capture-recapture | CSW | Tehran, Iran |
| 6 | Doshi RH et al. | Capture-recapture | PWID, MSM, and CSW | Kampala, Uganda |
| 7 | Li G et al. | Capture-recapture | MSM | Beijing, China |
| 8 | Sulaberidze L et al. | Multiplier | MSM | Tbilisi, Georgia |
| 9 | Okal J et al. | Multiplier | PWID, MSM, and CSW | Nairobi, Kenya |
| 10 | Paz-Bailey G et al. | Multiplier | MSM and CSW | El Salvador |
| 11 | Burrell ER et al. | Multiplier | MSM | the United States |
| 12 | Rich AJ et al. | Multiplier | MSM | Metro Vancouver, Canada |
| 13 | Hiebert L et al. | Multiplier | PWID | Malaysia |
| 14 | Khalid FJ et al. | Delphi | PWID, MSM, and CSW | Unguja Island, Zanzibar |
| 15 | Okal J et al. | Delphi | PWID, MSM and CSW | Nairobi, Kenya |
| 16 | Bunjaku DG et al. | Mapping | PWID, MSM and CSW | Kosovo |
| 17 | Odek WO et al. | Mapping | CSW | Kenya |
| 18 | Wambura M et al. | Mapping | MSM and CSW | Tanzania |
| 19 | Lu F, Wang N et al. | Workbook | The key populations | China |
| 20 | Ha NTT et al. | Workbook | PWID | Son La, Vietnam |
| 21 | Lansky A et al. | Workbook | heterosexual persons | the United States |
| 22 | Scholz SM et al. | Network scale-up | MSM | Germany |
| 23 | Baral S et al. | Network scale-up | MSM | Multiple countries |
| 24 | Guo J et al. | Network scale-up | MSM | Beijing, China |
| 25 | Ezoe S et al. | Network scale-up | MSM | Japan |
| 26 | Maghsoudi A et al. | Network scale-up | PWID and CSW | Iran |
| 27 | Wang J et al. | Network scale-up | MSM | Shanghai, China |
| 28 | Bengtsson L et al. | Respondent-driven sampling | MSM | Vietnam |
| 29 | Holland CE et al. | Respondent-driven sampling | MSM and CSW | Burkina Faso and Togo |
| 30 | Johnston LG et al. | Respondent-driven sampling | males who inject drugs | Myanmar |
| 31 | Carballo-Diéguez A et al. | Respondent-driven sampling | MSM | Buenos Aires |
| 32 | Buchanan R et al. | Respondent-driven sampling | PWID | Multiple countries |
| 33 | Lachowsky NJ et al. | Respondent-driven sampling | MSM | Vancouver, Canada |
| 34 | Overall AM et al. | Bayesian estimation | PWID | Scotland |
| 35 | Datta A et al. | Bayesian estimation | MSM | Côte d'Ivoire |
| 36 | Bao L et al. | Bayesian estimation | PWID | Bangladesh |
| 37 | Nakagawa F et al. | Stochastic simulation | HIV-positive MSM | UK |
| 38 | Chen H (2013) et al. | Laska-Meisner-Siegel estimation | MSM | Changsha, China |
| 39 | Chen H (2011) et al. | Laska-Meisner-Siegel estimation | MSM | One city in China |
The summary of population size estimation methods categories
| Categories | Definition of categories | Methods |
|---|---|---|
| Methods based on independent samples | Multiple independent sources of data are used to make the estimation | Capture-recapture method |
| Multiplier method | ||
| Methods based on population counting | Traditional geolocation and counting measures are used to estimate the size of the key population | Delphi method |
| Mapping method | ||
| Methods based on the official report | The estimates are made by combining them with the official report | Workbook method |
| Methods based on social network | The key population is recruited from their social network | Respondent-driven sampling method |
| Network scale-up method | ||
| Methods based on data-driven technologies | The key population size is estimated by data-driven technologies | Bayesian estimation method |
| Stochastic simulation method | ||
| LMS estimation method |