Literature DB >> 35637769

Optimal group testing strategy for the mass screening of SARS-CoV-2.

Fengfeng Huang1, Pengfei Guo2, Yulan Wang3.   

Abstract

We analyze the group testing strategy that maximizes the efficiency of the SARS-CoV-2 screening test while ensuring its effectiveness, where the effectiveness of group testing guarantees that negative results from pooled samples can be considered presumptive negative. Two aspects of test efficiency are considered, one concerning the maximization of the welfare throughput and the other concerning the maximization of the identification rate (namely, identifying as many infected individuals as possible). We show that compared with individual testing, group testing leads to a higher probability of false negative results but a lower probability of false positive results. To ensure the test effectiveness, both the group size and the prevalence of SARS-CoV-2 must be below certain respective thresholds. To achieve test efficiency that concerns either the welfare throughput maximization or the identification rate maximization, the optimal group size is jointly determined by the test accuracy parameters, the infection prevalence rate, and the relative importance of identifying infected subjects. We also show that the optimal group size that maximizes the welfare throughput is weakly smaller than the one that maximizes the identification rate.
© 2022 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Group testing; Mass screening; Test sensitivity; Test specificity

Year:  2022        PMID: 35637769      PMCID: PMC9124587          DOI: 10.1016/j.omega.2022.102689

Source DB:  PubMed          Journal:  Omega        ISSN: 0305-0483            Impact factor:   8.673


Introduction

Mass screening is an essential means to curb the spread of the coronavirus disease 2019 (COVID-19) due to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. However, due to the limited test capacity, the extent of screening work usually falls short of our expectation. In order to reconcile the tight testing capacity and the urgent need for large-scale tests, “group testing” protocols have been put forward (see, e.g. [1], [13], [27]) and into practice in several places, including Wuhan city of China [24] and Germany [8]. With group testing, samples from multiple subjects are pooled together and tested as one to check the presence of the virus. The pooled sample shall be tested negative if none of the sampled individuals is infected and positive otherwise. When a pooled sample is tested positive, each sample of the pool is then tested individually. Group testing was developed by Dorfman [7] as a cost efficient technique to eliminate syphilis men in army recruits and has been used afterwards in screening a variety of infections, including HIV, H1N1 influenza, malaria, Zika virus (see, e.g. [6], [10], [11], [20], respectively). In these studies, the group testing strategy aims to minimize the expected number of tests. However, they usually overlook how group testing affects the overall test accuracy. Virtually, no test is 100% accurate, because the distribution of the true positive result and that of the true negative result usually overlap and there does not exist a single decision criterion that could separate the samples cleanly [16], [26]. When choosing the decision criteria for a test (which determine test specificity and sensitivity), several clinical factors shall be taken into consideration, including the disease prevalence and the relative importance of identifying infected subjects [16]. Due to that pooling samples together alters the disease prevalence (among the pooled samples), the frequency of errors and the overall test accuracy of group testing are different from those of individual testing. In particular, for communicable diseases, the consequences associated with false positive results are more severe than those associated with false negative results, as false positives only lead to further testing whereas false negatives lead to further spread of the disease. Thus, it is needed to take into account the impacts of false positives and false negatives when developing a group testing strategy. We develop a stylized model to investigate the optimal group testing strategy, i.e., the number of individual samples to be mixed together as one, so that the efficiency of the test is maximized. Two aspects of test efficiency are considered, one concerning the maximization of the identification rate (namely, identifying as many infected individuals as possible) and the other concerning the maximization of the welfare throughput. The first aspect is in essence the same as minimizing the expected number of tests, which, as has been stated above, is the primary concern of the group testing strategy in the extant literature. The second aspect takes account of the consequences associated with false positive and false negative results, which, despite not having drawn enough attention in the group testing literature, have been widely recognized by the medical community; see, e.g. [16], [26]. When deciding the optimal group size, both aspects of test efficiency shall be subject to the group testing effectiveness requirement. For instance, the Food and Drug Administration of the United State suggests that group testing is effective only if each individual subject in a pooled sample can be considered presumptive negative whenever the pooled sample is tested negative [23]. We show that compared with individual testing, group testing leads to an increased probability of false negative results and a decreased probability of false positive results. The effectiveness of group testing requires both the infection prevalence rate and the group size of individual samples below certain thresholds. These thresholds are jointly determined by the test accuracy, which includes test specificity (the probability of correctly identifying an uninfected subject) and test sensitivity (the probability of correctly identifying an infected subject), and the relative importance of identifying infected subjects. We also show that the optimal group size hinges on the test accuracy, the infection prevalence rate, and the relative importance of identifying an infected subject. Moreover, the optimal group size under the welfare throughput maximization is weakly smaller than that under the identification rate maximization. When the agent aims to maximize the welfare throughput and test specificity is 100% (i.e., no false positive results), the optimal group size is the same as the one when the agent aims to maximize the identification rate, and it equals the minimum of two values with one corresponding to the group size that ensures the effectiveness of group testing and the other corresponding to the group size that minimizes the required testing times per sample. We further conduct the extensive numerical studies by taking into account that (1) test outcomes are usually not dichotomous and (2) it is possible to alter the decision criteria to align with the purpose of the test in practice; see, e.g. [16], [26], where decreasing (increasing) test specificity is coupled with increasing (decreasing) test sensitivity. Our numerical experiments reveal that a decision criterion associated with a slightly lower test specificity and a higher test sensitivity can improve the test efficiency of group-testing. Moreover, the relative advantage of group testing over individual testing in improving test efficiency diminishes when the infection prevalence rate is high. The remainder of this paper is organized as follows. The related literature is reviewed in Section 2. Section 3 discusses the model setup. We analyze the optimal group sizes in Section 4. Concluding remarks are provided in Section 5.

Literature review

Ever since group testing was developed by Dorfman in 1943 in eliminating syphilis man in army [7], the literature on group testing has being flourishing. Researchers have investigated the feasibility of group testing in a wide range of infections through experimental studies and clinic practice, including HIV, H1N1 influenza, malaria, and Zika virus; see [6], [10], [11], [20]. In response to the recent COVID-19 outbreak, group testing protocols for SARS-CoV-2 has surged [1], [13], [27]. Group testing pools samples collected from multiple subjects into a single pool and tests the pooled sample for the presence of the virus; the pool shall be tested negative if none of the sampled individuals is infected and positive otherwise. Based on the group testing method, a protocol called the square array method was proposed by [19] to further improve test efficiency. Under the square array method, samples are placed into a square array where each row and column forms a pool, and samples whose row and column pools both tested positive shall be deemed positive. The group testing strategy and its square-array derivative increase testing throughput, and have been analyzed by researchers from multiple disciplines by considering various perspectives, including search theories, learning theories, information theories, and control theories; see, e.g. [2], [12], [14], [15], respectively. The aforementioned studies overlook the following two facts: One, the group testing strategy affects the frequencies of false positive and false negative results. Two, the consequences associated with false positive results are more severe than those associated with false negative results for communicable diseases [4], [26]. We note that [4] consider a certain type of consequence incurred by the false positive and false negative results. Their group testing strategy aims to minimize a weighted sum of the expected number of false negative and false positive results, and the weights are assigned by the decision maker. Different from [4], the weights associated with false negative and false positive results in our welfare throughput maximization objective are jointly determined by the disease prevalence and the disease features. Our work is also related to the studies on balancing the system performance (such as social/patient welfare and system congestion) and diagnosis accuracy, including [3], [9], [25]. Alizamir et al. [3] consider a diagnostic process that consists of a sequence of imperfect tests. They find that the provider shall continue to test the patients until its belief that the patient is of a given type falls into a certain interval[25]. consider a multi-server queueing system and demonstrate that with dual concerns over accuracy and congestion, increasing capacity might increase congestion. [9] consider a similar diagnostic service but focus on examining how patients make their respective decisions based on their personal believes and how their decisions affect the social outcome. Other related studies include [12], [22], [28], which evaluate the effectiveness of group testing by using statistical or stylized models. Both Tu et al. [22] and Zenios and Wein [28] estimate the prevalence of HIV by adopting the group testing method[22] show that group testing not only reduces the probability of false positive results but also improves the accuracy of the estimator [28] take into account that HIV test outcomes are not dichotomous but continuous. They develop a parametric procedure and a hierarchical model to derive the decision criteria that yield accurate estimates. Lin et al. [12] propose a group testing strategy to reduce the prevalence rate of COVID-19 in a closed community.

Model setup

Consider a representative testing center (the agent) that provides SARS-CoV-2 (which causes COVID-19) mass screening tests to the public. The test capacity of the agent (namely, the number of tests the agent can perform at a time) is fixed, and his objective is to maximize the efficiency of group testing by choosing the appropriate group size. We consider two aspects of test efficiency, one concerning maximizing of the welfare throughput, and the other concerning maximizing the identification rate, that is, identifying as many infected individuals as possible. Let denote the infection prevalence rate; it measures the probability of an asymptomatic subject being infected. As of March 2022, the percentage of tested individuals being tested positive is no more than in most countries [18], and such percentage shall be lower when mass screening is considered. Given such evidence, we consider that in this study. Denote a subject’s true type by , where indicates the subject (she) is infected and uninfected. The screening test categorizes subjects as positive () and negative (). The screening test is not perfect. Depending on the subject’s true type (), the probability of is as follows:The parameters and are known as test specificity (the probability of correctly identifying an uninfected subject) and test sensitivity (the probability of correctly identifying an infected subject), respectively. It is reported that the accuracy of SARS-CoV-2 screening testing usually has an average sensitivity of 84.3% and a specificity of between 96.6% and 99.7%; see, e.g., the systematic review of [5]. Based on [5] and for the sake of analytical tractability, we restrict the parameter values to the case . Given a subject’s true type (), she obtains a reward if she is correctly identified () and suffers a loss if misidentified (). is the potential benefit associated with either being assured of the health status when or receiving the proper treatment when , and is the disutility/loss incurred from either not taking the proper treatment (or the potential pandemic containment failure) when or the unnecessary stress suffered by the concerned individual (or the health care system) when . Following [9], we assume that , which indicates that even though the screening information is imperfect, it is valuable to all individuals. Definewhich is an indicator of the relative importance of identifying an infected subject. As far as the communicable diseases are concerned, identifying positive cases are extremely important [26], and hence, the value of shall be large. The group test is composed of two stages. In Stage 1, the agent pools samples into equal-sized groups and performs tests on the groups. If a group is tested negative, every sample in the group is considered to be uninfected; otherwise, it goes to Stage 2, in which every sample in the group is tested individually. Let denote the prevalence of SARS-CoV-2 among the groups when the group size is (). Then, we have Test Effectiveness.When maximizing the efficiency of group testing, the agent is often subject to the effectiveness requirement. For instance, one important concern of the Food and Drug Administration of the United State with group testing is whether negative results from the pooled samples can be presumptive negative [23]. Group testing can be regarded effective only when the expected reward of considering negative pooling test results as presumptive negative is greater than that as presumptive positive; that is,whereand is the posterior probability that a group of size contains at least one infected subject given that the test result is negative. When the condition stated in (1) is violated, the groups with negative results still need to proceed to Stage 2, and hence, group testing cannot reduce the required number of tests. Test Efficiency.Given the group size , let denote the probability of identifying a type () subject as (), the average number of tests required for the group, and the average number of tests conducted on each individual sample. Obviously, when samples are tested individually, , , and . As the agent’s test capacity is fixed, maximizing the welfare throughput (the identification rate, respectively) shall be equivalent to maximizing the welfare throughput (the identification rate, respectively) per test. Denote the welfare throughput per test and the identification rate per test as and , respectively. Then, both of which shall be subject to the effectiveness requirement indicated by (1). In the welfare throughput maximization objective function (2), the numerator is the expected reward of a representative subject, in which and are her expected rewards if she is uninfected () and infected (), respectively. In the identification rate maximization objective function (3), is the expected number of infected individuals found in each group as is the expected number of infected individuals in a group.

Analysis

In this section, we first examine how group testing affects test outcomes and the average number of tests required per individual sample. Next, we derive the condition that ensures the test effectiveness. After that, we derive the optimal group sizes that ensure test efficiency in two aspects, welfare throughput maximization and identification rate maximization. Last, we conduct numerical experiments to strengthen our understanding of the implications of group testing. First, we provide the following result. Suppose that the group size is. Then, the average number of tests required per individual sample is; the false negative probability, i.e., the probability that an infected subject is tested negative, is; the false positive probability, i.e., the probability that an uninfected subject is tested positive, is. Lemma 1 shows that given test specificity and test sensitivity , group testing leads to a higher false negative probability (because ) and a lower false positive probability (because ) compared to individual testing. Moreover, the false negative probability is independent of the group size whereas the false positive probability increases with the group size. With regard to the effectiveness of the test, we obtain the following result. To ensure the effectiveness of group testing, the infection prevalence rate shall satisfyThe group sizeshall not exceed an upper bound, whereBothanddecrease with(the relative importance of identifying an infected subject) and increase with test specificityand test sensitivity. Lemma 2 shows that to ensure the effectiveness of group testing, the infection prevalence rate shall not exceed an upper threshold that is jointly determined by the relative importance of identifying an infected subject and the test accuracy parameters and . It also suggests that government health agencies shall provide operational guidelines on the implementation of group testing such as the maximum pool size that depends on , , , and . The results stated in Lemma 2 are also consistent with the observed practice. For example, regarding SARS-CoV-2 screening tests, the U.S. Food and Drug Administration authorized several public health laboratories to conduct group testing with 5 to 8 samples, depending on their respective test technology and accuracy [21], and a group size of 10 to 25 is allowed in New York State [17]. We further investigate how the infection prevalence rate affects the group size upper bound via a numerical experiment; see Fig. 1 . It shows that is decreasing with . The relative importance of identifying infected subjects also plays a critical role, where a higher leads to a smaller . Fig. 1 shows that given and , if , group testing is effective only when the infection prevalence rate is below , while if , it is effective as long as is no higher than .
Fig. 1

Group Size and Infection Prevalence Thresholds for Conducting Group Testing: and .

Group Size and Infection Prevalence Thresholds for Conducting Group Testing: and . Based on Lemmas 1 and 2, we can obtain the following result regarding the optimal group size. To ensure the test effectiveness and efficiency, depending on whether the agent is concerned with the maximization of the welfare throughput or the identification rate, we have that: When the agent aims to maximize the welfare throughput, the optimal group size is When the agent aims to maximize the identification rate, the optimal group size is, where,takes one of the following two values,or, whichever yields a larger value of the objective function, and() satisfies Proposition 1 shows that under both test efficiency objectives—the welfare throughput maximization and the identification rate maximization—the magnitude of the optimal group size highly depends on the prevalence of virus infection , test accuracy parameters and , and the relative importance of identifying an infected subject . We can easily conclude from Proposition 1 that the optimal group size under the welfare throughput maximization is weakly smaller than that under the identification rate maximization. Interestingly, the optimal group size that maximizes the welfare throughput when is the same as that maximizes the identification rate, and it equals the minimum of , the largest allowable group size that ensures the effectiveness of group testing (see Lemma 2), and , the group size that minimizes the required testing times per sample. Numerical Study. Before proceeding to the numerical experiments, we shall introduce the background knowledge about choosing the decision criteria that determine the magnitude of test specificity and test sensitivity in the clinical environment. First, it is worth pointing out that the results of most existing tests are not dichotomy, because the distributions of the true positive test result and the true negative test result usually overlap and there does not exist any decision criteria that can separate the true positives and true negatives cleanly. Fig. 2 excerpted from the theoretical guidelines of Metz [16] illustrates this gist. According to [16], in Fig. 2, any vertical line across the overlapped area of the two distributions can be a decision criterion that specifies a unique combination of test specificity and test sensitivity . Take the red line in Fig. 2 As an example. The dark gray area formed by the red line and the distribution curve of true negative cases represents the false positive fraction and determines a unique value of , while the light gray area formed by the red line and the distribution curve of true positive cases represents the false negative fraction and determines a unique value of . If we move the red line towards the left, test specificity decreases while test sensitivity increases; vice versa. By varying the decision criteria via moving the red line, multiple combinations of test specificity and test sensitivity are obtained, based on which the medical community usually graphically plot a receiver operating characteristic (ROC) curve [16]. Note that the ROC curve plots the pair rather than the pair.
Fig. 2

An Illustrative Example from [16].

An Illustrative Example from [16]. By adopting the theoretical guidelines of Metz [16] along with the observed accuracy of SARS-CoV-2 tests in practice, we set the distributions of true negative and true positive results to be normally distributed with mean 5 and 10 and variance 1 and 1.2, respectively. We then explore a series of possible decision criteria and obtain the corresponding ROC curve as shown in Fig. 3 . We then select the following four test specificity and sensitivity combinations from the ROC curve— , , , and — to conduct our numerical experiments, which are depicted as the red dots in Fig. 3.
Fig. 3

The Receiver Operating Characteristic Curve: True Negative and True Positive .

The Receiver Operating Characteristic Curve: True Negative and True Positive . We now investigate how test specificity and test sensitivity affect the optimal group size and test efficiency. Our numerical experiments reveal that due to the high importance of identifying infected subjects for COVID-19 (i.e., is close to 1), the test effectiveness requirement characterized by the threshold group size (given in Lemma 2) is usually the binding factor, leading to that the two aspects of test efficiency—maximizing the welfare throughput and maximizing the identification rate—yield the same optimal group size. Hence, for illustration purpose, we only present the numerical outcomes under the welfare throughput maximization; see Fig. 4 . Fig. 4a shows that the optimal group size decreases with the infection prevalence rate , and that is weakly larger when test specificity decreases and test sensitivity increases. Fig. 4b depicts the relative efficiency improvement of group testing over individual testing. It shows that when , the test specificity and sensitivity pair leads to the highest test efficiency; when , the test specificity and sensitivity pair leads to the highest test efficiency; and when , the test specificity and sensitivity pair leads to the highest test efficiency. This indicates that as the infection prevalence rate increases, to facilitate group testing, a decision criterion associated with a slightly lower test specificity and a higher test sensitivity can improve test efficiency. Moreover, Fig. 4b shows that for any combination of and , the relative advantage of group testing over individual testing diminishes as increases.
Fig. 4

Optimal Group Testing Strategy That Maximizes the Welfare Throughput: , , .

Optimal Group Testing Strategy That Maximizes the Welfare Throughput: , , .

Conclusion

We consider an agent that provides SARS-CoV-2 screening tests to the public. The agent’s objective is to maximize the efficiency of group testing. Two aspects of test efficiency are considered with one concerning the welfare throughput maximization and the other concerning the identification rate maximization. Apart from test efficiency, a group testing strategy must be “effective” in the sense that whenever a negative test result is obtained for a sample group, the group can be considered as presumptive negative. We show that compared with individual testing, group testing leads to a higher probability of false negative results but a lower probability of false positive results. To ensure the effectiveness of group testing, it requires that both the group size and the prevalence of SARS-CoV-2 are below certain thresholds. Moreover, the optimal group size that maximizes the welfare throughput is weakly smaller than the one that maximizes the identification rate, and the optimal group sizes under both objectives—the welfare throughput maximization and the identification rate maximization—are jointly determined by test accuracy parameters, the infection prevalence rate, and the importance of identifying an infected subject. Our numerical study shows that the efficiency of group testing can be improved if we increase test sensitivity (the probability of correctly identifying infected subjects) by sacrificing test specificity (the probability of correctly identifying uninfected subjects). Moreover, the relative advantage of group testing over individual testing in improving test efficiency diminishes—or even vanishes—when the infection prevalence rate is high for all combinations of test specificity and test sensitivity.

Credit Author Statement

All authors contributed equally to the work.
  11 in total

1.  Basic principles of ROC analysis.

Authors:  C E Metz
Journal:  Semin Nucl Med       Date:  1978-10       Impact factor: 4.446

2.  PCR-based pooling of dried blood spots for detection of malaria parasites: optimization and application to a cohort of Ugandan children.

Authors:  Michelle S Hsiang; Michael Lin; Christian Dokomajilar; Jordan Kemere; Christopher D Pilcher; Grant Dorsey; Bryan Greenhouse
Journal:  J Clin Microbiol       Date:  2010-08-04       Impact factor: 5.948

3.  Pooled testing for HIV prevalence estimation: exploiting the dilution effect.

Authors:  S A Zenios; L M Wein
Journal:  Stat Med       Date:  1998-07-15       Impact factor: 2.373

4.  The use of a square array scheme in blood testing.

Authors:  R M Phatarfod; A Sudbury
Journal:  Stat Med       Date:  1994-11-30       Impact factor: 2.373

5.  Real-time, universal screening for acute HIV infection in a routine HIV counseling and testing population.

Authors:  Christopher D Pilcher; J Todd McPherson; Peter A Leone; Marlene Smurzynski; Judy Owen-O'Dowd; Amy L Peace-Brewer; Juanita Harris; Charles B Hicks; Joseph J Eron; Susan A Fiscus
Journal:  JAMA       Date:  2002-07-10       Impact factor: 56.272

6.  Screening of Blood Donations for Zika Virus Infection - Puerto Rico, April 3-June 11, 2016.

Authors:  Matthew J Kuehnert; Sridhar V Basavaraju; Robin R Moseley; Lisa L Pate; Susan A Galel; Phillip C Williamson; Michael P Busch; Jose O Alsina; Consuelo Climent-Peris; Peter W Marks; Jay S Epstein; Hira L Nakhasi; J Peyton Hobson; David A Leiby; Pradip N Akolkar; Lyle R Petersen; Brenda Rivera-Garcia
Journal:  MMWR Morb Mortal Wkly Rep       Date:  2016-06-24       Impact factor: 17.586

7.  Entry screening to delay local transmission of 2009 pandemic influenza A (H1N1).

Authors:  Benjamin J Cowling; Lincoln L H Lau; Peng Wu; Helen W C Wong; Vicky J Fang; Steven Riley; Hiroshi Nishiura
Journal:  BMC Infect Dis       Date:  2010-03-30       Impact factor: 3.090

8.  Diagnostic accuracy of serological tests for covid-19: systematic review and meta-analysis.

Authors:  Mayara Lisboa Bastos; Gamuchirai Tavaziva; Syed Kunal Abidi; Jonathon R Campbell; Louis-Patrick Haraoui; James C Johnston; Zhiyi Lan; Stephanie Law; Emily MacLean; Anete Trajman; Dick Menzies; Andrea Benedetti; Faiz Ahmad Khan
Journal:  BMJ       Date:  2020-07-01

9.  Efficient high-throughput SARS-CoV-2 testing to detect asymptomatic carriers.

Authors:  Noam Shental; Shlomia Levy; Vered Wuvshet; Shosh Skorniakov; Bar Shalem; Aner Ottolenghi; Yariv Greenshpan; Rachel Steinberg; Avishay Edri; Roni Gillis; Michal Goldhirsh; Khen Moscovici; Sinai Sachren; Lilach M Friedman; Lior Nesher; Yonat Shemer-Avni; Angel Porgador; Tomer Hertz
Journal:  Sci Adv       Date:  2020-09-11       Impact factor: 14.136

10.  Pooling of samples for testing for SARS-CoV-2 in asymptomatic people.

Authors:  Stefan Lohse; Thorsten Pfuhl; Barbara Berkó-Göttel; Jürgen Rissland; Tobias Geißler; Barbara Gärtner; Sören L Becker; Sophie Schneitler; Sigrun Smola
Journal:  Lancet Infect Dis       Date:  2020-04-28       Impact factor: 71.421

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.