Literature DB >> 20011037

Cluster designs to assess the prevalence of acute malnutrition by lot quality assurance sampling: a validation study by computer simulation.

Casey Olives, Marcello Pagano, Megan Deitchler, Bethany L Hedt, Kari Egge, Joseph J Valadez.   

Abstract

Traditional lot quality assurance sampling (LQAS) methods require simple random sampling to guarantee valid results. However, cluster sampling has been proposed to reduce the number of random starting points. This study uses simulations to examine the classification error of two such designs, a 67x3 (67 clusters of three observations) and a 33x6 (33 clusters of six observations) sampling scheme to assess the prevalence of global acute malnutrition (GAM). Further, we explore the use of a 67x3 sequential sampling scheme for LQAS classification of GAM prevalence. Results indicate that, for independent clusters with moderate intracluster correlation for the GAM outcome, the three sampling designs maintain approximate validity for LQAS analysis. Sequential sampling can substantially reduce the average sample size that is required for data collection. The presence of intercluster correlation can impact dramatically the classification error that is associated with LQAS analysis.

Entities:  

Year:  2009        PMID: 20011037      PMCID: PMC2784900          DOI: 10.1111/j.1467-985X.2008.00572.x

Source DB:  PubMed          Journal:  J R Stat Soc Ser A Stat Soc        ISSN: 0964-1998            Impact factor:   2.483


1. Introduction

In the last 20 years, development organizations working in international health have increasingly adopted lot quality assurance sampling (LQAS) to assess health care parameters. Nearly all of the 805 studies that were identified in a recent review of LQAS implemented between January 1984 and December 2004 employed traditional LQAS sampling methods (Robertson, 2006), in which simple random sampling (SRS) is used for data collection. The exceptions are studies in which a two-stage LQAS design was combined with cluster sampling to assess neonatal tetanus eradication (World Health Organization, 2001, 2002, 2004), and a study in which small clusters instead of SRS were used to assess the prevalence of gobal acute malnutrition (GAM) by LQAS analysis methods (Deitchler ). In the international health setting, small sample sizes (e.g. n=19) have often been used for LQAS assessment of service provision indicators (Valadez, ). The small samples sizes have meant that LQAS has been feasible for use by local managers (Valadez, 1991). However, use of LQAS for assessment of anthropometric indicators requires large sample sizes due to the increased precision that is needed for hypothesis testing. To use SRS with large sample sizes means an increase in time and cost, as data collection for each observation in the sample can require travel to a different site. Sampling observations in batches, or clusters, is an alternative method which reduces the number of site visits that are needed to complete data collection. However, if the observations within each cluster are highly correlated with respect to the outcome being assessed, cluster sampling leads to increased misclassification with the LQAS analysis method. In contrast, cluster sampling could be a viable option if it does not undermine the validity of the independence assumption for hypothesis testing, as required by LQAS. Deitchler , 2008) field tested both a 67×3 and a 33×6 cluster design (67 clusters of size 3 and 33 clusters of size 6 respectively) for LQAS assessment of GAM prevalence in the Siraro woreda of Ethiopia in 2003 and in the administrative units of Fur Baranga and Habila in West Darfur in 2005. The use of a 67×3 sequential sampling design was also investigated in the Ethiopia study. In comparison with the 67×3 and 33×6 design, the sequential design allowed for a reduction in the total sample size that was required to assess the prevalence of GAM by LQAS analysis methods (Deitchler ). Similar sequential designs have been used for categorizing resistance of human immunodeficiency virus to drugs (Bennett ). However, those designs relied on SRS for validity. The current study uses computer simulations to assess the validity of the small cluster approach that was used to assess the prevalence of GAM. The principal sampling strategy uses a cluster model to minimize the number of random sites to visit. We focus on a 67×3 and a 33×6 cluster design as these were the designs that were tested in Ethiopia and Sudan. Additionally, we develop and investigate a second strategy which applies a sequential sampling scheme to the 67×3 cluster design. Here, we use more robust statistical assumptions for the sequential design than had been applied to the work in Ethiopia, to improve the design.

2. Methods

2.1. Traditional lot quality assurance sampling methods

LQAS inference uses the binomial approximation to the hypergeometric distribution to test whether the prevalence of a parameter of interest is exhibited at a proportion that is greater than or equal to some prespecified threshold P0. This is equivalent to the hypothesis test where P is the true prevalence in the population and P0, the upper threshold, is the prevalence level that the data are tested against. In the case of GAM, P0 represents an unacceptable level of acute malnutrition in the population. It is chosen to reflect the prevalence at which a population would be considered a priority for humanitarian intervention. The null hypothesis is rejected if the number of individuals in the sample exhibiting acute malnutrition, s, is less than or equal to an a priori defined critical value d (s≤d). This critical value is often referred to as the decision rule in LQAS literature (Valadez, 1991). In addition, LQAS requires that we define a lower threshold Pa. The lower threshold reflects the prevalence of GAM at which the population would not be considered a priority intervention. As with any hypothesis test, an α- and β-error are associated with LQAS. The α-error is the highest probability that the null hypothesis is incorrectly rejected. In the case of GAM, this would mean concluding that the assessment area does not have a high level of acute malnutrition when in fact it does. This probability is controlled for at the upper threshold: The β-error is the highest probability that we incorrectly fail to reject the null hypothesis. This would mean concluding that the assessment area does have a high level of acute malnutrition when in fact it does not. The β-error is controlled for at the lower threshold: The critical value is chosen to approximate the desired α and β given the upper and lower thresholds, and the sample size. In practice, it is difficult to attain the α- and β-errors exactly owing to the discrete nature of the binomial distribution. Further, more than one critical value can achieve the specified constraints. The actual error probabilities for a specific sample size, and upper and lower thresholds, therefore depend on the critical value d that is chosen. In this study, we investigate the upper and lower thresholds that were field tested in Ethiopia and Sudan (Deitchler , 2008). Three couplets (i.e. upper–lower threshold pairs) are investigated: the upper thresholds of 10%, 15% and 20%, and the respective lower thresholds of 5%, 10% and 15%. The 10%–5% and 15%–10% couplet are of primary concern as these are the benchmarks that are most commonly used by humanitarian agencies to assess the severity of GAM prevalence (Food and Agriculture Organization and Food Security Analysis Unit, 2006). The 20%–15% couplet is of secondary consideration as prevalences of GAM above 20% are fairly rare, even in emergency settings (Médecins sans Frontières, 1995). For each upper and lower threshold couplet, we determined the critical value subject to the constraints of an α-error of approximately 0.10 and a β-error of approximately 0.20 for samples of sizes 198 (33×6) and 201 (67×3). Table 1 gives the sample size, critical value and associated α- and β-errors for each upper and lower threshold couplet when traditional SRS is used for data collection. For the 10%–5% couplet, a critical value of 13 meets the constraints of α≤0.10 and β≤0.20 for both sample sizes. For the 15%–10% couplet, the desired error limits are approximately maintained for a critical value of 23. For the 20%–15% couplet, no critical value attains or closely approximates the desired α- and β-constraints for samples of size 198 and 201. The critical value 33 minimizes the total error for a sample of size 198 and the critical value 34 minimizes the total error for a sample of size 201. We chose to use the critical value 33 for this couplet, with a corresponding α of 0.138 and β of 0.221.
Table 1

LQAS α- and β-errors, and critical values for samples sizes 198 and 201 for three upper and lower threshold couplets assuming SRS

Sample sizeResults for the following threshold pairs:
10%–5%
15%–10%
20%–15%
dαβdαβdαβ
201120.0310.208220.0610.279320.0850.315
130.0540.134230.0910.209330.1170.250
140.0890.081240.1310.151340.1570.193
198120.0350.194220.0720.255320.1010.283
130.0620.123230.1060.188330.1380.221
140.1010.073240.1500.134340.1830.169
LQAS α- and β-errors, and critical values for samples sizes 198 and 201 for three upper and lower threshold couplets assuming SRS

2.2. Lot quality assurance sampling methods for sequential cluster designs

In this section we investigate a sequential cluster design to test the same three null hypotheses as above. The sequential cluster design differs from traditional LQAS as a decision can be made to reject or accept the null hypothesis after each individual cluster has been observed. In a k×m sequential sampling design, there are at most k stages of sampling. At each stage, m sampling elements are observed for a maximum of n possible observations. At the ith stage of sampling, we define a rejection rule r, an acceptance rule a and the cumulative number of outcomes, s (in our application, an outcome is a child exhibiting GAM). If s≥a, then we conclude that the prevalence of GAM is greater than or equal to P0, and sampling stops. Likewise, if s≤r, then we conclude that the prevalence is less than P0, and sampling stops. Otherwise, if r(a+r)/2. Wald outlined the calculation of LQAS critical values at each stage of a sequential design applied to observations that are selected by SRS (Wald, 1947). These critical values are linear in the individual observations. We adapt this theory to accommodate clusters of size m (m>1), under the assumption that observations within each cluster are independent. Namely, define where α and β refer to the target classification errors. These critical values are linear in the sampling stage and thus reflect a cluster sampling design. One of the benefits of sequential designs is the potential for reduction of the overall sample size that is required for data collection. With respect to the outcome of acute malnutrition, a reduction in sample size could lead to a more rapid response to an emergency situation. The average sample number ASN, or the average number of clusters that are sampled to reject or accept the null hypothesis, characterizes this reduction. The average sample size is equal to the number of sampling elements per cluster times ASN (m ASN) and is given by the formula where f(x)=⌈(·)⌉ is the next largest integer function (Aroian, 1976). The Wald critical values rely on the assumption that the number of possible observations is unbounded. However, in virtually all applications, this is not so. When the number of possible observations is bounded, the design is said to be truncated. The use of Wald critical values in truncated sequential designs does not generally yield the appropriate α and β (Wald, 1947). Aroian (1965, 1976) suggested treating a sequential sample as a random walk to calculate the classification error for a truncated design directly. We used Aroian's direct method to calculate the true classification error for a range of sequential designs varied over the parameter space of α and β to arrive within the desired targets of classification error. Here we investigate a 67×3 sequential sampling design with application to the three upper–lower threshold couplets of interest. In terms of the above notation, k=67, m=3, n=201 and the upper bound for ASN is 67. For each upper and lower threshold couplet, we determine the acceptance and rejection rules by using Wald theory. We calculated critical values for a range of α- and β-errors around the target levels of 0.10 and 0.20 respectively. The final critical values that are chosen are those that yield the true α and β nearest to the desired levels as calculated by using the direct method. For both the 15%–10% and 20%–15% couplets, we could not find a design that yielded the desired α- and β-targets. For these couplets we selected the design that jointly minimized the α- and β-errors. For the 10%–5% couplet we expect an α of 0.10 and a β of 0.16. For the 15%–10% couplet, we expect an α of 0.10 and a β of 0.24. And, for the 20%–15% couplet, we expect an α of 0.17 and a β of 0.22. The critical values for each couplet are given in Table 2.
Table 2

Rejection (r) and acceptance (a) rules for the 67×3 sequential design for three upper and lower threshold couplets assuming complete independence†

StageResults for the following couplets (P0/Pa):
10%–5%
15%–10%
20%–15%
rarara
1ND3ND4ND5
2ND3ND5ND6
3ND3ND5ND7
4ND3ND5ND7
5ND4ND6ND8
6ND4ND6ND8
7ND4ND6ND9
8ND4ND7ND9
9ND4ND7ND10
10ND5ND8ND10
11ND5ND8ND11
12ND5ND8011
13ND5ND9012
14ND6ND9112
1506ND9113
1606ND10213
1706010214
1806011314
1917111315
2017111415
2117112416
2217212516
2318212617
2428213617
2528313718
2628314719
2728414819
2829414820
2939415920
3039515921
31395151021
32395161022
334106161122
344106161123
354106171223
364107171224
374117181324
385118181325
395118181425
405118191426
415119191526
426129191527
436129201627
4461210201628
4561210211728
4661311211829
4771311211830
4871311221930
4971312221931
5071312222031
5171412232032
5281413232132
5381413242133
5481414242233
5581414242234
5691514252334
5791515252335
5891515252435
5991515262436
6091616262536
61101616262537
62101616272637
63101617272638
64101617282738
65111718282739
66111718282839
67141523243435

ND signifies that no decision is made and sampling continues.

Rejection (r) and acceptance (a) rules for the 67×3 sequential design for three upper and lower threshold couplets assuming complete independence† ND signifies that no decision is made and sampling continues.

2.3. Simulation validation of cluster designs for lot quality assurance sampling analysis

One key assumption in LQAS theory is that SRS is used for data collection of binary outcomes (Hoshaw-Woodard, 2001; Valadez, 1991). Cluster sampling often results in an intracluster correlation (correlation between subjects within the same cluster with respect to the outcome of interest). For the cluster designs that are of concern here, intracluster correlation could result from within-household correlation (i.e. correlation of GAM between multiple children sampled in one household) or as correlation of GAM between multiple households sampled within the same cluster (Deitchler ). Intercluster correlation (correlation between subjects in different clusters) is also possible although this is likely to be minimal for acute malnutrition and can be assumed to be less than or equal to the intracluster correlation (Fenn ; Reed, 2000). Validation of the 67×3,33×6 and sequential cluster design requires assessing the effect of these potential correlations on the α- and β-errors that are associated with LQAS hypothesis testing. For the cluster sampling techniques that are investigated here, we assume that intracluster correlation is homogeneous and non-negative. Intercluster correlation is also assumed to be homogeneous and non-negative, and less than or equal to the intracluster correlation. This study confines the investigation to the intercluster and intracluster correlations of 0.00, 0.05, 0.10, 0.15, 0.20 and 0.25, because these provide a broad set of acceptable alternatives. Kalton's work on cluster sampling suggests that intracluster correlation is usually less than 0.15 for most indicators (Kalton, 1983). The well-documented multiple causes of malnutrition along with the age dependence vulnerability of children to acute malnutrition (Shrimpton ; United Nations Children's Fund, 1990) further suggest that a low intracluster correlation is likely. Moreover, a review of demographic and health surveys that were conducted in 46 developing countries reported intracluster correlations of less than 0.10 for acute malnutrition in 90% of the countries that were studied (Fenn ) and intracluster correlations of less than 0.05 were reported for GAM in field applications of the 67×3 and 33×6 designs in Sudan (Deitchler ). With these considerations in mind, we expect intracluster correlations using the three cluster sampling schemes used here to be less than 0.05 in most field settings. Intracluster correlation levels equal to and above 0.05 for GAM, although unlikely, are investigated in this study to understand the effect of unusually high levels of intracluster correlation on LQAS classification error for these designs.

2.4. Simulation methods

To reproduce the correlation structure arising from the 67×3 and 33×6 sampling schemes and the 67×3 sequential sampling scheme, it is necessary to generate correlated binary vectors D such that D∼(P,Σ) where P is the n×1 mean vector of Ps and Σ is the n×n variance–covariance matrix describing the correlation structure. For each couplet, samples of size 201 and 198 were generated under the various intercluster and intracluster correlation constraints. This procedure was repeated 10000 times for each couplet and intercluster–intracluster correlation pair for each design. All simulations were performed by using the statistical package R version 2.6.0 (R Development Core Team, 2007). The simulation methodology is described in detail in Appendix A.

3. Results

3.1. Cluster sampling strategy: the 67×3 and 33×6 designs

Tables 3–5 contain the results of the simulations for the 67×3 and 33×6 designs along with the estimated standard errors. As expected, those simulations with an intercluster and intracluster correlation equal to 0 for GAM demonstrate α- and β-errors that are approximately equal to the binomial α- and β-errors that are presented in Table 1, as this situation corresponds to SRS.
Table 3

Simulation results for the 67×3 and 33×6 designs: α- and β-errors for the 10%–5% couplet with varied intercluster and intracluster correlation and d= 13†

Correlation
Results for the 67×3 design
Results for the 33×6 design
InterclusterIntraclusterαβαβ
0.000.000.054 (0.002)0.136 (0.003)0.061 (0.002)0.124 (0.003)
0.000.050.064 (0.002)0.144 (0.004)0.083 (0.003)0.147 (0.004)
0.050.389 (0.005)0.247 (0.004)0.402 (0.005)0.253 (0.004)
0.000.100.071 (0.003)0.159 (0.004)0.103 (0.003)0.161 (0.004)
0.050.390 (0.005)0.263 (0.004)0.393 (0.005)0.246 (0.004)
0.100.491 (0.005)0.234 (0.004)0.488 (0.005)0.245 (0.004)
0.000.150.077 (0.003)0.162 (0.004)0.123 (0.003)0.188 (0.004)
0.050.389 (0.005)0.246 (0.004)0.394 (0.005)0.246 (0.004)
0.100.473 (0.005)0.239 (0.004)0.495 (0.005)0.237 (0.004)
0.150.552 (0.005)0.220 (0.004)0.551 (0.005)0.216 (0.004)
0.000.200.086 (0.003)0.172 (0.004)0.141 (0.003)0.197 (0.004)
0.050.389 (0.005)0.258 (0.004)0.407 (0.005)0.245 (0.004)
0.100.487 (0.005)0.236 (0.004)0.491 (0.005)0.231 (0.004)
0.150.550 (0.005)0.230 (0.004)0.550 (0.005)0.221 (0.004)
0.200.592 (0.005)0.208 (0.004)0.599 (0.005)0.205 (0.004)
0.000.250.097 (0.003)0.179 (0.004)0.163 (0.004)0.205 (0.004)
0.050.400 (0.005)0.256 (0.004)0.409 (0.005)0.255 (0.004)
0.100.478 (0.005)0.236 (0.004)0.491 (0.005)0.239 (0.004)
0.150.552 (0.005)0.225 (0.004)0.548 (0.005)0.215 (0.004)
0.200.592 (0.005)0.213 (0.004)0.595 (0.005)0.198 (0.004)
0.250.628 (0.005)0.195 (0.004)0.635 (0.005)0.187 (0.004)

Standard errors are given in parentheses.

Table 5

Simulation results for the 67×3 and 33×6 designs: α- and β-errors for the 20%–15% couplet with varied intercluster and intracluster correlation and d=33†

Correlation
Results for the 67×3 design
Results for the 33×6 design
InterclusterIntraclusterαβαβ
0.000.000.118 (0.003)0.248 (0.004)0.135 (0.003)0.227 (0.004)
0.000.050.129 (0.003)0.256 (0.004)0.165 (0.004)0.244 (0.004)
0.050.401 (0.005)0.361 (0.005)0.423 (0.005)0.342 (0.005)
0.000.100.138 (0.003)0.266 (0.004)0.190 (0.004)0.262 (0.004)
0.050.409 (0.005)0.357 (0.005)0.422 (0.005)0.343 (0.005)
0.100.475 (0.005)0.358 (0.005)0.481 (0.005)0.339 (0.005)
0.000.150.154 (0.004)0.276 (0.004)0.210 (0.004)0.281 (0.004)
0.050.415 (0.005)0.366 (0.005)0.425 (0.005)0.344 (0.005)
0.100.474 (0.005)0.350 (0.005)0.487 (0.005)0.338 (0.005)
0.150.510 (0.005)0.339 (0.005)0.525 (0.005)0.336 (0.005)
0.000.200.159 (0.004)0.274 (0.004)0.223 (0.004)0.283 (0.005)
0.050.418 (0.005)0.364 (0.005)0.421 (0.005)0.352 (0.005)
0.100.481 (0.005)0.348 (0.005)0.484 (0.005)0.334 (0.005)
0.150.511 (0.005)0.328 (0.005)0.527 (0.005)0.334 (0.005)
0.200.547 (0.005)0.329 (0.005)0.544 (0.005)0.323 (0.005)
0.000.250.168 (0.004)0.282 (0.004)0.239 (0.004)0.291 (0.005)
0.050.408 (0.005)0.365 (0.005)0.435 (0.005)0.353 (0.005)
0.100.481 (0.005)0.353 (0.005)0.477 (0.005)0.353 (0.005)
0.150.511 (0.005)0.340 (0.005)0.523 (0.005)0.337 (0.005)
0.200.540 (0.005)0.335 (0.005)0.553 (0.005)0.322 (0.005)
0.250.561 (0.005)0.314 (0.005)0.574 (0.005)0.313 (0.005)

Standard errors are given in parentheses.

Simulation results for the 67×3 and 33×6 designs: α- and β-errors for the 10%–5% couplet with varied intercluster and intracluster correlation and d= 13† Standard errors are given in parentheses. In the correlated samples, the least effect on α- and β-error occurs when the intercluster correlation equals 0. For example, in the case of the 67×3 design, if the intercluster correlation is equal to 0 and the intracluster correlation is less than or equal to 0.25, the 10%–5% couplet maintains the desired error limits of α≤0.10 and β≤0.20 (Table 3). With intracluster correlations less than 0.10 the 15%–10% couplet performs approximately within the desired error limits (Table 4). Although the 20%–15% couplet has errors that are slightly above the desired limits at this correlation level, these were expected from the outset as the targets were untenable under SRS (Table 5).
Table 4

Simulation results for the 67×3 and 33×6 designs: α- and β-errors for the 15%–10% couplet with varied intercluster and intracluster correlation and d= 23†

Correlation
Results for the 67×3 design
Results for the 33×6 design
InterclusterIntraclusterαβαβ
0.000.000.095 (0.003)0.211 (0.004)0.107 (0.003)0.185 (0.004)
0.000.050.096 (0.003)0.215 (0.004)0.137 (0.003)0.210 (0.004)
0.050.407 (0.005)0.317 (0.005)0.420 (0.005)0.310 (0.005)
0.000.100.111 (0.003)0.220 (0.004)0.155 (0.004)0.230 (0.004)
0.050.410 (0.005)0.324 (0.005)0.426 (0.005)0.314 (0.005)
0.100.479 (0.005)0.312 (0.005)0.488 (0.005)0.304 (0.005)
0.000.150.115 (0.003)0.230 (0.004)0.173 (0.004)0.241 (0.004)
0.050.393 (0.005)0.327 (0.005)0.418 (0.005)0.317 (0.005)
0.100.489 (0.005)0.315 (0.005)0.497 (0.005)0.307 (0.005)
0.150.538 (0.005)0.301 (0.005)0.525 (0.005)0.288 (0.005)
0.000.200.135 (0.003)0.248 (0.005)0.192 (0.004)0.264 (0.004)
0.050.407 (0.005)0.319 (0.005)0.415 (0.005)0.316 (0.005)
0.100.485 (0.005)0.308 (0.005)0.491 (0.005)0.298 (0.005)
0.150.527 (0.005)0.305 (0.005)0.537 (0.005)0.292 (0.005)
0.200.562 (0.005)0.283 (0.005)0.577 (0.005)0.279 (0.004)
0.000.250.138 (0.003)0.247 (0.004)0.205 (0.004)0.270 (0.004)
0.050.403 (0.005)0.328 (0.005)0.425 (0.005)0.322 (0.005)
0.100.481 (0.005)0.304 (0.005)0.488 (0.005)0.306 (0.005)
0.150.525 (0.005)0.301 (0.005)0.536 (0.005)0.290 (0.005)
0.200.557 (0.005)0.285 (0.005)0.575 (0.005)0.279 (0.004)
0.250.594 (0.005)0.278 (0.004)0.597 (0.005)0.261 (0.004)

Standard errors are given in parentheses.

Simulation results for the 67×3 and 33×6 designs: α- and β-errors for the 20%–15% couplet with varied intercluster and intracluster correlation and d=33† Standard errors are given in parentheses. Simulation results for the 67×3 and 33×6 designs: α- and β-errors for the 15%–10% couplet with varied intercluster and intracluster correlation and d= 23† Standard errors are given in parentheses. In the case of the 33×6 design, assuming an intercluster correlation equal to 0, the 10%–5% couplet conforms to the desired error limits of α≤0.10 and β≤0.20 for intracluster correlations up to 0.10 (Table 3); the 15%–10% couplet conforms approximately to the desired error limits when the intracluster correlation equals 0, and, as expected, the 20%–15% couplet does not attain the desired performance (Tables 4 and 5). In cases where both the intercluster and the intracluster correlation are greater than 0, there is a substantial increase in the α-error for both the 67×3 and the 33×6 designs, though the β-error is less affected. This result suggests that, when intercluster correlation is greater than 0, larger samples may be required to attain the desired α- and β-levels. On use of random methods for selection of clusters to sample, it is, however, reasonable to assume an intercluster correlation equal to 0 for LQAS assessment of GAM prevalence with the 67×3 or 33×6 design.

3.2. Sequential sampling strategy: the 67×3 sequential design

Table 6 shows the simulation results for the 67×3 sequential design. As expected, when intercluster and intracluster correlations are equal to 0, the results closely approximate the α- and β-errors that were calculated under SRS. Additionally, the least effect on the α- and β-errors occurs in simulations where the intercluster correlation is equal to 0. Assuming an intercluster correlation equal to 0 and an intracluster correlation as high as 0.25, the α-error is 0.16 or less and the β-error is 0.25 or less for the 10%–5% couplet. For the 15%–10% couplet, the α- and β-errors are 0.14 or less and 0.30 or less respectively. The errors for the 20%–15% couplet are slightly higher with the α-error 0.211 or less and the β-error 0.284 or less.
Table 6

Simulation results for the 67×3 sequential design: α- and β-errors with ASN for three upper and lower threshold couplets with varied intercluster and intracluster correlation†

Correlation
Results for the 10%–5% couplet
Results for the 15%–10% couplet
Results for the 20%–15% couplet
InterclusterIntraclusterαβASN0ASNaαβASN0ASNaαβASN0ASNa
0.000.000.0950.16022.93333.0480.0870.24034.27349.0940.1500.21739.70746.955
(0.003)(0.004)(17.914)(17.452)(0.003)(0.004)(21.493)(18.291)(0.004)(0.004)(21.124)(18.685)
0.000.050.1030.17822.56731.5430.0960.25934.01547.4940.1620.23538.43645.529
(0.003)(0.004)(17.808)(17.186)(0.003)(0.004)(21.773)(18.828)(0.004)(0.004)(21.057)(19.059)
0.050.4010.25122.57623.3090.3920.33530.50932.4140.4350.34530.34830.596
(0.005)(0.004)(18.066)(14.715)(0.005)(0.005)(21.187)(18.853)(0.005)(0.005)(20.214)(18.905)
0.000.100.1250.19722.51930.1090.1070.26833.9546.1580.1820.24737.93944.01
(0.003)(0.004)(17.856)(16.807)(0.003)(0.004)(21.782)(19.059)(0.004)(0.004)(21.241)(19.359)
0.050.4010.25721.94622.8310.3960.33030.30532.3220.4270.33529.85830.289
(0.005)(0.004)(17.558)(14.678)(0.005)(0.005)(21.075)(18.692)(0.005)(0.005)(20.108)(18.494)
0.100.4860.24520.3720.750.4790.30827.63828.160.4850.34525.80925.617
(0.005)(0.004)(16.199)(13.411)(0.005)(0.005)(19.953)(17.646)(0.005)(0.005)(18.87)(17.821)
0.000.150.1350.20621.74728.990.1190.29033.10444.2760.1860.25836.30242.393
(0.003)(0.004)(17.632)(16.394)(0.003)(0.005)(21.656)(19.614)(0.004)(0.004)(21.235)(19.326)
0.050.3940.25721.38122.30.3980.32929.67231.6120.4290.34429.24529.591
(0.005)(0.004)(17.219)(14.059)(0.005)(0.005)(20.891)(18.447)(0.005)(0.005)(19.832)(18.446)
0.100.4930.23920.3220.1820.4700.31427.10127.320.4950.34925.77925.253
(0.005)(0.004)(15.945)(12.808)(0.005)(0.005)(19.904)(17.547)(0.005)(0.005)(18.913)(17.567)
0.150.5610.22719.20119.0460.5170.29625.14325.3990.5270.34023.26523
(0.005)(0.004)(15.149)(12.248)(0.005)(0.005)(18.754)(16.745)(0.005)(0.005)(18.173)(16.886)
0.000.200.1490.21421.40628.0030.1310.29832.5243.0290.1990.26935.98941.115
(0.004)(0.004)(17.343)(16.382)(0.003)(0.005)(21.729)(19.681)(0.004)(0.004)(21.293)(19.583)
0.050.3960.25920.64121.9450.3970.34328.92630.6740.4250.35028.69329.18
(0.005)(0.004)(16.63)(13.911)(0.005)(0.005)(20.722)(18.336)(0.005)(0.005)(19.632)(18.41)
0.100.4940.24819.92719.9480.4750.31926.89426.9820.4890.34225.35225.06
(0.005)(0.004)(15.853)(12.778)(0.005)(0.005)(19.822)(17.272)(0.005)(0.005)(18.715)(17.169)
0.150.5530.22519.0418.9890.5160.30424.95924.750.5190.32722.86722.871
(0.005)(0.004)(15.084)(12.159)(0.005)(0.005)(18.712)(16.283)(0.005)(0.005)(17.792)(16.67)
0.200.5900.21018.43218.0340.5540.28123.45223.6210.5530.31921.60121.155
(0.005)(0.004)(14.329)(11.208)(0.005)(0.004)(17.862)(15.817)(0.005)(0.005)(17.335)(16.099)
0.000.250.1580.24320.87226.8060.1400.30031.89941.2760.2110.28434.93639.666
(0.004)(0.004)(16.861)(15.91)(0.003)(0.005)(21.734)(19.831)(0.004)(0.005)(21.099)(19.669)
0.050.4000.26120.29421.5740.3960.34628.72430.2470.4280.35227.9628.537
(0.005)(0.004)(16.454)(13.355)(0.005)(0.005)(20.627)(18.269)(0.005)(0.005)(19.367)(17.946)
0.100.4790.24719.05419.6460.4690.32026.37527.0950.4790.35124.80824.777
(0.005)(0.004)(15.346)(12.419)(0.005)(0.005)(19.544)(17.22)(0.005)(0.005)(18.412)(17.161)
0.150.5450.23118.60918.7520.5160.29924.3624.7410.5280.33322.45722.523
(0.005)(0.004)(14.461)(11.812)(0.005)(0.005)(18.493)(16.105)(0.005)(0.005)(17.425)(16.403)
0.200.5930.20818.09717.8290.5570.28723.24523.0940.5460.32220.93520.758
(0.005)(0.004)(14.004)(10.931)(0.005)(0.005)(17.661)(15.263)(0.005)(0.005)(16.909)(15.829)
0.250.6220.19217.63417.4790.5940.26822.2322.1770.5750.31519.74419.707
(0.005)(0.004)(13.438)(10.792)(0.005)(0.004)(17.026)(15.051)(0.005)(0.005)(16.143)(15.4)

Standard errors are given in parentheses.

Simulation results for the 67×3 sequential design: α- and β-errors with ASN for three upper and lower threshold couplets with varied intercluster and intracluster correlation† Standard errors are given in parentheses. For all simulated sequential samples, ASN is substantially less than the maximum of 67. For the 10%–5% couplet, the maximum ASN is approximately 23 under the null hypothesis and 34 under the alternative (n=69 and n=102 respectively). For the 15%–10% couplet, the maximum ASN is approximately 35 under the null hypothesis and 50 under the alternative (n=105 and n=150 respectively) and, for the 20%–15% couplet, the maximum ASN is 40 under the null hypothesis and 47 under the alternative (n=120 and n=141 respectively). This result suggests that the 67×3 sequential design could be utilized to decrease the total number of clusters sampled, and thus the overall sample size that is required for data collection. A slightly elevated level of misclassification, beyond α≤0.10 and β≤0.20, would need to be acceptable for the 15%–10% and 20%–15% couplets but, in cases where uncorrelated clusters and a low intracluster correlation can be assumed for GAM, the design may be appropriate to use.

4. Discussion

This study uses computer simulations to assess three cluster sampling schemes that were field tested in Ethiopia to assess the prevalence of GAM by LQAS analysis methods (Deitchler ). The simulation results show that the 67×3 and 33×6 cluster designs conform to the desired error limits of α≤0.10 and β≤0.20 for the 10%–5% and 15%–10% couplet at numerous intracluster correlation levels when the intercluster correlation is equal to 0. It stands to reason that the 67×3 design conforms to the desired α- and β-limits at higher intracluster correlation levels than the 33×6 design for both the 10%–5% and 15%–10% couplet. For the 10%–5% couplet, the 67×3 design maintains the desired error limits when the intercluster correlation is 0 and the intracluster correlation is as high as 0.25. For the 15%–10% couplet, the 67×3 design maintains α and β approximately equal to 0.10 and 0.20 when the intercluster correlation is equal to 0 and the intracluster correlation is less than 0.10. Therefore, when clusters can be assumed independent and correlation within the clusters can be assumed to be less than 0.10, the 67×3 design can be an effective method to reduce the number of sites that would otherwise need to be visited by SRS of the same size. In cases where the clusters can be assumed independent and correlation within the clusters less than 0.15, the 33×6 design can also be an effective method for assessing the prevalence of GAM, allowing for LQAS inference within the desired error limits for the 10%–5% couplet. To maintain the same error limits for the 15%–10% couplet with the 33×6 design, there can be no intracluster correlation. Intuitively, we expect the 67×3 design to perform within the desired error limits at higher levels of intracluster correlation than the 33×6 design, as smaller clusters would suffer less from intracluster correlation. The simulation results for the 67×3 sequential design indicate a potential time advantage over the 67×3 and 33×6 cluster designs because the total sample required for data collection is likely to be smaller. However, notwithstanding two exceptions, the simulation results indicate that the α- and β-errors for all intercluster and intracluster correlation levels, for each threshold couplet, exceed the desired α- and β-limits of 0.10 and 0.20 respectively. Use of the sequential design with these maximal sample sizes would therefore be recommended only when it is acceptable to deviate slightly from the above-stated limits of α and β. The results of this simulation study demonstrate that information about the intracluster correlation of GAM is needed to use the 67×3, 33×6 and 67×3 sequential sampling designs reliably for LQAS assessment of the prevalence of GAM. The review of demographic and health surveys by Fenn suggests that most field settings will have an acute malnutrition intracluster correlation of less than 0.10, whereas the field application of the 67×3 and 33×6 designs in Sudan of Deitchler suggests that an intracluster correlation of less than 0.05 is likely. These studies provide useful information about the plausible upper limit of intracluster correlation for acute malnutrition. However, investigators rarely know in advance the exact intracluster correlation that exists in a field setting where a malnutrition assessment will be conducted. Until there is more clarity about the conditions in which the upper levels of 0.05–0.10 intracluster correlation of GAM would be expected, or possibly exceeded, investigators desiring strict adherence to the stated LQAS error limits of α≤0.10 and β≤0.20 may prefer to err on the side of caution by using the better performing 67×3 design, whereas investigators who require data rapidly may prefer instead to use the 67×3 sequential design. Finally, those investigators seeking a balance between limited classification error and potential expediency of data collection may find that the 33×6 design meets their data requirements best. The results of this study support use of the cluster designs that were used in Ethiopia and Sudan (Deitchler , 2008) for detecting threshold levels of GAM prevalence by LQAS analysis methods. Further, the findings from this study provide useful information to investigators who need to decide which design (i.e. a 67×3, 33×6 or 67×3 sequential design) best suits their analytic needs, with respect to expediency of data collection, and desired limits of classification error. The cluster sampling schemes that were analysed here offer both time efficient and statistically valid alternatives to the conventional methodology for assessment of acute malnutrition in emergency settings.
Table 7

Values of τ needed to simulate binary outcomes with correlation ρ and mean P

Correlation ρValues of τ for the following prevalences P:
0.050.100.150.20
0.050.1770.1310.1100.098
0.100.3050.2420.2100.191
0.150.4070.3390.3010.277
0.200.4930.4240.3850.359
0.250.5680.5010.4620.435
  8 in total

1.  Eliminating bias in randomized cluster trials with correlated binomial outcomes.

Authors:  J F Reed
Journal:  Comput Methods Programs Biomed       Date:  2000-02       Impact factor: 5.428

2.  Validation of neonatal tetanus elimination, Morocco, 2002.

Authors: 
Journal:  Wkly Epidemiol Rec       Date:  2002-09-27

3.  Assessment of neonatal tetanus elimination in Eritrea.

Authors: 
Journal:  Wkly Epidemiol Rec       Date:  2004-06-11

4.  Do childhood growth indicators in developing countries cluster? Implications for intervention strategies.

Authors:  Bridget Fenn; Saul S Morris; Chris Frost
Journal:  Public Health Nutr       Date:  2004-10       Impact factor: 4.022

Review 5.  Global review of health care surveys using lot quality assurance sampling (LQAS), 1984-2004.

Authors:  Susan E Robertson; Joseph J Valadez
Journal:  Soc Sci Med       Date:  2006-06-09       Impact factor: 4.634

6.  Worldwide timing of growth faltering: implications for nutritional interventions.

Authors:  R Shrimpton; C G Victora; M de Onis; R C Lima; M Blössner; G Clugston
Journal:  Pediatrics       Date:  2001-05       Impact factor: 7.124

7.  A field test of three LQAS designs to assess the prevalence of acute malnutrition.

Authors:  Megan Deitchler; Joseph J Valadez; Kari Egge; Soledad Fernandez; Mary Hennigan
Journal:  Int J Epidemiol       Date:  2007-05-21       Impact factor: 7.196

8.  Precision, time, and cost: a comparison of three sampling designs in an emergency setting.

Authors:  Megan Deitchler; Hedwig Deconinck; Gilles Bergeron
Journal:  Emerg Themes Epidemiol       Date:  2008-05-02
  8 in total
  5 in total

1.  Extending cluster lot quality assurance sampling designs for surveillance programs.

Authors:  Lauren Hund; Marcello Pagano
Journal:  Stat Med       Date:  2014-03-17       Impact factor: 2.373

2.  Intervene before leaving: clustered lot quality assurance sampling to monitor vaccination coverage at health district level before the end of a yellow fever and measles vaccination campaign in Sierra Leone in 2009.

Authors:  Lorenzo Pezzoli; Ishata Conteh; Wogba Kamara; Marta Gacic-Dobo; Olivier Ronveaux; William A Perea; Rosamund F Lewis
Journal:  BMC Public Health       Date:  2012-06-07       Impact factor: 3.295

3.  Choosing a Cluster Sampling Design for Lot Quality Assurance Sampling Surveys.

Authors:  Lauren Hund; Edward J Bedrick; Marcello Pagano
Journal:  PLoS One       Date:  2015-06-30       Impact factor: 3.240

4.  Using lot quality assurance sampling to assess access to water, sanitation and hygiene services in a refugee camp setting in South Sudan: a feasibility study.

Authors:  Elizabeth Harding; Colin Beckworth; Jean-Francois Fesselet; Annick Lenglet; Richard Lako; Joseph J Valadez
Journal:  BMC Public Health       Date:  2017-08-08       Impact factor: 3.295

5.  The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments.

Authors:  Bethany L Hedt-Gauthier; Tisha Mitsunaga; Lauren Hund; Casey Olives; Marcello Pagano
Journal:  Emerg Themes Epidemiol       Date:  2013-10-26
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.