| Literature DB >> 24160725 |
Bethany L Hedt-Gauthier1, Tisha Mitsunaga, Lauren Hund, Casey Olives, Marcello Pagano.
Abstract
BACKGROUND: Traditional Lot Quality Assurance Sampling (LQAS) designs assume observations are collected using simple random sampling. Alternatively, randomly sampling clusters of observations and then individuals within clusters reduces costs but decreases the precision of the classifications. In this paper, we develop a general framework for designing the cluster(C)-LQAS system and illustrate the method with the design of data quality assessments for the community health worker program in Rwanda.Entities:
Year: 2013 PMID: 24160725 PMCID: PMC3819670 DOI: 10.1186/1742-7622-10-11
Source DB: PubMed Journal: Emerg Themes Epidemiol ISSN: 1742-7622
Recommended sample sizes and decision rules for numbers of clusters up to n for the community health worker data quality assessment example
| ICC = 0.01 | samples per cluster, k | 13 | 7 | 5 | 4 | 5 | 3 | 4 | 3 | 2 | 2 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| | total sample size, n | 26 | 21 | 20 | 20 | 30 | 21 | 32 | 27 | 20 | 22 | 36 | 26 | 28 | 30 | 32 | 34 | 36 | 38 | 20 |
| | decision rule, d | 4 | 3 | 3 | 3 | 4 | 3 | 5 | 4 | 3 | 3 | 5 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 3 |
| | α | 0.093 | 0.081 | 0.096 | 0.095 | 0.040 | 0.077 | 0.073 | 0.069 | 0.092 | 0.062 | 0.036 | 0.081 | 0.056 | 0.038 | 0.071 | 0.050 | 0.035 | 0.024 | 0.091 |
| | β | 0.048 | 0.091 | 0.079 | 0.078 | 0.065 | 0.087 | 0.022 | 0.045 | 0.076 | 0.096 | 0.034 | 0.040 | 0.050 | 0.062 | 0.021 | 0.027 | 0.033 | 0.041 | 0.075 |
| | Expected Costs, Scenario 1 † | $1,260 | $1,710 | $2,200 | $2,700 | $3,300 | $3,710 | $4,320 | $4,770 | $5,200 | $5,720 | $6,360 | $6,760 | $7,280 | $7,800 | $8,320 | $8,840 | $9,360 | $9,880 | $10,200 |
| | Expected Costs, Scenario 2 ‡ | $1,900 | $1,950 | $2,200 | $2,500 | $3,300 | $3,150 | $4,000 | $4,050 | $4,000 | $4,400 | $5,400 | $5,200 | $5,600 | $6,000 | $6,400 | $6,800 | $7,200 | $7,600 | $7,000 |
| ICC = 0.025 | samples per cluster, k | 14 | 7 | 7 | 4 | 5 | 3 | 4 | 3 | 2 | 2 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| | total sample size, n | 28 | 21 | 28 | 20 | 30 | 21 | 32 | 27 | 20 | 22 | 36 | 26 | 28 | 30 | 32 | 34 | 36 | 38 | 20 |
| | decision rule, d | 4 | 3 | 4 | 3 | 4 | 3 | 5 | 4 | 3 | 3 | 5 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 3 |
| | α | 0.084 | 0.090 | 0.069 | 0.100 | 0.045 | 0.080 | 0.077 | 0.072 | 0.094 | 0.063 | 0.038 | 0.083 | 0.057 | 0.039 | 0.072 | 0.051 | 0.036 | 0.025 | 0.091 |
| | β | 0.074 | 0.098 | 0.062 | 0.083 | 0.070 | 0.090 | 0.025 | 0.048 | 0.078 | 0.097 | 0.036 | 0.041 | 0.051 | 0.063 | 0.022 | 0.028 | 0.034 | 0.042 | 0.075 |
| | Expected Costs, Scenario 1 | $1,280 | $1,710 | $2,280 | $2,700 | $3,300 | $3,710 | $4,320 | $4,770 | $5,200 | $5,720 | $6,360 | $6,760 | $7,280 | $7,800 | $8,320 | $8,840 | $9,360 | $9,880 | $10,200 |
| | Expected Costs, Scenario 2 | $2,000 | $1,950 | $2,600 | $2,500 | $3,300 | $3,150 | $4,000 | $4,050 | $4,000 | $4,400 | $5,400 | $5,200 | $5,600 | $6,000 | $6,400 | $6,800 | $7,200 | $7,600 | $7,000 |
| ICC = 0.05 | samples per cluster, k | 18 | 10 | 7 | 6 | 5 | 3 | 4 | 3 | 2 | 2 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| | total sample size, n | 36 | 30 | 28 | 30 | 30 | 21 | 4 | 27 | 20 | 22 | 36 | 26 | 28 | 30 | 32 | 34 | 36 | 38 | 20 |
| | decision rule, d | 5 | 4 | 4 | 4 | 4 | 3 | 5 | 4 | 3 | 3 | 5 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 20 |
| | α | 0.093 | 0.071 | 0.082 | 0.056 | 0.052 | 0.085 | 0.085 | 0.076 | 0.097 | 0.065 | 0.041 | 0.086 | 0.060 | 0.041 | 0.075 | 0.053 | 0.038 | 0.026 | 0.091 |
| | β | 0.083 | 0.095 | 0.073 | 0.082 | 0.078 | 0.094 | 0.029 | 0.052 | 0.080 | 0.100 | 0.040 | 0.043 | 0.054 | 0.065 | 0.023 | 0.029 | 0.036 | 0.044 | 0.075 |
| | Expected Costs, Scenario 1 | $1,360 | $1,800 | $2,280 | $2,800 | $3,300 | $3,710 | $4,320 | $4,770 | $5,200 | $5,720 | $6,360 | $6,760 | $7,280 | $7,800 | $8,320 | $8,840 | $9,360 | $9,880 | $10,200 |
| | Expected Costs, Scenario 2 | $2,400 | $2,400 | $2,600 | $3,000 | $3,300 | $3,150 | $4,000 | $4,050 | $4,000 | $4,400 | $5,400 | $5,200 | $5,600 | $6,000 | $6,400 | $6,800 | $7,200 | $7,600 | $7,000 |
| ICC = 0.1 | samples per cluster, k | 68‡‡ | 15 | 9 | 6 | 5 | 4 | 4 | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| | total sample size, n | 136 | 45 | 36 | 30 | 30 | 28 | 32 | 27 | 30 | 33 | 36 | 26 | 28 | 30 | 32 | 34 | 36 | 38 | 20 |
| | decision rule, d | 17 | 6 | 5 | 4 | 4 | 4 | 5 | 4 | 4 | 5 | 5 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 3 |
| | α | 0.096 | 0.092 | 0.090 | 0.075 | 0.067 | 0.082 | 0.099 | 0.086 | 0.052 | 0.077 | 0.048 | 0.091 | 0.064 | 0.045 | 0.080 | 0.058 | 0.041 | 0.029 | 0.091 |
| | β | 0.099 | 0.093 | 0.082 | 0.099 | 0.093 | 0.073 | 0.038 | 0.060 | 0.078 | 0.036 | 0.047 | 0.047 | 0.058 | 0.070 | 0.026 | 0.033 | 0.040 | 0.048 | 0.075 |
| | Expected Costs, Scenario 1 | $2,360 | $1,950 | $2,360 | $2,800 | $3,300 | $3,780 | $4,320 | $4,770 | $5,300 | $5,830 | $6,360 | $6,760 | $7,280 | $7,800 | $8,320 | $8,840 | $9,360 | $9,880 | $10,200 |
| | Expected Costs, Scenario 2 | $7,400 | $3,150 | $3,000 | $3,000 | $3,300 | $3,500 | $4,000 | $4,050 | $4,500 | $4,950 | $5,400 | $5,200 | $5,600 | $6,000 | $6,400 | $6,800 | $7,200 | $7,600 | $7,000 |
| ICC = 0.15 | samples per cluster, k | †† | 45‡‡ | 13 | 9 | 6 | 4 | 5 | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| | total sample size, n | | 135 | 52 | 45 | 36 | 28 | 40 | 27 | 30 | 33 | 36 | 26 | 28 | 30 | 32 | 34 | 36 | 38 | 20 |
| | decision rule, d | | 17 | 7 | 6 | 5 | 4 | 6 | 4 | 4 | 5 | 5 | 4 | 28 | 4 | 5 | 5 | 5 | 5 | 3 |
| | α | | 0.099 | 0.100 | 0.082 | 0.087 | 0.095 | 0.090 | 0.096 | 0.060 | 0.086 | 0.055 | 0.096 | 0.069 | 0.049 | 0.085 | 0.062 | 0.045 | 0.032 | 0.091 |
| | β | | 0.099 | 0.090 | 0.087 | 0.080 | 0.083 | 0.043 | 0.067 | 0.086 | 0.042 | 0.053 | 0.051 | 0.062 | 0.074 | 0.029 | 0.036 | 0.043 | 0.051 | 0.075 |
| | Expected Costs, Scenario 1 | | $2,850 | $2,520 | $2,950 | $3,360 | $3,780 | $4,400 | $4,770 | $5,300 | 5830.000 | $6,360 | $6,760 | $7,280 | $7,800 | $8,320 | $8,840 | $9,360 | $9,880 | $10,200 |
| | Expected Costs, Scenario 2 | | $7,650 | $3,800 | $3,750 | $3,600 | $3,500 | $4,400 | $4,050 | $4,500 | 4950.000 | $5,400 | $5,200 | $5,600 | $6,000 | $6,400 | $6,800 | $7,200 | $7,600 | $7,000 |
| ICC = 0.2 | samples per cluster, k | †† | †† | 40 | 12 | 9 | 6 | 6 | 4 | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| | total sample size, n | | | 160 | 60 | 54 | 42 | 48 | 36 | 30 | 33 | 36 | 39 | 28 | 30 | 32 | 34 | 36 | 38 | 20 |
| | decision rule, d | | | 20 | 8 | 7 | 6 | 7 | 5 | 4 | 5 | 5 | 5 | 4 | 4 | 5 | 5 | 5 | 5 | 3 |
| | α | | | 0.096 | 0.099 | 0.077 | 0.098 | 0.091 | 0.077 | 0.068 | 0.095 | 0.063 | 0.041 | 0.073 | 0.052 | 0.090 | 0.066 | 0.048 | 0.035 | 0.091 |
| | β | | | 0.100 | 0.090 | 0.091 | 0.067 | 0.050 | 0.071 | 0.093 | 0.047 | 0.060 | 0.074 | 0.066 | 0.078 | 0.032 | 0.039 | 0.047 | 0.055 | 0.075 |
| | Expected Costs, Scenario 1 | | | $3,600 | $3,100 | $3,540 | $3,920 | $4,480 | $4,860 | $5,300 | $5,830 | $6,360 | $6,890 | $7,280 | $7,800 | $8,320 | $8,840 | $9,360 | $9,880 | $10,200 |
| Expected Costs, Scenario 2 | $9,200 | $4,500 | $4,500 | $4,200 | $4,480 | $4,500 | $4,500 | $4,950 | $5,400 | $5,850 | $5,600 | $6,000 | $6,400 | $6,800 | $7,200 | $7,600 | $7,000 | |||
This table assumes p = 0.25, p = 0.05, α = β = 0.1.
† Assuming costs of $500 per cluster and $10 per individual sampled. ‡ Assuming costs of $300 per cluster and $50 per individual sampled. †† Too few clusters to determine a system. ‡‡ May not be applicable to CHW problem, as recommended sample size exceeds average number of households per cluster.
Figure 1Operating Characteristic Curve for the C-LQAS system a) {m = 4, k = 9, d = 5} and b) {m = 9, k = 3, d = 4} under varying values of intraclass correlation (ICC).
Observed misclassification errors and estimated intraclass correlation for varying values of intraclass correlation (ICC), assuming ICC = 0.01 in the design
| | | | |||||||
|---|---|---|---|---|---|---|---|---|---|
| m = 4, k = 5, d = 3 | |||||||||
| 0.025 | 0.101 | 0.089 | 0.000 | 0.102 | 0.632 | 0.004 | 0.166 | 0.997 | |
| 0.050 | 0.113 | 0.095 | 0.015 | 0.119 | 0.612 | 0.025 | 0.179 | 0.993 | |
| 0.100 | 0.132 | 0.111 | 0.043 | 0.144 | 0.578 | 0.065 | 0.195 | 0.991 | |
| 0.150 | 0.153 | 0.114 | 0.075 | 0.177 | 0.537 | 0.103 | 0.214 | 0.986 | |
| 0.200 | 0.173 | 0.127 | 0.100 | 0.200 | 0.524 | 0.148 | 0.236 | 0.981 | |
| m = 7, k = 3, d = 3 | |||||||||
| 0.025 | 0.083 | 0.094 | 0.098 | 0.141 | 0.648 | 0.095 | 0.231 | 0.999 | |
| 0.050 | 0.088 | 0.099 | 0.113 | 0.158 | 0.639 | 0.122 | 0.236 | 0.996 | |
| 0.100 | 0.097 | 0.110 | 0.142 | 0.189 | 0.623 | 0.167 | 0.246 | 0.995 | |
| 0.150 | 0.107 | 0.110 | 0.178 | 0.221 | 0.603 | 0.210 | 0.258 | 0.994 | |
| 0.200 | 0.117 | 0.117 | 0.211 | 0.250 | 0.591 | 0.256 | 0.266 | 0.993 | |
| m = 9, k = 3, d = 4 | |||||||||
| 0.025 | 0.071 | 0.047 | 0.112 | 0.136 | 0.739 | 0.114 | 0.208 | 0.999 | |
| 0.050 | 0.076 | 0.053 | 0.127 | 0.151 | 0.738 | 0.135 | 0.210 | 0.999 | |
| 0.100 | 0.083 | 0.061 | 0.160 | 0.188 | 0.725 | 0.184 | 0.224 | 0.999 | |
| 0.150 | 0.098 | 0.067 | 0.193 | 0.217 | 0.691 | 0.234 | 0.234 | 0.999 | |
| 0.200 | 0.108 | 0.073 | 0.228 | 0.242 | 0.681 | 0.278 | 0.239 | 0.998 | |
Intraclass correlation, † At least one or more events in at least one cluster. Italic data corresponds to the ICC used in the design.
Observed misclassification errors and intraclass correlation for varying values of intraclass correlation (ICC), assuming ICC = 0.10 in the design
| | | | |||||||
|---|---|---|---|---|---|---|---|---|---|
| m = 4, k = 9, d = 5 | |||||||||
| 0.010 | 0.041 | 0.040 | -0.011 | 0.060 | 0.830 | -0.009 | 0.085 | 1.000 | |
| 0.025 | 0.049 | 0.051 | -0.004 | 0.065 | 0.820 | 0.003 | 0.094 | 1.000 | |
| 0.050 | 0.058 | 0.063 | 0.011 | 0.079 | 0.781 | 0.022 | 0.106 | 1.000 | |
| 0.150 | 0.114 | 0.097 | 0.063 | 0.133 | 0.694 | 0.102 | 0.155 | 0.999 | |
| 0.200 | 0.137 | 0.117 | 0.091 | 0.157 | 0.651 | 0.145 | 0.178 | 0.997 | |
| m = 7, k = 4, d = 4 | |||||||||
| 0.010 | 0.059 | 0.052 | 0.039 | 0.106 | 0.754 | 0.036 | 0.166 | 1.000 | |
| 0.025 | 0.060 | 0.053 | 0.047 | 0.115 | 0.746 | 0.055 | 0.173 | 0.999 | |
| 0.050 | 0.064 | 0.064 | 0.065 | 0.133 | 0.744 | 0.079 | 0.181 | 0.999 | |
| 0.150 | 0.095 | 0.089 | 0.129 | 0.196 | 0.687 | 0.173 | 0.210 | 0.998 | |
| 0.200 | 0.111 | 0.092 | 0.164 | 0.219 | 0.663 | 0.215 | 0.225 | 0.998 | |
| m = 9, k = 3, d = 4 | |||||||||
| 0.010 | 0.070 | 0.048 | 0.102 | 0.122 | 0.748 | 0.100 | 0.206 | 1.000 | |
| 0.025 | 0.074 | 0.047 | 0.111 | 0.136 | 0.746 | 0.112 | 0.207 | 0.999 | |
| 0.050 | 0.073 | 0.056 | 0.128 | 0.157 | 0.735 | 0.137 | 0.213 | 1.000 | |
| 0.150 | 0.094 | 0.067 | 0.192 | 0.213 | 0.695 | 0.229 | 0.232 | 0.999 | |
| 0.200 | 0.107 | 0.072 | 0.225 | 0.240 | 0.687 | 0.275 | 0.236 | 0.998 | |
Intraclass correlation, † At least one or more events in at least one cluster. Italic data corresponds to the ICC used in the design.
Observed misclassification errors and estimated intraclass correlation for varying values of intraclass correlation (ICC), assuming ICC = 0.2 in the design
| | | | |||||||
|---|---|---|---|---|---|---|---|---|---|
| m = 4, k = 40, d = 20 | |||||||||
| 0.010 | 0.000 | 0.002 | 0.001 | 0.020 | 1.000 | 0.002 | 0.022 | 1.000 | |
| 0.025 | 0.002 | 0.008 | 0.010 | 0.027 | 0.997 | 0.013 | 0.030 | 1.000 | |
| 0.050 | 0.008 | 0.023 | 0.024 | 0.041 | 0.989 | 0.032 | 0.044 | 1.000 | |
| 0.100 | 0.029 | 0.056 | 0.047 | 0.064 | 0.960 | 0.070 | 0.070 | 1.000 | |
| 0.150 | 0.065 | 0.082 | 0.067 | 0.090 | 0.918 | 0.109 | 0.099 | 1.000 | |
| m = 7, k = 6, d = 6 | |||||||||
| 0.010 | 0.033 | 0.020 | 0.010 | 0.078 | 0.884 | 0.014 | 0.107 | 1.000 | |
| 0.025 | 0.041 | 0.026 | 0.021 | 0.087 | 0.864 | 0.027 | 0.111 | 1.000 | |
| 0.050 | 0.045 | 0.032 | 0.037 | 0.101 | 0.854 | 0.053 | 0.124 | 1.000 | |
| 0.100 | 0.066 | 0.046 | 0.071 | 0.132 | 0.825 | 0.098 | 0.141 | 1.000 | |
| 0.150 | 0.079 | 0.056 | 0.101 | 0.157 | 0.783 | 0.144 | 0.158 | 1.000 | |
| m = 9, k = 4, d = 5 | |||||||||
| 0.010 | 0.035 | 0.035 | 0.049 | 0.103 | 0.842 | 0.049 | 0.149 | 1.000 | |
| 0.025 | 0.040 | 0.040 | 0.058 | 0.111 | 0.829 | 0.068 | 0.156 | 1.000 | |
| 0.050 | 0.046 | 0.046 | 0.077 | 0.133 | 0.815 | 0.091 | 0.163 | 1.000 | |
| 0.100 | 0.058 | 0.052 | 0.113 | 0.163 | 0.800 | 0.139 | 0.177 | 1.000 | |
| 0.150 | 0.064 | 0.065 | 0.145 | 0.191 | 0.773 | 0.188 | 0.191 | 1.000 | |
ICC: Intraclass correlation, † At least one or more events in at least one cluster. Italic data corresponds to the ICC used in the design.