| Literature DB >> 21841351 |
Nayu Ikeda1, Kenji Shibuya, Hideki Hashimoto.
Abstract
BACKGROUND: The Comprehensive Survey of Living Conditions of the People on Health and Welfare (CSLC) is a major source of health data in Japan. The CSLC is not strictly based on probabilistic sampling, but instead uses an equal allocation of sample clusters to yield equal standard errors of estimates across prefectures. This study compared the performance of this sample design in measuring population health with that of an alternative probabilistic sampling approach.Entities:
Mesh:
Year: 2011 PMID: 21841351 PMCID: PMC3899438 DOI: 10.2188/jea.JE20100102
Source DB: PubMed Journal: J Epidemiol ISSN: 0917-5040 Impact factor: 3.211
Population size and basic statistics of a continuous variable X in a hypothetical population by strata
| Stratum ID | Clusters | Households | Individuals | Mean of | |
| 1 | 22 708 | 1 135 425 | 3 969 109 | 131.0 | 24.8 |
| 2 | 6043 | 302 308 | 1 058 277 | 126.0 | 14.4 |
| 3 | 31 176 | 1 558 708 | 5 455 087 | 128.3 | 18.9 |
| 4 | 4161 | 208 094 | 726 722 | 127.4 | 16.9 |
| 5 | 18 121 | 905 860 | 3 172 896 | 131.8 | 26.6 |
| 6 | 18 841 | 942 105 | 3 296 412 | 133.5 | 31.1 |
| 7 | 21 151 | 1 057 710 | 3 701 249 | 130.0 | 22.4 |
| 8 | 32 143 | 1 607 112 | 5 623 977 | 126.2 | 15.0 |
| 9 | 4538 | 226 915 | 794 826 | 129.1 | 20.4 |
| 10 | 36 939 | 1 846 871 | 6 464 310 | 131.6 | 26.2 |
Average size of 1000 sample datasets by strata and sample design
| Stratum ID | Method 1 | Method 2 (by number of sample clusters) | ||||
| 1000 | 2000 | 3000 | 4000 | 5000 | ||
| Clusters | ||||||
| 1 | 100 | 116 | 232 | 348 | 464 | 580 |
| 2 | 100 | 31 | 62 | 93 | 123 | 154 |
| 3 | 100 | 159 | 318 | 478 | 637 | 796 |
| 4 | 99 | 21 | 43 | 64 | 85 | 106 |
| 5 | 100 | 93 | 185 | 278 | 370 | 463 |
| 6 | 100 | 96 | 192 | 289 | 385 | 481 |
| 7 | 100 | 108 | 216 | 324 | 432 | 540 |
| 8 | 100 | 164 | 328 | 492 | 657 | 821 |
| 9 | 100 | 23 | 46 | 70 | 93 | 116 |
| 10 | 100 | 189 | 377 | 566 | 754 | 943 |
| Total | 999 | 1000 | 1999 | 3002 | 4000 | 5000 |
| Households | ||||||
| 1 | 4994 | 5799 | 5830 | 5860 | 5958 | 5799 |
| 2 | 4999 | 1546 | 1554 | 1560 | 1587 | 1544 |
| 3 | 4995 | 7960 | 8003 | 8044 | 8179 | 7961 |
| 4 | 4937 | 1063 | 1070 | 1074 | 1092 | 1063 |
| 5 | 5002 | 4627 | 4651 | 4674 | 4752 | 4626 |
| 6 | 4993 | 4812 | 4838 | 4862 | 4944 | 4812 |
| 7 | 5001 | 5404 | 5432 | 5460 | 5552 | 5402 |
| 8 | 5001 | 8209 | 8253 | 8293 | 8431 | 8208 |
| 9 | 5001 | 1159 | 1165 | 1171 | 1191 | 1159 |
| 10 | 4986 | 9433 | 9483 | 9531 | 9691 | 9427 |
| Total | 49 909 | 50 012 | 50 279 | 50 529 | 51 377 | 50 001 |
| Individuals | ||||||
| 1 | 17 434 | 20 272 | 20 377 | 20 490 | 20 825 | 20 269 |
| 2 | 17 409 | 5411 | 5439 | 5464 | 5559 | 5402 |
| 3 | 17 675 | 27 860 | 28 011 | 28 155 | 28 622 | 27 868 |
| 4 | 17 236 | 3712 | 3737 | 3750 | 3811 | 3714 |
| 5 | 17 417 | 16 207 | 16 288 | 16 372 | 16 639 | 16 200 |
| 6 | 17 496 | 16 833 | 16 928 | 17 018 | 17 300 | 16 837 |
| 7 | 17 556 | 18 906 | 19 002 | 19 109 | 19 429 | 18 901 |
| 8 | 17 672 | 28 727 | 28 883 | 29 019 | 29 508 | 28 732 |
| 9 | 17 383 | 4061 | 4081 | 4104 | 4171 | 4061 |
| 10 | 17 435 | 33 001 | 33 193 | 33 357 | 33 925 | 32 997 |
| Total | 174 713 | 174 990 | 175 939 | 176 838 | 179 789 | 174 981 |
Method 1, stratified sampling of a constant number of clusters; Method 2, two-stage cluster sampling of households.
Root mean squared error of 1000 estimates by strata and sample design
| Stratum ID | Method 1 | Method 2 (by number of sample clusters) | ||||
| 1000 | 2000 | 3000 | 4000 | 5000 | ||
| Mean of continuous | ||||||
| 1 | 0.522 | 0.504 | 0.320 | 0.282 | 0.237 | 0.232 |
| 2 | 0.498 | 0.931 | 0.708 | 0.593 | 0.542 | 0.541 |
| 3 | 0.540 | 0.418 | 0.297 | 0.272 | 0.224 | 0.186 |
| 4 | 0.653 | 1.067 | 0.756 | 0.684 | 0.546 | 0.557 |
| 5 | 0.502 | 0.569 | 0.438 | 0.375 | 0.330 | 0.282 |
| 6 | 0.486 | 0.534 | 0.400 | 0.322 | 0.301 | 0.276 |
| 7 | 0.511 | 0.459 | 0.342 | 0.285 | 0.250 | 0.214 |
| 8 | 0.526 | 0.406 | 0.327 | 0.258 | 0.221 | 0.204 |
| 9 | 0.554 | 1.050 | 0.830 | 0.679 | 0.556 | 0.563 |
| 10 | 0.475 | 0.374 | 0.264 | 0.246 | 0.202 | 0.172 |
| Total | 0.190 | 0.168 | 0.119 | 0.107 | 0.083 | 0.082 |
| Proportion of | ||||||
| 1 | 0.013 | 0.013 | 0.008 | 0.007 | 0.006 | 0.006 |
| 2 | 0.008 | 0.017 | 0.013 | 0.010 | 0.010 | 0.010 |
| 3 | 0.012 | 0.009 | 0.006 | 0.006 | 0.005 | 0.004 |
| 4 | 0.014 | 0.021 | 0.016 | 0.014 | 0.012 | 0.012 |
| 5 | 0.013 | 0.015 | 0.011 | 0.010 | 0.009 | 0.007 |
| 6 | 0.013 | 0.015 | 0.011 | 0.009 | 0.009 | 0.008 |
| 7 | 0.012 | 0.011 | 0.008 | 0.007 | 0.006 | 0.006 |
| 8 | 0.010 | 0.007 | 0.006 | 0.005 | 0.004 | 0.004 |
| 9 | 0.013 | 0.023 | 0.018 | 0.015 | 0.013 | 0.014 |
| 10 | 0.012 | 0.010 | 0.007 | 0.007 | 0.005 | 0.005 |
| Total | 0.004 | 0.004 | 0.003 | 0.003 | 0.002 | 0.002 |
Method 1, stratified sampling of a constant number of clusters; Method 2, two-stage cluster sampling of households.