Michael R Elliott1. 1. Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor 48109, USA. mrelliot@umich.edu
Abstract
PURPOSE: Statistical efficiency and cost efficiency can be achieved in population-based samples through stratification and/or clustering. Strata typically combine subgroups of the population that are similar with respect to an outcome. Clusters are often taken from preexisting units, but may be formed to minimize between-cluster variance, or to equalize exposure to a treatment or risk factor. Area probability sample design procedures for the National Children's Study required contiguous strata and clusters that maximized within-stratum and within-cluster homogeneity while maintaining approximately equal size of the strata or clusters. However, there were few methods that allowed such strata or clusters to be constructed under these contiguity and equal size constraints. METHODS: A search algorithm generates equal-size cluster sets that approximately span the space of all possible clusters of equal size. An optimal cluster set is chosen based on analysis of variance and convexity criteria. RESULTS: The proposed algorithm is used to construct 10 strata based on demographics and air pollution measures in Kent County, MI, following census tract boundaries. A brief simulation study is also conducted. CONCLUSIONS: The proposed algorithm is effective at uncovering underlying clusters from noisy data. It can be used in multi-stage sampling where equal-size strata or clusters are desired.
PURPOSE: Statistical efficiency and cost efficiency can be achieved in population-based samples through stratification and/or clustering. Strata typically combine subgroups of the population that are similar with respect to an outcome. Clusters are often taken from preexisting units, but may be formed to minimize between-cluster variance, or to equalize exposure to a treatment or risk factor. Area probability sample design procedures for the National Children's Study required contiguous strata and clusters that maximized within-stratum and within-cluster homogeneity while maintaining approximately equal size of the strata or clusters. However, there were few methods that allowed such strata or clusters to be constructed under these contiguity and equal size constraints. METHODS: A search algorithm generates equal-size cluster sets that approximately span the space of all possible clusters of equal size. An optimal cluster set is chosen based on analysis of variance and convexity criteria. RESULTS: The proposed algorithm is used to construct 10 strata based on demographics and air pollution measures in Kent County, MI, following census tract boundaries. A brief simulation study is also conducted. CONCLUSIONS: The proposed algorithm is effective at uncovering underlying clusters from noisy data. It can be used in multi-stage sampling where equal-size strata or clusters are desired.
Authors: Philip J Landrigan; Leonardo Trasande; Lorna E Thorpe; Charon Gwynn; Paul J Lioy; Mary E D'Alton; Heather S Lipkind; James Swanson; Pathik D Wadhwa; Edward B Clark; Virginia A Rauh; Frederica P Perera; Ezra Susser Journal: Pediatrics Date: 2006-11 Impact factor: 7.124
Authors: Jean M Kerver; Michael R Elliott; Gwendolyn S Norman; Robert J Sokol; Daniel P Keating; Glenn E Copeland; Christine C Johnson; Kendall K Cislo; Kirsten H Alcser; Shonda R Kruger-Ndiaye; Beth-Ellen Pennell; Shobha Mehta; Christine L M Joseph; Nigel Paneth Journal: Paediatr Perinat Epidemiol Date: 2013-05 Impact factor: 3.980
Authors: Nitin B Jain; Laurence D Higgins; Elena Losina; Jamie Collins; Philip E Blazar; Jeffrey N Katz Journal: BMC Musculoskelet Disord Date: 2014-01-08 Impact factor: 2.362