| Literature DB >> 26956373 |
Christian Bottomley1, Matthew J Kirby2, Steve W Lindsay3, Neal Alexander4.
Abstract
BACKGROUND: Clustering commonly affects the uncertainty of parameter estimates in epidemiological studies. Cluster-robust variance estimates (CRVE) are used to construct confidence intervals that account for single-level clustering, and are easily implemented in standard software. When data are clustered at more than one level (e.g. village and household) the level for the CRVE must be chosen. CRVE are consistent when used at the higher level of clustering (village), but since there are fewer clusters at the higher level, and consistency is an asymptotic property, there may be circumstances under which coverage is better from lower- rather than higher-level CRVE. Here we assess the relative importance of adjusting for clustering at the higher and lower level in a logistic regression model.Entities:
Mesh:
Year: 2016 PMID: 26956373 PMCID: PMC4784323 DOI: 10.1186/s12874-016-0127-1
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Parameter values
| Parameter | Description | Values or range |
|---|---|---|
|
| Log odds when | log(0.1/0.9), log(0.2/0.8) |
|
| Conditional log odds ratios | log(1.1), log(2), log(5) |
|
| SD of household effect | log(1.1), log(2), log(5) |
|
| SD of village effect | log(1.05)–log(5) |
|
| No. individuals per household | 5, 20 |
|
| No. households per village | 20, 100 |
|
| No. of villages | 5, 20 |
Distribution of predictors (x 1- x 7) across villages (V1-V5) and the resulting intra-cluster correlation of the predictor
| Proportion of individuals positive | Intra-cluster correlation | |||||
|---|---|---|---|---|---|---|
| Predictor | V1 | V2 | V3 | V4 | V5 | |
|
| 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | –0.01 |
|
| 0 | 0.1 | 0.1 | 0.3 | 0.5 | 0.19 |
|
| 0 | 0.05 | 0.1 | 0.1 | 0.75 | 0.48 |
|
| 0 | 0 | 0 | 0 | 1 | 1 |
N.B. In the simulation with 20 villages we created 4 identical sets of villages using the proportions for V1-V5
Fig. 1Coverage of 95 % confidence intervals for the log odds ratio of a household-level predictor. The lines correspond to coverage of confidence intervals that adjust for clustering at the village-level, household-level or do not adjust for clustering. Coverage is presented as a function of the degree of village-level clustering in the outcome as measured by the intra-cluster correlation (ICC). The intra-cluster correlation of the predictor ranges between 0 (top) and 1 (bottom), for K=5 (left) or K=20 (right) villages. The remaining parameter values are α 0=log(0.1/0.9), α 1=σ =log(2), I=5, J=20