| Literature DB >> 29892693 |
Juan Merlo1,2, Philippe Wagner1,3, Peter C Austin4, S V Subramanian5, George Leckie1,6.
Abstract
To be relevant for public health, a context (e.g., neighborhood, school, hospital) should influence or affect the health status of the individuals included in it. The greater the influence of the shared context, the higher the correlation of subject outcomes within that context is likely to be. This intra-context or intra-class correlation is of substantive interest and has been used to quantify the magnitude of the general contextual effect (GCE). Furthermore, ignoring the intra-class correlation in a regression analysis results in spuriously narrow 95% confidence intervals around the estimated regression coefficients of the specific contextual variables entered as covariates and, thereby, overestimates the precision of the estimated specific contextual effects (SCEs). Multilevel regression analysis is an appropriate methodology for investigating both GCEs and SCEs. However, frequently researchers only report SCEs and disregard the study of the GCE, unaware that small GCEs lead to more precise estimates of SCEs so, paradoxically, the less relevant the context is, the easier it is to detect (and publish) small but "statistically significant" SCEs. We describe this paradoxical situation and encourage researchers performing multilevel regression analysis to consider simultaneously both the GCE and SCEs when interpreting contextual influences on individual health.Entities:
Year: 2018 PMID: 29892693 PMCID: PMC5993177 DOI: 10.1016/j.ssmph.2018.05.006
Source DB: PubMed Journal: SSM Popul Health ISSN: 2352-8273
Numerical representation of three different Scenarios (A, B and C) obtained from a Ready Reckoner (Available In The Appendix With Extended Information). The fictitious example analyzes neighborhoods’ observational effects on systolic blood pressure in a sample of 1250 individuals residing in 50 neighborhoods with 25 residents in each. The example represents a multilevel linear regression analysis. Model 1 is a two-level variance-components model. Model 2 extends Model 1 by entering the cluster-level variable neighborhood socioeconomic deprivation. The outcome and covariate are standard normalized.
| Scenario A | Scenario B | Scenario C | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 1 | Model 2 | Model 1 | Model 2 | ||||
| Neighborhood socioeconomic deprivation | |||||||||
| Regression coefficient (1 SD increase) | 0.671 | 0.097 | 0.134 | ||||||
| 95% Confidence interval | 0.629 | 0.712 | 0.042 | 0.153 | -0.053 | 0.320 | |||
| 0.000 | 0.000 | 0.161 | |||||||
| Intraclass correlation coefficient (as a %) | 45.00% | 0.05% | 1.00% | 0.05% | 45.00% | 44.00% | |||
| Neighborhood-level variance | 0.450 | 0.0003 | 0.010 | 0.0005 | 0.450 | 0.432 | |||
| Individual level variance | 0.550 | 0.550 | 0.990 | 0.990 | 0.550 | 0.550 | |||
| Total residual variance | 1.000 | 0.550 | 1.000 | 0.990 | 1.000 | 0.990 | |||
| 99.9% | 95.0% | 4.0% | |||||||
| 106 | 1235 | 1008 | 1235 | 106 | 108 | ||||
| 1.000 | 0.931 | 0.288 | |||||||