| Literature DB >> 32617366 |
Diane Hindmarsh1,2, David Steel2.
Abstract
Regular health surveys can produce reliable estimates at higher geographic levels but not for small areas. Alternatives are to aggregate data over several years or use model-based methods. We created and evaluated model-based estimates for four health-related outcomes by gender, for 153 Local Government Areas using data from the New South Wales Population Health Survey. The evaluation examined evidence on bias and determined the covariates available and appropriate for each outcome variable. The evaluation considered the likely precision of the resulting estimates. The bias and precision of results for single years (2006-2008) for each outcome variable using six covariate specifications were compared with direct survey estimates based on a single year's data and those obtained by aggregating over seven years. A practical issue is how to choose covariates to include in the models as the best covariate specification varies between outcome variables. Model-based results had median root mean squared errors between 3.3% and 5.5% (max 5.2% and 11.3% respectively) and median relative root mean squared errors between 6.8% and 24.5% (max 11.7% and 41.5% respectively). The model-based estimates were unbiased compared with direct estimates based on one or seven years of data and when aggregated to a point where direct estimates were reliable. The bias and reliability assessment process provides a way for policymakers to have confidence in model-based estimates.Entities:
Keywords: SAE; health; population health; risk alcohol drinking; risk factors; small area estimation; smoking rates; survey
Year: 2020 PMID: 32617366 PMCID: PMC7327397 DOI: 10.3934/publichealth.2020034
Source DB: PubMed Journal: AIMS Public Health ISSN: 2327-8994
Covariate specifications studied in this analysis.
| Abbreviation | Covariates included | Comments |
| Covariates consistent between outcome variables | ||
| Null | Intercept only model | |
| Age | 10-year age group only | |
| ONS | 10-year age group, AHS and quintile of IRSD | Global model that includes contextual effects. Based on model used for small area estimates by the Office of National Statistics (ONS) in the UK |
| Global | All covariates included in any of the outcome-specific Comm models (see below for Comm model) | Consistent Global model includes unit-level and contextual effects (quintile of IRSD and AHS) |
| Covariates in model differ between outcome variables | ||
| Specific (Spec) | Outcome and model-specific. | Theoretically this is the best possible model, but it is inconsistent over time, so not particularly useful when developing methods for routine reporting. Model selection was undertaken without inclusion of the random effect |
| Common (Comm) | Covariates that are significant in the majority of years for that sex/outcome | Outcome-specific model; i.e. same within each outcome by sex and year |
Descriptive statistics for response numbers at LGA level, by sex. Overall numbers of responses and sample size applicable for BMI outcome variable, 2002–2008 combined.
| Gender | Min | Q1 | Median | Q3 | Max | Number of LGAs with | ||
| GT 150 responses | GT 300 responses | |||||||
| Number of respondents to surveys | Female | 16 | 91 | 201 | 392 | 1178 | 88 | 53 |
| Male | 16 | 69 | 126 | 283 | 777 | 70 | 38 | |
| Sample size for BMI question | Female | 14 | 79 | 170 | 334 | 1047 | 81 | 45 |
| Male | 13 | 62 | 112 | 253 | 694 | 66 | 31 | |
Summary statistics for direct estimates, SMK, by sex, 2002–2008 combined.
| Gender | Type | Min | Median | Mean | Max |
| Male | Sample size | 15 | 113 | 182.4 | 709 |
| DE | 6.4% | 19.2% | 20.1% | 38.4% | |
| SE | 1.5% | 4.0% | 4.9% | 13.7% | |
| RSE | 7.9% | 23.1% | 24.3% | 96.4% | |
| Female | Sample size | 15 | 181 | 263.5 | 1080 |
| DE | 3.3% | 17.0% | 17.0% | 49.6% | |
| SE | 1.3% | 3.3% | 3.8% | 13.2% | |
| RSE | 7.4% | 20.8% | 23.7% | 86.6% |
Variables that were included in the common model, by outcome and sex.
| Covariate | Categories | Reference Category | SMK | ALC | HDIFF | BMI | ||||
| M | F | M | F | M | F | M | F | |||
| 10yr age grp | 7 | 75+ | Y | Y | Y | Y | Y | Y | Y | Y |
| Health Area | 8 | Greater Western AHS | Y | Y | Y | Y | Y | Y | ||
| Marital status | 4 | Single | Y | Y | Y | Y | Y | Y | ||
| Education | 4 | Higher education | Y | Y | Y | Y | Y | Y | ||
| Country of birth | 2 | Australian-born | Y | Y | Y | Y | Y | |||
| Quintile of IRSD | 5 | Most disadvantaged | Y | Y | Y | Y | Y | |||
| Private health | 2 | No private health cover | Y | Y | Y | Y | Y | |||
| Employment status | 2 | No job | Y | Y | Y | Y | ||||
| Language spoken at home | 2 | English | Y | Y | ||||||
| Remoteness | 3 | Outer regional and remote | Y | Y | ||||||
| Pension | 2 | No pension | Y | Y | ||||||
| Household size | 5 | Single person household | Y | |||||||
Maximum and Median estimated RMSE of EBP and logistic synthetic estimates, by sex, covariate specification group and outcome variable.
| Sex | Outcome variable | Covariate specification group* | Median | Maximum | ||
| EBP | Synth | EBP | Synth | |||
| Male | BMI | null, age | 3.8% | 1.0% | 4.1% | 1.2% |
| Other | 3.7% | 2.9% | 4.5% | 3.8% | ||
| ALC | null, age | 5.5% | 1.1% | 6.2% | 1.3% | |
| Other | 4.0% | 3.1% | 4.6% | 3.7% | ||
| HDIFF | null, age | 6.1% | 0.9% | 11.5% | 1.0% | |
| Other | 5.5% | 3.1% | 9.1% | 4.5% | ||
| SMK | null, age | 4.4% | 1.0% | 5.4% | 1.1% | |
| Other | 3.2% | 2.2% | 4.7% | 3.2% | ||
| Female | BMI | null, age | 5.2% | 1.0% | 5.9% | 1.1% |
| Other | 3.6% | 2.7% | 4.2% | 3.3% | ||
| ALC | null, age | 5.2% | 0.9% | 6.4% | 1.1% | |
| Other | 3.9% | 2.6% | 5.0% | 3.5% | ||
| HDIFF | null, age | 6.8% | 1.1% | 14.6% | 1.1% | |
| Other | 6.7% | 3.4% | 10.9% | 4.4% | ||
| SMK | null, age | 3.9% | 0.7% | 5.3% | 0.9% | |
| Other | 3.5% | 1.9% | 5.4% | 2.8% | ||
Note: * Null, age—averaged over the null and age-only models; Other—averaged over ONC, Specific, Global and common models.
Estimated Approximate Intraclass correlation (ICC) of logistic mixed models, by covariate specification, outcome variable and sex, 2008.
| Outcome variable | Sex | null | age | glob | ONS | comm | spec |
| BMI | Male | 1.3% | 1.2% | 0.6% | 1.2% | 0.4% | 0.1% |
| Female | 2.1% | 2.5% | 0.6% | 0.9% | 0.6% | 0.8% | |
| ALC | Male | 1.6% | 1.9% | 0.0% | 0.8% | 0.1% | 0.1% |
| Female | 2.0% | 2.5% | 0.2% | 1.0% | 0.3% | 0.3% | |
| HDIFF | Male | 10.9% | 11.1% | 2.2% | 3.3% | 1.9% | 1.9% |
| Female | 10.9% | 12.3% | 4.5% | 5.3% | 4.5% | 4.5% | |
| SMK | Male | 1.5% | 1.2% | 0.4% | 0.4% | 0 | 0 |
| Female | 3.9% | 4.0% | 2.1% | 2.7% | 2.3% | 2.3% |
Note: 0 indicates that the random error term was zero for these models; See [15] for similar results for 2006 and 2007.
Unit level adjusted R2 and area-level pseudo-R2 values, averaged over years, by sex and outcome variable, specific (Spec) model.
| Male | Female | |||
| Unit level R2 (adj) | Area-level Pseudo R2 | Unit level R2 (adj) | Area-level Pseudo R2 | |
| SMK | 17% | 25% | 16% | 24% |
| ALC | 6% | 22% | 12% | 31% |
| HDIFF | 11% | 43% | 12% | 51% |
| BMI | 9% | 22% | 7% | 40% |
Figure 1.Difference between EBP and 1-year DE at LGA level, by outcome and gender, 2008.
Proportion of aggregated model-based estimates that lie within the 95% confidence interval around the direct estimate at health area level and by quintile of IRSD, by outcome and covariate specification.
| Null | Age | Global | ONS | Common | Specific | Average^ | |
| Health Area: approx * 48 observations per comparison | |||||||
| ALC | 81% | 79% | 95% | 100% | 96% | 96% | 97% |
| BMI | 77% | 77% | 55% | 96% | 94% | 96% | 95% |
| HDIFF | 87% | 90% | 100% | 100% | 100% | 100% | 100% |
| SMK | 92% | 90% | 100% | 100% | 98% | 98% | 99% |
| Quintile of the index of socioeconomic disadvantage (IRSD): approx* 30 observations per comparison | |||||||
| ALC | 93% | 93% | 84% | 97% | 100% | 100% | 95% |
| BMI | 67% | 70% | 88% | 93% | 90% | 93% | 91% |
| HDIFF | 77% | 73% | 63% | 80% | 72% | 70% | 71% |
| SMK | 73% | 70% | 83% | 87% | 80% | 77% | 82% |
Note: * Sample sizes differ due to non-convergence for three sex-year models; ˆAverage is the average of Global, ONS, Common and Specific models.
Information on final models for the four outcome variables.
| Outcome | Type of model | Variables included |
| ALC | Common | F: Age group, AHS, born in Australia, employment status, language spoken at home, aged pension status, household size, marital status, education level |
| M: Age group, AHS, born in Australia, employment status, language spoken at home | ||
| BMI | ONS | Age group, AHS and quintile of IRSD |
| HDIFF | ONS | Age group, AHS and quintile of IRSD |
| SMK | Global | Age group, AHS, born in Australia, employment status, language spoken at home, aged pension status, household size, marital status, education level, IRSD, private health status, ARIA |
Various statistics of LGA-level direct estimates based on aggregated data and model-based estimates for 2008, by outcome and sex.
| Outcome variable | Sex | Type of Estimate | Min | Q1 | Median | Q3 | Max |
| ALC | Male | DE0208 | 10.7% | 37.6% | 44.5% | 49.4% | 77.7% |
| Model-based | 22.7% | 38.2% | 43.8% | 45.7% | 49.2% | ||
| Female | DE0208 | 8.5% | 25.8% | 30.3% | 35.3% | 57.5% | |
| Model-based | 13.5% | 29.3% | 31.4% | 32.8% | 39.6% | ||
| BMI | Male | DE0208 | 36.4% | 57.7% | 63.4% | 71.7% | 94.0% |
| Model-based | 49.7% | 59.2% | 62.1% | 65.9% | 71.0% | ||
| Female | DE0208 | 19.6% | 44.4% | 50.6% | 55.7% | 72.1% | |
| Model-based | 34.4% | 45.0% | 50.7% | 53.7% | 60.4% | ||
| HDIFF | Male | DE0208 | 0.0% | 11.9% | 19.7% | 28.1% | 81.7% |
| Model-based | 5.9% | 11.9% | 22.6% | 26.2% | 36.6% | ||
| Female | DE0208 | 4.3% | 17.5% | 30.0% | 40.5% | 79.5% | |
| Model-based | 12.2% | 19.1% | 29.3% | 37.5% | 61.5% | ||
| SMK | Male | DE0208 | 6.5% | 16.2% | 19.7% | 24.8% | 39.4% |
| Model-based | 10.6% | 15.1% | 17.8% | 20.9% | 27.7% | ||
| Female | DE0208 | 3.5% | 14.1% | 18.0% | 21.3% | 52.7% | |
| Model-based | 9.5% | 14.8% | 17.0% | 19.5% | 29.2% |
Median and maximum SE and RSE of direct estimates based on data aggregated between 2002 and 2008 (DE0208), compared with estimated RRMSE and RMSE of final logistic EBP estimates, by outcome, summarised over 3 years of estimates.
| RMSE | RRMSE | |||||||
| Male | Female | Male | Female | |||||
| Med | Max | Med | Max | Med | Max | Med | Max | |
| ALC | ||||||||
| DE0208 | 5.5 | 15 | 4.4 | 15.9 | 13.9 | 44.8 | 14.9 | 50 |
| EBP | 3.0 | 5.5 | 3.6 | 6.4 | 7.8 | 19.8 | 12.8 | 25.8 |
| BMI | ||||||||
| DE0208 | 5.8 | 15.2 | 5.1 | 16.1 | 9.2 | 29.0 | 10.9 | 35.7 |
| EBP | 4.0 | 6.0 | 4.0 | 5.2 | 6.8 | 11.7 | 8.6 | 12.5 |
| HDIFF | ||||||||
| DE0208 | 4.2 | 16.7 | 4.0 | 17.7 | 25.1 | 102.1 | 17.9 | 51.5 |
| EBP | 5.4 | 9.9 | 6.5 | 11.3 | 27.9 | 40.1 | 24.5 | 41.5 |
| SMK | ||||||||
| DE0208 | 4.4 | 15.6 | 3.3 | 14 | 23.1 | 96.4 | 20.8 | 86.6 |
| EBP | 3.2 | 9.1 | 3.7 | 7.7 | 17.5 | 33.1 | 23.1 | 40.2 |
Note: EBP results are for final model, as indicated in Table 9.