| Literature DB >> 29937587 |
Ron Johnston1, Kelvyn Jones1, David Manley1.
Abstract
Many ecological- and individual-level analyses of voting behaviour use multiple regressions with a considerable number of independent variables but few discussions of their results pay any attention to the potential impact of inter-relationships among those independent variables-do they confound the regression parameters and hence their interpretation? Three empirical examples are deployed to address that question, with results which suggest considerable problems. Inter-relationships between variables, even if not approaching high collinearity, can have a substantial impact on regression model results and how they are interpreted in the light of prior expectations. Confounded relationships could be the norm and interpretations open to doubt, unless considerable care is applied in the analyses and an extended principal components method for doing that is introduced and exemplified.Entities:
Keywords: Collinearity; Confounding; Regression analysis; Voting behaviour
Year: 2017 PMID: 29937587 PMCID: PMC5993839 DOI: 10.1007/s11135-017-0584-6
Source DB: PubMed Journal: Qual Quant ISSN: 0033-5177
Ordinary least squares regressions of the percentage voting UKIP by constituency in England and Wales at the 2015 UK general election
| a | b1Qual | b2Age | b3Deprive | b4Students | r2/R2 | |
|---|---|---|---|---|---|---|
| S | ||||||
| Model 1a | − 5.53 | 0.55 | 0.52 | |||
| (0.81) | (0.02) | |||||
| Model 1b | 6.00 | 0.49 | 0.14 | |||
| (0.89) | (0.05) | |||||
| Model 1c | 8.48 | 0.23 | 0.07 | |||
| (0.93) | (0.04) | |||||
| Model 1d | 17.88 | − 0.64 | 0.17 | |||
| (0.39) | (0.06) | |||||
|
| ||||||
| Model 2a | − 9.29 | 0.51 | 0.30 | 0.57 | ||
| (0.90) | (0.04) | (0.02) | ||||
| Model 2b | − 4.08 | 0.79 | − 0.41 | 0.63 | ||
| (0.72) | (0.03) | (0.03) | ||||
| Model 2c | − 2.26 | 0.50 | − 0.27 | 0.55 | ||
| (0.97) | (0.02) | (0.05) | ||||
|
| ||||||
| Model 3a | − 7.40 | 0.50 | 0.26 | − 0.11 | 0.57 | |
| (1.27) | (0.02) | (0.04) | (0.05) | |||
| Model 3b | − 4.37 | 0.78 | 0.02 | − 0.40 | 0.63 | |
| (0.97) | (0.3) | (0.05) | (0.04) | |||
| Model 3c | − 5.48 | 0.85 | − 0.48 | 0.14 | 0.63 | |
| (0.91) | (0.04) | (0.04) | (0.06) | |||
|
| ||||||
| Model 4 | − 6.14 | 0.84 | 0.04 | − 0.46 | 0.14 | 0.63 |
| (1.19) | (0.40) | (0.05) | (0.05) | (0.06) | ||
Figures in brackets are the standard errors of the regression coefficients
Loadings on the principal components factor analyses of the four independent variables deployed in the regressions in Table 1
| Factor | Varimax rotated | |||
|---|---|---|---|---|
| Variable/Factor | 1 | 2 | 1 | 2 |
| Qualifications | 0.65 | − 0.60 | 0.87 | − 0.10 |
| Age | 0.82 | 0.53 | 0.34 | 0.91 |
| Deprivation | 0.31 | 0.92 | − 0.30 | 0.93 |
| Students | − 0.77 | 0.43 | − 0.87 | − 0.10 |
Logistic regressions of voting labour at the 2011 constituency-level elections to the National Assembly of Wales
| a | X1 | X2 | X3 | |
|---|---|---|---|---|
|
| ||||
| Coefficient | − 0.089 | 3.085 | ||
| SE | (0.065) | (0.129) | ||
| Exponent | 0.915 | 21.858 | ||
| r2 | 0.437 | |||
|
| ||||
| Coefficient | − 0.350 | 1.754 | ||
| SE | (0.053) | (0.106) | ||
| Exponent | 0.705 | 5.780 | ||
| r2 | 0.191 | |||
|
| ||||
| Coefficient | − 0.356 | 1.149 | ||
| SE | (0.054) | (0.107) | ||
| Exponent | 0.701 | 3.154 | ||
| r2 | 0.079 | |||
|
| ||||
| Coefficient | − 0.217 | 1.574 | 0.703 | |
| SE | (0.059) | (0.110) | (0.118) | |
| Exponent | 0.805 | 4.824 | 2.020 | |
| R2 | 0.212 | |||
|
| ||||
| Coefficient | 0.098 | 2.889 | 1.399 | |
| SE | (0.069) | (0.134) | (0.129) | |
| Exponent | 1.998 | 17.974 | 4.051 | |
| R2 | 0.492 | |||
|
| ||||
| Coefficient | 0.081 | 2.990 | 0.809 | |
| SE | (0.072) | (0.130) | (0.135) | |
| Exponent | 1.084 | 19.882 | 2.246 | |
| R2 | 0.454 | |||
|
| ||||
| Coefficient | 0.173 | 2.848 | 1.285 | 0.444 |
| SE | (0.074) | (0.134) | (0.134) | (0.143) |
| Exponent | 1.189 | 17.262 | 3.616 | 1.559 |
| R2 | 0.496 | |||
The independent variables are X 1 voted Labour in 2007, X 2 strongly likes the labour party X 3 strongly likes the Labour party leader, Carwyn Jones
Further logistic regressions of voting labour at the 2011 constituency-level elections to the National Assembly of Wales
| a | X4 | X5 | X6 | X7 | X8 | X9 | |
|---|---|---|---|---|---|---|---|
|
| |||||||
| Coefficient | − 0.471 | 2.035 | |||||
| SE | (0.053) | (0.106) | |||||
| Exponent | 0.625 | 7.649 | |||||
| r2 | 0.258 | ||||||
|
| |||||||
| Coefficient | − 0.538 | 1.371 | |||||
| SE | (0.050) | (0.100) | |||||
| Exponent | 0.584 | 3.941 | |||||
| r2 | 0.132 | ||||||
|
| |||||||
| Coefficient | − 0.361 | 1.220 | |||||
| SE | (0.053) | (0.106) | |||||
| Exponent | 0.697 | 3.387 | |||||
| r2 | 0.092 | ||||||
|
| |||||||
| Coefficient | − 0.630 | 1.326 | |||||
| SE | (0.050) | (0.100) | |||||
| Exponent | 0.532 | 3.768 | |||||
| r2 | 0.125 | ||||||
|
| |||||||
| Coefficient | − 0.372 | 1.189 | |||||
| SE | (0.053) | (0.105) | |||||
| Exponent | 0.698 | 3.284 | |||||
| r2 | 0.088 | ||||||
|
| |||||||
| Coefficient | − 0.599 | 1.244 | |||||
| SE | (0.049) | (0.099) | |||||
| Exponent | 0.549 | 3.470 | |||||
| r2 | 0.111 | ||||||
|
| |||||||
| Coefficient | − 0.499 | 0.633 | 0.224 | 0.671 | 0.191 | 0.353 | |
|
| |||||||
| SE | (0.060) | (0.135) | (0.141) | (0.124) | (0.141) | (0.136) | |
| Exponent | 0.607 | 1.884 | 1.251 | 1.955 | 1.211 | 1.423 | |
| R2 | 0.179 | ||||||
|
| |||||||
| Coefficient | − 0.497 | 1.663 | 0.258 | 0.088 | 0.538 | − 0.140 | 0.111 |
| SE | (0.063) | (0.129) | (0.148) | (0.150) | (0.132) | (0.153) | (0.146) |
| Exponent | 0.609 | 5.273 | 1.294 | 1.092 | 1.712 | 0.869 | 1.117 |
| R2 | 0.280 | ||||||
The independent variables: are X 1 voted Labour in 2007, X 2 strongly likes the Labour party, X 3 strongly likes the Labour party leader, Carwyn Jones, X 4 Labour at good running Wales, X 5 Welsh government handled NHS well, X 6 Welsh government handled schools well, X 7 Welsh government handled University tuition fees well, X 8 Welsh government handled economy well, X 9 Welsh government handled Welsh interests well
Loadings from the principal components factor analyses of the data analysed in Table 4, and the results of logistic regression analyses using the related factor scores as independent variables to predict voting labour at the 2011 constituency-level election to the National Assembly of Wales
| Variable | FIa | FIb | FIab | FIIab |
|---|---|---|---|---|
| X1 | 0.702 | 0.231 | 0.787 | |
| X2 | 0.775 | 0.315 | 0.673 | |
| X3 | 0.703 | 0.332 | 0.683 | |
| X4 | 0.734 | 0.690 | 0.617 | |
| X5 | 0.819 | 0.815 | 0.381 | |
| X6 | 0.771 | 0.787 | 0.250 | |
| X7 | 0.737 | 0.734 | 0.352 | |
| X8 | 0.784 | 0.798 | 0.260 | |
| X9 | 0.803 | 0.805 | 0.341 |
Y = − 0.763 + 1.429FIa r2 = 0.403
(0.058) (0.066)
Y = − 0.707 + 0.892FIb r2 = 0.211
(0.053) (0.052)
Y = − 0.820 + 1.244FIa + 0.518FIb R2 = 0.438
(0.061) (0.068) (0.061)
Y = − 0.833 + 0.464FIab + 1.328FIIab R2 = 0.449
(0.062) (0.060) (0.069)
Logistic regressions of the data in “Appendix”
| a | X1 | X21 | X22 | X23 | X24 | |
|---|---|---|---|---|---|---|
|
| ||||||
| Model 1 | ||||||
| Coefficient | − 0.458 | 1.492 | ||||
| SE | (0.058) | (0.116) | ||||
| Exponent | 0.632 | 4.444 | ||||
| Nagelkerke r2 | 0.149 | . | ||||
|
| ||||||
| Model 2a | ||||||
| Coefficient | − 0.549 | 0.523 | ||||
| SE | (0.056) | (0.112) | ||||
| Exponent | 0.577 | 1.687 | ||||
| Nagelkerke r2 | 0.020 | |||||
| Model 2b | ||||||
| Coefficient | − 0.347 | 4.277 | ||||
| SE | (0.087) | (0.173) | ||||
| Exponent | 0.707 | 72.000 | ||||
| Nagelkerke r2 | 0.662 | |||||
| Model 2c | ||||||
| Coefficient | − 0.347 | 4.277 | ||||
| SE | (0.087) | (0.173) | ||||
| Exponent | 0.707 | 72.000 | ||||
| Nagelkerke r2 | 0.662 | |||||
| Model 2d | ||||||
| Coefficient | − 0.394 | 2.621 | ||||
| SE | (0.066) | (0.131) | ||||
| Exponent | 0.674 | 13.750 | ||||
| Nagelkerke r2 | 0.375 | |||||
|
| ||||||
| Model 3a | ||||||
| Coefficient | − 0.409 | 1.456 | 0.392 | |||
| SE | (0.060) | (0.117) | (0.119) | |||
| Exponent | 0.664 | 4.288 | 1.480 | |||
| Nagelkerke R2 | 0.158 | |||||
| Model 3b | ||||||
| Coefficient | − 0.261 | 0.846 | 4.110 | |||
| SE | (0.089) | (0.179) | (0.175) | |||
| Exponent | 0.771 | 2.330 | 60.957 | |||
| Nagelkerke R2 | 0.672 | |||||
| Model 3c | ||||||
| Coefficient | − 0.458 | − 19.433 | 23.122 | |||
| SE | (0.089) | (2289.293) | (2289.293) | |||
| Exponent | (0.632) | 0.000 | 0.000 | |||
| Nagelkerke R2 | 0.700 | |||||
| Model 3d | ||||||
| Coefficient | − 0.458 | − 20.121 | 22.423 | |||
| SE | (0.067) | (3229.065) | (3229.065) | |||
| Exponent | 0.632 | 0.000 | 5,474,103,965.0 | |||
| Nagelkerke R2 | 0.428 | |||||
| Qualifications | 2.03 | Age | 3.53 |
| Deprivation | 1.89 | Students | − 2.61 |
The constructed data set
| Observation |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 1 | 1 | 1 | 0 | 1 | 1 | 1 |
| 2 | 1 | 1 | 1 | 1 | 1 | 1 |
| 3 | 1 | 1 | 0 | 1 | 1 | 1 |
| 4 | 1 | 1 | 0 | 1 | 1 | 1 |
| 5 | 1 | 0 | 1 | 0 | 1 | 1 |
| 6 | 1 | 0 | 0 | 0 | 0 | 1 |
| 7 | 1 | 0 | 1 | 0 | 0 | 0 |
| 8 | 0 | 1 | 0 | 1 | 1 | 1 |
| 9 | 0 | 1 | 1 | 1 | 1 | 0 |
| 10 | 0 | 1 | 1 | 0 | 0 | 0 |
| 11 | 0 | 0 | 1 | 1 | 0 | 0 |
| 12 | 0 | 0 | 1 | 0 | 0 | 0 |
| 13 | 0 | 0 | 0 | 0 | 0 | 0 |
| 14 | 0 | 0 | 0 | 0 | 0 | 0 |
| 15 | 0 | 0 | 0 | 0 | 0 | 0 |
| 16 | 0 | 0 | 0 | 0 | 0 | 0 |
| 17 | 0 | 0 | 0 | 0 | 0 | 0 |
| 18 | 0 | 0 | 0 | 0 | 0 | 0 |
| 19 | 0 | 0 | 0 | 0 | 0 | 0 |
| 20 | 0 | 0 | 0 | 0 | 0 | 0 |