| Literature DB >> 31304696 |
Abstract
Multicollinearity represents a high degree of linear intercorrelation between explanatory variables in a multiple regression model and leads to incorrect results of regression analyses. Diagnostic tools of multicollinearity include the variance inflation factor (VIF), condition index and condition number, and variance decomposition proportion (VDP). The multicollinearity can be expressed by the coefficient of determination (Rh2) of a multiple regression model with one explanatory variable (Xh) as the model's response variable and the others (Xi [i ≠ h]) as its explanatory variables. The variance (σh2) of the regression coefficients constituting the final regression model are proportional to the VIF. Hence, an increase in Rh2 (strong multicollinearity) increases σh2. The larger σh2 produces unreliable probability values and confidence intervals of the regression coefficients. The square root of the ratio of the maximum eigenvalue to each eigenvalue from the correlation matrix of standardized explanatory variables is referred to as the condition index. The condition number is the maximum condition index. Multicollinearity is present when the VIF is higher than 5 to 10 or the condition indices are higher than 10 to 30. However, they cannot indicate multicollinear explanatory variables. VDPs obtained from the eigenvectors can identify the multicollinear variables by showing the extent of the inflation of σh2 according to each condition index. When two or more VDPs, which correspond to a common condition index higher than 10 to 30, are higher than 0.8 to 0.9, their associated explanatory variables are multicollinear. Excluding multicollinear explanatory variables leads to statistically stable multiple regression models.Entities:
Keywords: Biomedical research; Biostatistics; Multivariable analysis; Regression; Statistical bias; Statistical data analysis
Year: 2019 PMID: 31304696 PMCID: PMC6900425 DOI: 10.4097/kja.19087
Source DB: PubMed Journal: Korean J Anesthesiol ISSN: 2005-6419
Fig. 1.The effects of the coefficient of determination (Rj 2 ) from a regression model on the variance of a regression coefficient of interest [Var(β)]. The presence of multicollinearity (an increase in R2) inflates Var(β). X: jth explanatory variable of a regression model [Y)].
Raw Data from Reference [4]
| Serial number | PVV/GW (cm/s/100 g) | PSV/GW (cm/s/100 g) | EDV/GW (cm/s/100 g) | HVV/GW (cm/s/100 g) | GW/SLV (%) | GRWR (%) | Regeneration rate (%) |
|---|---|---|---|---|---|---|---|
| 1 | 16.36 | 8.9 | 3.47 | 6.02 | 57.42 | 1.11 | 158.76 |
| 2 | 26.68 | 21.22 | 3.53 | 12.07 | 61.38 | 1.36 | 197.19 |
| 3 | 12.49 | 16.62 | 2 | 8.88 | 67.42 | 1.47 | 144.73 |
| 4 | 8.45 | 22.86 | 6.71 | 7.46 | 69.94 | 1.31 | 140.06 |
| 5 | 10.19 | 14.23 | 4.75 | 2.06 | 65.68 | 1.25 | 129.71 |
| 6 | 19.53 | 17.35 | 1.95 | 7.54 | 59.63 | 1.14 | 162.59 |
| 7 | 20.65 | 10.48 | 2.21 | 4.88 | 59.42 | 1.07 | 178.48 |
| 8 | 22.96 | 14.23 | 4.25 | 3.69 | 75.08 | 1.73 | 120.9 |
| 9 | 21.22 | 21.64 | 4.1 | 11.94 | 43.42 | 0.87 | 191.24 |
| 10 | 8.11 | 3.16 | 0.78 | 8.82 | 75.12 | 1.47 | 150.03 |
| 11 | 24.74 | 7.84 | 1.68 | 3.68 | 57.65 | 1.08 | 173.44 |
| 12 | 11.38 | 15.71 | 3.56 | 7.2 | 39.93 | 0.74 | 211.98 |
| 13 | 15.82 | 15.04 | 2.4 | 9.89 | 51.27 | 1.02 | 193.49 |
| 14 | 8.36 | 9.01 | 2.01 | 3.4 | 50.52 | 0.94 | 164.04 |
| 15 | 12.04 | 9.72 | 2.27 | 6.03 | 51.6 | 1.05 | 156.97 |
| 16 | 10.97 | 4.58 | 1.73 | 5.55 | 56.63 | 1.03 | 208.36 |
| 17 | 7.97 | 9.33 | 0.57 | 4.17 | 79.09 | 1.61 | 154.62 |
| 18 | 7.46 | 6.11 | 1.73 | 2.99 | 57.2 | 1.07 | 137.38 |
| 19 | 29.09 | 15.71 | 3.41 | 9.35 | 56.44 | 1.1 | 180.15 |
| 20 | 10.3 | 8.54 | 2.32 | 10.78 | 60.43 | 1.17 | 228.47 |
| 21 | 7.82 | 4.41 | 1.07 | 4.19 | 59.52 | 1 | 153.62 |
| 22 | 14.71 | 6.29 | 1.77 | 6.16 | 65.05 | 1.3 | 121.31 |
| 23 | 8.54 | 6.73 | 1.27 | 5.52 | 65.65 | 1.17 | 157.37 |
| 24 | 23.05 | 11.34 | 5.39 | 3 | 33.57 | 0.63 | 211.27 |
| 25 | 13.12 | 5.86 | 1.89 | 10.92 | 52.93 | 0.9 | 178.16 |
| 26 | 7.41 | 9.11 | 2.05 | 5.5 | 53.72 | 0.91 | 174.89 |
| 27 | 14.59 | 5.59 | 1.26 | 3.75 | 58.62 | 1.14 | 142.98 |
| 28 | 8.52 | 6.52 | 1 | 6.92 | 56.61 | 1.11 | 165.59 |
| 29 | 18.97 | 6.35 | 2.94 | 5.61 | 56.41 | 1.07 | 141.54 |
| 30 | 35.41 | 36.36 | 14.23 | 15 | 41.52 | 0.89 | 238.22 |
| 31 | 4.55 | 1.27 | 3.13 | 2.83 | 70.91 | 1.27 | 138.42 |
| 32 | 22.59 | 28.7 | 10.51 | 10.35 | 32.74 | 0.66 | 247.45 |
| 33 | 9.21 | 4.55 | 1.19 | 7.92 | 72.2 | 1.34 | 140.27 |
| 34 | 18.32 | 11.61 | 2.91 | 8.07 | 52.23 | 1.02 | 216.06 |
| 35 | 5.69 | 6.88 | 1.18 | 2.78 | 72.12 | 1.39 | 144.18 |
| 36 | 11.21 | 11.92 | 3.31 | 10.29 | 60.65 | 1.69 | 156.22 |
PVV/GW: peak portal venous flow velocity per 100 g of the initial graft weight, PSV/GW: peak systolic velocity of the hepatic artery per 100 g of the initial graft weight, EDV/GW: end diastolic velocity of the hepatic artery per 100 g of the initial graft weight, HVV/GW: peak hepatic venous flow velocity per 100 g of the initial graft weight, GW/SLV: graft-to-standard liver volume ratio, GRWR: graft-to-recipient weight ratio.
Correlation Matrix between Explanatory Variables
| PVV/GW (cm/s/100 g) | PSV/GW (cm/s/100 g) | EDV/GW (cm/s/100 g) | HVV/GW (cm/s/100 g) | GW/SLV (%) | GRWR (%) | ||
|---|---|---|---|---|---|---|---|
| PVV/GW | Pearson’s correlation coefficient | 1 | 0.649[ | 0.591[ | 0.456[ | −0.459[ | −0.262 |
| Two-tailed P value | < 0.001 | < 0.001 | 0.005 | 0.005 | 0.122 | ||
| PSV/GW | Pearson’s correlation coefficient | 1 | 0.841[ | 0.610[ | −0.442[ | −0.217 | |
| Two-tailed P value | < 0.001 | < 0.001 | 0.007 | 0.203 | |||
| EDV/GW | Pearson’s correlation coefficient | 1 | 0.450[ | −0.504[ | −0.330[ | ||
| Two-tailed P value | 0.006 | 0.002 | 0.049 | ||||
| HVV/GW | Pearson’s correlation coefficient | 1 | −0.310 | −0.109 | |||
| Two-tailed P value | 0.066 | 0.528 | |||||
| GW/SLV | Pearson’s correlation coefficient | 1 | 0.886[ | ||||
| Two-tailed P value | < 0.001 | ||||||
| GRWR | Pearson’s correlation coefficient | 1 | |||||
| Two-tailed P value |
PVV/GW: peak portal venous flow velocity per 100 g of the initial graft weight, PSV/GW: peak systolic velocity of the hepatic artery per 100 g of the initial graft weight, EDV/GW: end diastolic velocity of the hepatic artery per 100 g of the initial graft weight, HVV/GW: peak hepatic venous flow velocity per 100 g of the initial graft weight, GW/SLV: graft-to-standard liver volume ratio, GRWR: graft-to-recipient weight ratio.
P < 0.05,
P < 0.01.
Regression Coefficients of Multiple Linear Regression Model for Six Explanatory Variables
| Unstandardized coefficients | Standard error | Standardized coefficients | t-statistic | P | 95% Confidence interval for the unstandardized coefficients | Collinearity statistics | |||
|---|---|---|---|---|---|---|---|---|---|
| Lower bound | Upper bound | Tolerance | inflation factor | ||||||
| Intercept | 232.797 | 29.542 | 7.880 | < 0.001 | 172.376 | 293.217 | |||
| PVV/GW | 0.221 | 0.642 | 0.050 | 0.344 | 0.733 | −1.093 | 1.534 | 0.525 | 1.905 |
| PSV/GW | −0.050[ | 1.026 | −0.011 | −0.048 | 0.962[ | −2.148 | 2.049 | 0.202 | 4.948[ |
| EDV/GW | 0.690 | 2.506 | 0.056 | 0.275 | 0.785 | −4.435 | 5.816 | 0.261 | 3.834 |
| HVV/GW | 4.083 | 1.411 | 0.396 | 2.893 | 0.007 | 1.197 | 6.970 | 0.585 | 1.709 |
| GW/SLV | −0.905 | 0.845 | −0.305 | −1.071 | 0.293 | −2.633 | 0.823 | 0.135 | 7.387[ |
| GRWR | −37.594[ | 32.665[ | −0.295 | −1.151 | 0.259[ | −104.400[ | 29.213[ | 0.166 | 6.011[ |
Refer to the main text for details.
PVV/GW: peak portal venous flow velocity per 100 g of the initial graft weight, PSV/GW: peak systolic velocity of the hepatic artery per 100 g of the initial graft weight, EDV/GW: end diastolic velocity of the hepatic artery per 100 g of the initial graft weight, HVV/GW: peak hepatic venous flow velocity per 100 g of the initial graft weight, GW/SLV: graft-to-standard liver volume ratio, GRWR: graft-to-recipient weight ratio.
Collinearity Diagnostics
| Eigenvalue | Condition Index | Variance decomposition proportions | ||||||
|---|---|---|---|---|---|---|---|---|
| Intercept | PVV/GW | PSV/GW | EDV/GW | HVV/GW | GW/SLV | GRWR | ||
| 6.164 | 1.000 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 0.555 | 3.332 | 0.00 | 0.00 | 0.02 | 0.07 | 0.00 | 0.00 | 0.00 |
| 0.119 | 7.209 | 0.00 | 0.09 | 0.00 | 0.25 | 0.45 | 0.00 | 0.00 |
| 0.099 | 7.883 | 0.00 | 0.74 | 0.02 | 0.02 | 0.24 | 0.00 | 0.00 |
| 0.043 | 11.938[ | 0.01 | 0.01 | 0.87[ | 0.55 | 0.22 | 0.00 | 0.00 |
| 0.017 | 18.975[ | 0.47 | 0.09 | 0.08 | 0.11 | 0.04 | 0.00 | 0.15 |
| 0.003 | 47.323[ | 0.51 | 0.06 | 0.02 | 0.00 | 0.04 | 0.99[ | 0.84[ |
Refer to the main text for details,
Refer to the main text for details.
PVV/GW: peak portal venous flow velocity per 100 g of the initial graft weight, PSV/GW: peak systolic velocity of the hepatic artery per 100 g of the initial graft weight, EDV/GW: end diastolic velocity of the hepatic artery per 100 g of the initial graft weight, HVV/GW: peak hepatic venous flow velocity per 100 g of the initial graft weight, GW/SLV: graft-to-standard liver volume ratio, GRWR: graft-to-recipient weight ratio.
Regression Coefficients of Multiple Linear Regression Model for Four Explanatory Variables Following the Exclusion of Two Variables
| Unstandardized coefficients | Standard error | Standardized coefficients | t-statistic | P | 95% Confidence interval for the unstandardized coefficients | Collinearity statistics | |||
|---|---|---|---|---|---|---|---|---|---|
| Lower bound | Upper bound | Tolerance | Variance inflation factor | ||||||
| Intercept | 209.393 | 19.653 | 10.655 | < 0.001 | 169.311 | 249.476 | |||
| PVV/GW | 0.392 | 0.593 | 0.088 | 0.661 | 0.514 | −0.817 | 1.601 | 0.599 | 1.670 |
| EDV/GW | 1.006 | 1.664 | 0.082 | 0.605 | 0.550 | −2.388 | 4.401 | 0.575 | 1.738 |
| HVV/GW | 4.410 | 1.239 | 0.428 | 3.559 | 0.001 | 1.882 | 6.937 | 0.738 | 1.355 |
| GRWR | −68.832 | 14.014[ | −0.541 | −4.912 | < 0.001[ | −97.413[ | −40.251[ | 0.879 | 1.137[ |
Refer to the main text for details.
PVV/GW: peak portal venous flow velocity per 100 g of the initial graft weight, EDV/GW: end diastolic velocity of the hepatic artery per 100 g of the initial graft weight, HVV/GW: peak hepatic venous flow velocity per 100 g of the initial graft weight, GRWR: graftto-recipient weight ratio.
Collinearity Diagnostics
| Eigenvalue | Condition Index | Variance decomposition proportions | ||||
|---|---|---|---|---|---|---|
| Intercept | PVV/GW | EDV/GW | HVV/GW | GRWR | ||
| 4.409 | 1.000 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 |
| 0.369 | 3.456 | 0.01 | 0.01 | 0.39 | 0.00 | 0.03 |
| 0.107 | 6.410 | 0.02 | 0.07 | 0.38 | 0.69 | 0.05 |
| 0.096 | 6.766 | 0.00 | 0.85 | 0.18 | 0.30 | 0.00 |
| 0.018 | 15.697[ | 0.97 | 0.06 | 0.03 | 0.01 | 0.91[ |
Refer to the main text for details,
Refer to the main text for details.
PVV/GW: peak portal venous flow velocity per 100 g of the initial graft weight, EDV/GW: end diastolic velocity of the hepatic artery per 100 g of the initial graft weight, HVV/GW: peak hepatic venous flow velocity per 100 g of the initial graft weight, GRWR: graft-to-recipient weight ratio.