| Literature DB >> 24681553 |
Sara Gustavsson1, Björn Fagerberg2, Gerd Sallsten3, Eva M Andersson4.
Abstract
We compared six methods for regression on log-normal heteroscedastic data with respect to the estimated associations with explanatory factors (bias and standard error) and the estimated expected outcome (bias and confidence interval). Method comparisons were based on results from a simulation study, and also the estimation of the association between abdominal adiposity and two biomarkers; C-Reactive Protein (CRP) (inflammation marker,) and Insulin Resistance (HOMA-IR) (marker of insulin resistance). Five of the methods provide unbiased estimates of the associations and the expected outcome; two of them provide confidence intervals with correct coverage.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24681553 PMCID: PMC4024993 DOI: 10.3390/ijerph110403521
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Estimates of the regression coefficients; expected value of the estimate, E[*], true standard deviation of the estimated coefficient, SD[*], and expected sample specific standard error, E[se(*)]. The true coefficient values are β0 = 1.564, β1 = 0.122, β2 = 0.075, σZ = 0.383. Results of the simulation study for sample size n = 108 (r = 10,000 replicates).
| LSlin | WLS | MLLN | GLMG | GLMN1 | LSexp 2 | ||
|---|---|---|---|---|---|---|---|
| Intercept | |||||||
| E[ | 1.566 | 1.560 | 1.563 | 1.565 | 1.567 | 0.487 | |
| SD[ | 0.226 | 0.190 | 0.183 | 0.187 | 0.180 | 0.083 | |
| E[ | 0.269 | 0.187 | 0.180 | 0.178 | 0.179 | 0.084 | |
| Parameter for X1 | |||||||
| E[ | 0.121 | 0.122 | 0.122 | 0.122 | 0.121 | 0.042 | |
| SD[ | 0.021 | 0.019 | 0.019 | 0.020 | 0.019 | 0.006 | |
| E[ | 0.021 | 0.019 | 0.018 | 0.018 | 0.018 | 0.006 | |
| Parameter for X2 | |||||||
| E[ | 0.075 | 0.075 | 0.075 | 0.075 | 0.075 | 0.027 | |
| SD[ | 0.024 | 0.021 | 0.021 | 0.021 | 0.02 | 0.008 | |
| E[ | 0.024 | 0.021 | 0.020 | 0.020 | 0.02 | 0.008 | |
| E[ | 1.229 | ||||||
| SD[ | 0.143 | ||||||
| Scale parameter | 7.330 | 0.377 | |||||
| SD[scale parameter] | 1.015 | 0.026 | |||||
| E[ | 0.379 | 0.376 | 0.358 3 | 0.377 | 0.384 | ||
| SD[ | 0.031 | 0.026 | - | 0.026 | 0.026 | ||
1 After transformation of the coefficients in eq (3): and ;
2 Coefficients estimated under assumption of a log-linear model;
3 After transformation: .
Estimated expected value and expected length of 95% confidence interval for , for a sample of n = 108 observations (results from simulation with r = 10,000 replicates).
| Expected value | E[ | E[length] | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| LSlin | WLS | MLLN | GLMG | GLMN | LSexp | LSlin | WLS | MLLN | GLMG | GLMN | LSexp |
| 1.714 | 1.72 | 1.71 | 1.71 | 1.72 | 1.72 | 1.85 | 0.927 | 0.631 | 0.609 | 0.594 | 0.6 | 0.544 |
| 2.164 | 2.17 | 2.16 | 2.16 | 2.17 | 2.17 | 2.17 | 0.733 | 0.533 | 0.518 | 0.501 | 0.507 | 0.506 |
| 2.614 | 2.62 | 2.61 | 2.61 | 2.62 | 2.62 | 2.55 | 0.927 | 0.825 | 0.797 | 0.774 | 0.783 | 0.749 |
| 2.568 | 2.57 | 2.57 | 2.57 | 2.57 | 2.57 | 2.49 | 0.733 | 0.605 | 0.588 | 0.567 | 0.574 | 0.58 |
| 3.018 | 3.02 | 3.02 | 3.02 | 3.02 | 3.02 | 2.91 | 0.464 | 0.467 | 0.462 | 0.437 | 0.443 | 0.439 |
| 3.468 | 3.47 | 3.47 | 3.47 | 3.47 | 3.47 | 3.42 | 0.733 | 0.763 | 0.743 | 0.715 | 0.723 | 0.798 |
| 3.422 | 3.42 | 3.42 | 3.42 | 3.42 | 3.42 | 3.34 | 0.927 | 0.950 | 0.920 | 0.89 | 0.9 | 0.982 |
| 3.872 | 3.87 | 3.87 | 3.87 | 3.87 | 3.87 | 3.92 | 0.733 | 0.850 | 0.827 | 0.796 | 0.804 | 0.914 |
| 4.322 | 4.32 | 4.32 | 4.32 | 4.32 | 4.32 | 4.60 | 0.927 | 1.026 | 0.997 | 0.962 | 0.972 | 1.351 |
True standard deviation and sample-specific standard error for the -values; SD[] = and se() = . Results from simulation with n = 108 observations, r = 10,000 replicates.
| Expected value | SD[ | E[ | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| LSlin | WLS | MLLN | GLMG | GLMN | LSexp | LSlin | WLS | MLLN | GLMG | GLMN | LSexp |
| 1.714 | 0.191 | 0.161 | 0.156 | 0.159 | 0.154 | 0.136 | 0.238 | 0.159 | 0.154 | 0.152 | 0.154 | - |
| 2.164 | 0.145 | 0.135 | 0.132 | 0.135 | 0.132 | 0.128 | 0.188 | 0.135 | 0.131 | 0.128 | 0.13 | - |
| 2.614 | 0.220 | 0.209 | 0.202 | 0.211 | 0.205 | 0.190 | 0.238 | 0.208 | 0.201 | 0.198 | 0.201 | - |
| 2.568 | 0.167 | 0.153 | 0.150 | 0.154 | 0.151 | 0.147 | 0.188 | 0.153 | 0.148 | 0.145 | 0.147 | - |
| 3.018 | 0.121 | 0.118 | 0.118 | 0.120 | 0.120 | 0.112 | 0.119 | 0.118 | 0.117 | 0.112 | 0.113 | - |
| 3.468 | 0.210 | 0.195 | 0.190 | 0.196 | 0.192 | 0.204 | 0.188 | 0.192 | 0.187 | 0.183 | 0.185 | - |
| 3.422 | 0.251 | 0.241 | 0.234 | 0.244 | 0.238 | 0.251 | 0.238 | 0.240 | 0.232 | 0.228 | 0.231 | - |
| 3.872 | 0.228 | 0.217 | 0.212 | 0.219 | 0.215 | 0.235 | 0.188 | 0.214 | 0.209 | 0.204 | 0.206 | - |
| 4.322 | 0.290 | 0.263 | 0.256 | 0.264 | 0.258 | 0.345 | 0.238 | 0.259 | 0.251 | 0.246 | 0.249 | - |
Actual coverage of the 95% confidence interval for μY based on the sample-specific standard error (results from simulation with n = 108 observations and r = 10,000 replicates).
| Expected value | Coverage 1 | |||||
|---|---|---|---|---|---|---|
|
| LSlin | WLS | MLLN | GLMG | GLMN | LSexp |
| 1.714 | 0.98 | 0.94 | 0.95 | 0.93 | 0.94 | 0.83 |
| 2.164 | 0.99 | 0.95 | 0.95 | 0.93 | 0.94 | 0.95 |
| 2.614 | 0.96 | 0.95 | 0.95 | 0.93 | 0.94 | 0.93 |
| 2.568 | 0.97 | 0.95 | 0.95 | 0.93 | 0.94 | 0.90 |
| 3.018 | 0.94 | 0.95 | 0.95 | 0.93 | 0.93 | 0.83 |
| 3.468 | 0.92 | 0.95 | 0.95 | 0.93 | 0.94 | 0.94 |
| 3.422 | 0.93 | 0.95 | 0.95 | 0.92 | 0.93 | 0.93 |
| 3.872 | 0.89 | 0.94 | 0.95 | 0.93 | 0.94 | 0.95 |
| 4.322 | 0.89 | 0.94 | 0.94 | 0.93 | 0.94 | 0.87 |
1 Proportion of replicates where 95% confidence interval covers true expected value μY.
Descriptive statistics for C-reactive protein (CRP), insulin resistance (HOMA-IR) and waist circumference.
|
|
|
|
| |||||||
|
| Mean | Median | SD | Mean | Median | SD | Mean | Median | SD | |
| NGT 1 | 185 | 2.107 | 1.184 | 2.550 | 1.141 | 0.960 | 0.647 | 88.295 | 88.50 | 8.948 |
| IGT 1 | 195 | 2.583 | 1.380 | 3.783 | 1.816 | 1.430 | 1.268 | 92.677 | 92.50 | 11.882 |
| DM 1 | 218 | 4.468 | 1.856 | 10.255 | 4.677 | 2.835 | 5.842 | 98.083 | 98.00 | 12.631 |
1 Results for women with normal glucose tolerance (NGT), impaired glucose tolerance (IGT) and diabetes mellitus (DM).
Figure 1The parameter estimates and 95% confidence intervals for the different regression methods, when estimating CRP as a function of waist circumference (WC) and HOMA-IR, using n = 598 observations.
Figure 2The parameter estimates and 95% confidence intervals for the different regression methods, when estimating HOMA-IR as a function of waist circumference (WC) and the interaction between WC and glucose tolerance group (normal glucose tolerance, impaired glucose tolerance and diabetes mellitus), using n = 598 observations.
The σZ-estimates and mean length of 95% confidence intervals for μY, for CRP and HOMA-IR, n = 598.
|
|
|
| |||
| Length CI (mean, SD) | Length CI (mean, SD) | ||||
| LSlin | - | 1.61 (0.89) | - | 1.10 (0.19) | |
| WLS | 1.22 | 1.51 (2.07) | 0.73 | 0.64 (0.35) | |
| MLLN | 1.04 | 0.82 (0.86) | 0.61 | 0.43 (0.19) | |
| GLMG | 0.71 (0.974 1) | 0.85 (1.26) | 0.33 (2.52 1) | 0.47 (0.26) | |
| GLMN | 1.04 | 0.43 (0.23) | 0.61 | 0.23 (0.06) | |
| LSexp | 1.04 | 1.19 (5.40) | 0.60 | 0.50 (0.45) | |
1 Estimated scale parameter