| Literature DB >> 35689190 |
Tasneem Fatima Alam1, M Shafiqur Rahman2, Wasimul Bari3.
Abstract
BACKGROUND: Separation or monotone likelihood may exist in fitting process of the accelerated failure time (AFT) model using maximum likelihood approach when sample size is small and/or rate of censoring is high (rare event) or there is at least one strong covariate in the model, resulting in infinite estimates of at least one regression coefficient.Entities:
Keywords: Bias reduction; Jeffreys prior; Log-location-scale family; Monotone likelihood
Mesh:
Year: 2022 PMID: 35689190 PMCID: PMC9188212 DOI: 10.1186/s12874-022-01638-1
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Results of both MLE and Firth’s Penalized Likelihood Estimation for both β and β under Weibull Distribution. Each cell represents mean and standard deviation of estimates over number of valid cases (removing the simulations that were failed to achieve convergence) out of 1000 simulations. The maximum number of convergence failure for MLE is 60 when sample sizze is 50 and censoring rate 80%
| MLE | Firth | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sample Size (n) | Cens.% | True Coefficients | Estimate | SE | Sim.SE | Width | Estimates | SE | Sim.SE | Width |
| (95% CI) | (95% CI) | |||||||||
| 30 | 20 | 1.198 | 0.155 | 0.171 | 0.606 | 1.190 | 0.142 | 0.170 | 0.557 | |
| 40 | 1.344 | 0.193 | 0.219 | 0.758 | 1.185 | 0.174 | 0.216 | 0.681 | ||
| 60 | 1.528 | 0.266 | 0.334 | 1.04 | 1.174 | 0.225 | 0.325 | 0.880 | ||
| 20 | 0.689 | 0.272 | 0.307 | 1.067 | 0.681 | 0.245 | 0.304 | 0.960 | ||
| 40 | 0.686 | 0.321 | 0.362 | 1.259 | 0.663 | 0.279 | 0.350 | 1.095 | ||
| 60 | 0.948 | 56.262 | 1.203 | 220.325 | 0.639 | 0.340 | 0.461 | 1.332 | ||
| 50 | 20 | 1.210 | 0.120 | 0.121 | 0.469 | 1.205 | 0.114 | 0.120 | 0.446 | |
| 50 | 1.222 | 0.170 | 0.180 | 0.665 | 1.202 | 0.157 | 0.176 | 0.616 | ||
| 80 | 1.781 | 0.311 | 7.080 | 1.218 | 1.190 | 0.268 | 0.640 | 1.049 | ||
| 20 | 0.695 | 0.212 | 0.224 | 0.832 | 0.690 | 0.199 | 0.223 | 0.780 | ||
| 50 | 0.704 | 0.276 | 0.291 | 1.082 | 0.684 | 0.250 | 0.284 | 0.979 | ||
| 80 | 3.026 | 124.560 | 26.619 | 486.810 | 0.983 | 0.378 | 8.814 | 1.482 | ||
| 100 | 20 | 1.200 | 0.083 | 0.083 | 0.325 | 1.197 | 0.081 | 0.082 | 0.317 | |
| 50 | 1.209 | 0.117 | 0.120 | 0.458 | 1.199 | 0.112 | 0.119 | 0.440 | ||
| 80 | 4.807 | 0.210 | 48.926 | 0.824 | 1.175 | 0.193 | 0.217 | 0.756 | ||
| 20 | 0.703 | 0.150 | 0.150 | 0.587 | 0.701 | 0.145 | 0.150 | 0.568 | ||
| 50 | 0.708 | 0.193 | 0.197 | 0.758 | 0.698 | 0.184 | 0.194 | 0.720 | ||
| 80 | 3.855 | 0.327 | 65.946 | 1.274 | 0.668 | 0.290 | 0.305 | 1.137 | ||
β= Coefficient of continuous covariate and β= Coefficient of binary covariate
Fig. 1Bias associated with the estimates of regression coefficients (β for continuous covariates and β for binary covariates) obtained from both MLE and Firth procedure for Weibull AFT model
Fig. 2MSE associated with the estimates of regression coefficients (β for continuous covariates and β for binary covariates) obtained from both MLE and Firth procedure for Weibull AFT model
Results of both β0 and b from Maximum Likelihood Estimation and Firth’s Penalized Likelihood Estimation under Weibull Distribution. Each cell represents mean and standard deviation of estimates from valid cases out of 1000 simulations. The maximum number of convergence failure for MLE is 60 when sample sizze is 50 and censoring rate 80%
| MLE | Firth | |||||||
|---|---|---|---|---|---|---|---|---|
| Sample Size (n) | Cens.% | True Coefficients | Estimates | SE | Sim.SE | Estimates | SE | Sim.SE |
| 30 | 20 | 2.989 | 0.187 | 0.213 | 2.989 | 0.129 | 0.212 | |
| 40 | 3.452 | 0.216 | 0.246 | 2.984 | 0.144 | 0.242 | ||
| 60 | 3.726 | 0.292 | 0.354 | 2.976 | 0.176 | 0.333 | ||
| 20 | 0.618 | 0.100 | 0.107 | 0.616 | 0.096 | 0.107 | ||
| 40 | 0.607 | 0.113 | 0.123 | 0.603 | 0.108 | 0.122 | ||
| 60 | 0.597 | 0.136 | 0.159 | 0.585 | 0.123 | 0.153 | ||
| 50 | 20 | 2.987 | 0.146 | 0.154 | 2.988 | 0.104 | 0.153 | |
| 50 | 2.987 | 0.189 | 0.197 | 2.983 | 0.127 | 0.194 | ||
| 80 | 5.893 | 6.859 | 22.755 | 3.649 | 0.207 | 11.591 | ||
| 20 | 0.642 | 0.079 | 0.083 | 0.641 | 0.077 | 0.083 | ||
| 50 | 0.635 | 0.097 | 0.103 | 0.631 | 0.093 | 0.102 | ||
| 80 | 0.598 | 0.146 | 0.177 | 0.584 | 0.169 | 0.348 | ||
| 100 | 20 | 2.988 | 0.104 | 0.107 | 2.989 | 0.075 | 0.107 | |
| 50 | 2.988 | 0.134 | 0.139 | 2.986 | 0.092 | 0.137 | ||
| 80 | 21.583 | 0.276 | 385.896 | 3.512 | 0.152 | 12.415 | ||
| 20 | 0.654 | 0.057 | 0.058 | 0.653 | 0.055 | 0.058 | ||
| 50 | 0.649 | 0.070 | 0.072 | 0.647 | 0.067 | 0.071 | ||
| 80 | 0.633 | 0.106 | 0.122 | 0.626 | 0.115 | 0.112 | ||
β0= Intercept and b= Scale parameter of the location-scale distribution
Estimates, standard error (SE) and simulation standard error (Sim.SE) of β0 and b from Maximum Likelihood Estimation and Firth’s Penalized Likelihood Estimation under Weibull Distribution in case of separation and near-to separation. Maximum convergence failure by MLE is 26.6% when separation occur over 1000 simulations in case of 80% censoring
| MLE | Firth | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sample Size (n) | Cens.% | True Coefficients | Estimates | Bias | SE | Sim.SE | Estimates | Bias | SE | Sim.SE |
| Separation | ||||||||||
| 50 | 20 | 0.481 | -0.019 | 0.109 | 0.115 | 0.479 | -0.021 | 0.103 | 0.115 | |
| 50 | 0.587 | 0.087 | 0.151 | 0.195 | 0.575 | 0.075 | 0.138 | 0.184 | ||
| 80 | 0.568 | 0.068 | 0.259 | 0.831 | 0.492 | -0.008 | 0.223 | 0.397 | ||
| 20 | 1.943 | 0.043 | 0.211 | 0.197 | 1.928 | 0.028 | 0.198 | 0.195 | ||
| 50 | 14.187 | 12.287 | 6,559.531 | 1.566 | 3.071 | 1.171 | 0.879 | 0.175 | ||
| 80 | 104.371 | 102.471 | 7020.086 | 700.557 | 2.764 | 0.864 | 0.813 | 9.344 | ||
| Near-to-Separation | ||||||||||
| 50 | 20 | 0.503 | 0.003 | 0.111 | 0.117 | 0.500 | 0.000 | 0.105 | 0.117 | |
| 50 | 0.494 | -0.006 | 0.146 | 0.149 | 0.485 | -0.015 | 0.134 | 0.146 | ||
| 80 | 1.682 | 1.182 | 0.244 | 23.939 | 0.484 | -0.016 | 0.203 | 0.264 | ||
| 20 | 1.904 | 0.004 | 0.216 | 0.229 | 1.890 | -0.010 | 0.202 | 0.227 | ||
| 50 | 2.030 | 0.130 | 0.354 | 0.348 | 1.934 | 0.034 | 0.309 | 0.315 | ||
| 80 | 2.855 | 0.955 | 0.636 | 21.649 | 1.433 | -0.467 | 0.468 | 0.459 | ||
β= Coefficient of continuous covariate and β= Coefficient of binary covariate
Estimates of survival probabilities (mean over 500 simulations) at the 1, 2 and 3 quantile of survival times of Weibull distribution with different values of binary covariates and the mean value of continous covariate for sample size 50 and censoring 50%
| Binary covariate | Quartiles | True | MLE | Firth |
|---|---|---|---|---|
| 1st | 0.750 | 0.780 | 0.769 | |
| 2nd | 0.500 | 0.543 | 0.529 | |
| 3rd | 0.250 | 0.294 | 0.278 | |
| 1st | 0.750 | 0.757 | 0.751 | |
| 2nd | 0.500 | 0.498 | 0.499 | |
| 3rd | 0.250 | 0.247 | 0.249 |
Fig. 3Estimated mean survival probabilities over 500 simulations by both MLE and Firth procedures under Weibull distribution for sample size 30 with censoring percentage C=50%
Fig. 4MSE associated with the estimates of regression coefficients (β for continuous covariates and β for binary covariates) obtained from both MLE and Firth procedure for Log-logistic AFT model
Fig. 5MSE associated with the estimates of regression coefficients (β for continuous covariates and β for binary covariates) obtained from both MLE and Firth procedure for Log-normal AFT model
Contingency tables between dichotomous covariates (treatment) and response (prostate cancer status) showing separation and near-to-separation
| Separation | Near-to-Separation | |||
| Status | Status | |||
| Treatment | alive(0) | dead(1) | alive(0) | dead(1) |
| low-dose(0) | 12 | 8 | 9 | 6 |
| high - dose (1) | 10 | 0 | 12 | 3 |
| Status | Status | |||
| Age | alive(0) | dead(1) | alive(0) | dead(1) |
| ≤ 75 years(0) | 17 | 8 | 14 | 7 |
| 75-80g/ 100 ml(1) | 5 | 0 | 6 | 2 |
| ≥ 80g/ 100 ml(2) | −− | −− | 1 | 0 |
| Status | Status | |||
| Serum haemoglobin (HG) | alive(0) | dead(1) | alive(0) | dead(1) |
| ≥ 12g/ 100 ml(0) | 20 | 7 | 18 | 9 |
| 9-12g/ 100 ml(1) | 2 | 0 | 3 | 0 |
| < 9g/ 100 ml(2) | 0 | 1 | −− | −− |
Estimates of regression parameters and their standard error obtained from MLE and Firth’s procedure by fitting Weibull AFT model for prostate cancer data under separation and near-to-separation
| Separation | Near-to-Separation | |||||||
|---|---|---|---|---|---|---|---|---|
| MLE | Firth | MLE | Firth | |||||
| Predictors | Estimates | SE | Estimates | SE | Estimates | SE | Estimates | SE |
| Treatment | 11.842 | 9,561.118 | 1.129 | 0.409 | 1.146 | 0.600 | 0.846 | 0.271 |
| Age | 11.475 | 0.00 | 1.013 | 0.428 | 0.380 | 0.804 | 0.132 | 0.565 |
| WT | 0.309 | 0.502 | − 0.112 | 0.199 | − 1.316 | 0.597 | − 0.598 | 0.213 |
| PF | -0.895 | 0.888 | -0.981 | 0.388 | -1.138 | 1.417 | -0.854 | 0.720 |
| HX | 0.534 | 0.615 | 0.503 | 0.251 | 1.547 | 1.057 | 0.905 | 0.616 |
| HG | -1.141 | 0.731 | -1.063 | 0.313 | 13.101 | 7,825.785 | 1.587 | 0.684 |
| SZ | -0.537 | 0.778 | -0.054 | 0.259 | 0.506 | 0.918 | 0.389 | 0.591 |
| SG | -1.966 | 0.872 | -0.321 | 0.092 | -2.071 | 0.743 | -0.345 | 0.080 |
| Intercept | 5.218 | 0.728 | 7.671 | 0.181 | 5.924 | 1.006 | 8.171 | 0.252 |
| scale (b) | 0.518 | 0.182 | 0.398 | 0.096 | 0.694 | 0.219 | 0.608 | 0.145 |
WT = weight index, PF= performance rating, HX= history of cardiovascular disease, HG= serum haemoglobin, SZ= size of primary lesion, SG= Gleason stage/grade category
Contingency tables between dichotomous covariates (treatment) and response (respiratory disease and pulmonary embolus status)
| Respiratory disease | Pulmonary embolus | |||
| Status | Status | |||
| Treatment | alive(0) | dead(1) | alive(0) | dead(1) |
| low-dose(0) | 61 | 10 | 61 | 4 |
| high - dose (1) | 87 | 6 | 87 | 10 |
| Status | Status | |||
| Performance rating (PF) | alive(0) | dead(1) | alive(0) | dead(1) |
| normal(0) | 142 | 16 | 142 | 12 |
| limitation of activity(1) | 6 | 0 | 6 | 2 |
| Status | Status | |||
| Serum haemoglobin (HG) | alive(0) | dead(1) | alive(0) | dead(1) |
| ≥ 12g/ 100 ml(0) | 131 | 15 | 131 | 11 |
| 9-12g/ 100 ml(1) | 16 | 1 | 16 | 3 |
| < 9g/ 100 ml(2) | 1 | 0 | 1 | 0 |
| Status | Status | |||
| Size of primary lesion (SZ) | alive(0) | dead(1) | alive(0) | dead(1) |
| < 30 cm2(0) | 141 | 15 | 141 | 12 |
| ≥ 30 cm2(1) | 5 | 1 | 5 | 2 |
Estimates of regression parameters and their standard error obtained from MLE and Firth’s procedure by fitting Weibull AFT model for time-to-event data with outcome both respiratory disease and pulmonary embolus
| Respiratory disease | Pulmonary embolus | |||||||
|---|---|---|---|---|---|---|---|---|
| MLE | Firth | MLE | Firth | |||||
| Predictors | Estimates | SE | Estimates | SE | Estimates | SE | Estimates | SE |
| Treatment | 0.204 | 0.478 | 0.124 | 0.284 | -0.588 | 1.086 | -0.292 | 0.575 |
| Age | -1.131 | 0.419 | -0.755 | 0.243 | -1.481 | 0.900 | -0.913 | 0.471 |
| WT | -0.535 | 0.351 | -0.307 | 0.202 | 0.218 | 0.821 | 0.070 | 0.445 |
| PF | 14.193 | -0.259 | 0.750 | 26.677 | -0.676 | 1.518 | ||
| HX | 0.001 | 0.488 | − 0.037 | 0.289 | -0.646 | 1.121 | − 0.463 | 0.625 |
| HG | 14.733 | 6,113.595 | 0.857 | 0.751 | − 0.672 | 1.284 | − 0.585 | 0.672 |
| SZ | 14.605 | − 0.309 | 0.731 | − 3.777 | 1.697 | − 2.472 | 0.895 | |
| SG | 0.083 | 0.463 | 0.025 | 0.076 | -0.503 | 1.077 | − 0.029 | 0.146 |
| Intercept | 6.772 | 0.904 | 5.931 | 0.336 | 10.031 | 2.075 | 8.942 | 0.969 |
| scale (b) | 0.779 | 0.262 | 0.646 | 0.073 | 1.657 | 3.438 | 1.423 | 0.159 |
AG = age, WT = weight index, PF= performance rating, HX= history of cardiovascular disease, HG= serum haemoglobin, SZ= size of primary lesion, SG= Gleason stage/grade category