| Literature DB >> 26290569 |
Osval A Montesinos-López1, Abelardo Montesinos-López2, José Crossa3, Juan Burgueño4, Kent Eskridge5.
Abstract
Most genomic-enabled prediction models developed so far assume that the response variable is continuous and normally distributed. The exception is the probit model, developed for ordered categorical phenotypes. In statistical applications, because of the easy implementation of the Bayesian probit ordinal regression (BPOR) model, Bayesian logistic ordinal regression (BLOR) is implemented rarely in the context of genomic-enabled prediction [sample size (n) is much smaller than the number of parameters (p)]. For this reason, in this paper we propose a BLOR model using the Pólya-Gamma data augmentation approach that produces a Gibbs sampler with similar full conditional distributions of the BPOR model and with the advantage that the BPOR model is a particular case of the BLOR model. We evaluated the proposed model by using simulation and two real data sets. Results indicate that our BLOR model is a good alternative for analyzing ordinal data in the context of genomic-enabled prediction with the probit or logit link.Entities:
Keywords: Bayesian ordinal regression; GenPred; Gibbs sampler; genomic selection; logit; probit; shared data resource
Mesh:
Year: 2015 PMID: 26290569 PMCID: PMC4592994 DOI: 10.1534/g3.115.021154
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Simulated data set 1: Average values (Mean) and SD of MLEs and the Bayesian estimators, with four sample sizes (n)
|
| Parameter | True Value | BLOR | BLOR* | MLLOR | MLLOR* | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | |||
| −6 | 1.935 | −6.711 | 2.117 | −6.380 | 2.489 | −6.826 | 2.668 | |||
| −5 | 2.262 | −5.546 | 2.726 | −5.596 | 2.731 | −5.927 | 2.872 | |||
| 7 | 7.550 | 2.746 | 7.815 | 3.112 | 6.659 | 2.698 | 2.885 | |||
| 5 | −0.842 | 0.190 | −0.937 | 0.152 | −0.883 | 0.177 | −0.942 | 0.185 | ||
| −0.253 | 0.154 | −0.271 | 0.142 | −0.277 | 0.167 | −0.298 | 0.179 | |||
| 0.253 | 0.274 | 0.170 | 0.328 | 0.171 | 0.203 | 0.281 | 0.218 | |||
| 0.842 | 0.878 | 0.171 | 0.967 | 0.151 | 0.211 | 0.920 | 0.220 | |||
| −6 | −6.224 | 1.562 | −6.534 | 1.650 | 1.673 | −6.480 | 1.767 | |||
| −5 | 1.825 | −5.433 | 1.901 | −4.717 | 1.500 | −5.022 | 1.619 | |||
| 7 | 7.306 | 1.971 | 7.762 | 1.825 | 6.606 | 1.836 | 1.939 | |||
| 10 | −0.842 | 0.100 | −0.926 | 0.147 | −0.847 | 0.127 | −0.907 | 0.135 | ||
| −0.253 | 0.097 | −0.284 | 0.131 | −0.273 | 0.110 | −0.296 | 0.119 | |||
| 0.253 | 0.276 | 0.113 | 0.272 | 0.123 | 0.233 | 0.110 | 0.120 | |||
| 0.842 | 0.861 | 0.116 | 0.920 | 0.124 | 0.115 | 0.897 | 0.123 | |||
| −6 | −6.122 | 1.063 | −6.278 | 1.390 | 0.936 | −6.422 | 1.017 | |||
| −5 | 1.262 | −5.538 | 1.103 | −5.181 | 0.962 | −5.488 | 1.019 | |||
| 7 | 7.271 | 1.114 | 7.479 | 1.394 | 1.118 | 7.669 | 1.183 | |||
| 20 | −0.842 | −0.849 | 0.108 | −0.917 | 0.100 | 0.073 | −0.905 | 0.077 | ||
| −0.253 | 0.106 | −0.291 | 0.094 | −0.250 | 0.069 | −0.270 | 0.074 | |||
| 0.253 | 0.259 | 0.091 | 0.261 | 0.099 | 0.068 | 0.272 | 0.073 | |||
| 0.842 | 0.860 | 0.097 | 0.883 | 0.106 | 0.079 | 0.907 | 0.084 | |||
| −6 | 0.804 | −6.467 | 0.924 | −6.199 | 0.719 | −6.563 | 0.783 | |||
| −5 | −5.175 | 0.817 | −5.231 | 0.879 | −4.804 | 0.791 | 0.856 | |||
| 7 | 7.163 | 0.815 | 7.674 | 0.958 | 0.867 | 7.528 | 0.904 | |||
| 40 | −0.842 | −0.844 | 0.065 | −0.911 | 0.069 | 0.051 | −0.899 | 0.055 | ||
| −0.253 | 0.053 | −0.278 | 0.064 | −0.248 | 0.05 | −0.267 | 0.056 | |||
| 0.253 | 0.255 | 0.052 | 0.267 | 0.060 | 0.044 | 0.271 | 0.048 | |||
| 0.842 | 0.856 | 0.054 | 0.900 | 0.071 |
| 0.047 | 0.893 | 0.050 | ||
BLOR* and MLLOR* use the parameter estimates of BPOR and MLLOR and approximate BLOR and MLLOR with . The best model has the value for the parameter closer to the true value; these are presented in bold. MLEs, maximum likelihood estimators; BLOR, Bayesian logistic ordinal regression; MLLOR, maximum likelihood logistic ordinal regression.
Simulated data set 2: average values (Mean) and SD of MLEs and Bayesian estimators, with four POs
| PO | Parameter | True Value | BLOR | BLOR* | MLLOR | MLLOR* | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | |||
| −6 | 0.824 | −6.525 | 0.915 | −6.230 | 0.723 | −6.608 | 0.766 | |||
| −5 | −5.088 | 0.880 | −5.678 | 0.864 | 0.763 | −5.324 | 0.815 | |||
| 7 | 7.183 | 0.802 | 7.607 | 0.893 | 0.651 | 7.535 | 0.689 | |||
| 5 | −0.842 | −0.862 | 0.066 | −0.923 | 0.067 | 0.050 | −0.910 | 0.053 | ||
| −0.253 | −0.263 | 0.071 | −0.281 | 0.059 | 0.048 | −0.276 | 0.051 | |||
| 0.253 | 0.068 | 0.285 | 0.051 | 0.261 | 0.040 | 0.281 | 0.043 | |||
| 0.842 | 0.861 | 0.067 | 0.928 | 0.058 | 0.041 | 0.914 | 0.043 | |||
| −6 | 0.854 | −6.670 | 0.979 | 0.607 | −6.504 | 0.647 | ||||
| −5 | −5.359 | 0.899 | −5.544 | 1.094 | 0.610 | −5.459 | 0.639 | |||
| 7 | 7.349 | 0.830 | 7.739 | 0.856 | 0.645 | 7.536 | 0.682 | |||
| 10 | −0.842 | −0.883 | 0.063 | −0.925 | 0.076 | 0.048 | −0.924 | 0.050 | ||
| −0.253 | −0.266 | 0.070 | −0.267 | 0.072 | 0.052 | −0.285 | 0.056 | |||
| 0.253 | 0.071 | 0.313 | 0.065 | 0.264 | 0.051 | 0.284 | 0.055 | |||
| 0.842 | 0.868 | 0.068 | 0.964 | 0.082 | 0.052 | 0.918 | 0.055 | |||
| −6 | −6.529 | 0.730 | −6.828 | 0.903 | 0.645 | −6.709 | 0.689 | |||
| −5 | 0.759 | −5.722 | 0.877 | −5.275 | 0.629 | −5.589 | 0.671 | |||
| 7 | 7.525 | 0.784 | 7.860 | 0.758 | 0.735 | 7.780 | 0.772 | |||
| 20 | −0.842 | −0.915 | 0.065 | −0.972 | 0.060 | 0.049 | −0.942 | 0.053 | ||
| −0.253 | −0.275 | 0.059 | −0.295 | 0.057 | 0.044 | −0.280 | 0.048 | |||
| 0.253 | 0.279 | 0.063 | 0.297 | 0.059 | 0.047 | 0.291 | 0.051 | |||
| 0.842 | 0.922 | 0.060 | 0.977 | 0.060 | 0.053 | 0.954 | 0.057 | |||
| −6 | −6.794 | 0.803 | −7.011 | 1.013 | 0.654 | −6.916 | 0.691 | |||
| −5 | −5.652 | 0.754 | −5.827 | 0.754 | 0.634 | −5.590 | 0.666 | |||
| 7 | 8.065 | 0.894 | 8.351 | 0.881 | 0.752 | 7.925 | 0.798 | |||
| 30 | −0.842 | −0.972 | 0.071 | −1.004 | 0.075 | 0.041 | −0.956 | 0.044 | ||
| −0.253 | −0.301 | 0.060 | −0.296 | 0.072 | 0.044 | −0.289 | 0.047 | |||
| 0.253 | 0.067 | 0.318 | 0.069 | 0.286 | 0.044 | 0.306 | 0.047 | |||
| 0.842 | 0.944 | 0.068 | 1.023 | 0.060 | 0.044 | 0.981 | 0.046 | |||
The outliers were generated with a student's t distribution with four degrees of freedom. BLOR* and MLLOR* use the parameter estimates of BPOR and MLLOR and approximate BLOR and MLLOR with . The best model has the value for the parameter closer to the true value; these are presented in bold. MLEs, maximum likelihood estimators; POs, percentages of outliers; BLOR, Bayesian logistic ordinal regression; MLLOR, maximum likelihood logistic ordinal regression.
Real data sets: GLS and Septoria data sets
| Model | Set (DIC) | Statistic | Probability of Each Category | BS | ||||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | ||||
| Mean | 0.025 | 0.256 | 0.392 | 0.196 | 0.132 | 0.363 | ||
| Zimbabwe | L | 0.018 | 0.236 | 0.368 | 0.173 | 0.112 | 0.363 | |
| (4150.29) | U | 0.034 | 0.278 | 0.415 | 0.216 | 0.149 | 0.363 | |
| Mean | 0.054 | 0.442 | 0.267 | 0.169 | 0.069 | 0.350 | ||
| BLOR | México | L | 0.035 | 0.399 | 0.228 | 0.139 | 0.050 | 0.349 |
| (1313.82) | U | 0.075 | 0.480 | 0.308 | 0.200 | 0.092 | 0.352 | |
| Mean | 0.204 | 0.248 | 0.259 | 0.212 | 0.077 | 0.390 | ||
| Colombia | L | 0.179 | 0.219 | 0.232 | 0.187 | 0.058 | 0.389 | |
| (2577.92) | U | 0.233 | 0.279 | 0.287 | 0.239 | 0.095 | 0.391 | |
| Mean | 0.083 | 0.284 | 0.332 | 0.198 | 0.104 | 0.377 | ||
| Pooled | L | 0.069 | 0.269 | 0.315 | 0.184 | 0.093 | 0.377 | |
| (8327.36) | U | 0.093 | 0.299 | 0.350 | 0.211 | 0.119 | 0.377 | |
| Mean | 0.077 | 0.150 | 0.432 | 0.341 | − | 0.335 | ||
| Septoria | L | 0.049 | 0.112 | 0.376 | 0.294 | − | 0.333 | |
| (654.33) | U | 0.110 | 0.195 | 0.493 | 0.388 | − | 0.338 | |
| Mean | 0.032 | 0.237 | 0.416 | 0.190 | 0.124 | 0.363 | ||
| Zimbabwe | L | 0.025 | 0.219 | 0.389 | 0.171 | 0.110 | 0.363 | |
| (4156.86) | U | 0.039 | 0.258 | 0.443 | 0.211 | 0.140 | 0.365 | |
| Mean | 0.057 | 0.436 | 0.280 | 0.156 | 0.070 | 0.350 | ||
| BLOR* | México | L | 0.041 | 0.392 | 0.242 | 0.128 | 0.054 | 0.350 |
| (1315.21) | U | 0.075 | 0.479 | 0.323 | 0.187 | 0.089 | 0.353 | |
| Mean | 0.193 | 0.260 | 0.279 | 0.194 | 0.074 | 0.390 | ||
| Colombia | L | 0.168 | 0.226 | 0.248 | 0.168 | 0.060 | 0.389 | |
| (2581.66) | U | 0.220 | 0.292 | 0.310 | 0.223 | 0.089 | 0.392 | |
| Mean | 0.082 | 0.277 | 0.358 | 0.184 | 0.100 | 0.377 | ||
| Pooled | L | 0.074 | 0.259 | 0.341 | 0.168 | 0.090 | 0.377 | |
| (8339.54) | U | 0.091 | 0.294 | 0.375 | 0.200 | 0.109 | 0.378 | |
| Mean | 0.075 | 0.137 | 0.457 | 0.332 | − | 0.334 | ||
| Septoria | L | 0.051 | 0.098 | 0.392 | 0.268 | − | 0.330 | |
| (652.50) | U | 0.104 | 0.176 | 0.527 | 0.393 | − | 0.341 | |
| Mean | 0.025 | 0.253 | 0.392 | 0.199 | 0.132 | 0.363 | ||
| Zimbabwe | L | 0.017 | 0.233 | 0.366 | 0.179 | 0.113 | 0.363 | |
| (4150.18) | U | 0.033 | 0.274 | 0.416 | 0.219 | 0.148 | 0.363 | |
| Mean | 0.054 | 0.440 | 0.265 | 0.171 | 0.070 | 0.350 | ||
| BPOR | México | L | 0.036 | 0.399 | 0.229 | 0.142 | 0.051 | 0.350 |
| (1314.70) | U | 0.076 | 0.479 | 0.304 | 0.203 | 0.092 | 0.352 | |
| Mean | 0.206 | 0.249 | 0.261 | 0.209 | 0.075 | 0.390 | ||
| Colombia | L | 0.176 | 0.221 | 0.230 | 0.183 | 0.058 | 0.389 | |
| (2578.42) | U | 0.233 | 0.277 | 0.293 | 0.233 | 0.095 | 0.391 | |
| Mean | 0.071 | 0.256 | 0.331 | 0.218 | 0.123 | 0.377 | ||
| Pooled | L | 0.042 | 0.183 | 0.313 | 0.191 | 0.104 | 0.374 | |
| (8329.84) | U | 0.086 | 0.287 | 0.350 | 0.286 | 0.171 | 0.389 | |
| Mean | 0.075 | 0.150 | 0.430 | 0.345 | − | 0.334 | ||
| Septoria | L | 0.047 | 0.110 | 0.369 | 0.282 | − | 0.330 | |
| (651.79) | U | 0.109 | 0.191 | 0.500 | 0.402 | − | 0.339 | |
BLOR* use the parameter estimates of the BPOR and approximate the BLOR with . Point probability estimates, credible sets for each category, DIC, and BS for the threshold Bayesian ridge regression. L and U denote lower and upper confidence sets, respectively. GLS, Gray leaf spot; DIC, deviance information criterion; BS, Brier scores; BLOR, Bayesian logistic ordinal regression; BPOR, Bayesian probit ordinal regression.
GLS data set
| Model | Brier Scores | ||
|---|---|---|---|
| Mean | Min | Max | |
| BLOR | 0.373 | 0.365 | 0.381 |
| BLOR* | 0.374 | 0.364 | 0.382 |
| BPOR | 0.373 | 0.365 | 0.381 |
BLOR* uses the parameter estimates of BPOR and approximates BLOR with . Brier scores (mean, minimum and maximum; lower scores indicate better prediction) evaluated for validation samples from the pooled data. GLS, Gray leaf spot; BLOR, Bayesian logistic ordinal regression; BPOR, Bayesian probit ordinal regression.