| Literature DB >> 24312135 |
Abstract
Regression models are introduced into the receiver operating characteristic (ROC) analysis to accommodate effects of covariates, such as genes. If many covariates are available, the variable selection issue arises. The traditional induced methodology separately models outcomes of diseased and nondiseased groups; thus, separate application of variable selections to two models will bring barriers in interpretation, due to differences in selected models. Furthermore, in the ROC regression, the accuracy of area under the curve (AUC) should be the focus instead of aiming at the consistency of model selection or the good prediction performance. In this paper, we obtain one single objective function with the group SCAD to select grouped variables, which adapts to popular criteria of model selection, and propose a two-stage framework to apply the focused information criterion (FIC). Some asymptotic properties of the proposed methods are derived. Simulation studies show that the grouped variable selection is superior to separate model selections. Furthermore, the FIC improves the accuracy of the estimated AUC compared with other criteria.Entities:
Mesh:
Year: 2013 PMID: 24312135 PMCID: PMC3838845 DOI: 10.1155/2013/436493
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Model selection performance for group SCAD.
| Setting | Method | F-measure (%) |
|---|---|---|
| 1 | CV | 71.3 |
| GCV | 72.8 | |
| AIC | 70.9 | |
| BIC | 77.4 | |
|
| ||
| 2 | CV | 66.0 |
| GCV | 66.0 | |
| AIC | 66.2 | |
| BIC | 67.8 | |
Prediction of AUC at z 0 with group SCAD. Size means the number of selected factors, where each factor contains two variables.
| Setting | Methods |
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MSE | MAE | Size | MSE | MAE | Size | MSE | MAE | Size | ||
| 1 | CV | 0.00345 | 0.0467 | 4.72 | 0.00295 | 0.0437 | 4.72 | 0.00114 | 0.0255 | 4.72 |
| GCV | 0.00345 | 0.0467 | 4.49 | 0.00294 | 0.0432 | 4.49 | 0.00114 | 0.0251 | 4.49 | |
| AIC | 0.00349 | 0.0468 | 4.90 | 0.00291 | 0.0428 | 4.90 | 0.00110 | 0.0246 | 4.90 | |
| BIC | 0.00335 | 0.0461 | 3.62 | 0.00317 | 0.0450 | 3.62 | 0.00147 | 0.0278 | 3.62 | |
| FIC | 0.00339 | 0.0464 | 4.23 | 0.00283 | 0.0426 | 4.23 | 0.00108 | 0.0247 | 4.23 | |
|
| ||||||||||
| 2 | CV | 0.00328 | 0.0461 | 8.31 | 0.00279 | 0.0428 | 8.31 | 0.00073 | 0.0209 | 8.31 |
| GCV | 0.00339 | 0.0470 | 9.43 | 0.00281 | 0.0433 | 9.43 | 0.00066 | 0.0206 | 9.43 | |
| AIC | 0.00344 | 0.0472 | 12.05 | 0.00285 | 0.0434 | 12.05 | 0.00064 | 0.0204 | 12.05 | |
| BIC | 0.00324 | 0.0458 | 6.14 | 0.00328 | 0.0462 | 6.14 | 0.00129 | 0.0259 | 6.14 | |
| FIC | 0.00327 | 0.0459 | 7.97 | 0.00290 | 0.0440 | 7.97 | 0.00081 | 0.0224 | 7.97 | |
|
| ||||||||||
| 3 | CV | 0.00369 | 0.0483 | 6.67 | 0.00439 | 0.0535 | 6.67 | 0.00199 | 0.0317 | 6.67 |
| GCV | 0.00367 | 0.0483 | 6.14 | 0.00436 | 0.0533 | 6.14 | 0.00197 | 0.0316 | 6.14 | |
| AIC | 0.00369 | 0.0484 | 6.36 | 0.00441 | 0.0534 | 6.36 | 0.00201 | 0.0318 | 6.36 | |
| BIC | 0.00367 | 0.0482 | 5.14 | 0.00473 | 0.0549 | 5.14 | 0.00247 | 0.0345 | 5.14 | |
| FIC | 0.00368 | 0.0483 | 5.46 | 0.00451 | 0.0532 | 5.49 | 0.00219 | 0.0324 | 5.49 | |
Prediction of AUC at z 0 with models on diseased and healthy groups separately. Size means the sum of numbers of selected variables in diseased and non-diseased groups.
| Setting | Methods | Size |
|
|
| |||
|---|---|---|---|---|---|---|---|---|
| MSE | MAE | MSE | MAE | MSE | MAE | |||
| 1 | CV | 8.78 | 0.00384 | 0.0490 | 0.00303 | 0.0439 | 0.00107 | 0.0247 |
| GCV | 8.23 | 0.00383 | 0.0488 | 0.00298 | 0.0432 | 0.00103 | 0.0239 | |
| AIC | 8.18 | 0.00383 | 0.0488 | 0.00397 | 0.0432 | 0.00102 | 0.0239 | |
| BIC | 6.96 | 0.00383 | 0.0490 | 0.00313 | 0.0447 | 0.00114 | 0.0254 | |
|
| ||||||||
| 2 | CV | 14.47 | 0.00483 | 0.0566 | 0.00351 | 0.0481 | 0.00079 | 0.0218 |
| GCV | 16.31 | 0.00553 | 0.0609 | 0.00390 | 0.0515 | 0.00069 | 0.0218 | |
| AIC | 15.46 | 0.00545 | 0.0606 | 0.00388 | 0.0516 | 0.00069 | 0.0218 | |
| BIC | 12.67 | 0.00514 | 0.0590 | 0.00384 | 0.0502 | 0.00092 | 0.0225 | |
|
| ||||||||
| 3 | CV | 12.29 | 0.00405 | 0.0494 | 0.00481 | 0.0558 | 0.00225 | 0.0332 |
| GCV | 10.55 | 0.00403 | 0.0497 | 0.00461 | 0.0541 | 0.00208 | 0.0320 | |
| AIC | 10.55 | 0.00402 | 0.0497 | 0.00461 | 0.0541 | 0.00209 | 0.0320 | |
| BIC | 9.53 | 0.00403 | 0.0499 | 0.00468 | 0.0550 | 0.00206 | 0.0325 | |
Estimated AUC at three test points. Size means the number of selected factors.
| Methods | Test point 1 | Test point 2 | Test point 3 | |||
|---|---|---|---|---|---|---|
| AUC | Size | AUC | Size | AUC | Size | |
| CV | 0.971 | 6 | 0.916 | 6 | 0.982 | 6 |
| AIC | 0.971 | 6 | 0.916 | 6 | 0.982 | 6 |
| GCV | 0.971 | 6 | 0.916 | 6 | 0.982 | 6 |
| BIC | 0.949 | 1 | 0.957 | 1 | 0.944 | 1 |
| FIC | 0.963 | 3 | 0.957 | 2 | 0.944 | 1 |