| Literature DB >> 24982876 |
Omid Hamidi1, Lily Tapak2, Aarefeh Jafarzadeh Kohneloo3, Majid Sadeghifar4.
Abstract
Microarray technology results in high-dimensional and low-sample size data sets. Therefore, fitting sparse models is substantial because only a small number of influential genes can reliably be identified. A number of variable selection approaches have been proposed for high-dimensional time-to-event data based on Cox proportional hazards where censoring is present. The present study applied three sparse variable selection techniques of Lasso, smoothly clipped absolute deviation and the smooth integration of counting, and absolute deviation for gene expression survival time data using the additive risk model which is adopted when the absolute effects of multiple predictors on the hazard function are of interest. The performances of used techniques were evaluated by time dependent ROC curve and bootstrap .632+ prediction error curves. The selected genes by all methods were highly significant (P < 0.001). The Lasso showed maximum median of area under ROC curve over time (0.95) and smoothly clipped absolute deviation showed the lowest prediction error (0.105). It was observed that the selected genes by all methods improved the prediction of purely clinical model indicating the valuable information containing in the microarray features. So it was concluded that used approaches can satisfactorily predict survival based on selected gene expression measurements.Entities:
Mesh:
Year: 2014 PMID: 24982876 PMCID: PMC4055233 DOI: 10.1155/2014/393280
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Influential genes on OSCC patients' survival based on additive hazards model using Lasso, SCAD and SICA. Values are frequency of occurrences of the genes, means of coefficients (standard errors) over 100 replicates.
| Probeset ID | Lasso | SCAD | SICA | |||
|---|---|---|---|---|---|---|
| Frequency | Coefficient (SE) | Frequency | Coefficient (SE) | Frequency | Coefficient (SE) | |
| 7897663 | 100 | 0.141 (0.006) | ||||
| 7905589 | 100 | −0.122 (0.032) | 97 | −0.182 (0.086) | 88 | −0.324 (0.132) |
| 7908407 | 100 | −0.639 (0.054) | 98 | −1.239 (0.645) | 92 | −1.367 (0.460) |
| 7916489 | 99 | −0.012 (0.009) | 56 | −0.022 (0.021) | ||
| 7918825 | 99 | 0.112 (0.041) | ||||
| 7919157 | 81 | 0.081 (0.049) | ||||
| 7922793 | 99 | −0.069 (0.035) | 48 | −0.012 (0.017) | ||
| 7925161 | 1 | −0.002 (0.017) | ||||
| 7946565 | 44 | 0.009 (0.010) | ||||
| 7964627 | 91 | 0.102 (0.056) | 12 | 0.030 (0.084) | ||
| 7965467 | 99 | −0.100 (0.050) | 56 | −0.034 (0.051) | ||
| 7971191 | 91 | 0.037 (0.019) | 30 | 0.003 (0.007) | ||
| 7978754 | 100 | −0.136 (0.015) | 50 | −0.005 (0.008) | ||
| 7981968 | 100 | −0.086 (0.021) | ||||
| 7982129 | 99 | −0.012 (0.005) | 56 | −0.002 (0.003) | ||
| 8002247 | 91 | 0.100 (0.052) | 2 | 0.002 (0.017) | ||
| 8018097 | 100 | −0.140 (0.008) | 98 | −0.060 (0.018) | ||
| 8020844 | 36 | −0.011 (0.019) | 94 | −0.024 (0.012) | ||
| 8035398 | 75 | −0.018 (0.012) | ||||
| 8035829 | 64 | 0.018 (0.014) | ||||
| 8040338 | 91 | −0.016 (0.007) | ||||
| 8044733 | 91 | −0.039 (0.019) | ||||
| 8047690 | 56 | 0.023 (0.028) | ||||
| 8048595 | 91 | −0.055 (0.030) | 30 | −0.005 (0.013) | ||
| 8065392 | 91 | −0.168 (0.086) | 1 | −0.003 (0.029) | ||
| 8076511 | 49 | 0.011 (0.011) | ||||
| 8075691 | 26 | −0.034 (0.058) | ||||
| 8093764 | 56 | −0.018 (0.020) | ||||
| 8095441 | 56 | −0.006 (0.006) | ||||
| 8103368 | 100 | −0.011 (0.003) | 56 | −0.009 (0.013) | ||
| 8106814 | 75 | 0.040 (0.029) | 56 | 0.024 (0.033) | ||
| 8106919 | 81 | 0.070 (0.046) | 97 | −0.092 (0.028) | 1 | 0.002 (0.020) |
| 8109828 | 100 | −0.128 (0.003) | 47 | −0.068 (0.082) | ||
| 8110880 | 56 | 0.047 (0.041) | ||||
| 8112916 | 68 | 0.005 (0.009) | ||||
| 8120206 | 91 | 0.030 (0.015) | ||||
| 8123338 | 75 | −0.065 (0.045) | 1 | −0.002 (0.018) | ||
| 8126931 | 100 | 0.157 (0.025) | 84 | 0.022 (0.017) | 2 | 0.002 (0.014) |
| 8138531 | 90 | 0.873 (0.338) | ||||
| 8139808 | 99 | −0.059 (0.016) | ||||
| 8127993 | 30 | −0.008 (0.014) | ||||
| 8158952 | 84 | 0.900 (0.676) | ||||
| 8161169 | 100 | 0.281 (0.029) | 97 | 0.154 (0.041) | 70 | 0.144 (0.175) |
| 8174710 | 82 | −0.023 (0.013) | 24 | −0.007 (0.012) | ||
| 8174970 | 100 | 0.360 (0.030) | 94 | 0.060 (0.046) | 35 | 0.101 (0.140) |
| 8180388 | 100 | −0.403 (0.039) | 98 | −0.139 (0.077) | 12 | −0.072 (0.199) |
Figure 1Comparison of predictive performance (area under the ROC curve, over time) for the OPL patients.
Figure 2Model comparison using prediction error curves. Clinical model used age, histology, and podoplanin and deltaNp63 expression as predictors. The SICA, SCAD, and Lasso used selected microarray data as well as age, histology, and podoplanin and deltaNp63 expression as predictors.