| Literature DB >> 27114989 |
Leili Tapak1, Hossein Mahjub2, Majid Sadeghifar3, Massoud Saidijam4, Jalal Poorolajal5.
Abstract
BACKGROUND: One substantial part of microarray studies is to predict patients' survival based on their gene expression profile. Variable selection techniques are powerful tools to handle high dimensionality in analysis of microarray data. However, these techniques have not been investigated in competing risks setting. This study aimed to investigate the performance of four sparse variable selection methods in estimating the survival time.Entities:
Keywords: Additive hazards model; Bladder cancer; Microarray data; Survival analysis; Variable selection
Year: 2016 PMID: 27114989 PMCID: PMC4841879
Source DB: PubMed Journal: Iran J Public Health ISSN: 2251-6085 Impact factor: 1.429
Selected microarray features by using four variable selection approaches for progression or death from bladder cancer event in Dyrskjøt data set. Values shown are frequency of selected genes, means of coefficients (standard errors) over 100 replicates
| SEQ1082 | 84 | 5.2(0.3) | 88 | 4.9(0.3) | 87 | 3.9(0.3) | - | - |
| SEQ1197 | 90 | 4.5(0.2) | 91 | 4.3(0.2) | 95 | 3.6(0.2) | - | - |
| SEQ1226 | 100 | −5.5(0.2) | 100 | −5.1(0.2) | 100 | −4.4(0.2) | 100 | −19(0.2) |
| SEQ1262 | 99 | −8.9(0.3) | 99 | −9.0(0.3) | 99 | −7.9(0.3) | 100 | −19(0.2) |
| SEQ1284 | - | - | - | - | - | - | 56 | 1.5(0.2) |
| SEQ1295 | - | - | - | - | - | - | 83 | 2.3(0.2) |
| SEQ1330 | 54 | 0.8(0.1) | 43 | 0.6(0.1) | 40 | 0.4(0.1) | - | - |
| SEQ1384 | 76 | −5.1(0.4) | 75 | −4.9(0.4) | 74 | −3.4(0.3) | 100 | −48(0.3) |
| SEQ162 | 98 | −1.6(0.1) | 96 | −1.6(0.1) | 98 | −1.5(0.1) | - | - |
| SEQ213 | 63 | 1.6(0.2) | 43 | 1.2(0.2) | 50 | 0.8(0.1) | 83 | 3.8(0.3) |
| SEQ240 | 32 | −0.1(0.3) | - | - | - | - | - | - |
| SEQ265 | 100 | 9.4(0.1) | 100 | 9.5(0.1) | 100 | 9.4(0.1) | 97 | 2.6(0.1) |
| SEQ279 | 84 | −5.4(0.3) | 88 | −5.2(0.3) | 87 | −4.1(0.3) | 83 | −1.7(0.1) |
| SEQ287 | 76 | 1.0(0.1) | 69 | 0.8(0.1) | 74 | 0.6(0.1) | - | - |
| SEQ34 | 100 | 23(0.3) | 100 | 24(0.3) | 100 | 23(0.3) | 100 | 31(0.1) |
| SEQ377 | 99 | 8.8(0.4) | 97 | 8.1(0.3) | 99 | 7.2(0.3) | 100 | 8.2(0.2) |
| SEQ408 | - | - | - | - | - | - | 13 | 0.4(0.1) |
| SEQ410 | - | - | - | - | - | - | 100 | 6.9(0.2) |
| SEQ542 | - | - | - | - | - | - | 56 | 0.9(0.1) |
| SEQ820 | 100 | 11(0.1) | 100 | 11(0.1) | 100 | 12(0.1) | - | - |
| SEQ833 | 100 | 7.1(0.2) | 100 | 7.1(0.2) | 100 | 6.6(0.2) | - | - |
| SEQ843 | - | - | - | - | - | - | 97 | −3.3(0.1) |
| SEQ940 | 90 | −5.2(0.3) | 91 | −5.1(0.2) | 95 | −4.2(0.2) | - | - |
| SEQ948 | - | - | - | - | - | - | 13 | −0.3(0.1) |
Coefficients and standard errors (SE) must be multiplied by 10−4
Results of various methods applied to the Bladder cancer microarray data
| Elastic net | 0.137±0.07 | 0.803±0.06 | 0.779±0.13 |
| Lasso | 0.153±0.09 | 0.741±0.11 | 0.693±0.19 |
| SCAD | 0.144±0.06 | 0.763±0.07 | 0.722±0.16 |
| SICA | 0.145±0.07 | 0.761±0.07 | 0.717±0.12 |
AUC is the area under ROC curve
Fig. 1:Bootstrap 0.632+ prediction error curve estimates for prediction of the conditional probability function from bladder cancer microarray data
Influential genes on bladder cancer patient’s survival based on additive hazards model from selected genes by elastic net
| SEQ1082 | NM_207521.1 | Homo sapiens reticulon 4 (RTN4) | 0.0025 (0.0009) | 0.005 |
| SEQ1197 | NM_003103.5 | Homo sapiens (human) SON DNA binding protein (SON) | 0.0034 (0.0014) | 0.016 |
| SEQ1262 | NM_000875.2 | Homo sapiens insulin-like growth factor 1 receptor (IGF1R), mRNA | 0.0029 (0.0014) | 0.039 |
| SEQ833 | NM_001255.1 | Homo sapiens CDC20 cell division cycle 20 homolog (S. cerevisiae) (CDC20), mRNA | 0.0018 (0007) | 0.009 |
| SEQ940 | NM_020159.1 | Homo sapiens SWI/SNF-related, matrix-associated actin-dependent regulator of chromatin, subfamily a, containing DEAD/H box 1 (SMARCAD1), transcript variant 3, mRNA | −0.0023 (0.0010) | 0.025 |