| Literature DB >> 23173013 |
Mehri Khoshhali1, Hossein Mahjub, Massoud Saidijam, Jalal Poorolajal, Ali Reza Soltanian.
Abstract
The present study was conducted to predict survival time in patients with diffuse large B-cell lymphoma, DLBCL, based on microarray data using Cox regression model combined with seven dimension reduction methods. This historical cohort included 2042 gene expression measurements from 40 patients with DLBCL. In order to predict survival, a combination of Cox regression model was used with seven methods for dimension reduction or shrinkage including univariate selection, forward stepwise selection, principal component regression, supervised principal component regression, partial least squares regression, ridge regression and Losso. The capacity of predictions was examined by three different criteria including log rank test, prognostic index and deviance. MATLAB r2008a and RKWard software were used for data analysis. Based on our findings, performance of ridge regression was better than other methods. Based on ridge regression coefficients and a given cut point value, 16 genes were selected. By using forward stepwise selection method in Cox regression model, it was indicated that the expression of genes GENE3555X and GENE3807X decreased the survival time (P=0.008 and P=0.003, respectively), whereas the genes GENE3228X and GENE1551X increased survival time (P=0.002 and P<0.001, respectively). This study indicated that ridge regression method had higher capacity than other dimension reduction methods for the prediction of survival time in patients with DLBCL. Furthermore, a combination of statistical methods and microarray data could help to detect influential genes in survival.Entities:
Keywords: Lymphoma; dimension reduction; gene expression; microarray; ridge regression; survival analysis
Year: 2012 PMID: 23173013 PMCID: PMC3410377 DOI: 10.4172/1747-0862.1000051
Source DB: PubMed Journal: J Mol Genet Med ISSN: 1747-0862
Figure 1.Box plots of p-value resulting from three performance criteria for seven prediction methods. The horizontal lines represent the null model with no gene information included. The smaller the value of each criterion, the better the performance of prediction will be. Uni: Univariate selection, FS: Forward stepwise selection, PCR: Principal component regression, SPCR: Supervised PCR, PLS: Partial least square.
Influential genes on DLBCL survival based on the cut point value of 0.06 using ridge regression model
| Gene code | Gene name |
|---|---|
| OSE: (2’-5’) oligoadenylate synthetase E; Clone=276483 | |
| GENE2536X | BCL-2; Clone=342181 |
| GENE3831X | Lymphotoxin-Beta=Tumor necrosis factor C; Clone=712066 |
| LIgGFc: Low-affinity IgG Fc receptor II-B and C isoforms (multiple orthologous genes); Clone=292524 | |
| GENE3554X | Low-affinity IgG Fc receptor II-B and C isoforms (multiple orthologous genes); Clone=1233864 |
| GENE2387X | Unknown UG Hs.181297 ESTs; Clone=1336563 |
| JNK3: Stress-activated protein kinase; Clone=23173 | |
| GENE3317X | CD10=CALLA=Neprilysin=enkepalinase; Clone=701606 |
| GENE3318X | CD10=CALLA=Neprilysin=enkepalinase; Clone=1286850 |
| GENE3391X | CD21=B-lymphocyte CR2-receptor (for complement factor C3d and Epstein-Barr virus); Clone=824695 |
| GENE1190X | SLAM=signaling lymphocytic activation molecule; Clone=814251 |
| GENE1214X | Unknown UG Hs.89104 ESTs; Clone=713158 |
| GENE1161X | Unknown UG Hs.136858 EST; Clone=1317052 |
| GENE62X | p16-INK4a=Cyclin-dependent kinase 4 inhibitor A=Multiple tumor suppressor 1=MTS1; Clone=1174836 |
| GENE1819X | Unknown UG Hs.221250 ESTs; Clone=686150 |
| IL-2 receptor beta chain; Clone=1372713 |
Effective genes on DLBCL survival
Estimated parameters using Cox regression model
| Gene code | ( | SE | Wald | df | Exp( | 95% CI for Exp(βi) | ||
|---|---|---|---|---|---|---|---|---|
| Lower | Upper | |||||||
| GENE3807X | 1.024 | 0.340 | 9.086 | 1 | 0.003 | 2.785 | 1.431 | 5.421 |
| GENE3555X | 0.671 | 0.255 | 6.932 | 1 | 0.008 | 1.957 | 1.187 | 3.225 |
| GENE3228X | -1.298 | 0.418 | 9.623 | 1 | 0.002 | 0.273 | 0.120 | 0.620 |
| GENE1551X | -1.299 | 0.326 | 15.881 | 1 | 0.000 | 0.273 | 0.144 | 0.517 |
Standard error
Degree of freedom