| Literature DB >> 27609023 |
Nurul Ainin Abdul Aziz1, Norfilza M Mokhtar2, Roslan Harun1, Md Manir Hossain Mollah1, Isa Mohamed Rose3, Ismail Sagap4, Azmi Mohd Tamil5, Wan Zurinah Wan Ngah1, Rahman Jamal6.
Abstract
BACKGROUND: Histopathological assessment has a low potential to predict clinical outcome in patients with the same stage of colorectal cancer. More specific and sensitive biomarkers to determine patients' survival are needed. We aimed to determine gene expression signatures as reliable prognostic marker that could predict survival of colorectal cancer patients with Dukes' B and C.Entities:
Keywords: Colorectal cancer; Microarray analysis; Real-time PCR; Survivalm
Mesh:
Year: 2016 PMID: 27609023 PMCID: PMC5016995 DOI: 10.1186/s12920-016-0218-1
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Clinical and pathological features
| Good survival | Poor survival | |||
|---|---|---|---|---|
| No (%) | No (%) |
| ||
| Dukes’ | B | 22 (56.41) | 15 (38.46) | 0.173 ** |
| C | 17 (43.59) | 24 (61.54) | ||
| Gender | Male | 20 (51.28) | 20 (51.28) | 1.000 ** |
| Female | 19 (48.72) | 19 (48.72) | ||
| Age (year) | ≤50 | 4 (10.26) | 5 (12.82) | 0.235 ** |
| >50 | 35 (89.74) | 34 (87.18) | ||
| Race | Chinese | 29 (74.36) | 24 (61.54) | 0.226 * |
| Malay | 9 (23.08) | 15 (38.46) | ||
| Indian | 1 (2.56) | 0 | ||
| Tumor differentiation | Well | 26 (66.67) | 15 (38.46) | 0.051 * |
| Moderately | 5 (12.82) | 15 (38.46) | ||
| Poorly | 1 (2.56) | 2 (5.13) | ||
| Mucinous | 2 (5.13) | 4 (10.26) | ||
| No record | 5 (12.82) | 3 (7.69) | ||
| Clinical outcome | Alive | 34 (87.18) | 0 | 0.000 ** |
| Dead | 5 (12.82) | 39 (100.00) | ||
| Organ | Colon | 25 (64.10) | 21 (53.85) | 0.357 ** |
| Rectum | 14 (35.90) | 18 (46.15) | ||
* = p value was calculated using Pearson Chi-Square
** = p value was calculated using Fisher’s Exact Test
[Relevant location: Page 13]
Fig. 1a Cancerous tissue section of patients Dukes' B well-differentiated adenocarcinoma. Hematoxylin (purple) stains chromatin in the nucleus and eosin (pink orangish) gives color to the protein that resides in the cytoplasm of muscle cells. Tumor cells appear to thicken and be seen spreading muscular propia but did not penetrate serous layer. b. Well differentiated adenocarcinoma Dukes’ C tissue section invaded into muscular propia and involved lymph nodes
Microarray-based changes in gene expression of the 19 genes
| Probe ID | Gene symbol | aFold change | Gene name | Expression in poor survival group (Up-regulated/Down-regulated) |
|---|---|---|---|---|
| 7000692 |
| -4.32 (-2.59) | mitochondrial ribosomal protein L52 | Down-regulated |
| 5700373 |
| -3.49 (-3.22) | thyroid hormone receptor interactor 13 | Down-regulated |
| 2690324 |
| 1.36 (1.23) | inositol 1,4,5-triphosphate receptor interacting protein | Up-regulated |
| 7000184 |
| -3.89 (-3.08) | solute carrier family 38, member 9 | Down-regulated |
| 5420070 |
| 3.65 (2.63) | FERM domain containing 6 | Up-regulated |
| 4230739 |
| 2.96 (3.58) | sortilin-related VPS10 domain containing receptor 2 | Up-regulated |
| 6040070 |
| 3.68 (2.59) | EGF, latrophilin and seven transmembrane domain containing 1 | Up-regulated |
| 1190176 |
| 3.37 (2.62) | Notch homolog 2 | Up-regulated |
| 2570196 |
| -2.62 (-2.18) | CPX chromosome region, candidate 1 | Down-regulated |
| 840367 |
| -2.20 (-1.93) | olfactory receptor, family 10, subfamily H, member 5 | Down-regulated |
| 3450575 |
| -3.06 (-1.75) | phosducin | Down-regulated |
| 2710564 |
| 2.22 (3.01) | dual oxidase 2 | Up-regulated |
| 4560474 |
| -2.91 (-1.63) | GDNF family receptor alpha 4 | Down-regulated |
| 5690064 |
| -2.69 (-2.45) | LAG1 homolog, ceramide synthase 6 | Down-regulated |
| 3780725 |
| -2.04 (-2.45) | oxysterol binding protein-like 9 | Down-regulated |
| 5090025 |
| -1.15 (-1.11) | chromosome 12 open reading frame 66 | Down-regulated |
| 5870121 |
| 3.64 (4.57) | spastic paraplegia 7 (pure and complicated autosomal recessive) | Up-regulated |
| 620398 |
| -3.53 (-2.92) | dual specificity phosphatase 21 | Down-regulated |
| 540411 |
| -3.88 (-3.07) | breast cancer 1, early onset | Down-regulated |
This table shows the probe ID, gene symbols and expression of the 19 genes in the poor survival group compared to the good survival group. aFold change: training set (test set)
[Relevant location: Page 13]
Fig. 2Hierarchical clustering of gene expression datasets. Hierarchical clustering of 78 CRC samples in training and test sets which graphically displays the intensity of the gene expression for each gene. Samples were clustered based on the 19 significant genes. The color of each square boxes represents the ratio of gene expression. Red boxes indicates up regulated genes while green boxes represents down regulated genes. The column represent individual tissue samples while rows represent individual of genes
Fig. 3Survival analysis. Kaplan–Meier survival analysis using six different microarray datasets (Training and test sets (Illumina-based), dataset from Denmark (Affymetrix), dataset from the USA (Affymetrix) and dataset from Australia (Affymetrix)). The 19-gene signature segregates patients into two risk groups (red, high risk; black, low risk). The p values correspond to the likelihood ratio test comparing the survival curves
Univariate and multivariate cox proportional hazard regression analyses
| Genes | Univariate | Multivariate | ||
|---|---|---|---|---|
| Hazard ratio (95 % CI) |
| Hazard ratio (95 % CI) |
| |
|
| 1.356 (1.154 – 1.592) | 0.000 *** | 1.56 (1.034 – 1.913) | 0.009 ** |
|
| 1.063 (0.919 – 1.23) | 0.408 | 0.852 (0.614 – 1.183) | 0.340 |
|
| 1.259 (1.092 – 1.452) | 0.001 ** | 1.066 (0.850 – 1.338) | 0.575 |
|
| 0.860 (0.728 – 1.016) | 0.005 ** | 0.871 (0.547 – 1.005) | 0.010 ** |
|
| 0.803 (0.671 – 0.962) | 0.017 ** | 0.818 (0.640 – 1.045) | 0.047 ** |
|
| 0.918 (0.811 – 1.041) | 0.183’ | 1.030 (0.766 – 1.387) | 0.840 |
|
| 1.159 (1.041 – 1.290) | 0.006 ** | 0.936 (0.752 – 1.166) | 0.558 |
|
| 0.812 (0.679 – 0.970) | 0.022 ** | 1.068 (0.796 – 1.432) | 0.659 |
|
| 0.902 (0.808 – 1.008) | 0.069 * | 1.007 (0.729 – 1.390) | 0.965 |
|
| 0.919 (0.845 – 1.000) | 0.050 * | 1.073 (0.894 – 1.289) | 0.443 |
|
| 0.846 (0.704 – 1.018) | 0.075 * | 1.078 (0.826 – 1.407) | 0.579 |
|
| 0.881 (0.802 – 0.968) | 0.008 ** | 0.985 (0.780 – 1.244) | 0.901 |
|
| 0.848 (0.767 – 0.939) | 0.001 ** | 0.865 (0.697 – 1.075) | 0.019 ** |
|
| 0.903 (0.807 – 1.011) | 0.077 * | 0.928 (0.697 – 1.235) | 0.609 |
|
| 0.836 (0.764 – 0.915) | 0.000 *** | 0.884 (0.713 – 1.097) | 0.264 |
|
| 1.155 (1.021 – 1.307) | 0.021 ** | 0.969 (0.772 – 1.216) | 0.787 |
|
| 1.141 (1.035 – 1.259) | 0.008 ** | 1.026 (0.840 – 1.255) | 0.797 |
|
| 0.867 (0.782 – 0.961) | 0.006 ** | 0.788 (0.631 – 0.984) | 0.035 ** |
|
| 1.076 (0.975 – 1.187) | 0.141’ | 1.039 (0.844 – 1.280) | 0.715 |
| Age (>60) | 0.154 (0.01953 – 1.225) | 0.045 ** | 0.562 (0.0354 – 2.392) | 0.049 ** |
| Gender | 1.048 (0.554 – 1.981) | 0.886 | 1.729 (0.704 – 4.245) | 0.231 |
| Stage | 1.644 (0.8567 – 3.156) | 0.135’ | 1.274 (0.494 – 3.286) | 0.615 |
This table shows the univariate and multivariate cox proportional hazard regression analyses of 19 gene signatures and other clinical variables associated with overall survival of CRC patients. [Relevant location: Page 14]
Comparison the LASSO and Ridge regression methods with Elastic Net regression
| Univariate | Multivariate | ||||
|---|---|---|---|---|---|
| HR (95 % CI) |
| HR (95 % CI) |
| ||
| Datasets | Methods | ||||
| Lasso | 0.106 (0.030 – 0.370) | 0.000 | 0.063 (0.016 – 0.252) | 0.000 | |
| Our dataset | Ridge | 0.812 (0.303 – 2.173) | 0.000 | 0.083 (0.022 – 0.321) | 0.000 |
| Elastic net | 0.065 (0.014 – 0.287) | 0.000 | 0.040 (0.008 – 0.198) | 0.000 | |
| Denmark dataset | Lasso | 0.055 (0.007 – 0.453) | 0.007 | 0.044 (0.005 – 0.396) | 0.005 |
| Ridge | 0.112 (0.024 – 0.519) | 0.005 | 0.080 (0.014 – 0.467) | 0.005 | |
| Elastic net | 0.057 (0.007 – 0.464) | 0.007 | 0.038 (0.004 – 0.389) | 0.005 | |
| Australian dataset | Lasso | 0.565 (0.396 – 0.805) | 0.002 | 0.549 (0.384 – 0.784) | 0.001 |
| Ridge | 0.447 (0.312 – 0.641) | 0.000 | 0.454 (0.316 – 0.651) | 0.000 | |
| Elastic net | 0.523 (0.370 – 0.739) | 0.000 | 0.529 (0.373 – 0.748) | 0.000 | |
| USA dataset | Lasso | 0.105 (0.010 – 1.068) | 0.056 | 0.104 (0.010 – 1.052) | 0.055 |
| Ridge | 0.130 (0.013 – 1.294) | 0.082 | 0.129 (0.013 – 1.283) | 0.081 | |
| Elastic net | 0.120 (0. 012 – 1.195) | 0.071 | 0. 122 (0. 012 – 1.214) | 0. 072 | |
| Norway dataset | Lasso | --- | --- | --- | --- |
| Ridge | --- | --- | --- | --- | |
| Elastic net | 0.536 (0.300 – 0.957) | 0.035 | 0.569 (0.318 – 1.018) | 0.057 | |
This table shows the comparison with the LASSO, Ridge regression and Elastic Net methods for 19 gene signatures based on our dataset and other external datasets from different countries. Univariate and multivariate Cox’s proportional hazard model analysis of prognostic factor (prognostic index or risk score) for overall survival
[Relevant location: Page 16]
Fig. 4Validation of detected genes using qPCR. The normalized gene expression ratio for six genes including FRMD6, ELTD1, ITPRIP, MRPL52, TRIP13 and SLC38A9 which was determined using qPCR (p < 0.05). (*) represents the significant genes