Literature DB >> 29328363

Support vector machine classifier for prediction of the metastasis of colorectal cancer.

Jiajun Zhi1, Jiwei Sun1, Zhongchuan Wang1, Wenjun Ding1.   

Abstract

Colorectal cancer (CRC) is one of the most common cancers and a major cause of mortality. The present study aimed to identify potential biomarkers for CRC metastasis and uncover the mechanisms underlying the etiology of the disease. The five datasets GSE68468, GSE62321, GSE22834, GSE14297 and GSE6988 were utilized in the study, all of which contained metastatic and non-metastatic CRC samples. Among them, three datasets were integrated via meta-analysis to identify the differentially expressed genes (DEGs) between the two types of samples. A protein-protein interaction (PPI) network was constructed for these DEGs. Candidate genes were then selected by the support vector machine (SVM) classifier based on the betweenness centrality (BC) algorithm. A CRC dataset from The Cancer Genome Atlas database was used to evaluate the accuracy of the SVM classifier. Pathway enrichment analysis was carried out for the SVM-classified gene signatures. In total, 358 DEGs were identified by meta‑analysis. The top ten nodes in the PPI network with the highest BC values were selected, including cAMP responsive element binding protein 1 (CREB1), cullin 7 (CUL7) and signal sequence receptor 3 (SSR3). The optimal SVM classification model was established, which was able to precisely distinguish between the metastatic and non-metastatic samples. Based on this SVM classifier, 40 signature genes were identified, which were mainly enriched in protein processing in endoplasmic reticulum (e.g., SSR3), AMPK signaling pathway (e.g., CREB1) and ubiquitin mediated proteolysis (e.g., FBXO2, CUL7 and UBE2D3) pathways. In conclusion, the SVM-classified genes, including CREB1, CUL7 and SSR3, precisely distinguished the metastatic CRC samples from the non-metastatic ones. These genes have the potential to be used as biomarkers for the prognosis of metastatic CRC.

Entities:  

Mesh:

Year:  2018        PMID: 29328363      PMCID: PMC5819940          DOI: 10.3892/ijmm.2018.3359

Source DB:  PubMed          Journal:  Int J Mol Med        ISSN: 1107-3756            Impact factor:   4.101


Introduction

Colorectal cancer (CRC) is one of the most lethal diseases worldwide, and it is estimated to account for >9% of all cases of cancer. The incidence of CRC varies according to geographical location, and the majority of cases occur in developed countries (1). Based on epidemiological data, it was estimated that 136,830 individuals would be diagnosed with CRC and 50,310 would succumb to the disease in the USA in 2014 (2). Genetic factors have been established as major regulators that affect CRC pathogenesis. Germline mutations of susceptibility genes, such as adenomatous polyposis coli, MutL homolog 1, MutL homolog 2 and the three loci recently identified near to the genes paired like homeodomain 1, cyclin D2 and hydroxyacid oxidase 1, are considered to be tightly associated with CRC risk (3). SRY-box containing gene 17 is a transcription factor (TF) that functions as an inhibitor in the Wnt pathway, and its abnormal expression caused by promoter hypermethylation may influence CRC development (4). The activation of nuclear factor-κB signaling and its regulated genes also serve important roles in the promotion of CRC progression (5). Metastasis is the most common cause of cancer-associated mortality, and accounts for ~90% of all cancer deaths (6). Patients with metastatic CRC have a poor 5-year survival rate of <10% (7). A number of studies have investigated the molecular mechanisms of metastatic CRC. For instance, the overexpression of AKT serine/threonine kinase 2 has been indicated to be a causative factor for CRC metastasis (8). Another study identified several metastasis-associated genes in CRC, which mainly participate in extracellular matrix interactions and cell signaling functions, and include integrin subunit β1, integrin subunit β5, collagen type Vα1 and secreted phosphoprotein 1 (9). A further study indicated that metastatic gene signatures, such as chemokine (C-X-C Motif) receptor 7, adenylate kinase 1 and early growth response 1 are able to predict the risk of recurrence and mortality in patients with CRC (10). Despite these profound findings, the etiology of CRC metastasis remains obscure. The support vector machine (SVM) classifier is a kernel algorithm that bases its analysis on data obtained only through dot-products. The SVM classifier is widely applied in bioinformatics due to its high accuracy, and has the ability to identify the multivariate statistical properties of data that distinguish between two different groups (11,12). Henneges et al (13) demonstrated that the SVM classifier, in combination with liquid chromatography ion trap mass spectrometry, is a promising tool for crucial gene predictions in non-invasive breast cancer. In addition, another study using SVM established a model that was able to discriminate normal samples from those of CRC patients; via this classification method, several biomarkers were predicted, including cadherin 3, claudin 1 and interleukin-8 (14). However, to the best of our knowledge, there have been no previous reports regarding the application of the SVM classifier to CRC metastasis. Therefore, the present study was performed using the SVM method to classify metastatic and non-metastatic CRC samples. Three datasets were integrated using meta-analysis and an additional dataset from The Cancer Genome Atlas (TCGA) database was utilized to validate the precision of the SVM classifier. Several bioinformatic methods were then carried out to reveal function and pathway information of the identified SVM-classified signature genes, on the basis of which a comprehensive evaluation of the metastatic mechanisms in CRC was conducted and novel biomarkers identified.

Materials and methods

Data resources and pretreatment

The Gene Expression omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo) database was searched for all eligible open public datasets with the key search terms of 'colon cancer' and 'homo sapiens'. Datasets that satisfied the following criteria were included in the study: i) The data comprised gene expression profiles; ii) the data were associated with CRC and metastasis; iii) information on samples from patients with CRC and controls was elaborated. Based on these selection criteria, five microarray datasets, GSE68468 (15), GSE62321 (16), GSE22834 (17), GSE14297 (18) and GSE6988 (19) were included in the present study. Among these datasets, GSE68468 and GSE62321 were from the same platform, Affymetrix HG-U133 arrays (Thermo Fisher Scientific, Inc., Waltham, MA, USA). GSE68468 consisted of 240 CRC samples, of which 47 were metastatic and 193 were non-metastatic. GSE62321 comprised a total of 39 CRC samples, including 19 metastatic and 20 non-metastatic samples. For these two datasets, raw data in the CEL format was downloaded from the GEO database, followed by background correction and normalization using the Microarray Suite and quantiles, respectively (20,21). The median method was used for the supplementation of missing values. These pretreatments were performed using the Affy package in R version 1.42.3 (http://www.bioconductor.org/packages/release/bioc/html/affy.html). Regarding the remaining three datasets, GSE22834 was obtained from the Stanford Microarray Database print platform (Stanford University, Stanford, CA, USA), and consisted of 63 CRC samples (32 metastatic and 31 non-metastatic); GSE14297 was derived from the Illumina human-6 v2.0 expression beadchip (extended) (Illumina, Inc., San Diego, CA, USA), and included 36 CRC samples (18 metastatic and 18 non-metastatic); and GSE6988 was from the human 17K cDNA-GeneTrack platform (Genomic Tree, Inc., Daegeon, Korea), and comprised 53 CRC samples (33 metastatic and 20 non-metastatic). For these three datasets, raw data in the txt format was downloaded in the respective platform. In each annotation platform, the probe identification number was transformed into gene expression symbols. Probes that had a vacancy were deleted, and multiple probes that corresponded to a single gene were averaged to obtain the gene expression value. The Linear Models for Microarray Analysis (limma; http://www.biocon-ductor.org/packages/release/bioc/html/limma.html) package version 3.22.1 was then used to normalize the data (22).

Selection of differentially expressed genes (DEGs) using meta-analysis

To eliminate the bias from different platforms, the MetaQC package version 0.1.13 was utilized to perform quality control of the different datasets, in combination with principal component analysis and standardized mean rank (23). The standards in MetaQC included: i) Internal quality control, which was used to determine the structural homogeneity of gene expression values among different datasets; ii) external quality control, which was used for the consistency testing of gene expression in a pathway database; iii) accuracy quality control, which was used to determine the accuracy of a differentially DEG or recognition of a pathway; iv) consistency quality control of a DEG and pathway. Following quality control, MetaDE.ES in the MetaDE package [(23) https://cran.r-project.org/web/packages/MetaQC/index.html] was utilized to identify DEGs in the integrated dataset. First, the heterogeneity of the expression of each gene in different platforms was detected based on parameters including τ2, the Q value and Qpval (τ2=0 indicates homogeneity and a lack of bias; a Q statistic obeying the χ2 test with a freedom of K-1 and Qpval >0.05 indicate homogeneity and a lack of bias). DEGs between the different groups in this integrated dataset were then selected, and the P-value and false discovery rate (FDR) were obtained. FDR <0.05 indicated a significant difference. Thresholds for DEGs among different groups in the present study were τ2=0, Qpval >0.05 and FDR <0.05. Thereafter, these DEGs were subjected to bi-directional hierarchical clustering analysis using the pheatmap R package version 1.0.2 (http://cran.r-project.org/web/packages/pheatmap/index.html).

Construction of a protein-protein interaction (PPI) network and its topological properties

Information in the human protein reference database (HPRD; http://www.hprd.org/) was integrated with that in the Biological General Repository for Interaction Datasets (BioGRID; http://www.thebiogrid.org) (24,25) to construct a PPI network for the identified DEGs. The network was visualized using Cytoscape software version 3.6.0 (http://cytoscape.org/). The betweenness centrality (BC or CB) algorithm was used to reflect the topological property of each gene in this network and to optimize candidate genes (26). The BC value of each DEG was calculated based on the following formula: In the formula, v, s and t denote three nodes (protein production of DEGs) in the PPI network, σst is the number of shortest paths from 's' to 't', and σst (v) reflects the number of σst that pass the node 'v'. The BC value varies from 0–1, and the greatest value indicates the highest centrality of a node in the PPI network.

Training of the optimal SVM classification model and performance evaluation

The DEGs were sorted in descending order based on their BC values, and for those ranked at 10 to 100, the dataset that conformed to the quality control and had the largest sample number was set as the training dataset to perform training of the optimal SVM classification model, until it could absolutely distinguish one sample from another (27). DEGs obtained by this SVM classifier were then further investigated using bi-directional hierarchical clustering analysis, with visualization using the aforementioned heatmap software. Afterwards, the remaining datasets were taken as the validation datasets to evaluate the accuracy of the optimal SVM classifier.

Validation via an individual dataset

A CRC dataset that was downloaded from the TCGA database (https://cancergenome.nih.gov/), with the accession number TCGA_COAD_G4502A_07_3-2015-02-24 (level 3), was used for the validation. The dataset included a total of 193 specimens, and 90 of them had available clinical information, including 14 and 76 cases with and without the appearance of additional tumors, respectively.

Enrichment analysis

Kyoto Encyclopedia of Genes and Genomes (KEGG; http://www.genome.jp/kegg/pathway.html) pathway enrichment analysis was carried out for these DEGs to identify their potential pathways, using Fisher's exact test based on the following formula: In the formula, N represents total gene counts in the whole genome, M indicates gene counts in the pathways, K denotes DGE counts, and p represents the probability of ≥x of the K DEGs being enriched in the pathway.

Results

DEGs detected by meta-analysis

Quality control conducted using MetaQC indicated that the GSE22834 and GSE6988 datasets had relatively low quality, compared with the others (Table I). In addition, GSE22834 markedly deviated from the other four datasets, and GSE6988 also exhibited evident bias (Fig. 1). For these reasons, these two datasets were excluded. The remaining three datasets were selected for data integration via meta-analysis. The parameters pval, FDR, τ2, Qpval and Qval were calculated using MetaDE. Based on the aforementioned selection criteria, a total of 358 DEGs were identified by integrating the three datasets, and the top 10 DEGs are listed in Table II. A heat map of the gene expression of the 358 genes is presented in Fig. 2.
Table I

Quality control results of the five datasets.

DatasetIQCEQCCQCgCQCpAQCgAQCpSMR
GSE684685.193.2869.15103.5927.4656.312.13
GSE623213.763.1556.7148.6633.7847.613.59
GSE228340.210.670.010.270.831.9813.87
GSE142977.654.321.9259.6221.192.396.02
GSE69880.031.190.860.531.731.968.62

IQC, internal quality control; EQC, external quality control; CQCg, consistency quality control of gene; CQCp, consistency quality control of pathway; AQCg, accuracy quality control of gene; AQCp, accuracy quality control of pathway; SMR, standardized mean rank.

Figure 1

Quality control of the five datasets via MetaQC. Numbers 1–5 denote the five datasets. IQC, internal quality control; EQC, external quality control; CQCg, consistency quality control of gene; CQCp, consistency quality control of pathway; AQCg, accuracy quality control of gene; AQCp, accuracy quality control of pathway.

Table II

Top 10 differentially expressed genes identified via meta-analysis of the three integrated datasets.

GeneP-valueFDRQQpτ2
MCF2L1.00×10−203.45×10−181.71040.42520
TCF211.00×10−203.45×10−180.94100.62470
FGD61.00×10−203.45×10−180.93750.62580
MED281.00×10−203.45×10−180.74980.68740
PRDM11.00×10−203.45×10−180.73720.69170
TMED101.00×10−203.45×10−180.69720.70570
F51.00×10−203.45×10−180.43270.80540
NUMA11.00×10−203.45×10−180.27510.87150
ELOVL63.62×10−67.69×10−41.99480.36880
DLD3.62×10−67.69×10−41.80350.40590

FDR, false discovery rate; MCF2L, MCF.2 cell line derived transforming sequence like; TCF21, transcription factor 21; FGD6, FYVE, RhoGEF and PH domain containing 6; MED28, mediator complex subunit 28; PRDM1, PR/SET domain 1; TMED10, transmembrane p24 trafficking protein 10; F5, coagulation factor 5; NUMA1, nuclear mitotic apparatus protein 1; ELOVL6, ELOVL fatty acid elongase 6; DLD, dihydrolipoamide dehydrogenase.

Figure 2

Heat map of the gene expression of the 358 differentially expressed genes in metastatic and non-metastatic colon cancer samples. Red indicates high expression and green indicates low expression, yellow represents metastatic samples and blue represents non-metastatic samples.

PPI network of the DEGs

By integrating protein information in the HPRD database with that in BioGRID, interactions among the 358 DEGs were extracted to form a PPI network, which contained 162 nodes and 193 interactions (Fig. 3).
Figure 3

Protein-protein interaction network of the differentially expressed genes. Orange indicates upregulated genes and blue represents downregulated genes in metastatic compared with non-metastatic colon cancer samples. Lines between two nodes represent interactions between them.

DEGs optimized by BC of the network

Based on the BC algorithm, the BC value of each node was obtained, and the top 10 nodes were: BCL6 corepressor; coatomer protein complex subunit β 2; cAMP responsive element binding protein 1 (CREB1); myosin heavy chain 11; family with sequence similarity 3 member C; InaD-like (also known as PATJ, crumbs cell polarity complex component); RAB32, member RAS oncogene family; translocation of outer mitochondrial membrane 22; cullin 7 (CUL7); and signal sequence receptor 3 (SSR3). Detailed information is listed in Table III.
Table III

Top 10 differentially expressed genes ranked by their betweenness centrality value.

GeneBCExpDegreeP-valueFDRQQpτ2
BCOR1121.41×10−20.13370.11980.94180
COPB21026.28×10−30.08450.82270.66270
CREB11042.44×10−20.18120.65220.72170
MYH111027.17×10−40.02360.86180.64990
FAM3C0.7033.82×10−20.22790.07200.96460
INADL0.6667123.26×10−50.00301.69940.42750
RAB320.6667033.02×10−20.19900.43740.80360
TOMM220.6022.54×10−50.00281.69780.42790
CUL70.45951166.92×10−40.02341.03300.59660
SSR30.4021.04×10−30.02911.50030.47230

BC, betweenness centrality; FDR, false discovery rate; BCOR, BCL6 corepressor; COPB2, coatomer protein complex subunit β 2; CREB1, cAMP responsive element binding protein 1; MYH11, myosin heavy chain 11; FAM3C, family with sequence similarity 3 member C; INADL, InaD-like; RAB32, RAB32, member RAS oncogene family; TOMM22, translocation of outer mitochondrial membrane 22; CUL7, culin 7; SSR3, signal sequence receptor 3.

Optimal SVM classification model and performance evaluation results

GSE68468, which includes 47 metastatic and 193 non-metastatic CRC samples, was used as the training dataset to perform SVM classification training, until the SVM classification was able to completely distinguish the two types of sample. In the training process, as the number of DEGs was increased from the top 10 to the top 100, the precision of the SVM classification increased from 98 to 100%. Notably, the precision remained at 100% as the number of DEGs increased from the top 40 to the top 100. Therefore, the DEGs whose BC value ranked within the top 40 were selected to build the SVM classification model with a strong ability to distinguish metastatic samples from non-metastatic ones. The scattergram is shown in Fig. 4.
Figure 4

SVM classification and the performance evaluation result. (A) Accurate and error ratios of different training SVM classifications based on different signature genes. Red denotes error ratio and blue represents accurate ratio. (B) Scattergram based on SVM classification on different kinds of samples. Orange represents non-metastatic samples and blue represents metastatic samples. SVM, support vector machine.

To determine whether the SVM classification constructed using the top 40 genes was repeatable, two other datasets, GSE62321 and GSE14297 were used to test the precision of the classification. As shown in Fig. 5, this SVM classification was clearly able to distinguish between metastatic and non-metastatic samples in these two datasets.
Figure 5

Scattergram based on support vector machine classification of different samples in two datasets. (A) GSE62321 and (B) GSE14297 datasets. Orange represents non-metastatic samples and blue represents metastatic samples.

Validation results

The CRC dataset downloaded from TCGA was set as the individual validation dataset, which was used to testify the performance evaluation of the SVM classification. As a result, 13 metastatic and 76 non-metastatic samples were correctly identified. Only 1 sample was wrongly classified and the area under the curve was 0.997 (Fig. 6).
Figure 6

Receiver operating characteristic curve of support vector machine classification on individual validation datasets. AUC, area under the curve.

Pathways of the 40 DEGs

On the basis of Fisher's exact test, five pathways were identified for these 40 DEGs (Table IV), namely protein processing in endoplasmic reticulum (ER) [e.g., F-box protein 2 (FBXO2), DnaJ heat shock protein family (Hsp40) member C10 (DNAJC10) and SSR3], AMP-activated protein kinase (AMPK) signaling pathway [e.g., protein kinase AMP-activated non-catalytic subunit β 2 (PRKAB2), phosphofructokinase, platelet (PFKP) and CREB1], dorso-ventral axis formation [e.g., mitogen-activated protein kinase 1 (MAPK1) and notch 1 (NOTCH1)], ubiquitin mediated proteolysis [e.g., FBXO2, CUL7 and ubiquitin conjugating enzyme E2 D3 (UBE2D3)] and prion diseases (e.g., MAPK1 and NOTCH1).
Table IV

Pathway enrichment results of the crucial 40 genes.

TermIDCountP-valueGenes
Protein processing in ERhsa0414150.0089FBXO2, DNAJC10, SSR3, CUL1, UBE2D3
AMPK signaling pathwayhsa0415240.0144PRKAB2, PFKP, PRKAA1, CREB1
Dorso-ventral axis formationhsa0432020.0188MAPK1, NOTCH1
Ubiquitin mediated proteolysishsa0412040.0199FBXO2, CUL1, CUL7, UBE2D3
Prion diseaseshsa0502020.0313MAPK1, NOTCH1

ER, endoplasmic reticulum; AMPK, AMP-activated protein kinase; FBXO2, F-box protein 2; DNAJC10, DnaJ heat shock protein family (Hsp40) member C10; SSR3, signal sequence receptor 3; CUL1, cullin 1; UBE2D3, ubiquitin conjugating enzyme E2 D3; PRKAB2, protein kinase AMP-activated non-catalytic subunit β 2; PFKP, phosphofructokinase, platelet; PRKAA1, protein kinase AMP-activated catalytic subunit α 1; CREB1, cAMP responsive element binding protein 1; MAPK1, mitogen-activated protein kinase 1; NOTCH1, notch 1; CUL7, cullin 7:

Discussion

The present study identified 40 SVM-classified signature genes in metastatic CRC, including CREB1, CUL7 and SSR3, which were significantly enriched in protein processing in ER, AMPK signaling pathway and ubiquitin mediated proteolysis functions. The precision of the SVM-classified 40 gene signatures was as high as 100%, and the validation using a dataset from TCGA indicated that the majority of the metastatic and non-metastatic samples could be clearly distinguished from each other using these 40 genes. CREB1 is a TF that belongs to the leucine zipper family. The CREB1 gene is reported to increase the proliferation of CRC cells, while the knockdown of CREB1 inhibits this process (28). In addition, multiple microRNAs (miRs) function as tumor suppressors in CRC development through targeting this gene, including miR-9, miR-34b and miR-200b (29). A soluble resistance-related calcium binding protein, sorcin, has been demonstrated to increase the metastasis of CRC (30). Notably, the overexpression of sorcin activates the CREB pathway by increasing the phosphorylation of CREB1 (30), which implicates the expression of CREB1 in CRC metastasis, as predicted in the present study. AMPK is a heterotrimeric protein kinase that serves as a metabolic master switch. AMPK induces apoptosis in the development of CRC, and resveratrol is reported to exert therapeutic effects via inhibition of the AMPK signaling pathway (31). In the present study, CREB1 was significantly enriched in the AMPK signaling pathway, suggesting that the alteration of this gene may affect the AMPK signaling pathway, whereby it may contribute to the metastasis of CRC. Based on this finding, it may be inferred that the CREB1-mediated AMPK signaling pathway has the potential to serve as a therapeutic marker for the diagnosis of CRC metastasis. As a major component of the ubiquitin proteasome system, E3 ubiquitin ligases serve an important function in orchestrating the substrate ubiquitination in the cullin, Skp and F-box-containing complex (32). Disruption of their roles is the primary cause of the occurrence of various types of cancer (33). The CUL7 protein is a complex of the E3 ubiquitin-protein ligase that also comprises the S-phase kinase-associated protein 1, F-box/WD repeat-containing protein 8 (FBXW8) and E3 ubiquitin-protein ligase RBX1 proteins. Reportedly, the CUL7/FBXW8 complex inhibits cell growth in gastric cancer by inducing the expression of insulin receptor substrate 1 (34). The overexpression of CUL7 has been detected in hepatocellular carcinoma (HCC) tissues, particularly in metastatic HCC, and in vitro experiments have demonstrated that the knockdown of this gene pronouncedly decreases the metastatic capacity of HCC (32). In addition, the expression of CUL7 has been observed to be increased in non-small cell lung cancer cells, with its high expression potentially promoting the invasion and metastasis of these cells (35). Cyclin D1 is a vital protein for cell proliferation in various types of cancer. Its activation is controlled via the degradation caused by ubiquitin-mediated proteolysis (36). In the present study, CUL7 and its family member CUL1 were enriched in the ubiquitin-mediated proteolysis pathway, suggesting that their activation through this pathway may also regulate cell proliferation in CRC. However, there is insufficient evidence supporting the involvement of CUL7 in metastatic CRC. The results of the present study indicate that CUL7 is a signature gene that is able to distinguish between metastatic and non-metastatic CRC. In combination with the previously reported findings that CUL7 is involved in the metastasis of other cancers, it is speculated that this gene may also be implicated in metastatic CRC, and is activated via the ubiquitin-mediated proteolysis pathway. SSR is a glycosylated membrane receptor responsible for protein entry into the ER (37). As one of the four SSR family members, SSR3 is a non-glycosylated subunit that mediates the translocation of nascent polypeptide through the ER membrane (38). Reportedly, in pancreatic cancer (PAC) and prostate cancer, the expression of SSR3 is elevated, and the inhibition of this gene may weaken the potential tumor growth of PAC (39,40). Furthermore, in a study using an RNA sequencing method, it was predicted that SSR3 is a target of a long noncoding RNA, RP5-890E16.4, that may have crucial roles in esophageal squamous cell carcinoma (41). In murine breast tumor, SSR3 has been identified as a DEG associated with metastasis, using an exon-based clustering method (42). However, to the best of our knowledge, no further information is available concerning the role of this gene in CRC progression, particularly in metastasis. In the present study, SSR3 was demonstrated to be a critical gene signature of metastatic CRC that was enriched in the protein processing in ER pathway. Together, the aforementioned information suggests that SSR3 may serve an important function in the metastasis of CRC via involvement in the protein processing in ER pathway, and could be used as a novel therapeutic target for the treatment of metastatic CRC. Despite these comprehensive analyses and the precision of the SVM classifier, the present study has the limitation that all the predicted results lack experimental validation. Nevertheless, the findings are valuable as they provide novel insights into the regulatory mechanisms of the metastasis of CRC and identify novel biomarkers for the prognosis of this disease. In conclusion, the SVM-classified gene signatures in the present study precisely distinguished metastatic CRC samples from non-metastatic ones, using genes including CREB1, CUL7 and SSR3. The genes could be used as biomarkers for the prognosis of metastatic CRC. However, substantial additional experiments are required to validate the predicted expression levels and functions.
  36 in total

1.  Colorectal cancer epidemiology: incidence, mortality, survival, and risk factors.

Authors:  Fatima A Haggar; Robin P Boushey
Journal:  Clin Colon Rectal Surg       Date:  2009-11

2.  Expression of an ASCL2 related stem cell signature and IGF2 in colorectal cancer liver metastases with 11p15.5 gain.

Authors:  D E Stange; F Engel; T Longerich; B K Koo; M Koch; N Delhomme; M Aigner; G Toedt; P Schirmacher; P Lichter; J Weitz; B Radlwimmer
Journal:  Gut       Date:  2010-05-17       Impact factor: 23.059

Review 3.  A perspective on cancer cell metastasis.

Authors:  Christine L Chaffer; Robert A Weinberg
Journal:  Science       Date:  2011-03-25       Impact factor: 47.728

Review 4.  A portrayal of E3 ubiquitin ligases and deubiquitylases in cancer.

Authors:  Yatendra Kumar Satija; Abhishek Bhardwaj; Sanjeev Das
Journal:  Int J Cancer       Date:  2013-03-25       Impact factor: 7.396

5.  Sorcin Enhances Metastasis and Promotes Epithelial-to-Mesenchymal Transition of Colorectal Cancer.

Authors:  Weihua Tong; Donghui Sun; Quan Wang; Jian Suo
Journal:  Cell Biochem Biophys       Date:  2015-06       Impact factor: 2.194

6.  Colorectal cancer statistics, 2014.

Authors:  Rebecca Siegel; Carol Desantis; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2014-03-17       Impact factor: 508.702

Review 7.  The role of cullin proteins in gastric cancer.

Authors:  Peng Chen; Guo-Dong Yao
Journal:  Tumour Biol       Date:  2015-10-15

8.  MicroRNA-205 functions as a tumor suppressor in colorectal cancer by targeting cAMP responsive element binding protein 1 (CREB1).

Authors:  Peng Li; Wan-Jiang Xue; Ying Feng; Qin-Sheng Mao
Journal:  Am J Transl Res       Date:  2015-10-15       Impact factor: 4.060

9.  Inhibition of Liver Carcinoma Cell Invasion and Metastasis by Knockdown of Cullin7 In Vitro and In Vivo.

Authors:  Donghui Zhang; Genling Yang; Xidong Li; Cheng Xu; Honglei Ge
Journal:  Oncol Res       Date:  2016       Impact factor: 5.574

10.  Whole genome analysis for liver metastasis gene signatures in colorectal cancer.

Authors:  Dong Hyuk Ki; Hei-Cheul Jeung; Chan Hee Park; Seung Hee Kang; Gui Youn Lee; Won Suk Lee; Nam Kyu Kim; Hyun Chul Chung; Sun Young Rha
Journal:  Int J Cancer       Date:  2007-11-01       Impact factor: 7.396

View more
  16 in total

1.  Distinguishing Rectal Cancer from Colon Cancer Based on the Support Vector Machine Method and RNA-sequencing Data.

Authors:  Yan Zhang; Yuan Wu; Zi-Ying Gong; Hai-Dan Ye; Xiao-Kai Zhao; Jie-Yi Li; Xiao-Mei Zhang; Sheng Li; Wei Zhu; Mei Wang; Ge-Yu Liang; Yun Liu; Xin Guan; Dao-Yun Zhang; Bo Shen
Journal:  Curr Med Sci       Date:  2021-04-20

2.  Artificial intelligence predicts lymph node metastasis or risk of lymph node metastasis in T1 colorectal cancer.

Authors:  Kenta Kasahara; Kenji Katsumata; Akira Saito; Tetsuo Ishizaki; Masanobu Enomoto; Junichi Mazaki; Tomoya Tago; Yuichi Nagakawa; Jun Matsubayashi; Toshitaka Nagao; Hiroshi Hirano; Masahiko Kuroda; Akihiko Tsuchida
Journal:  Int J Clin Oncol       Date:  2022-07-31       Impact factor: 3.850

3.  IL-8, MSPa, MIF, FGF-9, ANG-2 and AgRP collection were identified for the diagnosis of colorectal cancer based on the support vector machine model.

Authors:  Mingfu Cui; Yanan Zhao; Zuocong Zhang; Yang Zhao; Songyun Han; Ruijie Wang; Dayong Ding; Xuedong Fang
Journal:  Cell Cycle       Date:  2021-03-28       Impact factor: 4.534

4.  Discovering Panel of Autoantibodies for Early Detection of Lung Cancer Based on Focused Protein Array.

Authors:  Di Jiang; Xue Zhang; Man Liu; Yulin Wang; Tingting Wang; Lu Pei; Peng Wang; Hua Ye; Jianxiang Shi; Chunhua Song; Kaijuan Wang; Xiao Wang; Liping Dai; Jianying Zhang
Journal:  Front Immunol       Date:  2021-04-23       Impact factor: 7.561

5.  Biochemical pathway analysis of gastric atrophy.

Authors:  Mostafa Rezaei Tavirani; Sina Rezaei Tavirani; Fatemeh Tajik Rostami
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2018

6.  Gene screening of colorectal cancers via network analysis.

Authors:  Vahid Mansouri; Mostafa Rezaei Tavirani; Sina Rezaei Tavirani
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2019

7.  ANXA2, PRKCE, and OXT are critical differentially genes in Nonalcoholic fatty liver disease.

Authors:  Mostafa Rezaei Tavirani; Majid Rezaei Tavirani; Mona Zamanian Azodi
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2019

8.  Barrett's esophagus network analysis revealed that arginine, alanine, aspartate, glutamate, valine, leucine and isoleucine can be biomarkers.

Authors:  Mohammad Reza Zali; Mohammad-Mehdi Zadeh-Esmaeel; Majid Rezaei Tavirani; Sina Rezaei Tavirani; Mohsen Norouzinia; Mostafa Rezaei-Tavirani
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2018

9.  Linkage analysis revealed risk loci on 6p21 and 18p11.2-q11.2 in familial colon and rectal cancer, respectively.

Authors:  Susanna von Holst; Xiang Jiao; Wen Liu; Vinaykumar Kontham; Jessada Thutkawkorapin; Jenny Ringdahl; Patrick Bryant; Annika Lindblom
Journal:  Eur J Hum Genet       Date:  2019-04-05       Impact factor: 4.246

10.  Gene expression analysis of colon high-grade dysplasia revealed new molecular mechanism of disease.

Authors:  Habib Malekpour; Mohammad Hossein Heidari; Reza Vafaee; Hamideh Moravvej Farshi; Mahsa Khodadoostan
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2018
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.