| Literature DB >> 30008861 |
Tian-Ming Zhang1, Tao Huang2, Rong-Fei Wang3.
Abstract
Colorectal cancer is a severe cancer associated with a high prevalence and fatality rate. There are three major mechanisms for colorectal cancer: (1) Chromosome instability (CIN), (2) CpG island methylator phenotype (CIMP) and (3) mismatch repair (MMR), of which CIN is the most common type. However, these subtypes are not exclusive and overlap. To investigate their biological mechanisms and cross talk, the gene expression profiles of 585 colorectal cancer patients with CIN, CIMP and MMR status records were collected. By comparing the CIN+ and CIN-samples, CIMP+ and CIMP-samples, MMR+ and MMR-samples with minimal redundancy maximal relevance (mRMR) and incremental feature selection (IFS) methods, the CIN, CIMP and MMR associated genes were selected. Unfortunately, there was little direct overlap among them. To investigate their indirect interactions, downstream genes of CIN, CIMP and MMR were identified using the random walk with restart (RWR) method and a greater overlap of downstream genes was indicated. The common downstream genes were involved in biosynthetic and metabolic pathways. These findings were consistent with the clinical observation of wide range metabolite aberrations in colorectal cancer. To conclude, the present study gave a gene level explanation of CIN, CIMP and MMR, but also showed the network level cross talk of CIN, CIMP and MMR. The common genes of CIN, CIMP and MMR may be useful for cross-subtype general colorectal cancer drug development.Entities:
Keywords: CpG island methylator phenotype; chromosome instability; incremental feature selection; minimal redundancy maximal relevance; mismatch repair; random walk with restart
Year: 2018 PMID: 30008861 PMCID: PMC6036478 DOI: 10.3892/ol.2018.8860
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Figure 1.The IFS curves of CIN, CIMP and MMR. (A) The IFS curve of CIN. The top 34 mRMR genes were chosen and their LOOCV sensitivity, specificity, accuracy and MCC were 0.932, 0.696, 0.877 and 0.648, respectively. (B) The IFS curve of CIMP. The top 19 mRMR genes were chosen and their leave one out cross validation sensitivity, specificity, accuracy and MCC were 0.710, 0.976, 0.928 and 0.744, respectively. (C) The IFS curve of MMR. The top 18 mRMR genes were chosen and their leave one out cross validation sensitivity, specificity, accuracy and MCC were 0.922, 0.985, 0.976 and 0.902, respectively. IFS, incremental feature selection; CIN, chromosome instability; CIMP, CpG island methylator phenotype; MMR, mismatch repair; mRMR, minimal redundancy maximal relevance; LOOCV, leave-one-out cross validation.
The 34 chromosome instability-associated genes.
| Order | Symbol | Name | Entrez gene | mRMR score |
|---|---|---|---|---|
| 1 | STX16 | Syntaxin 16 | 8675 | 0.161 |
| 2 | NCBP3 | Nuclear cap binding subunit 3 | 55421 | 0.062 |
| 3 | IVD | Isovaleryl-CoA dehydrogenase | 3712 | 0.061 |
| 4 | DYNLRB1 | Dynein light chain roadblock-type 1 | 83658 | 0.067 |
| 5 | EXOSC9 | Exosome component 9 | 5393 | 0.044 |
| 6 | ATP9B | ATPase phospholipid transporting 9B (putative) | 374868 | 0.043 |
| 7 | KBTBD2 | Kelch repeat and BTB domain containing 2 | 25948 | 0.042 |
| 8 | EMC6 | ER membrane protein complex subunit 6 | 83460 | 0.043 |
| 9 | ADGRG6 | Adhesion G protein-coupled receptor G6 | 57211 | 0.046 |
| 10 | OIP5-AS1 | OIP5 antisense RNA 1 | 729082 | 0.044 |
| 11 | RNF145 | Ring finger protein 145 | 153830 | 0.043 |
| 12 | COMMD7 | COMM domain containing 7 | 149951 | 0.046 |
| 13 | TUFT1 | Tuftelin 1 | 7286 | 0.038 |
| 14 | NLRX1 | NLR family member X1 | 79671 | 0.036 |
| 15 | CYB5D1 | Cytochrome b5 domain containing 1 | 124637 | 0.038 |
| 16 | RTFDC1 | Replication termination factor 2 domain containing 1 | 51507 | 0.037 |
| 17 | RPL22L1 | Ribosomal protein L22 like 1 | 200916 | 0.034 |
| 18 | TMEM102 | Transmembrane protein 102 | 284114 | 0.032 |
| 19 | TM9SF4 | Transmembrane 9 superfamily member 4 | 9777 | 0.035 |
| 20 | HERPUD2 | HERPUD family member 2 | 64224 | 0.033 |
| 21 | RHEB | Ras homolog enriched in brain | 6009 | 0.033 |
| 22 | NDUFAF1 | NADH:ubiquinone oxidoreductase complex assembly factor 1 | 51103 | 0.033 |
| 23 | TGFBR2 | Transforming growth factor β receptor 2 | 7048 | 0.034 |
| 24 | TRIM7 | Tripartite motif containing 7 | 81786 | 0.032 |
| 25 | PRPSAP2 | Phosphoribosyl pyrophosphate synthetase associated protein 2 | 5636 | 0.032 |
| 26 | HEATR4 | HEAT repeat containing 4 | 399671 | 0.032 |
| 27 | SERINC3 | Serine incorporator 3 | 10955 | 0.034 |
| 28 | HSPA4L | Heat shock protein family A (Hsp70) member 4 like | 22824 | 0.03 |
| 29 | RALBP1 | RalA binding protein 1 | 10928 | 0.029 |
| 30 | RRNAD1 | Ribosomal RNA adenine dimethylase domain containing 1 | 51093 | 0.029 |
| 31 | CTC1 | CST telomere replication complex component 1 | 80169 | 0.03 |
| 32 | EIF6 | Eukaryotic translation initiation factor 6 | 3692 | 0.031 |
| 33 | TMEM176A | Transmembrane protein 176A | 55365 | 0.031 |
| 34 | ROCK2 | Rho associated coiled-coil containing protein kinase 2 | 9475 | 0.03 |
Figure 2.The heatmaps of CIN, CIMP and MMR. (A) The heatmap of CIN. The 34 CIN associated genes were used to cluster the CIN+ and CIN-patients. (B) The heatmap of CIMP. The 19 CIMP associated genes were used to cluster the CIMP+ and CIMP- patients. (C) The heatmap of MMR. The 18 MMR associated genes were used to cluster the MMR+ and MMR- patients. CIN, chromosome instability; CIMP, CpG island methylator phenotype; MMR, mismatch repair.
The 19 CpG island methylator phenotype-associated genes.
| Order | Name | Gene name | Entrez gene | mRMR score |
|---|---|---|---|---|
| 1 | MLH1 | mutL homolog 1 | 4292 | 0.193 |
| 2 | HUNK | Hormonally up-regulated Neu-associated kinase | 30811 | 0.069 |
| 3 | ZNF141 | Zinc finger protein 141 | 7700 | 0.063 |
| 4 | DUSP18 | Dual specificity phosphatase 18 | 150290 | 0.058 |
| 5 | ADGRG6 | Adhesion G protein-coupled receptor G6 | 57211 | 0.053 |
| 6 | FOXD1 | Forkhead box D1 | 2297 | 0.052 |
| 7 | FAM84A | Family with sequence similarity 84 member A | 151354 | 0.049 |
| 8 | AFAP1-AS1 | AFAP1 antisense RNA 1 | 84740 | 0.047 |
| 9 | ZBTB38 | Zinc finger and BTB domain containing 38 | 253461 | 0.052 |
| 10 | VANGL2 | VANGL planar cell polarity protein 2 | 57216 | 0.054 |
| 11 | PRDX5 | Peroxiredoxin 5 | 25824 | 0.049 |
| 12 | MTERF1 | Mitochondrial transcription termination factor 1 | 7978 | 0.05 |
| 13 | QPRT | Quinolinate phosphoribosyltransferase | 23475 | 0.05 |
| 14 | HOXC6 | Homeobox C6 | 3223 | 0.045 |
| 15 | HS3ST1 | Heparan sulfate-glucosamine 3-sulfotransferase 1 | 9957 | 0.044 |
| 16 | PIWIL1 | Piwi like RNA-mediated gene silencing 1 | 9271 | 0.046 |
| 17 | JUN | Jun proto-oncogene, AP-1 transcription factor subunit | 3725 | 0.047 |
| 18 | GRM8 | Glutamate metabotropic receptor 8 | 2918 | 0.045 |
| 19 | ZNF665 | Zinc finger protein 665 | 79788 | 0.046 |
The 18 mismatch repair-associated genes.
| Order | Name | Gene name | Entrez gene | mRMR score |
|---|---|---|---|---|
| 1 | HNRNPL | Heterogeneous nuclear ribonucleoprotein L | 3191 | 0.285 |
| 2 | HPSE | Heparanase | 10855 | 0.097 |
| 3 | CAB39L | Calcium binding protein 39 like | 81617 | 0.081 |
| 4 | MTA2 | Metastasis associated 1 family member 2 | 9219 | 0.093 |
| 5 | RAPGEF6 | Rap guanine nucleotide exchange factor 6 | 51735 | 0.086 |
| 6 | LYG1 | Lysozyme g1 | 129530 | 0.081 |
| 7 | SEC22B | SEC22 homolog B, vesicle trafficking protein (gene/pseudogene) | 9554 | 0.081 |
| 8 | BRD3 | Bromodomain containing 3 | 8019 | 0.076 |
| 9 | H2AFJ | H2A histone family member J | 55766 | 0.079 |
| 10 | RAB12 | RAB12, member RAS oncogene family | 201475 | 0.072 |
| 11 | TGFBR2 | Transforming growth factor β receptor 2 | 7048 | 0.078 |
| 12 | STRN3 | Striatin 3 | 29966 | 0.076 |
| 13 | INO80D | INO80 complex subunit D | 54891 | 0.076 |
| 14 | MLH1 | MutL homolog 1 | 4292 | 0.079 |
| 15 | EIF5A | Eukaryotic translation initiation factor 5A | 1984 | 0.072 |
| 16 | MIR3916 | microRNA 3916 | 100500849 | 0.069 |
| 17 | FOXO3 | Forkhead box O3 | 2309 | 0.069 |
| 18 | FBXO21 | F-box protein 21 | 23014 | 0.069 |
Figure 3.The Venn diagram of CIN associated genes, CIMP associated genes and MMR associated genes. None genes were common in these three gene lists. The overlap between CIN and CIMP was ADGRG6, the common gene between CIN and MMR was TGFBR2 and the overlap between CIMP and MMR was MLH1. CIN, chromosome instability; CIMP, CpG island methylator phenotype; MMR, mismatch repair.
Figure 4.The workflow to investigate the cross talk among CIN, CIMP and MMR. (A) The CIN associated genes, CIMP associated genes and MMR associated genes were identified using mRMR and IFS methods. (B) The direct overlap between CIN genes, CIMP genes and MMR genes were little. (C) The genes that the CIN genes, CIMP genes and MMR genes affect were identified using RWR method. (D) When both the CIN genes, CIMP genes and MMR genes and their RWR genes were considered, the overlap among CIN, CIMP and MMR was significantly increased. (E) The biological functions of the common genes (the red star) were studied. CIN, chromosome instability; CIMP, CpG island methylator phenotype; MMR, mismatch repair; mRMR, minimal redundancy maximal relevance; IFS, incremental feature selection; RWR, random walk with restart.
Figure 5.The Venn diagram of CIN downstream genes, CIMP downstream genes and MMR downstream genes. There were 236 common downstream genes of CIN, CIMP and MMR. CIN, chromosome instability; CIMP, CpG island methylator phenotype; MMR, mismatch repair; RWR, random walk with restart.
Common downstream genes of chromosome instability, CpG island methylator phenotype and mismatch repair.
| List of common genes | |||||||
|---|---|---|---|---|---|---|---|
| A1BG | CD248 | DEFB131 | HECA | LCE1A | NAIP | SEMA4C | TRMU |
| A1CF | CDKAL1 | DEFB134 | HES2 | LCE1B | NCDN | SERINC3 | TSEN15 |
| ABCC5 | CEP120 | DEFB135 | HES3 | LCE1D | NCR3 | SERINC5 | TSEN2 |
| ABHD12 | CFAP58 | DNAJC9 | HGFAC | LCE1E | NCR3LG1 | SETDB2 | TSEN34 |
| ABHD6 | CGREF1 | DYSF | HHLA2 | LCE3B | NSUN4 | SLC16A7 | TSEN54 |
| ABI3 | CGRRF1 | EMB | HHLA3 | LCE3C | NTNG1 | SLC30A8 | UBAP2 |
| ABI3BP | CLASRP | ENAM | HLA-DOA | LCN1 | NTNG2 | SLC36A2 | UNKL |
| ACOT13 | CLEC2A | ETV7 | HLA-DOB | LETM1 | OR10H1 | SLC3A1 | UPK1A |
| ADAT1 | CLK2 | FAM149B1 | HMGN3 | LIAS | ORAOV1 | SLC51A | UPK1B |
| ADAT2 | CLK3 | FAM3C | HOXC13 | LIPT2 | PANK1 | SLC51B | UPK2 |
| ADAT3 | CLK4 | FAT4 | HPCA | LMBR1L | PANK2 | SLC6A18 | UPK3A |
| AGR2 | CLN6 | FBXO38 | IGLL1 | LRRC4 | PANK3 | SLC6A19 | UPK3B |
| AGR3 | CLN8 | FJX1 | IGSF3 | LRRC4C | PANK4 | SLC6A20 | VEZT |
| AMBN | CNBD1 | FLCN | IGSF6 | LXN | PCTP | SLC6A9 | VN1R1 |
| AMICA1 | COASY | FNIP2 | IGSF9B | LYPD3 | PHF11 | SLC7A9 | VNN2 |
| ANO5 | COMMD10 | FOXQ1 | IKBIP | MARCO | PIP | SMDT1 | VPREB1 |
| APOBEC1 | COMMD7 | FUZ | INTU | MCU | PLXDC1 | SP8 | XAGE1B |
| AZGP1 | COMMD8 | GABRR1 | KBTBD6 | MDGA1 | PPCDC | SPICE1 | XAGE2 |
| BCS1L | CPA1 | GABRR2 | KBTBD7 | METTL9 | PPCS | SPINK9 | YAE1D1 |
| BFSP1 | CPA4 | GNPTAB | KCNK10 | MFSD10 | PRLH | SPINT1 | YIPF3 |
| BFSP2 | CPN1 | GNPTG | KCNK2 | MICU1 | PRLHR | ST14 | YIPF4 |
| BSCL2 | CPN2 | GP2 | KCNK4 | MICU2 | PRSS8 | STYX | YRDC |
| CARHSP1 | CRISP3 | GRID2 | KIAA0319 | MMS22L | PTCD3 | SUGP2 | ZCCHC17 |
| CCDC109B | CRYBA1 | GRID2IP | KLF7 | MSRA | RASD2 | TM2D1 | ZFR |
| CCDC179 | CRYBB1 | GRXCR1 | KLK5 | MSRB2 | RBBP9 | TM2D2 | ZNF461 |
| CCDC68 | CTAGE5 | GSX2 | KLRF2 | MSRB3 | RPUSD4 | TMEM126B | ZNF772 |
| CCIN | CXADR | GTPBP1 | KRTAP24-1 | MTERF4 | RSRP1 | TMEM19 | |
| CD101 | CYLC1 | GTPBP3 | KRTAP25-1 | MTO1 | SCGB2A2 | TMEM27 | |
| CD200 | DCDC2 | HAS1 | KRTAP27-1 | MUCL1 | SCGB3A2 | TONSL | |
| CD200R1 | DEFB110 | HAS3 | L3MBTL1 | MYO7A | SDCBP2 | TOX2 | |
Figure 6.The significance of overlap among CIN downstream genes, CIMP downstream genes and MMR downstream genes. To statistically evaluate the significance of overlap, we calculated the odds ratio and p value using R package Super Exact Test. The odds ratio of overlap was 60.3 and the P-value was smaller than 1e-320. CIN, chromosome instability; CIMP, CpG island methylator phenotype; MMR, mismatch repair.
Kyoto Encyclopedia of Genes and Genomes and Gene Ontology enrichments of common downstream genes of chromosome instability, CpG island methylator phenotype and mismatch repair.
| Type | Gene set | FDR |
|---|---|---|
| KEGG | hsa00770 Pantothenate and CoA biosynthesis | 4.35E-11 |
| hsa00785 Lipoic acid metabolism | 0.0226 | |
| hsa04514 Cell adhesion molecules (CAMs) | 0.0476 | |
| GO BP | GO:0015937 coenzyme A biosynthetic process | 9.66E-08 |
| GO:0015936 coenzyme A metabolic process | 1.32E-06 | |
| GO:0033866 nucleoside bisphosphate biosynthetic process | 1.72E-06 | |
| GO:0034030 ribonucleoside bisphosphate biosynthetic process | 1.72E-06 | |
| GO:0034033 purine nucleoside bisphosphate biosynthetic process | 1.72E-06 | |
| GO:0008033 tRNA processing | 6.32E-05 | |
| GO:0009451 RNA modification | 0.000240 | |
| GO:0033865 nucleoside bisphosphate metabolic process | 0.000267 | |
| GO:0033875 ribonucleoside bisphosphate metabolic process | 0.000267 | |
| GO:0034032 purine nucleoside bisphosphate metabolic process | 0.000267 | |
| GO:0015804 neutral amino acid transport | 0.000561 | |
| GO:0006865 amino acid transport | 0.00215 | |
| GO:0015807 L-amino acid transport | 0.00215 | |
| GO:0046942 carboxylic acid transport | 0.00218 | |
| GO:0000379 tRNA-type intron splice site recognition and cleavage | 0.00218 | |
| GO:0006399 tRNA metabolic process | 0.00233 | |
| GO:0015849 organic acid transport | 0.00254 | |
| GO:0036444 mitochondrial calcium uptake | 0.00277 | |
| GO:0015711 organic anion transport | 0.00408 | |
| GO:0008544 epidermis development | 0.00408 | |
| GO:0031424 keratinization | 0.00458 | |
| GO:0030855 epithelial cell differentiation | 0.00458 | |
| GO:0006820 anion transport | 0.00458 | |
| GO:1905039 carboxylic acid transmembrane transport | 0.00473 | |
| GO MF | GO:0000213 tRNA-intron endonuclease activity | 5.22E-05 |
| GO:0004594 pantothenate kinase activity | 5.22E-05 | |
| GO:0015171 amino acid transmembrane transporter activity | 0.000748 | |
| GO:0008514 organic anion transmembrane transporter activity | 0.000748 | |
| GO:0046943 carboxylic acid transmembrane transporter activity | 0.000760 | |
| GO:0008509 anion transmembrane transporter activity | 0.00112 | |
| GO:0005342 organic acid transmembrane transporter activity | 0.00112 | |
| GO:0015175 neutral amino acid transmembrane transporter activity | 0.00162 | |
| GO:0016892 endoribonuclease activity, producing 3′-phosphomonoesters | 0.00230 | |
| GO:0004549 tRNA-specific ribonuclease activity | 0.0128 | |
| GO:0015179 L-amino acid transmembrane transporter activity | 0.0221 | |
| GO:0005328 neurotransmitter:sodium symporter activity | 0.0291 | |
| GO:0005212 structural constituent of eye lens | 0.0333 | |
| GO:0016894 endonuclease activity, active with either ribo- or deoxyribonucleic acids and producing 3′-phosphomonoesters | 0.0458 | |
| GO:0008251 tRNA-specific adenosine deaminase activity | 0.0462 | |
| GO CC | GO:1990246 uniplex complex | 0.000199 |
| GO:0000214 tRNA-intron endonuclease complex | 0.00661 | |
| GO:0005886 plasma membrane | 0.0114 | |
| GO:0071944 cell periphery | 0.0114 | |
| GO:0031526 brush border membrane | 0.0125 | |
| GO:0098590 plasma membrane region | 0.0125 | |
| GO:0098862 cluster of actin-based cell projections | 0.0148 | |
| GO:0044459 plasma membrane part | 0.0168 | |
| GO:0001533 cornified envelope | 0.0242 |