| Literature DB >> 27243824 |
Ashok Palaniappan1, Karthick Ramar1, Satish Ramalingam1.
Abstract
It is well-known that the conversion of normal colon epithelium to adenoma and then to carcinoma stems from acquired molecular changes in the genome. The genetic basis of colorectal cancer has been elucidated to a certain extent, and much remains to be known about the identity of specific cancer genes that are associated with the advancement of colorectal cancer from one stage to the next. Here in this study we attempted to identify novel cancer genes that could underlie the stage-specific progression and metastasis of colorectal cancer. We conducted a stage-based meta-analysis of the voluminous tumor genome-sequencing data and mined using multiple approaches for novel genes driving the progression to stage-II, stage-III and stage-IV colorectal cancer. The consensus of these driver genes seeded the construction of stage-specific networks, which were then analyzed for the centrality of genes, clustering of subnetworks, and enrichment of gene-ontology processes. Our study identified three novel driver genes as hubs for stage-II progression: DYNC1H1, GRIN2A, GRM1. Four novel driver genes were identified as hubs for stage-III progression: IGF1R, CPS1, SPTA1, DSP. Three novel driver genes were identified as hubs for stage-IV progression: GSK3B, GGT1, EIF2B5. We also identified several non-driver genes that appeared to underscore the progression of colorectal cancer. Our study yielded potential diagnostic biomarkers for colorectal cancer as well as novel stage-specific drug targets for rational intervention. Our methodology is extendable to the analysis of other types of cancer to fill the gaps in our knowledge.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27243824 PMCID: PMC4887059 DOI: 10.1371/journal.pone.0156665
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Staging of colon cancer.
The American Joint Committee on Cancer (AJCC) has staged the colorectal cancer based on the anatomical extent of the disease. Stage I: Tumor that is limited to the mucosal layer (T1) or muscularis propria (T2), without involvement of any lymph node or distant metastatic organs. Stage II: Tumor that penetrates the muscularis propria (T3) or invades nearby organs or structures (T4), without involvement of any lymph node or distant metastatic organs. Stage III: Tumor stages with lymph node metastasis but without distant metastasis. Stage IV: Any tumor stage and lymph node status with distant organ metastasis.
Fig 2Identification of hub driver genes.
Consensus novel driver genes were identified for each stage from driverDB data. Consensus central genes (‘hubs’) were identified from each stage-specific network. The overlap between these two sets of genes yields ‘hub driver’ genes for each stage.
From tumor genome-sequencing data to network reconstruction.
| Stage | No. of Samples | Consensus driver genes | Subtraction of previous stage | Screening Against background | Screening against Cancer Gene Census | Network reconstruction #node; #edge |
|---|---|---|---|---|---|---|
| II | 108 | 52 | 47 | 34 | 27 | 109; 396 |
| III | 75 | 56 | 43 | 35 | 31 | 109; 199 |
| IV | 28 | 49 | 43 | 32 | 30 | 115; 297 |
The number of driver genes at each step leading to the network reconstruction.
Centrality analysis of stage-II network.
| S.No. | Betweenness centrality | Closeness centrality | Bottleneck centrality | Consensus Centrality (hubs) | Hub+driver genes |
|---|---|---|---|---|---|
| 1 | DLG4 | DLG4 | DLG4 | DLG4 | DYNC1H1 |
| 2 | SMC2 | GRIN2B | DYNC1H1 | SMC2 | GRIN2A |
| 3 | DYNC1H1 | GRIN2A | SMC2 | DYNC1H1 | GRM1 |
| 4 | PLCG1 | GRIN1 | PLCG1 | PLCG1 | |
| 5 | GRIN2B | CAMK2A | CAMK2A | GRIN2B | |
| 6 | GRIN2A | PLCG1 | RHOG | GRIN2A | |
| 7 | RHOG | DYNC1H1 | NF2 | RHOG | |
| 8 | SMC1A | RHOG | SMC1A | SMC1A | |
| 9 | GRM1 | LRRC7 | GRIN2A | GRM1 | |
| 10 | NF2 | GRIA1 | GRM1 | GRIN1 | |
| 11 | GRIN1 | GRM1 | HRSP12 | CAMK2A | |
| 12 | NCAPH | DLG2 | GRIN2B | ||
| 13 | CAMK2A | DYNLL1 | LRRC7 | ||
| 14 | SOX9 | SMC2 | RAD21 | ||
| 15 | CHEK1 | SMC1A | GRIN1 |
The top 15 genes obtained by applying each centrality metric are given. The genes at the intersection of all the three metrics are designated as ‘hubs’. Genes that are both hub and driver are shown.
Moduland decomposition of stage-II network.
| S. No | Module centre | Eff. size of module | Driver gene members | Hub gene members |
|---|---|---|---|---|
| 1 | DLG4 | 43 | GRIN2A, GRM1, LRRC7 | CAMK2A, DLG4, GRIN1. GRIN2A, GRIN2B, GRM1, PLCG1, RHOG |
| 2 | SMC2 | 17 | NCAPD2, STAG1 | SMC1A, SMC2 |
| 3 | DYNC1H1 | 6 | DYNC1H1 | DYNC1H1 |
| 4 | PPP1R12B | 9 | PPP1R12B, KCNQ5 | |
| 5 | UBE4B | 6 | UBE4B | |
| 6 | CLSPN | 4 | CLSPN | |
| 7 | SOX9 | 5 | SOX9 |
Each module is represented by its centre and effective size. Hub genes identified by centrality analysis and driver genes are indicated by their module membership. Modules with the maximum number of hub genes could function as driver subnetworks in the pathogenesis of disease progression.
Fig 3Network of cluster centres of stage-II network.
Clustering was done using Moduland.
GO enrichment analysis of Stage-II network.
| GO- ID | q-value | %size > 20 | Description | Hub genes | Other genes in the network |
|---|---|---|---|---|---|
| 7059 | 3.10E-12 | N | chromosome segregation | SMC2, SMC1A | PDS5B, SMC3,SMC4,NCAPD2,NCAPH,RAD21, NDEL1, NCAPG, STAG2,NEK6,STAG1 |
| 7215 | 1.62E-10 | Y | glutamate signaling pathway | GRIN1,GRIN2A,GRIN2B | HOMER3, GRIN3B, GRIA4,HOMER1, HOMER2 |
| 45202 | 1.62E-10 | N | Synapse | GRIN1,GRIN2A,GRIN2B, GRM1, DLG4, CAMK2A | GABBR1,NLGN2,GRIN3B,GRIA4,CDH2,HOMER1,ADORA1,HOMER2,SLC17A7,SLC32A1,GRIA1,HOMER3, DLG2 |
| 279 | 5.55E-10 | N | M phase | DYNC1H1, SMC1A, SMC2 | SSSCA1,FZR1,PDS5B,PDS5A,CHEK1,SMC3,SMC4,NCAPD2,NCAPH,RAD21,NCAPG,STAG2,NEK6,NUDC,STAG1 |
| 8328 | 7.06E-09 | Y | ionotropic glutamate receptor complex | GRIN1,GRIN2A,GRIN2B | GRIA1, GRIN3B, GRIA4 |
| 8066 | 2.31E-08 | Y | glutamate receptor activity | GRM1, GRIN1,GRIN2A,GRIN2B | GRIA1, GABBR1, GRIN3B, GRIA4 |
| 7216 | 1.60E-04 | Y | metabotropic glutamate receptor signaling pathway | HOMER3, HOMER1, HOMER2 | |
| 50839 | 1.18E-02 | N | cell adhesion molecule binding | GRIN2A,GRIN2B | PTPRT |
Significant GO terms enriched in stage II network are given. If the ratio of the genes in the network for a given GO term to the total number of genes in that GO term is greater than 20%, ‘Y’ is indicated, otherwise ‘N’ is indicated. Hub genes identified in our analysis are indicated for each GO term.
Centrality analysis of stage-III network.
| S.No | Betweenness centrality | Closeness centrality | Bottleneck centrality | Consensus Centrality (hubs) | Hub+driver genes |
|---|---|---|---|---|---|
| 1 | HEATR1 | MACF1 | ARAF | HEATR1 | IGF1R |
| 2 | IGF1R | MAPK9 | MAPK9 | IGF1R | CPS1 |
| 3 | MAPK9 | IGF1R | PLEC | MAPK9 | SPTA1 |
| 4 | ARAF | SPTA1 | MSN | ARAF | DSP |
| 5 | XRN1 | PRKCE | HEATR1 | PRKCE | |
| 6 | PRKCE | XRN1 | IGF1R | CPS1 | |
| 7 | CPS1 | DSP | CPS1 | PLEC | |
| 8 | PLEC | ARAF | SPTA1 | MSN | |
| 9 | MSN | PLEC | UTP20 | SPTA1 | |
| 10 | MTOR | HEATR1 | RICTOR | DSP | |
| 11 | MACF1 | CPS1 | MTOR | ||
| 12 | UTP20 | MSN | PIK3R3 | ||
| 13 | SPTA1 | NRXN2 | DSP | ||
| 14 | DSP | COL17A1 | COL17A1 | ||
| 15 | MAPK6 | MAPK6 | PRKCE |
The top 15 genes obtained by applying each centrality metric are given. The genes at the intersection of all the three metrics are designated as ‘hubs’. Genes that are both hub and driver are shown.
Moduland decomposition of stage-III network.
| S. No | Module centre | Eff. size of module | Driver gene members | Hub gene members |
|---|---|---|---|---|
| 1 | MTOR | 15 | RICTOR, UTP20, MAGEC1 | HEATR1 |
| 2 | SPTA1 | 11 | SPTA1, MACF1 | SPTA1, PLEC |
| 3 | DSP | 12 | DSP, ADAM28, PKD2L1 | DSP, PRKCE |
| 4 | SPTB | 7 | MACF1 | |
| 5 | NRXN2 | 8 | NRXN3 | |
| 6 | XRN1 | 16 | XRN1, CPS1 | CPS1 |
| 7 | TEK | 7 | TEK | |
| 8 | MAPK9 | 14 | MAPK6, ANK3 | MAPK9, ARAF |
| 9 | IGF1R | 7 | IGF1R | IGF1R, MSN |
| 10 | FBN2 | 6 | FBN2 | |
| 11 | MYLK | 7 | MYLK | |
| 12 | CHRM2 | 4 | CHRM2 | |
| 13 | AIFM1 | 5 | AIFM1 |
Each module is represented by its centre and effective size. Hub genes identified by centrality analysis and driver genes are indicated by their module membership. Modules with the maximum number of hub genes could function as driver subnetworks in the pathogenesis of disease progression.
Fig 4Network of cluster centres of stage-III network.
Clustering was done using Moduland.
GO enrichment analysis of stage-III network.
| GO- ID | q-value | %size > 20 | Description | Hub genes | Other genes in the network |
|---|---|---|---|---|---|
| 7059 | 3.10E-12 | N | chromosome segregation | SMC2, SMC1A | PDS5B,SMC3,SMC4,NCAPD2,NCAPH,RAD21,NDEL1,NCAPG,STAG2,NEK6,STAG1 |
| 7215 | 1.62E-10 | Y | glutamate signaling pathway | GRIN1,GRIN2A,GRIN2B | HOMER3, GRIN3B,GRIA4,HOMER1, HOMER2 |
| 45202 | 1.62E-10 | N | Synapse | GRIN1,GRIN2A,GRIN2B, GRM1, DLG4,CAMK2A | GABBR1,NLGN2,GRIN3B,GRIA4,CDH2,HOMER1, ADORA1,HOMER2,SLC17A7,SLC32A1, GRIA1,HOMER3, DLG2 |
| 279 | 5.55E-10 | N | M phase | DYNC1H1, SMC1A, SMC2 | SSSCA1,FZR1,PDS5B,PDS5A,CHEK1,SMC3,SMC4,NCAPD2,NCAPH,RAD21,NCAPG,STAG2,NEK6,NUDC,STAG1 |
| 8328 | 7.06E-09 | Y | ionotropic glutamate receptor complex | GRIN1,GRIN2A,GRIN2B | GRIA1, GRIN3B, GRIA4 |
| 8066 | 2.31E-08 | Y | glutamate receptor activity | GRM1,GRIN1,GRIN2A,GRIN2B | GRIA1, GABBR1, GRIN3B, GRIA4 |
| 7216 | 1.60E-04 | Y | metabotropic glutamate receptor signaling pathway | HOMER3, HOMER1, HOMER2 | |
| 50839 | 1.18E-02 | N | cell adhesion molecule binding | GRIN2A,GRIN2B | PTPRT |
Significant GO terms enriched in stage III network are given. If the ratio of the genes in the network for a given GO term to the total number of genes in that GO term is greater than 20%, ‘Y’ is indicated, otherwise ‘N’ is indicated. Hub genes identified in our analysis are indicated for each GO term.
Centrality analysis of stage-IV network.
| S.No. | Betweenness centrality | Closeness centrality | Bottleneck centrality | Consensus Centrality (hubs) | Hub+driver genes |
|---|---|---|---|---|---|
| 1 | GSK3B | GSK3B | GSK3B | GSK3B | GSK3B |
| 2 | AKT1 | PXN | AKT1 | AKT1 | GGT1 |
| 3 | PXN | AKT1 | HSPA5 | PXN | EIF2B5 |
| 4 | GGT1 | VEGFA | CHKB | GGT1 | |
| 5 | SFN | PLCG1 | GGT1 | SFN | |
| 6 | PPIF | EIF2B5 | PXN | GNAI2 | |
| 7 | GNAI2 | CHKB | SFN | CHKB | |
| 8 | CHKB | GNAI2 | PLCG1 | HSPA5 | |
| 9 | HSPA5 | FLT1 | EIF2B5 | EIF2B5 | |
| 10 | EIF2B5 | HSPA5 | QARS | PLCG1 | |
| 11 | PLCG1 | SFN | SSR4 | ||
| 12 | SF3B3 | GGT1 | GNAI2 | ||
| 13 | IARS | SF3B1 | HECW1 | ||
| 14 | NTRK2 | ITGAV | NTRK2 | ||
| 15 | MAGI1 | ITGA9 | SF3B3 |
The top 15 genes obtained by applying each centrality metric are given. The genes at the intersection of all the three metrics are designated as ‘hubs’. Genes that are both hub and driver are shown.
Moduland decomposition of stage-IV network.
| S. No | Module centre | Eff. size of module | Driver gene members | Hub gene members |
|---|---|---|---|---|
| 1 | EIF2B5 | 14 | EIF2B5 | EIF2B5, CHKB, GGT1, GSK3B |
| 2 | FLT1 | 29 | FLT1, TNC, ADCY8 | GNAI2, PXN, PLCG |
| 3 | GSK3B | 34 | GSK3B, MAGI1, LCT | GSK3B, AKT1, HSPA5, SFN |
| 4 | NTRK2 | 7 | NTRK2, SLAMF7 | |
| 5 | EPRS | 8 | EPRS | |
| 6 | SF3B3 | 12 | SF3B3, GGT1, SYT14 | GGT1 |
| 7 | HLA-C | 5 | ||
| 8 | ROR2 | 4 | ||
| 9 | FGG | 3 | F13B | |
| 10 | GABRG2 | 4 | GABRG2 |
Each module is represented by its centre and effective size. Hub genes identified by centrality analysis and driver genes are indicated by their module membership. Modules with the maximum number of hub genes could function as driver subnetworks in the pathogenesis of disease progression.
Fig 5Network of cluster centres of stage-IV network.
Clustering was done using Moduland.
GO enrichment analysis of stage-IV network.
| GO- ID | q-value | %size > 20 | Description | Hub genes | Other genes in the network |
|---|---|---|---|---|---|
| 5851 | 7.11E-09 | Y | eukaryotic translation initiation factor 2B complex | EIF2B5 | EIF2B2, EIF2B1,EIF2B, EIF2B4 |
| 22008 | 1.23E-05 | N | Neurogenesis | EIF2B5, AKT1 | WNT5A,NTF3,PTPRZ1,NF1,DLL1,FZD2,EIF2B1,BDNF,DYNLL2,TNR,VEGFA,ROR2,EIF2B2,EIF2B3,EIF2B4,DISC1 |
| 30817 | 3.76E-05 | N | regulation of cAMP biosynthetic process | GNIA2 | ADCY1,ADCY2,ADCY8,DRD5,ADCY5,NF1,NTRK2 |
| 30554 | 1.04E-03 | N | adenyl nucleotide binding | GSK3B, HSPA5 | ADCY1,FLT1,ADCY2,SGK3,MAGI1,TAOK1,ADCY8,ADCY5,CHKB,EPRS,QARS,CLPX,MARK1,IARS,AKT1,MUSK,CBWD1,RARS,NTRK2,ROR2,EIF2B2,DUS2L |
| 43409 | 1.10E-03 | N | negative regulation of MAPKKK cascade | AKT1 | SPRY2, MAGI1,NF1 |
| 3840 | 1.27E-02 | Y | gamma-glutamyl transferase activity | GGT1 | GGT2 |
Significant GO terms enriched in stage IV network are given. If the ratio of the genes in the network for a given GO term to the total number of genes in that GO term is greater than 20%, ‘Y’ is indicated, otherwise ‘N’ is indicated. Hub genes identified in our analysis are indicated for each GO hit.