| Literature DB >> 22784616 |
Jianxin Wang1, Gang Chen, Min Li, Yi Pan.
Abstract
BACKGROUND: Various gene-expression signatures for breast cancer are available for the prediction of clinical outcome. However due to small overlap between different signatures, it is challenging to integrate existing disjoint signatures to provide a unified insight on the association between gene expression and clinical outcome.Entities:
Mesh:
Year: 2011 PMID: 22784616 PMCID: PMC3287565 DOI: 10.1186/1752-0509-5-S3-S10
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Schematic overview of graph centrality based integration of gene signatures. Schematic overview of graph centrality based integration of distinct breast cancer gene signatures.
Overlap among 94 breast cancer gene signatures
| Frequency | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 14 | 15 | 16 | 17 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4143 | 1608 | 687 | 323 | 148 | 56 | 40 | 23 | 15 | 13 | 4 | 3 | 1 | 1 | 1 | 1 |
There is very small overlap among distinct breast cancer signatures. Most genes exist in only one signature, and only 24 genes are included in 10 or more signatures.
Graph centrality based breast cancer signatures
| BC | CC | DC | EC | IC | SC |
|---|---|---|---|---|---|
| TP53 | TP53 | TP53 | TP53 | TP53 | TP53 |
| EGFR | ESR1 | EGFR | ESR1 | EGFR | ESR1 |
| CTNNB1 | AR | EP300 | EP300 | EP300 | EP300 |
| SMAD3 | EGFR | ESR1 | AR | ESR1 | AR |
| ESR1 | EP300 | BRCA1 | BRCA1 | BRCA1 | BRCA1 |
| EP300 | SMAD3 | CREBBP | CREBBP | CREBBP | CREBBP |
| SRC | BRCA1 | SMAD3 | SMAD3 | AR | SMAD3 |
| BRCA1 | CTNNB1 | AR | EGFR | SMAD3 | EGFR |
| CREBBP | SRC | CTNNB1 | HDAC1 | SRC | HDAC1 |
| AR | CREBBP | SRC | STAT3 | CTNNB1 | STAT3 |
| UBE2I | AKT1 | HDAC1 | RB1 | HDAC1 | RB1 |
| DYNLL1 | HDAC1 | CASP3 | CTNNB1 | RB1 | CTNNB1 |
| CASP3 | STAT3 | RB1 | JUN | PIK3R1 | JUN |
| ACTB | STAT1 | PIK3R1 | SRC | CASP3 | SRC |
| AKT1 | XRCC6 | STAT3 | AKT1 | STAT3 | AKT1 |
| HDAC1 | RB1 | UBE2I | SMARCA4 | AKT1 | SMARCA4 |
| PIK3R1 | PIK3R1 | AKT1 | CDKN1A | SHC1 | CDKN1A |
| ZBTB16 | PML | CDK2 | PML | CDK2 | PML |
| ACVR1 | UBE2I | PCNA | STAT1 | JUN | STAT1 |
| RB1 | HSPA8 | SHC1 | CDK2 | STAT1 | CDK2 |
| STAT1 | CASP3 | JUN | NCOA6 | UBE2I | NCOA6 |
| HGS | CDK2 | DYNLL1 | HDAC2 | CDKN1A | HDAC2 |
| CDK2 | SMARCA4 | STAT1 | PIK3R1 | PCNA | PIK3R1 |
| YWHAZ | CDKN1A | ACTB | UBE2I | HDAC2 | UBE2I |
| BCL2 | JUN | CDKN1A | XRCC6 | SMARCA4 | XRCC6 |
| XRCC6 | CDH1 | MAPK14 | HIF1A | NFKB1 | HIF1A |
| STAT3 | YWHAZ | YWHAZ | NFKB1 | CDH1 | NFKB1 |
| PLK1 | MAPK14 | BCL2 | CASP3 | ACTB | CASP3 |
| PCNA | PRKDC | HDAC2 | E2F1 | MAPK14 | E2F1 |
| HSPA8 | HIF1A | ATM | CEBPB | LYN | CEBPB |
| FN1 | SHC1 | LYN | PRKDC | XRCC6 | PRKDC |
| VIM | STUB1 | NFKB1 | SHC1 | ATM | SHC1 |
| CD44 | HDAC2 | XRCC6 | NCOA2 | ZBTB16 | NCOA2 |
| MAPK14 | ERBB2 | CDH1 | CCND1 | E2F1 | CCND1 |
| GNB2L1 | HSPA4 | SMARCA4 | PCNA | YWHAZ | PCNA |
| FANCA | BCL2 | EZR | TDG | BCL2 | TDG |
| EEF1A1 | ZBTB16 | HGS | ERBB2 | PML | ERBB2 |
| SKIL | LYN | ZBTB16 | HSPA4 | JAK1 | HSPA4 |
| SHC1 | NCOA6 | PLK1 | ZBTB16 | EZR | ZBTB16 |
| USP7 | PAK1 | CAV1 | RXRA | PRKDC | RXRA |
| CAV1 | MDM4 | E2F1 | MAPK14 | JAK2 | MAPK14 |
| JUN | CASP8 | JAK1 | JAK2 | CASP8 | JAK2 |
| RXRA | CEBPB | RXRA | STUB1 | RXRA | STUB1 |
| LYN | NFKB1 | FN1 | CDK7 | ERBB2 | CDK7 |
| CDKN1A | JAK2 | HSPA8 | PTPN6 | HIF1A | PTPN6 |
| HIF1A | PTPN6 | VAV1 | ATM | CAV1 | ATM |
| ATM | HGS | ACVR1 | BCL3 | HSPA8 | BCL3 |
| EZR | EZR | CASP8 | DDX5 | VAV1 | DDX5 |
| PPP1CA | CAV1 | HIF1A | RBL1 | CDKN2A | RBL1 |
| PAK1 | PIAS4 | JAK2 | FOS | NCOA6 | FOS |
| PLSCR1 | TDG | PML | CDH1 | PTPN6 | CDH1 |
| TGFBR2 | IGF1R | CDKN2A | MDM4 | DYNLL1 | MDM4 |
| AURKA | DDX5 | EEF1A1 | ING1 | HGS | ING1 |
| KPNB1 | MET | ERBB2 | SIN3A | SIN3A | SIN3A |
54 genes with highest graph centrality in the context-constrained PIN are selected to consist new breast cancer gene signatures. The genes in the six signatures identified by six graph centrality measurements are listed in this table.
Topological size of context-constrained PINs.
| Proteins | Interactions | |
|---|---|---|
| BC | 54 | 238 |
| CC | 54 | 330 |
| DC | 54 | 279 |
| EC | 54 | 352 |
| IC | 54 | 312 |
| SC | 54 | 352 |
| Chen | 54(35) | 70 |
To investigate the differences between graph centrality based gene signatures and that reported in previous literature, topological size of sub-network consisted by the genes in these gene signatures are calculated and presented in this table. It should be note that the sub-network consisted by the genes included in Chen's gene signature is not a connected graph. The number in parenthesis is the number of proteins of its biggest connected component.
Figure 2Sub-networks consisted by the genes of different signatures. The first six sub-networks are consisted by the signature genes identified by six graph centrality measurements, repectively. The last one is consisted by the signature genes identified by Chen. Compared with Chen's result, all the sub-networks that consisted by the genes of the signatures that identified by graph centrality are connected graph and denser.
KEGG pathways enrichment analysis results.
| Method | KEGG Pathway | Description | Annotated Genes | Corrected |
|---|---|---|---|---|
| BC | hsa05200 | pathways in cancer | 23 | 13.09 |
| CC | hsa05200 | pathways in cancer | 29 | 18.92 |
| DC | hsa05200 | pathways in cancer | 31 | 21.59 |
| EC | hsa05200 | pathways in cancer | 28 | 25.74 |
| IC | hsa05200 | pathways in cancer | 30 | 20.22 |
| SC | hsa05200 | pathways in cancer | 28 | 25.74 |
| Chen | hsa04110 | Cell cycle | 25 | 26.80 |
| Chen | hsa05200 | pathways in cancer | 11 | 2.59 |
To investigate the relationship between the gene signatures and cancer-related pathways, KEGG pathway enrichment analysis is preformed on these signatures.
Gene Ontology enrichment analysis results.
| Method | GO Term | Description | |
|---|---|---|---|
| BC | GO:0010033 | response to organic substance | 21.44 |
| CC | GO:0010604 | positive regulation of macromolecule metabolic process | 21.44 |
| DC | GO:0010604 | positive regulation of macromolecule metabolic process | 17.50 |
| EC | GO:0010604 | positive regulation of macromolecule metabolic process | 25.74 |
| IC | GO:0010033 | response to organic substance | 19.60 |
| SC | GO:0010604 | positive regulation of macromolecule metabolic process | 25.74 |
| Chen | GO:0007049 | cell cycle | 19.94 |
To investigate the relationship between the gene signatures and cancer-related biological processes, Gene Ontology enrichment analysis is preformed on these signatures.
Figure 3Relationship among most significant GO terms annotate to significant breast cancer genes identified by different methods. Relationship among most significant GO terms annotate to significant breast cancer genes identified by different methods. Green boxes indicates the GO terms annotate to genes identified by centrality, and red box indicate the GO term annotate to Chen's results.
Clinical outcome of two main clusters
| 118 samples cluster | 80 samples cluster | |
|---|---|---|
| Patient Age | 47 | 45 |
| Tumor Size(mm) | 2.06 | 2.36 |
| Disease-free survival (days) | 3498 | 3252 |
| Overall survival (days) | 4391 | 3792 |
| Distant metastasis-free survival (days) | 4148 | 3667 |
| Time to distant metastasis (days) | 4148 | 3667 |
| NPI Score | 3.4 | 4.17 |
| 10-year overall survival probability | 83.38 | 74.98 |
Mean value of each clinical outcome recorded in the microarray dataset of the samples in each cluster is presented in this table.