| Literature DB >> 27150055 |
Zhenyu Yue1,2, Hai-Tao Li3, Yabing Yang1, Sajid Hussain1, Chun-Hou Zheng3, Junfeng Xia2, Yan Chen1.
Abstract
Breast cancer (BC) is one of the most common malignancies that could threaten female health. As the molecular mechanism of BC has not yet been completely discovered, identification of related genes of this disease is an important area of research that could provide new insights into gene function as well as potential treatment targets. Here we used subnetwork extraction algorithms to identify novel BC related genes based on the known BC genes (seed genes), gene co-expression profiles and protein-protein interaction network. We computationally predicted seven key genes (EPHX2, GHRH, PPYR1, ALPP, KNG1, GSK3A and TRIT1) as putative genes of BC. Further analysis shows that six of these have been reported as breast cancer associated genes, and one (PPYR1) as cancer associated gene. Lastly, we developed an expression signature using these seven key genes which significantly stratified 1660 BC patients according to relapse free survival (hazard ratio [HR], 0.55; 95% confidence interval [CI], 0.46-0.65; Logrank p = 5.5e-13). The 7-genes signature could be established as a useful predictor of disease prognosis in BC patients. Overall, the identified seven genes might be useful prognostic and predictive molecular markers to predict the clinical outcome of BC patients.Entities:
Keywords: breast cancer; candidate gene; gene co-expression; protein-protein interaction; subnetwork extraction algorithm
Mesh:
Substances:
Year: 2016 PMID: 27150055 PMCID: PMC5094985 DOI: 10.18632/oncotarget.9132
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Summary workflow for identifying BC related candidate genes
The approach was based on three steps: (1) We obtained a PPI network derived from STRING (Search Tool for the Retrieval of Interacting Genes/Proteins) and constructed a co-expression network using gene expression data from The Cancer Genome Atlas (TCGA). A common network was generated through comparing the two networks. (2) Seed genes and the common network were imported into GenRev which was used to extract three subnetworks with different extraction algorithms. (3) 7 common genes were found in the obtained subnetworks which were considered as key candidate genes related to BC.
Numbers of nodes and edges in PPI network, co-expression network and their common network were illustrated
| PPI Network | Co-expression Network | Common Network | |
|---|---|---|---|
| Nodes | 17460 | 17325 | 9534 |
| Edges | 4850628 | 2303648 | 148182 |
Seed genes were collected from the breast cancer gene database (BCGD)
| Gene Names | |||
|---|---|---|---|
| NCOA3 | COL18A1 | FGF4 | PLAU |
| AKT2 | TSG101 | TP53 | VIM |
| TFAP2C | FGFR4 | PHB | APC |
| ATM | NME1 | PLAT | BCL2 |
| ESR1 | GRB7 | PRL | THRA |
| BRCA1 | PTPRF | SRC | EGFR |
| BRCA2 | HRAS | TGFA | ERBB3 |
| CCND1 | IGF1R | PTPN1 | NF2 |
| CDKN2A | FGF3 | PGR | FGFR1 |
| CDKN2B | KRAS | RB1 | KIT |
| CDKN2C | MYCL | CTTN | MDM2 |
| CDKN2D | IGF2R | SSTR2 | MET |
| CTSD | MCC | SSTR3 | MYC |
| PLG | MLH1 | SSTR5 | ERBB2 |
| MSH2 | |||
Numbers of edges, seeds and linkers in the subnetworks obtained from different methods
| Algorithm | Edges | Seeds | Linkers |
|---|---|---|---|
| Steiner | 136 | 57 | 22 |
| Kwalk (not-weighted) | 268 | 57 | 87 |
| Kwalk (edge-weighted) | 348 | 57 | 83 |
Linkers refer to the novel genes. Steiner and Kwalk refer to the Klein-Ravi algorithm and the limited k-walk algorithm, respectively.
Figure 2The subnetwork extracted by not-weighted Klein-Ravi algorithm
Figure 3Kaplan–Meier plot of relapse free survival using the 7-genes signature
Figure 4The smallest subnetwork connecting seven key genes derived from the common network