| Literature DB >> 31213036 |
Jia-Nuo Li1, Rui Zhong2, Xiong-Hui Zhou3.
Abstract
Bone is the most frequent organ for breast cancer metastasis, and thus it is essential to predict the bone metastasis of breast cancer. In our work, we constructed a gene dependency network based on the hypothesis that the relation between one gene and the risk of bone metastasis might be affected by another gene. Then, based on the structure controllability theory, we mined the driver gene set which can control the whole network in the gene dependency network, and the signature genes were selected from them. Survival analysis showed that the signature could distinguish the bone metastasis risks of cancer patients in the test data set and independent data set. Besides, we used the signature genes to construct a centroid classifier. The results showed that our method is effective and performed better than published methods.Entities:
Keywords: bone metastasis; breast cancer; driver gene set; gene dependency network; structure controllability theory
Mesh:
Year: 2019 PMID: 31213036 PMCID: PMC6627827 DOI: 10.3390/genes10060466
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Functional annotation of the driver gene set.
| KEGG Gene Set Name | FDR q-Value | |
|---|---|---|
| MAPK signaling pathway | 7.31 × 10−19 | 1.36 × 10−16 |
| Neuroactive ligand-receptor interaction | 1.12 × 10−15 | 1.04 × 10−13 |
| Pathways in cancer | 2.82 × 10−13 | 1.75 × 10−11 |
| Focal adhesion | 4.05 × 10−11 | 1.88 × 10−9 |
| Cytokine-cytokine receptor interaction | 8.22 × 10−11 | 3.06 × 10−9 |
| Regulation of actin cytoskeleton | 2.72 × 10−10 | 8.43 × 10−9 |
| SNARE (SNAP Receptor) interactions in vesicular transport | 1.62 × 10−9 | 4.29 × 10−8 |
| Complement and coagulation cascades | 1.26 × 10−8 | 2.92 × 10−7 |
| Purine metabolism | 3.08 × 10−8 | 6.36 × 10−7 |
| Spliceosome | 5.07 × 10−8 | 9.42 × 10−7 |
KEGG: Kyoto Encyclopedia of Genes and Genomes; FDR: False discovery rate.
The signature genes in our work.
| Gene Id | Gene Name | Frequency | |
|---|---|---|---|
| 85458 | DIXDC1 | 500 | 1.68192 × 10−6 |
| 29068 | ZBTB44 | 500 | 3.29096 × 10−6 |
| 51232 | CRIM1 | 500 | 1.46030e × 10−5 |
| 9986 | RCE1 | 500 | 2.12898 × 10−5 |
| 56888 | KCMF1 | 500 | 4.00030 × 10−5 |
| 1456 | CSNK1G3 | 500 | 4.58237 × 10−5 |
| 6256 | RXRA | 500 | 7.63144 × 10−5 |
| 55343 | SLC35C1 | 500 | 0.00012 |
| 55520 | ELAC1 | 500 | 0.00012 |
| 55081 | IFT57 | 500 | 0.00012 |
| 57610 | RANBP10 | 500 | 0.00018 |
| 5877 | RABIF | 500 | 0.00018 |
| 25839 | COG4 | 500 | 0.00019 |
| 23261 | CAMTA1 | 500 | 0.00020 |
| 3009 | HIST1H1B | 500 | 0.00020 |
| 3092 | HIP1 | 500 | 0.00026 |
| 246243 | RNASEH1 | 500 | 0.00030 |
| 3104 | ZBTB48 | 500 | 0.00031 |
| 10342 | TFG | 500 | 0.00032 |
| 6282 | S100A11 | 500 | 0.00033 |
| 10462 | CLEC10A | 500 | 0.00033 |
| 51199 | NIN | 500 | 0.00041 |
| 10531 | PITRM1 | 500 | 0.00048 |
| 9856 | KIAA0319 | 500 | 0.00049 |
| 11167 | FSTL1 | 500 | 0.00049 |
| 3993 | LLGL2 | 500 | 0.00052 |
| 56729 | RETN | 500 | 0.00054 |
| 51514 | DTL | 500 | 0.00054 |
| 9202 | ZMYM4 | 500 | 0.00058 |
| 51302 | CYP39A1 | 500 | 0.00065 |
| 9971 | NR1H4 | 500 | 0.00067 |
| 79083 | MLPH | 500 | 0.00073 |
| 65082 | VPS33A | 500 | 0.00075 |
| 10179 | RBM7 | 500 | 0.00078 |
| 55794 | DDX28 | 500 | 0.00082 |
| 57405 | SPC25 | 500 | 0.00089 |
| 51659 | GINS2 | 500 | 0.00089 |
| 1852 | DUSP9 | 500 | 0.00092 |
| 57017 | COQ9 | 500 | 0.00096 |
| 10397 | NDRG1 | 500 | 0.00098 |
| 9911 | TMCC2 | 500 | 0.00128 |
| 55095 | SAMD4B | 500 | 0.00137 |
| 23649 | POLA2 | 500 | 0.00143 |
| 10615 | SPAG5 | 500 | 0.00143 |
| 7134 | TNNC1 | 500 | 0.00145 |
| 7083 | TK1 | 500 | 0.00146 |
| 9442 | MED27 | 500 | 0.00151 |
| 8449 | DHX16 | 500 | 0.00171 |
| 8817 | FGF18 | 500 | 0.00176 |
| 483 | ATP1B3 | 500 | 0.00179 |
| 2175 | FANCA | 500 | 0.00190 |
The frequency of a gene is the times involved in the minimal driver gene sets and the p-values were calculated based on t-test.
Figure 1Survival analysis of test data set.
Figure 2Survival analysis of the independent data set.
Figure 3Comparing the performance of our method with published methods. AUC: area under the curve; DPBM: dysregulated pathway-based prediction model. SVM: Support vector machine; SCC: Shrunken centroids classifier.
Figure 4Comparing the performance of bone metastasis with other organ metastases.
Comparing results with estrogen receptor (ER) status.
| Training Data Set | Test Data Set | Independent Data Set | |
|---|---|---|---|
| Centroids classifier | 0.22 | 0.15 | 0.20 |
| ER status | 0.18 | 0.16 | 0.07 |