| Literature DB >> 36117847 |
Jili Hu1,2, Weiqiang Yu3,4,5, Yuting Dai3,4, Can Liu1, Yongkang Wang1,2, Qingfa Wu3,4,5.
Abstract
Background: Gastric cancer (GC) is one of the deadliest cancers in the world, with a 5-year overall survival rate of lower than 20% for patients with advanced GC. Genomic information is now frequently employed for precision cancer treatment due to the rapid advancements of high-throughput sequencing technologies. As a result, integrating multiomics data to construct predictive models for the GC patient prognosis is critical for tailored medical care.Entities:
Year: 2022 PMID: 36117847 PMCID: PMC9481367 DOI: 10.1155/2022/2965166
Source DB: PubMed Journal: J Oncol ISSN: 1687-8450 Impact factor: 4.501
Figure 1(a) The number of nodes in each layer of the network is constructed based on the Reactome pathway. The first layer has 26 nodes and the last layer has 2 nodes. (b). The parent-child relationship network layer constructed based on the Reactome pathway has a total of 9 layers, each node represents a pathway, and the node corresponds to the corresponding pathway gene.
Figure 2The architecture of the GCS-Net proposed to integrate multiomics data for the GC prognosis prediction. The structure of the GCS-Net consists of a feature layer (multiomics data), a layer of genes, and four layers of biological pathways based on Reactome, and the layers are directly sparsely connected.
GCS-NET network model parameters.
| Layer (type) | Output shape | Param | Connected to |
|---|---|---|---|
| Inputs (InputLayer) | (none, 34380) | 0 | |
| h0 (Diagonal) | (none, 11460) | 45840 | Inputs[0][0] |
| dropout_0 (Dropout) | (none, 11460) | 0 | h0[0][0] |
| h1 (SparseTF) | (none, 1061) | 22081 | dropout_0[0][0] |
| dropout_1 (Dropout) | (none, 1061) | 0 | h1[0][0] |
| h2 (SparseTF) | (none, 447) | 1512 | dropout_1[0][0] |
| dropout_2 (Dropout) | (none, 447) | 0 | h2[0][0] |
| h3 (SparseTF) | (none, 147) | 594 | dropout_2[0][0] |
| dropout_3 (Dropout) | (none, 147) | 0 | h3[0][0] |
| h4 (SparseTF) | (none, 26) | 174 | dropout_3[0][0] |
| o_linear1 (Dense) | (none, 1) | 11461 | h0[0][0] |
| o_linear2 (Dense) | (none, 1) | 1062 | h1[0][0] |
| o_linear3 (Dense) | (none, 1) | 448 | h2[0][0] |
| o_linear4 (Dense) | (none, 1) | 148 | h3[0][0] |
| o_linear5 (Dense) | (none, 1) | 27 | h4[0][0] |
| Total | 83347 |
The GCS-Net and other classic machine learning method model's scores.
| Model | Accuracy | auc | aupr | f1 | Precision | Recall |
|---|---|---|---|---|---|---|
| GCS-Net | 0.844 | 0.807 | 0.949 | 0.913 | 0.840 | 1 |
| L2 LogisticRegression | 0.800 | 0.751 | 0.907 | 0.886 | 0.833 | 0.945 |
| RBF support vector machine | 0.733 | 0.628 | 0.916 | 0.846 | 0.804 | 0.891 |
| Linear support vector machine | 0.777 | 0.743 | 0.943 | 0.871 | 0.829 | 0.918 |
| Random forest | 0.800 | 0.785 | 0.946 | 0.886 | 0.833 | 0.945 |
| Decision tree | 0.755 | 0.692 | 0.893 | 0.857 | 0.825 | 0.891 |
Figure 3Prediction performance of the GCS-Net. (a) The AUPRC value of the GCS-Net outperforms other classical machine learning models on the test set. (b) The GCS-Net has a true negative rate (TN) of 75% and a true positive rate (TP) of 100% in the test set.
Figure 4GCS-Net model pathway Sankey diagram. The Sankey diagram visualization shows the node importance and mutual drive of each layer of the GCS-Net model, and the nodes with darker colors are more important. The left most node represents the input feature data type; the nodes of the second layer represent the last layer of genes constructed according to the Reactome pathway; each subsequent layer represents a higher-level biological pathway; the last layer represents the prediction result.
The top genes for survival prediction in GC by the GCS-Net.
| Gene name | Reference |
|---|---|
| UBE2C | [ |
| JAK2 | [ |
| RAD21 | [ |
| NUP210 | [ |
| PTPN1 | [ |
| CDC27 | [ |
| NUP188 | [ |
| PLK4 | [ |
The top pathway for survival prediction in GC by the GCS-Net.
| Pathway name | Reference |
|---|---|
| Mitotic anaphase | [ |
| Antigen processing | [ |
| Recruitment of NuMA to mitotic centrosomes | [ |
| Neddylation | [ |
| Centrosome maturation | [ |
| SUMO E3 ligases | [ |
| G2/M transition | [ |
| M phase | [ |
| SUMOylation | [ |
| Cell cycle | [ |