| Literature DB >> 29717769 |
Xiaosheng Hang1, Dapeng Li2, Jianping Wang3, Ge Wang4.
Abstract
The aim of the present study was to reveal the potential molecular mechanisms of microsatellite instability (MSI) on the prognosis of gastric cancer (GC). The investigation was performed based on an RNAseq expression profiling dataset downloaded from The Cancer Genome Atlas, including 64 high‑level MSI (MSI‑H) GC samples, 44 low‑level MSI (MSI‑L) GC samples and 187 stable microsatellite (MSI‑S) GC samples. Differentially expressed genes (DEGs) were identified between the MSI‑H, MSI‑L and MSI‑S samples. Pathway enrichment analysis was performed for the identified DEGs and the pathway deviation scores of the significant enrichment pathways were calculated. A Multi‑Layer Perceptron (MLP) classifier, based on the different pathways associated with the MSI statuses was constructed for predicting the outcome of patients with GC, which was validated in another independent dataset. A total of 190 DEGs were selected between the MSI‑H, MSI‑L and MSI‑S samples. The MLP classifier was established based on the deviation scores of 10 significant pathways, among which antigen processing and presentation, and inflammatory bowel disease pathways were significantly enriched with HLA‑DRB5, HLA‑DMA, HLA‑DQA1 and HLA‑DRA; the measles, toxoplasmosis and herpes simplex infection pathways were significantly enriched with Janus kinase 2 (JAK2), caspase‑8 (CASP8) and Fas. The classifier performed well on an independent validation set with 100 GC samples. Taken together, the results indicated that MSI status may affect GC prognosis, partly through the antigen processing and presentation, inflammatory bowel disease, measles, toxoplasmosis and herpes simplex infection pathways. HLA‑DRB5, HLA‑DMA, HLA‑DQA1, HLA‑DRA, JAK2, CASP8 and Fas may be predictive factors for prognosis in GC.Entities:
Mesh:
Year: 2018 PMID: 29717769 PMCID: PMC5979886 DOI: 10.3892/ijmm.2018.3643
Source DB: PubMed Journal: Int J Mol Med ISSN: 1107-3756 Impact factor: 4.101
Demographic and clinical information of the training set and validation set.
| Dataset | Age (years) | Sex (M/F) | MSS/MSI-L/MSI-H | Survival status | OS (months) |
|---|---|---|---|---|---|
| Training | 65±5 | 182/113 | 187/44/64 | 230 alive/57 deceased | 12.7±13.4 |
| Validation | 67±6 | 61/39 | 90 | NA | 10.1±12.6 |
MSS+MSIL; NA, survival times were unavailable; M, male; F, female; MSI, microsatellite instability; MSI-L, low-level MSI; MSI-H, high-level MSI; MSS, stable MSI; OS, overall survival.
Figure 1DEG screening. (A) P-value distribution. The black vertical line represents the threshold of P=0.05 [−log (0.05)=1.3]. (B) CV distribution. Two black vertical lines represents 10% CV and 90% CV, respectively (−35.1, 33.8). (C) Ratio of DEGs in all genes. (D) DEG distribution; green points represent DEGs and grey points represent other genes. DEGs, differentially expressed genes; CV, coefficient of variation.
Significantly differentially expressed genes at the protein level.
| Gene | P-value |
|---|---|
| TLDC1 | 0.0001 |
| CRABP2 | 0.0019 |
| C1ORF116 | 0.0075 |
| C6ORF132 | 0.0121 |
| CAPNS2 | 0.0213 |
| HSD3B2 | 0.0272 |
| GPR157 | 0.0290 |
| HEPN1 | 0.0306 |
| KLK12 | 0.0315 |
| HOXB2 | 0.0329 |
| FOXI2 | 0.0371 |
| C12ORF54 | 0.0479 |
Figure 2Heatmap of top 30 co-expressed genes. Genes are shown on the horizontal and vertical axes. The red lattice represents a positive correlation, the blue lattice represents a negative correlation. The color bar indicates the R value.
Figure 3Gene co-expression networks of gastric cancer samples of different MSI status. (A) MSI-S samples; (B) MSI-L samples. (C) MSI-H samples. Red rectangles represent upregulated genes, green rectangles represent downregulated genes. MSI, microsatellite instability; MSI-S, stable MSI; MSI-L, low-level MSI; MSI-H, high-level MSI.
Figure 4Topological properties of 3 gene co-expressed networks. (A) Degree of genes in the three networks; (B) Average shortest path length of the three networks. MSI, microsatellite instability; MSS, stable MSI; MSI-L, low-level MSI; MSI-H, high-level MSI.
Figure 5Heatmap of unsupervised hierarchical cluster analysis of DEGs. Samples are on the vertical axis; genes are on the horizontal axis. Among samples on the vertical axis, the samples with a good outcome are in green, and the samples with a poor outcome are in red. In the heatmap, upregulated genes are shown in red and downregulated genes are shown in green.
Outcome of patients in the training set.
| MSI | Prognosis
| |
|---|---|---|
| Good (n) | Poor (n) | |
| MSI-H | 57 | 2 |
| MSI-L | 11 | 25 |
| MSS | 121 | 47 |
Good indicates patient survival at 12 months following diagnosis. Poor indicates patient succumbed to mortality 12 months following diagnosis. MSI, microsatellite instability; MSI-L, low-level MSI; MSI-H, high-level MSI; MSS, stable MSI.
Significant pathways enriched with differentially expressed genes.
| Term | Count | P-value | Genes |
|---|---|---|---|
| Graft-vs.-host disease | 8 | 2.52E-08 | CD86, CD80, HLA-DRB5, FAS, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Allograft rejection | 8 | 5.89E-08 | CD86, CD80, HLA-DRB5, FAS, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Type I diabetes mellitus | 8 | 1.48E-07 | CD86, CD80, HLA-DRB5, FAS, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Autoimmune thyroid disease | 8 | 6.80E-07 | CD86, CD80, HLA-DRB5, FAS, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Viral myocarditis | 8 | 1.29E-06 | CD86, CD80, CASP8, HLA-DRB5, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Herpes simplex infection | 12 | 1.59E-06 | DDX58, HMGN1, IFIH1, GTF2IRD1, CASP8, HLA-DRB5, JAK2, FAS, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Tuberculosis | 10 | 5.85E-05 | FCGR1A, CASP8, HLA-DRB5, FCER1G, ATP6V1H, JAK2, CLEC7A, HLA-DMA, HLA-DQA1, HLA-DRA |
| Cell adhesion molecules | 9 | 7.58E-05 | CLDN16, CD86, CD80, HLA-DRB5, L1CAM, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Intestinal immune network for IgA production | 6 | 9.50E-05 | CD86, CD80, HLA-DRB5, HLA-DMA, HLA-DQA1, HLA-DRA |
| Phagosome | 9 | 1.28E-04 | FCGR1A, HLA-DRB5, ITGB5, ATP6V1H, CLEC7A, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Asthma | 5 | 2.05E-04 | HLA-DRB5, FCER1G, HLA-DMA, HLA-DQA1, HLA-DRA |
| Rheumatoid arthritis | 7 | 2.26E-04 | CD86, CD80, HLA-DRB5, ATP6V1H, HLA-DMA, HLA-DQA1, HLA-DRA |
| Influenza A | 9 | 3.11E-04 | DDX58, IFIH1, HLA-DRB5, JAK2, CPSF4, FAS, HLA-DMA, HLA-DQA1, HLA-DRA |
| Systemic lupus erythematosus | 8 | 3.50E-04 | HIST1H2AC, CD86, CD80, FCGR1A, HLA-DRB5, HLA-DMA, HLA-DQA1, HLA-DRA |
| Leishmaniasis | 6 | 6.70E-04 | FCGR1A, HLA-DRB5, JAK2, HLA-DMA, HLA-DQA1, HLA-DRA |
| Antigen processing and presentation | 6 | 9.15E-04 | KLRC4, HLA-DRB5, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA |
| Toxoplasmosis | 7 | 1.09E-03 | CASP8, HLA-DRB5, JAK2, BIRC3, HLA-DMA, HLA-DQA1, HLA-DRA |
| 5 | 1.98E-03 | FCGR1A, HLA-DRB5, HLA-DMA, HLA-DQA1, HLA-DRA | |
| Inflammatory bowel disease | 5 | 3.69E-03 | IL18RAP, HLA-DRB5, HLA-DMA, HLA-DQA1, HLA-DRA |
| HTLV-I infection | 7 | 4.18E-02 | IL2RB, HLA-DRB5, HLA-E, HLA-DMA, HLA-DQA1, HLA-DRA, APC |
| Measles | 5 | 4.34E-02 | DDX58, IL2RB, IFIH1, JAK2, FAS |
Term, pathway identity; count, number of genes enriched in a pathway.
Figure 6Pathway deviation score of measles pathway. (A) Pathway deviation score of measles pathway in MSI-H, MSI-L and MSS samples; (B) pathway deviation score of measles pathway in good outcome samples and poor outcome samples. MSI, microsatellite instability; MSI-L, low-level MSI; MSI-H, high-level MSI; MSS, stable MSI.
Figure 7Pathway deviation score of leishmaniasis pathway. (A) Pathway deviation score of leishmaniasis pathway in MSI-H, MSI-L and MSS samples; (B) pathway deviation score of leishmaniasis pathway in good outcome samples and poor outcome samples. MSI, microsatellite instability; MSI-L, low-level MSI; MSI-H, high-level MSI; MSIS, stable MSI.
Analysis of pathway deviation scores.
| Pathway | P-value prognosis | P-value MSS |
|---|---|---|
| Measles | 2.03E-29 | 3.32E-03 |
| Antigen processing and presentation | 7.42E-19 | 4.80E-02 |
| Rheumatoid arthritis | 1.70E-13 | 1.25E-03 |
| Phagosome | 5.84E-12 | 2.48E-02 |
| Systemic lupus erythematosus | 1.31E-11 | 6.02E-03 |
| Herpes simplex infection | 1.18E-06 | 1.67E-02 |
| Inflammatory bowel disease | 1.14E-05 | 4.63E-02 |
| Tuberculosis | 3.57E-04 | 3.98E-04 |
| Type I diabetes mellitus | 1.28E-03 | 7.13E-03 |
| Toxoplasmosis | 1.64E-03 | 4.82E-03 |
| Cell adhesion molecules | 7.99E-03 | 9.63E-02 |
| Viral myocarditis | 1.11E-02 | 9.75E-03 |
| Asthma | 1.21E-02 | 2.01E-02 |
| HTLV I infection | 4.21E-02 | 3.54E-02 |
| Autoimmune thyroid disease | 8.06E-02 | 7.13E-03 |
| Allograft rejection | 8.86E-02 | 7.13E-03 |
| Staphylococcus aureus infection | 1.50E-01 | 4.63E-02 |
| Graft versus host disease | 2.26E-01 | 7.13E-03 |
| Leishmaniasis | 2.52E-01 | 1.03E-02 |
| Influenza A | 2.99E-01 | 1.79E-02 |
| Intestinal immune network for IgA production | 3.91E-01 | 7.66E-03 |
MSS, microsatellite stability.
Figure 8Multi-Layer Perceptron neural network. A black edge linking any two neural nodes (circles) represents the correlation between the two nodes, with the bias term indicated by the black number. Blue edges and numbers represent the weight value of the correlation between any two neural nodes.
Figure 9ROC curve of the MLP classifier. (A) Curve for training set; (B) curve for validation set. The vertical axis represents sensitivity and the horizontal axis represents specificity. ROC, Receiver Operating Characteristic; MLP, Multi-Layer Perceptron; AUC, area under the curve; LR, logistic regression.
Figure 10Survival analysis of patients using the log-rank test in the validation set. The vertical axis represents ratio of patient survival; the horizontal axis represents the survival time.