| Literature DB >> 33364806 |
Ran Xie1, Suwei Dong2, Jie Jiang3, Conghui Yang1, Lanjiang Li4, Sheng Zhao1, Yunlei Li3, Chun Wang1, Shujuan Li1, Yanbin Xiao2, Long Chen1.
Abstract
INTRODUCTION: Skin cutaneous melanoma (SKCM) is a common skin malignancy worldwide, and its metastasis and mortality rates are high. The molecular characteristics exhibited by tumor-immune interactions have drawn the attention from researchers. Therefore, increased knowledge and new strategies to identify effective immune-related biomarkers may improve the clinical management of SKCM by providing more accurate prognostic information. PATIENTS AND METHODS: In this study, we established a prognostic immune-related gene pair (IRGP) signature for predicting the survival of SKCM patients. The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases provided gene expression profiles together with clinical information, and the samples were randomly divided into three groups including the training, testing, and validation datasets. The regression model of least absolute shrinkage and selection operator (LASSO) helped to identify a 13-IRGP signature with a significant relation to the survival of SKCM patients.Entities:
Keywords: TCGA; bioinformatic; immune-related gene pair; signature; skin cutaneous melanoma
Year: 2020 PMID: 33364806 PMCID: PMC7751297 DOI: 10.2147/CCID.S281364
Source DB: PubMed Journal: Clin Cosmet Investig Dermatol ISSN: 1178-7015
The Clinical Information for Three Datasets
| TCGA | Independent Validation Dataset | GSE65904 | ||
|---|---|---|---|---|
| Alive | 229 | 35 | 106 | |
| Dead | 213 | 78 | 100 | |
| T0 | 23 | General | 21 | |
| T1 | 41 | In-transit | 15 | |
| T2 | 76 | Local | 11 | |
| T3 | 89 | Primary | 14 | |
| T4 | 143 | Regional | 138 | |
| TX | 74 | X | 7 | |
| N0 | 222 | |||
| N1 | 73 | |||
| N2 | 48 | |||
| N3 | 54 | |||
| NX | 49 | |||
| M0 | 401 | |||
| M1 | 21 | |||
| MX | 24 | |||
| I | 76 | 24 | ||
| II | 133 | 25 | ||
| III | 168 | 20 | ||
| IV | 20 | 43 | ||
| X | 49 | 1 | ||
| Female | 168 | 85 | ||
| Male | 278 | 121 | ||
| Extremities | 188 | |||
| Extremities&Trunk | 4 | |||
| Head&Neck | 32 | |||
| Trunk | 162 | |||
| Unknown | 60 | |||
| 0–40 | 60 | 10 | 20 | |
| 40–50 | 75 | 15 | 18 | |
| 50–60 | 102 | 33 | 40 | |
| 60–70 | 88 | 31 | 54 | |
| 70–100 | 121 | 24 | 74 | |
The Clinical Information of the Training and Testing Datasets
| Clinical Features | Training Set | Testing Set |
|---|---|---|
| Alive | 115 | 114 |
| Dead | 107 | 110 |
| T0 | 13 | 10 |
| T1 | 21 | 20 |
| T2 | 40 | 36 |
| T3 | 43 | 46 |
| T4 | 72 | 71 |
| TX | 33 | 41 |
| N0 | 110 | 112 |
| N1 | 35 | 38 |
| N2 | 23 | 25 |
| N3 | 31 | 23 |
| NX | 23 | 26 |
| M0 | 201 | 200 |
| M1 | 11 | 10 |
| MX | 10 | 14 |
| I | 42 | 34 |
| II | 60 | 73 |
| III | 86 | 82 |
| IV | 11 | 9 |
| X | 23 | 26 |
| Female | 84 | 84 |
| Male | 138 | 140 |
| Extremities | 84 | 104 |
| Extremities&Trunk | 3 | 1 |
| Head&Neck | 19 | 13 |
| Trunk | 87 | 75 |
| Unknown | 29 | 31 |
| 0–40 | 23 | 37 |
| 40–50 | 44 | 31 |
| 50–60 | 49 | 53 |
| 60–70 | 46 | 42 |
| 70–100 | 60 | 61 |
Figure 1The relationships between p-values and HRs. The relationships between the p-values and HRs of 1614 significant prognostic IRGPs. Red indicates the IRGPs with p-value < 0.05.
Figure 2Construction of the IRGP signature in SKCM. (A) The risk model ROC curve of 13 IRGPs in the training set after LASSO regression. The statistics of Risk-H and Risk-L samples under different OS outcomes and the proportion of Risk-L samples in the total sample vary with OS. (B) The risk model ROC curve of 13 IRGPs in TCGA after LASSO regression. The statistics of Risk-H and Risk-L samples under different OS outcomes and the proportion of Risk-L samples in the total sample vary with OS. (C) The risk model ROC curve of 13 IRGPs in the independent set after LASSO regression. The statistics of Risk-H and Risk-L samples under different OS outcomes and the proportion of Risk-L samples in the total sample vary with OS. (D) The risk model ROC curves of 13 IRGPs in the GSE65904 set after LASSO regression. The statistics of Risk-H and Risk-L samples under different OS outcomes and the proportion of Risk-L samples in the total sample vary with OS.
Figure 3IRGP signature evaluation and validation for survival prediction. (A) The K-M survival plot of the training set. (B) The K-M survival plot of the TCGA dataset. (C) The K-M survival plot of the independent dataset. (D) The K-M survival plot of the GSE65904 dataset.
The Gene Family Enrichment Analysis of IRGPs Signature
| Gene Family | Genes | p-value | p-adj |
|---|---|---|---|
| Repulsive guidance molecule family | 0.00430607 | 0.107651741 | |
| Erb-b2 receptor tyrosine kinases | 0.005379802 | 0.134495062 | |
| Tetratricopeptide repeat domain containing | 0.007020208 | 0.175505211 | |
| Integrin beta subunits | 0.010731814 | 0.268295359 | |
| Toll like receptors | 0.011798894 | 0.29497235 | |
| V-set domain containing | 0.01356257 | 0.339064249 | |
| Apolipoprotein B mRNA editing enzyme catalytic subunits | 0.013929739 | 0.348243483 | |
| Caspases | 0.014993507 | 0.374837679 | |
| Potassium voltage-gated channel subfamily J | 0.018178205 | 0.454455119 | |
| FA complementation groups | 0.024517971 | 0.612949276 | |
| Pyrin domain containing | 0.027673098 | 0.691827462 | |
| Scavenger receptors | 0.029771072 | 0.74427679 | |
| Blood group antigens | ACHE | 0.044335676 | 1 |
| Chemokine ligands | CXCL14 | 0.048458289 | 1 |
| PHD finger proteins | ING4 | 0.093676864 | 1 |
| Tripartite motif containing | TRIM45 | 0.098571875 | 1 |
| RNA binding motif containing | CNOT4 | 0.206994163 | 1 |
| Endogenous ligands | CXCL10 | 0.22069409 | 1 |
| Ankyrin repeat domain containing | NFKBIE | 0.231657695 | 1 |
| CD molecules | CD1D | 0.349347048 | 1 |
| Unknown | IFI27:RSAD2:IDO1: | NA | NA |
Figure 4Biological functions related to the IRGP signature. (A) The results of GO enrichment of the 13-IRGPs. (B) The KEGG pathway enrichment results of the 13 IRGPs.
Figure 5The relationships between risk scores and clinical features. (A) The difference distribution of the risk score according to T stage. (B) The difference distribution of the risk score according to N stage. (C) The difference distribution of the risk score according to tumor stage. (D) The difference distribution of the risk score according to age.
Figure 6The C-index results of the prognostic risk models.
Figure 7The nomogram of risk score and clinical features. (A) The nomogram constructed based on clinical features including T stage, N stage, M stage, age and the risk score. (B) The nomogram constructed by clinical features including tumor stage, age and the risk score.
Figure 8The forest plots of the risk score and clinical features. (A) The forest plots of the risk score and T stage, N stage, M stage and age. (B) The forest plots of the risk score, age and tumor stage.