| Literature DB >> 33200655 |
Weiguo Huang1, Jianhui Chen1, Wanqing Weng1, Yukai Xiang1, Hongqi Shi1, Yunfeng Shan1.
Abstract
Utilizing genomic data to predict cancer prognosis was insufficient. Proteomics can improve our understanding of the etiology and progression of cancer and improve the assessment of cancer prognosis. And the Clinical Proteomic Tumor Analysis Consortium (CPTAC) has generated extensive proteomics data of the vast majority of tumors. Based on CPTAC, we can perform a proteomic pan-carcinoma analysis. We collected the proteomics data and clinical features of cancer patients from CPTAC. Then, we screened 69 differentially expressed proteins (DEPs) with R software in five cancers: hepatocellular carcinoma (HCC), children's brain tumor tissue consortium (CBTTC), clear cell renal cell carcinoma (CCRC), lung adenocarcinoma (LUAD) and uterine corpus endometrial carcinoma (UCEC). GO and KEGG analysis were performed to clarify the function of these proteins. We also identified their interactions. The DEPs-based prognostic model for predicting over survival was identified by least absolute shrinkage and selection operator (LASSO)-Cox regression model in training cohort. Then, we used the time-dependent receiver operating characteristics analysis to evaluate the ability of the prognostic model to predict overall survival and validated it in validation cohort. The results showed that the DEPs-based prognostic model could accurately and effectively predict the survival rate of most cancers.Entities:
Keywords: Proteomics; biomarker; differentially expressed proteins; pan-cancer; prognosis
Year: 2020 PMID: 33200655 PMCID: PMC8291886 DOI: 10.1080/21655979.2020.1847398
Source DB: PubMed Journal: Bioengineered ISSN: 2165-5979 Impact factor: 3.269
Figure 1.Identification of DEPs in five cancers. DEPs were defined with P-value < 0.05 and |log2(Fold Change)|>1. (a) Volcano plots of proteins with normalized expression alteration in all five cancers; (b) Heatmap of the DEPs (n = 69) in all five cancers. DEPs, differentially expressed proteins
Figure 2.GO analysis and KEGG analysis of DEPs. (a) The functions of the 69 DEPs identified cover three main categories: BP, CC, MF; (b) based on KEGG pathway, 11 enriched pathways with lowest P-value were displayed; (c) (d) GO cluster diagram and GO chord diagram of the 69 DEPs. DEPs, differentially expressed proteins; BP, biological processes; CC, cellular contents; MF, molecular functions; GO, gene ontology
Figure 3.PPI network. (a) Interactions among 69 DEPs were detected after removing isolated proteins; (b) the number of interactions between each protein and other proteins. PPI, protein-protein interaction; DEPs, differentially expressed proteins
Figure 4.The survival-predictor model based on twenty-four-DEPs. (a) Univariate Cox analyses showed that 33 DEPs contributed to the OS in the training cohort; (b)(c) the LASSO regression model identified the 24 most accurate predictive DEPs in the training cohort; (d) (e) the expression relationship of the 24 DEPs was displayed. DEPs, differentially expressed proteins; OS, overall survival
The detailed information of differentially expressed proteins for constructing the prognostic signature
| Protein name | Gene name | β |
|---|---|---|
| alpha-aminoadipate aminotransferase | AADAT | −0.051658113 |
| Alcohol dehydrogenase 1 C | ADH1C | −0.165116243 |
| Aflatoxin B1 aldehyde reductase member 3 | AKR7A3 | 0.13190529 |
| Calmodulin-like protein 3 | CALML3 | 0.046728326 |
| Cyclin-dependent kinase 1 | CDK1 | 0.120550067 |
| Desmin | DES | 0.002519638 |
| EH domain-containing protein 3 | EHD3 | −0.388025694 |
| Beta-enolase | ENO3 | −0.26539137 |
| Fructose-1,6-bisphosphatase isozyme 2 | FBP2 | −0.245540744 |
| Glycerol-3-phosphate dehydrogenase [NAD(+)] | GPD1 | −0.041717955 |
| Interleukin-33 | IL33 | 0.1638571 |
| Lysyl oxidase homolog 2 | LOXL2 | 0.259559228 |
| Midkine | MDK | 0.153385186 |
| Proliferation marker protein Ki-67 | MKI67 | 0.30323553 |
| Prolyl 4-hydroxylase subunit alpha-1 | P4HA1 | 0.046806691 |
| Prolyl 4-hydroxylase subunit alpha-2 | P4HA2 | 0.144674736 |
| Phosphoglucomutase-like protein 5 | PGM5 | −0.07403554 |
| Phytanoyl-CoA dioxygenase domain-containing protein 1 | PHYHD1 | −0.02187378 |
| PI-PLC X domain-containing protein 3 | PLCXD3 | 0.121054348 |
| Perilipin-4 | PLIN4 | 0.216349151 |
| cAMP-dependent protein kinase type II-beta regulatory subunit | PRKAR2B | 0.063303645 |
| Peripherin | PRPH | 0.069267063 |
| 14-3-3 protein sigma | SFN | 0.034228846 |
| Sushi repeat-containing protein SRPX | SRPX | 0.077423785 |
Figure 5.Time-dependent ROC curves and the survival analysis for the DEPs-based classifiers for OS in the training cohort and the validation cohort. (a,d) Cancer patients were divided into two groups by the median of risk score in the training cohort: High risk and Low risk; (b) Kaplan-Meier Survival analysis results indicated that the two groups had significantly different survival rates (p = 2.309e−09); (c) tdROC were applied to assess predictive accuracy for overall survival; (d) according to the same cutoff point cancer patients were also divided into two groups in the validation cohort; (e) Kaplan-Meier Survival analysis results indicated that the two groups had significantly different survival rates in the validation cohort (p = 1.113e−04); (f) tdROC were applied to assess predictive accuracy for overall survival. DEPs, differentially expressed proteins; OS, overall survival; tdROC, Time-dependent ROC
Univariate and multivariate COX analyses of the DEPs-based classifier for OS
| Univariate analysis | Multivariate analysis | |||||
|---|---|---|---|---|---|---|
| Prognostic parameter | HR | 95% CI | P value | HR | 95% CI | P value |
| Age (> 65 vs. ≤ 65) | 0.724 | 0.218–2.409 | 0.599 | |||
| Gender (male vs. female) | 1.099 | 0.297–4.063 | 0.887 | |||
| Grade (G3&4 vs. G1&2) | 1.901 | 0.603–5.993 | 0.273 | |||
| Tumor stage | 2.031 | 1.224–3.369 | ||||
| T classification | 5.479 | 1.483–20.240 | ||||
| N classification | 1.269 | 0.132–12.221 | 0.837 | 6.593 | 1.670–26.026 | |
| DEPs-based classifier | 4.047 | 2.081–7.871 | 4.438 | 1.704–11.561 | ||
| Age (> 65 vs. ≤ 65) | 0.589 | 0.233–1.488 | 0.263 | |||
| Gender (male vs. female) | 0.843 | 0.420–1.692 | 0.630 | |||
| Number of Tumors | 1.042 | 0.467–2.323 | 0.920 | |||
| Tumor thrombus | 2.118 | 1.157–3.879 | 0.780 | 0.399–1.525 | 0.468 | |
| DEPs-based classifier | 3.892 | 2.563–5.909 | 3.114 | 1.911–5.073 | ||
| Age (> 65 vs. ≤ 65) | 6.184 | 0.803–47.630 | 0.080 | |||
| Gender (male vs. female) | 1.176 | 0.394–3.509 | 0.771 | |||
| Grade (G3&4 vs. G1&2) | 0.259 | 0.026–2.540 | 0.246 | |||
| Tumor stage | 7.518 | 2.428–23.283 | 2.721 | 0.347–21.351 | 0.341 | |
| T classification | 4.499 | 1.357–14.913 | 2.571 | 0.730–9.057 | 0.142 | |
| N classification | 5.400 | 1.658–17.585 | 1.973 | 0.257–15.160 | 0.514 | |
| Smoking history | 2.310 | 0.754–7.070 | 0.143 | |||
| DEPs-based classifier | 3.867 | 1.573–9.502 | 2.666 | 1.123–6.331 | 0.026 | |
| Age (> 65 vs. ≤ 65) | 131.682 | 0.007–2,628,475 | 0.334 | |||
| Grade (G3&4 vs. G1&2) | 3.042 | 0.190–48.629 | 0.432 | |||
| Tumor stage | 5355.564 | 0–5.000E+18 | 0.626 | |||
| T classification | 18.422 | 1.654–205.237 | 25,773.5 | 0–4.44094E+23 | 0.424 | |
| N classification | 0.039 | 0–1,243,745,686 | 0.793 | 0.000 | 0.000–1,102,092.5 | 0.424 |
| DEPs-based classifier | 13.430 | 1.983–183.437 | 216.159 | 0.003–14,902,987 | 0.344 | |
| Age (> 65 vs. ≤ 65) | 0.945 | 0.895–0.998 | 0.044 | 0.971 | 0.926–1.019 | 0.228 |
| Gender (male vs. female) | 0.708 | 0.387–1.293 | 0.261 | |||
| Surgery | 0.102 | 0.047–0.222 | 0.165 | 0.070–0.387 | ||
| DEPs-based classifier | 13.430 | 1.983–183.437 | 2.173 | 1.229–3.843 | ||
HR, Hazard ratio; CI, confidence interval; DEPs, differentially expressed proteins; hepatocellular carcinoma, HCC; children’s brain tumor tissue consortium, CBTTC; clear cell renal cell carcinoma, CCRC; lung adenocarcinoma, LUAD; uterine corpus endometrial carcinoma, UCEC.
Correlations between risk score of the DEPs-based classifier with overall survival and clinicopathological characteristics in five types of cancers
| Clinicopathological variables | Number of patients | High Risk | Low Risk | P value |
|---|---|---|---|---|
| CCRC | ||||
| <65 (n, %) | 67 (59.3%) | 34 (30.1%) | 33 (29.2%) | |
| ≥65 (n, %) | 46 (40.7%) | 19 (16.8%) | 27 (23.9%) | 0.323 |
| Gender | ||||
| Male (n, %) | 30 (26.5%) | 11 (9.7%) | 19 (16.8%) | |
| Female (n, %) | 83 (73.5%) | 42 (37.2%) | 41 (36.3%) | 0.190 |
| Histologic Grade | ||||
| G1+ G2 (n, %) | 69 (61.1%) | 27 (23.9%) | 42 (37.2%) | |
| G3+ G4 (n, %) | 44 (38.9%) | 26 (23.0%) | 18 (15.9%) | 0.038 |
| TNM staging system | ||||
| T1+ T2 (n, %) | 72 (63.7%) | 29 (25.7%) | 43 (38.1%) | |
| T3+ T4 (n, %) | 41 (36.3%) | 24 (21.2%) | 17 (15.0%) | 0.061 |
| N0 (n, %) | 14 (77.8%) | 9 (50.0%) | 5 (27.8%) | |
| N1 (n, %) | 4 (22.2%) | 3(16.7%) | 1(5.6%) | 0.688 |
| HCC | ||||
| Age | ||||
| <65 (n, %) | 120 (85.1%) | 85 (60.3%) | 35 (24.8%) | |
| ≥65 (n, %) | 21 (14.9%) | 16 (11.3%) | 5 (3.5%) | 0.615 |
| Gender | ||||
| Male (n, %) | 26 (18.4%) | 19 (13.5%) | 7 (5.0%) | |
| Female (n, %) | 115 (81.6%) | 82 (58.2%) | 33 (23.4%) | 0.856 |
| Number of Tumors | ||||
| Single (n, %) | 121(85.8%) | 87 (61.7%) | 34 (24.1%) | |
| Couple (n, %) | 20 (14.2%) | 14 (9.9%) | 6 (4.3%) | 0.861 |
| LUAD | ||||
| Age | ||||
| <65 (n, %) | 59 (57.8%) | 29 (28.4%) | 30 (29.4%) | |
| ≥65 (n, %) | 43 (42.2%) | 18 (17.6%) | 25 (24.5%) | 0.466 |
| Gender | ||||
| Male (n, %) | 32 (31.4%) | 12 (11.8%) | 20 (19.6%) | |
| Female (n, %) | 70 (68.6%) | 35 (34.3%) | 35 (34.3%) | 0.240 |
| Histologic Grade | ||||
| G1+ G2 (n, %) | 62 (63.9%) | 26 (26.8%) | 36 (37.1%) | |
| G3+ G4 (n, %) | 35 (36.1%) | 18 (18.6%) | 17 (17.5%) | 0.367 |
| TNM staging system | ||||
| T1+ T2 (n, %) | 91 (89.2%) | 39 (38.2%) | 52 (51.0%) | |
| T3+ T4 (n, %) | 11 (10.8%) | 8 (7.8%) | 3 (2.9%) | 0.060 |
| N0 (n, %) | 70 (68.6%) | 29 (28.4%) | 41 (40.2%) | |
| N1 (n, %) | 32 (31.4%) | 18 (17.6%) | 14 (13.7%) | 0.163 |
| M0 (n, %) | 85 (97.7%) | 42 (48.3%) | 43 (49.4%) | |
| M1 (n, %) | 2 (2.3%) | 0 (.0%) | 2 (2.3%) | 0.167 |
| Pathological stage | ||||
| I+ II (n, %) | 81 (79.4%) | 34 (33.3%) | 47 (46.1%) | |
| III+IV (n, %) | 21 (20.6%) | 13 (12.7%) | 8 (7.8%) | 0.103 |
| Smoking history | ||||
| Present (n, %) | 56 (56.6%) | 26 (26.3%) | 30 (30.3%) | |
| Absent (n, %) | 43 (43.4%) | 20 (20.2%) | 23 (23.2%) | 0.993 |
| UCEC | ||||
| Age | ||||
| <65 (n, %) | 56 (56.6%) | 16 (16.2%) | 40 (40.4%) | |
| ≥65 (n, %) | 43 (43.4%) | 8 (8.1%) | 35 (35.4%) | 0.251 |
| Histologic Grade | ||||
| G1+ G2 (n, %) | 73 (75.3%) | 13 (13.4%) | 60 (61.9%) | |
| G3+ G4 (n, %) | 24 (24.7%) | 11 (11.3%) | 13 (13.4%) | 0.006 |
| TNM staging system | ||||
| T1+ T2 (n, %) | 88 (88.9%) | 20 (20.2%) | 68 (68.7%) | |
| T3+ T4 (n, %) | 11 (11.1%) | 4 (4.0%) | 7 (7.1%) | 0.320 |
| N0 (n, %) | 47 (85.5%) | 9 (16.4%) | 38 (69.1%) | |
| N1 (n, %) | 8 (14.5%) | 4 (7.3%) | 4 (7.3%) | 0.058 |
| M0 (n, %) | 71 (97.3%) | 17 (23.3%) | 54 (74.0%) | |
| M1 (n, %) | 2 (2.7%) | 1 (1.44%) | 1 (1.4%) | 0.399 |
| FIGO stage | ||||
| I+ II (n, %) | 82 (82.8%) | 18 (18.2%) | 64 (64.6%) | |
| III+IV (n, %) | 17 (17.2%) | 6 (6.1%) | 11 (11.1%) | 0.243 |
| CBTTC | ||||
| Age | ||||
| <65 (n, %) | 195 (99.5%) | 100 (51.0%) | 95 (48.5%) | |
| ≥65 (n, %) | 1 (0.5%) | 1 (0.5%) | 0 (0%) | 0.331 |
| Gender | ||||
| Male (n, %) | 86 (43.9%) | 44 (22.4%) | 42 (21.4%) | |
| Female (n, %) | 110 (56.1%) | 57 (29.1%) | 53 (27.0%) | 0.927 |
| Surgery | ||||
| Present (n, %) | 163 (93.1%) | 82 (46.7%) | 81 (46.3%) | |
| Absent (n, %) | 12 (6.9%) | 10 (5.7%) | 2 (1.1%) | 0.061 |
Hepatocellular carcinoma, HCC; children’s brain tumor tissue consortium, CBTTC; clear cell renal cell carcinoma, CCRC; lung adenocarcinoma, LUAD; uterine corpus endometrial carcinoma, UCEC.