| Literature DB >> 32938429 |
Abstract
BACKGROUND: There is growing evidence that pseudogenes may serve as prognostic biomarkers in several cancers. The present study was designed to develop and validate an accurate and robust pseudogene pairs-based signature for the prognosis of hepatocellular carcinoma (HCC).Entities:
Keywords: Hepatocellular carcinoma; Pseudogene pairs; Signature; Survival
Mesh:
Year: 2020 PMID: 32938429 PMCID: PMC7493157 DOI: 10.1186/s12885-020-07391-2
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Clinical data of patients in the TCGA and the ICGC validation cohort
| Variables | Subgroups | TCGA ( | ICGC( |
|---|---|---|---|
| Age | < 60 | 169 | 44 |
| > = 60 | 201 | 187 | |
| Sex | Male | 249 | 179 |
| Female | 121 | 62 | |
| Stage | I | 171 | 36 |
| II | 85 | 104 | |
| III | 85 | 72 | |
| IV | 5 | 19 | |
| NA | 24 | 0 | |
| Grade | I | 55 | – |
| II | 177 | – | |
| III | 121 | – | |
| IV | 12 | – | |
| NA | 5 | – | |
| Survival status | Dead | 130 | 42 |
| Living | 240 | 189 | |
| Vascular invasion | Positive | 108 | – |
| Negative | 206 | – | |
| NA | 56 | – | |
| Family history | Positive | 112 | 73 |
| Negative | 207 | 143 | |
| NA | 51 | 15 | |
| Prior malignancy | Positive | – | 29 |
| Negative | – | 202 | |
| NA | – | 0 |
Fig. 1Predictor selection by LASSO algorithm. a: Parameter filter by LASSO regress algorithm used five-fold cross-validation by through minimum criteria; b: Optimal feature selection based on LASSO coefficient profile plot of 19 pseudogene pairs
Information on the 19 pseudogene pairs and the coefficient obtained from the least absolute shrinkage and selection operator (LASSO) regression analysis
| Genepair1 | Full name | Genepair2 | Full name | Coef |
|---|---|---|---|---|
| ABCC6P2 | ATP binding cassette subfamily C member 6 pseudogene 2 | DSTNP2 | DSTN pseudogene 2 | −0.133577486 |
| ANXA2P2 | annexin A2 pseudogene 2 | AZGP1P1 | AZGP1 pseudogene 1 | 0.06815618 |
| ANXA2P2 | annexin A2 pseudogene 2 | HLA-J | major histocompatibility complex, class I, J | 0.337854755 |
| AQP7P1 | aquaporin 7 pseudogene 1 | HLA-J | major histocompatibility complex, class I, J | 0.433464122 |
| AQP7P1 | aquaporin 7 pseudogene 1 | MT1DP | metallothionein 1D, pseudogene | 0.220401079 |
| AZGP1P1 | AZGP1 pseudogene 1 | CYP21A1P | cytochrome P450 family 21 subfamily A member 1, pseudogene | −0.171662304 |
| AZGP1P1 | AZGP1 pseudogene 1 | GGTA1P | glycoprotein alpha-galactosyltransferase 1, pseudogene | −0.330772998 |
| C3P1 | complement component 3 precursor pseudogene | MT1L | metallothionein 1 L, pseudogene | −0.211202632 |
| CA5BP1 | carbonic anhydrase 5B pseudogene 1 | LPAL2 | lipoprotein(a) like 2, pseudogene | 0.140891921 |
| DSTNP2 | DSTN pseudogene 2 | PLGLA | plasminogen like A | 0.139199981 |
| DSTNP2 | DSTN pseudogene 2 | WASH3P | WASP family homolog 3, pseudogene | 0.332685477 |
| HLA-J | major histocompatibility complex, class I, J | MSTO2P | misato family member 2, pseudogene | −0.356768111 |
| HLA-J | major histocompatibility complex, class I, J | RP9P | RP9 pseudogene | −0.035991571 |
| HSPA7 | heat shock protein family A (Hsp70) member 7 (pseudogene) | NAPSB | napsin B aspartic peptidase, pseudogene | 0.384325838 |
| LPAL2 | lipoprotein(a) like 2, pseudogene | PLGLA | plasminogen like A | 0.092279424 |
| NAPSB | napsin B aspartic peptidase, pseudogene | NSUN5P1 | NSUN5 pseudogene 1 | −0.339252375 |
| NUDT16P1 | nudix hydrolase 16 pseudogene 1 | PLGLA | plasminogen like A | 0.20989673 |
| PLGLA | plasminogen like A | RP9P | RP9 pseudogene | −0.137033874 |
| RP9P | RP9 pseudogene | WASH3P | WASP family homolog 3, pseudogene | 0.424813675 |
Fig. 2Association between the pseudogene pair-based signature risk score and clinical parameters in the TCGA cohort
Fig. 3Time-dependent ROC curve analysis of the risk score. A cutoff point of risk score was identified as 0.509 to divide patients into two distinct groups in the TCGA cohort
Fig. 4Kaplan-Meier survival curves for patients with HCC in two distinct groups. Survival cures in the TCGA cohort (a), ICGC dataset (b), and subgroup analysis with respect to age (c, d), gender (e, f), histological grade (g, h), American Joint Committee on Cancer stage (i, j), family history (k, l), and vascular invasion (m, n)
Univariate and multivariate analyses identified independent prognostic factors for overall survival of HCC in the TCGA and the ICGC cohorts
| Univariate analysis | Multivariate analysis | |||||
|---|---|---|---|---|---|---|
| HR | 95%CI | HR | 95%CI | |||
| Age | 1.01 | 0.996–1.025 | 0.174 | 1.01 | 0.996–1.024 | 0.168 |
| Sex | 0.776 | 0.531–1.132 | 0.188 | 0.912 | 0.614–1.353 | 0.646 |
| Grade | 1.133 | 0.881–1.456 | 0.33 | 0.927 | 0.706–1.219 | 0.588 |
| Stage | 1.68 | 1.369–2.062 | < 0.0001 | 1.33 | 1.070–1.654 | 0.01 |
| riskScore | 3.583 | 2.726–4.709 | < 0.0001 | 3.416 | 2.551–4.576 | < 0.0001 |
| Sex | 0.515 | 0.270–0.982 | 0.044 | 0.42 | 0.215–0.819 | 0.011 |
| Age | 0.998 | 0.966–1.032 | 0.917 | 0.989 | 0.955–1.025 | 0.558 |
| Stage | 2.238 | 1.532–3.269 | < 0.0001 | 2.16 | 1.459–3.198 | 0.0001 |
| Prior malignancy | 1.658 | 0.692–3.975 | 0.257 | 2.287 | 0.912–5.734 | 0.078 |
| Cancer history | 0.794 | 0.404–1.563 | 0.505 | 0.706 | 0.351–1.421 | 0.329 |
| riskScore | 2.337 | 1.490–3.664 | 0.0002 | 1.902 | 1.201–3.014 | 0.006 |
Fig. 5The ROC curve for 1-, 3- and 5-year overall survival prediction using the pseudogene pair-based prognostic. a TCGA cohort; b ICGC cohort
Fig. 6Diagnosis value of pseudogene pair-based signature risk score in HCC and normal controls. ROC in normal tissues and HCC samples in the TCGA cohort (a) and ICGC cohort (b). ROC for stage I samples and normal tissues in the TCGA cohort (c) and ICGC cohort (d)
Fig. 7Comparison of C-index among the novel model, previously established prognostic signatures, and clinical features (age, sex, stage, grade, and their combination)
GO functional and KEGG pathway enrichment analysis of pseudogenes-related protein-coding genes
| ID | Description | P adjust | |
|---|---|---|---|
| GO:0004896 | cytokine receptor activity | 1.64E-11 | 6.31E-09 |
| GO:0001637 | G protein-coupled chemoattractant receptor activity | 4.05E-08 | 3.23E-06 |
| GO:0004950 | chemokine receptor activity | 4.05E-08 | 3.23E-06 |
| GO:0019955 | cytokine binding | 4.21E-08 | 3.23E-06 |
| GO:0016493 | C-C chemokine receptor activity | 1.65E-07 | 9.05E-06 |
| GO:0019957 | C-C chemokine binding | 2.54E-07 | 1.30E-05 |
| GO:0019956 | chemokine binding | 3.92E-07 | 1.88E-05 |
| GO:0023023 | MHC protein complex binding | 6.72E-07 | 3.04E-05 |
| GO:0042287 | MHC protein binding | 1.33E-06 | 5.67E-05 |
| GO:0032395 | MHC class II receptor activity | 2.54E-05 | 0.001027582 |
| GO:0030246 | carbohydrate binding | 0.000143695 | 0.004598232 |
| GO:0001608 | G protein-coupled nucleotide receptor activity | 0.000175825 | 0.005193614 |
| GO:0045028 | G protein-coupled purinergic nucleotide receptor activity | 0.000175825 | 0.005193614 |
| GO:0030695 | GTPase regulator activity | 0.000508907 | 0.012607749 |
| KEGG:hsa04662 | B cell receptor signaling pathway | 4.67E-12 | 1.85E-10 |
| KEGG:hsa04062 | Chemokine signaling pathway | 2.79E-09 | 4.07E-08 |
| KEGG:hsa04660 | T cell receptor signaling pathway | 1.36E-07 | 1.45E-06 |
| KEGG:hsa04650 | Natural killer cell mediated cytotoxicity | 3.92E-07 | 4.02E-06 |
| KEGG:hsa04060 | Cytokine-cytokine receptor interaction | 1.07E-06 | 1.03E-05 |
| KEGG:hsa04064 | NF-kappa B signaling pathway | 0.000577911 | 0.00390442 |
| KEGG:hsa05235 | PD-L1 expression and PD-1 checkpoint pathway in cancer | 0.001387015 | 0.008934958 |
| KEGG:hsa05231 | Choline metabolism in cancer | 0.008939182 | 0.047618336 |