| Literature DB >> 36123629 |
Peter Larsson1,2, Daniella Pettersson3,4, Hanna Engqvist3,4, Elisabeth Werner Rönnerman3,4,5, Eva Forssell-Aronsson4,6,7, Anikó Kovács5, Per Karlsson3,8, Khalil Helou3,4, Toshima Z Parris3,4.
Abstract
BACKGROUND: The human proteasome gene family (PSM) consists of 49 genes that play a crucial role in cancer proteostasis. However, little is known about the effect of PSM gene expression and genetic alterations on clinical outcome in different cancer forms.Entities:
Keywords: Cancer; Gene expression profiling; Prognosis; Prognostic biomarkers; Proteasome gene family
Mesh:
Substances:
Year: 2022 PMID: 36123629 PMCID: PMC9484138 DOI: 10.1186/s12885-022-10079-4
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.638
The 49 human proteasome gene family members (proteasome subunits and proteasome-interacting proteins)
| Proteasome 20S subunit alpha 1 | α6 | 11p15.2 | HC2, NU, PROS30, PSC2 | P25786 | 263 | 29,556 | |
| Proteasome 20S subunit alpha 2 | α2 | 7p14.1 | HC3, PSC3 | P25787 | 234 | 25,899 | |
| Proteasome 20S subunit alpha 3 | α7 | 14q23.1 | HC8, PSC8 | P25788 | 255 | 28,433 | |
| Proteasome 20S subunit alpha 4 | α3 | 15q25.1 | HC9, PSC9 | P25789 | 261 | 29,484 | |
| Proteasome 20S subunit alpha 5 | α5 | 1p13.3 | ZETA | P28066 | 241 | 26,411 | |
| Proteasome 20S subunit alpha 6 | α1 | 14q13.2 | PROS27 | P60900 | 246 | 27,399 | |
| Proteasome 20S subunit alpha 7 | α4 | 20q13.33 | HSPC | O14818 | 248 | 27,887 | |
| Proteasome 20S subunit alpha 8 | - | 18q11.2 | PSMA7L | Q8TAA3 | 256 | 28,530 | |
| Proteasome 20S subunit beta 1 | β6 | 6q27 | PSC5 | P20618 | 241 | 26,489 | |
| Proteasome 20S subunit beta 2 | β4 | 1p34.3 | HC7-I | P49721 | 201 | 22,836 | |
| Proteasome 20S subunit beta 3 | β3 | 17q12 | HC10-II, MGC4147 | P49720 | 205 | 22,949 | |
| Proteasome 20S subunit beta 4 | β7 | 1q21.3 | PROS26 | P28070 | 264 | 29,204 | |
| Proteasome 20S subunit beta 5 | β5 | 14q11.2 | LMPX, MB1, X | P28074 | 263 | 28,480 | |
| Proteasome 20S subunit beta 6 | β1 | 17p13.2 | LMPY, Y | P28072 | 239 | 25,358 | |
| Proteasome 20S subunit beta 7 | β2 | 9q33.3 | Z | Q99436 | 277 | 29,965 | |
| Proteasome 20S subunit beta 8 | β5i | 6p21.32 | LMP7, PSMB5i, RING10, Y2 | P28062 | 276 | 30,354 | |
| Proteasome 20S subunit beta 9 | β1i | 6p21.32 | LMP2, PSMB6i, RING12 | P28065 | 219 | 23,264 | |
| Proteasome 20S subunit beta 10 | β2i | 16q22.1 | LMP10, MECL1 | P40306 | 273 | 28,936 | |
| Proteasome 20S subunit beta 11 | β5t | 14q11.2 | A5LHX3 | 300 | 32,530 | ||
| Proteasome 26S subunit, ATPase 1 | Rpt2 | 14q32.11 | S4, p56 | P62191 | 440 | 49,185 | |
| Proteasome 26S subunit, ATPase 2 | Rpt1 | 7q22.1 | MSS1 | P35998 | 433 | 48,634 | |
| PSMC3 Interacting Protein | 17q21.2 | HOP2, TBPIP | Q9P2W1 | 217 | 24,906 | ||
| Proteasome 26S subunit, ATPase 3 | Rpt5 | 11p11.2 | TBP1 | P17980 | 439 | 49,204 | |
| Proteasome 26S subunit, ATPase 4 | Rpt3 | 19q13.2 | TBP-7 | P43686 | 418 | 47,366 | |
| Proteasome 26S subunit, ATPase 5 | Rpt6 | 17q23.3 | SUG1 | P62195 | 406 | 45,626 | |
| Proteasome 26S subunit, ATPase 6 | Rpt4 | 14q22.1 | SUG2 | P62333 | 389 | 44,173 | |
| Proteasome 26S subunit, non-ATPase 1 | Rpn2 | 2q37.1 | S1, P112, Rpn2 | Q99460 | 953 | 105,836 | |
| Proteasome 26S subunit, non-ATPase 2 | Rpn1 | 3q27.1 | TRAP2 | Q13200 | 908 | 100,200 | |
| Proteasome 26S subunit, non-ATPase 3 | Rpn3 | 17q21.1 | S3, P58, Rpn3 | O43242 | 534 | 60,978 | |
| Proteasome 26S subunit, non-ATPase 4 | Rpn10 | 1q21.3 | MCB1 | P55036 | 377 | 40,737 | |
| Proteasome 26S subunit, non-ATPase 5 | - | 9q33.2 | KIAA0072 | Q16401 | 504 | 56,196 | |
| Proteasome 26S subunit, non-ATPase 6 | Rpn7 | 3p14.1 | KIAA0107, PFAAP4 | Q15008 | 389 | 45,531 | |
| Proteasome 26S subunit, non-ATPase 7 | Rpn8 | 16q23.1 | MOV34L | P51665 | 324 | 37,025 | |
| Proteasome 26S subunit, non-ATPase 8 | Rpn12 | 19q13.2 | S14, Nin1p, p31, HIP6, HYPF, Rpn12 | P48556 | 350 | 39,612 | |
| Proteasome 26S subunit, non-ATPase 9 | - | 12q24.31 | p27, Rpn4 | O00233 | 223 | 24,682 | |
| Proteasome 26S subunit, non-ATPase 10 | Gankyrin | Xq22.3 | p28 | O75832 | 226 | 24,428 | |
| Proteasome 26S subunit, non-ATPase 11 | Rpn6 | 17q11.2 | S9, p44.5, MGC3844, Rpn6 | O00231 | 422 | 47,464 | |
| Proteasome 26S subunit, non-ATPase 12 | Rpn5 | 17q24.2 | p55, Rpn5 | O00232 | 456 | 52,904 | |
| Proteasome 26S subunit, non-ATPase 13 | Rpn9 | 11p15.5 | p40.5, Rpn9 | Q9UNM6 | 376 | 42,945 | |
| Proteasome 26S subunit, non-ATPase 14 | Rpn11 | 2q24.2 | POH1 | O00487 | 310 | 34,577 | |
| Proteasome activator subunit 1 | PA28α | 14q12 | IFI5111 | Q06323 | 249 | 28,723 | |
| Proteasome activator subunit 2 | PA28β | 14q12 | PA28beta | Q9UL46 | 239 | 27,402 | |
| Proteasome activator subunit 3 | PA28γ | 17q21.31 | Ki, PA28-gamma, REG-GAMMA, PA28G | P61289 | 254 | 29,506 | |
| Proteasome activator subunit 4 | PA200 | 2p16.2 | KIAA0077 | Q14997 | 1,843 | 211,334 | |
| Proteasome inhibitor subunit 1 | PI31 | 20p13 | PI31 | Q92530 | 271 | 29,817 | |
| Proteasome Assembly Chaperone 1 | 21q22.2 | C21LRP, DSCR2, PAC1 | O95456 | 288 | 32,854 | ||
| Proteasome Assembly Chaperone 2 | 18p11.21 | HCCA3, PAC2, TNFSF5IP1 | Q969U7 | 264 | 29,396 | ||
| Proteasome Assembly Chaperone 3 | 7p22.3 | C7orf48, PAC3 | Q9BT73 | 122 | 13,104 | ||
| Proteasome Assembly Chaperone 4 | 6p25.2 | C6orf86, PAC4 | Q5JS54 | 123 | 13,775 | ||
Data obtained from a Gomes AV et al., b Genome Reference Consortium Human GRCh38.p12/hg38, c UniProtKB
TCGA cancer types and corresponding pan-cancer organ systems
| Disease name and pan-organ system | Cohort | RNA-seq dataa | Survival analysisb | cBioPortalc | KM plotterd | |
|---|---|---|---|---|---|---|
| Glioblastoma multiforme | GBM | 166 | 5 | 167 | 592 | |
| Brain lower grade glioma | LGG | 530 | 0 | 528 | 514 | |
| Adrenocortical carcinoma | ACC | 79 | 0 | 79 | 92 | |
| Thyroid carcinoma | THCA | 496 | 58 | 510 | 500 | |
| Cholangiocarcinoma | CHOL | 36 | 9 | 36 | 36 | |
| Colon adenocarcinoma | COAD | 191 | 0 | 469 | ||
| Esophageal carcinoma | ESCAe | 185 | 11 | 162 | 182 | |
| Liver hepatocellular carcinoma | LIHC | 147 | 50 | 374 | 372 | 364 |
| Pancreatic adenocarcinoma | PAAD | 56 | 0 | 178 | 184 | |
| Rectum adenocarcinoma | READ | 72 | 0 | 166 | ||
| Colorectal adenocarcinoma/Rectum adenocarcinoma | COADREADf | 0 | 594 | |||
| Stomach adenocarcinoma | STAD | 415 | 35 | 375 | 440 | 875 |
| Breast invasive carcinoma | BRCA | 1026 | 108 | 1103 | 1084 | 1879 |
| Cervical and endocervical cancers | CESC | 159 | 0 | 306 | 297 | |
| Ovarian serous cystadenocarcinoma | OV | 265 | 0 | 379 | 585 | 1656 |
| Uterine corpus endometrial carcinoma | UCEC | 369 | 0 | 548 | 529 | |
| Head and neck squamous cell carcinoma | HNSC | 425 | 42 | 502 | 523 | |
| Lymphoid neoplasm diffuse large B-cell lymphoma | DLBC | 48 | 0 | 48 | 48 | |
| Acute myeloid leukemia | LAML | 173 | 0 | 151 | 200 | |
| Thymoma | THYM | 120 | 0 | 119 | 123 | |
| Skin cutaneous melanoma | SKCM | 472 | 0 | 471 | 448 | |
| Uveal melanoma | UVM | 80 | 0 | 80 | 80 | |
| Pheochromocytoma and paraganglioma | PCPG | 184 | 3 | 183 | 178 | |
| Sarcoma | SARC | 105 | 0 | 263 | 255 | |
| Uterine carcinosarcoma | UCS | 57 | 0 | 56 | 57 | |
| Lung adenocarcinoma | LUAD | 490 | 58 | 526 | 566 | 1925 |
| Lung squamous cell carcinoma | LUSC | 482 | 50 | 501 | 487 | |
| Mesothelioma | MESO | 87 | 0 | 86 | 87 | |
| Bladder urothelial carcinoma | BLCA | 223 | 19 | 411 | 411 | |
| Kidney chromophobe | KICH | 66 | 25 | 65 | 65 | |
| Kidney renal clear cell carcinoma | KIRC | 507 | 72 | 535 | 512 | |
| Kidney renal papillary cell carcinoma | KIRP | 161 | 30 | 289 | 283 | |
| Prostate adenocarcinoma | PRAD | 498 | 52 | 499 | 494 | |
| Testicular germ cell tumors | TGCT | 156 | 0 | 139 | 149 | |
| Total | 8526 | 627 | 10,304 | 10,967 | 6699 | |
a UNC RNASeqV2 level 3 expression (normalized RSEM) data were retrieved from Broad GDAC Firehose (https://gdac.broadinstitute.org/)
b Survival analysis was performed using the dataset https://gdc.cancer.gov/about-data/publications/PanCan-Clinical-2018
c Mutational profiling data (mutated genes, CNA genes, and fusion genes) was retrieved from cBioPortal for Cancer Genomics, http://www.cbioportal.org/study/summary?id=laml_tcga_pan_can_atlas_2018%2Cacc_tcga_pan_can_atlas_2018%2Cblca_tcga_pan_can_atlas_2018%2Clgg_tcga_pan_can_atlas_2018%2Cbrca_tcga_pan_can_atlas_2018%2Ccesc_tcga_pan_can_atlas_2018%2Cchol_tcga_pan_can_atlas_2018%2Ccoadread_tcga_pan_can_atlas_2018%2Cdlbc_tcga_pan_can_atlas_2018%2Cesca_tcga_pan_can_atlas_2018%2Cgbm_tcga_pan_can_atlas_2018%2Chnsc_tcga_pan_can_atlas_2018%2Ckich_tcga_pan_can_atlas_2018%2Ckirc_tcga_pan_can_atlas_2018%2Ckirp_tcga_pan_can_atlas_2018%2Clihc_tcga_pan_can_atlas_2018%2Cluad_tcga_pan_can_atlas_2018%2Clusc_tcga_pan_can_atlas_2018%2Cmeso_tcga_pan_can_atlas_2018%2Cov_tcga_pan_can_atlas_2018%2Cpaad_tcga_pan_can_atlas_2018%2Cpcpg_tcga_pan_can_atlas_2018%2Cprad_tcga_pan_can_atlas_2018%2Csarc_tcga_pan_can_atlas_2018%2Cskcm_tcga_pan_can_atlas_2018%2Cstad_tcga_pan_can_atlas_2018%2Ctgct_tcga_pan_can_atlas_2018%2Cthym_tcga_pan_can_atlas_2018%2Cthca_tcga_pan_can_atlas_2018%2Cucs_tcga_pan_can_atlas_2018%2Cucec_tcga_pan_can_atlas_2018%2Cuvm_tcga_pan_can_atlas_2018
d Survival analysis using KM plotter, https://kmplot.com/analysis/
e Esophageal adenocarcinoma and Esophageal squamous carcinoma was merged into one as Esophageal carcinoma
f COAD and READ was merged in cBioPortal dataset
Fig. 1Flowchart depicting the study design and workflow. A Genomic and transcriptomic data were collected from multiple sources and validated with a breast cancer cohort and KM plotter. B Both interactive tools and collected data were used to determine genomic alterations and its effect on gene expression. Furthermore, differential expression between cancer and normal tissue, co-expressed genes was determined, and how this affects cancer patient survival. C We used the statistical tool R to perform statistical calculations and to generate figures
Fig. 2Bar charts depicting alteration frequency for the 49 PSM genes by cancer type using the interactive web-based online tool cBioPortal (cbioportal.org). A DNA amplification was shown to be prevalent in most cancer types, with ESCA and THCA showing the highest och lowest alteration frequencies, respectively. Box plots visualizing DNA amplification of (B) PSMB3, (C) PSMB4, (D) PSMD4 and their effect on expression (RSEM). Wilcoxon test was used to calculate statistical significance (Benjamini–Hochberg adjusted p-values), ns = not significant (P ≥ 0.05); *P < 0.05; **P ≤ 0.01; ***P ≤ 0.001; ****P ≤ 0.0001. E PSME4 gene was the most mutated of all PSM genes. Most PSME4 mutations was found in the UCEC cancer type, where missense mutations were prevalent. F Beeswarm plot visualizing copy number alterations (CNA) and other types of mutations, and their effect on expression was generated in cBioPortal. Deep deletions in PSME4 resulted in significantly lower expression. G Lollipop plot depicting the number of mutations across the PSME4 gene. Missense mutations were prevalent (243 of 312 mutations), with a domain with unknown function containing 14 mutations (10 frameshift deletions in T1805Pfs*69, three frameshift insertions in T1805Nfs*11, and one missense in T1805P)
Fig. 3Human proteasome genes frequently displayed overexpression in cancer compared with normal tissue. Heatmap showing relative log2 RSEM gene expression (cancer vs mean normal samples) for the 49 PSM genes in 5,507 TCGA cancer samples representing 16 pan-cancer diseases. Hierarchical clustering was performed with the pheatmap R package (version 1.0.12) using the Manhattan distance metric and Ward’s minimum variance method (Ward.D2)
Fig. 4Differentially expressed PSMs between 16 cancer types and corresponding normal tissue. A Bar chart visualizing the number of differentially expressed PSM genes between cancer and normal tissue. BRCA and LUSC showed the highest number of cancer-related PSMs (n = 45), whereas only 17 differentially expressed PSMs were identified in PCPG. B Bar chart depicting differential PSM gene expression patterns in various cancer types. Overexpression strongly dominated across all cancer types. C-D Box plot depicting differentially expressed PSMs in cancer and normal tissue. PSMB11 was found to be differentially expressed in 2/16 cancer types, while PSME3 was differentially expressed in all except one of the 16 cancer types. The Wilcoxon test was used to calculate statistical significance (Benjamini–Hochberg adjusted p-values) differences in expression (RSEM) between cancer and normal tissue. ns = not significant (P > 0.05); *P < 0.05; **P ≤ 0.01; ***P ≤ 0.001; ****P ≤ 0.0001
Fig. 5Pairwise Pearson correlation between PSM gene expression in 33 pan-cancer diseases. Correlation matrices for compiled gene expression patterns for (A) the 33 pan-cancer diseases and (B) BRCA, with genes ordered using hierarchical clustering with Ward’s minimum variance (Ward.D2). Red and blue dots represent negative and positive correlation patterns, respectively. The strength of color and circle size defines correlation pattern between gene pairs using correlation coefficients (P < 0.05); blank squares were not statistically significant (P > 0.05). PSM genes showing recurrent positive correlation are outlined in red
Fig. 6The prognostic relevance of PSM gene expression in different cancer types using overall survival (OS) and progression-free interval (PFI) as clinical endpoints in multivariable Cox regression analysis (adjusted for age and/or tumor grade). A-B Dot plots displaying the –log10(p-value) for the multivariable Cox regression analysis between PSM gene expression and OS (A) and PFI (B). Blue dots indicate a hazardous role for PSM gene expression, while red dots indicate a protective role. NS = not significant (P > 0.05). Dot sizes denote –log10(p-value); P < 0.001 is shown as –log10(p-value) = 3. Due to a lack of clinical data, PFI could not be performed for acute myeloid leukemia (LAML). C-D Bar charts illustrating the number of cancer types associated with different expression levels for each prognostic PSM gene. PSM gene expression (high [blue bars, higher than median expression] and low [yellow bars, lower than median expression]) associated with OS (C) and PFI (D) in different cancer types
Fig. 7The number of prognostic PSMs associated with high or low expression per cancer type using overall survival (OS) and progression-free interval (PFI) as clinical endpoints in multivariable Cox regression analysis (adjusted for age and/or tumor grade). A-B Forest plots visualizing the Hazard ratio (HR) for the multivariable Cox regression analysis between high PSMB5 expression and OS (A) and PFI (B). HR < 1 shows reduced risk at high PSMB5 expression (higher than median expression) and HR > 1 illustrates increased risk at high PSMB5 expression. C-D Bar charts visualizing the number of prognostic PSMs associated with each cancer type at high (blue bars, higher than median expression) or low (yellow bars, lower than median expression) expression for OS (C) and PFI (D)