| Literature DB >> 34150665 |
Costas Koufaris1, Antonis Kirmizis1.
Abstract
Liver hepatocellular carcinoma (LIHC) is a leading cause of cancer-related mortality. In this study we initially interrogated the Cancer Genome Atlas (TCGA) dataset to determine the implication of N-terminal acetyltransferases (NATs), a family of enzymes that modify the N-terminus of the majority of eukaryotic proteins, in LIHC. This examination unveiled NAA40 as the NAT family member with the most prominent upregulation and significant disease prognosis for this cancer. Focusing on this enzyme, which selectively targets histone proteins, we show that its upregulation occurs from early stages of LIHC and is not specifically correlated with any established risk factors such as viral infection, obesity or alcoholic disease. Notably, in silico analysis of TCGA and other LIHC datasets found that expression of this epigenetic enzyme is associated with high proliferating, poorly differentiating and more aggressive LIHC subtypes. In particular, NAA40 upregulation was preferentially linked to mutational or non-mutational P53 functional inactivation. Accordingly, we observed that high NAA40 expression was associated with worse survival specifically in liver cancer patients with inactivated P53. These findings define NAA40 as a NAT with potentially oncogenic functions in LIHC and uncover its prognostic value for aggressive LIHC subtypes.Entities:
Keywords: NAA40; P53 signature; hepatocellular carcinoma; histone modification; protein N-terminal acetylation; the Cancer Genome Atlas (TCGA)
Year: 2021 PMID: 34150665 PMCID: PMC8208081 DOI: 10.3389/fonc.2021.691950
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Information on studies examined as part of this study.
| Analysis | Cohort(s) | Cohort size |
|---|---|---|
| Comparison of NAT expression in tumour vs adjacent normal liver tissue and association with survival | TCGA | 371 tumour and 50 normal samples |
| GSEA comparison of top and bottom quartiles of NAA40 expression in LIHC tumours | TCGA; GSE112790. | TCGA 90 samples for top and bottom quartiles; GSE112790 46 samples for top and bottom quartiles |
| LIHC disease stage | TCGA | 50 samples for the adjacent normal group, Stage I consisted of 150 samples, Stage II of 98 samples, and Stage III of 97 samples |
| Obese and non-obese LIHC patients | TCGA | 127 obese patients and 37 non-obese |
| LIHC patients with cirrhosis vs no cirrhosis | TCGA | 47 cirrhotic patients and 140 non-cirrhotic patients |
| LIHC patients with stages of liver inflammation | TCGA | 117 were defined as non-inflamed, 99 as mild inflammation, and 18 as severely inflamed |
| LIHC patients at different Fibrosis stages | TCGA | 97 were classified as stage 0, for stages 1&2 35 samples, stages 3&4 31, stage 5 10 samples, and for stage 6 76 samples |
| LIHC patients with serological data for Hepatitis B or C infection | TCGA | 207 serologically negative for both hepatitis B and C; 84 serologically positive for both hepatitis B and C; 60 positive only for hepatitis B and 19 only for hepatitis C |
| iCLuster classification | TCGA | 65 iCluster 1, 55 iCluster 2, and 63 for iCluster 3 |
|
| TCGA |
|
| Association of NAA40 with survival and P53 activity | GSE14520; | GSE14520 221 samples with survival and transcriptomic data; GSE54236 78 samples with survival and transcriptomic data |
| Hepatic differentiation | GSE23034; GSE61287; GSE19044; GSE73617 | GSE23034 compared hepatocytes (N=3) and hepatocyte derived induced pluripotent stem cells (iPSC) (N=3); GSE61287 comparison of hepatocytes, iPSC derived from hepatocytes, and hepatocyte-like cells (HLC) differentiated from the iPSC (N=1 for all groups); GSE19044 Germline cell-derived pluripotent stem cells (GPSCs) and Embryonic stem cells (ES) in naïve state or differentiated into hepatocytes (N=2 all groups); GSE73617 Mesenchymal stromal cells (MSCs), HLC derived from MSC, and HLC induced to dedifferentiate (N=3 each group |
| P53 manipulation | GSE34760; | GSE34760 compared liver tumours derived from p53 KO mice and from p53 wildtype liver tumours induced by N-nitrosodiethylamine (DEN) treatment (N=4-6 per group); GSE30137 HepG2 cells treated with DMSO, shP53, Nutlin, Nutlin+shP53 (N=2 per group); GSE30137 acute knockout of P53 using adenoviral construct (N=3 for GFP control and CRE treatment respectively) |
| Cancer cell lines | DepMap | Cell lines originating from ten different tissues of origin were examined, ranging from 22-107 cell lines per tissue. |
Comparison of NAT expression levels between adjacent normal and primary tumour tissue in TCGA LIHC dataset.
| Gene | Adjacent Normal | Primary Tumour | Fold Change | p. value |
|---|---|---|---|---|
| NAA40 | 6.9 ± 0.4 | 8.0 ± 0.8 | 2.1 | p<0.01 |
| NAA10 | 9.6 ± 0.3 | 10.4 ± 0.7 | 1.7 | p<0.01 |
| NAA20 | 10.2 ± 0.2 | 10.8 ± 0.6 | 1.5 | p<0.01 |
| NAA80 | 7.3 ± 0.6 | 7.8 ± 0.8 | 1.4 | p<0.01 |
| NAA25 | 8.0 ± 0.4 | 8.4 ± 0.6 | 1.3 | p<0.01 |
| NAA60 | 10.9 ± 0.2 | 11.2 ± 0.4 | 1.2 | p<0.01 |
| HYPK | 10.6 ± 0.4 | 10.7 ± 0.7 | 1.1 | N.S. |
| NAA15 | 9.3 ± 0.5 | 9.3 ± 0.5 | 1.0 | N.S. |
| NAA35 | 7.9 ± 0.6 | 7.9 ± 0.4 | 1.0 | N.S. |
| NAA50 | 11.1 ± 0.2 | 11.1 ± 0.4 | 1.0 | N.S. |
| NAA30 | 9.4 ± 0.3 | 9.0 ± 0.5 | 0.7 | p<0.01 |
Genes are arranged in descending order according to Fold-change between adjacent normal and tumour tissue. Values shown are Pan-cancer normalised values+1 and standard deviation. Statistical significance was tested by Student’s t-test with Bonferroni multiple testing correction.
Figure 1Association of NAA40 expression with LIHC tumour stages and exposure to specific risk factors (A) NAA40 transcript levels in adjacent normal and stages I-III of LIHC derived from the TCGA cohort. Stage IV is not shown due to small sample size; (B–G) Association between NAA40 transcript levels and various etiological factors of LIHC. In the scatterplots each circle represents an individual sample and orange line depicts the median value. NAA40 expression values shown are pan-cancer normalised log2(RSEM+1). For panel (G) Adjacent normal (Adj.norm) refers to non-cancer tissue sampled adjacent to the tumour tissue, none refers to LIHC tumour samples with no serological evidence of viral infection, any refers to LIHC samples with serological evidence with infection with either one or both hepatitis B and C Statistical significance was tested by Student’s t-test or one-way ANOVA ***p < 0.01. n.s., non significant.
Figure 2NAA40 transcript levels inversely correlated with hepatocyte differentiation (A) NAA40 levels in primary hepatocytes before and after de-defferentiation into iPSC; (B) NAA40 transcript levels in another study similar to (A) but including expression data from iPSCs that are differentiated again into hepatic-like cells (HLC); (C) Expression of NAA40 in Embryonic Stem Cells (ES) and Germ line cell-derived pluripotent stem cells (GPSC) in untreated state and after 28 day treatment to induce differentiation into hepatic-like cells; (D) NAA40 expression in resting mesenchymal stromal cells, mesenchymal stem cells induced to differentiate into HLC and after these are dedifferentiated again; (E) NAA40 expression across cancer cell lines originating from various human organs. Each circle represents a different cell line. Orange line indicated the median expression for cells of a given tissue of origin. Arbitrary Units (A.U) refers to normalised microarray values without any further processing. ***p < 0.001.
Figure 3High NAA40 expression in LIHC is a characteristic of more proliferating and aggressive cancer subtypes. (A) Diagram depicting the major molecular subclasses of LIHC adapted from (11); (B) Comparison of NAA40 expression levels in adjacent normal tissue, bottom and top quartiles of TCGA LIHC dataset; (C) GSEA plots displaying enrichment of Hoshida_S3, Boyault_G123_DOWN and Chiang_proliferation_DOWN genesets in low NAA40 expressing samples and Chiang-proliferation_UP in the high expressing NAA40 samples; (D) Heatmap displaying expression of 125 cell cycle-associated genes in High and Low NAA40 samples, with red colour signifying high expression (>value) and blue colour low expression (
Selected enriched genesets according to NAA40 levels relating to molecular classification of HCC.
| Gene Set Name | NES | Nom. p.val | FDR q Val |
|---|---|---|---|
| HOSHIDA_LIVER_CANCER_SUBCLASS_S3 | 1.98 | >0.0001 | 0.013 |
| BOYAULT_LIVER_CANCER_SUBCLASS_G1_DN | 1.79 | 0.008 | 0.017 |
| ANDERSEN_LIVER_CANCER_KRT19_DN | 1.83 | >0.0001 | 0.019 |
| BOYAULT_LIVER_CANCER_SUBCLASS_G123_DN | 1.75 | >0.0001 | 0.02 |
| CHIANG_LIVER_CANCER_SUBCLASS_PROLIFERATION_DN | 1.72 | >0.0001 | 0.022 |
| CHIANG_LIVER_CANCER_SUBCLASS_CTNNB1_UP | 1.72 | 0.02 | 0.023 |
| BOYAULT_LIVER_CANCER_SUBCLASS_G6_UP | 1.67 | 0.041 | 0.031 |
| CHIANG_LIVER_CANCER_SUBCLASS_POLYSOMY7_UP | 1.57 | 0.048 | 0.057 |
| BOYAULT_LIVER_CANCER_SUBCLASS_G3_DN | 1.87 | >0.0001 | 0.03 |
| BOYAULT_LIVER_CANCER_SUBCLASS_G12_DN | 1.68 | >0.0001 | 0.03 |
| YAMASHITA_LIVER_CANCER_STEM_CELL_DN | 1.65 | 0.008 | 0.032 |
| YAMASHITA_LIVER_CANCER_WITH_EPCAM_DN | 1.60 | 0.01 | 0.051 |
| CHIANG_LIVER_CANCER_SUBCLASS_POLYSOMY7_UP | 1.57 | 0.048 | 0.057 |
| CHIANG_LIVER_CANCER_SUBCLASS_PROLIFERATION_UP | -1.69 | >0.0001 | 0.228 |
| VILLANUEVA_LIVER_CANCER_KRT19_UP | -1.72 | 0.002 | 0.249 |
Figure 4NAA40 upregulation coincides with inactivation of P53 in LIHC (A) Stacked bar charts displaying frequencies of TP53 and CTNNB1 mutations in High and Low NAA40 LIHC groups; (B) Expression of NAA40 in TCGA LIHC tumours with mutations of TP53, CTNNB1, or neither of the two genes; (C) Heatmap comparing expression of NAA40 with ten p53-repressed genes which define a signature indicative of P53 activity; (D) Comparison of expression of NAA40 in tumours with P53 mutations, CTNNB1 mutations, and in tumours with no mutations in either of these two genes but divided into top and bottom quartiles of p53 activity according to p53-repressed gene signature; (E) Comparison of NAA40 levels in an independent LIHC cohort (GSE112790) demarcated into top and bottom quartiles based on signature of P53 repressed genes; In the scatterplots each circle represents an individual sample and orange line depicts the median value ***p < 0.001 Student’s t-test.
Figure 5High NAA40 expression predicts worse survival for LIHC patients with inactivated P53 (A–C) Kaplan-Meier survival curves for LIHC patients in the TCGA study carrying P53 mutations, CTNNB1 mutations or no mutations on either of the two genes divided according to median NAA40 expression; (D, E) Kaplan-Meier plots for LIHC samples with no registered P53 mutations that were divided into two groups according to P53 activity using a previously described 10-gene signature. Hazard ratio (HR) and p values were calculated using the log-rank method.