| Literature DB >> 26887054 |
Davide Degli Esposti1, Hector Hernandez-Vargas1, Catherine Voegele2, Nora Fernandez-Jimenez1, Nathalie Forey2, Brigitte Bancel3,4, Florence Le Calvez-Kelm2, James McKay2, Philippe Merle3,5, Zdenko Herceg1.
Abstract
Functional characterization of long non-coding RNAs (lncRNAs) and their pathological relevance is still a challenging task. Abnormal expression of a few long non-coding RNAs have been found associated with hepatocellular carcinoma, with potential implications to both improve our understanding of molecular mechanism of liver carcinogenesis and to discover biomarkers for early diagnosis or therapy. However, the understanding of the global role of lncRNAs during HCC development is still in its infancy. In this study, we produced RNA-Seq data from 23 liver tissues (controls, cirrhotic and HCCs) and applied statistical and gene network analysis approaches to identify and characterize expressed lncRNAs. We detected 5,525 lncRNAs across different tissue types and identified 57 differentially expressed lncRNAs in HCC compared with adjacent non-tumour tissues using stringent criteria (FDR<0.05, Fold Change>2). Using weighted gene co-expression network analysis (WGCNA), we found that differentially expressed lncRNAs are co-expressed with genes involved in cell cycle regulation, TGF-β signalling and liver metabolism. Furthermore, we found that more than 20% of differentially expressed lncRNAs are associated to actively transcribed enhancers and that the co-expression patterns with their closest genes change dramatically during HCC development. Our study provides the most comprehensive compendium of lncRNAs expressed in HCC, as well as in control or cirrhotic livers. Our results identified both known oncogenic lncRNAs (such as H19 and CRNDE) and novel lncRNAs involved in cell cycle deregulation and liver metabolism deficits occurring during HCC development.Entities:
Keywords: RNA-sequencing; enhancer-associated RNAs; gene networks; hepatocellular carcinoma; long non-coding RNA
Mesh:
Substances:
Year: 2016 PMID: 26887054 PMCID: PMC5077982 DOI: 10.18632/oncotarget.7364
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
RNA-Sequencing general data after alignment and mapping
| Sample | Total mapped reads | Total mapped reads | % of lncRNAs reads |
|---|---|---|---|
| 112N | 30.2 | 1.5 | 4.9 |
| 112T | 27.7 | 1.8 | 6.4 |
| 122N | 11.2 | 0.7 | 5.9 |
| 122T | 16.1 | 1.0 | 6.2 |
| 127N | 21.8 | 1.0 | 4.7 |
| 127T | 21.9 | 1.2 | 5.3 |
| 1N | 33.6 | 2.2 | 6.4 |
| 1T | 25.6 | 1.6 | 6.3 |
| 2N | 75.3 | 4.2 | 5.5 |
| 2T | 46.9 | 3.0 | 6.5 |
| 8N | 46.4 | 2.8 | 6.0 |
| 8T | 72.1 | 4.1 | 5.7 |
| 20N | 61.6 | 3.5 | 5.7 |
| 20T | 52.7 | 4.2 | 8.0 |
| 13N | 49.0 | 3.4 | 7.0 |
| 13T | 55.6 | 4.0 | 7.2 |
| 14N | 42.0 | 2.2 | 5.2 |
| 14T | 37.2 | 3.0 | 8.1 |
| 15N | 29.2 | 1.5 | 5.1 |
| 15T | 27.7 | 1.8 | 6.5 |
| C1 | 24.9 | 1.4 | 5.8 |
| C5 | 29.8 | 1.7 | 5.8 |
| C8 | 19.4 | 1.2 | 6.2 |
| TOTAL | 857.7 | 53.0 | 6.2 |
| Average | 37.3 | 2.3 | 6.2 |
Mapped and annotated using Genecode V17 annotation
Number of reads corresponding to lncRNAs annotated in Genecode V17 annotation
Figure 1A. Number of genes expressed in control livers (3 cases), cirrhotic livers adjacent to HCCs and HCCs (10 cases) for each different gene category. B. Expression variability of each gene category in control livers, cirrhotic livers adjacent to HCCs and HCCs. C. Uniquely expressed genes for each category in control livers, cirrhotic livers adjacent to HCCs and HCCs.
Uniquely expressed genes in control, cirrhotic livers or hepatocellular carcinomas (HCC)
| Histology | Uniquely expressed genes | ||||||
|---|---|---|---|---|---|---|---|
| Protein coding | Long non coding RNAs | Pseudogenes | TOTAL | ||||
| N° | % | N° | % | N° | % | ||
| Control livers | 721 | 3.9 | 206 | 1.1 | 131 | 0.7 | 18,586 |
| Cirrhotic livers | 767 | 4.7 | 55 | 0.3 | 54 | 0.3 | 16,445 |
| HCC | 747 | 4.9 | 58 | 0.4 | 52 | 0.3 | 15,172 |
χ2 test:
p-value<0.001 compared with controls
Figure 2A. Heatmaps showing unsupervised clustering of HCC and adjacent cirrhotic tissues based on differentially expressed coding genes (up) and long non-coding genes (down). B. Distribution of differentially expressed genes by gene category. C. Overlap between differentially expressed genes in IARC-Croix Rousse cohort and TCGA paired matched cases, annotated with RefSeqGene. D. RT-qPCR results on 5 different lncRNAs in 20 cases (HCC and cirrhotic adjacent tissues) and 10 control livers.
Differentially expressed long non-coding RNAs in HCC compared with adjacent cirrhotic tissues
| Genes | Gene symbol | Expression levels in cirrhosis (FPKM) | Expression levels in HCC (FPKM) | FDR |
|---|---|---|---|---|
| ENSG00000228651.1 | RP11-556E13.1 | 0.32 | 6.21 | 0.00003 |
| ENSG00000253819.1 | LINC01151 | 0.65 | 2.77 | 0.04545 |
| ENSG00000257579.1 | - | 0.26 | 0.90 | 0.02398 |
| ENSG00000250696.1 | RP11-704M14.1 | 1.19 | 3.16 | 0.00097 |
| ENSG00000233396.1 | - | 0.21 | 0.72 | 0.00567 |
| ENSG00000228709.1 | - | 0.58 | 1.79 | 0.01721 |
| ENSG00000267583.1 | - | 0.33 | 0.78 | 0.00640 |
| ENSG00000230325.1 | - | 0.54 | 1.32 | 0.03730 |
| ENSG00000228918.1 | LINC01344 | 0.40 | 0.85 | 0.00056 |
| ENSG00000261716.1 | - | 0.79 | 1.73 | 0.00150 |
| ENSG00000243533.1 | - | 2.17 | 4.23 | 0.01998 |
| ENSG00000230850.3 | - | 0.10 | 0.24 | 0.01509 |
| ENSG00000232995.2 | RGS5 | 1.09 | 2.39 | 0.00016 |
| ENSG00000245694.4 | CRNDE | 0.74 | 1.54 | 0.00577 |
| ENSG00000259347.1 | - | 2.48 | 5.45 | 0.02980 |
| ENSG00000237036.3 | - | 0.47 | 1.04 | 0.00976 |
| ENSG00000243628.2 | - | 0.56 | 1.22 | 0.01902 |
| ENSG00000270933.1 | - | 0.99 | 1.75 | 0.01110 |
| ENSG00000230333.2 | - | 1.26 | 0.59 | 0.01619 |
| ENSG00000225431.1 | LOC101928233 | 0.77 | 0.38 | 0.02902 |
| ENSG00000233421.2 | - | 8.44 | 2.80 | 0.04678 |
| ENSG00000233695.1 | - | 1.55 | 0.72 | 0.00446 |
| ENSG00000258867.1 | LINC01146 | 5.71 | 3.52 | 0.01950 |
| ENSG00000249201.2 | CTD-3080P12.3 | 1.78 | 0.77 | 0.00345 |
| ENSG00000237517.4 | DGCR5 | 1.37 | 0.57 | 0.01325 |
| ENSG00000228826.1 | - | 2.56 | 1.05 | 0.01722 |
| ENSG00000236341.1 | - | 2.17 | 0.84 | 0.00005 |
| ENSG00000233554.1 | - | 1.92 | 0.76 | 0.00266 |
| ENSG00000233392.1 | LOC200772 | 0.90 | 0.34 | 0.00177 |
| ENSG00000232310.1 | - | 0.81 | 0.30 | 0.00008 |
| ENSG00000254192.1 | - | 0.73 | 0.24 | 0.01924 |
| ENSG00000229005.1 | HNF4A-AS1 | 5.41 | 2.34 | 0.01208 |
| ENSG00000249364.1 | LOC101928858 | 2.23 | 0.79 | 0.03451 |
| ENSG00000223956.1 | - | 3.32 | 1.41 | 0.00688 |
| ENSG00000239685.1 | - | 30.93 | 12.76 | 0.00245 |
| ENSG00000224652.1 | LINC00885 | 1.04 | 0.36 | 0.00044 |
| ENSG00000248740.1 | LOC101929448 | 2.20 | 0.59 | 0.00004 |
| ENSG00000230613.1 | HM13-AS1 | 0.98 | 0.36 | 0.02403 |
| ENSG00000261058.1 | - | 1.81 | 0.58 | 0.00428 |
| ENSG00000253364.1 | - | 0.89 | 0.27 | 0.00038 |
| ENSG00000229740.1 | - | 2.73 | 0.84 | 0.00516 |
| ENSG00000257878.1 | - | 2.85 | 0.82 | 0.00000 |
| ENSG00000250463.1 | - | 1.50 | 0.41 | 0.00005 |
| ENSG00000264066.2 | - | 38.03 | 8.67 | 0.00219 |
| ENSG00000221857.3 | - | 53.23 | 15.66 | 0.00007 |
| ENSG00000255071.1 | SAA2-SAA4 | 143.42 | 55.29 | 0.01047 |
| ENSG00000049319.2 | - | 9.72 | 4.00 | 0.01193 |
| ENSG00000205865.4 | FAM99B | 0.69 | 0.18 | 0.00370 |
| ENSG00000258442.1 | - | 2.23 | 0.44 | 0.00000 |
| ENSG00000237949.1 | - | 13.00 | 5.52 | 0.02010 |
| ENSG00000243694.2 | - | 2.77 | 0.41 | 0.00000 |
| ENSG00000262588.1 | - | 7.64 | 2.71 | 0.00021 |
| ENSG00000240801.1 | - | 339.18 | 43.36 | 0.00727 |
| ENSG00000249173.1 | - | 6.95 | 0.61 | 0.00000 |
| ENSG00000130600.10 | H19 | 222.04 | 123.02 | 0.00004 |
| ENSG00000231486.3 | - | 147.04 | 52.01 | 0.00014 |
| ENSG00000205866.2 | FAM99A | 5.34 | 0.91 | 0.00002 |
False Discovery Rate. The FDRs reflect statistical significance from the differential analysis performed using read counts as expression units (see Materials and Methods section)
Figure 3Co-expression analysis of 3 different enhancer-associated lncRNA (eRNAs), representing the 3 main trends observed in the co-expression patterns of differentially expressed eRNAs
Each small square represents the P value for the correlation of the expression level in a specific gene pair. White, light gray, dark gray and black indicate Pearson's correlation P values of >0.05, <0.05, <0.01 and <0.001, respectively. A. Co-expression analysis for FAM99A. Loss of co-expression is observed in cirrhotic and HCC samples compared with control livers (χ2 test, p-value<0.05). B. Co-expression analysis for ENSG00000228709) Gain of co-expression in cirrhotic tissues compared with control livers or HCCs (χ2 test, p-value<0.05). C. Co-expression analysis for LINC00885. Gain of co-expression is observed in HCC compared with cirrhotic or control livers (χ2 test, p-value<0.05).
Figure 4Gene co-expression network analysis of differentially expressed genes
A. Gene clustering and module identification. Each co-expression cluster has a different colour. B. LncRNAs distribution in the different modules. C. Co-expression matrix showing relations across the different modules. D. Eigengene heatmap showing correlation among the different eigengene vectors of each module.
Patients characteristics
| Patient No. | Age | Gender | Status | Tissue | Sequencing code | Primary aetiology | Secondary aetiology | EGSc |
|---|---|---|---|---|---|---|---|---|
| 1 | 70 | Male | Case | T+NT | 1T, 1NT | NASH | - | 3 |
| 2 | 75 | Male | Case | T+NT | 2T, 2NT | NASH | - | 2 |
| 3 | 68 | Female | Case | T+NT | N.A | NASH | - | 3 |
| 4 | 70 | Male | Case | T+NT | N.A | NASH | - | 3 |
| 5 | 68 | Male | Case | T+NT | N.A | NASH | - | 2 |
| 6 | 70 | Male | Case | T+NT | N.A | Alcohol | - | 2 |
| 7 | 63 | Male | Case | T+NT | N.A | Alcohol | Obesity | 2 |
| 8 | 74 | Male | Case | T+NT | 8T, 8NT | Alcohol | - | 2 |
| 9 | 60 | Male | Case | T+NT | N.A | Alcohol | - | 3 |
| 10 | 63 | Male | Case | T+NT | N.A | Alcohol | - | 4 |
| 11 | 55 | Female | Case | T+NT | N.A | Alcohol | - | 3 |
| 12 | 57 | Male | Case | T+NT | N.A | Alcohol | - | 3 |
| 13 | 74 | Male | Case | T+NT | 13T, 13NT | HCV | - | 2 |
| 14 | 52 | Male | Case | T+NT | 14T, 14NT | HCV | NASH | 3 |
| 15 | 44 | Male | Case | T+NT | 15T, 15NT | HCV | Alcohol | 3 |
| 16 | 38 | Male | Case | T+NT | N.A | HBV | - | 3 |
| 17 | 67 | Male | Case | T+NT | N.A | HBV | - | 3 |
| 18 | 56 | Male | Case | T+NT | N.A | HBV | - | 3 |
| 19 | 21 | Male | Case | T+NT | N.A | Cryptogenic | - | 4 |
| 20 | 57 | Female | Case | T+NT | 20T, 20NT | Glycogenosis | - | 3 |
| 21 | 77 | Male | Case | T+NT | 112T, 112NT | Alcohol | - | N.A |
| 22 | 67 | Male | Case | T+NT | 122T, 122NT | Alcohol | - | N.A |
| 23 | 79 | Female | Case | T+NT | 127T, 127NT | HCV | - | N.A |
| 24 | 78 | Female | Control | N | C1 | Ovarian cancer | Liver metastasis | N.A |
| 25 | 72 | Male | Control | N | N.A | Leiomyosarcoma | Liver metastasis | N.A |
| 26 | 80 | Female | Control | N | N.A | Colon cancer | Liver metastasis | N.A |
| 27 | 63 | Male | Control | N | N.A | Colon cancer | Liver metastasis | N.A |
| 28 | 79 | Male | Control | N | C5 | Colon cancer | Liver metastasis | N.A |
| 29 | 38 | Female | Control | N | N.A | Breast cancer | Liver metastasis | N.A |
| 30 | 63 | Female | Control | N | N.A | Colon cancer | Liver metastasis | N.A |
| 31 | 63 | Male | Control | N | C8 | Colon cancer | Liver metastasis | N.A |
| 32 | 59 | Male | Control | N | N.A | Colon cancer | Liver metastasis | N.A |
| 33 | 55 | Female | Control | N | N.A | Pancreatic cancer | Liver metastasis | N.A |
Pair-Matched HCC with adjacent non tumour tissues. All NT were cirrhotic (F4).
Edmondson-Steiner Grading system.
Primers used for RT-qPCR analysis on liver tissues
| Gene | F-primer | R-primer |
|---|---|---|
| 5-CCTTGTGACACTGAGATCAGCTA-3 | 5-ATCTCCCAGTCGGGTTTCCT-3 | |
| 5-ACACGGCTTTCCGGAGTAGA-3 | 5-GCCAACATTTGGAGGAACCC-3 | |
| 5-GTCCCTTGCCCTCTCTTGTC-3 | 5-ACACGCATCACAAAACAGCC-3 | |
| 5′-ATCGGTGCCTCAGCGTTC-3′ | 5′-AGAAACAGACCCGCTTCTTG-3′ | |
| 5′-ACCTCCAGAACTGTGATCCAAAATG-3′ | 5′-TCTTGCTTGATGCTTTGGTCTG-3′ | |
| 5′-GGCTACGGGAAGATCCTGGA-3′ | 5′-TGCATCACGCAGATCATCAA-3′ |