| Literature DB >> 28901494 |
Yao Liu1, Zhe Yang1, Feng Du1, Qiao Yang1, Jie Hou1, Xiaohong Yan1, Yi Geng1, Yaning Zhao1, Hua Wang1.
Abstract
The present study aimed to explore the underlying molecular mechanisms of hepatocellular carcinoma (HCC). RNA‑sequencing profiles GSM629264 and GSM629265, from the GSE25599 data set, were downloaded from the Gene Expression Omnibus database and processed by quality evaluation. GSM629264 and GSM629265 were from HCC and adjacent non‑cancerous tissues, respectively. TopHat software was used for alignment analysis, followed by the detection of novel splicing sites. In addition, the Cufflinks software package was used to analyze gene expressions, and the Cuffdiff program was used to screen for differently expressed genes (DEGs) and differentially expressed splicing variants. Gene ontology functional enrichment and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of DEGs were also performed. Transcription factors (TFs) and microRNAs (miRNAs) that regulate DEGs were identified, and a protein‑protein interaction (PPI) network was constructed. The hub node in the PPI network was obtained, and the TFs and miRNAs that regulated the hub node were further predicted. The quality of the sequencing data met the standards for analysis, and the clean reads were ~65%. Most sequencing reads mapped into coding sequence exons (CDS_exons), whereas other reads mapped into exon 3' untranslated regions (UTR_Exons), 5'UTR_Exons and Introns. Upregulated and downregulated DEGs between HCC and adjacent non‑cancerous tissues were screened. Genes of differentially expressed splicing variants were identified, including vesicle‑associated membrane protein 4, phosphatidylinositol glycan anchor biosynthesis class C, protein disulfide isomerase family A member 4 and growth arrest specific 5. Screened DEGs were enriched in the complement pathway. In the PPI network, ubiquitin C (UBC) was the hub node. UBC was predicted to be regulated by several TFs, including specificity protein 1 (SP1), FBJ murine osteosarcoma viral oncogene homolog (FOS), proto‑oncogene c‑JUN (JUN), FOS‑like antigen 2 (FOSL2) and SWI/SNF‑related, matrix‑associated, actin‑dependent regulator of chromatin, subfamily A, member 4 (SMARCA4), and several miRNAs, including miR‑30 and miR‑181. Results from the present study demonstrated that UBC, SP1, FOS, JUN, FOSL2, SMARCA4, miR‑30 and miR‑181 may participate in the development of HCC.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28901494 PMCID: PMC5865798 DOI: 10.3892/mmr.2017.7457
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Quality evaluation chart for sequencing data.
| Sample | Raw reads[ | Clean reads[ | Clean bases[ | sQ20 (%)[ | GC (%)[ | Duplication (%)[ |
|---|---|---|---|---|---|---|
| SRR074999 | 21,944,622 | 14,292,579 | 571M | 94.87 | 45.31 | 40.13 |
| SRR075000 | 21,328,051 | 13,254,517 | 530M | 94.04 | 45.53 | 38.28 |
| SRR075001 | 21,532,717 | 13,304,142 | 532M | 93.95 | 45.46 | 38.98 |
| SRR075002 | 20,950,756 | 13,615,048 | 544M | 94.81 | 45.36 | 45.40 |
| SRR075003 | 21,959,501 | 13,835,204 | 553M | 94.17 | 45.37 | 45.11 |
| SRR075004 | 22,011,164 | 13,372,744 | 534M | 93.80 | 45.01 | 42.09 |
Original reads transformed from original sequencing images.
Reads filtered from raw reads.
Total number of bases that were filtered.
Percentage of clean bases with sQ ≥20 in all clean bases.
Percentage of GC content in the sequence.
Percentage of repeated reads in whole reads. GC, guanine and cytosine; M, Megabase.
Top 10 genes list - differentially expressed genes.
| Gene | Locus | Control | Case | Log2(FC) | Regulation |
|---|---|---|---|---|---|
| AFP | 4:74296854–74321891 | 10.953 | 6573.840 | 9.230 | Up |
| THBS4 | 5:79287133–79379477 | 0.177 | 77.892 | 8.780 | Up |
| NTS | 12:86268072–86276767 | 0.866 | 315.600 | 8.509 | Up |
| PRAME | 22:22890122–2290900 | 0.0255 | 4.051 | 7.311 | Up |
| SULT1C2 | 2:108905094–108926371 | 0.465 | 69.951 | 7.234 | Up |
| PEG10 | 7:94285636–94299007 | 1.427 | 213.887 | 7.227 | Up |
| NQO1 | 16:69740898–69760854 | 1.418 | 206.101 | 7.183 | Up |
| AGR2 | 7:16831434–16873057 | 0.997 | 113.695 | 6.834 | Up |
| GPC3 | X:132669772–133119922 | 5.825 | 566.919 | 6.605 | Up |
| NLRP1 | 17:5402747–5487832 | 3.232 | 289.985 | 6.488 | Up |
| CLEC4M | 19:7804878–7834490 | 271.072 | 1.483 | −7.514 | Down |
| ADH4 | 4:100010007–100274184 | 612.703 | 3.122 | −7.617 | Down |
| HPD | 12:122277432–122326517 | 833.459 | 4.560 | −7.514 | Down |
| RP11-7M8.2.1 | 12:122277432–122326517 | 186.722 | 1.029 | −7.503 | Down |
| SYT9 | 11:7260098–7490273 | 4.716 | 0.027 | −7.450 | Down |
| CYP2E1 | 10:135192694–135383462 | 4030.920 | 22.780 | −7.467 | Down |
| CTD-2195M18.1.1 | 5:6582248–6588612 | 27.817 | 0.159 | −7.455 | Down |
| CPS1 | 2:211342405–211543831 | 2827.010 | 17.092 | −7.367 | Down |
| GLYAT | 11:58476536–58499447 | 150.288 | 0.955 | −7.298 | Down |
| HSD11B1 | 1:209834708–209908295 | 520.703 | 3.393 | −7.262 | Down |
Paired t-test (P-value) was used to identify the splicing variants that were differentially expressed between HCC and adjacent non-cancerous tissues. HCC, hepatocellular carcinoma; FC, fold change.
Top 10 genes list - genes with significant differentially expressed splicing variants.
| Gene | Locus | Sqrt (JS) | P-value | q-value | Significant |
|---|---|---|---|---|---|
| VAMP4 | 1:171669299–171711387 | 0.328 | 5.05×10−3 | 4.79×10−2 | Yes |
| PIGC | 1:171810620–172437971 | 0.198 | 2.50×10−4 | 3.93×10−3 | Yes |
| PDIA4 | 7:148700153–148725733 | 0.833 | 5.00×10−5 | 9.03×10−4 | Yes |
| GAS5 | 1:173831289–173866494 | 0.244 | 5.00×10−5 | 9.03×10−4 | Yes |
| RARRES2 | 7:150035407–150038763 | 0.287 | 5.00×10−5 | 9.03×10−4 | Yes |
| ABCF2 | 7:150904922–150924316 | 0.160 | 1.00×10−4 | 1.72×10−3 | Yes |
| MRPS14 | 1:174968299–174992561 | 0.174 | 5.00×10−5 | 9.03×10−4 | Yes |
| SLC7A2 | 8:17354596–17428082 | 0.436 | 5.00×10−5 | 9.03×10−4 | Yes |
| INTS10 | 8:19674650–19709594 | 0.494 | 4.40×10−3 | 4.34×10−2 | Yes |
Paired t-test (P-value) was used to identify the splicing variants that were differentially expressed between HCC and adjacent non-cancerous tissues. HCC, hepatocellular carcinoma; Sqrt, square root. JS, JavaScript.
Top 5 significant GO terms.
| Category | GO ID | Term | Count | Ratio | P-value | q-value | Regulation |
|---|---|---|---|---|---|---|---|
| GOTERM_CC_FAT | GO:0031974 | Membrane-enclosed lumen | 201 | 17.12 | 2.13×10−13 | 1.18×10−10 | Up |
| GOTERM_CC_FAT | GO:0043233 | Organelle lumen | 197 | 16.78 | 4.37×10−13 | 1.21×10−10 | Up |
| GOTERM_CC_FAT | GO:0070013 | Intracellular organelle lumen | 193 | 16.44 | 6.82×10−13 | 1.26×10−10 | Up |
| GOTERM_CC_FAT | GO:0005829 | Cytosol | 151 | 12.86 | 1.70×10−11 | 2.35×10−9 | Up |
| GOTERM_CC_FAT | GO:0031981 | Nuclear lumen | 159 | 13.54 | 6.46×10−11 | 7.13×10−9 | Up |
| GOTERM_CC_FAT | GO:0044421 | Extracellular region part | 158 | 11.84 | 6.91×10−21 | 3.08×10−18 | Down |
| GOTERM_BP_FAT | GO:0055114 | Oxidation reduction | 117 | 8.77 | 6.75×10−20 | 2.36×10−16 | Down |
| GOTERM_BP_FAT | GO:0009611 | Response to wounding | 102 | 7.65 | 6.01×10−19 | 1.05×10−15 | Down |
| GOTERM_CC_FAT | GO:0005615 | Extracellular space | 120 | 9.00 | 5.67×10−18 | 1.26×10−15 | Down |
| GOTERM_CC_FAT | GO:0005576 | Extracellular region | 251 | 18.82 | 3.79×10−16 | 4.95×10−14 | Down |
The Fisher's exact test was used to calculate statistical significance (P-values) of enriched annotation terms. The q value is the Benjamini-Hochberg adjusted P-value. GO, gene ontology.
Figure 1.Regulatory networks of miRNAs, TFs and target genes. (A) Predicted TF-target gene network. (B) Predicted miRNA-target gene network. The red nodes represent the miRNAs or TFs, and the blue nodes represent their target genes. A link represents an interaction between a TF or miRNA and its target gene, whereas the size of a node corresponds to the number of interactions that a TF or miRNA has. miRNA, microRNA; TF, transcription factor.
Figure 2.Protein-protein interaction network with ubiquitin C as the hub node. In the network, a node represents a protein and a link represents each pairwise protein interaction. The red shaded nodes represent upregulated genes, and the blue shaded nodes represent downregulated genes. The size of a node corresponds to its degree (that is, the number of interactions one protein has). Green-framed nodes are those with a degree >50.
Figure 3.Regulatory network of UBC. The UBC gene may be regulated by transcription factors, represented by the square nodes, and microRNAs, represented by the diamond nodes. Green square framed nodes were also identified as DEGs, with the red shaded nodes representing the upregulated DEGs, and the blue shaded nodes representing the downregulated DEGs. UBC, ubiquitin C; DEGs, differently expressed genes.
Top 5 significant KEGG pathways.
| Category | KEGG ID | Term | Count | Ratio | P-value | q-value | Regulation |
|---|---|---|---|---|---|---|---|
| KEGG_PATHWAY | hsa04110: | Cell cycle | 28 | 2.385 | 7.10×10−7 | 1.16×10−4 | Up |
| KEGG_PATHWAY | hsa03030 | DNA replication | 14 | 1.193 | 1.41×10−6 | 1.15×10−4 | Up |
| KEGG_PATHWAY | hsa00480 | Glutathione metabolism | 13 | 1.107 | 3.36×10−4 | 1.81×10−2 | Up |
| KEGG_PATHWAY | hsa00970 | Aminoacyl-tRNA biosynthesis | 11 | 0.937 | 9.14×10−4 | 3.66×10−2 | Up |
| KEGG_PATHWAY | hsa04610 | Complement and coagulation cascades | 32 | 2.399 | 1.87×10−14 | 3.43×10−12 | Down |
| KEGG_PATHWAY | hsa00071 | Fatty acid metabolism | 20 | 1.499 | 7.52×10−10 | 6.92×10−8 | Down |
| KEGG_PATHWAY | hsa03320 | PPAR signaling pathway | 25 | 1.874 | 9.89×10−9 | 6.07×10−7 | Down |
| KEGG_PATHWAY | hsa00982 | Drug metabolism | 21 | 1.574 | 7.00×10−7 | 3.22×10−5 | Down |
| KEGG_PATHWAY | hsa00830 | Retinol metabolism | 19 | 1.424 | 1.48×10−6 | 5.45×10−5 | Down |
The Fisher's exact test was used to calculate statistical significance (P-values) of enriched annotation terms. The q value is the Benjamini-Hochberg adjusted P-value. KEGG, Kyoto Encyclopedia of Genes and Genomes.