| Literature DB >> 25917195 |
Yu-Chiao Chiu, Tzu-Hung Hsiao, Yidong Chen, Eric Y Chuang.
Abstract
BACKGROUND: In addition to direct targeting and repressing mRNAs, recent studies reported that microRNAs (miRNAs) can bridge up an alternative layer of post-transcriptional gene regulatory networks. The competing endogenous RNA (ceRNA) regulation depicts the scenario where pairs of genes (ceRNAs) sharing, fully or partially, common binding miRNAs (miRNA program) can establish coexpression through competition for a limited pool of the miRNA program. While the dynamics of ceRNA regulation among cellular conditions have been verified based on in silico and in vitro experiments, comprehensive investigation into the strength of ceRNA regulation in human datasets remains largely unexplored. Furthermore, pan-cancer analysis of ceRNA regulation, to our knowledge, has not been systematically investigated.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25917195 PMCID: PMC4416191 DOI: 10.1186/1471-2164-16-S4-S1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Analysis flowchart of this study. The present study is aimed to systematically explore optimal conditions and related biological functions of ceRNA regulation in GBM, and confer cancer type specific and independent effects. (A) First we defined 47,451,423 putative ceRNA pairs as pairs of genes (i.e., pairs of ceRNAs) sharing any number of predicted targeting miRNAs in the TargetScan database. Pairwise correlation coefficients of putative ceRNA pairs were computed in 481-sample TCGA GBM datasets. (B) Correlation coefficients were partitioned into groups based on the states of each essential factor of putative ceRNA pairs, followed by inter-group goodness-of-fit tests (K-S tests) that pinpointed the essential factors and optimal conditions for ceRNA regulation. (C) 551,175 pairs of ceRNAs fulfilling all of the identified optimal conditions were defined as optimal ceRNA pairs. In order to address differential and constitutive functions, we then included TCGA OV, LUSC, and LAML datasets and performed pan-cancer analysis.
Figure 2Effects of size of miRNA programs and number of miRNA program binding sites on ceRNA regulation. (A, B) Density functions and cumulative distribution functions of correlation coefficients of putative ceRNA pairs. The putative ceRNA pairs were divided into four groups by the quartiles of miRNA program sizes. (C, D) Density functions and cumulative distribution functions of correlation coefficients of putative ceRNA pairs, which were partitioned based on number of miRNA program binding sites.
Figure 3Effects of expression levels of miRNA programs and ceRNAs on ceRNA regulation. (A, B) Density functions and cumulative distribution functions of correlation coefficients of putative ceRNA pairs. The putative ceRNA pairs were split into groups by quartiles of miRNA programs expression levels. (C) Density functions of correlation coefficients of putative ceRNA pairs. Here the putative ceRNA pairs were categorized based on expression states (i.e., H, M, and L) of their composed ceRNAs. (D) Cumulative distribution functions of (C), focused on comparison of ceRNA pairs composed of two highly expressed genes (H-H) to other ceRNA pairs and non-ceRNA pairs.
Number of ceRNA pairs in groups categorized based on expression states of their composed ceRNAs.
| High-expression genes (H) | Medium-expression genes (M) | Low-expression genes (L) | |
|---|---|---|---|
| 4551383 (9.59%) | |||
| 8262660 (17.41%) | 3786914 (7.98%) | ||
| 10134010 (21.36%) | 9302868 (19.61%) | 5654651 (11.92%) | |
The percentages were calculated with respect to the number of all putative ceRNA pairs (47,451,423).
Figure 4The optimal ceRNA regulatory network. (A) Cumulative distribution functions of correlation coefficients of optimal ceRNA pairs satisfying four optimal conditions, other ceRNA pairs, and non-ceRNA pairs. (B) The optimal ceRNA regulatory network. The network is constructed by merging the identified 551,175 optimal ceRNA pairs comprising 2,405 ceRNAs. Nodes and edges denote ceRNAs and optimal regulatory relationship, respectively. (C) The subnetwork of intracellular transport (GO:0046907), generated by extracting 181 genes related to the function and corresponding ceRNA regulatory pairs from (B). Node size is proportional to the number of first-order neighbors and nodes accounting for more than 1% of all intra-function ceRNA regulating pairs are labeled with gene symbols. (D) The subnetwork of protein localization (GO:0008104).
Top 20 hub genes of the optimal ceRNA network.
| Hub genes | No. of first-order neighbors | Percentage of total optimal ceRNA pairs | Entrez gene name | Typea | Disease/Functiona |
|---|---|---|---|---|---|
| CDS2 | 1480 | 0.269% | CDP-diacylglycerol synthase (phosphatidate cytidylyltransferase) 2 | enzyme | |
| PARVA | 1470 | 0.267% | parvin, alpha | other | cancer; cell death and survival |
| SLC1A2 | 1466 | 0.266% | solute carrier family 1 (glial high affinity glutamate transporter), member 2 | transporter | neurological disease; hereditary disorder |
| NFIB | 1435 | 0.260% | nuclear factor I/B | transcription regulator | |
| SSR1 | 1421 | 0.258% | signal sequence receptor, alpha | other | |
| GTF2H5 | 1407 | 0.255% | general transcription factor IIH, polypeptide 5 | other | neurological disease; hereditary disorder |
| SAR1B | 1398 | 0.254% | SAR1 homolog B (S. cerevisiae) | enzyme | hereditary disorder |
| GSK3B | 1392 | 0.253% | glycogen synthase kinase 3 beta | kinase | cancer; cell death and survival; neurological disease; hereditary disorder |
| HEG1 | 1390 | 0.252% | heart development protein with EGF-like domains 1 | other | cancer |
| ZNF148 | 1390 | 0.252% | zinc finger protein 148 | transcription regulator | cell death and survival |
| EIF5 | 1387 | 0.252% | eukaryotic translation initiation factor 5 | translation regulator | cancer |
| SMAD4 | 1382 | 0.251% | SMAD family member 4 | transcription regulator | cancer; cell death and survival; prognosis biomarker; neurological disease; hereditary disorder |
| TCF4 | 1374 | 0.249% | transcription factor 4 | transcription regulator | cell death and survival; neurological disease; hereditary disorder |
| QKI | 1366 | 0.248% | QKI, KH domain containing, RNA binding | other | cancer; neurological disease; hereditary disorder |
| LSAMP | 1364 | 0.247% | limbic system-associated membrane protein | other | |
| ATXN1 | 1354 | 0.246% | ataxin 1 | transcription regulator | cancer; cell death and survival; neurological disease; hereditary disorder |
| DCP2 | 1354 | 0.246% | decapping mRNA 2 | enzyme | |
| PSD3 | 1346 | 0.244% | pleckstrin and Sec7 domain containing 3 | other | cancer; neurological disease |
| SLC38A1 | 1340 | 0.243% | solute carrier family 38, member 1 | transporter | |
| VAPB | 1340 | 0.243% | VAMP (vesicle-associated membrane protein)- associated protein B and C | other | cell death and survival; neurological disease; hereditary disorder |
a Annotations were obtained from Ingenuity Pathway Analysis (Qiagen Inc.).
Top 5 clusters of Gene Ontology terms enriched in the 2,405 optimal ceRNAs
| GO Term | No. of genes | Bonferroni adjusted | Total No. of optimal ceRNA pairs/ceRNAs | GBM corea | OV corea | LUSC corea | LAML corea | CV among cancersb |
|---|---|---|---|---|---|---|---|---|
| GO:0046907~intracellular transport | 184 | 9.26E-18 | ||||||
| GO:0008104~protein localization | 225 | 7.88E-17 | ||||||
| GO:0015031~protein transport | 200 | 4.67E-16 | ||||||
| GO:0045184~establishment of protein localization | 201 | 6.18E-16 | 8229/261 | 7152/261 | 6235/258 | 3961/254 | 2755/247 | 34.85%/2.06% |
| GO:0070727~cellular macromolecule localization | 126 | 2.75E-14 | ||||||
| GO:0034613~cellular protein localization | 124 | 1.09E-13 | ||||||
| GO:0006886~intracellular protein transport | 111 | 3.17E-11 | ||||||
| GO:0009057~macromolecule catabolic process | 180 | 2.95E-08 | ||||||
| GO:0044265~cellular macromolecule catabolic process | 168 | 1.04E-07 | ||||||
| GO:0030163~protein catabolic process | 143 | 6.75E-06 | ||||||
| GO:0043632~modification-dependent macromolecule catabolic process | 134 | 8.41E-06 | ||||||
| GO:0019941~modification-dependent protein catabolic process | 134 | 8.41E-06 | 4838/200 | 4418/200 | 3836/197 | 2379/193 | 2049/191 | 31.08%/1.79% |
| GO:0051603~proteolysis involved in cellular protein catabolic process | 138 | 1.31E-05 | ||||||
| GO:0044257~cellular protein catabolic process | 138 | 1.85E-05 | ||||||
| GO:0006511~ubiquitin-dependent protein catabolic process | 68 | 8.96E-05 | ||||||
| GO:0006508~proteolysis | 177 | 1 | ||||||
| GO:0016192~vesicle-mediated transport | 141 | 8.98E-08 | ||||||
| GO:0016044~membraneorganization | 93 | 3.70E-04 | 3462/157 | 2882/155 | 2519/156 | 1607/149 | 1208/147 | 32.82%/2.53% |
| GO:0010324~membrane invagination | 50 | 0.935246 | ||||||
| GO:0006897~endocytosis | 50 | 0.935246 | ||||||
| GO:0032446~protein modification by small protein conjugation | 45 | 6.85E-05 | ||||||
| GO:0070647~protein modification by small protein conjugation or removal | 50 | 2.36E-04 | ||||||
| GO:0016567~protein ubiquitination | 40 | 7.30E-04 | ||||||
| GO:0019787~small conjugating protein ligase activity | 45 | 0.012039 | 607/67 | 555/67 | 488/66 | 335/65 | 266/62 | 28.13%/2.88% |
| GO:0016881~acid-amino acid ligase activity | 51 | 0.020482 | ||||||
| GO:0004842~ubiquitin-protein ligase activity | 39 | 0.084550 | ||||||
| GO:0016879~ligase activity, forming carbon-nitrogen bonds | 53 | 0.213866 | ||||||
| GO:0010608~posttranscription al regulation of gene expression | 59 | 0.001017 | ||||||
| GO:0006417~regulation of translation | 40 | 0.036298 | 1312/105 | 1132/105 | 961/101 | 643/99 | 480/100 | 31.90%/2.25% |
| GO:0032268~regulation of cellular protein metabolic process | 96 | 0.576893 | ||||||
a Number of optimal ceRNA pairs/ceRNAs with significant positive correlation coefficients in corresponding cancer dataset, b Coefficients of variation of number of core ceRNA pairs/ceRNAs among four cancer datasets.
Figure 5Venn diagrams of core ceRNAs among four cancer types. (A) Comparison of core ceRNAs in four cancer datasets. Core ceRNAs are genes that comprise the core ceRNA pairs with significant positive correlation coefficients in a cancer dataset. (B) Comparison of top hub core ceRNAs which collectively account for 10% of the total core ceRNA regulating pairs in corresponding cancer datasets.