| Literature DB >> 31390792 |
Felipe Wendt Porto1, Swapna Vidhur Daulatabad1, Sarath Chandra Janga2,3,4.
Abstract
Recent developments in our understanding of the interactions between long non-coding RNAs (lncRNAs) and cellular components have improved treatment approaches for various human diseases including cancer, vascular diseases, and neurological diseases. Although investigation of specific lncRNAs revealed their role in the metabolism of cellular RNA, our understanding of their contribution to post-transcriptional regulation is relatively limited. In this study, we explore the role of lncRNAs in modulating alternative splicing and their impact on downstream protein-RNA interaction networks. Analysis of alternative splicing events across 39 lncRNA knockdown and wildtype RNA-sequencing datasets from three human cell lines-HeLa (cervical cancer), K562 (myeloid leukemia), and U87 (glioblastoma)-resulted in the high-confidence (false discovery rate (fdr) < 0.01) identification of 11,630 skipped exon events and 5895 retained intron events, implicating 759 genes to be impacted at the post-transcriptional level due to the loss of lncRNAs. We observed that a majority of the alternatively spliced genes in a lncRNA knockdown were specific to the cell type. In tandem, the functions annotated to the genes affected by alternative splicing across each lncRNA knockdown also displayed cell-type specificity. To understand the mechanism behind this cell-type-specific alternative splicing pattern, we analyzed RNA-binding protein (RBP)-RNA interaction profiles across the spliced regions in order to observe cell-type-specific alternative splice event RBP binding preference. Despite limited RBP binding data across cell lines, alternatively spliced events detected in lncRNA perturbation experiments were associated with RBPs binding in proximal intron-exon junctions in a cell-type-specific manner. The cellular functions affected by alternative splicing were also affected in a cell-type-specific manner. Based on the RBP binding profiles in HeLa and K562 cells, we hypothesize that several lncRNAs are likely to exhibit a sponge effect in disease contexts, resulting in the functional disruption of RBPs and their downstream functions. We propose that such lncRNA sponges can extensively rewire post-transcriptional gene regulatory networks by altering the protein-RNA interaction landscape in a cell-type-specific manner.Entities:
Keywords: RNA-binding proteins; alternative splicing; cancer; cell-type-specific; functional enrichment; long non-coding RNA; protein binding lncRNA sponges; secondary RNA structure
Mesh:
Substances:
Year: 2019 PMID: 31390792 PMCID: PMC6722645 DOI: 10.3390/genes10080593
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Ensembl annotations of the list of long non-coding RNAs (lncRNAs) and corresponding cell lines in which wildtype and knockout RNA-sequencing data were employed in this study.
| lncRNA Symbol | ENSG ID | Cell line/s Perturbed In | Length (bp) | Strand | Gene Type |
|---|---|---|---|---|---|
| AC016831.7 | ENSG00000285106.2 | U87 | 318,897 | + | LncRNA |
| CCAT1 | ENSG00000247844.1 | HeLa | 11,222 | - | LincRNA |
| CTC-428G20.6 | ENSG00000271797.1 | U87, HeLa | 943 | + | Antisense |
| EPB41L4A-AS1 | ENSG00000224032.7 | K562 | 4292 | + | LncRNA |
| GAL4 | ENSG00000282992.2 | HeLa | 11,263 | - | Protein_coding |
| KB-1471A8.1 | ENSG00000245330.1 | HeLa | 7069 | - | LincRNA |
| LAMTOR5-AS1 | ENSG00000224699.9 | HeLa | 96,701 | + | LncRNA |
| LINC00263 | ENSG00000235823.3 | K562, U87, HeLa | 80,944 | + | LncRNA |
| LINC00680 | ENSG00000215190.9 | K562, U87, HeLa | 15,427 | - | Transcribed_unprocessed_pseudogene |
| LINC00909 | ENSG00000264247.2 | K562, U87, HeLa | 8568 | - | LncRNA |
| LINC00910 | ENSG00000188825.14 | K562 | 50,458 | - | LncRNA |
| MIR142 | ENSG00000284353.1 | K562 | 86 | - | miRNA |
| MIR17HG | ENSG00000215417.13 | HeLa | 6759 | + | LncRNA |
| MIR210HG | ENSG00000247095.3 | U87 | 2797 | - | LncRNA |
| MIR29A | ENSG00000284032.1 | U87 | 63 | - | miRNA |
| PVT1 | ENSG00000249859.11 | U87, HeLa | 392,575 | + | LncRNA |
| RP11-1094M14.11 | ENSG00000267321.3 | U87, HeLa | 6824 | + | LncRNA |
| RP11-126L15.4 | ENSG00000236305.1 | U87 | 15,302 | - | LncRNA |
| RP11-347C12.10 | ENSG00000260219.1 | K562 | 1086 | + | LincRNA |
| RP11-392P7.6 | ENSG00000247498.10 | K562 | 56,919 | + | LncRNA |
| RP11-734K2.4 | ENSG00000270344.3 | U87 | 23,411 | + | LncRNA |
| RP11-96L14.7 | ENSG00000236782.1 | U87 | 1874 | - | Antisense |
| RP11-973D8.4 | ENSG00000258554.1 | U87 | 636 | - | Antisense |
| RP5-1148A21.3 | ENSG00000266680.1 | HeLa | 1403 | - | Antisense |
| SNHG1 | ENSG00000255717.7 | U87 | 3969 | - | LncRNA |
| SNHG12 | ENSG00000197989.14 | U87 | 4594 | - | LncRNA |
| TRAM2-AS1 | ENSG00000225791.7 | U87 | 66,271 | + | LncRNA |
| XLOC_010347 | ENSG00000254430.1 | U87 | 699 | - | Unprocessed pseudogene |
| XLOC_014806 | ENSG00000211813.2 | U87 | 607 | + | TR V gene |
| XLOC_015111 | ENSG00000258687.1 | U87 | 17,537 | - | Antisense |
| XLOC_017263 | ENSG00000259656.1 | HeLa | 1473 | + | LincRNA |
| XLOC_018520 | ENSG00000243007.1 | K562 | 495 | - | Processed pseudogene |
| XLOC_026118 | ENSG00000272396.1 | U87 | 8285 | - | Antisense |
| XLOC_029037 | ENSG00000200718.1 | U87 | 319 | + | LncRNA |
| XLOC_038702 | ENSG00000261519.2 | K562 | 1909 | + | LincRNA |
| XLOC_040566 | ENSG00000109686.18 | U87 | 222,881 | - | Protein_coding |
| XLOC_042889 | ENSG00000271862.1 | K562 | 1588 | - | LincRNA |
| XLOC_051509 | ENSG00000253372.5 | U87 | 8771 | + | LncRNA |
| XLOC_054068 | ENSG00000268674.1 | U87 | 509 | + | Protein_coding |
Figure 1Depiction of the workflow and analysis conducted in this study. Automated scripts were developed to download data, align reads to the corresponding reference, and predict the alternative splicing (AS) events across samples. From further processing of the predicted events, both alternative splicing event information and proximal RNA-binding protein (RBP) binding locations were extracted. Alternative splicing events were then analyzed by generating heatmaps and by performing functional enrichment analysis of the alternatively spliced genes. The overlapping proximal RBP binding locations were extracted by analyzing RBP binding profiles obtained from ENCODE.
Representation of Fisher’s exact test input contingency table for each iteration of RBP binding data. Conservative hypergeometric tests were performed in order to evaluate the importance of reported proximal RBP binding activity to alternatively spliced genes due to lncRNA knock downs. Each iteration of Fisher’s exact test required this input space, which evaluates if a proximal binding with RBP is occurring in significant frequencies or by chance.
| Genetic Regions | Bound | Unbound |
|---|---|---|
| Regions where an RBP may bind across genes in a cell line | X | Y |
| All proximal splicing regions generated from a lncRNA knockdown in a cell | A | B |
Figure 2Summary of cell-type-specific skipped exon (SE) events. A hierarchically clustered heatmap demonstrates that the genes spliced via skipping exon isolate themselves in a mostly cell-type-specific manner. The numbers in the Venn diagram show how many alternatively spliced genes are shared across cell lines. Across all the alternatively spliced events in the three different cell lines, no single genes displayed the same pattern.
Figure 3Summary of cell-type-specific GO functional annotation in skipped exon events. Functional annotation of cell-type-specific gene sets that were affected by SE events. The lines connect the corresponding gene sets across each cell type to respective functional annotation. The functions enriched in each cell type are exclusive.
Figure 4RNA-binding protein eCLIP-based binding profile analyses revealed possible lncRNAs acting as RBP sponges. The intensities are log-scaled false discovery rats (FDRs) of RBP protein interactions with genes alternatively spliced in corresponding lncRNA knockdowns. (A) and (B) showcase the lncRNAs knocked down in respective cell lines (A) K562 and (B) HeLa and the resulting proximal RBP interaction intensity. The grey values represent null values within the heatmaps. (C) An illustration of the proposed RBP sponge model, portraying lncRNA–RBP proposed behavior, in the presence and absence of a lncRNA. C(I) demonstrates when the lncRNA is present, and C(II) demonstrates the proliferated RBP interactions with mRNAs when the corresponding lncRNA is absent or less abundant. (D) Shows the extent of RBPs interacting with lncRNA LINC00909 (data from ENCODE), and supports the lncRNA–RBP sponge model.