| Literature DB >> 31732741 |
Dimitra Karagkouni1,2,3, Maria D Paraskevopoulou1,2, Spyros Tastsoglou1,2, Giorgos Skoufos1,2, Anna Karavangeli1,3, Vasilis Pierros1,2, Elissavet Zacharopoulou1,4, Artemis G Hatzigeorgiou1,2,3.
Abstract
DIANA-LncBase v3.0 (www.microrna.gr/LncBase) is a reference repository with experimentally supported miRNA targets on non-coding transcripts. Its third version provides approximately half a million entries, corresponding to ∼240 000 unique tissue and cell type specific miRNA-lncRNA pairs. This compilation of interactions is derived from the manual curation of publications and the analysis of >300 high-throughput datasets. miRNA targets are supported by 14 experimental methodologies, applied to 243 distinct cell types and tissues in human and mouse. The largest part of the database is highly confident, AGO-CLIP-derived miRNA-binding events. LncBase v3.0 is the first relevant database to employ a robust CLIP-Seq-guided algorithm, microCLIP framework, to analyze 236 AGO-CLIP-Seq libraries and catalogue ∼370 000 miRNA binding events. The database was redesigned from the ground up, providing new functionalities. Known short variant information, on >67,000 experimentally supported target sites and lncRNA expression profiles in different cellular compartments are catered to users. Interactive visualization plots, portraying correlations of miRNA-lncRNA pairs, as well as lncRNA expression profiles in a wide range of cell types and tissues, are presented for the first time through a dedicated page. LncBase v3.0 constitutes a valuable asset for ncRNA research, providing new insights to the understanding of the still widely unexplored lncRNA functions.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31732741 PMCID: PMC7145509 DOI: 10.1093/nar/gkz1036
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The table summarizes the LncBase v3.0 content in comparison with the experimental module of the previous database version
| LncBase v3.0 | LncBase v2.0 | ||
|---|---|---|---|
| Database | Total miRNA–lncRNA entries | >500 000 | ∼170 000 |
| Interactions from low-yield methods | 242 | 86 | |
| Interactions from high-throughput methods | ∼239 000 | ∼70 000 | |
| miRNAs in interactions | 1551 | 1419 | |
| Targeted LncRNAs | 24 618 | 8216 | |
| Cell types | 192 | 57 | |
| Tissues | 51 | 22 | |
| Publications | 236 | 59 | |
| lncRNA Resources | GENCODE v30, RefSeq 109–106, Cabili | GENCODE v21, RefSeq 106–104, Cabili | |
| Analyzed high-throughput experiments | Datasets | 322 | 153 |
| Conditions | 150 | 67 | |
| Publications | 79 | 22 | |
| Analysis of AGO-CLIP-Seq experiments | Framework | microCLIP CLIP-Seq guided model | Intersection of AGO clusters with MREs derived from an |
| Experimental Methods | Description of major classes | Reporter genes, northern blot, qPCR, | Reporter genes, northern blot, qPCR, biotin miRNA tagging, CLIP-Seq, AGO-IP, RNA-Seq, microarrays |
| LncRNA expression information | Datasets (cell) | 48 | 38 |
| Datasets (nucleus/cytoplasm) | 55 | - | |
| Interface | Data visualization, content mining and database inter-connection options |
| Support of specific queries, search by location, enhanced filtering options including cell type, tissue, species and method, detailed meta-data, cell-type/tissue specific indication of lncRNA expression, interconnection with DIANA-tools, UCSC graphical support |
Statistics regarding the total entries, miRNA–lncRNA interactions derived from low-/high-throughput methodologies, the number of miRNAs targeting lncRNA transcripts, the number of lncRNAs harboring MREs, distinct cell types/tissues, curated publications and the incorporated lncRNA resources are provided. The number of analyzed (i) high-throughput datasets and unique studied conditions, (ii) datasets to infer lncRNA expression profiles, is reported. The utilized framework for the analysis of AGO-CLIP-Seq data is mentioned. The incorporated experimental techniques, as well as interface improvements, are displayed and marked as bold in case they constitute additions to LncBase v3.0.
Figure 1.LncBase miRNA–lncRNA pairs. Values are plotted in log2 scale. (A) miRNA–lncRNA interactions derived from direct high-throughput techniques per tissue and miRNA species. 85 ± 10% of interactions is spatially classified into sense, antisense, lincRNAs and pseudogenes. (B) Comparison of LncBase v3.0 and LncBase v2.0 experimentally supported interactions.
Figure 2.Snapshot depicting the DIANA-LncBase v3.0 interface. Users can explore different database modules and interactive visualizations through a dedicated menu bar (1). They can retrieve interactions by querying with miRNA and/or gene names (2), genomic location (3), and/or by applying different filtering combinations (4). Interactions can be refined with a series of filtering options including cell type/tissue, experimental methodology, transcript category, species and lncRNA annotation source (4). Result statistics are promptly calculated (5). Interactions can be also sorted in ascending or descending order (6). Gene/miRNA details are complemented with active links to Ensembl, RefSeq, miRBase and RNAcentral (7). Additional details regarding the experimental procedures (8), variant information, where applicable (9), as well as miRNA confidence level indication are provided (10). Interactions are accompanied by miRNA-binding site details (11). Inter-connection with the lncRNA expression module is provided (12). miRNA binding events and MRE-overlapping variant genomic locations can be visualized in an interactive UCSC genome browser (13). Links to other DIANA-Tools are also available (14). Users can easily retrieve query results through a dedicated ‘Download’ button (15).
Figure 3.Snapshot depicting the interface of lncRNA expression profile dedicated page. Users can retrieve expression profiles of lncRNAs within the cell (A) and comparatively between the nuclear and cytoplasmic subcellular compartments (B). They can explore lncRNA abundance by performing queries with gene and/or transcript names (A-1, B-1), as well as combinations of tissues and cell types (A-2, B-2). Gene details, complemented with active links to Ensembl and RefSeq are provided (A-3, B-3). TPM values describing the expression of lncRNAs, accompanied with experimental details are catered to users (A-4). Links directing to the experimentally supported targets module are provided (A-5, B-5). In ‘Localization’ mode, expression TPM values, are provided separately in nucleus and cytoplasm and followed by the RCI value (B-4). Apparent inclination of the sub-localization of lncRNAs is indicated (B-8). Users can easily swap between ‘Localization’ and ‘Expression’ modes and retrieve lncRNA abundance without performing new queries (A-6, B-6). Easy retrieval of query results is also provided through a dedicated ‘Download’ button (A-7, B-7).
Figure 4.Screenshot depicting DIANA-LncBase v3.0 interactive correlation plot. Users may select more than one cell type and/or tissue to explore their relationship based on absence/presence of AGO-CLIP-Seq derived miRNA–lncRNA interactions. For each pair-wise combination, Pearson's r coefficient values are calculated. The current plot depicts correlations between all human cell types that present AGO-CLIP-Seq derived interactions in LncBase v3.0. Higher correlations are observed among some cell types such as lymphoblastoid, bone marrow-derived and mammary gland cell lines.
Figure 5.Screenshot depicting DIANA-LncBase v3.0 interactive bar-plots. The user can select an lncRNA, and optionally cell type(s), tissue(s), to view its expression profiles (A) and its abundance in nuclear cytoplasmic compartments (B). The current bar-plots portray MALAT1 expression. TPM values and RCI values are plotted in ‘LncRNA Expression’ plot (A) and ‘Nucleus/Cytoplasm lncRNA sub-localization’ plot (B), respectively. Negative and positive RCI values denote inclination of the lncRNA expression towards the nucleus and cytoplasm respectively. Users are able to couple RCI values with the nuclear/cytoplasmic lncRNA expression profiles through an inter-connected interactive table (B).