| Literature DB >> 30053237 |
Wen-Jing Wang1,2,3, Yu-Mei Wang1,2,3, Yi Hu1,2,3, Qin Lin1,2,3, Rou Chen1,2,3, Huan Liu1,2,3, Wen-Ze Cao1,2,3, Hui-Fang Zhu1,2,3, Chang Tong1,2, Li Li1,2,3, Lu-Ying Peng1,2,3.
Abstract
Heart diseases (HDs) represent a common group of diseases that involve the heart, a number of which are characterized by high morbidity and lethality. Recently, increasing evidence demonstrates diverse non-coding RNAs (ncRNAs) play critical roles in HDs. However, currently there lacks a systematic investigation of the association between HDs and ncRNAs. Here, we developed a Heart Disease-related Non-coding RNAs Database (HDncRNA), to curate the HDs-ncRNA associations from 3 different sources including 1904 published articles, 3 existing databases [the Human microRNA Disease Database (HMDD), miR2disease and lncRNAdisease] and 5 RNA-seq datasets. The HDs-ncRNA associations with experimental validations curated from these articles, HMDD, miR2disease and part of data from lncRNAdisease were 'direct evidence'. Relationships got from high-through data in lncRNAdisease and annotated differential expressed lncRNAs from RNA-seq data were defined as 'high-throughput associations'. Novel lncRNAs identified from RNA-seq data in HDs had least credibility and were defined as 'predicted associations'. Currently, the database contains 2304 HDs-ncRNA associations for 133 HDs in 6 species including human, mouse, rat, pig, calf and dog. The database also has the following features: (i) A user-friendly web interface for browsing and searching the data; (ii) a visualization tool to plot miRNA and lncRNA locations in the human and mouse genomes; (iii) information about neighboring genes of lncRNAs and (iv) links to some mainstream databases including miRbase, Ensemble and Fantom Cat for the annotated lncRNAs and miRNAs. In summary, HDncRNA provides an excellent platform for exploring HDs related ncRNAs.Database URL: http://hdncrna.cardiacdev.com.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30053237 PMCID: PMC6059085 DOI: 10.1093/database/bay067
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.The whole pipeline of database construction. The titles in blue mark the sources of data in HDncRNA, and the titles in green represent steps of database construction.
The five RNA-seq datasets used in HDncRNA
| Series | Species | Samples | Tissue | Conditions | Condition details (numbers) |
|---|---|---|---|---|---|
| GSE71613 | Human | 8 | Heart | 3 | Control ( |
| GSE46224 | Human | 8 | Heart | 2 | Non-falling control ( |
| GSE52313 | Mouse | 2 | Heart | 2 | Sham ( |
| GSE58455 | Mouse | 8 | Heart | 8 | Sham ( |
| GSE54132 | Rat | 6 | Heart | 2 | Control ( |
ICM: Ischemic cardiomyopathy; TAC: Transverse aortic constriction.
Figure 2.The workflow of lncRNA analysis and prediction. (a) Initial assembly. Raw reads were mapped to reference genome of corresponding species. When Cufflinks ran, −g parameter was dropped to find new transcripts. (b) Cufflinks toolkit usage for novel lncRNAs detection. During novel lncRNA prediction, Cuffcompare and Cuffdiff were used. (c) Predict potential novel lncRNAs from results got from Cufflinks. During the prediction process, threshold of log2FC of FPKM was set as 1.4, length threshold was set as 200 nt, threshold of ORF length by ORF predictor (29) was set as 300 nt and threshold score of CPC (coding potential calculator) (30) was set as 0.
The entries in the HDncRNA database
| Data source | miRNA | lncRNA | circRNA | Total | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Human | Mouse | Rat | Other | Human | Mouse | Rat | Other | Human | Mouse | Rat | ||
| Article | 438 | 162 | 128 | 11 | 53 | 35 | 15 | 2 | 3 | 4 | 1 | 852 |
| Database | 151 | 122 | 45 | 4 | 21 | 288 | 1 | 0 | 0 | 0 | 0 | 632 |
| GEO | 0 | 0 | 0 | 0 | 268 | 520 | 32 | 0 | 0 | 0 | 0 | 820 |
| Total | 1061 | 1235 | 8 | 2304 | ||||||||
Number of annotated and - novel lncRNAs associated with HDs in RNA-seq datasets
| Human | Mouse | Rat | |
|---|---|---|---|
| Annotated lncRNA | 226 | 228 | 10 |
| Unannotated lncRNA | 42 | 292 | 22 |
| Total | 268 | 520 | 32 |
Figure 3.The distribution of the HDs in the HDncRNA.
Figure 4.The workflow of HDncRNA. (a) Search page of HDncRNA, in which two search methods are provided. (b) Table contains search results. In the table, data can be ordered by every column, search in table is also supported. (c) Details webpage of the lncRNA MIAT. The top model is visualization of the MIAT, the genome locus and nearby biological molecules can be seen. In the middle model, detailed information of MIAT is provided. Neighbor genes of MIAT are shown in the bottom model. Relative position and distance between each two genes are provided as well.