| Literature DB >> 26043858 |
Soo Youn Lee, Je-Gun Joung, Chan Hee Park, Ji Hye Park, Ju Han Kim.
Abstract
The post-transcriptional sequence modification of transcripts through RNA editing is an important mechanism for regulating protein function and is associated with human disease phenotypes. The identification of RNA editing or RNA-DNA difference (RDD) sites is a fundamental step in the study of RNA editing. However, a substantial number of false-positive RDD sites have been identified recently. A major challenge in identifying RDD sites is to distinguish between the true RNA editing sites and the false positives. Furthermore, determining the location of condition-specific RDD sites and elucidating their functional roles will help toward understanding various biological phenomena that are mediated by RNA editing. The present study developed RNA-sequence comparison and annotation for RNA editing (RCARE) for searching, annotating, and visualizing RDD sites using thousands of previously known editing sites, which can be used for comparative analyses between multiple samples. RCARE also provides evidence for improving the reliability of identified RDD sites. RCARE is a web-based comparison, annotation, and visualization tool, which provides rich biological annotations and useful summary plots. The developers of previous tools that identify or annotate RNA-editing sites seldom mention the reliability of their respective tools. In order to address the issue, RCARE utilizes a number of scientific publications and databases to find specific documentations respective to a particular RNA-editing site, which generates evidence levels to convey the reliability of RCARE. Sequence-based alignment files can be converted into VCF files using a Python script and uploaded to the RCARE server for further analysis. RCARE is available for free at http://www.snubi.org/software/rcare/.Entities:
Mesh:
Year: 2015 PMID: 26043858 PMCID: PMC4460956 DOI: 10.1186/1755-8794-8-S2-S8
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Database resources for RNA editing, listing the sources of data currently included in RCARE, specific entries in the database, and the total unique RNA editing site covered by each data type.
| No. of tissue types | No. of RNA editing sites | No. of Alu sequences | No. of references | |
|---|---|---|---|---|
| DARNED | 29 | 314,880 | 15,783 | 34 |
| ENCODE | 27 | 6,840 | 347 | 18 |
| RADAR | 30 | 291,901 | 14,318 | 31 |
| Bahn et al. [ | 27 | 12,810 | 2,916 | 15 |
| Li et al. [ | 1 | 1 | 0 | 1 |
| Total | 114 | 626,432 | 33,364 | 99 |
Figure 1RCARE RNA-Seq data processing steps and annotation result visualization. A) Conversion utility RNA-Seq data processing pipeline. B) Web interface for annotation and comparison analyses. C) Summary plots of annotation results
Figure 2Data compositions. A) Venn diagram representing the number of RNA editing sites shared between DARNED, ENCODE, and Bahn et al. [5]. B) Number of RNA editing sites for each sample type. C) Bar plot representing the number of RNA editing sites for each editing type. D) Bar plot representing the number of RNA editing sites for each chromosome. E) Number of RNA editing sites for each genomic feature.
Format of RCARE annotation Results.
| Column | Column | Definition | References |
|---|---|---|---|
| 1 | Chr | Chromosome of the RNA editing site in the reference genome. | [ |
| 2 | Pos | Coordinate of the RNA editing site in the reference genome. | |
| 3 | In DNA | Base of the RNA editing site in the DNA reference sequence. | |
| 4 | In RNA | Base of the RNA editing site in the RNA sequence of sample. | |
| 5 | Gene | Gene name to which the RNA editing site belongs. | |
| 6 | The | [ | |
| 7 | Strand | + for positive strand; - for negative strand. | [ |
| 8 | Source | This field contains information regarding the tissue source from which the RNA editing instance was obtained. | |
| 9 | PubMed ID | This field provides the reference article from which the RNA editing data was extracted. | |
| 10 | Alu | This field provides information of Alu at the RNA editing site. | RepeatMasker |
| 11 | Data reference | Reference database. | Each database or reference |
| 12 | ENSG | Ensembl Gene ID. | GTF( |
| 13 | ENST | Ensembl Transcript ID. | |
| 14 | ENSE | Ensembl Exon ID. | GRCH37.17) in Ensembl |
| 15 | Genomic feature | Genomic feature of the RNA editing site. | [ |
| 16 | Synonymous or nonsynonymous | Synonymous or nonsynonymous substitutions at the RNA editing site. | [ |
| 17 | Noncoding RNA | This field indicates whether the location of an RNA editing site is in ncRNA. | GTF ( |
Evidence level classification scheme for RNA editing.
| Class | Composition of evidence | Count | |
|---|---|---|---|
| A | 5 | DARNED, ENCODE, Bahn et al. [ | 33 |
| B | 4 | DARNED, RADAR, Bahn et al. [ | 2,204 |
| DARNED, ENCODE, RADAR, Alu | 23 | ||
| DARNED, ENCODE, Bahn et al. [ | 106 | ||
| C | 3 | DARNED, Bahn et al. [ | 7,037 |
| DARNED, Bahn et al. [ | 679 | ||
| DARNED, ENCODE, RADAR | 550 | ||
| DARNED, RADAR, Alu | 12,038 | ||
| DARNED, RADAR, Li et al. [ | 1 | ||
| ENCODE, Bahn et al. [ | 1 | ||
| ENCODE, RADAR, Alu | 20 | ||
| D | 2 | DARNED, Alu | 806 |
| DARNED, RADAR | 269,547 | ||
| DARNED, Bahn et al. [ | 2,749 | ||
| ENCODE, Alu | 271 | ||
| ENCODE, RADAR | 341 | ||
| ENCODE, Bahn et al. [ | 1 | ||
| E | 1 | DARNED | 19,107 |
| ENCODE | 5,494 | ||
| Total | 321,008 |