Literature DB >> 30357418

Editome Disease Knowledgebase (EDK): a curated knowledgebase of editome-disease associations in human.

Guangyi Niu1,2,3, Dong Zou1,2, Mengwei Li1,2,3, Yuansheng Zhang1,2,3, Jian Sang1,2,3, Lin Xia1,2,3, Man Li1,2,3, Lin Liu1,2,3, Jiabao Cao1,2,3, Yang Zhang1,2,3, Pei Wang1,2,3, Songnian Hu2,3,4, Lili Hao1,2, Zhang Zhang1,2,3,4.   

Abstract

RNA editing, as an essential co-/post-transcriptional RNA modification type, plays critical roles in many biological processes and involves with a variety of human diseases. Although several databases have been developed to collect RNA editing data in both model and non-model animals, there still lacks a resource integrating associations between editome and human disease. In this study, we present Editome-Disease Knowledgebase (EDK; http://bigd.big.ac.cn/edk), an integrated knowledgebase of RNA editome-disease associations manually curated from published literatures. In the current version, EDK incorporates 61 diseases associated with 248 experimentally validated abnormal editing events located in 32 mRNAs, 16 miRNAs, 1 lncRNA and 11 viruses, and 44 aberrant activities involved with 6 editing enzymes, which together are curated from more than 200 publications. In addition, to facilitate standardization of editome-disease knowledge integration, we propose a data curation model in EDK, factoring an abundance of relevant information to fully capture the context of editome-disease associations. Taken together, EDK is a comprehensive collection of editome-disease associations and bears the great utility in aid of better understanding the RNA editing machinery and complex molecular mechanisms associated with human diseases.

Entities:  

Mesh:

Year:  2019        PMID: 30357418      PMCID: PMC6323952          DOI: 10.1093/nar/gky958

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

RNA editing, as an essential co-/post-transcriptional modification, alters the base pairing properties of RNA molecules by means of substitutions, insertions and/or deletions (1–3). The exponential growth of reported RNA editing events has revealed the pervasive nature of RNA editing in human transcriptome (4–6), indicating the functional implications in gene expression and regulation (7). Evidence has accumulated that RNA editing exerts critical roles in affecting mRNA stability (8), splicing (9), recoding protein sequence (10), regulating the biogenesis and base-pairing interaction of ncRNAs (11), as well as nuclear export and localization (12). More importantly, a growing body of research has shown that RNA editing is of broad physiologic significance in mammals (13–17). Aberrant editing enzyme activity and dysfunction of editing in either coding or noncoding region have been revealed to be correlated with a wide range of human diseases, including cardiovascular diseases (8), neurological and behavioral abnormalities (18), cancer (19), autoimmune disorders (20), viral infections (21), aging-related diseases (22), and metabolic diseases (23). Collectively, RNA editing associates closely with a variety of human diseases, leading to a demand for a comprehensive collection of the associations between RNA editing and human diseases. With the advancement of next-generation sequencing technologies, millions of high-confidence RNA editing sites (5,6,24) have been identified across a variety of tissues in human. Accordingly, efforts have been made to develop different databases that integrate various RNA editing data, including REDIportal (24), dbRES (25), DARNED (26), RADAR (27) and LNCediting (28). However, none of them is devoted to the comprehensive collection of editome-disease associations in human. Specifically, dbRES focuses on collection of experimentally validated editing sites for different organisms, DARNED and RADAR collect A-to-I editing events based on either EST data or a limited number of RNA-Seq data, REDIportal houses a large number of A-to-I editing sites across 30 human tissues identified from thousands of RNA-Seq experiments, and LNCediting characterizes and predicts the effects of all editing sites in lncRNAs across model animals. Clearly, all these resources provide valuable annotations for RNA editing events but ignore the associations between RNA editing and diseases. Considering the rapid accumulation of knowledge on RNA editome-disease associations, there is a great need to build a specialized database that integrates disease-associated RNA editing data. Toward this end, here we present Editome-Disease Knowledgebase (EDK; http://bigd.big.ac.cn/edk), a curated knowledgebase of RNA editome-disease associations. Unlike extant relevant data-oriented databases, EDK features comprehensive integration of knowledge on abnormal RNA editing events and aberrant RNA editing enzyme activities, which are associated with human diseases. Therefore, EDK would be of great help to provide a landscape of editome-disease associations and to explore the RNA editing machineries underlying different human diseases.

MATERIALS AND METHODS

Literatures were initially retrieved from NCBI PubMed by specifying keywords ‘RNA editing’ and ‘disease’ and then manually filtered and double-checked by expert curators to remove irrelevant publications. Considering that the reported diseases are mostly associated with either aberrant activities of editing enzymes or abnormal editing events occurring on transcribed coding/noncoding genes and viruses, a curation model was proposed to standardize the curation process, through literature curation of over 200 relevant publications. Based on this model, editome-disease associations were curated from four aspects, namely, diseases, editing enzymes, edited genes, and viruses. For each aspect, a variety of curated information were further collected, such as disease name, gene symbol, miRNA ID, virus name, editing enzyme, editing type, molecular effects, phenotype, etc. Then, control vocabularies and standardized descriptive terms were abstracted and categorized to depict each item. More detailed information can be found at http://bigd.big.ac.cn/edk/faq. To unify disease name and definition, terms or identifiers of Disease Ontology (DO), Experimental Factor Ontology (EFO), Online Mendelian Inheritance in Man (OMIM) and National Cancer Institute (NCI) were retrieved and mapped to the corresponding diseases. Consequently, all RNA editing associated diseases were grouped into predefined categories based on the modified primary and secondary classification systems of DO. In addition, external links to several related resources, such as REDIportal (24), RADAR (27), UniProt (29), GenBank (30), GeneCard (31), miRBase (32), GTEx (33), etc., were also provided for convenience. Furthermore, enrichment analysis was done to deduce the pathways affected by abnormal RNA editing using IPA version 2.0. The ‘Core Analysis’ function in IPA was used to interpret the edited genes in the context of biological processes, pathways and networks. EDK was implemented using Spring Boot as back-end framework. JSP, Bootstrap, SemanticUI and JQuery, which provide a series of templates for designing web pages with consistent interface components, were adopted as front-end framework. Curated data were stored in MySQL database and data visualization was rendered by HighCharts and DataTables. EDK organizes all curated editome-disease associations at different levels; it provides a whole picture of editome-disease associations with conclusion items, presents detailed association information in terms of edited mRNA/miRNA/lncRNA, editing enzyme and virus, and summarizes different editome-disease associations for a specific disease, edited gene, editing enzyme and virus.

DATABASE CONTENTS AND USAGE

EDK features comprehensive integration of editome-disease associations as well as their relevant data manually curated from published literatures. Collectively, the current version of EDK incorporates 61 diseases (grouped into 10 categories) associated with 248 experimentally validated abnormal editing events that occur in 32 mRNAs, 16 miRNAs, 1 lncRNA and 11 viruses, as well as 44 aberrant activities involved with 6 editing enzymes, which are all manually curated from over 200 publications (Figure 1). Therefore, EDK serves as a curated knowledgebase of editome-disease associations, providing a comprehensive landscape of RNA editing in human diseases.
Figure 1.

Statistics of curated editome-disease associations in EDK (as of August 2018).

Statistics of curated editome-disease associations in EDK (as of August 2018). In the current implementation, EDK integrates 61 editome-associated diseases, including 26 cancers (43%), 15 nervous system diseases (24%), 6 viral infectious diseases (10%), 3 gastrointestinal system diseases (5%), 2 cognitive disorders (3%), 1 diseases of metabolism (2%), 1 genetic diseases (2%), 2 cardiovascular system disease (3%), 2 skin disease (3%) and 3 other types (5%) (http://bigd.big.ac.cn/edk/browse/disease) (Figure 2A). Among these diseases, cancers and nervous system diseases are extensively studied, which together cover nearly 70% of all collected diseases in EDK. Interestingly, one disease may be associated with aberrant RNA editing mechanisms on multiple levels. Taking ‘Glioblastoma’ as an example, it is associated with the abnormal editing events that occur on the mRNA of CDC14B and the miRNA of hsa-miR-589-3p/hsa-miR-221/hsa-miR-222 as well as the aberrant activity of the editing enzyme ADAR2 (http://bigd.big.ac.cn/edk/disease/Glioblastoma).
Figure 2.

Screenshots of EDK webpages, including browse pages of (A) editome-associated human diseases, (B) endogenous edited genes, (C) editing enzymes and (D) exogenous viruses.

Screenshots of EDK webpages, including browse pages of (A) editome-associated human diseases, (B) endogenous edited genes, (C) editing enzymes and (D) exogenous viruses. To clarify the associations between aberrant edited genes and diseases, EDK categorizes disease-associated edited genes into 3 types, namely, mRNA, miRNA and lncRNA (http://bigd.big.ac.cn/edk/browse/RNA) (Figure 2B). To date, 153 aberrant editing sites in 32 mRNAs (AZIN1, CTSS, GRIA1, FLNB, BLCAP, etc.), 38 sites in 16 miRNAs (mir-214, miR-200b, miR-455-5p, etc.) and 1 site in 1 lncRNA (lncPCA3), are annotated to be associated with 46 human diseases. IPA analysis shows that the curated edited coding/noncoding genes and their target genes are significantly enriched in a number of networks and categories, which are relevant to cell-to-cell signalling and interaction, nervous system development, cell cycle and embryo development (http://bigd.big.ac.cn/edk/ipa). Regarding the mRNA type, enzyme families of ADARs (adenosine deaminase acting on RNA), APOBECs (apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like) and AID (activation-induced cytidine deaminase) are responsible for the characterized abnormal editing events (mainly A-to-I and C-to-U types). Most abnormal editing events, locating in the regions of CDS and 3′-UTR, induce several molecular consequences, such as nonsynonymous substitution, alternative splicing, mRNA stability change and mRNA expression change. One typical example is that the clustered RNA editing events within 3′-UTR region affect the stability of CTSS mRNA (a proinflammatory gene), which leads to cardiovascular disease (http://bigd.big.ac.cn/edk/mRNA?name=CTSS). Of note, both increased and decreased editing patterns can contribute to human diseases. For instance, decreased editing at 11 sites of CDC14B transcripts promotes malignant glioblastoma (http://bigd.big.ac.cn/edk/mRNA?name=CDC14B), and increased editing of AZIN1 (encoding antizyme inhibitor 1) promotes hepatocellular carcinoma (http://bigd.big.ac.cn/edk/mRNA?name=AZIN1). As to the miRNA type, both ADAR1 and ADAR2 are responsible for abnormal editing events, which may further affect diseases by retargeting miRNA or regulating the expression of miRNA itself or tumour-related target genes (34). One representative example is that hypo-editing within miR-455-5p promotes melanoma growth and metastasis by inhibiting the tumour suppressor gene CPEB1 (http://bigd.big.ac.cn/edk/miRNA/hsa-miR-455-5p). Moreover, it is noted that abnormal editing in long non-coding RNA has been identified to be associated with disease; there is only one example to date collected in EDK that the editing of the duplex of intronic long noncoding RNA lncPCA3 and pre-mRNA PRUNE2 augments the malignance of prostate cancer (35) (http://bigd.big.ac.cn/edk/lncRNA/PCA3). To delineate the associations between aberrant editing enzymes and diseases, EDK collects 6 types of editing enzymes, including ADAR1, ADAR2 and ADAR3 in the ADAR family, APOBEC1 and APOBEC3 in the APOBEC family, and AID, which together associate with 27 diseases based on 44 independent experimental evidences (http://bigd.big.ac.cn/edk/browse/enzyme) (Figure 2C). As depicted on the enzyme category page, genetic mutations occurring in editing enzymes as well as aberrant expression of editing enzymes are both shown to be associated with human diseases. Notably, the enzyme-mediated mutagenesis is highly active in human cancers because abnormal editing may induce activation and/or inactivation of various tumour-related genes. For instance, over-expression of ADAR1 promotes ESCC tumour migration and invasion by increasing the editing level of AZIN1 and FLNB (36) (http://bigd.big.ac.cn/edk/enzyme/ADAR1). In addition to cancer, individual cases of other diseases related to aberrant editing enzyme are also curated and listed in EDK. One example is that mutation in ADAR1 is associated with the hereditary autoimmune diseases Aicardi–Goutieres syndrome (AGS) (20). However, it should be noted that not all editing enzyme aberrances affect diseases dependant on RNA editing mechanism, which means that editing enzymes may also participate in other cellular pathways to regulate physiological and pathological processes (37). For example, mutation in ADAR1 promotes HTLV-1 infection by the inhibition of PKR phosphorylation, which eventually causes anxiety disorder (38). Additionally, RNA editing events occurring on exogenous viral dsRNA are also associated with some human diseases, especially the infectious diseases. In the current version, EDK also incorporates 13 editing events, residing in 11 edited viruses and associating with 9 diseases (among which 5 are infectious diseases) (http://bigd.big.ac.cn/edk/browse/virus) (Figure 2D). Based on the Baltimore classification, all collected viruses are classified into three types, namely, (–)ssRNA, dsDNA-RT and dsDNA virus. It is also found that ADARs, APOBECs and AID are responsible for the characterized editing events in these viruses. Different from the edited mRNA/miRNA/lncRNA, RNA editing events in virus includes not only substitution but also insertion of single or multiple nucleotides. Strikingly, most RNA editing events occur in viral structural genes, such as phosphoprotein gene and glycoproteins, which may change virus structure and affect virus function by recoding virus genes, inducing the generation of new transcripts or a new virus subclass, promoting or inhibiting virus replication, helping virus to escape from the immune resistance or inhibiting antiviral activity (37). For instance, editing in the phosphoprotein gene of measles virus may affect its pathogenicity (http://bigd.big.ac.cn/edk/virus/Measles%20virus).

DISCUSSION AND FUTURE DEVELOPMENTS

EDK is a comprehensive knowledgebase that provides the landscape of RNA editing events in a wide range of human diseases. Based on manual curation from more than 200 publications, the current implementation of EDK integrates 61 diseases that are associated with 248 experimentally validated abnormal editing events in 32 mRNAs, 16 miRNAs, 1 lncRNA and 11 viruses, as well as 44 aberrant enzyme activities involving with six editing enzymes. As a core resource of the BIG Data Center (http://bigd.big.ac.cn) (39,40), EDK is aimed at serving as an open access resource for dissecting the complex RNA editing mechanisms underlying human diseases. Considering the increasing volume of RNA editing studies in human diseases, we will frequently integrate experimentally validated RNA editome-disease associations based on literature curation from newly published literatures. Future directions of EDK also include development of community-curation interfaces to enable the scientific community in contributing their research data and/or expertise.
  40 in total

1.  RNA editing changes the lesion specificity for the DNA repair enzyme NEIL1.

Authors:  Jongchan Yeo; Rena A Goodman; Nicole T Schirle; Sheila S David; Peter A Beal
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-10       Impact factor: 11.205

Review 2.  A-to-I RNA editing and human disease.

Authors:  Stefan Maas; Yukio Kawahara; Kristen M Tamburro; Kazuko Nishikura
Journal:  RNA Biol       Date:  2006-01-12       Impact factor: 4.652

3.  RNA editing in brain controls a determinant of ion flow in glutamate-gated channels.

Authors:  B Sommer; M Köhler; R Sprengel; P H Seeburg
Journal:  Cell       Date:  1991-10-04       Impact factor: 41.582

4.  Genetic variability and mRNA editing frequencies of the phosphoprotein genes of wild-type measles viruses.

Authors:  B Bankamp; E N Lopareva; J R Kremer; Y Tian; M S Clemens; R Patel; A L Fowlkes; J R Kessler; C P Muller; W J Bellini; P A Rota
Journal:  Virus Res       Date:  2008-05-19       Impact factor: 3.303

Review 5.  Molecular diversity through RNA editing: a balancing act.

Authors:  Sanaz Farajollahi; Stefan Maas
Journal:  Trends Genet       Date:  2010-04-13       Impact factor: 11.639

6.  Contribution of major cardiovascular risk factors to familial premature coronary artery disease: the GENECARD project.

Authors:  Vincent Jomini; Séverine Oppliger-Pasquali; Vincent Wietlisbach; Nicolas Rodondi; Valérie Jotterand; Fred Paccaud; Roger Darioli; Pascal Nicod; Vincent Mooser
Journal:  J Am Coll Cardiol       Date:  2002-08-21       Impact factor: 24.094

7.  RNA editing: a driving force for adaptive evolution?

Authors:  Willemijn M Gommans; Sean P Mullen; Stefan Maas
Journal:  Bioessays       Date:  2009-10       Impact factor: 4.345

8.  Two novel transcripts for human endothelin B receptor produced by RNA editing/alternative splicing from a single gene.

Authors:  Akito Tanoue; Taka-Aki Koshimizu; Masako Tsuchiya; Kayano Ishii; Makiko Osawa; Morihiro Saeki; Gozoh Tsujimoto
Journal:  J Biol Chem       Date:  2002-06-21       Impact factor: 5.157

9.  miRBase: microRNA sequences, targets and gene nomenclature.

Authors:  Sam Griffiths-Jones; Russell J Grocock; Stijn van Dongen; Alex Bateman; Anton J Enright
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  dbRES: a web-oriented database for annotated RNA editing sites.

Authors:  Tao He; Pufeng Du; Yanda Li
Journal:  Nucleic Acids Res       Date:  2006-11-06       Impact factor: 16.971

View more
  10 in total

1.  Database Resources of the National Genomics Data Center in 2020.

Authors: 
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

2.  Databases for RNA Editing Collections.

Authors:  Claudio Lo Giudice; Luigi Mansi; Graziano Pesole; Ernesto Picardi
Journal:  Methods Mol Biol       Date:  2021

3.  Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022.

Authors: 
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

4.  Editome landscape of CCM-derived endothelial cells.

Authors:  Concetta Scimone; Simona Alibrandi; Luigi Donato; Concetta Alafaci; Antonino Germanò; Sergio L Vinci; Rosalia D'Angelo; Antonina Sidoti
Journal:  RNA Biol       Date:  2022-01       Impact factor: 4.766

Review 5.  Challenges in the construction of knowledge bases for human microbiome-disease associations.

Authors:  Varsha Dave Badal; Dustin Wright; Yannis Katsis; Ho-Cheol Kim; Austin D Swafford; Rob Knight; Chun-Nan Hsu
Journal:  Microbiome       Date:  2019-09-05       Impact factor: 14.650

6.  BrainBase: a curated knowledgebase for brain diseases.

Authors:  Lin Liu; Yang Zhang; Guangyi Niu; Qianpeng Li; Zhao Li; Tongtong Zhu; Changrui Feng; Xiaonan Liu; Yuansheng Zhang; Tianyi Xu; Ruru Chen; Xufei Teng; Rongqin Zhang; Dong Zou; Lina Ma; Zhang Zhang
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

7.  Gene Expression Nebulas (GEN): a comprehensive data portal integrating transcriptomic profiles across multiple species at both bulk and single-cell levels.

Authors:  Yuansheng Zhang; Dong Zou; Tongtong Zhu; Tianyi Xu; Ming Chen; Guangyi Niu; Wenting Zong; Rong Pan; Wei Jing; Jian Sang; Chang Liu; Yujia Xiong; Yubin Sun; Shuang Zhai; Huanxin Chen; Wenming Zhao; Jingfa Xiao; Yiming Bao; Lili Hao; Zhang Zhang
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

8.  Clinical Applications of RNA Editing Technology for the Early Detection of Cancer and Future Directions.

Authors:  Mujib Ullah; Asma Akbar; Gustavo Yannarelli
Journal:  Technol Cancer Res Treat       Date:  2020 Jan-Dec

9.  Genetic variation and microRNA targeting of A-to-I RNA editing fine tune human tissue transcriptomes.

Authors:  Eddie Park; Yan Jiang; Lili Hao; Jingyi Hui; Yi Xing
Journal:  Genome Biol       Date:  2021-03-09       Impact factor: 13.583

10.  MiREDiBase, a manually curated database of validated and putative editing events in microRNAs.

Authors:  Gioacchino P Marceca; Rosario Distefano; Carlo M Croce; Giovanni Nigita; Luisa Tomasello; Alessandro Lagana; Francesco Russo; Federica Calore; Giulia Romano; Marina Bagnoli; Pierluigi Gasparini; Alfredo Ferro; Mario Acunzo; Qin Ma
Journal:  Sci Data       Date:  2021-08-04       Impact factor: 8.501

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.