Literature DB >> 27899613

NSDNA: a manually curated database of experimentally supported ncRNAs associated with nervous system diseases.

Jianjian Wang1, Yuze Cao1, Huixue Zhang1, Tianfeng Wang1, Qinghua Tian1, Xiaoyu Lu1, Xiaoyan Lu1, Xiaotong Kong1, Zhaojun Liu1, Ning Wang1, Shuai Zhang1, Heping Ma2, Shangwei Ning3, Lihua Wang4.   

Abstract

The Nervous System Disease NcRNAome Atlas (NSDNA) (http://www.bio-bigdata.net/nsdna/) is a manually curated database that provides comprehensive experimentally supported associations about nervous system diseases (NSDs) and noncoding RNAs (ncRNAs). NSDs represent a common group of disorders, some of which are characterized by high morbidity and disabilities. The pathogenesis of NSDs at the molecular level remains poorly understood. ncRNAs are a large family of functionally important RNA molecules. Increasing evidence shows that diverse ncRNAs play a critical role in various NSDs. Mining and summarizing NSD-ncRNA association data can help researchers discover useful information. Hence, we developed an NSDNA database that documents 24 713 associations between 142 NSDs and 8593 ncRNAs in 11 species, curated from more than 1300 articles. This database provides a user-friendly interface for browsing and searching and allows for data downloading flexibility. In addition, NSDNA offers a submission page for researchers to submit novel NSD-ncRNA associations. It represents an extremely useful and valuable resource for researchers who seek to understand the functions and molecular mechanisms of ncRNA involved in NSDs.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27899613      PMCID: PMC5210534          DOI: 10.1093/nar/gkw1038

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Nervous system diseases (NSDs) represent a common class of diseases, some of which are characterized by high morbidity and disabilities. For example, stroke causes serious long-term disabilities and has been the third leading cause of death in most developed countries for many decades (1). Other NSDs, such as Alzheimer's and Parkinson's diseases, cause progressive neuron damage in the brain, severely affecting quality of life and putting great burdens on families and society (2). NSDs are remarkably complex, and the underlying causes involve changes at multiple levels, including genome, transcriptome, epigenetic mechanisms and environmental factors (3). Therefore, one of the primary challenges in NSD research is to uncover the precise molecular mechanisms underlying the diseases. Non-coding RNAs (ncRNAs) are a family of functional RNA molecules that include microRNAs (miRNAs), long noncoding RNAs (lncRNA), small interfering RNA (siRNA), small nucleolar RNAs (snoRNA), piwi-interacting RNA (piRNA), among others (4). There is now overwhelming evidence that changes in ncRNA expression levels have been associated with NSDs (5). For example, miRNAs (∼22 nt small ncRNA molecules that repress mRNA target expression) are the hallmark of various NSDs, such as epilepsy (6), multiple sclerosis (7) and Huntington disease (8). lncRNAs (>200 nt RNA molecules) have been found in recent years to play a role in nervous system development, plasticity, evolution and disease (9). For example, the lncRNA HOTAIR acts as an endogenous ‘sponge’ of miR-141, resulting in significant reduction of glioma growth (10). To date, many ncRNA-related databases have been developed, such as HMDD (11), LncRNADisease (12) and Lnc2Cancer (13). However, a high quality and well-curated public resource specifically for NSDs–ncRNAs association data remains unavailable. Currently, a huge amount of ncRNAs and NSDs data have accumulated in a short period of time with the rapid development of this field. However, the data are distributed among many massive articles that makes it inconvenient for researchers who want to further explore the relationship between NSDs and ncRNAs. It is worth noting that researchers have made great efforts to develop databases to explore NSD-related information across several fields. For example, the MethylomeDB database collects DNA methylation profiles of the brain (14), the Allen Brain Atlas integrates gene expression data and neuroanatomical information (15) and the EpilepsyGene and MDPD databases present gene mutation information for epilepsy and Parkinson's disease, respectively (16,17). However, no resource is currently focused on collecting experimentally supported ncRNAs–NSDs associations across various species. To bridge this gap, we developed NSDNA, a manually curated database of experimentally supported NSD and ncRNA associations, which seeks to provide a high quality, comprehensive and specialist resource of NSD-related ncRNA deregulation across various species. The current version of NSDNA documents 24 713 entries of associations between 142 NSDs and 8593 ncRNAs in 11 species, curated from more than 1300 articles. We expect NSDNA to serve as a useful resource for researchers who wish to further investigate the relationship between ncRNAs and NSDs.

DATABASE CONSTRUCTION

To ensure a high quality database, we referred to the steps used to assemble the databases HMDD, Lnc2Cancer and ViRBase (11,13,18). We mainly applied the following three steps in the data collection process: collecting disease and ncRNA information, searching for relevant articles and extracting useful information from the selected articles. First, NSD terminologies were organized based upon the controlled vocabulary of the Medical Subject Heading disease categories that contains ∼600 NSDs (19). The ncRNA information was collected and integrated from various resources: miRNA symbols were taken from miRBase (20), lncRNA symbols were taken from NONCODE (21) and lncRNAdb (22), siRNA symbols were taken from siRecords (23), snoRNA symbols were taken from snoRNA-LBME-db (24) and piRNA symbols were taken from piRNABank (25). In total, we collected ∼600 NSDs and more than 600 000 ncRNA terms in the first step. We also applied the ncRNA category names for those and other ncRNAs, such as ribosomal RNA and transfer RNA. Second, we used a simple script to select all abstracts in the PubMed database using the following keyword combinations: ‘each nervous system disease’ and/or ‘each ncRNA symbol’ or ‘ncRNA category name’ (18). Then, we downloaded all published literature and available supplementary files describing the associations between NSDs and ncRNAs. Third, we manually extracted experimentally supported ncRNA–NSD associations from selected articles by at least two researchers. Finally, all NSDNA data were stored and managed using MySQL. The web interface was built using JSP. The scripts for the data processing programs were written in Java. The web service is running on an Apache Tomcat web server. The NSDNA database is freely available at http://www.bio-bigdata.net/nsdna/ and http://www.bio-bigdata.com/nsdna/.

DATABASE CONTENT

After systematically reviewing more than 1300 published articles, a total of 24 713 entries representing 142 NSDs and 8593 ncRNAs in 11 species were manually collected. The number of ncRNA-associated entries for each species is listed in Table 1. Each entry contains detailed information, including disease and ncRNA name, species, the ncRNA expression pattern (e.g. upregulated, downregulated), experimental methods (e.g. microarray, Real-time PCR), experimentally verified targets of ncRNA, detected tissue (e.g. brain, blood), associated treatment (e.g. levodopa, Interferon-Beta), detailed description and corresponding literature (PubMed ID and publication year). To verify altered ncRNA expression in NSDs, the researchers used various experimental methods, including high-throughput and low-throughput experiments. Therefore, we divided the data into high-throughput and low-throughput experiments, as previously described (26). High-throughput experiments include microarray and next-generation sequencing, whereas low-throughput experiments include real-time PCR, Northern blot, Western blot and luciferase reporter assays, and so on.
Table 1.

Statistics for the NSD-ncRNA entries in the NSDNA database

SpeciesmiRNAlncRNAsiRNAsnoRNApiRNATotal
Homo sapiens12 361422346-16 594
Mus musculus528415951025460
Rattus norvegicus21031204-1052332
Drosophila75----75
Macaca mulatta71----71
Pan troglodytes56----56
Sus scrofa48----48
Caenorhabditis elegans35----35
Marmoset25----25
Gallus gallus16----16
Aedes albopictuscells1----1
Total20 0754502131610724 713

USER INTERFACE

NSDNA provides a user-friendly web interface for an easy database query (Figure 1). In the ‘Browse’ page, users can browse by miRNAs, lncRNAs, siRNAs, snoRNAs, piRNAs, diseases, species and tissue name. By clicking on a particular node, corresponding results are displayed. In the ‘Search’ page, NSDNA allows users to search by miRNAs, lncRNAs, siRNAs, snoRNAs, piRNAs, disease and tissue name. NSDNA enables users to select all or particular species from the species pull-down menu. In addition, NSDNA provides an option that allows users to select low-throughput (Real-time PCR, Northern Blot, Luciferase Reporter Assay, etc.) or high-throughput (microarray, next-generation sequencing, etc.) experimental data or both. The search results are organized as tables in the web interface, and users can obtain more extensive information by clicking on the ‘Details’ hyperlink. NSDNA supports ‘fuzzy’ searching capabilities, where all possible results matching the keywords will be displayed. NSDNA allows users to download all the obtained data in the ‘Download’ page. In addition, NSDNA invites users to submit novel NSD-ncRNA associations for updating the database in the ‘Submit’ page. Once approved by the submission review committee, the submitted record is included in the database and made available to the public in the following release. We provide a detailed database use tutorial in the ‘Help’ page.
Figure 1.

A schematic workflow of the NSDNA database.

A schematic workflow of the NSDNA database. Moreover, NSDNA provides users a prioritization page that enables researchers to prioritize ncRNAs in a specific tissue and disease. For miRNAs, we used low-throughput experiments confirmed miRNA as the gold standard and prioritize the set of miRNAs supported by high-throughput experiments based on semantic similarities of functional genomic data as previously described (27). In brief, this prioritized method ranked candidate miRNAs according to their semantic similarity and distance proximity to the gold standard. For lncRNAs, we used low-throughput experiments confirmed lncRNA as the gold standard and prioritize the set of lncRNAs supported by high-throughput experiments based on functional similarity as previously described (28). Finally, we integrated the tool and the prioritized results into the NSDNA database.

DISCUSSION

NSDs are a common group of disorders, encompassing a broad range of clinical features, such as seizures, dementia and paralysis, among others. Researchers and neurologists have been trying to decipher the precise molecular and cellular regulatory mechanisms involved in these disorders. In recent years, emerging work has focused on exploring potential mechanisms involving ncRNAs in NSDs (29). Consequently, NSD–ncRNA association data have been accumulating rapidly in this decade. However, most data are dispersed in many independent studies. It has become increasingly clear that an integrated collection of NSD-associated ncRNA data are critical for a complete understanding of NSD processes. Thus, we developed NSDNA, an NSD-specific database that provides a comprehensive resource on diverse ncRNA–NSD associations across various species. With the advance of disease-related ncRNA research, several ncRNAs (mainly miRNAs and lncRNAs) associated databases were established, such as miR2Disease (30), HMDD (11) and LncRNADisease (12). These databases are crucial for studying ncRNA functions in human diseases. However, to the best of our knowledge, none of these resources were developed to specifically collect experimentally supported NSD–ncRNA association data. For example, miR2Disease only documents 450 NSD-related entries of associations between 179 miRNAs and 26 NSDs, while LncRNADisease only includes 196 NSD-related entries of associations between 74 lncRNAs and 37 NSDs, and only human data are available in these databases. NSDNA contains general NSDs, nearly 6 and 4 times the number of NSDs than the miR2Disease and LncRNADisease databases, respectively. Compared with miR2Disease, NSDNA also includes data regarding mutations in ncRNA and their association with NSDs. For example, in the NSDNA database, we also found that a SNP (rs11014002) located in miR-603 precursor exhibits a protective effect toward Alzheimer disease risk (31). Moreover, miR2Disease has not been updated since 2009. In addition to recording more NSD-ncRNA associations, NSDNA has several advantages compared to previous studies (Supplementary Table S1). First, NSDNA includes many ncRNAs categories (miRNAs, lncRNAs, siRNAs, snoRNAs and piRNAs), while the above-mentioned databases only include one ncRNA category. Second, NSDNA includes data for many species, and users can retrieve and download all species or particular species data. Third, data on NSD-associated circulating miRNA, treatment-associated miRNA and miRNA target were also added to the NSDNA. Thus, NSDNA is a specialized database that provides a comprehensive resource on ncRNA and NSD associations. We believe that NSDNA will be beneficial to researchers who wish to further dissect mechanisms involved in NSDs. We also counted the number of published articles each year in NSDNA. We found that the number of NSD-associated ncRNA articles have generally increased, suggesting that the research on ncRNAs may be one of the hottest topics in the NSD field in this decade (Figure 2A). Particularly, the number of miRNA- and lncRNA-related articles have been steadily increasing, and hundreds of miRNAs and lncRNAs have been identified due to the development of high-throughput technologies. In recent years, new evidence has shown that circulating miRNA levels appear stable in fluids and can be used as biomarkers for diagnosis and treatment of NSDs (32). Therefore, we focused on circulating miRNA studies and found that the number of circulating miRNAs publications has increased dramatically from 2013 to 2015, suggesting that circulating miRNAs have also become a hot topic in NSD research (Figure 2B).
Figure 2.

(A) Annual publication counts in NSDNA. (B) Annual publication counts of circulating miRNAs in NSDNA.

(A) Annual publication counts in NSDNA. (B) Annual publication counts of circulating miRNAs in NSDNA. Because more than half of the entries were human miRNA-associated data, we constructed a human miRNA–NSD bipartite network, where nodes represent miRNAs or NSDs and the lines denote experimentally supported relationships between miRNAs and NSDs (Supplementary Figure S1A). We listed the number of miRNAs associated with the NSDs, the number of NSDs associated with the miRNAs, and the five most highly connected nodes (Supplementary Figure S1B and S1C). For instance, the disease with the highest connectivity is glioblastoma that is associated with 839 miRNAs. These results indicate that glioblastoma has received wide attention in miRNA-related research and also has a complex regulatory mechanism mediated by miRNA. In addition, hsa-miR-146a is the most highly connected miRNA node, which is associated with 34 NSDs and might be widely associated with NSDs.

CONCLUSION AND FUTURE EXTENSIONS

In conclusion, NSDs are common complex disorders that researchers have been trying to understand for years. Recent studies have shed light on ncRNAs’ role in NSDs and the mechanisms involved. NSDNA not only provides a comprehensive NSD-specialized database with experimental support but also offers a more global perspective on ncRNA functions in NSDs. In the future, we will continue to manually collect newly validated ncRNA–NSD associations and update the database every two months. In addition, we plan to integrate more sources such as functional annotations and provide additional tools such as an ncRNA–NSD association prediction tool. NSDNA will serve as a valuable resource for deciphering ncRNA mechanisms and improving the diagnosis and treatment of NSDs.
  32 in total

1.  The microRNA miR-23b suppresses IL-17-associated autoimmune inflammation by targeting TAB2, TAB3 and IKK-α.

Authors:  Shu Zhu; Wen Pan; Xinyang Song; Yan Liu; Xinrui Shao; Yuanjia Tang; Dong Liang; Dongyi He; Honglin Wang; Wenjun Liu; Yufang Shi; John B Harley; Nan Shen; Youcun Qian
Journal:  Nat Med       Date:  2012-07       Impact factor: 53.440

2.  SM2miR: a database of the experimentally validated small molecules' effects on microRNA expression.

Authors:  Xinyi Liu; Shuyuan Wang; Fanlin Meng; Jizhe Wang; Yan Zhang; Enyu Dai; Xuexin Yu; Xia Li; Wei Jiang
Journal:  Bioinformatics       Date:  2012-12-05       Impact factor: 6.937

3.  Primate-specific miR-603 is implicated in the risk and pathogenesis of Alzheimer's disease.

Authors:  Chi Zhang; Jie Lu; Bing Liu; Qinghua Cui; Yun Wang
Journal:  Aging (Albany NY)       Date:  2016-02       Impact factor: 5.682

Review 4.  Modular regulatory principles of large non-coding RNAs.

Authors:  Mitchell Guttman; John L Rinn
Journal:  Nature       Date:  2012-02-15       Impact factor: 49.962

Review 5.  Circulating biomarkers for gliomas.

Authors:  Manfred Westphal; Katrin Lamszus
Journal:  Nat Rev Neurol       Date:  2015-09-15       Impact factor: 42.937

Review 6.  The Cellular and Molecular Landscapes of the Developing Human Central Nervous System.

Authors:  John C Silbereis; Sirisha Pochareddy; Ying Zhu; Mingfeng Li; Nenad Sestan
Journal:  Neuron       Date:  2016-01-20       Impact factor: 17.173

Review 7.  Mechanisms of Long Non-coding RNAs in Mammalian Nervous System Development, Plasticity, Disease, and Evolution.

Authors:  James A Briggs; Ernst J Wolvetang; John S Mattick; John L Rinn; Guy Barry
Journal:  Neuron       Date:  2015-12-02       Impact factor: 17.173

8.  ViRBase: a resource for virus-host ncRNA-associated interactions.

Authors:  Yanhui Li; Changliang Wang; Zhengqiang Miao; Xiaoman Bi; Deng Wu; Nana Jin; Liqiang Wang; Hao Wu; Kun Qian; Chunhua Li; Ting Zhang; Chunrui Zhang; Ying Yi; Hongyan Lai; Yongfei Hu; Lixin Cheng; Kwong-Sak Leung; Xiaobo Li; Fengmin Zhang; Kongning Li; Xia Li; Dong Wang
Journal:  Nucleic Acids Res       Date:  2014-10-01       Impact factor: 16.971

9.  mirTarPri: improved prioritization of microRNA targets through incorporation of functional genomics data.

Authors:  Peng Wang; Shangwei Ning; Qianghu Wang; Ronghong Li; Jingrun Ye; Zuxianglan Zhao; Yan Li; Teng Huang; Xia Li
Journal:  PLoS One       Date:  2013-01-09       Impact factor: 3.240

10.  HMDD v2.0: a database for experimentally supported human microRNA and disease associations.

Authors:  Yang Li; Chengxiang Qiu; Jian Tu; Bin Geng; Jichun Yang; Tianzi Jiang; Qinghua Cui
Journal:  Nucleic Acids Res       Date:  2013-11-04       Impact factor: 16.971

View more
  20 in total

Review 1.  Invited Review: Long non-coding RNAs: important regulators in the development, function and disorders of the central nervous system.

Authors:  R Cuevas-Diaz Duran; H Wei; D H Kim; J Q Wu
Journal:  Neuropathol Appl Neurobiol       Date:  2019-03-04       Impact factor: 8.090

2.  ncRPheno: a comprehensive database platform for identification and validation of disease related noncoding RNAs.

Authors:  Wenliang Zhang; Guocai Yao; Jianbo Wang; Minglei Yang; Jing Wang; Haiyue Zhang; Weizhong Li
Journal:  RNA Biol       Date:  2020-03-26       Impact factor: 4.652

3.  LncRNA LINC00680 Acts as a Competing Endogenous RNA and Is Associated With the Severity of Myasthennia Gravis.

Authors:  Li Liu; Huixue Zhang; Xiaoyu Lu; Lifang Li; Tianfeng Wang; Shuang Li; Xu Wang; Si Xu; Lei Li; Qian Li; Tingting Yi; Tao Wu; Zhimin Chen; Hongyu Gao; Jianjian Wang; Lihua Wang
Journal:  Front Neurol       Date:  2022-06-21       Impact factor: 4.086

4.  LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases.

Authors:  Zhenyu Bao; Zhen Yang; Zhou Huang; Yiran Zhou; Qinghua Cui; Dong Dong
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

5.  MNDR v2.0: an updated resource of ncRNA-disease associations in mammals.

Authors:  Tianyu Cui; Lin Zhang; Yan Huang; Ying Yi; Puwen Tan; Yue Zhao; Yongfei Hu; Liyan Xu; Enmin Li; Dong Wang
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

6.  Identifying cortical specific long noncoding RNAs modified by m6A RNA methylation in mouse brains.

Authors:  Yanzhen Nie; Geng G Tian; Longbin Zhang; Trevor Lee; Zhen Zhang; Jing Li; Tao Sun
Journal:  Epigenetics       Date:  2020-12-23       Impact factor: 4.528

7.  NONCODEV5: a comprehensive annotation database for long non-coding RNAs.

Authors:  ShuangSang Fang; LiLi Zhang; JinCheng Guo; YiWei Niu; Yang Wu; Hui Li; LianHe Zhao; XiYuan Li; XueYi Teng; XianHui Sun; Liang Sun; Michael Q Zhang; RunSheng Chen; Yi Zhao
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

8.  Computational Methods and Applications for Identifying Disease-Associated lncRNAs as Potential Biomarkers and Therapeutic Targets.

Authors:  Congcong Yan; Zicheng Zhang; Siqi Bao; Ping Hou; Meng Zhou; Chongyong Xu; Jie Sun
Journal:  Mol Ther Nucleic Acids       Date:  2020-05-21       Impact factor: 8.886

9.  Building the drug-GO function network to screen significant candidate drugs for myasthenia gravis.

Authors:  Shuang Li; Yuze Cao; Lei Li; Huixue Zhang; Xiaoyu Lu; Chunrui Bo; Xiaotong Kong; Zhaojun Liu; Lixia Chen; Peifang Liu; Yang Jiao; Jianjian Wang; Shangwei Ning; Lihua Wang
Journal:  PLoS One       Date:  2019-04-04       Impact factor: 3.240

10.  Arena-Idb: a platform to build human non-coding RNA interaction networks.

Authors:  Vincenzo Bonnici; Giorgio De Caro; Giorgio Constantino; Sabino Liuni; Domenica D'Elia; Nicola Bombieri; Flavio Licciulli; Rosalba Giugno
Journal:  BMC Bioinformatics       Date:  2018-10-15       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.