Literature DB >> 30380072

LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse.

Liang Cheng1, Pingping Wang2, Rui Tian2, Song Wang3, Qinghua Guo2, Meng Luo2, Wenyang Zhou2, Guiyou Liu2, Huijie Jiang4, Qinghua Jiang2.   

Abstract

Long non-coding RNAs (lncRNAs) play crucial roles in regulating gene expression, and a growing number of researchers have focused on the identification of target genes of lncRNAs. However, no online repository is available to collect the information on target genes regulated by lncRNAs. To make it convenient for researchers to know what genes are regulated by a lncRNA of interest, we developed a database named lncRNA2Target to provide a comprehensive resource of lncRNA target genes in 2015. To update the database this year, we retrieved all new lncRNA-target relationships from papers published from 1 August 2014 to 30 April 2018 and RNA-seq datasets before and after knockdown or overexpression of a specific lncRNA. LncRNA2Target database v2.0 provides a web interface through which its users can search for the targets of a particular lncRNA or for the lncRNAs that target a particular gene, and is freely accessible at http://123.59.132.21/lncrna2target.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30380072      PMCID: PMC6323902          DOI: 10.1093/nar/gky1051

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Development of RNA sequencing technology leads to recognition of large amount of novel transcripts (1,2). The corresponding large-scale genomic and transcriptomic analysis has shown that over 90% of the human genome is actively transcribed (2), while only ∼2% of the genome can encode proteins (3). In comparison, a large portion of the transcripts are non-coding RNAs (ncRNAs) (4), such as long ncRNAs (lncRNAs) lncRNA plays a crucial role in biological processes (5,6) and even in diseases (7–13). In 2014, Sun et al. presented that BANCR could be a biomarker for poor prognosis of non-small cell lung cancer (NSCLC) by investigating its expression in multiple NSCLC tissues (14). In 2016, Zhu et al. observed that lncRNA MEG3 may be a potential target and therapeutic strategy for diabetes since that the upregulation of MEG3 enhances hepatic insulin resistance (15). Many important databases such as lncRNADisease (16), lncRNASNP2 (17), LncRNAWiki (18), MNDR (19), EVLncRNAs (20), NONCODE (21), LncRNA2Function (22) and TF2LncRNA (23) were developed for collecting the sequence information and functional characteristics of lncRNAs (16–23). For example, LncRNA2Function (22) mapped lncRNAs to Gene Ontology (GO) terms and biological process, lncRNADisease (16) documented associations between lncRNAs and diseases. Accumulating evidence has shown that lncRNA exert its functions by regulating the expression of target genes. Researchers used to infer lncRNA–target relationships by examining whether a candidate gene is differentially expressed after knocking down or overexpressing a specific lncRNA (24). For example, in 2014, Bell et al. found the decreased expression of Myocardin and numerous smooth muscle contractile genes after knocking down the lncRNA SENCR (25). In 2015, LncRNA2Target v1.0 was initially released with these scattered lncRNA–target relationships (24), which was widely used for predicting potential lncRNA–target relationships, lncRNA disease associations, and so on (26–28). Whereas, the differentially expressed genes after knocking down or overexpressing a lncRNA are potential target genes with the lack of direct lncRNA–target interaction evidence. Recent years, some binding experiments such as immunoprecipitation assays, RNA pull-down assays and luciferase reporter assays was used to identify target genes of lncRNAs (29,30). Therefore, lncRNA2Target 2.0 not only collects all differentially expressed genes after knocking down or overexpressing a lncRNA, but also collects all lncRNA–target relationships confirmed by binding experimental technologies such as luciferase reporter assays, immunoprecipitation assays and pull down arrays.

DATA COLLECTION

To update LncRNA2Target, we collected all lncRNA papers and datasets published from 1 August 2014 to 30 April 2018 by searching the PubMed literature database and Gene Expression Omnibus (GEO) with keywords ‘lncRNA’, ‘lincRNA’ and ‘long non-coding RNA’, respectively. Then, over 1500 papers and 140 new available high-throughput microarray or RNA-seq datasets were retrieved. Subsequently, we downloaded all the published papers and manually extracted information about associations between lncRNAs and their target genes. To ensure data quality, lncRNAs were annotated by NCBI GenBank (31), Ensembl (32) and GENCODE (33), and each of these associations was double checked. Besides, we also downloaded all the high-throughput microarray and RNA-seq datasets before and after knocking down or overexpressing a lncRNA of interest and identified all differentially expressed genes. A unified analysis process was developed to reanalyze public microarray and RNA-sequencing datasets before and after knocking down or overexpressing a lncRNA. First, microarray raw data was preprocessed by oligo package (34). For RNA-seq data, NGS QC Toolkit (35) was used for quality filtering and trimming, TopHat (36) was used to map reads to reference genome, and HTSeq (37) was used to count how many reads map to each gene. Then, limma (38) was selected for gene expression normalization and differential expression analysis, due to it can be applied to both microarray and RNAseq data with very similar pipelines. At last, genes with significant expression change (adjusted P ≤ 0.05) were considered as targets of a specific lncRNA.

DATABASE CONTENT

LncRNA2Target v2.0 contains 152 137 lncRNA–target associations from 1047 papers and 224 datasets. The detailed statistics on lncRNA–target associations is shown in Table 1. The total number of lncRNA–target associations increased significantly compared previous version. For example, lncRNA2target v1.0 contains 278 human lncRNA–target associations between 68 lncRNAs and 216 target genes from low-throughput methods such as RT-qPCR and western blot. While lncRNA2target v2.0 contains 1465 human lncRNA–target associations between 356 lncRNAs and 689 target genes from low-throughput methods such as RT-qPCR, western blot, luciferase reporter assays, immunoprecipitation assays and RNA pull-down assays.
Table 1.

Comparison between v1.0 and v2.0 of LncRNA2Target database

SpeciesMethodsNo. of lncRNAs (v1.0, v2.0)No. of target genes (v1.0, v2.0)No. of lncRNA–target relationships (v1.0, v2.0)
HumanLow-throughputa(68, 356)(216, 689)(278, 1465)
High-throughputb(14, 61)(11 389, 28 865)(26 133, 72 102)
MouseLow-throughputa(26, 81)(95, 188)(118, 210)
High-throughputb(109, 134)(14 667, 19 973)(67 034, 78 360)

aLow-throughput: Immunoprecipitation assays, RNA pull down assays, Luciferase reporter assays and so on.

bHigh-throughput: Microarray, RNA-seq.

Comparison between v1.0 and v2.0 of LncRNA2Target database aLow-throughput: Immunoprecipitation assays, RNA pull down assays, Luciferase reporter assays and so on. bHigh-throughput: Microarray, RNA-seq. LncRNA2Target v2.0 contains 35 lncRNA–target associations from low-throughput experiments before 2010, and 33, 59, 59, 84, 133, 129, 296, 579 and 268 associations per year from 2010 to 2018, respectively. The number of lncRNA–target associations from low-throughput experiments has been annually increasing since 2010. Especially in 2017, up to 579 lncRNA–target associations were reported. Figure 1 shows that lncRNA knockdown and overexpression were the most common method to infer lncRNA–target relationships, and the number of lncRNA–target associations validated from luciferase reporter assays, immunoprecipitation assays and RNA pull down assays is also increasing rapidly.
Figure 1.

Distribution of lncRNA–target associations identified by low-throughput methods in the LncRNA2Target v2.0. (A) The number distribution of lncRNA–target associations by year. (B) The number distribution of lncRNA–target associations by identified method. Each node represents one low-throughput method. Node size indicates the number of lncRNA–target associations identified by the method. Thickness of the edge represents the number of lncRNA–target associations that is validated by both of the linked methods.

Distribution of lncRNA–target associations identified by low-throughput methods in the LncRNA2Target v2.0. (A) The number distribution of lncRNA–target associations by year. (B) The number distribution of lncRNA–target associations by identified method. Each node represents one low-throughput method. Node size indicates the number of lncRNA–target associations identified by the method. Thickness of the edge represents the number of lncRNA–target associations that is validated by both of the linked methods.

DATABASE ACCESS

LncRNA2Target v2.0 is publicly available at http://123.59.132.21/lncrna2target. Users can browse, search and download all lncRNA–target relationship data through our web interface. Figure 2 shows the schematic workflow of browsing lncRNA–target associations by lncRNA symbol. lncRNAs of human and mouse will be shown after clicking ‘Human’ and ‘Mouse’ button, respectively. The hyperlink of each lncRNA could be linked to the details of lncRNA information and its target genes. Furthermore, the description of association between lncRNA and gene from the literature could be shown by clicking the hyperlink of the gene. Figure 3 shows the schematic workflow of searching lncRNA–target associations by lncRNA Entrez ID/symbol, lncRNA Ensembl ID and target Entrez ID/symbol. Here, fuzzy search function was provided for retrieving lncRNAs and target genes. The figure gives the searching results of two examples CDKN1A and HOTAIR. In the download page, details of all the lncRNA–target associations could be accessed. In addition, LncRNA2Target web server also provides a submission page, which allows researchers to submit new experimentally verified lncRNA–target associations to the database.
Figure 2.

Schematic workflow of browsing lncRNA–target associations.

Figure 3.

Schematic workflow of searching lncRNA–target associations.

Schematic workflow of browsing lncRNA–target associations. Schematic workflow of searching lncRNA–target associations.

CONCLUSION

LncRNA2Target v2.0 provides a comprehensive resource of lncRNA–target relationships in human and mouse, which aims to facilitate users to browse, search and download all literature-based lncRNA–target associations. The associations based on low-throughput and low-throughput experiments were manually extracted from literature and reanalyzed using a unified analysis process, respectively. Now LncRNA2Target v2.0 contains 152 137 lncRNA–target associations from 1047 papers and 224 datasets. With the development of experimental technologies, more and more relationships between lncRNAs and target genes will be identified. Therefore, to make researchers convenient access the new lncRNA–target associations, we will update lncRNA2Target regularly. We believe that LncRNA2Target v2.0 will be of particular interest to lncRNA community.
  38 in total

1.  A framework for oligonucleotide microarray preprocessing.

Authors:  Benilton S Carvalho; Rafael A Irizarry
Journal:  Bioinformatics       Date:  2010-08-05       Impact factor: 6.937

2.  NGS QC Toolkit: a toolkit for quality control of next generation sequencing data.

Authors:  Ravi K Patel; Mukesh Jain
Journal:  PLoS One       Date:  2012-02-01       Impact factor: 3.240

3.  Association of a novel long non-coding RNA in 8q24 with prostate cancer susceptibility.

Authors:  Suyoun Chung; Hidewaki Nakagawa; Motohide Uemura; Lianhua Piao; Kyota Ashikawa; Naoya Hosono; Ryo Takata; Shusuke Akamatsu; Takahisa Kawaguchi; Takashi Morizono; Tatsuhiko Tsunoda; Yataro Daigo; Koichi Matsuda; Naoyuki Kamatani; Yusuke Nakamura; Michiaki Kubo
Journal:  Cancer Sci       Date:  2010-09-28       Impact factor: 6.716

4.  The prostate cancer-up-regulated long noncoding RNA PlncRNA-1 modulates apoptosis and proliferation through reciprocal regulation of androgen receptor.

Authors:  Zilian Cui; Shancheng Ren; Ji Lu; Fubo Wang; Weidong Xu; Yi Sun; Min Wei; Junyi Chen; Xu Gao; Chuanliang Xu; Jian-Hua Mao; Yinghao Sun
Journal:  Urol Oncol       Date:  2012-01-20       Impact factor: 3.498

Review 5.  Long non-coding RNAs in Huntington's disease neurodegeneration.

Authors:  Rory Johnson
Journal:  Neurobiol Dis       Date:  2011-12-14       Impact factor: 5.996

6.  Genetic variants at the 9p21 locus contribute to atherosclerosis through modulation of ANRIL and CDKN2A/B.

Authors:  Ada Congrains; Kei Kamide; Ryousuke Oguro; Osamu Yasuda; Keishi Miyata; Eiichiro Yamamoto; Tatsuo Kawai; Hiroshi Kusunoki; Hiroko Yamamoto; Yasushi Takeya; Koichi Yamamoto; Miyuki Onishi; Ken Sugimoto; Tomohiro Katsuya; Nobuhisa Awata; Kazunori Ikebe; Yasuyuki Gondo; Yuichi Oike; Mitsuru Ohishi; Hiromi Rakugi
Journal:  Atherosclerosis       Date:  2011-11-19       Impact factor: 5.162

7.  Expression of a noncoding RNA is elevated in Alzheimer's disease and drives rapid feed-forward regulation of beta-secretase.

Authors:  Mohammad Ali Faghihi; Farzaneh Modarresi; Ahmad M Khalil; Douglas E Wood; Barbara G Sahagan; Todd E Morgan; Caleb E Finch; Georges St Laurent; Paul J Kenny; Claes Wahlestedt
Journal:  Nat Med       Date:  2008-06-29       Impact factor: 53.440

8.  Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis.

Authors:  Rajnish A Gupta; Nilay Shah; Kevin C Wang; Jeewon Kim; Hugo M Horlings; David J Wong; Miao-Chih Tsai; Tiffany Hung; Pedram Argani; John L Rinn; Yulei Wang; Pius Brzoska; Benjamin Kong; Rui Li; Robert B West; Marc J van de Vijver; Saraswati Sukumar; Howard Y Chang
Journal:  Nature       Date:  2010-04-15       Impact factor: 49.962

9.  Functional characterization of the plasmacytoma variant translocation 1 gene (PVT1) in diabetic nephropathy.

Authors:  M Lucrecia Alvarez; Johanna K DiStefano
Journal:  PLoS One       Date:  2011-04-22       Impact factor: 3.240

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  94 in total

1.  CNVxplorer: a web tool to assist clinical interpretation of CNVs in rare disease patients.

Authors:  Francisco Requena; Hamza Hadj Abdallah; Alejandro García; Patrick Nitschké; Sergi Romana; Valérie Malan; Antonio Rausell
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

Review 2.  DeepWalk based method to predict lncRNA-miRNA associations via lncRNA-miRNA-disease-protein-drug graph.

Authors:  Long Yang; Li-Ping Li; Hai-Cheng Yi
Journal:  BMC Bioinformatics       Date:  2022-02-25       Impact factor: 3.169

3.  The Magnitude of IFN-γ Responses Is Fine-Tuned by DNA Architecture and the Non-coding Transcript of Ifng-as1.

Authors:  Franziska Petermann; Aleksandra Pękowska; Catrina A Johnson; Dragana Jankovic; Han-Yu Shih; Kan Jiang; William H Hudson; Stephen R Brooks; Hong-Wei Sun; Alejandro V Villarino; Chen Yao; Kentner Singleton; Rama S Akondy; Yuka Kanno; Alan Sher; Rafael Casellas; Rafi Ahmed; John J O'Shea
Journal:  Mol Cell       Date:  2019-07-31       Impact factor: 17.970

4.  Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities.

Authors:  Zhen-Hao Guo; Zhu-Hong You; Yan-Bin Wang; De-Shuang Huang; Hai-Cheng Yi; Zhan-Heng Chen
Journal:  Gigascience       Date:  2020-06-01       Impact factor: 6.524

5.  DF-MDA: An effective diffusion-based computational model for predicting miRNA-disease association.

Authors:  Hao-Yuan Li; Zhu-Hong You; Lei Wang; Xin Yan; Zheng-Wei Li
Journal:  Mol Ther       Date:  2021-01-09       Impact factor: 11.454

6.  Identification of key transcriptome biomarkers based on a vital gene module associated with pathological changes in Alzheimer's disease.

Authors:  Tong Zhang; Yang Shen; Yiqing Guo; Junyan Yao
Journal:  Aging (Albany NY)       Date:  2021-05-24       Impact factor: 5.682

7.  Introduction to Bioinformatics Resources for Post-transcriptional Regulation of Gene Expression.

Authors:  Eliana Destefanis; Erik Dassi
Journal:  Methods Mol Biol       Date:  2022

8.  Web tools to perform long non-coding RNAs analysis in oncology research.

Authors:  Shixing Gu; Guangjie Zhang; Qin Si; Jiawen Dai; Zhen Song; Yingshuang Wang
Journal:  Database (Oxford)       Date:  2021-07-23       Impact factor: 3.451

9.  A Non-Coding RNA Network Involved in KSHV Tumorigenesis.

Authors:  Julián Naipauer; Martín E García Solá; Daria Salyakina; Santas Rosario; Sion Williams; Omar Coso; Martín C Abba; Enrique A Mesri; Ezequiel Lacunza
Journal:  Front Oncol       Date:  2021-06-16       Impact factor: 6.244

10.  A structural deep network embedding model for predicting associations between miRNA and disease based on molecular association network.

Authors:  Hao-Yuan Li; Hai-Yan Chen; Lei Wang; Shen-Jian Song; Zhu-Hong You; Xin Yan; Jin-Qian Yu
Journal:  Sci Rep       Date:  2021-06-16       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.