Xiaowei Chen1,2, Yajing Hao1,3, Ya Cui1,3, Zhen Fan1,2, Shunmin He4, Jianjun Luo1, Runsheng Chen1,5. 1. CAS Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences. 2. Core Facility for Protein Research, Institute of Biophysics, Chinese Academy of Sciences. 3. University of Chinese Academy of Sciences, Beijing 100049, China. 4. Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China. 5. Research Network of Computational Biology, RNCB.
Abstract
MOTIVATION: Long non-coding RNAs (lncRNAs) are essential in many molecular pathways, and are frequently associated with disease but the mechanisms of most lncRNAs have not yet been characterized. Genetic variations, including single nucleotide polymorphisms (SNPs) and structural variations, are widely distributed in the genome, including lncRNA gene regions. As the number of studies on lncRNAs grows rapidly, it is necessary to evaluate the effects of genetic variations on lncRNAs. RESULTS: Here, we present LncVar, a database of genetic variation associated with long non-coding genes in six species. We collected lncRNAs from the NONCODE database, and evaluated their conservation. We systematically integrated transcription factor binding sites and m6A modification sites of lncRNAs and provided comprehensive effects of SNPs on transcription and modification of lncRNAs. We collected putatively translated open reading frames (ORFs) in lncRNAs, and identified both synonymous and non-synonymous SNPs in ORFs. We also collected expression quantitative trait loci of lncRNAs from the literature. Furthermore, we identified lncRNAs in CNV regions as prognostic biomarker candidates of cancers and predicted lncRNA gene fusion events from RNA-seq data from cell lines. The LncVar database can be used as a resource to evaluate the effects of the variations on the biological function of lncRNAs. AVAILABILITY AND IMPLEMENTATION: LncVar is available at http://bioinfo.ibp.ac.cn/LncVar CONTACT: rschen@ibp.ac.cnSupplementary information: Supplementary materials are available at Bioinformatics online.
MOTIVATION: Long non-coding RNAs (lncRNAs) are essential in many molecular pathways, and are frequently associated with disease but the mechanisms of most lncRNAs have not yet been characterized. Genetic variations, including single nucleotide polymorphisms (SNPs) and structural variations, are widely distributed in the genome, including lncRNA gene regions. As the number of studies on lncRNAs grows rapidly, it is necessary to evaluate the effects of genetic variations on lncRNAs. RESULTS: Here, we present LncVar, a database of genetic variation associated with long non-coding genes in six species. We collected lncRNAs from the NONCODE database, and evaluated their conservation. We systematically integrated transcription factor binding sites and m6A modification sites of lncRNAs and provided comprehensive effects of SNPs on transcription and modification of lncRNAs. We collected putatively translated open reading frames (ORFs) in lncRNAs, and identified both synonymous and non-synonymous SNPs in ORFs. We also collected expression quantitative trait loci of lncRNAs from the literature. Furthermore, we identified lncRNAs in CNV regions as prognostic biomarker candidates of cancers and predicted lncRNA gene fusion events from RNA-seq data from cell lines. The LncVar database can be used as a resource to evaluate the effects of the variations on the biological function of lncRNAs. AVAILABILITY AND IMPLEMENTATION: LncVar is available at http://bioinfo.ibp.ac.cn/LncVar CONTACT: rschen@ibp.ac.cnSupplementary information: Supplementary materials are available at Bioinformatics online.
Authors: Carla Liaci; Lucia Prandi; Lisa Pavinato; Alfredo Brusco; Mara Maldotti; Ivan Molineris; Salvatore Oliviero; Giorgio R Merlo Journal: Int J Mol Sci Date: 2022-05-30 Impact factor: 6.208