| Literature DB >> 33121433 |
Abstract
BACKGROUND: Circular RNAs (circRNAs) play important roles in regulating gene expression through binding miRNAs and RNA binding proteins. Genetic variation of circRNAs may affect complex traits/diseases by changing their binding efficiency to target miRNAs and proteins. There is a growing demand for investigations of the functions of genetic changes using large-scale experimental evidence. However, there is no online genetic resource for circRNA genes.Entities:
Keywords: Bioinformatics database; Circular RNA; Genetic variant; Systems biology
Mesh:
Substances:
Year: 2020 PMID: 33121433 PMCID: PMC7599103 DOI: 10.1186/s12864-020-07172-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Web interface of the circVAR database. In this version, we focused on single nucleotide variations and small insertions and deletions (INDELs). The page implemented data downloading, browsing and searching. For each circRNA in the database, the gene information is shown on the top right. In total, we integrated 295,526 circRNAs based on the GRCH 37 coordinates from circBase, circNet, and circRNAdb. By mapping to GRCH 38 coordinates, we identified 295,073 circRNAs. The basic information for each circRNA includes the circRNA ID in our database, data source, genomic locations for both GRCH 37 and GRCH 38, mapped genes and pre-computed genetic variants from GWASCatalog, ClinVAR, COSMIC, and the 1000 Genomes Project. Due to the huge volume of data, we provided bulk downloading for all pre-computed genetic variants from the four genetic variant databases
Fig. 2Data integration and chromosome distribution of human circRNAs. a The overlapping relationship of the three integrated human circRNA datasets based on the exact chromosome location. b The chromosome distribution of all unique circRNAs. The number of circRNAs in each chromosome is shown in the blue bar chart. The orange line is the ratio between the number of circRNAs and the length of the chromosome
Fig. 3circRNAs mapped to the 1000 Genomes Project and the clinVAR database. a The chromosome distribution of common variants from the 1000 Genomes Project. SNP: single nucleotide polymorphism; MNP: multiple nucleotide polymorphism; INDEL: insertion and deletion; SV: structure variant. The orange line is the ratio between the number of circRNA variants from the 1000 Genomes Project and the length of the chromosome. b The chromosome distribution of circRNA variants with phenotypes from the clinVAR database. The orange line is the ratio between the number of circRNA variants from the ClinVAR database and the length of the chromosome
Fig. 4The circRNA variants from GWAS and cancer genomics studies. a The circRNAs were mapped to the GWASCatalog to annotate their potential association with phenotypes. The Z-scores were transformed from the numbers for a specific variant type across different chromosomes. The polarity of the Z-score reflects whether the result is higher or lower than the average. b The circRNAs were mapped to the COSMIC database to annotate their potential association with various cancers. The Z-scores were transformed from the numbers for a specific variant type across different chromosomes. The polarity of the Z-score reflects whether the results is higher or lower than the average