| Literature DB >> 26578589 |
Robert Lesurf1, Kelsy C Cotto1, Grace Wang1, Malachi Griffith2, Katayoon Kasaian3, Steven J M Jones4, Stephen B Montgomery5, Obi L Griffith6.
Abstract
The Open Regulatory Annotation database (ORegAnno) is a resource for curated regulatory annotation. It contains information about regulatory regions, transcription factor binding sites, RNA binding sites, regulatory variants, haplotypes, and other regulatory elements. ORegAnno differentiates itself from other regulatory resources by facilitating crowd-sourced interpretation and annotation of regulatory observations from the literature and highly curated resources. It contains a comprehensive annotation scheme that aims to describe both the elements and outcomes of regulatory events. Moreover, ORegAnno assembles these disparate data sources and annotations into a single, high quality catalogue of curated regulatory information. The current release is an update of the database previously featured in the NAR Database Issue, and now contains 1 948 307 records, across 18 species, with a combined coverage of 334 215 080 bp. Complete records, annotation, and other associated data are available for browsing and download at http://www.oreganno.org/.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26578589 PMCID: PMC4702855 DOI: 10.1093/nar/gkv1203
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Current content of the ORegAnno database. Content statistics are divided by species (A and B), regulatory type (C and D), and data source (E and F).
Figure 2.Comparison of the genomic coverage captured by ORegAnno and the ENSEMBL Regulatory Track. A Venn diagram demonstrates coverage overlaps for human genome assembly version GRCh38/hg38, with sets sized to scale. The ENSEMBL Regulatory Track is divided into two main sets, a track overview set and the transcription factor binding site (TFBS) set.
Figure 3.Capture reagent using ORegAnno sites improves coverage of regulatory regions in human hepatocellular carcinoma. (A) Coverage across the entire ‘regulome’ is visualized as a heatmap for each of the ten HCC cases. WRS samples have greater sequencing read depth across the targeted region, compared with WGS samples. (B) An illustrative IGV (51) screenshot is shown of the TERT promoter for one HCC case. A canonical C228T somatic mutation is observed in the WRS data, but cannot be reliably called in the WGS data.