| Literature DB >> 29912458 |
Nicholas Ceglia1,2, Yu Liu1,2, Siwei Chen1,2, Forest Agostinelli1,2, Kristin Eckel-Mahan3, Paolo Sassone-Corsi2,4,5, Pierre Baldi1,2,4,5.
Abstract
Circadian rhythms play a fundamental role at all levels of biological organization. Understanding the mechanisms and implications of circadian oscillations continues to be the focus of intense research. However, there has been no comprehensive and integrated way for accessing and mining all circadian omic datasets. The latest release of CircadiOmics (http://circadiomics.ics.uci.edu) fills this gap for providing the most comprehensive web server for studying circadian data. The newly updated version contains high-throughput 227 omic datasets corresponding to over 74 million measurements sampled over 24 h cycles. Users can visualize and compare oscillatory trajectories across species, tissues and conditions. Periodicity statistics (e.g. period, amplitude, phase, P-value, q-value etc.) obtained from BIO_CYCLE and other methods are provided for all samples in the repository and can easily be downloaded in the form of publication-ready figures and tables. New features and substantial improvements in performance and data volume make CircadiOmics a powerful web portal for integrated analysis of circadian omic data.Entities:
Mesh:
Year: 2018 PMID: 29912458 PMCID: PMC6030824 DOI: 10.1093/nar/gky441
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Data volumes of publicly available circadian omic databases
| Source | Experiments | Tissues | Species | Total data pts. (est.) |
|---|---|---|---|---|
| CircadiOmics | 227 | 23 | 8 | ≈74 600 000 |
| CircaDB | 30 | 15 | 2 | <1 800 000 |
| DIURNAL | 11 | 3 | 3 | ≈3 009 600 |
| BIOCLOCK | 2 | 2 | 2 | ≈3 600 000 |
| CirGRDB | 50 | <20 | 2 | ≈9 000 000 |
Comparison of CircadiOmics with other circadian repositories. Experiments refers to the total number of experimental level datasets from each source. An experimental level dataset should contain at least two time points, more than one replicate at each time point, and time series data for a substantial number of molecular species–at least 1000 for transcriptome and acetylome, and at least 100 for metabolome and proteome–and each replicate. Total data points provide an estimate of the total number of individual measurements taken across different time points, replicates and molecular species. Numbers are collected from internal statistics for CircadiOmics and from publications, or official websites, for the other sources. Details are provided in Supplementary Material.
Figure 1.Dataset collection by species, tissues, experimental conditions and omic categories.
Figure 2.Three-tier Model-View-Controller architecture of the CircadiOmics web portal. Intelligent data discovery supplies candidate datasets for inclusion in the repository using a machine learning filter applied to key word features derived from web crawling published abstracts. BIO_CYCLE results are obtained and stored for all datasets. The user interface sends requests and displays results from the web server allowing for interactive hypothesis generation and scientific discovery.
Figure 3.Visualization of queries for ARNTL, PER1 and CRY1 in a control mouse dataset. Any number of queries, across any number of datasets, can be displayed simultaneously.
Figure 4.Selected examples of the impact of CircadiOmics. (A) CircadiOmics was used to link a multitude of circadian metabolites with functionally related circadian transcripts. Figure taken from Figure 5A of (17). (B) CircadiOmics was used to discover reprogrammed circadian transcripts and metabolites related to inflammatory and energy pathways. Figure taken from Figures 2E, 4B and 5D of (27). (C) Exogenous MYOD1, during MEF myogenic reprogramming, entrains oscillation in MYOG and related targets in absence of oscillation of the core clock (https://www.biorxiv.org/content/early/2017/06/18/151555). (D) Bar heights show the ordered number of oscillating protein coding transcripts with a P ≤ 0.05 in each mouse transcriptomic experiment in the repository. The trend is the cumulative union of oscillating transcripts. Over 93% of possible protein coding transcripts are found to oscillate in at least one tissue or condition across all mouse datasets.