Literature DB >> 23180795

CircaDB: a database of mammalian circadian gene expression profiles.

Angel Pizarro¹, Katharina Hayer, Nicholas F Lahens, John B Hogenesch.

Abstract

CircaDB (http://circadb.org) is a new database of circadian transcriptional profiles from time course expression experiments from mice and humans. Each transcript's expression was evaluated by three separate algorithms, JTK_Cycle, Lomb Scargle and DeLichtenberg. Users can query the gene annotations using simple and powerful full text search terms, restrict results to specific data sets and provide probability thresholds for each algorithm. Visualizations of the data are intuitive charts that convey profile information more effectively than a table of probabilities. The CircaDB web application is open source and available at http://github.com/itmat/circadb.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Year: 2012 PMID： 23180795 PMCID： PMC3531170 DOI： 10.1093/nar/gks1161

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Circadian rhythms are biological rhythms of ∼24 h in many physiological and behavioral processes (1,2). These rhythms are generated by a cell autonomous circadian clock, present in most cells in mammals. This circadian clock is composed of interlocked transcriptional, translational feedback loops, where transactivators activate repressors that later feedback on the activators (3). Components of the required E-box loop include Bmal1, Bmal2, Clock and Npas2, bHLH-PAS transactivators, Per1, Per2 and Per3, PAS domain containing repressors and Cry1 and Cry2 (4), transcriptional repressors related to cryptochromes from plants and insects. An important secondary loop also exists, the ROR loop, which comprises Rev-erb-alpha, Rev-erb-beta, transcriptional repressors, as well as Rorα, Rorb and Rorγ, transcriptional activators (5–7). Factors in this loop regulate transcript levels of several of the E-box components including Bmal1, Cry1, Npas2 and Per2. The cAMP Responsive Element Binding Protein (CREB) pathway (8,9) and D-box binding factors, Dbp, Hlf, Tef, Nfil3, also regulate clock function (10,11). Thus, transcription factors play a major role in the functioning of the core clock. In addition to regulating transcription of each other, clock factors also impart circadian rhythms in expression of many ‘output’ genes. First order clock control genes are those directly regulated by clock factors (e.g. Clock/Bmal1), while second order output genes could be regulated by a first-order clock-control gene, but not clock components (12–14). Because of this, the research community has spent more than a decade cataloging genes under clock control (12,13,15–17). Historically, these include many disease genes, drug targets and important components of various biological pathways (1,18–20). For example, HMG-CoA reductase, the rate limiting enzyme of cholesterol biosynthesis and target of statins, is under clock control in liver (21). Several factors have catalysed a more complete description of circadian rhythms, including the advent of DNA arrays (16) and now RNA sequencing (22), powerful statistical approaches to find rhythmic genes (23) and appropriate experimental design. The goal of CircaDB is to systematically collect, analyse and visualize circadian expression profiles for bench researchers in a simple and straightforward fashion. Common queries are supported and include straightforward queries of expression profiles, as well as compound queries searching keywords in the gene annotation, in multiple tissues, with the ability to restrict results by probability of cycling.

MATERIALS AND METHODS

Various publicly available microarray time course studies (23–26) were collected (Table 1). References and links to download the expression data sets are outlined on the website. Data from each study were re-analysed using three circadian rhythm detection algorithms: JTK_CYCLE, Lombe Scargle, de Lichtenberg (23,27,28). Table 2 lists the runtime parameters of the algorithms on each data set. The reported expression values from each study were not filtered, as each algorithm accounts for technical replicates. The significance calls and other results reported by each algorithm were entered into a MySQL database.

Table 1.

Expresssion data sets in CircaDB

Name	Time points	Species/tissue
Panda 2002	12	Mouse suprachiasmatic nuclei (SCN) of the hypothalamus, and liver
Hughes 2009	48	Mouse liver, NIH3T3 cells, pituitary gland and human U2OS cells
Miller 2007 and Andrews 2010	12 (WT)	Wild type mouse liver, SCN and skeletal muscle
Miller 2007 and Andrews 2010	7 (KO)	Clock mutant mouse liver, SCN and skeletal muscle
Rudic 2004	12	Mouse aorta, kidney

Table 2.

Runtime parameters for each data set and algorithm

Data set	JTK_CYCLE	Lomb Scargle	De Lichtenberg
Panda 2002	Periods: 16–32 h	minFrequency = 1/32, maxFrequncy = 1/18; (periods = 18–32 h; #test frequencies: 4*N	Period = 24 h
Panda 2002	Periods: 16–32 h		#Permutations = 10 000
Hughes 2009 (mouse)	Periods: 6–42 h	minFrequency = 1/6, maxFrequncy = 1/42; (periods = 6–42 h; #test frequencies: 4*N	Period = 24 h
Hughes 2009 (mouse)	Periods: 6–42 h		#Permutations = 10 000
Hughes 2009 (human)	Periods: 6–42 h	minFrequency = 1/6, maxFrequncy = 1/42; (periods = 6–42 h; #test frequencies: 4*N	Period = 24 h
Hughes 2009 (human)	Periods: 6–42 h		#Permutations = 10 000
Miller 2007	Periods: 16–32 h	minFrequency = 1/32, maxFrequncy = 1/18; (periods = 18–32 h; #test frequencies: 4*N	Period = 24 h
Miller 2007	Periods: 16–32 h		#Permutations = 10 000
Andrews 2010	Periods: 20–28 h	minFrequency = 1/6, maxFrequncy = 1/42; (periods = 6–42 h; #test frequencies: 4*N	Period = 24 h
Andrews 2010	Periods: 20–28 h		#Permutations = 10 000
Rudic 2004	Periods: 16–32 h	minFrequency = 1/32, maxFrequncy = 1/18; (periods = 18–32 h; #test frequencies: 4*N	Period = 24 h
Rudic 2004	Periods: 16–32 h		#Permutations = 10 000

Data sets are located in Table 1.

N = number of time points in the series.

Expresssion data sets in CircaDB Runtime parameters for each data set and algorithm Data sets are located in Table 1. N = number of time points in the series. Gene annotation data were downloaded from the Affymetrix NetAffx resource (http://www.affymetrix.com/analysis/index.affx). Annotations were then entered into the database alongside the unfiltered experimental values and the results of the circadian rhythm detection algorithms. Transcript information was supplemented with links to the GeneWiki project (29,30) and Homologene (http://www.ncbi.nlm.nih.gov/homologene). The data model for the database is described in Figure 1.

Figure 1.

The database schema. Boxes represent table, and edges represent foreign key relationships. Further documentation is available at http://github.com/itmat/circadb.

The database schema. Boxes represent table, and edges represent foreign key relationships. Further documentation is available at http://github.com/itmat/circadb. The transcript annotation and the statistical results were indexed with the Sphinx full text search system (http://sphinxsearch.com/). Visualization of data is accomplished by created using pre-formatted URI requests to the Google Charts API (https://developers.google.com/chart/). The web application was coded using the Ruby on Rails framework (http://rubyonrails.org/). All source code for data loading and the web application is licensed under the GNU General Public License (GPL-2.0) license and available at http://github.com/itmat/circadb.

RESULTS AND DISCUSSION

In creating CircaDB, we have provided the research community a clear, concise and powerful interface for querying genes within the context of circadian expression profile data. Another circadian expression database, Diurnal 2.0 (31), provides a similar resource to CircaDB but focuses on plant data. It also restricts its initial search to transcript accessions, whereas CircaDB allows full query capabilities on gene annotation. CircaDB provides advanced keyword search capabilities of gene annotation. This includes the ability to search by phrases, boolean conditions and combinations thereof. Queries can also be restricted by a given experiment’s data set, phase of expression and significance of a particular algorithm (Figure 2).

Figure 2.

(a) The query interface for CircaDB. The interface consists of a simple and powerful full-text search capability, with possible restrictions on the data sets, phase information and a significance threshold for a given algorithm. (b) The set of available threshold categories for the circadian classification algorithms. The Database of Circadian Gene Expression (24), part of the Gene Atlas Project (32), contains a subset of the same data sets in CircaDB, but uses a single circadian expression algorithm. CircaDB contains all of these data and re-analysed them with newer and more robust set of algorithms (23,27,28). Three algorithms were used to allow for the inspection of the differences between each algorithm’s results (Figure 3). CircaDB is actively maintained and will continue to add new features and data sets as time they become available. Requests for integration of data sets are handled via submitting a request via the project site at Github. CiraDB also provides integration expression profiles for use within BioGPS (33).

Figure 3.

Expression profile report. A simple visualization of the data accompanies the main annotation of the gene probe, probability values from various circadian rhythm detection algorithms and other circadian information. Finally, to facilitate use of this database framework by other researcher groups, we have made the source code for the application freely available under the GPL 2.0 open source license. The project has been recently used to visualize circadian experiments for Anopheles gambiae (34). All of these together make CircaDB a unique and valuable resource for the circadian research community.

FUNDING

The National Institutes of Health, the National Center for Advancing Translational Sciences [8UL1TR000003] (to Garret FitzGerald, University of Pennsylvania); National Heart, Lung, and Blood Institute [1R01HL097800-04 to J.B.H.]; the Defense Advanced Research Projects Agency [BAA-11-65] (to John Harer, Duke University). Funding for open access charge: Departmental Funds. Conflict of interest statement. None declared.

34 in total

Review 1. Central and peripheral clocks in cardiovascular and metabolic function.

Authors: Anne M Curtis; Garret A Fitzgerald
Journal: Ann Med Date: 2006 Impact factor: 4.709

Review 2. Circadian clock control of the cellular response to DNA damage.

Authors: Aziz Sancar; Laura A Lindsey-Boltz; Tae-Hong Kang; Joyce T Reardon; Jin Hyup Lee; Nuri Ozturk
Journal: FEBS Lett Date: 2010-03-15 Impact factor: 4.124

3. The Gene Wiki: community intelligence applied to human gene annotation.

Authors: Jon W Huss; Pierre Lindenbaum; Michael Martone; Donabel Roberts; Angel Pizarro; Faramarz Valafar; John B Hogenesch; Andrew I Su
Journal: Nucleic Acids Res Date: 2009-09-15 Impact factor: 16.971

4. The DIURNAL project: DIURNAL and circadian expression profiling, model-based pattern matching, and promoter analysis.

Authors: T C Mockler; T P Michael; H D Priest; R Shen; C M Sullivan; S A Givan; C McEntee; S A Kay; J Chory
Journal: Cold Spring Harb Symp Quant Biol Date: 2007

Review 5. The genetics of mammalian circadian order and disorder: implications for physiology and disease.

Authors: Joseph S Takahashi; Hee-Kyung Hong; Caroline H Ko; Erin L McDearmon
Journal: Nat Rev Genet Date: 2008-10 Impact factor: 53.242

Review 6. The meter of metabolism.

Authors: Carla B Green; Joseph S Takahashi; Joseph Bass
Journal: Cell Date: 2008-09-05 Impact factor: 41.582

7. REV-ERBalpha participates in circadian SREBP signaling and bile acid homeostasis.

Authors: Gwendal Le Martelot; Thierry Claudel; David Gatfield; Olivier Schaad; Benoît Kornmann; Giuseppe Lo Sasso; Antonio Moschetta; Ueli Schibler
Journal: PLoS Biol Date: 2009-09-01 Impact factor: 8.029

8. Harmonics of circadian gene transcription in mammals.

Authors: Michael E Hughes; Luciano DiTacchio; Kevin R Hayes; Christopher Vollmers; S Pulivarthy; Julie E Baggs; Satchidananda Panda; John B Hogenesch
Journal: PLoS Genet Date: 2009-04-03 Impact factor: 5.917

9. BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources.

Authors: Chunlei Wu; Camilo Orozco; Jason Boyer; Marc Leglise; James Goodale; Serge Batalov; Christopher L Hodge; James Haase; Jeff Janes; Jon W Huss; Andrew I Su
Journal: Genome Biol Date: 2009-11-17 Impact factor: 13.583

10. A gene wiki for community annotation of gene function.

Authors: Jon W Huss; Camilo Orozco; James Goodale; Chunlei Wu; Serge Batalov; Tim J Vickers; Faramarz Valafar; Andrew I Su
Journal: PLoS Biol Date: 2008-07-08 Impact factor: 8.029

142 in total

1. Taking into account circadian rhythm when conducting experiments on animals.

Authors: Michelle L Gumz
Journal: Am J Physiol Renal Physiol Date: 2015-12-30

2. Alcohol Effects on Colon Epithelium are Time-Dependent.

Authors: Faraz Bishehsari; Lijuan Zhang; Robin M Voigt; Natalie Maltby; Bita Semsarieh; Eyas Zorub; Maliha Shaikh; Sherry Wilber; Andrew R Armstrong; Seyed Sina Mirbagheri; Nailliw Z Preite; Peter Song; Alessia Stornetta; Silvia Balbo; Christopher B Forsyth; Ali Keshavarzian
Journal: Alcohol Clin Exp Res Date: 2019-07-22 Impact factor: 3.455

3. Leucine Differentially Regulates Gene-Specific Translation in Mouse Skeletal Muscle.

Authors: Micah J Drummond; Paul T Reidy; Lisa M Baird; Brian K Dalley; Michael T Howard
Journal: J Nutr Date: 2017-06-14 Impact factor: 4.798

4. The biological clock and the molecular basis of lysosomal storage diseases.

Authors: Gianluigi Mazzoccoli; Tommaso Mazza; Manlio Vinciguerra; Stefano Castellana; Maurizio Scarpa
Journal: JIMD Rep Date: 2015-01-13

Review 5. Circadian regulation of membrane physiology in neural oscillators throughout the brain.

Authors: Jodi R Paul; Jennifer A Davis; Lacy K Goode; Bryan K Becker; Allison Fusilier; Aidan Meador-Woodruff; Karen L Gamble
Journal: Eur J Neurosci Date: 2019-01-29 Impact factor: 3.386

6. A Slow Conformational Switch in the BMAL1 Transactivation Domain Modulates Circadian Rhythms.

Authors: Chelsea L Gustafson; Nicole C Parsley; Hande Asimgil; Hsiau-Wei Lee; Christopher Ahlbach; Alicia K Michael; Haiyan Xu; Owen L Williams; Tara L Davis; Andrew C Liu; Carrie L Partch
Journal: Mol Cell Date: 2017-05-11 Impact factor: 17.970

7. RECENT ADVANCES IN UNDERSTANDING THE CIRCADIAN CLOCK IN RENAL PHYSIOLOGY.

Authors: G Ryan Crislip; Sarah H Masten; Michelle L Gumz
Journal: Curr Opin Physiol Date: 2018-06-20

8. mTOR Senses Intracellular pH through Lysosome Dispersion from RHEB.

Authors: Zandra E Walton; Rebekah C Brooks; Chi V Dang
Journal: Bioessays Date: 2019-06-03 Impact factor: 4.345

9. Intrinsic muscle clock is necessary for musculoskeletal health.

Authors: Elizabeth A Schroder; Brianna D Harfmann; Xiping Zhang; Ratchakrit Srikuea; Jonathan H England; Brian A Hodge; Yuan Wen; Lance A Riley; Qi Yu; Alexander Christie; Jeffrey D Smith; Tanya Seward; Erin M Wolf Horrell; Jyothi Mula; Charlotte A Peterson; Timothy A Butterfield; Karyn A Esser
Journal: J Physiol Date: 2015-11-23 Impact factor: 5.182

10. CLOCK-BMAL1 regulates circadian oscillation of ventricular arrhythmias in failing hearts through β1 adrenergic receptor.

Authors: Zihao Zhou; Jiamin Yuan; Didi Zhu; Yanhong Chen; Zhiyong Qian; Yao Wang; Peibin Ge; Quanpeng Wang; Xiaofeng Hou; Jiangang Zou
Journal: Am J Transl Res Date: 2020-10-15 Impact factor: 4.060