| Literature DB >> 26576534 |
Anuprabha Bhargava1, Hanspeter Herzel2, Bharath Ananthasubramaniam3.
Abstract
BACKGROUND: Most physiological processes in mammals are temporally regulated by means of a master circadian clock in the brain and peripheral oscillators in most other tissues. A transcriptional-translation feedback network of clock genes produces near 24 h oscillations in clock gene and protein expression. Here, we aim to identify novel additions to the clock network using a meta-analysis of public chromatin immunoprecipitation sequencing (ChIP-seq), proteomics and protein-protein interaction data starting from a published list of 1000 genes with robust transcriptional rhythms and circadian phenotypes of knockdowns.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26576534 PMCID: PMC4650315 DOI: 10.1186/s12918-015-0227-2
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Data sources used in this study
| Type | Source | Characterized | Total | Circadian | Hits |
|---|---|---|---|---|---|
| Koike et al. [ | BMAL1 | 3495 | 3004 | 359ns | |
| PER1 | 2984 | 138 | 15ns | ||
| PER2 | 4255 | 3495 | 384ns | ||
| CRY1 | 6768 | 2923 | 356*** | ||
| CRY2 | 5230 | 2717 | 318** | ||
| CLOCK | 2831 | 1204 | 170*** | ||
| ChIP-seq | NPAS | 1597 | 808 | 121ns | |
| Rey et al. [ | BMAL1 | 1273 | 439 | 228* | |
| Cho et al. [ | REV-ERB | 3849 | - | 412 | |
| REV-ERB | 3849 | - | 412 | ||
| Bugge et al. [ | REV-ERB | 6256 | - | 636 | |
| Feng et al. [ | REV-ERB | 6444 | - | 635 | |
| Fang et al. [ | ROR | 8457 | - | 529 | |
| E4BP4 (NFIL3) | 6147 | - | 437 | ||
| Proteomics | Robles et al. [ | 2877 | 185 | 34** | |
| Mauvoisin et al. [ | 5610 | 193 | 35*** | ||
| Chiang et al. [ | 1881 | 47 | 6ns | ||
| Protein-protein interaction | Wallach et al. [ | 123 | - | 25 | |
| PINA mouse database [ | 467 | - | 33 |
The source of the published data, the characterized protein or transcription factor, the total number of binding sites for the ChIP-seq data and the number of genes corresponding to the quantified proteins in the case of proteomics or protein-protein interaction data are provided. When data could be filtered to include only circadian components, the number of genes with circadian ChIP-seq or proteomic evidence is also listed. Finally, the total number of hits from each data set among the 1000 gene long master list is given. The statistically overrepresented circadian hits are marked by significance (see “Method” section) ∗:p<0.05,∗∗:p<0.01,∗∗∗:p<0.001,:not significant
Fig. 1Data mining approach to find novel clock candidate genes. The schematic outlines the approach along with the types of data sources used to filter the master list of 1000 genes from Anafi et al. [9] to the list of 11 novel candidate clock genes. Some key properties of the novel candidates genes are also indicated. The entire list of robust and non-robust candidate genes are given in Table S1 in Additional file 2
Fig. 2Scores of the significant and robust candidate genes and their relationship to the original ranking in Anafi et al. a The table of individual hits within the ChIP-sequencing, proteomics and protein-protein interaction data sets (in blue, orange and green, respectively) for the 20 significant candidate genes (at the 0.001 level) based on our meta-analysis that are also robust to the choice of weighting schemes for the different data sets. The nine known clock-associated genes are marked in bold on the y-axis. b The distribution of the evidence-based ranks from Anafi et al. [9] for the 20 significant and robust candidate genes identified in a. The candidate genes that obtained evidence-based ranks in the top and bottom 200 are also listed
Fig. 3Transcript expression of candidate genes. a The distributions of the number of different tissues and expression phase in those tissues compared across candidate genes (red) and the rest of the master list (green) based on [33]. b The circadian expression profiles in fourteen different tissues (data from [33]) of genes in the candidate list that were expressed in a circadian manner in at least four tissues
Fig. 4Phase regulation of circadian transcripts. a The distribution of the expression phase of genes that are significantly circadian in different tissues (FDRJTK<0.05) [33] classified according to the combinations of transcription factors (TFs) binding their promoter based on ChIP-seq data. E-box: BMAL1 or NPAS2 or CLOCK, D-box: E4BP4, RRE: REV-ERB α or REV-ERB β or ROR α. The grey vertical lines represent the median phase of expression within each group of transcripts and the line width increases with increasing significance of the mean-direction (Rayleigh test). The number of transcripts in each group is also provided. b The estimated distribution of phases for each combination of TFs from the generalized linear model fit of the transcript phases to the normalized scores for each TF group within the ChIP-seq data in our meta-analysis. Under the model, the phase of expression within each group is assumed to follow a von Mises distribution, whose mean is linearly dependent on the normalized scores (see “Method” section)