| Literature DB >> 25945798 |
Robert Lehmann1, Liam Childs2, Philippe Thomas2, Monica Abreu3, Luise Fuhr3, Hanspeter Herzel1, Ulf Leser2, Angela Relógio3.
Abstract
By regulating the timing of cellular processes, the circadian clock provides a way to adapt physiology and behaviour to the geophysical time. In mammals, a light-entrainable master clock located in the suprachiasmatic nucleus (SCN) controls peripheral clocks that are present in virtually every body cell. Defective circadian timing is associated with several pathologies such as cancer and metabolic and sleep disorders. To better understand the circadian regulation of cellular processes, we developed a bioinformatics pipeline encompassing the analysis of high-throughput data sets and the exploitation of published knowledge by text-mining. We identified 118 novel potential clock-regulated genes and integrated them into an existing high-quality circadian network, generating the to-date most comprehensive network of circadian regulated genes (NCRG). To validate particular elements in our network, we assessed publicly available ChIP-seq data for BMAL1, REV-ERBα/β and RORα/γ proteins and found strong evidence for circadian regulation of Elavl1, Nme1, Dhx6, Med1 and Rbbp7 all of which are involved in the regulation of tumourigenesis. Furthermore, we identified Ncl and Ddx6, as targets of RORγ and REV-ERBα, β, respectively. Most interestingly, these genes were also reported to be involved in miRNA regulation; in particular, NCL regulates several miRNAs, all involved in cancer aggressiveness. Thus, NCL represents a novel potential link via which the circadian clock, and specifically RORγ, regulates the expression of miRNAs, with particular consequences in breast cancer progression. Our findings bring us one step forward towards a mechanistic understanding of mammalian circadian regulation, and provide further evidence of the influence of circadian deregulation in cancer.Entities:
Mesh:
Year: 2015 PMID: 25945798 PMCID: PMC4422523 DOI: 10.1371/journal.pone.0126283
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Work flow used to establish a network of circadian regulated genes (NCRG).
Two independent data types were used to predict genes which interact with the human core clock network (CCN, orange) and the extended core clock network (ECCN, green). Co-expression data was used to find sets of genes with strongest (anti-) correlating expression with the 43 ECCN genes across a large number of independent experiments. A total of 2357 genes were found to interact with more than 2 ECCN genes. The GeneView text-mining pipeline was used to analyse published knowledge (approximately 22 million citations) about interacting genes. A total of 961 text-mining-predicted genes were found. The intersection of both methodologies resulted in 118 new genes, which together with the ECCN form a new network of circadian regulated genes (NCRG, purple).
Fig 2The human core clock network (CCN) and the extended core clock network (ECCN).
The CCN (orange) contains the known core-clock elements (Per1,2,3, Cry1,2, Rev-Erbα,β, Rorα,β,γ, Bmal1,2, Clock and Npas). The ECCN (green) was obtained after an extensive collection of CCN-interacting genes followed by a detailed curation for direct interactions [21] and a further update to the recent literature. Activation (green lines), inhibition (red lines) and other sort of interactions (grey lines) are represented. The resulting clock network contains 43 elements and more than 200 regulatory relationships.
Fig 3Correlation distributions for clock network gene pairs versus random gene pairs.
The cumulative Pearson ρ distributions of pairs of ECCN genes reported to interact but excluding CCN (ECCN, green), reported pairs of CCN genes (CCN, orange), and 43 randomly chosen genes versus all genes as background (BG, black) are shown for the Hsa2 data collection (A, B). Distributions are shown centred around 0 with the centred bin marked by the dashed red line. The Pearson ρ distributions of reported pairs of ECCN genes (green) is compared to not reported pairs (blue) for the data set Hsa2 (C, D). Comparison of Pearson ρ and mutual rank (E, F) between all possible pairs of ECCN genes (green) and all possible pairs of non-ECCN genes as background (black). All data were taken from the Hsa2 data collection.
Fig 4Variation in functional annotation enrichment with increasing tightness between the predicted targets and the ECCN.
The significance of 28 enriched GO terms (A) and 16 KEGG terms (B) for genes connected to the CCN steadily decreases for most terms as the minimum number connections is increased (this number of connections between a gene and a gene set is here defined as "tightness"). As the minimum tightness between the predicted targets and the ECCN increases, the enrichment and rank of functional annotation changes. We observe an overall decrease in enrichment but little in rank. The greatest changes in rank occur between a tightness of 2 and 3. At a tightness of two and above, the rank of the majority of significant GO terms such as "mitotic cell cycle" and "nuclear mRNA splicing, via spliceosome", and KEGG terms such as "Spliceosome", "Ubiquitin mediated proteolysis" and "RNA degradation" remain largely stable suggesting a natural threshold on tightness at this point.
Enrichment analysis of the co-expression-predicted ECCN interacting genes for GO term annotations.
| GO Term | Annotations in Total | Annotations in Predicted Set | Expected | FDR |
|---|---|---|---|---|
|
| 200 | 88 | 27 | 1.9e-10 |
|
| 443 | 114 | 59.81 | 1.6e-09 |
|
| 105 | 54 | 14.18 | 6.7e-08 |
|
| 34 | 21 | 4.59 | 1.2e-06 |
|
| 347 | 104 | 46.85 | 1.7e-06 |
|
| 130 | 44 | 17.55 | 1.8e-06 |
|
| 69 | 24 | 9.32 | 1.9e-06 |
|
| 84 | 35 | 11.34 | 1.9e-06 |
|
| 76 | 32 | 10.26 | 8.4e-06 |
|
| 378 | 152 | 51.03 | 1.9e-05 |
|
| 4347 | 874 | 586.86 | 0.00027 |
|
| 234 | 70 | 31.59 | 0.00031 |
|
| 28 | 18 | 3.78 | 0.00033 |
|
| 26 | 16 | 3.51 | 0.00038 |
|
| 59 | 28 | 7.97 | 0.00042 |
|
| 1154 | 222 | 155.79 | 0.00047 |
|
| 28 | 16 | 3.78 | 0.00079 |
|
| 369 | 109 | 49.82 | 0.00120 |
|
| 44 | 20 | 5.94 | 0.00285 |
|
| 2735 | 506 | 369.23 | 0.00518 |
|
| 308 | 121 | 41.58 | 0.00989 |
|
| 6831 | 1116 | 889.85 | 2.6e-29 |
|
| 792 | 248 | 103.17 | 7.0e-24 |
|
| 2240 | 454 | 291.8 | 5.8e-14 |
|
| 1439 | 292 | 187.45 | 9.4e-13 |
|
| 2294 | 448 | 298.83 | 3.3e-07 |
|
| 64 | 27 | 8.34 | 2.6e-05 |
|
| 235 | 59 | 30.61 | 0.00026 |
|
| 241 | 63 | 31.39 | 0.00104 |
|
| 50 | 21 | 6.51 | 0.00134 |
|
| 43 | 19 | 5.6 | 0.00186 |
|
| 32 | 17 | 4.17 | 0.00547 |
|
| 87 | 28 | 11.33 | 0.00865 |
|
| 5640 | 1172 | 724.54 | 2.1e-30 |
|
| 1401 | 423 | 179.98 | 1.0e-27 |
|
| 144 | 61 | 18.5 | 1.3e-15 |
|
| 589 | 154 | 75.67 | 2.4e-14 |
|
| 78 | 40 | 10.02 | 3.7e-13 |
|
| 60 | 36 | 7.71 | 5.6e-10 |
|
| 2217 | 382 | 284.8 | 3.6e-07 |
|
| 19 | 14 | 2.44 | 2.5e-06 |
|
| 363 | 89 | 46.63 | 3.9e-05 |
|
| 44 | 20 | 5.65 | 0.00014 |
|
| 21 | 12 | 2.7 | 0.00239 |
|
| 137 | 60 | 17.6 | 0.00322 |
|
| 10 | 8 | 1.28 | 0.00330 |
|
| 280 | 72 | 35.97 | 0.00466 |
|
| 6 | 6 | 0.77 | 0.00568 |
|
| 6 | 6 | 0.77 | 0.00568 |
|
| 169 | 46 | 21.71 | 0.00685 |
Gene ontology annotation enrichment was performed for the molecular function, cellular component, and biological process ontologies. Only terms with q < 0.01 (false discovery rate after Benjamini-Hochberg) are shown.
Enrichment of KEGG pathway annotations amongst the co-expression-predicted ECCN interacting genes.
| KEGG ID | Pathway | p-value | FDR |
|---|---|---|---|
|
| RNA transport | 3.51E-40 | 8.00E-38 |
|
| Spliceosome | 6.64E-40 | 1.50E-37 |
|
| Cell cycle | 7.95E-23 | 1.80E-20 |
|
| Ribosome biogenesis in eukaryotes | 6.98E-21 | 1.60E-18 |
|
| Ubiquitin mediated proteolysis | 4.01E-16 | 9.00E-14 |
|
| RNA degradation | 1.95E-14 | 4.40E-12 |
|
| DNA replication | 6.08E-14 | 1.40E-11 |
|
| mRNA surveillance pathway | 8.00E-12 | 1.80E-09 |
|
| Protein processing in endoplasmic reticulum | 2.52E-11 | 5.60E-09 |
|
| Wnt signalling pathway | 1.26E-10 | 2.80E-08 |
|
| Pathways in cancer | 3.19E-09 | 7.00E-07 |
|
| Olfactory transduction | 6.69E-09 | 1.50E-06 |
|
| Mismatch repair | 8.92E-09 | 1.90E-06 |
|
| Purine metabolism | 1.24E-08 | 2.70E-06 |
|
| Pyrimidine metabolism | 1.24E-08 | 2.70E-06 |
|
| MAPK signalling pathway | 1.84E-08 | 3.90E-06 |
|
| Nucleotide excision repair | 6.15E-08 | 1.30E-05 |
|
| Chronic myeloid leukemia | 2.60E-07 | 5.50E-05 |
|
| Neurotrophin signalling pathway | 2.94E-07 | 6.20E-05 |
|
| Endocytosis | 5.12E-07 | 1.10E-04 |
|
| Oocyte meiosis | 7.00E-07 | 1.50E-04 |
|
| Colorectal cancer | 7.63E-07 | 1.60E-04 |
|
| T cell receptor signalling pathway | 1.16E-06 | 2.40E-04 |
|
| Endometrial cancer | 2.74E-06 | 5.70E-04 |
|
| Progesterone-mediated oocyte maturation | 3.32E-06 | 6.80E-04 |
|
| Long-term potentiation | 3.78E-06 | 7.70E-04 |
|
| Hepatitis C | 1.04E-05 | 2.10E-03 |
|
| Regulation of actin cytoskeleton | 1.17E-05 | 2.40E-03 |
|
| ErbB signalling pathway | 1.38E-05 | 2.80E-03 |
|
| Thyroid cancer | 1.42E-05 | 2.80E-03 |
|
| Insulin signalling pathway | 1.45E-05 | 2.90E-03 |
|
| Renal cell carcinoma | 1.76E-05 | 3.50E-03 |
|
| Proteasome | 1.95E-05 | 3.80E-03 |
|
| Non-small cell lung cancer | 2.30E-05 | 4.50E-03 |
|
| Non-homologous end-joining | 5.12E-05 | 1.00E-02 |
Only terms with q < 0.01 (false discovery rate after Benjamini-Hochberg) are shown.
Fig 5Homogenous functional spectrum of genes targeted by different ECCN genes.
The specific functions of genes interacting with each individual ECCN gene were counted and illustrated as heat map. Annotated KEGG pathways (A) and GO terms (B) found to be overrepresented amongst all predicted target genes in the preceding analysis were counted for each target gene, and these counts were then accumulated for each individual ECCN gene and represented as colours according to the legend. Rows and columns are ordered according to a hierarchical clustering.
Fig 6Functional analysis of the consensus predicted ECCN target gene set.
Overlap between 2357 new ECCN elements (orange) based on expression pattern correlation and 961 genes obtained with text-mining methods (green) (A). This resulted in 118 new genes found by both methods. KEGG pathway annotation enrichment of the 118 consensus predicted genes (B) and corresponding GO enrichment (C).
Fig 7Network representation of CCN/ECCN network together with the 118 predicted target genes (NCRG).
Boxes represent individual genes, which are connected by lines reflecting interactions that are known (grey), predicted by co-expression (blue), text-mining (green), or by both (red). The sub-networks are indicated by rectangles, the CCN (orange), the ECCN (green), and NCRG (purple).
Properties of the consensus predicted ECCN target genes.
| Chip-Seq Target Genes | Circadian Phenotype | Pathological phenotype | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| entrezID | Gene Symbol | RevErbα [ | RevErbβ [ | RevErbα/β [ | RORα [ | RORγ [ | RORα/γ [ | circadian expression | high-A | long-T | short-T | OMIM |
| 102 | ADAM10 | x | Reticulate acropigmentation of Kitamura, 615537 (3)~{Alzheimer disease 18, susceptibility to}, 615590 (3) | |||||||||
| 328 | APEX1 | x | ||||||||||
| 350 | APOH | x | x | x | x | x | x | x | ||||
| 466 | ATF1 | x | ||||||||||
| 471 |
| x | AICA-ribosiduria due to ATIC deficiency, 608688 (3) | |||||||||
| 551 | AVP | x | Diabetes insipidus, neurohypophyseal, 125700 (3) | |||||||||
| 813 | CALU | x | x | |||||||||
| 885 | CCK | x | ||||||||||
| 996 | CDC27 | x | ||||||||||
| 1108 | CHD4 | x | x | |||||||||
| 1195 | CLK1 | x | 3MC syndrome 2, 265050 (3) | |||||||||
| 1386 | ATF2 | |||||||||||
| 1452 | CSNK1A1 | x | x | x | x | |||||||
| 1459 | CSNK2A2 | x | ||||||||||
| 1499 | CTNNB1 | x | x | Colorectal cancer, somatic, 114500 (3)~Hepatocellular carcinoma, somatic, 114550 (3)~Mental retardation, autosomal dominant 19, 615075 (3)~Ovarian cancer, somatic, 167000 (3)~Pilomatricoma, somatic, 132600 (3) | ||||||||
| 1642 |
| x | x | |||||||||
| 1656 | DDX6 | x | x | |||||||||
| 1660 | DHX9 | x | ||||||||||
| 1855 | DVL1 | x | ||||||||||
| 1859 | DYRK1A | x | x | x | Mental retardation, autosomal dominant 7, 614104 (3) | |||||||
| 1915 | EEF1A1 | x | x | x | ||||||||
| 1994 | ELAVL1 | |||||||||||
| 2177 | FANCD2 | x | Fanconi anemia, complementation group D2, 227646 (3) | |||||||||
| 2547 | XRCC6 | x | ||||||||||
| 2875 | GPT | x | x | x | ||||||||
| 2905 | GRIN2C | x | x | x | ||||||||
| 3308 | HSPA4 | x | ||||||||||
| 3454 | IFNAR1 | x | x | x | x | |||||||
| 4089 | SMAD4 | x | x | Juvenile polyposis/hereditary hemorrhagic telangiectasia syndrome, 175050 (3)~Myhre syndrome, 139210 (3)~Pancreatic cancer, somatic, 260350 (3)~Polyposis, juvenile intestinal, 174900 (3) | ||||||||
| 4297 | MLL | x | x | x | Leukemia, myeloid/lymphoid or mixed-lineage (2)~Wiedemann-Steiner syndrome, 605130 (3) | |||||||
| 4299 | AFF1 | x | x | x | ||||||||
| 4670 | HNRNPM | x | ||||||||||
| 4691 | NCL | x | x | x | ||||||||
| 4830 | NME1 | x | x | Neuroblastoma, 256700 (3) | ||||||||
| 4836 | NMT1 | x | x | x | ||||||||
| 5430 | POLR2A | x | ||||||||||
| 5469 | MED1 | x | x | |||||||||
| 5478 | PPIA | x | ||||||||||
| 5599 | MAPK8 | x | x | x | ||||||||
| 5663 | PSEN1 | x | x | Acne inversa, familial, 3, 613737 (3)~Alzheimer disease, type 3, 607822 (3)~Alzheimer disease, type 3, with spastic paraparesis and apraxia, 607822 (3)~Alzheimer disease, type 3, with spastic paraparesis and unusual plaques, 607822 (3)~Cardiomyopathy, dilated, 1U, 613694 (3)~Dementia, frontotemporal, 600274 (3)~Pick disease, 172700 (3) | ||||||||
| 5725 | PTBP1 | x | x | |||||||||
| 5931 | RBBP7 | |||||||||||
| 5980 | REV3L | x | ||||||||||
| 6125 | RPL5 | x | Diamond-Blackfan anemia 6, 612561 (3) | |||||||||
| 6667 | SP1 | x | x | x | ||||||||
| 6868 | ADAM17 | x | Inflammatory skin and bowel disease, neonatal, 614328 (3) | |||||||||
| 7248 | TSC1 | x | x | Focal cortical dysplasia, Taylor balloon cell type, 607341 (3)~Lymphangioleiomyomatosis, 606690 (3)~Tuberous sclerosis-1, 191100 (3) | ||||||||
| 7341 | SUMO1 | x | Orofacial cleft 10, 613705 (3) | |||||||||
| 7520 | XRCC5 | x | ||||||||||
| 7994 | KAT6A | x | x | |||||||||
| 8021 | NUP214 | x | Leukemia, T-cell acute lymphoblastic (3)~Leukemia, acute myeloid, 601626 (3) | |||||||||
| 8202 | NCOA3 | x | x | x | ||||||||
| 8491 | MAP4K3 | x | x | x | ||||||||
| 8615 | USO1 | x | x | x | ||||||||
| 8648 | NCOA1 | x | ||||||||||
| 9318 | COPS2 | x | ||||||||||
| 9611 | NCOR1 | x | x | x | ||||||||
| 9612 | NCOR2 | x | x | x | ||||||||
| 10059 | DNM1L | x | Encephalopahty, lethal, due to defective mitochondrial peroxisomal fission, 614388 (3) | |||||||||
| 10432 | RBM14 | x | x | x | x | x | ||||||
| 10499 | NCOA2 | x | ||||||||||
| 10615 | SPAG5 | x | x | x | x | |||||||
| 10664 | CTCF | x | Mental retardation, autosomal dominant 21, 615502 (3) | |||||||||
| 10725 | NFAT5 | x | ||||||||||
| 10728 | PTGES3 | x | x | |||||||||
| 11331 | PHB2 | x | ||||||||||
| 23013 | SPEN | x | x | Megakaryoblastic leukemia, acute (2) | ||||||||
| 26959 | HBP1 | x | ||||||||||
| 27113 | BBC3 | x | x | |||||||||
| 27327 | TNRC6A | x | ||||||||||
| 28996 | HIPK2 | x | x | |||||||||
| 51514 | DTL | x | x | x | ||||||||
| 54464 | XRN1 | x | ||||||||||
| 55031 | USP47 | x | x | |||||||||
| 57786 | RBAK | x | ||||||||||
| 79664 | NARG2 | x | x | x | x | |||||||
| 90480 | GADD45GIP1 | x | ||||||||||
Genes marked in bold were found to be BMAL1 targets.
All 62 genes (of 118) are shown which exhibit at least one of the following properties: regulated by REV-ERB or ROR, circadian expression pattern, causing a clock phenotype upon RNAi knockdown, predicted as similar to known clock genes [29], and featuring an OMIM annotation.
Fig 8Transcriptional regulation of the CCN/ECCN extension network of 118 genes by REV-ERB and ROR.
Regulatory interactions of REV-ERB α/β with NCRG genes (green lines) were derived from the locations of physical binding of these proteins in two ChIP-seq experiments [16, 17]. The ROR α/ γ interactions (purple lines) were adopted from the report of a third ChIP-seq experiment [22]. Genes with an observed phenotype in the genome-wide RNAi screen [28, 29] are shown with a coloured box, red indicating long period, blue a high amplitude, and green a short period.