| Literature DB >> 26635867 |
Marina Lizio1, Yuri Ishizu1, Masayoshi Itoh2, Timo Lassmann3, Akira Hasegawa1, Atsutaka Kubosaki4, Jessica Severin1, Hideya Kawaji2, Yukio Nakamura5, Harukazu Suzuki1, Yoshihide Hayashizaki6, Piero Carninci1, Alistair R R Forrest7.
Abstract
Mammals are composed of hundreds of different cell types with specialized functions. Each of these cellular phenotypes are controlled by different combinations of transcription factors. Using a human non islet cell insulinoma cell line (TC-YIK) which expresses insulin and the majority of known pancreatic beta cell specific genes as an example, we describe a general approach to identify key cell-type-specific transcription factors (TFs) and their direct and indirect targets. By ranking all human TFs by their level of enriched expression in TC-YIK relative to a broad collection of samples (FANTOM5), we confirmed known key regulators of pancreatic function and development. Systematic siRNA mediated perturbation of these TFs followed by qRT-PCR revealed their interconnections with NEUROD1 at the top of the regulation hierarchy and its depletion drastically reducing insulin levels. For 15 of the TF knock-downs (KD), we then used Cap Analysis of Gene Expression (CAGE) to identify thousands of their targets genome-wide (KD-CAGE). The data confirm NEUROD1 as a key positive regulator in the transcriptional regulatory network (TRN), and ISL1, and PROX1 as antagonists. As a complimentary approach we used ChIP-seq on four of these factors to identify NEUROD1, LMX1A, PAX6, and RFX6 binding sites in the human genome. Examining the overlap between genes perturbed in the KD-CAGE experiments and genes with a ChIP-seq peak within 50 kb of their promoter, we identified direct transcriptional targets of these TFs. Integration of KD-CAGE and ChIP-seq data shows that both NEUROD1 and LMX1A work as the main transcriptional activators. In the core TRN (i.e., TF-TF only), NEUROD1 directly transcriptionally activates the pancreatic TFs HSF4, INSM1, MLXIPL, MYT1, NKX6-3, ONECUT2, PAX4, PROX1, RFX6, ST18, DACH1, and SHOX2, while LMX1A directly transcriptionally activates DACH1, SHOX2, PAX6, and PDX1. Analysis of these complementary datasets suggests the need for caution in interpreting ChIP-seq datasets. (1) A large fraction of binding sites are at distal enhancer sites and cannot be directly associated to their targets, without chromatin conformation data. (2) Many peaks may be non-functional: even when there is a peak at a promoter, the expression of the gene may not be affected in the matching perturbation experiment.Entities:
Keywords: CAGE; ChIP-seq; FANTOM5; pancreas; perturbation; transcriptional regulatory network
Year: 2015 PMID: 26635867 PMCID: PMC4650373 DOI: 10.3389/fgene.2015.00331
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Diagram showing the workflow of the analyses. (A) The FANTOM5 data reveal the TC-YIK cell line expresses key pancreatic neuropeptides and pancreatic transcription factors. (B) New data is generated for this study including siRNA perturbation of TC-YIK enriched and non-enriched TFs. siRNA perturbed samples are profiled by qRTPCR in a matrix RNAi design and by CAGE to globally identify target promoters. ChIP-seq on 4 key TFs is also carried out to identify genome wide binding sites. (C) KD-CAGE and ChIP-seq data are integrated to identify directly regulated targets of NEUROD1 and LMX1A.
Neurosecretory peptide expression in TC-YIK.
| 6062.51 | 1 | 6062.51 | TC-YIK | |
| 1202.73 | 21 | 60441.3 | medulla oblongata, adult | |
| 1096.66 | 1 | 1096.66 | TC-YIK | |
| 50.13 | 4 | 5119.98 | Duodenum, fetal | |
| 15.37 | 5 | 54.13 | Eosinophils | |
| 7.81 | 93 | 3612.79 | Duodenum, fetal | |
| 0 | NA | 26.58 | Pancreas, adult | |
| 0 | NA | 3534.95 | Gastric cancer cell line AZ521 | |
TFs enriched in TC-YIK and their putative function in pancreas.
| 593 | 2.77 | Yes | Yes | Si, CA, CS | |
| 519 | 2.72 | Yes | Yes | – | |
| 296 | 2.47 | Yes | Yes | Si, CA, CS | |
| 239 | 2.38 | Yes | No | – | |
| 237 | 2.38 | Yes | Yes | Si | |
| 218 | 2.34 | Yes | Yes | Si, CA | |
| 146 | 2.17 | Yes | Yes | Si, CA, CS | |
| 151 | 2.14 | Yes | Yes | Si, CA | |
| 133 | 2.13 | Yes | Yes | Si, CA | |
| 127 | 2.11 | Yes | Yes | Si | |
| 269 | 2.05 | Yes | Yes | Si, CA | |
| 102 | 2.01 | Yes | Yes | Si, CA, CS | |
| 94 | 1.98 | Yes | No | Si | |
| 168 | 1.95 | Yes | Yes | Si, CA | |
| 88 | 1.95 | Yes | Yes | Si | |
| 78 | 1.90 | Yes | Yes | – | |
| 75 | 1.88 | Yes | Yes | – | |
| 106 | 1.84 | Yes | Yes | Si, CA | |
| 69 | 1.84 | Yes | Yes | Si | |
| 51 | 1.71 | Yes | Yes | Si | |
| 62 | 1.70 | Yes | No | Si, CA | |
| 55 | 1.63 | No | Yes | – | |
| 41 | 1.63 | Yes | Yes | – | |
| 40 | 1.61 | Yes | No | Si | |
| 38 | 1.59 | Yes | Yes | Si, CA | |
| 34 | 1.54 | No | Yes | – | |
| 97 | 1.53 | Yes | No | – | |
| 31 | 1.51 | No | Yes | Si | |
| 70 | 1.51 | Yes | Yes | – | |
| 30 | 1.49 | No | No | Si | |
| 62 | 1.49 | No | Yes | – | |
| 26 | 1.43 | Yes | Yes | Si | |
| 26 | 1.43 | Yes | No | Si | |
| 25 | 1.42 | Yes | Yes | Si, CA | |
| 24 | 1.41 | Yes | Yes | Si | |
| 24 | 1.40 | Yes | No | Si, CA, CS | |
| 27 | 1.33 | No | Yes | – | |
| 71 | 1.32 | Yes | Yes | – | |
| 23 | 1.31 | No | Yes | Si | |
| 23 | 1.30 | No | Yes | – | |
| 19 | 1.29 | No | No | Si | |
| 18 | 1.28 | Yes | Yes | Si, CA | |
| 290 | 0.73 | No | Yes | Si, CA | |
| 243 | 0.37 | No | Yes | Si, CA | |
| 213 | 0.36 | No | Yes | Si, CA | |
| 672 | 0.34 | Yes | Yes | Si, CA | |
| 29 | 0.15 | No | Yes | Si, CA | |
| 80 | 0.09 | No | Yes | Si, CA | |
| 90 | −0.10 | No | Yes | Si, CA | |
| 75 | −0.33 | No | Yes | Si, CA | |
An extended version of the table is provided as Supplementary Table .
Figure 2Influence of transcription factor knock-down on . Log2 expression fold changes for INS gene upon siRNA perturbation of 41 TFs. NEUROD1 knock-down caused the most down–regulation of insulin expression, while highest up-regulation was observed in TBP knock-down. Error bars indicate standard deviation over triplicate measurements. TFs in bold indicate those that were TC-YIK-enriched rather than ubiquitous.
Figure 3KD-CAGE analysis. (A) Up-regulated and down-regulated TSSs in KD-CAGE experiments. Bars indicate, the numbers of up-regulated, and down-regulated TSSs detected by edgeR (p < 0.05) after siRNA knockdown of each factor. Line indicates the log transformed ratio of up-regulated to down-regulated TSS (e.g., note NEUROD1 causes a much larger number of down-regulated TSS than up-regulated ones, while MNX1 shows the reverse). (B) Fractions of up or down-regulated promoters that are TC-YIK-enriched or non-enriched. (C) Comparison of the ratios of TC-YIK-enriched to non-enriched promoters for up and down-regulated TSS sets. Note, ISL1 and PROX1 appear antagonistic to the TC-YIK state. (D) Diagram summarizing the results of the state enrichment and gene ontology enrichment analyses. *Indicates at least 15% of the up or down-regulated promoters were TC-YIK enriched.
Figure 4ChIP-seq analysis of Comparison of the novel RFX6 motif to that of other RFX members. Note that it is unlikely that the antibody used (S-15, Santa Cruz) would cross react with any other RFX family member as the antibody is raised against a peptide in the unique extended C-terminus of the protein which is not present in any of the other RFX family members. (B) ZENBU genome browser (Severin et al., 2014) view showing combinatorial binding of LMX1A-NEUROD1 and NEUROD1-RFX6 in the first intron of the RERE locus. Red, LMX1A; Blue, NEUROD1; Green, PAX6; Yellow, RFX6; Gray, input chromatin. (C) Venn diagram showing the degree of overlap between the peaks called for the four factors, numbers correspond to count of peaks overlapping by at least 1 base. (D) Comparison of the TF ChIP-seq peaks to open chromatin sites identified in human islet cell material by Pasquali et al. (2014).
Figure 5Integration of KD-CAGE and ChIP-seq to identify direct edges. (A) Bar graph showing the fractions of up-regulated (orange), down-regulated (blue), and unaffected (gray) TSS in the knock-down of NEUROD1 or LMX1A. Bars correspond to different distance bins from a ChIP-seq peak for the same factor. (B) Example of putative non-functional binding of LMX1A at the EYS locus. Note the presence of multiple NEUROD1 and LMX1A ChIP-seq peaks and relevant motifs, but only the NEUROD1 knock-down affected EYS expression (more examples shown in Supplementary Figure 4). (C) Diagram showing TC-YIK enriched transcription factors (from Supplementary Table 4) that are directly regulated by NEUROD1 or LMX1A. To be called a direct target, we require at least one TSS of the target gene to be down-regulated 1.5-fold with a p-value of 0.05 and it must be within 50 kb of a ChIP-seq peak for the same factor.