Literature DB >> 24109597

Identification of cell cycle-regulated genes periodically expressed in U2OS cells and their regulation by FOXM1 and E2F transcription factors.

Gavin D Grant1, Lionel Brooks, Xiaoyang Zhang, J Matthew Mahoney, Viktor Martyanov, Tammara A Wood, Gavin Sherlock, Chao Cheng, Michael L Whitfield.   

Abstract

We identify the cell cycle-regulated mRNA transcripts genome-wide in the osteosarcoma-derived U2OS cell line. This results in 2140 transcripts mapping to 1871 unique cell cycle-regulated genes that show periodic oscillations across multiple synchronous cell cycles. We identify genomic loci bound by the G2/M transcription factor FOXM1 by chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) and associate these with cell cycle-regulated genes. FOXM1 is bound to cell cycle-regulated genes with peak expression in both S phase and G2/M phases. We show that ChIP-seq genomic loci are responsive to FOXM1 using a real-time luciferase assay in live cells, showing that FOXM1 strongly activates promoters of G2/M phase genes and weakly activates those induced in S phase. Analysis of ChIP-seq data from a panel of cell cycle transcription factors (E2F1, E2F4, E2F6, and GABPA) from the Encyclopedia of DNA Elements and ChIP-seq data for the DREAM complex finds that a set of core cell cycle genes regulated in both U2OS and HeLa cells are bound by multiple cell cycle transcription factors. These data identify the cell cycle-regulated genes in a second cancer-derived cell line and provide a comprehensive picture of the transcriptional regulatory systems controlling periodic gene expression in the human cell division cycle.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24109597      PMCID: PMC3842991          DOI: 10.1091/mbc.E13-05-0264

Source DB:  PubMed          Journal:  Mol Biol Cell        ISSN: 1059-1524            Impact factor:   4.138


INTRODUCTION

Examining the periodic expression patterns of the human cell cycle using genomic approaches can provide a complete picture of one of the most tightly regulated processes in the life of a cell. This knowledge allows, in turn, the examination of how different regulators of the cell cycle machinery interact and affect the timing of cell cycle progression. This is especially important, as perturbations in cell cycle progression can lead to apoptosis or cancer. The cell cycle has been studied extensively at the molecular level, and transcriptional programs have been measured and analyzed using microarray technology in budding yeast (Cho ; Spellman ), fission yeast (Rustici ; Oliva ; Peng ), bacteria (Laub ), primary human fibroblasts (Cho ; Iyer ; Bar-Joseph ), mouse fibroblasts (Ishida ), Arabidopsis (Menges , 2005), the human keratinocyte cell line HaCaT (Pena-Diaz ), and HeLa cells (Crawford and Piwnica-Worms, 2001; Whitfield ; Sadasivam ). The analysis of cell cycle–regulated gene expression in fibroblasts and HeLa cells suggests both commonality and cell-type specificity in the cell cycle–regulated gene expression programs of human cells. Here we provide the full complement of cell cycle–regulated genes in a fourth human cell line, the commonly used osteosarcoma cell line U2OS. In addition, to shed light on cell cycle regulation, we determine the genome-wide binding patterns of FOXM1 and analyze publicly available data for additional cell cycle transcriptional regulators, thus providing an interpretive framework for the periodic gene expression program in U2OS cells. DNA replication and mitosis are two major cellular events whose transcriptional programs must be precisely coordinated and executed in order for a cell to successfully divide. To ensure that these two events are successfully completed in an orderly manner, there are regulatory mechanisms, ranging from transcriptional control (e.g., RB and pocket protein sequestration of E2Fs) to protein degradation (e.g., anaphase-promoting complex), that assure that a cell progresses forward in the process and reversal does not occur. These processes, in turn, have their own systems of control and regulation. Transcription factors provide one aspect of this control, regulating the timing and levels of gene expression. One of these transcription factors, the G1/S-phase regulator E2F1, has been well studied, whereas the corresponding transcription factor for mitotic gene expression, FOXM1, has been less well studied (reviewed in Wierstra and Alves, 2007; Alvarez-Fernandez and Medema, 2013). At the beginning of and throughout S phase, the E2F family of transcription factors regulates the expression of genes involved in initiation and continued replication of DNA (Johnson ; Slansky and Farnham, 1996; Helin, 1998). As cells progress through S phase, the MuvB core complex (LIN9, LIN37, LIN52, LIN54, and RBBP4) dissociates from the DREAM complex (TFDP1, p130 [RBL2], E2F4, and the MuvB core complex) and binds to B-Myb (the B-Myb-MuvB complex). This complex then binds to the promoters of PLK1 and CCNB1, two genes expressed in G2 and mitosis, as well as a number of other mitotic cell cycle genes (Sadasivam ). In late S phase, the transcription factor FOXM1 is recruited to these promoters by the B-Myb-MuvB complex before B-Myb dissociates from the complex and is degraded by the proteasome (Litovchick ). FOXM1 activates expression of genes critical for the proper progression of mitosis, including CCNB1, PLK1, CDC25B, CDC25C, AURKB, and BIRC5 (Kalinichenko ; Kim ; Laoukili ; Wonsey and Follettie, 2005; Alvarez-Fernandez ; Nakamura ; Bonet ; Down ; Sadasivam ). In HeLa cells, FOXM1 mRNA is cell cycle regulated, with peak expression in G2 (Whitfield ). The FOXM1 protein is also phosphorylated in M phase. These include multiple phosphorylation events involving multiple cyclin-CDK complexes, including cyclin D1/Cdk4 (Wierstra and Alves, 2006b, 2008), cyclin E/Cdk2 (Wierstra and Alves, 2006a, 2008), cyclin A/Cdk2 (Wierstra and Alves, 2006a, 2008; Laoukili et al., 2008), cyclin A/Cdk1 (Wierstra and Alves, 2006a), and cyclin B1/Cdk1 (Major ; Chen ), as well as phosphorylation by PLK1 (Chen ). These modifications culminate in the destruction of FOXM1 protein during mitosis by the APC/C (Laoukili et al., 2008; Park ). FOXM1 expression is often increased in different types of cancer, such as squamous cell carcinoma (Calvisi ; Chen ; Gemenetzidis ; Waseem ; Hui ; Teh ), rhabdomyosarcoma (Wan ), hepatocellular carcinoma (Liu ; Xia ), pancreatic cancer (Huang ), glioblastoma (Liu ), breast cancer (Madureira ; Ahmad , 2011; Kwok ; Bergamaschi ; Wang and Gartel, 2011; Park ), and leukemia (Nakamura ). FOXM1 has also been shown to drive expression of MMP2 and MMP9, which are implicated in metastases and tumor cell invasion (Wang ; Ahmad ; Chen ; Lok ; Ahmed ; Lynch ; Xue ). Although FOXM1 is perhaps the best-studied forkhead box protein involved in cell cycle control, there are a number of others that have roles in cell cycle progression. FOXK1 and FOXJ3 are required for cell cycle progression in U2OS cells, as determined in both live-cell luminescence assays (Grant ) and genome-wide small interfering RNA (siRNA) screens (Mukherji ; Kittler ). In addition, there are a number of other FOX genes that result in cell cycle phenotypes when knocked down in siRNA screens, including FOXA2, FOXA1, and FOXL2 (Mukherji ; Kittler ). Here we identify the cell cycle–regulated genes in U2OS cells after synchronization using either a double-thymidine block or a thymidinenocodazole block. We identify 2140 transcripts mapping to 1871 unique cell cycle–regulated genes and analyze FOXM1 genomic loci by genome-wide chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq). We show that FOXM1 is bound to many genes involved in the G2- to M-phase transition, as well as a number of genes involved in DNA replication. Inclusion of ChIP-seq data for E2F1, 4, and 6 from the Encyclopedia of DNA Elements (ENCODE) project, FOXK1 (Grant ), and the DREAM complex (Sadasivam ) provides a comprehensive picture of the transcriptional regulatory system controlling periodic gene expression in the human cell division cycle.

RESULTS

We identified the full complement of genes periodically expressed across multiple synchronous cell division cycles in the commonly studied human U2OS osteosarcoma-derived cell line (Ponten and Saksela, 1967; Laoukili ). This extends and contrasts with periodic, cell cycle–regulated gene expression data from three additional human cell lines: HeLa cells (Whitfield ; Sadasivam ), primary foreskin fibroblasts (Cho ; Bar-Joseph ), and HaCaT cells (Pena-Diaz ).

Identification of periodically expressed transcripts

Two complementary methods were used to obtain synchronous populations of U2OS cells. The first is a thymidinethymidine (Thy-Thy) synchronization that arrests cells at the G1/S boundary, and the second is a thymidinenocodazole (Thy-Noc) synchronization that arrests cells in mitosis. Each of these methods provides the best synchrony in the cell cycle phases that immediately follow the arrest. Thy-Thy provides the best synchrony for the G2 and M phases, and Thy-Noc provides the best synchrony in G1/S and S phases. Three Thy-Thy synchronization time courses were performed, which sampled the population at 2-h intervals for 38 h (Thy-Thy 1) or 46 h (Thy-Thy 2 and Thy-Thy 3; Figure 1; all figures are available in blue/yellow color format in Supplemental Figure S10). The Thy-Thy time courses measured gene expression across 2.5 synchronous cell cycles, traversing two complete cycles of S phase and mitosis, and into a third round of DNA replication. A fourth time course was performed using the Thy-Noc synchronization protocol, and time points were collected at 2-h intervals for 44 h, which resulted in two full cycles of mitosis and S phase (Figure 1). Gene expression was measured genome-wide using these two complementary methods across four independent time courses, resulting in the analysis of multiple synchronous cell cycles.
FIGURE 1:

Periodic expression of well-characterized cell cycle genes. (A) Expression profiles of known periodically expressed genes for each of the four time courses. The purple bars indicate S phase, and black arrows indicate mitosis as estimated by flow cytometry or Western blots. (B) The idealized vector for each phase is the average expression of the known genes shown in A.

Periodic expression of well-characterized cell cycle genes. (A) Expression profiles of known periodically expressed genes for each of the four time courses. The purple bars indicate S phase, and black arrows indicate mitosis as estimated by flow cytometry or Western blots. (B) The idealized vector for each phase is the average expression of the known genes shown in A. The degree of synchrony in each cell cycle time course was monitored by flow cytometry analysis of DNA content by propidium iodide staining (Supplemental Figure S1). In addition, cell lysates were prepared and Western blots performed for well-characterized cell cycle–regulated proteins (CCNB1 and FOXM1; unpublished data). All data used in this study were from experiments that showed clear cell cycle synchrony using these two measures. Periodically expressed genes were identified as previously described (Whitfield ). We collected and isolated total RNA from synchronous U2OS cells every 2 h for a minimum of 38 h and determined the expression of all genes in the genome using Agilent 4 × 44,000–element DNA microarrays (41,000 probes representing 19,637 unique Entrez GeneIDs). Each time course was analyzed independently. To verify our ability to detect periodic cell cycle–regulated genes, we selected a set of known cell cycle–regulated genes from the literature previously shown to have peak expression in each of the five major phases of the cell cycle (G1/S, S, G2, G2/M, and M/G1; Figure 1). Expression of these genes in each time course is shown with DNA replication and mitosis indicated (Figure 1). For G1/S, we examined CCNE1, CCNE2, E2F2, PCNA, and MCM2. For S phase, we selected genes involved in DNA replication or the packaging of newly replicated DNA, including HIST1H4B, HIST1H4C, SLBP, RRM2, and RFC4. G2 was represented by TOP2A, CDC2, CCNA2, and CCNF. G2/M included five genes required for mitosis: CCNB1, CCNB2, BUB1, BUB1B, and CDC25B. Following on the observation from Whitfield that a set of genes showed peak expression during mitosis into G1, we selected three genes for the M/G1 transition: RAD21, CDKN3, and PTTG1. These genes were averaged to generate an idealized expression vector for each cell cycle phase (Figure 1B). A Fourier transform (Whitfield ) was used to generate a periodicity score for each gene. The periodicity score quantifies how periodic a gene is across each of the four time courses and is the magnitude of the sine and cosine components of the Fourier analysis (see Materials and Methods). The periodicity score for each gene was summed across the four time courses and then scaled by the highest correlation to one of five idealized vectors of the known cell cycle genes for each cell cycle phase (G1/S, S, G2, G2/M, M/G1; Figure 1; Spellman ; Whitfield ). Table 1 gives examples of genes and their periodicity score (Supplemental Table S1).
TABLE 1:

Sample periodicity scores from selected cell cycle–regulated genes.

RankGenePeriodicity scoreMaximum fold change
1SHISA327.07
2PIF126.54.5
4KIF20A23.56
7PLK122.53
11CCNB119.52.5
14CCNE119.05.5
23CENPE116.85
161PCNA9.153.5
964CIT4.901.5
1162BRCA14.502
1288PRIM14.282.5
2557E2F13.122

The maximum fold change is the largest difference in peak-to-trough ratio for each gene rounded down to the nearest 0.5 fold change.

Sample periodicity scores from selected cell cycle–regulated genes. The maximum fold change is the largest difference in peak-to-trough ratio for each gene rounded down to the nearest 0.5 fold change. To establish a cutoff for a cell cycle–regulated gene, we randomized the data either by rows or both rows and columns and repeated the analysis on the randomized data to estimate a false discovery rate (FDR). We selected a periodicity score of 2.65, which resulted in 3568 periodically expressed probes with estimated FDR of 1.29% when randomizing by rows and columns and 3.67% when randomizing by rows only. To estimate the false-negative rate (FNR), we used a list of known cell cycle–regulated genes (Whitfield , Table 2). We determined the FNR at this cutoff to be 11%.
TABLE 2:

Enrichment of transcription factor–binding loci by cell cycle phase.

Transcription factorG1/SSG2G2/MM/G1Minimum pMinimum p phase
E2F1 (HeLa)1.20E-060.160.700.411.001.20E-06G1/S
E2F6 (K562)0.00100.200.591.000.360.0010G1/S
E2F6 (K562b)0.00180.180.911.000.120.0018G1/S
E2F6 (HeLa)0.290.421.000.690.460.29G1/S
FOXM10.790.531.09E-408.79E-330.000541.09E-40G2
E2F4 (HeLa)1.43E-050.059.71E-165.51E-090.669.71E-16G2
E2F4 (K562b)9.73E-070.000762.78E-101.56E-070.662.78E-10G2
As in the prior analysis of HeLa cells, we found that a portion of the probes display a sinusoidal pattern in the first cell cycle but do not repeat in the second cell cycle (Whitfield ). These genes are likely induced by the synchronization procedure. To filter out these probes, we calculated an autocorrelation score for each gene in each time course. The autocorrelation determines whether the gene expression ratio at a given time is a good predictor of the ratio one cell cycle later. Therefore genes that oscillate and thus repeat their expression values over multiple cell cycles receive high autocorrelations, whereas genes that do not oscillate receive low or negative autocorrelations. Altogether, 690 probes with negative autocorrelations were removed, leaving 2878 probes. As a final filtering step, we removed 48 probes that showed systematic bias associated with date of hybridization. These probes had no clear periodic expression and are almost certainly systematic artifacts from microarray hybridization. To objectively remove genes with “date-biased” expression patterns, we projected the spectrum for each of the 2878 probes onto the first two principal components and organized by k-means clustering. Forty-eight probes that displayed expression patterns that correlated with date of hybridization clustered together (Supplemental Figure S2A). Those 48 probes that grouped together with alternating high and low expression levels, correlated with the hybridization batch (Supplemental Figure S2, A and B), were removed, leaving 2830 cell cycle–regulated probes corresponding to 2140 Entrez GeneIDs, with 1871 of these being unique (Supplemental Table S2).

Overview of periodic gene expression in U2OS cells

To display the selected genes in the order of their timing of peak expression within the cell division cycle, we organized genes by their phase of peak expression determined by Fourier analysis (Figure 2A). The alternating pattern of high and low gene expression levels shows persistent waves of gene expression across multiple synchronous cell cycles. The phase of peak expression of each gene was assigned based on peak correlation to the idealized phase expression profiles shown in Figure 1. This method assigns the probes to G1/S (1018 probes; 702 unique genes), S phase (546 probes; 355 unique genes), G2 phase (390 probes; 278 unique genes), G2/M (598 probes; 392 unique genes), and M/G1 (278 probes; 144 unique genes; Figure 2A).
FIGURE 2:

Periodically expressed genes in the U2OS cell cycle. (A) The 2830 probes that show periodic expression in U2OS cells, ordered by the point of their peak expression as calculated from their sine and cosine values in the Fourier transform. Phase assignments were performed using the correlation coefficients with the ideal vectors defined in Figure 1. The color bar on the right indicates the phase of peak expression (G1/S, green; S, blue; G2, red; G2/M, yellow; and M/G1, black); blended colors indicate interspersed phase assignments. (B) The 2830 periodically expressed probes from U2OS cells were ordered by average linkage hierarchical clustering. The enriched biological processes of select clusters is indicated.

Periodically expressed genes in the U2OS cell cycle. (A) The 2830 probes that show periodic expression in U2OS cells, ordered by the point of their peak expression as calculated from their sine and cosine values in the Fourier transform. Phase assignments were performed using the correlation coefficients with the ideal vectors defined in Figure 1. The color bar on the right indicates the phase of peak expression (G1/S, green; S, blue; G2, red; G2/M, yellow; and M/G1, black); blended colors indicate interspersed phase assignments. (B) The 2830 periodically expressed probes from U2OS cells were ordered by average linkage hierarchical clustering. The enriched biological processes of select clusters is indicated. The expression program in U2OS cells has two distinct waves of expression—one that occurs in G1 and S phases, and a second that occurs in G2 and M phases. We displayed the distribution of genes in the two phases by clustering the correlation values for each gene to the five idealized expression vectors (Supplemental Figure S3). Genes assigned to G1/S or S phase had similar correlation scores to the idealized G1/S and S-phase vectors, whereas genes that were classified as G2, G2/M, or M/G1 had high correlations to the G2 and G2/M vectors. Considerable overlap is observed among genes of the G1/S and S-phase transition and between the genes of the G2 and G2/M transition. Thus the boundaries between G1/S and S phase and between G2 and M phase are necessarily somewhat arbitrary, as genes assigned to G1/S can be immediately adjacent to genes assigned to S phase. In contrast, there is a distinct break between the G1- and S-phase genes and the G2- and M-phase genes. We analyzed the same distributions in HeLa cells, where we did not observe a distinct break between the two waves of expression (Supplemental Figure S3). This suggests that the distinct break observed here may be specific to U2OS cells and reflect specific mutations or biological changes in this cell line. To organize the genes in an unbiased manner, we hierarchically clustered all cell cycle–regulated probes (Eisen ). The Database for Annotation Visualization and Integrated Discovery (DAVID; Dennis ) was used to identify enriched cellular processes in each cluster. Genes involved in mitosis (M phase, p = 6.6 × 10−42), including cyclins A2, B1, B2, and F, primarily fell into one large cluster, whereas genes involved in DNA replication separated into three large clusters, each with weaker but still significant levels of enrichment for S-phase processes (DNA replication, p = 1.4 × 10−10; DNA metabolic process, 1.4 × 10−6). The first cluster of S-phase genes includes four minichromosome maintenance proteins (MCM 2, 3, 4, and 10), PCNA, CDT1, CHAF1A, CHAF1B, E2F2, and E2F8. The second cluster of DNA replication genes includes RMI1, DSCC1, and MCM6. The third and final DNA replication cluster includes two more E2F genes, E2F1 and E2F7, PLK3, RMI2, CDC45L, RBBP8, DHFR, BRIP1, PRIM1, RRM2, and RFC4. A small but distinct cluster was found completely comprising histone genes (nucleosome assembly, p = 4.9 × 10−23). There was also a small cluster of genes containing primarily heat shock proteins (labeled the HSP70 cluster), as well as the HSP70-binding protein BAG3, which has antiapoptotic properties (Takayama ; Doong ; Romano , b). The overall patterns of gene expression reflect the fundamental biological processes necessary to duplicate a cell, such as DNA replication, DNA packaging, formation of the mitotic spindle, and mitosis. It includes not only the proteins required to perform these processes (e.g., MCM proteins, the histones, proteins at the DNA replication fork, nucleotide biosynthesis, nucleosome assembly and mitotic genes), but also key regulators of those genes (e.g., E2F transcription factors, cyclins, and key kinases such as PLK). Each of these is coordinately regulated in a tightly controlled manner, with peak expression at their required points in the cell cycle and then reduced expression until they are needed the following cell cycle.

Cell cycle regulation by FOXM1

To understand the network of transcription factors controlling cell cycle–regulated gene expression and better understand the role of FOXM1 in the transcriptional program that regulates mitosis, we performed ChIP-seq for endogenous FOXM1 in asynchronous HeLa cells. IPs were performed in duplicate using an antibody against the endogenous FOXM1 (C20; Santa Cruz Biotechnology) and sequenced independently. As a conservative estimate of FOXM1 binding, we focused the analysis on those ChIP-seq regions found in both ChIP-seq assays. The first sequencing analysis (run A) gave 17.1 million sequence reads, with 8.4 million reads mapped (Hg 19), and the second sequencing analysis (run B) gave 17.0 million sequence reads, with 8.4 million mapped. Using the MACS module in Cistrome (Zhang ; Liu ; www.cistrome.com), we found 5727 peaks in A and 2849 peaks in B, with 2215 regions identified in both; these regions could be associated with 2367 unique genes by the Gene Centered Annotation (GCA) module in Cistrome (Shin ; Figure 4 and Supplemental Figure S4). We refer to these intersecting regions as FOXM1 genomic loci. Of the genomic loci, 36.8% were in promoter regions within 3000 base pairs upstream from the transcription start site, and 1.9% of the genomic loci were within 3000 base pairs downstream of the transcription start site. Of the genomic loci, 20.4% were in the 5′-untranslated region (5′UTR) of target genes and 1.1% were in the 3′UTR. Genomic loci in exonic regions accounted for 5.3%, and 17.8% of the loci were in intronic regions. Distal intergenic regions accounted for 16.6% of the ChIP-seq genomic loci (Figure 3A).
FIGURE 4:

The expression profiles of genes bound by FOXM1. The clustered U2OS and HeLa cell expression profiles of the 501 genes bound by FOXM1 that are cell cycle regulated in U2OS cells. FOXM1 transcription factor binding is shown as percentage coverage of the UCSC genome browser gene model as defined by GCA for each gene (see Supplemental Figure S6 for more detail).

FIGURE 3:

FOXM1 ChIP-seq indicates a role for FOXM1 in the cell cycle and translation. (A) CEAS analysis of the overlapping FOXM1 ChIP-seq analysis shows enrichment in promoters, downstream regions, 5′UTRs, and coding exons relative to background. (B) Venn diagram showing the overlapping gene targets of FOXM1, B-Myb, and LIN9 (b-Myb and LIN9 results from Sadasivam ). There is enrichment in cell cycle Gene Ontology (GO) terms for biological process (BP) and cellular component (CC) in the overlapping gene lists for FOXM1/B-Myb, FOXM1/B-Myb/LIN9, and FOXM1/LIN9. FOXM1 genes that do not overlap with B-Myb or LIN9 show an enrichment in the GO term translation. (C) Bar graph showing the percentage of FOXM1 targets per cell cycle phase. (D) Bar graph showing the percentage of genes in each phase that are bound by FOXM1.

FOXM1 ChIP-seq indicates a role for FOXM1 in the cell cycle and translation. (A) CEAS analysis of the overlapping FOXM1 ChIP-seq analysis shows enrichment in promoters, downstream regions, 5′UTRs, and coding exons relative to background. (B) Venn diagram showing the overlapping gene targets of FOXM1, B-Myb, and LIN9 (b-Myb and LIN9 results from Sadasivam ). There is enrichment in cell cycle Gene Ontology (GO) terms for biological process (BP) and cellular component (CC) in the overlapping gene lists for FOXM1/B-Myb, FOXM1/B-Myb/LIN9, and FOXM1/LIN9. FOXM1 genes that do not overlap with B-Myb or LIN9 show an enrichment in the GO term translation. (C) Bar graph showing the percentage of FOXM1 targets per cell cycle phase. (D) Bar graph showing the percentage of genes in each phase that are bound by FOXM1. The expression profiles of genes bound by FOXM1. The clustered U2OS and HeLa cell expression profiles of the 501 genes bound by FOXM1 that are cell cycle regulated in U2OS cells. FOXM1 transcription factor binding is shown as percentage coverage of the UCSC genome browser gene model as defined by GCA for each gene (see Supplemental Figure S6 for more detail). Recently it was shown using ChIP-reChIP that B-Myb and FOXM1 cooccupy the promoters of genes involved in mitosis (PLK1, CCNB1, and AURKA; Sadasivam ). Similar results were found using ChIP-reChIP of LIN9 followed by FOXM1. We compared our FOXM1 ChIP-seq results with the ChIP-seq results of Sadasivam Figure 3B). The gene targets found in all three ChIP-seq experiments were enriched for genes involved in mitosis (DAVID, M phase, p = 3.26 × 10−39). There was also enrichment for cell cycle–related processes for the FOXM1/B-Myb overlap (Figure 3B; cell cycle, p = 5.35 × 10−06) as well as for the FOXM1/LIN9 overlap (cell division, p = 6.36 × 10−05). Of interest, after removal of the FOXM1 target genes from the B-Myb/LIN9 target list (i.e., target genes of all three transcription factors vs. targets of B-Myb and LIN9 but not FOXM1), the most enriched biological process was actin cytoskeleton organization (p = 8.18 × 10−05). The list of FOXM1 target genes only was enriched for the biological process of translation (p = 3.49 × 10−46) and translation elongation (p = 1.32 × 10−27; Figure 3B). We display the expression of genes bound by FOXM1 in our ChIP-seq that were also cell cycle regulated (Figure 4), as well as those that were not cell cycle regulated (Supplemental Figure S4). To represent FOXM1 binding relative to gene models, we show the percentage coverage of different regions of each gene model as defined by GCA (Supplemental Figure S6). We then linked the genes for each FOXM1 ChIP-seq loci via Entrez GeneIDs to genes that are cell cycle regulated in U2OS cells. Of the 1871 unique cell cycle–regulated genes in U2OS cells, 287 showed evidence of FOXM1 occupancy at their promoters. Because FOXM1 is known to drive the expression of G2/M phase genes, we first examined the expression of known G2/M FOXM1 targets, AURKB, CCNB1, CCNB2, PLK1, and TOP2A, which all had FOXM1 bound in their promoters (Supplemental Figure S5). Of the 278 genes expressed in G2, 98 (35.2%, p < 0.001) were bound by FOXM1 in our ChIP-seq data. Of the 392 genes expressed in G2/M, 102 (26%, p < 0.001) were bound by FOXM1. Progressing through the cell cycle, there were 16 M/G1 genes bound by FOXM1 out of 144 (11.1%; not significantly enriched [NS]). Surprisingly, we found a number of G1- and S-phase genes that were bound by FOXM1, including TYMS, RMI1, and replication-dependent histone genes. FOXM1 binds to 6.1% of the genes expressed in G1/S (43 of 702; NS) and 7.8% of S-phase genes (28 of 355; NS). Many FOXM1 target genes in different cell types have been reported (reviewed in Wierstra and Alves, 2007). Given that FOXM1 has roles in cell cycle progression, metastasis (through MMP2 and MMP9; Wang ), and development in many cell types, we focused on FOXM1 targets in cell types that display epithelial morphology. In our ChIP-seq analysis we identified the promoters of all four previously published FOXM1 targets in HeLa cells (AURKA, PLK1, CCNB1, and RACGAP1; Sadasivam ). Looking at genes previously shown to be bound by FOXM1 by ChIP in U2OS cells, we found five of nine published FOXM1 targets (Dai ; Wang ; Chen ; Chetty ; Ahmed ). Of the previously reported G2- or M-phase FOXM1 target genes that were also cell cycle regulated in HeLa cells, 32 of 39 (∼82%) were found in our analysis of U2OS cells, whereas only five of 16 (∼31%) of the published FOXM1 targets expressed in G1 or S phases were identified in our analysis in U20S cells (Supplemental Figure S6 and Supplemental Table S4). Most FOXM1 ChIP-seq target genes were cell cycle regulated in U2OS cells (27 of the top 30) and HeLa cells (28 of the top 30; Whitfield ). Of the 12 published FOXM1 target genes not found in our ChIP-seq data, only three were cell cycle regulated: CDC25A in U2OS and HeLa, BRCA2 in U2OS only, and SKP2 in HeLa only (Supplemental Figure S7). To determine whether FOXM1 can induce luciferase expression from promoters found in our ChIP-seq data, we used the LumiCycle, a luminometer that measures luciferase expression from live cells in real time (Grant ). We cotransfected U2OS cells with either a Flag-tagged FOXM1C construct or the empty vector (pBABE-puro) and a promoter luciferase target. We tested the ability of FOXM1 to activate seven genes found among our ChIP-seq data. These included the well-characterized FOXM1 targets CENPE, PLK1, and TOP2A genes, with peak expression in G2 or G2/M phase. We also tested a subset of the novel FOXM1 targets identified in our screen: MCM8, RPS6KB1, and RMI1. As a negative control, we used the FOXK1 target ACAP3 (Grant ), which contains a forkhead box cis-element but is not bound by FOXM1, as determined by our ChIP-seq analysis. As expected, the ACAP3 promoter showed no induction after transfection with FOXM1, whereas all other promoters were activated by exogenous FOXM1C expression (Figure 5). Given that both MCM8 and RMI1 are expressed in S phase, this demonstrates that FOXM1C can activate expression of S-phase genes, albeit at a lower level of induction. RPS6KB1 is not cell cycle regulated in HeLa cells but is in U2OS cells and can be overexpressed in cancers (Sinclair ; Yamnik ). RPS6KB1, a downstream target of mammalian target of rapamycin (mTOR), plays multiple roles in translation, including phosphorylation of eIF-4B, as well as phosphorylation of the negative regulator of eIF4A, PDCD4, targeting PDCD4 for destruction (reviewed in Fenton and Gout, 2011).
FIGURE 5:

FOXM1 overexpression activates the expression of a number of target genes found via ChIP-seq. (A) Neither FOXM1 (light line) nor the empty vector control (dark line) activates the ACAP3 promoter. ACAP3 is a known FOXK1 target (Grant ) and is used as a negative control. (B–D) FOXM1 activates the promoters of PLK1, CENPE, and TOP2A, known FOXM1 targets (Figure 4 and Supplemental Figure S6). (E) FOXM1 activates the MCM8 promoter. (F) FOXM1 activates the promoter of RPS6KB1, which phosphorylates the 6S ribosomal protein. (G) The previously unknown FOXM1 target RMI1 promoter is activated by FOXM1 overexpression. Error bars, SEM. Time points were binned and averaged for every 24 h for a one-way analysis of variance. *p ≤ 0.05; **p ≤ 0.01, ***p ≤ 0.005.

FOXM1 overexpression activates the expression of a number of target genes found via ChIP-seq. (A) Neither FOXM1 (light line) nor the empty vector control (dark line) activates the ACAP3 promoter. ACAP3 is a known FOXK1 target (Grant ) and is used as a negative control. (B–D) FOXM1 activates the promoters of PLK1, CENPE, and TOP2A, known FOXM1 targets (Figure 4 and Supplemental Figure S6). (E) FOXM1 activates the MCM8 promoter. (F) FOXM1 activates the promoter of RPS6KB1, which phosphorylates the 6S ribosomal protein. (G) The previously unknown FOXM1 target RMI1 promoter is activated by FOXM1 overexpression. Error bars, SEM. Time points were binned and averaged for every 24 h for a one-way analysis of variance. *p ≤ 0.05; **p ≤ 0.01, ***p ≤ 0.005.

Transcriptional regulators of the cell cycle gene expression program

To identify the transcriptional regulators of the genome-wide cell cycle gene expression program, we obtained ChIP-seq data from the ENCODE project for transcription factors E2F1, E2F4, E2F6, and GA-binding protein transcription factor, α subunit 60 kDa (GABPA [GABP]), each implicated in cell cycle control. We reanalyzed the data to identify ChIP-seq loci using MACS and associated loci with gene regions. We also included ChIP-seq data for FOXM1 and FOXK1 (Grant ). Cell cycle gene expression data were collapsed to Entrez GeneIDs and merged with the Entrez GeneID–associated ChIP-seq loci. Gene expression and ChIP-seq data were coclustered and in the following are discussed in detail by biological process. The DNA replication cluster includes genes involved in the process of DNA synthesis, including components of the prereplication complex (MCM2, 3, 4, and 6), nucleotide biosynthesis (RRM2), DNA replication (PCNA, CLSPN, EXO1, POLD3, RFC4, and DCC1), and DNA packaging (CHAF1A, CHAF1B, and SLBP). We also find genes involved in S-phase regulation (E2F1, CDC6, CDC25A, CCNE1, and RBBP8) and DNA repair (UNG, BARD1, BRIP1, and RAD51). The G1/S-phase transcriptional regulator E2F1 is bound to 67% (68 of 101) of the unique genes in this DNA replication cluster. E2F4, which is a member of the DREAM complex that represses the expression of S-phase genes during G0 (Litovchick ), is bound to 60% of the genes in this cluster (61 of 101). Greater than half (56) of the genes in this cluster have both E2F1 and E2F4 bound (Figure 6A). Comparison with cell cycle–regulated genes in HeLa cells (Whitfield ) shows that 42% (39 of 93) of those bound by E2F1 or E2F4 were regulated there.
FIGURE 6:

Biological process–specific gene clusters have coordinated transcription factor binding. (A) After the inclusion of FOXM1 ChIP-seq data as well as ENCODE data, hierarchical clustering shows that the genes in the U2OS S phase cluster are predominantly bound by E2F1 and E2F4. (B) The histone cluster shows E2F4 bound to more gene models than E2F1 or FOXM1. (C) Genes in the mitosis gene cluster are primarily bound by FOXM1 or E2F4. Transcription factor binding is shown as percentage of coverage of each portion of the gene model as defined by GCA (see Supplemental Figure S6 for more details).

Biological process–specific gene clusters have coordinated transcription factor binding. (A) After the inclusion of FOXM1 ChIP-seq data as well as ENCODE data, hierarchical clustering shows that the genes in the U2OS S phase cluster are predominantly bound by E2F1 and E2F4. (B) The histone cluster shows E2F4 bound to more gene models than E2F1 or FOXM1. (C) Genes in the mitosis gene cluster are primarily bound by FOXM1 or E2F4. Transcription factor binding is shown as percentage of coverage of each portion of the gene model as defined by GCA (see Supplemental Figure S6 for more details). Histone genes clustered together in a tight group likely due to their unique mechanism of regulation, which is primarily by posttranscriptional mechanisms controlled by histone SLBP (Wang ; Whitfield ). Consistent with this, only two of the genes in this cluster are bound by E2F1, but, surprisingly, 53% of the histone genes are bound by E2F4 (9 of 17 genes in the cluster) and 47% by FOXM1 (8 of 17). Given that E2F4 is known to repress S-phase genes in G0, it may play a similar role for the histone genes, whose expression is tightly restricted to S phase. Of these genes, 18% (3 of 17) were bound by FOXK1, recently shown to be involved in G1/S-phase gene activation (Figure 6B). The mitosis cluster contains genes involved in the processes of DNA segregation and chromosome organization (KIF11, KIF2C, and KIF4A) and regulation of mitosis (PLK1, CCNB1, CCNB2, BUB1, STK6 [AURKA], CCNA2, CCNF, and CDC25C). Of the mitotic genes, 58% (68 of 118, p < 0.001) are bound by E2F4, likely as a component of the repressive DREAM complex; 76% (90 of 118, p < 0.001) are bound by FOXM1, and only 24% (28 of 118, NS) are bound by E2F1. Eleven percent (13 of 118, NS) are bound by E2F6 and 3% (4 of 118, NS) by FOXK1. GABPA binds to 35% (41 of 118, p < 0.001), albeit at a lower percentage of promoter coverage than FOXM1 or E2F4 (Figure 6C). Of the genes found in this cluster, 37% (44 of 118) are also cell cycle regulated in HeLa cells, corresponding to the “spindle assembly” and “mitotic surveillance” clusters (Whitfield ).

Transcription factor binding by cell cycle phase

Transcription factor binding across the cell cycle was displayed as a function of the significance of enrichment (Fisher's exact test) of a transcription factor–binding site over a sliding window of phase-ordered genes (Figure 7A). This demonstrates that E2F1 binding shows significant enrichment in late G1/S phase of the cell cycle, whereas FOXM1 binding is significantly enriched at G2/M phase of the cell cycle. As expected, both show low enrichment of binding in the opposite phase. E2F4 shows biphasic binding, with enrichment in G1/S and G2/M phases. We also examined gene density by cell cycle phase and found that the regions with peak binding of these three transcription factors correspond to the highest gene density (Figure 7B).
FIGURE 7:

Transcription factor binding as a function of cell cycle phase. (A) Enrichment of transcription factor targets using a sliding window across the cell cycle indicates enrichment of E2F1, E2F4, and FOXM1 binding to genes that show peak expression at specific times during the cell cycle. Phase angles were calculated from the arc tangent of the Fourier analysis. Transcription factor binding enrichment was calculated using Fischer's exact test over a sliding window of 30° with a 10° overlap between neighboring windows. Cell cycle phase is indicated (B) Gene density in each sliding window was calculated for the expression of all the cell cycle–regulated genes in U2OS cells. (C) Enrichment of transcription factor targets in each cell cycle phase was calculated using estimated phase boundaries rather than a sliding window. The significance of transcription factor binding enrichment was calculated using Fischer's exact test. Transcription factor binding data were from HeLa cells, except for E2F6, which was from K562 cells. The p values for each transcription factor at each phase are given in Table 2.

Transcription factor binding as a function of cell cycle phase. (A) Enrichment of transcription factor targets using a sliding window across the cell cycle indicates enrichment of E2F1, E2F4, and FOXM1 binding to genes that show peak expression at specific times during the cell cycle. Phase angles were calculated from the arc tangent of the Fourier analysis. Transcription factor binding enrichment was calculated using Fischer's exact test over a sliding window of 30° with a 10° overlap between neighboring windows. Cell cycle phase is indicated (B) Gene density in each sliding window was calculated for the expression of all the cell cycle–regulated genes in U2OS cells. (C) Enrichment of transcription factor targets in each cell cycle phase was calculated using estimated phase boundaries rather than a sliding window. The significance of transcription factor binding enrichment was calculated using Fischer's exact test. Transcription factor binding data were from HeLa cells, except for E2F6, which was from K562 cells. The p values for each transcription factor at each phase are given in Table 2. To determine the specific cell cycle phase in which each transcription factor is enriched, we calculated enrichment using the phase assignments for each gene that we assigned using our idealized vectors (Figure 7C and Table 2). We find that E2F1 binding is significantly enriched in G1/S (p = 1.20 × 10−6). E2F enrichment is not significant in other cell cycle phases but is bound to 47% (16 of 34) of S-phase genes, 26% (16 of 60) of G2 genes, 27% (20 of 74) of G2/M genes, and 10% (2 of 20) M/G1 genes. E2F4 shows biphasic enrichment, with significant interactions with genes that peak at G1/S (p = 1.43 × 10−5), as well as with those that show peak expression at G2/M phase (p = 8.79 × 10−33; Table 2 and Figure 7C). E2F4 is bound to ∼53% (35 of 65) of G1/S, ∼44% (15 of 34) of S phase, and ∼28 (17 of 60) of G2 genes and ∼48% (36 of 74) G2/M phase targets. FOXM1-binding sites show significant enrichment for genes that peak in G2 (p = 1.09 × 10−40), G2/M (p = 8.79 × 10−33), and M/G1 (p = 5.4 × 10−4). FOXM1 is bound to a 73% (44 of 60) of G2 genes and 70% (52 of 74) of G2/M-phase genes but only ∼15% (10 of 65) of G1/S and ∼20% (7 of 34) of S-phase genes. E2F6 showed only weak enrichment for G1/S (p = 0.0010 and 0.0018) and only when using binding sites defined in K562 or K562b cells (Table 2 and Figure 7C). These data demonstrate a complex interplay of regulation between G1/S- and G2/M-phase transcription factors, which includes a biphasic binding by E2F4. Enrichment of transcription factor–binding loci by cell cycle phase.

Identification of a common set of cell cycle–regulated genes

To identify the common cell cycle genes regulated across multiple cell types, we merged and compared the U2OS, HeLa, and ENCODE data (Figure 8). We compared the 1871 unique genes cell cycle regulated in U2OS cells to the 651 unique cell cycle-regulated genes in HeLa cells. We found that 253 of the 651 (38.8%) HeLa cell cycle–regulated genes were also cell cycle regulated in U2OS cells. These overlapping genes predominantly consisted of those involved in core processes of mitosis and S phase. Many of these genes are well known cell cycle genes, such as PLK1, CCNB1, E2F1, and CCNE2; however, there were a number of less well-characterized genes that are cell cycle regulated in all four cell types, including FLJ10156 (FAM64A), ARL6IP, DLG7 (DLGAP5), HMGB3, TROAP, SHCBP1, ANP32E, GPR126, HJURP, SLC38A2, and C14orf130 (UBR7), which may represent less well-studied proteins involved in essential biological processes required to duplicate a cell.
FIGURE 8:

Core cell cycle–regulated genes between HeLa cells and U2OS cells are bound by FOXM1 or E2F1. The 253 genes that show periodic regulation in both HeLa cells and U2OS cells have been clustered after the inclusion of the FOXM1 and FOXK1 ChIP-seq data and the ENCODE data for selected transcription factors. Transcription factor binding is shown as percentage of coverage of each portion of the gene model as defined by GCA (see Supplemental Figure S6 for more details).

Core cell cycle–regulated genes between HeLa cells and U2OS cells are bound by FOXM1 or E2F1. The 253 genes that show periodic regulation in both HeLa cells and U2OS cells have been clustered after the inclusion of the FOXM1 and FOXK1 ChIP-seq data and the ENCODE data for selected transcription factors. Transcription factor binding is shown as percentage of coverage of each portion of the gene model as defined by GCA (see Supplemental Figure S6 for more details).

DISCUSSION

We identified 1871 unique genes expressed in a cell cycle–dependent manner in the well-studied U2OS cancer cell line. This cell line was chosen because it is amenable to multiple synchronization methods, with a low percentage of noncycling cells. Each gene was assigned to a cell cycle phase: 56% had peak expression in G1 or S phase, whereas 35.8% had peak expression in G2 or M phase.

Comparison to other cell cycle expression data sets in mammalian cells

There have been a number of reports on the periodic expression of cell cycle genes in mammalian cells. These reports often used different cell lines/types, as well as different analysis methods. We focus on three studies that use human cells and either a double-thymidine or thymidinenocodazole block. Whitfield ) published the cell cycle–regulated genes in HeLa cells. They reported 651 unique genes that were cell cycle regulated. Here we report 1871 unique genes that are cell cycle regulated in U2OS cells. Of these genes, 253 (38.8%) are cell cycle regulated in both cell lines when analyzed using the same analysis pipeline. Genes cell cycle regulated in both studies include well-known genes that have peak expression in G1/S phase, such as E2F1, BARD1, CHAF1A, CHAF1B, as well as CCNE1 and CCNE2. This also holds true for genes involved in DNA replication with peak expression in S phase, including RRM2, PRIM1, RBBP8, and RFC2. U2OS genes peaking in mitosis have good overlap with mitosis genes in HeLa and foreskin fibroblasts, including PLK1, CCNB1, CDC25B, CDC25C, CENPA, CENPE, CENPF, and TOP2A. There are also overlapping genes that have peak expression during mitosis into G1, including CEP70, UBE2D3, BAIAP2, and PDGFA. Thus we identified a large number of previously known cell cycle–regulated genes, as well as a number of novel cell cycle–regulated genes in U2OS cells. Although the methods for determining which genes are cell cycle regulated are very similar for both HeLa and U2OS, there are experimental differences that may affect the results. These include different array platforms and differences in sample preparation, sample collection methodologies, and time-point collection. There are also differences in the lengths of each time course and how many cell cycles were contained in each. HeLa cells have a cell cycle length of ∼14–15 h, so a 48-h time course can contain three complete cell cycles. The cell cycle length of U2OS cells is ∼18–20 h, and so a 48-h time course would be expected to have only ∼2.5 cell cycles. Bar-Joseph ) published the cell cycle–regulated genes in normal foreskin fibroblasts. We compared the U2OS data set to the cell cycle–regulated genes in their experiments (Bar-Joseph ). Of the 480 genes identified as cell cycle regulated in foreskin fibroblasts, 228 (47.5%) were also regulated in U2OS cells. Like the U2OS data set, the samples in the foreskin fibroblast data set were collected every 2 h and arrayed on commercially available arrays (Affymetrix U133A 2.0 arrays; Bar-Joseph ). Thus there are a number of potential reasons for the amount of overlap between the U2OS and the foreskin fibroblast data sets: cell type, transformed versus normal cells, arrays, and analysis methods. This compares similarly to the analysis in Bar-Joseph , in which there was ∼40% overlap between the genes identified as regulated in HeLa cells and those cell cycle regulated in foreskin fibroblasts. Reanalysis of the HeLa data set with their analysis method shows that 362 of 481 genes (75.3%) overlapped (Bar-Joseph ). Overlap of the genes identified as cell cycle regulated in U2OS, HeLa (Whitfield ), and foreskin fibroblasts (Bar-Joseph ) shows 142 cell cycle–regulated genes in all three cell types (Supplemental Table S3). Despite the pairwise overlaps being in the 40% range, there are 142 genes that are cell cycle regulated in all three cell types. Many of these are involved in the core processes of either DNA replication or mitosis, which are tightly regulated and involve a discrete set of genes. These overlapping genes may be “core cell cycle” regulators that are critical for all cell types. Recently Pena-Diaz ) reported the cell cycle–regulated genes in HaCaT human keratinocytes. They found that of 1249 Entrez genes cell cycle regulated in this cell line, 125 genes also were cell cycle regulated in HeLa and foreskin fibroblasts. This is similar to the number we report (142 genes) as cell cycle regulated in three different cell types. The inclusion of HaCat cells in our comparison of all cell types reduces the number of common periodic genes to 96 (Supplemental Figure S8 and Supplemental Table S3). These 96 genes represent genes involved in core cell cycle processes. Many of the cell cycle–regulated genes in all three cell types have either E2F1 or FOXM1 bound at their promoters. This implies that these two transcription factors are responsible for the highly periodic expression patterns seen in either S phase (E2F1) or G2/M (FOXM1). Unfortunately, E2F1 does not bind the promoter of FOXM1, nor does FOXM1 bind the promoter of E2F1, so it appears that E2F1 does not directly induce FOXM1. This would imply that there is not a continuous transcription factor–based circuit that regulates the cell cycle; instead, there is an interplay of transcriptional activation, phosphorylation, degradation, and sequestration (among other methods) that controls cell cycle regulation. It is becoming apparent that there may be cell type–dependent, cell cycle–regulated genes. As cell cycle–regulated genes are catalogued for different cell lines (or types), it will be interesting to determine which genes are cell line dependent and which genes are cell type dependent, and to determine with more precision which genes are invariant across all cycling cells. These results suggest that it may be informative to determine the differences in cell cycle gene expression between different cell types/lines of the same lineage, embryonic stem cells, or induced pluripotent stem cells to begin to determine the changes through developmental lineages.

Regulation of the cell cycle gene expression program

Previous reports showed that FOXM1 is required for proper progression through mitosis due to a requirement for the FOXM1 activation of critical mitosis genes (e.g., PLK1, CDC25B, and CCNB1; Laoukili ; Wonsey and Follettie, 2005). Here, via FOXM1 ChIP-seq, we showed that not only are these genes activated by FOXM1, but in addition a number of other genes that are involved in proper mitotic progression are also activated by FOXM1. The number of genes that are direct FOXM1 targets implies that FOXM1 is a regulator of G2/M-phase transcription. In addition to binding a large number of G2/M-phase genes, FOXM1 binds a large number of targets that are involved with translation, a process that is ongoing throughout the cell cycle. Of interest, FOXM1 is not cell cycle regulated at the RNA level in U2OS cells, whereas it is regulated in HeLa cells. This is possibly due to cell type–specific methods of regulation or perhaps to U2OS cells having a relatively intact p53 pathway whereas HeLa cells do not, due to the presence of HPV E6 and E7 proteins (Gao ). There is 43% overlap between the three S-phase clusters in HeLa cells and the S-phase cluster in U2OS cells. Recently it was shown that FOXM1 binds to G2/M-phase gene promoters via CHR elements (Chen ), which are also important to both DREAM and B-Myb-MuvB complex binding (Sadasivam ). Similar to the results published here (Figure 3B), Chen show that FOXM1 binds to many of the same promoters as LIN9 and B-MYB and that both FOXM1 ChIP-seq data sets are enriched for genes involved in mitosis. However, due to experimental differences, the data set presented here is also enriched for genes involved in translation that are not cell cycle regulated. Our data support the possibility that FOXM1 is weakly bound to a subset of genes expressed during S phase (Figures 6, A and B, and 7). The genes cell cycle regulated in both HeLa and U2OS cells are generally bound by at least one of the cell cycle transcription factors. Of the 253 genes that are cell cycle regulated in U2OS and HeLa cells, 183 (∼72%) of the promoters are bound by at least one transcription factor, 70 (∼28%) of the promoters are bound by two transcription factors, and 63 (∼25%) of the promoters are bound by three or more. Thus each cell cycle gene may be regulated by the combinatorial effects of multiple cell cycle transcriptional factors. This would allow for very precise temporal regulation of cell cycle genes, as well as provide a high degree of redundancy in the system. These data provide a catalogue of the cell cycle–regulated genes in U2OS cells and, along with associated transcription factor binding data, are available to anyone for any purpose. This will provide a resource for the scientific community, and the full data set is available from Gene Expression Omnibus (www.ncbi.nlm.nih.gov/geo/query/acc.cgiat) at accession number GSE52100.

MATERIALS AND METHODS

Cell culture, synchronization, and RNA preparation

HeLa and U2OS cells were passaged in a 37°C humidified incubator in DMEM with 10% fetal bovine serum and 100 U of penicillinstreptomycin following standard protocols. U2OS cells were synchronized using a double-thymidine protocol or a thymidinenocodazole protocol. Briefly, 3.0 × 105 cells were plated in 16 ml of DMEM. After 24 h of growth, thymidine was added to a final concentration of 2.5 mM. After 18 h in thymidine media cells were washed twice with prewarmed CO2-equilibrated phosphate-buffered saline (PBS) and allowed to grow for 8 h in prewarmed CO2 equilibrated DMEM. Again thymidine was added to a final concentration of 2.5 mM for another 18 h. Cells were washed twice with PBS and released into DMEM. For the thymidinenocodazole synchronization, U2OS cells were plated (5.0 × 105 cells) and allowed to grow for 24 h. Thymidine (2.5 mM) was added for 18 h before cells were washed twice with prewarmed CO2-equilibrated PBS before treatment with DMEM supplemented with 100 ng/ml nocodazole for 12 h. Floating cells were collected and spun down, washed twice with prewarmed CO2-equilibrated PBS, and resuspended in prewarmed CO2-equilibrated DMEM. Nonfloating cells were washed twice with prewarmed CO2-equilibrated PBS and released into prewarmed CO2-equilibrated DMEM, and the resuspended floating cells were added back to each plate. Cells were collected every 2 h for a minimum of 36 h using RNeasy Plus Mini Kit (Qiagen, Valencia, CA). Zero-hour samples were collected while cells were still in arrest conditions. Synchrony was monitored via fluorescence-activated cell sorting (FACS) analysis of propidium iodide–labeled cells (DartLab, Geisel School of Medicine at Dartmouth College) and FOXM1 phosphorylation state or cyclin B1 expression via Western blots (see later description). Samples were collected for Western blot analysis using SDS–PAGE sample buffer. Reference RNA was isolated from asynchronously growing U2OS cells using an RNeasy Plus Mini Kit. The same reference was used for all hybridization experiments.

Microarray hybridization and analysis

Microarrays were run as described previously (Grant . Briefly, cellular RNA was amplified and Cy3 (asynchronous U2OS RNA) or Cy5 (sample) labeled using the Quick-Amp Labeling kit (Agilent Technologies, Santa Clara, CA) following the manufacturer's protocol, except that the reaction volumes were reduced by one-half. Labeled cRNA was hybridized to Agilent Whole Human Genome Oligonucleotide arrays (4 × 44k) following the manufacturer's protocol. Microarrays were scanned using a GenePix 4000B scanner (Molecular Devices, Sunnyvale, CA). Spot pixel intensities were determined using GenePix Pro 5.1 software. Poor-quality spots were identified and flagged by hand and excluded from subsequent analysis. Arrays were stored in the University of North Carolina Microarray Database (Chapel Hill, NC; UMD). The full raw microarray data are available from the GEO at accession number GSE50988 (part of SuperSeries GSE52100). Each time course was retrieved from the UMD independently from each other time course with the following conditions. Only spots with ratio of intensity over background of >1.5 were used. Genes missing >30% of their data were excluded from further analysis. Genes were normalized using Lowess normalization. Periodically expressed transcripts were identified using the same method as in Whitfield . Briefly, missing data were imputed using a k-nearest neighbors algorithm (Troyanskaya ) using k = 12. Then each time course was centered by removing the first eigengene (Alter ). Imputed data were removed from the data set as the last step of the analysis. Rough estimates of the U2OS cell cycle were initially obtained from Western blot analysis of cell cycle–regulated phosphorylation of FOXM1 and FACS analysis for each time course. This estimate was then refined by performing a Fourier transform on each gene in each time course (Whitfield , Eqs. 1–3) with equally spaced values of time (every 15 min) for the estimated cell cycle length ±two hours. An offset (φ; Whitfield , Eqs. 1 and 2) was determined for each time course relative to the first time course. The Fourier transform was then repeated for each time course using the following values of T and φ: Thy-Thy 1 (T = 17.65, φ = 0.0), Thy-Thy 2 (T = 18.6, φ = 0.0), Thy-Thy 3 (T = 18, φ = 0.0), and Thy-Noc (T = 23.95, φ = 2.3). The vectors for each data set were then summed and genes ranked by the magnitude of their combined vectors (C). To compensate for the imperfect match to sine or cosine curves, each gene was scaled by its correlation to an idealized vector. The ideal vector for each cell cycle phase (G1/S, S, G2, G2/M, and M/G1) was defined by the average expression profiles of the indicated genes in Figure 1. Using a standard Pearson correlation, each gene received a peak correlation score, which was its largest absolute value correlation with each of the ideal vectors. This peak correlation score was then used to scale each gene's C, generating a periodicity score for each gene. Randomized data were then used to set a cutoff value for the minimum periodicity score to be considered cell cycle regulated. The data were randomized 10 times either within rows only or in rows and columns. The full analysis pipeline was performed for each of these randomizations using the same parameters as for the unrandomized data. We chose a minimum periodicity score of 2.65, which gave us 3568 genes with an initial false-positive rate of 3.67% when randomizing by rows only. Inclusion of the Thy-Noc time course resulted in improved false-positive and false-negative rates, despite having a lower degree of synchrony than the Thy-Thy time courses (Supplemental Figure S9) To account for genes that received a high Fourier score but did not have a sinusoidal expression pattern throughout each time course, we calculated autocorrelation scores (Whitfield , Eq. 5). We calculated and summed the autocorrelation scores for each time course, leaving 2878 genes. To remove any genes that had an obvious date bias from technical issues during array hybridization, we found their power spectra using the Fourier transform of each time course. The date-biased genes were then removed by projecting the power spectra onto their first two principal components and clustered by k-means (k = 2; Supplemental Figure S2). Removing these genes gave us a final data set of 2830 probes. The 2830 probes correspond to 2140 Entrez GeneIDs with 1871 unique gene identifiers.

ChIP-seq and analysis

FOXM1 ChIP-seq was carried out as previously described (Lupien ; Grant ) using the FOXM1 antibody sc-502 (C20; Santa Cruz Biotechnology, Santa Cruz, CA). Briefly, asynchronous HeLa cells were fixed using 1% formaldehyde before sonication to produce DNA fragment lengths of 200–600 base pairs with a Bioruptor (Diagenode, Sparta, NJ). Anti-FOXM1 was bound to a mix of Protein A and Protein G Dynabeads (Life Technologies, Grand Island, NY) before an 18-h incubation at 4°C with the fragmented DNA. Bound DNA was washed and the cross-links reversed before DNA purification with a QIAquick PCR purification kit (Qiagen). DNA concentrations were measured using Quant-iT PicoGreen (Life Technologies). Library construction and sequencing for each ChIP-seq run were carried out independently at the High Throughput Sequencing Facility at the University of North Carolina (Chapel Hill, NC) using an Illumina Genome Analyzer II. Fastq files were mapped to the human genome using Bowtie (version 0.12) using the “best” flag to constrain alignments to those with the best read quality and fewest mismatches. The first ChIP-seq run resulted in 17.1 million sequence reads (8.4 mapped sequence reads), and the second ChIP-seq run resulted in 17.0 million reads (8.4 million mapped reads; human genome build Hg18). Enriched peaks were determined independently for each run using MACS, version 1.3 (run 1, mfold 32, p < 1.0 × 10−5; run 2, mfold 25, p < 1.0 × 10−5; Zhang ). This resulted in 5727 peaks for the first run and 2849 peaks for the second. As a conservative estimate of FOXM1 binding, we analyzed the intersection of the sequences under the peaks that were found in both ChIP-seq runs, resulting in 2215 shared FOXM1 genomic loci. We then determined the distribution of the shared FOXM1 genomic loci using the cis-Regulatory Element Annotation System (CEAS; http://liulab.dfci.harvard.edu/CEAS/; Zhang ; Shin ) implemented in Cistrome (www.cistrome.com/). Raw ChIP-seq data and BED files are available from GEO at accession number GSE52098 (part of SuperSeries GSE52100).

Real-time luciferase assays

U2OS cells were plated at ∼20–25% density in 30-mm dishes and allowed to grow for 24 h. After 24 h, the growth medium was replaced with assay medium (Phenol red–free L15 [Life Technologies], 10% fetal bovine serum, 1% penicillin/streptomycin, 10 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid buffer, and 0.1 mM luciferin). Cells were then transfected with equal amounts of plasmid (typically 250 ng of each plasmid) using FuGENE 6 (Life Technologies) following the manufacturer's protocol. Tissue culture dishes were sealed with glass coverslips and silicone grease and transferred to the LumiCycle (Actimetrics, Wilmette, IL) at 36°C. Data analysis was performed with LumiCycle Analysis software (Actimetrics).

Western blots

Antibodies to FoxM1 C-20 (1:500) and cyclinB1 H-433 (1:2000) were purchased from Santa Cruz Biotechnology. Anti–glyceraldehyde-3-phosphate dehydrogenase was purchased from American Research Products (Belmont, MA). Western blots were run following standard protocols.

Plasmid construction

FOXM1 expression vectors, the ACAP3/CENTB5, and the RPS6KB1 promoter constructs have been described previously (Grant ). We obtained commercially available promoter constructs for PLK1 (S119035), CENPE (S118567), TOP2A (S118760), and RMI1 (S113323) from Switchgear Genomics (Menlo Park, CA). The FOXM1 target promoter construct, pGL3-MCM8, was cloned based on ChIP-seq loci as determined by MACS. Primers were designed using Primer 3 (Rozen and Skaletsky, 2000) to provide an amplicon length between 800 and 1000 base pairs. DNA fragments were amplified via PCR and cloned into Zero Blunt TOPO (Life Technologies) before being subcloned into pGL3-basic (Promega, Madison, WI) using standard methods. All plasmids were verified by sequencing (Molecular Biology and Proteomics Core Facility, Dartmouth College).

Functional annotation

Functional annotation of genes was performed using DAVID (Dennis ; Huang da ).

Cell cycle–wide binding profiles

We investigated the distribution of transcription factor target genes in the cell cycle. First, we identified a list of 2830 cell cycle probes in U2OS cells and sorted them according to their peak expression time in the cell cycle. Then we examined the enrichment of the target genes of a given transcription factor in each sliding window of the cell cycle. We used a window size of 30° with 10° overlap between neighboring windows. We used Fisher's exact test to determine the significance of enrichment of target genes for a transcription factor in each cell cycle window. The target genes for E2F1, E2F4, and E2F6 in HeLa cells were determined from ChIP-seq data generated by the ENCODE project (Gerstein ). The FOXM1 target genes were determined from the ChIP-seq presented here.
  91 in total

1.  FoxM1 is required for execution of the mitotic programme and chromosome stability.

Authors:  Jamila Laoukili; Matthijs R H Kooistra; Alexandra Brás; Jos Kauw; Ron M Kerkhoven; Ashby Morrison; Hans Clevers; René H Medema
Journal:  Nat Cell Biol       Date:  2005-01-16       Impact factor: 28.824

2.  FoxA1 translates epigenetic signatures into enhancer-driven lineage-specific transcription.

Authors:  Mathieu Lupien; Jérôme Eeckhoute; Clifford A Meyer; Qianben Wang; Yong Zhang; Wei Li; Jason S Carroll; X Shirley Liu; Myles Brown
Journal:  Cell       Date:  2008-03-21       Impact factor: 41.582

3.  CEAS: cis-regulatory element annotation system.

Authors:  Hyunjin Shin; Tao Liu; Arjun K Manrai; X Shirley Liu
Journal:  Bioinformatics       Date:  2009-08-18       Impact factor: 6.937

4.  The forkhead transcription factor FOXM1 controls cell cycle-dependent gene expression through an atypical chromatin binding mechanism.

Authors:  Xi Chen; Gerd A Müller; Marianne Quaas; Martin Fischer; Namshik Han; Benjamin Stutchbury; Andrew D Sharrocks; Kurt Engeland
Journal:  Mol Cell Biol       Date:  2012-10-29       Impact factor: 4.272

5.  The Forkhead box M1 protein regulates the transcription of the estrogen receptor alpha in breast cancer cells.

Authors:  Patricia A Madureira; Rana Varshochi; Demetra Constantinidou; Richard E Francis; R Charles Coombes; Kwok-Ming Yao; Eric W-F Lam
Journal:  J Biol Chem       Date:  2006-06-28       Impact factor: 5.157

6.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

7.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

8.  S6 kinase 1 regulates estrogen receptor alpha in control of breast cancer cell proliferation.

Authors:  Rachel L Yamnik; Alla Digilova; Daphne C Davis; Z Nilly Brodt; Christopher J Murphy; Marina K Holz
Journal:  J Biol Chem       Date:  2008-12-27       Impact factor: 5.157

9.  Architecture of the human regulatory network derived from ENCODE data.

Authors:  Mark B Gerstein; Anshul Kundaje; Manoj Hariharan; Stephen G Landt; Koon-Kiu Yan; Chao Cheng; Xinmeng Jasmine Mu; Ekta Khurana; Joel Rozowsky; Roger Alexander; Renqiang Min; Pedro Alves; Alexej Abyzov; Nick Addleman; Nitin Bhardwaj; Alan P Boyle; Philip Cayting; Alexandra Charos; David Z Chen; Yong Cheng; Declan Clarke; Catharine Eastman; Ghia Euskirchen; Seth Frietze; Yao Fu; Jason Gertz; Fabian Grubert; Arif Harmanci; Preti Jain; Maya Kasowski; Phil Lacroute; Jing Jane Leng; Jin Lian; Hannah Monahan; Henriette O'Geen; Zhengqing Ouyang; E Christopher Partridge; Dorrelyn Patacsil; Florencia Pauli; Debasish Raha; Lucia Ramirez; Timothy E Reddy; Brian Reed; Minyi Shi; Teri Slifer; Jing Wang; Linfeng Wu; Xinqiong Yang; Kevin Y Yip; Gili Zilberman-Schapira; Serafim Batzoglou; Arend Sidow; Peggy J Farnham; Richard M Myers; Sherman M Weissman; Michael Snyder
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

10.  Reversal of endocrine resistance in breast cancer: interrelationships among 14-3-3ζ, FOXM1, and a gene signature associated with mitosis.

Authors:  Anna Bergamaschi; Barbara L Christensen; Benita S Katzenellenbogen
Journal:  Breast Cancer Res       Date:  2011-06-29       Impact factor: 6.466

View more
  100 in total

Review 1.  Create, activate, destroy, repeat: Cdk1 controls proliferation by limiting transcription factor activity.

Authors:  Jennifer A Benanti
Journal:  Curr Genet       Date:  2015-11-21       Impact factor: 3.886

Review 2.  Enigmatic MELK: The controversy surrounding its complex role in cancer.

Authors:  Ian M McDonald; Lee M Graves
Journal:  J Biol Chem       Date:  2020-04-29       Impact factor: 5.157

Review 3.  Preparation for DNA replication: the key to a successful S phase.

Authors:  Juanita C Limas; Jeanette Gowen Cook
Journal:  FEBS Lett       Date:  2019-10-15       Impact factor: 4.124

4.  Temporal separation of replication and transcription during S-phase progression.

Authors:  Matthieu Meryet-Figuiere; Babak Alaei-Mahabadi; Mohamad Moustafa Ali; Sanhita Mitra; Santhilal Subhash; Gaurav Kumar Pandey; Erik Larsson; Chandrasekhar Kanduri
Journal:  Cell Cycle       Date:  2014       Impact factor: 4.534

5.  Studying Cell Cycle-regulated Gene Expression by Two Complementary Cell Synchronization Protocols.

Authors:  Aintzane Apraiz; Jone Mitxelena; Ana Zubiaga
Journal:  J Vis Exp       Date:  2017-06-06       Impact factor: 1.355

6.  Silencing of CDK2, but not CDK1, separates mitogenic from anti-apoptotic signaling, sensitizing p53 defective cells for synthetic lethality.

Authors:  Tatyana S Nekova; Susanne Kneitz; Hermann Einsele; Ralf Bargou; Gernot Stuhler
Journal:  Cell Cycle       Date:  2016-11-10       Impact factor: 4.534

7.  Cell Cycle-Regulated Transcription of CENP-A by the MBF Complex Ensures Optimal Level of CENP-A for Centromere Formation.

Authors:  David Aristizabal-Corrales; Jinpu Yang; Fei Li
Journal:  Genetics       Date:  2019-01-11       Impact factor: 4.562

8.  Understanding the unimodal distributions of cancer occurrence rates: it takes two factors for a cancer to occur.

Authors:  Shuang Qiu; Zheng An; Renbo Tan; Ping-An He; Jingjing Jing; Hongxia Li; Shuang Wu; Ying Xu
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

9.  The forkhead box M1 (FOXM1) expression and antitumor effect of FOXM1 inhibition in malignant rhabdoid tumor.

Authors:  Yuichi Shibui; Kenichi Kohashi; Akihiko Tamaki; Izumi Kinoshita; Yuichi Yamada; Hidetaka Yamamoto; Tomoaki Taguchi; Yoshinao Oda
Journal:  J Cancer Res Clin Oncol       Date:  2020-11-21       Impact factor: 4.553

10.  Developmental Heterogeneity of Microglia and Brain Myeloid Cells Revealed by Deep Single-Cell RNA Sequencing.

Authors:  Qingyun Li; Zuolin Cheng; Lu Zhou; Spyros Darmanis; Norma F Neff; Jennifer Okamoto; Gunsagar Gulati; Mariko L Bennett; Lu O Sun; Laura E Clarke; Julia Marschallinger; Guoqiang Yu; Stephen R Quake; Tony Wyss-Coray; Ben A Barres
Journal:  Neuron       Date:  2018-12-31       Impact factor: 17.173

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.