Main

The classical cell cycle regulatory pRb/E2f pathway has emerged as an important effector of fate decisions in a number of cell types, including in the brain. Cell cycle dynamics strongly influence neural precursor cell (NPC) maintenance and neurogenesis,1, 2, 3, 4 and gain- or loss-of-function studies have demonstrated key roles for cell cycle proteins, including the E2f family, in NPC fate decisions.3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 E2f3 is required for proper cortical migration of neurons and to maintain the balance between NPC self-renewal, proliferation and differentiation, and its loss disrupts long-term neurogenesis and cortical function; E2f1 deficiency impairs NPC proliferation, and E2f4 deficiency leads to inhibition of NPC self-renewal and severe defects in telencephalic development.6, 8, 9, 10, 17

A pivotal question is whether cell fate control by the pRb/E2f pathway is largely a consequence of cell cycle regulation, or due to direct regulation of cell fate-associated genes. We recently found that loss of E2f3a and E2f3b leads to opposing defects in NPC maintenance and differentiation.8 The fact that this occurred without affecting cell cycle dynamics strongly suggests that fate control by E2fs is not secondary to cell cycle regulation. In addition, a number of key cell fate genes and pathways have been identified as E2f-regulated targets driving E2f-dependent fate decisions in NPCs. These include the neurogenesis and migration genes Dlx1/Dxl2 and Neo1 (Neogenin),18, 19 the growth factor fibroblast growth factor 2 (Fgf2),17 the pluripotency and self-renewal factor Sox2,8 and the Notch/Hes11 and Sonic Hedgehog pathways.10 Together, these findings demonstrate a direct role for pRb/E2f at cell fate-associated genes, but the extent of this interaction is unknown. Many of these studies focused on single pRb or E2f factor knock-out models, and because E2fs exhibit extensive redundancy in their biological functions and genomic binding patterns,20, 21 it is likely that more E2f target genes relevant to cell fate exist than those that have been reported. Understanding the full regulatory potential of the cell cycle machinery in the brain therefore necessitates an appreciation of the repertoire of E2f target genes in NPCs. Genome-wide targets have been reported for E2f4 (and E2f1) in a number of immortalized and transformed cell lines,20, 22, 23 and for E2f3 in immortal mouse myoblasts, myotubes and embryonic fibroblasts.24, 25 E2f targets have, however, not been identified on a genome-wide level in any neural cells, nor in any primary cell types.

Here we used an unbiased approach to show that E2f3a, E2f3b and E2f4 target a large and diverse network of cell fate regulatory genes, with substantial overlap in their binding sites. We further show that E2f3 isoforms demonstrate extensive tissue specificity in the genes that they target, positioning them as potential regulators of specialized, cell type-specific differentiation programs. Our results highlight the cell cycle machinery as a pervasive regulator of cell fate decisions, and provide a comprehensive tool to fuel further discovery.

Results

E2f3&4 promoter occupancy in neural precursor cells

We determined the genomic binding sites of E2f3 and E2f4 in NPCs derived from telencephalic tissue at embryonic day 14.5 (E14.5), a peak stage of cortical neurogenesis. As E2f binding sites are typically found within close proximity of a transcriptional start site (TSS)8, 17, 20, 24, 26 we focused our analysis on promoter regions by coupling chromatin immunoprecipitation (ChIP) using antibodies towards E2f3 and E2f4 (Figure 1a and b) with proximal promoter DNA microarrays (details in Supplementary Text S1). We have validated the specificity of these antibodies previously8 and here (Supplementary Figure S1 and S2). We first identified E2f3-binding sites in wild-type (WT) cells using an antibody that binds a C-terminal epitope common to both E2f3a&b (Figure 1c). Analysis of these sites based on the gene whose TSS is closest to each peak revealed 3444 peaks enriched for E2f3, corresponding to 3271 unique target genes (Figure1d,Supplementary Table S1). Importantly, E2f3 peak sequences were enriched compared with control promoter regions for the E2f consensus motif, with the highest ranked peaks exhibiting the greatest motif enrichment (discussed below), underscoring the validity of our approach. The data were similar for E2f4, with 3815 peaks and 3632 target genes (Figure1d,Supplementary Table S1). In total, we identified 5459 unique E2f-bound peaks in NPCs, and found that 28% of all proximal promoter regions are bound by at least one E2f3/4 factor. Furthermore, while we surveyed genomic regions encompassing −5 to +3 kb relative to all known TSSs (Supplementary Figure S3), 88–90% of enriched peaks fell within 600 bp upstream and 600 bp downstream of a TSS (Figure1e). Interestingly, while a large fraction of E2f3 and E2f4 binding sites overlap, approximately one-third of the binding sites for each protein are unique (Figure1d,Supplementary Table S2), suggesting the existence of factor-specific target genes.

Figure 1
figure 1

Specificity and redundancy among E2f3 and E2f4 genomic binding sites in neural precursor cells. (a) Quantitative ChIP (qChIP) analysis of E2f4 binding at the promoters of previously identified E2f targets (thymidine kinase (TK1) and p107) and a negative control gene (Chrna1) in WT neurospheres. Normal rabbit IgG is used as a non-specific antibody control. E2f4 is significantly enriched over rabbit IgG at positive control genes only. *P<0.05, **P<0.01. Graphed are the averages of four to six biological replicates, ±S.E.M. (b) qChIP analysis to assess E2f3 binding at TK1, p107 and Chrna1 promoters only, with similar enrichment in WT and knock-out cells. (*P<0.05, **P<0.01). Graphed are the averages of three to four biological replicates, ±S.E.M. (c) Schematic diagram of E2f3a and E2f3b protein structure. Both isoforms share common structural domains: DNA, DNA-binding domain; MB, marked box domain; NLS, nuclear localization sequence; TA, transcriptional activation domain, The region recognized by the α-E2f3 antibody is denoted by '*'. (d) Venn diagram demonstrating the degree of overlap between E2f3 and E2f4 peaks discovered by ChIP–chip. The number of peaks in each category is shown, with the percent of all E2f3 or E2f4 peaks in that category in parentheses. (e) Distribution of E2f3a-, E2f3b- and E2f4-bound sites relative to the nearest TSS. The three binding data sets were categorized into 200 bp bins, and the percent of bound sites in each category (indicated in upper right corner) within each bin range is plotted. (f) Venn diagram demonstrating the overlap between E2f3 peaks identified by ChIP–chip in WT, E2f3a−/− and E2f3b−/− neurospheres. Peaks identified as specific or common to E2f3a and E2f3b are indicated. (g) Analysis of the percent of E2f3 peaks in the indicated categories that overlap with E2f4 peaks in NPCs

Based on their binding profiles in muscle cells,24 we hypothesized that E2f3a&b may bind unique target genes in NPCs. Thus, we performed ChIP–chip for E2f3 on chromatin from E2f3a−/− and E2f3b−/− NPCs. This approach allowed us to identify genomic regions bound by E2f3a alone (present in both WT and E2f3b−/−, absent in E2f3a−/− cells), E2f3b alone (present in WT and E2f3a−/−, absent in E2f3b−/− cells) and by both E2f3a and E2f3b (present in all three conditions) with the use of a single antibody (Figure1f). Importantly, we have previously demonstrated that loss of either E2f3 isoform in NPCs does not result in altered expression or a perceivable change in DNA-binding capacity of the other isoform, nor of other E2f factors.8 We identified 3144 enriched peaks (2979 genes) in E2f3a-deficient cells, indicating that E2f3b is present at the majority of E2f3-bound sites in NPCs. In contrast, we found only 1595 peaks (1518 genes) in E2f3b−/− cells, suggesting that E2f3a is present at a smaller subset of E2f3-bound sites (Figure1f, Supplementary Table S3). The 779 peaks enriched for E2f3 in WT cells only (Figure1f, Supplementary Table S3) probably represent a class of loci that require a maximum amount of E2f3 protein for optimal ChIP detection. This claim is supported by the fact that 78% of these E2f3 WT ‘unique’ sites overlap with peaks in E2f3a−/− and E2f3b−/− cells for which an E2f3-binding signal was obtained but did not meet our FDR requirement (Supplementary Figure S4). At E2f-bound promoters, while E2f3a is enriched at only a subset of E2f3b-bound sites, E2f4 peaks overlap significantly with both E2f3a and E2f3b peaks (Supplementary Figure 1G). Thus, at least two E2f family members can bind the majority of our E2f-bound sites.

E2fs bind a network of genes involved in cell fate decisions

We performed gene ontology (GO) analysis to identify the biological processes with which E2f target genes are associated in NPCs. We found a high enrichment for both E2f3 and E2f4 targets in well-characterized E2f-regulated functions, including the DNA damage response, cell cycle regulation, chromatin organization, gene expression and cell death23, 24, 26, 27, 28, 29, 30, 31 (Figure 2a, Table 1). Previously established E2f target genes identified in NPCs, most common to both E2f3&4, include Pcna, Ccne, Top2a, Cdc2a, Cdc25a, Mcm4, Rad51, Brca1/2, Ezh2 and pRb and E2f family members24, 26, 32, 33, 34 (complete lists in Supplementary Tables S1 and S2). In agreement with pRb/E2f family loss-of-function studies, we also observed a strong enrichment in processes related to differentiation and development, including those specific to the nervous system (Figure 2a, Table 1). More surprisingly, however, identification of the target genes themselves (Figure 3a for example, Supplementary Tables S1 and S2) revealed that E2fs are bound to the promoters of hundreds of genes that control cell fate decisions (Table 1). Furthermore, these targets are functionally diverse, including genes associated with key growth factor and developmental signaling pathways, chromatin modifiers and transcription factors that together regulate stem cell identity, self-renewal and differentiation.

Figure 2
figure 2

E2f3 and E2f4 bind an overlapping set of gene promoters associated with fundamental NPC fate decisions. (a) Gene Ontology (GO) analysis of E2f3 and E2f4 target genes, expressed as the percentage of all target genes in each group (groups indicated to the right of the graph) with a particular GO annotation. Target genes were identified genes whose TSS is closest to each E2f peak. (b) qChIP analysis validates E2f3 and E2f4 enrichment at binding sites identified by ChIP–chip corresponding to a panel of genes involved in neural precursor cell fate decisions. IgG is used as a negative control. Both E2f3 and E2f4 are enriched at all analyzed sites, except for that corresponding to the gene Hes1, which is uniquely detected by anti-E2F3. *P<0.05, **P<0.01. The average of at least three biological replicates is shown, ±S.E.M. (c) Genomic binding profiles for E2f3a (E2f3 ChIP–chip in E2f3b−/− neurospheres), E2f3b (E2f3 ChIP–chip in E2f3a−/− neurospheres) and E2f4 (ChIP–chip in wild-type cells), at selected loci. At each locus shown, all three E2f family members were significantly enriched (E2f identity indicated to the left). The level of phylogenetic conservation across 17 species of mammals is shown along the bottom of each image (‘Mammal Cons’). The majority of E2f3 peaks are located within highly conserved regions, and most promoters are bound by multiple E2fs with highly overlapping peaks. Peaks are generally found overlapping or directly proximal to the TSS. The black arrows specify the location and direction of transcription initiation

Table 1 Quantification of select biological processes enriched among genes targeted by E2f3 and E2f4
Figure 3
figure 3

Genes involved in nervous system development and neurogenesis are targeted by both E2f3 isoforms. (a) The number of E2f3 target genes with the GO classifications of ‘nervous system development’ or ‘neurogenesis’ was determined, and the percentage of these genes that belong to specific groups of E2f3a and E2f3b targets are shown. Examples of genes belonging to each category of E2f3 targets are listed. (b) E2f3 isoforms control common cellular functions. GO analysis of E2f3a and E2f3b common and specific target genes, expressed as the percentage of all target genes in each group with a particular GO annotation (GO annotations indicated at the bottom of the graph). Target genes were identified genes whose TSS is closest to each E2f peak

We observed that E2f3&4 share a number of NPC cell fate genes as targets. Examples include genes or members of pathways that have previously been described as targets of the pRb-E2f family in the brain, such as Sox28, 35 and members of the Notch/Hes (Notch1, Hes5, Jag1, Rbpj and Numb)11, 14 and Fgf pathways (Fgfr2 and Fgfr3).17 Additional genes include components of the Wnt (Wnt6, Fzd5, Dll4, Gsk3b, Apc and Tcf3) and Tgf-β signaling pathways (Tgfbr1, Bmp7, Smad4&7 and Acvr2b), the transcription factors Ascl1, Pax6 and Lhx2, the chromatin regulators Dnmt1, Mbd1 and members of the Polycomb/Trithorax families (Ezh2, Eed, Epc1 and Mll1). We confirmed binding of E2f3 and E2f4 to the promoters of a group of target genes that are key cell fate regulators in NPCs, using quantitative ChIP (Figure 2b). We note that the majority of these genes show enrichment for both E2f4 and at least one E2f3 isoform, typically with tightly overlapping peaks, and that these binding sites largely correspond with evolutionarily conserved regions (Figure 2c).

Despite this overlap, quantitative GO analysis surprisingly revealed that the functional categories related to development and differentiation are more highly enriched among E2f3 than E2f4 targets (Figure 2a, Table 1). In fact, E2f3 binds 12-fold more differentiation-related genes (405 versus 34) and twice as many nervous system development and neurogenesis-related genes than does E2f4 (Table 1). In addition, migration-related genes, such as those involved in actin dynamics and axon guidance, are also more heavily targeted by E2f3 (Table 1Supplementary Table S4). Cell fate-related genes unique to E2f3 primarily include additional transcription factors, chromatin regulators and family members of the core pathways represented among E2f3/E2f4 common targets (such as Mecp2, Atrx, Wnt5a, Vegf, Hes1, Egfr and Fgfr1).

Genes targeted by E2f3a and E2f3b control common cellular processes in NPCs

GO analysis demonstrated that E2f3a and E2f3b target genes are enriched for similar functional categories in NPCs whether they are common or unique to one isoform (Figure 3b, Supplementary Table S5). E2f3a-specific targets are somewhat less enriched for certain functions, including ‘cell cycle’ and ‘embryo development’, but overall our analysis reveals that functions as wide-ranging as the DNA damage response, gene expression and neurogenesis are relatively equally enriched among E2f3a- and E2f3b-specific and common target genes.

To focus on the genes that are likely directly related to NPC fate decisions, we grouped E2f3 target genes belonging to the ‘neurogenesis’ and ‘nervous system development’ categories. About 70% of this group of genes is targeted either exclusively by E2f3b (39.2%) or by both E2f3a and E2f3b (31.2%), while a minority is bound specifically by E2f3a (3.5%) or by E2f3 in WT cells only (25.3%) (Figure 3a). Notably, among the genes identified as specific to E2f3b are members of the Fgf signaling pathway, which we have previously shown is regulated by E2f3.17 On the other hand, as was the case for E2f3&4 common targets, we found that many genes with established roles as regulators of NPC fate decisions are shared between E2f3a&b (see Figure 3a for examples of E2f3 isoform specific and common genes). These data, combined with those for E2f4 (Figures 1d and g), further demonstrates that the majority of E2f targets in NPCs are bound and likely regulated by multiple E2f proteins.

E2f3 and Ctcf are co-enriched at the promoters of neural specific target genes

We analyzed our E2f3-bound regions for the occurrence of binding sites of other transcription factors, to uncover combinatorial regulation partners of E2f3. As expected, we found a strong over-representation of sequence motifs corresponding to E2F factors, as well as other cell cycle-related factors, such as NF-Y and Sp1 (data not shown).36 Less expectedly, we uncovered a specific enrichment of CCCTC binding factor (Ctcf) binding motifs across E2f3 peaks (Figure 4a). Exploiting data sets of genomic Ctcf sites identified by ChIP-Seq in mouse neuronal tissues,29 we confirmed a remarkable overlap between binding sites of E2f3 and Ctcf in mouse brain at E14.5. E14.5 mouse brain is a tissue undergoing active neurogenesis and NPC expansion, and is therefore amenable to comparison with our NPCs.37 We found that 45% of E2f3 target promoters are indeed jointly bound by Ctcf, a significant enrichment (1.46-fold) compared with the overlap expected based on all promoters surveyed by our microarrays (Figure 4b, Supplementary Table S6). Furthermore, the enrichment of either E2f or Ctcf sequence motifs was strongest at peaks where both factors are bound (Table 2). Distinguishing between E2f3- and E2f4-specific targets revealed that the association with Ctcf is strongest for E2f3 (Figure 4b).

Figure 4
figure 4

E2f3 and Ctcf are co-enriched at nervous system-related genes. (a) Enrichment of E2f and Ctcf position weight matrices (PWM) within E2f3 peaks discovered in NPCs, as compared with the frequency of their occurrence in randomly selected surveyed loci. The genomic loci enriched for E2f3 were ranked in decreasing order of ChIP–chip enrichment and were binned into groups of 100 target loci. (b) The percentage of E2f3- and/ or E2f4-bound regions (E2f-bound category indicated along the x-axis) in NPCs at which Ctcf is also bound (in E14.5 brain) was calculated. Numerical values above each data set indicates the fold change of the observed over expected percentage overlap. The expected overlap (30.8%) is denoted by a dashed line, and represents the number of all surveyed promoters at which Ctcf is bound. (c) The percentage of promoter regions at which E2f3 is bound in NPCs that also contained Ctcf in different brain tissues (indicated along the x-axis) was quantified and is expressed as ‘observed overlap’. The ‘expected overlap’ is the percentage of all known promoter regions surveyed in our ChIP–chip experiments that contain a Ctcf peak. Numerical values above each data set indicates the fold change of the observed over expected percentage overlap. (d) The percentage of E2f3-bound promoters at which Ctcf is also bound was calculated separately for neural cells and myoblasts. We show, as the ‘expected overlap’, the percentage of E2f3-bound promoters containing Ctcf for (1) all E2f3-bound genes in the specified cell type, (2) E2f3-bound genes specific to that cell type versus the other, and 3) E2f3-bound genes common to both cell types. For neural cells, E2f3-binding sites were sourced from our data, and the Ctcf data were from E14.5 brain tissue;29 for myoblasts E2f3 data were from ref. 24 and the Ctcf data were reported by the Wold lab (ENCODE project). The ‘expected overlap’ represents the percentage of gene promoters surveyed in each cell type that contain Ctcf, for either all genes surveyed (the first data set for each cell type) or genes surveyed in both cell types (the last two data sets for each group). (e) Comparative GO analysis of E2f3-bound target genes that are bound by Ctcf versus those not bound by Ctcf in E14.5 brain. The percentage of E2f3 targets in each group with specific GO annotations (indicated on y-axis) was determined by GREAT. P-values are indicated (*P<1.0E−4, **P<1.0E−10 and ***P<1.0E−20). (f) The percent of surveyed genes whose promoter region is bound by Ctcf in E14.5 brain that belong to various GO categories (indicated along x-axis) were calculated separately for genes that are also bound or not bound by E2f3 in NPCs. (g) Distribution of phylogenetic conservation scores of E2f3- and/or Ctcf-bound gene promoters, compared with all surveyed loci. Box plots demonstrating increased phylogenetic conservation of promoters bound by both E2f3 and Ctcf compared with those bound by a single factor. Asterisks, P<0.05 by Wilcoxon’s rank-sum test. (h) Number of enhancer–promoter units (EPUs) for E2f3- and/or Ctcf-bound promoters. The average number of enhancers, as determined by ref. 29 in embryonic brain, predicted to engage in enhancer–promoter units is represented for various categories of promoters identified in our experiments

Table 2 Enrichment of TF motifs across promoters bound by E2f3, Ctcf or both E2f3 and Ctcf

Surveying our E2f3-bound promoters with Ctcf peaks from various brain tissues revealed that the E2f3–Ctcf association is strongest in tissues undergoing neurogenesis (E14.5 brain and olfactory bulb), and is substantially lower in the less neurogenic adult cortex and cerebellum (Figure 4c). In sharp contrast to NPCs, the E2f3–Ctcf association is not enriched in myoblasts (MBs) and is in fact slightly depleted at MB-specific E2f3 targets (Figure 4d). These observations suggest that genes bound by both E2f3 and Ctcf are more likely to be involved in neurodevelopmental processes. GO analysis confirmed that while E2f3 targets are equally represented for ‘cell cycle’, ‘chromatin organization’ and ‘gene expression’ whether or not Ctcf is also bound, targets at which both E2f3 and Ctcf are bound are uniquely enriched for functions and signaling pathways that impact cell fate-related processes, many with important roles during neurogenesis (Figure 4e). In addition, genes targeted by E2f3 have a stronger tendency to be targets of Ctcf than those not targeted by E2f3, and this association is most predominant for genes involved in functions related to brain development or neuron generation (Figure 4f). Finally, we observed that E2f3-bound promoters that are also bound by Ctcf are characterized by two additional properties associated with developmentally important genes: they are more highly phylogenetically conserved (Figure 4g) and are predicted to associate with a larger number of distal enhancers (Figure 4h, Supplementray Table S7). Together, these data suggest that a strong overlap between E2f3 and Ctcf may be a feature specific to neural tissue and may at least partially account for E2f3-dependent target gene binding in NPCs.

E2f3 is enriched at transcriptionally active genes in NPCs and exhibits tissue-specific binding

Generally, E2f3a is considered a transcriptional activator, while E2f3b and E2f4 are thought to function predominantly as repressors.38, 39, 40 We evaluated the relative mRNA levels of genes in E14.5 brain tissue whose promoters are bound by different combinations of E2f3a, E2f3b and E2f4,37 and surprisingly found that all categories of E2f-bound genes exhibit higher expression levels than average, compared with all genes surveyed on our microarrays. This observation suggests a transcriptional activation role for all three E2fs in NPCs (Figure 5a), which is corroborated by the strong association we observed between bound E2f3 and the presence of chromatin marks that correlate with gene activation (Supplementary Figure S5). Of note, we found that E2f3b-specific genes collectively showed significantly lower expression compared with both E2f3a&b-common and E2f3a-specific targets, demonstrating that the presence of E2f3a is associated with higher levels of gene expression compared with E2f3b (Figure 5a). Furthermore, genes whose promoters are bound by E2f3 in NPCs have higher-than-average expression patterns in many of the tissue types examined, but it is in E14.5 brain and other brain tissues that E2f3 binding is most associated with high gene expression (Figure 5b). Quantitative RT-PCR in E2f3b+/+ and E2f3b−/− NPCs validated that E2f3 is important for expression of cell fate genes (Figure 5c). While overall the effects of E2f3b loss on gene expression were modest and some genes were unaffected (likely due to redundant binding by other E2f factors), half of these genes showed altered expression below or close to a P-value of 0.05. In addition, the majority of these altered genes showed reduced as opposed to increased expression in the absence of E2f3b, further suggesting that E2f3b functions predominantly (although not exclusively) as a transcriptional activator. In contrast, the NPC target genes of E2f3 are expressed at levels much closer to the average in some tissues, including heart, liver and testes. By analyzing a genome-wide atlas of gene expression in mouse tissues, we identified a cluster of over 400 E2f3 targets that are highly expressed predominantly in neuronal tissues (Supplementary Figure S6,Supplementary Table S9). These observations suggest that E2f3-bound sites are differentially transcribed depending on the cell type, and/or that E2fs exhibit significant tissue specificity in their genomic binding patterns.

Figure 5
figure 5

In NPCs, E2fs bind the TSS of genes that are transcriptionally active in neurogenic tissues. (a) Box plots display ranked expression data for genes belonging to distinct classes of E2f3 and E2f4 targets (gene classes indicated along the x-axis). Asterisks, P<0.05 by Wilcoxon’s rank-sum test. In addition, all the E2f-bound categories are significantly more highly expressed than the group of surveyed genes. (b) Expression levels of E2f3-bound and ‘all surveyed’ genes as measured by RNA-seq in various tissues. To evaluate if E2f3-bound genes are differently expressed than the group of surveyed genes, the difference between the median percentile rank of these two groups is given for each tissue (Δ medians). (c) Gene deregulation in E2f3-deficient NPCs. mRNA from wild-type and E2f3b−/− neurospheres was quantified using real-time RT-PCR. Data are normalized using Gapdh as invariant control, and are reported as fold over the signal detected in wild-type samples. n=5 wild-type and 6 mutant embryos from two age-matched litters derived from the same father. The results of unpaired two-tailed t-tests are shown below each gene. Error bars indicated S.E.M.

To assess tissue-specific binding, we compared our E2f3 peaks from NPCs with those previously identified in MBs.24 In this analysis, we compared only those peaks that were discovered using the same antibody (sc-878) in each cell type and that fell within genomic regions that were surveyed in both experiments. We found a surprisingly low degree of overlap, with just 30% of MB genes also identifying as targets in NPCs (Figure 6a). Strikingly, GO analysis revealed a strong conservation of classical E2f processes between the two cell types (cell cycle, gene expression, DNA damage response, chromatin organization and cell death). In contrast, tissue-specific E2f3 targets were enriched for developmentally-related functions, and these categories are poorly represented within the group of genes targeted by E2f3 in both NPCs and MBs (Figure 6b, Supplementary Table S10). Specifically, E2f3 targets are uniquely enriched in NPCs for ‘cell differentiation’, ‘nervous system development’ and ‘neurogenesis’, while MB-specific targets are primarily enriched for ‘cell adhesion’, ‘response to wounding’ and ‘skeletal system development’. These findings demonstrate both a conservation of ‘classical’ E2F-dependent functions between cell types, which surprisingly occurs through a relatively small proportion of E2f3-binding sites, as well as extensive cell type specificity in terms of both target gene binding and cellular functions. Together, our comparative analyses reveal the surprising finding that E2f3 is positioned to control distinct differentiation and cell fate regulatory pathways in different mammalian cell types.

Figure 6
figure 6

E2f3 binds predominantly unique target genes in neural versus muscle precursors. (a) The number and overlap of peaks identified for E2f3, using a common pan-E2f3 antibody, in neural precursors (NPC) and myoblasts (MB) are shown. (b) GO analysis of NPC- and MB-common and specific genes. The percentage of genes from each category with specific functional annotations (indicated along the y-axis) is graphed. P-values are also indicated (*P<1.0E−4, **P<1.0E−10, ***P<1.0E−20). E2f3-specific peaks in NPCs are uniquely enriched in processes related to differentiation and development

Discussion

The cell cycle machinery is a pivotal regulator of brain development and function by influencing key cell fate decisions, typically via E2f transcription factor activity.5, 6, 8, 9, 17, 18, 41, 42, 43 Thus through our identification of E2f3&4-bound promoters in NPCs, we have significantly expanded our understanding of how cell cycle regulators can direct cell fate control. While this study provides a number of important insights, we offer two key conceptual findings: E2f transcription factors are poised as widespread regulators of cell fate-associated genes in NPCs, establishing a pervasive direct role for the cell cycle machinery in fate determination, and E2f3 is associated with specialized, tissue-specific differentiation programs.

We found that E2fs are poised to control diverse NPC fate choices, including self-renewal, precursor maintenance, cell death and neuron generation, by directly binding the promoters of hundreds of cell fate-associated genes. The majority of target genes we identified are bound by multiple E2f family members, all predominantly associated with gene activation in NPCs, suggesting extensive functional redundancy by E2fs across their target genes. This finding underscores the importance of identifying factor-specific targets in an unbiased manner to fully appreciate the genetic mechanisms driving biological phenomena.

Our discovery that E2f3&4 are enriched at the promoters of such a large number of cell fate genes suggests that fate control by the pRb/E2f pathway may be largely independent from cell cycle regulation. Indeed, we have reported that the reduced/enhanced neurogenesis caused by loss of E2f3a/ E2f3b occurs without any changes to the length of cell cycle phases in affected NPCs.8 Another recent study demonstrated that pRb inhibits the ability of murine fibroblasts to reprogram to a pluripotent state by inhibiting E2f transcriptional activity, and that this process is also unaccompanied by changes in cell cycle dynamics.44 Thus it appears that, at least in cells that are actively making stem cell fate decisions, pRb/E2f factors can direct these decisions independently from cell cycle regulation. Our data suggest that this is through direct regulation of networks of cell fate-associated genes.

We unexpectedly identified Ctcf as a potential regulatory co-factor for E2f3 at differentiation genes in NPCs. While Ctcf is a well-known insulator protein in mammals,45 genome-wide studies have recently demonstrated that 20–25% of Ctcf-bound sites are located within proximal promoter regions,46 and enrichment of E2f binding motifs has been identified among Ctcf-bound regions in multiple cell types.47, 48 Furthermore, the chromatin regulatory function of Ctcf has recently been established as a tumor suppressive mechanism,49 akin to pRb/E2f. These studies substantiate our discovery, and suggest the possibility that the E2f3–Ctcf connection we have uncovered may be relevant in other cell types. However, we foresee a certain degree of tissue specificity in this connection, since it seems inexistent in muscle cells. More functional analyses are required to determine if this association is unique to NPCs or is more widespread, and to clarify how Ctcf–E2f interactions impact transcription.

We also discovered that E2f3 is more strongly associated with differentiation-related genes than is E2f4. This observation is supported biologically by the fact that NPCs deficient in E2f3 but not E2f4 exhibit overt neuronal differentiation defects.8, 10 We were also surprised to observe that E2f3 binds target sites in a highly tissue-specific manner in NPCs compared with MBs. Genetic mouse models have highlighted E2f3 as an important regulator of differentiation in a number of tissues, including brain, skeletal and cardiac muscle, cartilage and adipocytes,8, 24, 50, 51, 52, 53 and our data now suggest that regulation of E2f3-specific target genes is likely to contribute to distinct differentiation programs in these and potentially other cell types. To our knowledge, ours is the first study to demonstrate such a substantial degree of tissue specificity among genomic binding sites for any E2f factor. This phenomenon does not appear to be the result of differential isoform usage by E2f3 in NPCs versus MBs, as the common antibody used in both studies pulled down binding sites of both E2f3a&b, with E2f3b-bound sites making up the majority in both cell types (our data and ref. 24). It will be important for future studies to examine E2f3a&b binding patterns in other tissue and cell types to reveal the extent by which E2f3 may function as a determinant of tissue-specific differentiation programs.

In conclusion, we have provided the first comprehensive, genome-wide view of the genes and associated cellular functions targeted by the pRb/E2f pathway in neural cells, and have thereby significantly enriched our understanding of how the cell cycle machinery impacts cell fate control and tissue-specific differentiation.

Materials and Methods

Mouse models and cell culture

Germline E2f3a- and E2f3b-deficient mice were generated originally by GL and were maintained on an FVB/N background.8, 54 All cells used in this study were cultured as neurospheres. Neural precursors were obtained by dissection of the ganglionic eminence from developing embryos at gestational age E14.5, and cells were cultured as neurospheres as previously described.11 E2f3a−/− and E2f3b−/− NPCs were obtained from crosses of either two heterozygous or two homozygous (knock-out) animals. Animal experiments were approved by the University of Ottawa’s Animal Care Committee, which abides by the guidelines of the Canadian Council on Animal Care. Mice were anesthetized using pentobarbital and killed by cervical dislocation.

Chromatin immunoprecipitation

ChIP assays were performed and quantified as previously described,8 using 2 μg of specific antibody and 20 μg of chromatin for each reaction. ChIP data from WT cells are from a combination of E2f3a+/+ and E2f3b+/+ experiments; we observed no obvious discrepancies in levels of chromatin enrichment with any primers pair assessed in WT cells from either colony. All chromatin preparations were made from animals between the ages of 2- and 4-months old. Statistical analysis of ChIP data was performed using an unpaired two-tailed t-test, with differences considered significant with a *P<0.05, **P<0.01 and ***P<0.001.

ChIP antibodies and primers

Identical E2f and IgG antibodies were used for ChIP–chip and conventional ChIP experiments. These include rabbit-specific antibodies against E2f3 (sc-878) and E2f4 (sc-866), and normal rabbit IgG as a control antibody for ChIP (all from Santa-Cruz, Dallas, TX, USA). The primer pairs used to amplify enriched chromatin at gene promoters are given in Supplementary Table S8.

Quantitative RT-PCR

Total RNA was isolated from neurospheres derived from WT and E2f3b−/− embryos using RNeasy columns (Qiagen, Toronto, ON, Canada) and DNase treatment (Invitrogen, Burlington, ON, Canada). The embryos were obtained from two age-matched litters from the same father, from E2f3b-heterozygous crosses. Reverse-transcription followed by real-type PCR quantitation with the SYBR Green method were performed as previously described55 using gene-specific primers spanning introns, whenever possible. Expression was normalized to the Gapdh housekeeping gene.

Electrophoretic mobility shift assay

Electrophoretic mobility shift assay was performed as previously described.9 The material was neurospheres at stage E14.5 cultured from either littermate WT and E2f3−/− embryos, or littermate WT and E2f3a−/− embryos. The antibodies used were sc-878, to detect E2f3, and sc-193 and sc-633 to detect E2f1 and E2f2.

ChIP–chip and assignment of E2f target genes

A custom microarray manufactured by Agilent and containing ~1 million probes was used to survey the proximal promoters of 24 654 mouse transcripts. Details are provided in Supplementary Text S1. Chromatin from ~50 million cells was fragmented and immunoprecipitated with the anti-E2F3 antibody. The immunopurified and input DNA (pre-IP) samples were amplified by LM-PCR, fluorescently labeled and co-hybridized on a DNA microarray custom made by Agilent Technologies (Mississauga, ON, Canada), essentially as described previously.55 Experimental details and data analysis methods are provided in Supplementary Text S1. The data have been deposited on the NCBI Gene Expression Omnibus under the accession number GSE54941. The complete set of E2f3-bound regions is given in Supplementary Table S1. Mapping is done on the mm9 (NCBI build 37) mouse genome assembly.

Comparisons between ChIP–chip and ChIP-Seq data sets

The overlap between various groups of bound regions was determined using the UCSC genome table browser and the intersection tool, requiring an overlap of at least 1 bp. The E2f3-binding data reported by Asp et al.24 in skeletal MBs were converted to mm9 coordinates using the liftOver tool. The regions bound by Ctcf and RNA pol II in embryonic brain used in our comparisons were obtained from the NCBI GEO, under accession GSE29218.37 The regions bearing certain histone modification marks in neural precursors were taken from GSE12241.56 Ctcf-bound regions in C2C12 MBs were those reported by the Wold lab (Caltech, University in Pasadena, Pasadena, CA, USA) as part of the ENCODE project. For most analyses of gene promoters, we defined proximal promoters as the region from 2 kb upstream to 2 kb downstream of the TSS. The promoter–enhancer connections were based on the ‘enhancer–promoter units’ predicted to exist in embryonic brain.37 E2F3- or Ctcf-bound proximal promoters that overlap with the promoter component of the E14.5 brain enhancer–promoter units were identified, and the number of enhancers predicted to link to these promoters was counted.

GO analysis

The enrichment of specific ontologies among the genes bound by E2f3 was assessed using the program GREAT, with the ‘single nearest gene’ within 1 kb association rule. As ‘background regions’, we used the entire set of genomic loci that have been surveyed by our microarrays. Significance values used were the hypergeometric test with correction for multiple hypothesis testing via the Benjamini algorithm. In this case, differences are considered significant with the following P-values: *P<1.0E−4, **P<1.0E−10, ***P<1.0E−20.

DNA sequence analyses

The E2f3-bound regions were analyzed for the presence, and enrichment, of certain transcription factor DNA-binding sequences using CisGenome58 and the position weight matrices from TRANSFAC, at a cut-off of log likelihood ratio of 500. The frequency of sequence motifs in the transcription factor-bound loci was compared with that in control regions, a group of randomly selected sequenced from the set of sequences surveyed by the microarray. For the E2F motif, we used TRANSFAC matrix 738. We used the Ctcf binding matrix reported by Schmidt et al.59 for murine Ctcf. The phylogenetic conservation (phastCons scores) of sequences was analyzed using CisGenome.

Gene expression profiling

Data from RNA-Seq performed in embryonic mouse brain at E14.5 were downloaded from NCBI GEO, under accession number GSM850906.37 The fragments per kilobase of exon per million fragments mapped (FPKM) expression values of genes were assigned to individual E2f3 target sites using GREAT and the ‘single nearest gene’ association rule. To obtain a non-parametric view of gene expression from this data set, we analyzed genes based on their rank within the RNA-Seq data set: the entire list of detected genes was sorted in order of increasing fragments per kilobase of exon per million values, and genes were assigned a rank (lowest rank number meaning lowest expression level). Box plots were made using R60; they represent the median value (thick horizontal line), the first and third quartile boundaries (boxes) and 1.5 times the interquartile distance (whiskers). Statistical significance was assessed using the Wilcoxon rank-sum test.

We also determined the expression profile of E2f3 target genes discovered in our ChIP–chip analysis across a panel of over 90 mouse tissues and cell lines. Gene expression data for all E2f3 target genes in NPCs were obtained from the mouse Gene Atlas V3,61 and was analyzed and normalized as previously described.55 K-means clustering was performed on the gene expression data using the Cluster 3 program,62 and was visualized using Java Treeview63 to identify tissues and cell types that demonstrate similar gene expression profiles.