Pioneer transcription factors establish new cell-fate competence by triggering chromatin remodeling. However, many features of pioneer action, such as their kinetics and stability, remain poorly defined. Here, we show that Pax7, by opening a unique repertoire of enhancers, is necessary and sufficient for specification of one pituitary lineage. Pax7 binds its targeted enhancers rapidly, but chromatin remodeling and gene activation are slower. Enhancers opened by Pax7 show a loss of DNA methylation and acquire stable epigenetic memory, as evidenced by binding of nonpioneer factors after Pax7 withdrawal. This work shows that transient Pax7 expression is sufficient for stable specification of cell identity.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
We are grateful to our colleagues N. Francis for critical comments on the manuscript; L. Budry and S. Nemec for profiling data and Pitx1 ChIP–seq, respectively; O. Neyret for NGS analyses; E. Massicotte for FACS sorting; A. Blanchet-Cohen for WGBS data analysis; and E. Joyal for expert secretarial assistance. A.M. was supported by an IRCM Challenge fellowship. This work was supported by grants to J.D. from the Canadian Institutes of Health Research.
Integrated supplementary information
Supplementary Figure 1 High degree of correlation between RNA-seq and ATAC–seq replicates of FACS-enriched pituitary cells
(a) Dispersion plots showing correlation between RNA-seq values for all expressed genes (FPKM >1 in at least one sample) of two independent primary cell replicates of pituitary corticotropes and melanotropes. (b) Density plots showing correlation between two independent primary cell replicates of ATAC-seq′s performed in corticotropes and melanotropes. Only 200-bp bins with more than 50 reads in one sample were used for calculating the Pearson coefficient. Bins with more than 200 reads are shown on the plot as 201 reads. (c) Bar graphs showing the proportion of ATAC-seq peaks with P values (derived from MACS) between 10−5 and 10−20 (grey) and less than 10−20 (green). (d) Cross comparisons of ATAC-seq peaks in corticotropes (blue) and melanotropes (red) between the different replicates with the indicated P-value (derived from MACS) thresholds.
(a,b) Expression levels of the 20 most highly expressed genes in corticotropes (a) and melanotropes (b). (c) Boxplot showing the distances between cortico- (n = 558) or melano- (n = 2,891) specific DARs and the closest differentially expressed genes. Center lines show medians; box limits indicate the 25th and 75th percentiles; whiskers extend 1.5 times the interquartile range from the 25th to 75th percentiles; outliers are represented by dots. P values are assessed by unpaired two-sided t test. (d) Boxplot showing the level of conservation (PhastCons) of lineage specific DARs compared to random control regions. Center lines show medians; box limits indicate the 25th and 75th percentiles; whiskers extend 1.5 times the interquartile range from the 25th to 75th percentiles; outliers are represented by dots. P values assessed by unpaired two-sided t test. (e, f) Genome browser views of lineage-specific genes for two melanotrope (e) and two corticotrope markers (f).
(a, b) Motif enrichments obtained using HOMER on corticotrope (a) and melanotrope (b) DARs within a 200-bp window around DARs summit. The top three de novo and known motifs are shown. The bZIP motif identified in the melano-specific DARs (b) appears unique to this subset. However, this bZIP motif is not unique to melanotrope DARs since it is also a major motif extracted from DARs shared between the two POMC lineages and it is also found in gonadotrope DARs (not shown). (c) Distribution of changes (P <0.05) in cortico- and melano-specific gene expression in Pax7-/- neuro-intermediate lobe3.
(a) Scheme for clustering subsets of Pax7 binding sites identified (P < 10−5, derived using MACS) by ChIP-seq (n = 89,206). Pax7 peaks were associated with the presence of H3K4me1 and p300 peaks (P < 10−5) before or after Pax7 expression (summit of H3K4me1 peak +/− 1 kb from Pax7 summit, +/− 500 bp for p300 peaks). The subsets with H3K4me1 before and after Pax7 were defined as Constitutively Active (p300 present before and after Pax7) and Pax7 Activated putative enhancers (p300 present only after Pax7). The subset that gained H3K4me1 after Pax7 binding was deemed to include putative Pioneer sites, and was further subdivided into Primed or Activated pioneer subsets depending on the accompanying gain of p300. Finally, pioneer targets being mostly at intergenic and intronic regions, we extracted for all subsets the intergenic and intronic peaks to define putative enhancers for further analyses. The number of peaks in each subset is indicated. (b) P value (derived from MACS) distribution of the four subsets (n indicated above in a) of Pax7 targets used for analyses. (c) Average DNAse hypersensitivity (GEO SRX034837) profiles for four subsets of peaks defined in a. (d) Average H3K4me1 profiles before (blue) or after (red) Pax7 at the four subsets described in Fig. 3. (e) Heatmaps of Tpit and Stat3 ChIP-seq signals at the four subsets of Pax7 peaks described in Fig. 3. (f) GR binding changes, measured by ChIP-seq (P < 10−5, derived from MACS), before or after Pax7 action at the indicated subsets (n indicated above in a) of Pax7 peaks. (g) Genome browser views of the four loci used for qPCR validation in h. H3K4me1 ChIP-seq data are shown in AtT-20 cells with/without Pax7 and replicate 1 of ATAC-seq data from corticotropes (C) and melanotropes (M). (h) ChIP-qPCR for H3K4me1 at two Pax7 pioneer targets in the Kif21b and Pcsk2 loci, one putative enhancer of the GR locus open in both lineages (C and M), and another GR locus putative enhancer specifically opened in corticotropes (C) as indicated. The data shown are means +/- s.e.m. of tissue triplicates assessed by duplicate qPCR measurements. The negative control sites (Neg1 and Neg2) do not show any H3K4me1 enrichment.
(a) De novo motif enrichments identified using HOMER at the indicated subsets (n indicated in Supplementary Fig. 4a) of Pax7 targets. The top three identified motifs are shown. (b) De novo motif enrichments identified using HOMER at the indicated subsets of Pax7 targets using the indicated subsets as background to identify sequences associated with specific Pax7 subsets. The top three identified motifs are shown.
(a) Heatmaps of indicated ChIP-seq data around 24,061 RefSeq TSS in AtT-20 cells before or after Pax7 ranked according to expression levels derived from RNA-seq data shown at right. (b) Genome-wide Pearson correlations of ChIP-seq signals of all chromatin marks used in this study. Genomic windows of 500 bp with ≥ 20 reads were used to calculate correlations; windows with more than 500 reads were downscaled to 501 reads. The repressive histone marks (H3K9me3, H3K9me2 and H3K27me3) cluster together, while the active mark H3K27ac correlates with both H3K4me1 and H3K4me3. In all cases, the strongest correlation is obtained when comparing each dataset with or without Pax7 (>0.89).
(a) Western blot showing Pax7 protein expression in Tam-induced ER-Pax7 AtT-20 cells compared to stable Pax7-expressing AtT-20 cells. A representative of two independent experiments is shown. (b) Gene induction curves of ER-Pax7 gene targets assessed by RNA-seq in cells treated with Tam for 12 h, 24 h and 96 h in comparison with non-treated cells. Gene targets were separated into two induction dynamics matching those observed in Fig. 5e. 100% corresponds to the maximum level of gene induction for each gene, and 0% representing the lowest level of the four time points measured. All target genes were induced at least 2-fold in stable Pax7-expressing and in ER-Pax7-expressing cells. Late targets show lower than 40% gene induction at 24 h, while early targets show more than 80% induction at 24 h.