The shift from maternal to embryonic control is a critical developmental milestone in preimplantation development. Widespread transcriptomic and epigenetic remodeling facilitate this transition from terminally differentiated gametes to totipotent blastomeres, but the identity of transcription factors (TF) and genomic elements regulating embryonic genome activation (EGA) are poorly defined. The timing of EGA is species-specific, e.g., the timing of murine and human EGA differ significantly. To deepen our understanding of mammalian EGA, here we profile changes in open chromatin during bovine preimplantation development. Before EGA, open chromatin is enriched for maternal TF binding, similar to that observed in humans and mice. During EGA, homeobox factor binding becomes more prevalent and requires embryonic transcription. A cross-species comparison of open chromatin during preimplantation development reveals strong similarity in the regulatory circuitry underlying bovine and human EGA compared to mouse. Moreover, TFs associated with murine EGA are not enriched in cattle or humans, indicating that cattle may be a more informative model for human preimplantation development than mice.
Preimplantation development encompasses several critical milestones as embryos progress from fertilization to implantation. Fusion of the transcriptionally quiescent oocyte and sperm results in a zygote with two haploid pronuclei. Gametic syngamy is completed after the first round of DNA replication, when pronuclear membranes dissolve, allowing maternal and paternal chromosomes to intermingle on the metaphase plate. Subsequent rounds of cleavage ultimately lead to formation of a blastocyst. However, the cleavage-stage embryo must first complete the maternal-to-zygotic transition (MZT), wherein the embryo assumes control over its own continued development by degrading oocyte-derived products and initiating its own transcriptional program. This dramatic change in gene expression proceeds gradually; minor embryonic genome activation (EGA) results in low levels of transcription in early cleavage-stage embryos1,2, and leads to major EGA, which involves wide-spread transcription of embryonic genes3. This shift from maternal dependence to self-sufficiency serves at least three functions: elimination of oocyte-specific messages, replenishment of transcripts that are common to both the oocyte and the embryo, and generation of novel embryonic-specific transcripts4.
While embryos undergo this massive shift in transcriptional program during the MZT, wide-spread changes in chromatin structure5 and epigenetic information are simultaneously restructuring the embryonic epigenome. Chromatin remodeling is necessary to erase gamete-specific signatures and establish an open chromatin landscape that supports embryonic transcriptional programs. Specifically, chromatin structure defines the genomic context within which transcriptional machinery can operate, thereby determining the cell-specific gene expression patterns that confer cell identity and function.
Following fertilization, the zygotic genome is globally demethylated6, and this loss of DNA methylation coincides with global decreases in repressive histone modifications5,7. Overall, epigenetic factors linked to relaxed chromatin are more abundant in mouse zygotes, whereas factors implicated in chromatin compaction become more prevalent during EGA8, pointing to a more permissive chromatin landscape in pre-EGA embryos. Indeed, mouse zygotes present increased histone mobility9, highly dispersed chromatin10, and lack chromocenters (congregations of pericentromeric heterochromatin)11,12. Moreover, in mouse, 3-D chromatin architecture is largely absent after fertilization, and is gradually established throughout preimplantation development, which facilitates long-distance chromatin interactions in later stage embryos13, and indicates increasing chromatin compaction and organization. The co-occurrence of genome activation and chromatin remodeling raises an interesting causality dilemma, namely, whether chromatin remodeling is necessary for transcription activation or whether transcription activation leads to chromatin remodeling. Several maternal products prompt transcription initiation by altering chromatin structure14,15,16, and some chromatin compaction occurs in the absence of embryonic transcription13. However, inhibiting embryonic transcription pervasively disrupts establishment of open chromatin during EGA17,18, suggesting that EGA and chromatin remodeling are likely interdependent.
In mouse and human preimplantation embryos, enrichment for chromatin accessible sites is gradually established as development progresses17,18,19,20,21,22; however, these accessible sites demonstrate different motif enrichment patterns, implicating distinct sets of transcription factors (TFs) in either murine (RARG, NR5A2, ESRRB)17, or human EGA (OTX2, GSC, POU5F1)18,20. Although some TFs appear to regulate EGA in multiple species, i.e., Krüppel-like factors (KLFs)17,18,23, double homeobox (DUX)24,25,26,27, ZSCAN428,29, and CTCF17,18, the level of conservation across mammals remains unclear. In fact, the timing of genome activation is highly species-specific. RNA-seq studies indicate that the major wave of EGA in mice occurs during the 2-cell stage30, in humans during the 4-to-8-cell stage31, and in cattle during the 4-to-8-cell stage32. The relative timing at which mice activate their genomes could indicate that the mechanism behind murine EGA differs significantly from other species’, which would have significant implications for modeling human preimplantation development. In particular, the timing of bovine EGA more closely resembles that of human EGA, as do global changes to histone post-translational modifications. For instance, the active mark trimethylation of lysine 4 on histone 3 (H3K4me3) decreases in global abundance during human33 and bovine EGA34, but increases during murine EGA35,36. However, the chromatin remodeling events that underscore bovine preimplantation development have yet to be cataloged, and it remains unclear whether the regulation and execution of the MZT in cattle resembles what has been observed in humans.
Here we report progressive gains in ATAC-seq peaks throughout bovine preimplantation development. Peaks established prior to major EGA are predominantly enriched for binding sites of maternally provided factors (CTCF, NFY, SP factors, KLFs), whereas homeobox TF binding (OTX1, GSC, DUX factors) becomes more prevalent during major EGA, and depends on embryonic transcription. Comparison of the open chromatin landscape during preimplantation development in cattle, human, and mice reveals that the regulatory circuitry underlying bovine and human EGA is very similar and distinct from mouse. Moreover, TFs that appear to regulate murine EGA (RARG, NR5A2, ESRRB) demonstrate no enrichment in cattle or humans, suggesting that cattle are a more informative model for human preimplantation development than mice.
Dynamics of open chromatin in bovine preimplantation embryos
For each developmental stage, ATAC-seq libraries were prepared from pools of cells derived from three separate oocyte collections. A subset of embryos from each collection was also cultured in the presence of the transcriptional inhibitor, α-amanitin, to interrogate the relationship between embryonic transcription and chromatin remodeling (Fig. 1a). Between 30 and 87 million nonmitochondrial monoclonal uniquely mapped reads were collectively obtained for each developmental stage and cell type: germinal vesicle-stage (GV) oocytes, 2-, 4-, 8-cell embryos, morula, inner cell masses (ICM), and embryonic stem cells (ESC)37 (Supplementary Table 1). Of note, because pools of embryos were used as input for ATAC-seq, the resulting chromatin accessibility profiles are representative of cell populations with some level of heterogeneity, especially for later stages of development. Genome-wide ATAC-seq coverage demonstrated a striking shift in the open chromatin landscape between 4- and 8-cell embryos (Fig. 1b), suggesting that large-scale chromatin remodeling coincides with major EGA (Supplementary Fig. 1). Strong correlation of genome-wide signal between biological replicates (Supplementary Table 2) indicated that both the technique and embryo production were robust, generating comparable chromatin accessibility profiles across different rounds of oocyte collection and embryo production (Fig. 1c). Reads from replicates were pooled together to obtain greater sequencing depth and maximize power for identifying regions of open chromatin. To gauge changes in chromatin accessibility throughout development, regions with enriched signal, or peaks, were called for each stage of development. Pooled reads from replicates were randomly subsampled to standard depths before calling peaks to minimize bias from sequencing depth. Peaks were called from either 30 million monoclonal uniquely mapped reads when comparing control embryos at different developmental stages, or 20 million reads when comparing transcription blocked embryos (TBE) with control embryos. Iterative random subsampling resulted in consistent ATAC-seq peak sets, indicating that subsampling did not introduce additional variation to peak calling (Supplementary Fig. 2).
As observed in humans and mice17,18,20,21,22, ATAC-seq peaks were progressively gained throughout bovine preimplantation development, with the lowest enrichment for accessible sites in 2-cell embryos (Fig. 1d). Considering that multiple studies in pre-EGA embryos indicate highly dispersed chromatin9,10, lack of chromocenters11,12, and lack of higher order chromatin organization13, this decreased enrichment for open chromatin in 2-cell embryos could reflect a highly permissive chromatin structure, resulting in similar levels of accessibility across the genome and lack of specific peak enrichment. Because assays that employ endonucleases—e.g., ATAC-seq and DNase-seq—depend on enrichment for cutting events at consistently accessible loci, genome-wide chromatin relaxation would lead to random cutting events, resulting in the observed high background and low enrichment in 2-cell embryos. Attempts to use endonuclease-based methods to profile open chromatin in mouse zygotes17,22 and human 2-cell embryos20,21 have encountered similar difficulties with low enrichment. On the other hand, a recent study that profiled chromatin accessibility with a technique that was not based on endonuclease activity found that genome-wide accessibility in human embryos actually decreased from the zygote stage onward19. Collectively, these data indicate that global chromatin relaxation is a shared characteristic of human, bovine, and murine pre-EGA embryos. It is tempting to speculate that this naïve chromatin structure acts as an epigenetic “blank slate”, which is then gradually compacted and structured to meet the needs of the developing embryo.
As chromatin becomes more structured, specific loci become more consistently accessible, and can thus be detected as enriched areas of open chromatin by ATAC-seq. Indeed, ATAC-seq peaks were progressively gained in 4-cell, 8-cell, and morula stage embryos (Fig. 1d). Intriguingly, many peaks were only apparent at a specific stage, whereas other peaks were maintained throughout later stages (Fig. 1e), suggesting that chromatin remodeling serves two functions: progressive establishment of a chromatin landscape that will support further development, and transient stage-specific regulation.
Stage-specific open chromatin has distinct functionality
To delve further into the potential function of ATAC-seq peaks that were gained or lost between consecutive developmental stages (Fig. 2a), computational footprinting was used to identify regions that were actively bound by TFs at each stage. These footprints were matched to known binding motifs38, and TF binding activities were compared between consecutive stages (Supplementary Fig. 3, Supplementary Data 1–7). Both motif enrichment and binding activity changed dramatically as development progressed, demonstrating that chromatin remodeling subjects each stage of development to a distinct regulatory circuitry (Fig. 2b). Sequence enrichment within intergenic ATAC-seq peaks revealed that the binding motifs of distinct groups of TFs gained accessibility at each developmental stage (Supplementary Table 3). These enriched sequences corresponded to the known binding motifs of several TFs that were either maternally provided or embryonically expressed (Supplementary Fig. 4).
Over 30,000 ATAC-seq peaks were unique to GV oocytes (Fig. 2a), and those which were intergenic (n = 20,959) were notably enriched for the binding motifs of RFX factors and folliculogenesis specific bHLH factor (FIGLA) (Fig. 2b, Supplementary Fig. 5). Although very few ATAC-seq peaks were detected at the 2-cell stage, nearly 70,000 peaks were gained during the 2- to 4-cell transition. Surprisingly, nearly half of these peaks were only transiently accessible (Fig. 1e). Four-cell-specific open chromatin was most significantly enriched for binding motifs of NFkB family members (Fig. 2c), which was particularly intriguing because NFkB activation is required in mouse 1-cell embryos for development to progress past EGA39. Although NFkB factors are maternally provided (Fig. 2d), they are initially sequestered in the cytoplasm until they translocate into the nucleus at the early 1-cell stage in mice39 and the 4-to-8 cell stage in cattle40. In particular, RELA – one of the NFkB subunits capable of activating expression – binds DNA with increased frequency in cattle embryos compared to oocytes40, suggesting that NFkB activation of target gene expression could be one of the first events in a cascade leading to major EGA. In fact, one of the few genes with upregulated expression in 4-cell embryos, as compared to MII oocytes, was TRIM8, a positive regulator of NFkB activity (Supplementary Fig. 6). Overall, NFkB footprints marked promoters of genes involved in negative regulation of cell differentiation at the 4-cell stage (Supplementary Table 4, Supplementary Data 8), including several potential regulators of EGA that were substantially upregulated between the 4- and 8-cell stages (Supplementary Fig. 7). Nevertheless, the contribution of NFkB activity to minor EGA in cattle has yet to be established, but certainly warrants further research.
In contrast to open chromatin in pre-EGA embryos, ATAC-seq peaks that were gained at the 8-cell stage were predominantly enriched for binding motifs of homeobox TFs, including OTX2, GSC, CRX, PHOX2A, PAX7, and PITX1 (Fig. 2e), although only OTX1, OTX2, and PITX2 were appreciably expressed during the 8-cell stage (Fig. 2f). Based on the presence of OTX and PITX footprints in promoters, these homeobox TFs appear to target genes involved in negative regulation of cell differentiation (Supplementary Table 4, Supplementary Data 8). De novo motif enrichment revealed that intergenic ATAC-seq peaks that were gained at the 8-cell stage were most significantly enriched for a motif that corresponded to the DUX family, which has been extensively implicated in EGA regulation24,25,26,27. More than a third of the accessible intergenic loci in 8-cell embryos harbored a sequence that corresponded to DUX binding motifs (Fig. 2g). DUX factors are increasingly recognized as potential developmental pioneer factors, participating in chromatin remodeling25,27, and expression of repetitive elements and cleavage-stage genes24,25,26. In cattle, DUXA is expressed transiently and strongly during the 8- and 16-cell stages (Fig. 2h), similar to DUX4 and Dux expression in human and mouse embryos, respectively25,26. Although Dux expression in mice can be induced by the maternal factors DPPA2 and DPPA441, only DPPA3 is maternally provided in cattle (Supplementary Fig. 8). Knockdown of DPPA3 impairs the developmental competency of mouse and bovine embryos42,43, highlighting its potential as a key regulator. Although the maternal factors that induce DUXA expression in cattle remain unclear, the synchronized upregulation of DUXA and increased accessibility of its binding sites during bovine development suggest that DUXA may play an important role in bovine EGA. This hypothesis is especially intriguing, considering that DUX family members are implicated in murine and human EGA, and are highly conserved and specific to placental mammals44. Further functional validation will be necessary to determine if DUXA is required for bovine embryogenesis, or if is an important but nonessential regulator of EGA, as in mice45,46,47.
The expression profile of DUXA in bovine preimplantation development closely mirrored that of ZSCAN4, another TF implicated in EGA (Supplementary Fig. 9)48. ZSCAN4 is a downstream target of DUX factors in humans25 and mice24,26, and the coordinated expression of these two factors in cattle certainly suggests that a similar mechanism may be at play. ZSCAN4 depletion results in developmental arrest at the time of EGA in mice28 and cattle29, supporting an important role for this TF. A recent report indicates that ZSCAN4 binds microsatellites in mouse embryos49. These binding sites would be problematic to identify in our analysis, given the difficulty of aligning short reads to repetitive sequences. Nevertheless, ZSCAN4 footprints were apparent in the promoters of 544 genes at the 8-cell stage, and these genes were functionally enriched for negative regulation of cell differentiation and apoptosis, as well as positive regulation of proliferation (Supplementary Table 4, Supplementary Data 8).
Extensive chromatin remodeling continued to take place after major EGA. From the 8-cell to the morula stage, over 100,000 ATAC-seq peaks appeared (Fig. 2a), and those that were intergenic were primarily enriched for KLF, SP, and GATA binding motifs (Fig. 2b; Supplementary Table 3). Differential analysis of TF binding activity in 8-cell embryos and morula, based on TF footprinting, revealed significantly increased KLF and SP binding activity in morula, although GATA binding activity did not significantly increase (Supplementary Fig. 4). Nonetheless, almost 20,000 GATA footprints were evident in open chromatin at the morula stage and occurred in promoters of genes involved in negative regulation of differentiation (Supplementary Table 4, Supplementary Data 8). Genes marked by KLF and SP footprints in 8-cell embryos and morula demonstrated substantial enrichment for a variety of biological processes, including cell-cell adherens junctions (Supplementary Table 4, Supplementary Data 8).
Comparatively fewer ATAC-seq peaks were gained during the morula to ICM transition (n = 23,954) (Fig. 2a). Intergenic peaks gained in ICM were also enriched for GATA, KLF, and SP motifs, but, in contrast to ATAC-seq peaks gained in morula, were significantly enriched for JUN factor motifs (Fig. 2b). Footprinting analysis corroborated this enrichment, showing that JUN factors exhibited stronger binding in ICM than in morula (Supplementary Fig. 4). Increased JUN binding may reflect the beginning of primitive endoderm specification, as c-JUN induces expression of endoderm markers50.
Gradually established open chromatin sets the stage for EGA
Although many ATAC-seq peaks were stage-specific, a stable open chromatin landscape was also progressively established after fertilization. As early as the 2-cell stage, ATAC-seq peaks appeared and were maintained post-EGA until at least the morula stage. Maintained ATAC-seq peaks were enriched for CTCF footprints (Fig. 3a; Supplementary Table 5), especially those that were first established during the 4-cell stage. CTCF binding delineates chromatin loop boundaries, thus determining the genomic space within which genes can interact with their regulatory elements51. Therefore, enrichment of CTCF binding in maintained peaks could indicate a gradual establishment of a stable 3-D chromatin architecture in preparation for major EGA. This interpretation is consistent with 3-D chromatin architecture in mouse embryos, which is greatly diminished after fertilization and then gradually established throughout preimplantation development, facilitating long-distance chromatin contacts, i.e., promoter-enhancer interactions, in later stage embryos13. Indeed, transcription during minor EGA in mouse is primarily driven by proximal promoters, whereas enhancers are dispensable for transcription until major EGA52. The global reorganization of 3-D chromatin architecture in the mouse has also been observed in bovine embryos: the nonradial establishment of chromosome territories, when gene-poor chromosomes tend to localize to the nuclear periphery more often than gene-rich regions, is first established at the 8-cell stage53. Collectively, these results suggest that gradual establishment of 3-D chromatin architecture may be a conserved feature of pre-EGA embryos, although the specific mechanisms and factors regulating this restructuring remain unknown.
Maintained peaks were also more enriched for KLF motifs than stage-specific peaks (Fig. 3b), and several KLFs were maternally provided (Fig. 3c), including KLF4, a master regulator of induced pluripotent stem cell reprogramming54. KLF4 contributes to reprogramming by mediating pluripotency-associated enhancer-promoter contacts55; thus, the concurrent establishment of open chromatin corresponding to CTCF and KLF binding motifs could indicate that pre-EGA embryos are priming a similar mechanism for use during major EGA, especially because about 50% of activated genes during bovine major EGA contain KLF motifs in their promoters23.
In fact, ATAC-seq peaks that were maintained from the 2- and 4-cell stages onward occurred in genic regions more often than stage-specific peaks (Fig. 3d) and marked the promoters of genes that were functionally enriched for housekeeping functions (Fig. 3e), including 33 of the 51 genes that were upregulated in 4-cell embryos compared to MII oocytes (Supplementary Fig. 6). Moreover, maintained ATAC-seq peaks were strongly enriched for NFY and SP1 binding sites (Supplementary Table 5), which is highly reminiscent of chromatin remodeling in mouse, as proximal promoters enriched for NFY motifs are some of the first regions to gain accessibility22. Of note, NFY enhances binding of the pluripotency factors POU5F1, SOX2, and KLF4 to their recognition motifs56, and is clearly involved in murine EGA, with NFY knockdown embryos presenting impaired open chromatin establishment and downregulation of gene expression22. Similarly, SP1 is essential for early mouse development, with knockout embryos arresting at day 11 of gestation57. Intriguingly, human zygotes and 4-cell embryos also demonstrate KLF, SP1, and NFY motif enrichment in open chromatin21, overall suggesting that a conserved set of maternal regulators participate in chromatin remodeling and transcription activation in pre-EGA embryos.
Maternal factors and transcription drive chromatin remodeling
Considering that embryonic transcription is limited before the 8-cell stage in cattle, the appearance of ATAC-seq peaks in pre-EGA embryos suggests that maternal factors actively participate in chromatin remodeling. To further dissect the maternal contribution to epigenetic reprogramming and EGA, embryos were cultured in the presence of α-amanitin to inhibit POLR2-dependent transcription elongation. Treatment with α-amanitin prevents development of bovine embryos past the 8-to-16-cell stage58. Loss of embryonic transcription dramatically repressed the appearance of ATAC-seq peaks (Fig. 4a); 64% of ATAC-seq peaks that should have been gained at the 4-cell stage failed to appear without embryonic transcription, and in 8-cell TBEs, 96% of ATAC-seq peaks that should have appeared were not present (Fig. 4b), disrupting the chromatin state of key genes, such as KLF4 (Fig. 4c).
Interestingly, transcription inhibition preferentially disrupted stage-specific ATAC-seq peaks, whereas maintained peaks that were established in 2- and 4-cell embryos appeared despite transcription inhibition (Fig. 4d). Transcription-independent maintained peaks marked the promoters of nearly 60% of embryonically-expressed genes (n = 1,038/1,784 genes identified from data from Bogliotti et al.23; Supplementary Fig. 10), suggesting that maternal factors actively remodel the local chromatin structure of target genes, possibly priming them for expression later in development. In fact, transcription inhibition did not affect the binding activity of maternal factors CTCF and KLFs (Supplementary Fig. 11), which were actively bound in 2- and 4-cell maintained open chromatin (Fig. 3a, b). On the other hand, transcription inhibition significantly reduced the binding activity of homeoboxes in 8-cell embryos (Fig. 5). Altogether, maternal and embryonic factors appear to work together to establish the appropriate chromatin landscape for EGA.
Repetitive element enrichment in embryonic open chromatin
Notably, the MZT is not just characterized by a shift in the transcriptome, but also in the repeatome. Over 40% of the bovine genome is comprised of repeats derived from retrotransposons59. These repeats propagate through a “copy and paste” mechanism, and their expression is generally suppressed to avoid deleterious integrations60. However, retrotransposons are actively transcribed in preimplantation embryos, and although this phenomenon was previously considered opportunistic expression due to an unusually permissive chromatin state, the activity of some retrotransposons is actually crucial for development61. Although the specific mechanisms behind this necessity are still being investigated, transposable elements have been implicated at several regulatory levels, as they can provide binding sites for TFs, behave as alternative promoters and enhancers62 and contribute to 3-D chromatin architecture63.
A complete catalog of repeat transcription throughout bovine preimplantation development was lacking. To address this gap in knowledge, available RNA-seq data32 were assessed for expression of repetitive elements. Importantly, these libraries were not subjected to polyA selection. As has been observed in mouse and human64,65, expression and accessibility of repetitive elements throughout bovine preimplantation development was highly stage-specific and dynamic (Fig. 6a; Supplementary Fig. 12).
Non-long terminal repeat retrotransposons—long interspersed elements (LINEs) and short interspersed elements —are increasingly transcribed during human preimplantation development19,64. In cattle, LINEs also demonstrated increasing expression (Fig. 6a) and accessibility (Fig. 6b) starting at the 8-cell stage. Although the function of LINE elements in bovine preimplantation embryos has yet to be established, their activation is crucial in mouse for development to proceed past the 2-cell stage66. Notably, LTR activation occurs in human64, mouse65, and bovine preimplantation development67. In particular, mammalian LTRs and ERVLs were increasingly enriched in open chromatin starting at the 8-cell stage (Fig. 6b), although their expression dropped throughout development (Fig. 6a), indicating that other mechanisms likely regulate repeat expression, e.g., DNA methylation or histone modifications68. Only endogenous retroviral K (ERVK) elements demonstrated both increasing expression (Fig. 6a) and increased prevalence in distal open chromatin at the 8-cell stage (Fig. 6b). These patterns could indicate that ERVK elements are co-opted as regulatory elements in early embryos, as observed in humans69, although the regulatory potential of these elements would need to be further explored and validated.
Mounting evidence suggests that specific LTR retrotransposons, especially intact elements, play pivotal roles in early development. During bovine preimplantation development, the most expressed retrotransposons were ERV1-1_BT and ERV1-2_BT (Fig. 7a). Upon further inspection, intact ERV1-1_BT elements often co-occurred with MER41_BT repeats in a specific sequence, which demonstrated reproducible patterns of chromatin accessibility (Fig. 7b, Supplementary Fig. 13a). Furthermore, MER41_BT elements that were accessible in 8-cell embryos were enriched for the binding motifs of several pluripotency factors, including POU5F1, NFY, KLF4, OTX2, and TEAD (Supplementary Fig. 13b), suggesting that pluripotency factors may drive transposon expression.
In human and mouse pre-EGA embryos, DUX drives the expression of intact ERVL21,24,25,26. Specifically, in human embryos DUX4 binds MLT2A1 elements21. Considering that intergenic open chromatin was especially enriched for DUX binding sites in 8-cell bovine embryos, it seemed likely that these sites would also correspond to retrotransposons. Indeed, several MLT elements were enriched in 8-cell open chromatin harboring DUX motifs (Supplementary Table 6), suggesting that bovine DUX could be regulating the expression of LTRs with sequence similarity to MLT2A1 elements. Nearly all LTRs that were enriched in 8-cell open chromatin with DUX binding sites demonstrated dynamic expression profiles throughout development (Supplementary Fig. 14). In particular, MLT1A0 elements with DUX motifs experienced sharp increases in accessibility and expression in 8-cell embryos (Fig. 7c, d).
Whether LTRs actively contribute to bovine development simply as a consequence from opportunistic expression remains to be established. However, evidence in other species suggests that retrotransposons are often co-opted as promoters and enhancers62: a phenomenon that appears to extend to bovine embryos, as ATAC-seq and RNA-seq suggest that MLT elements are co-opted as alternative promoters (Fig. 7e). An interesting balance appears to exist between repetitive elements and the embryo, wherein the repeatome leverages the embryo’s existing regulatory network to drive transposition, while simultaneously providing novel regulatory elements and TF binding sites that the embryo co-opts to drive the expression of its own transcriptional programs.
A model for mammalian genome activation
Recent evidence suggests that regulation of the MZT may differ between mammals. First, the timing of genome activation is species-specific. Second, expression patterns of repetitive elements are not only stage-specific, but also species-specific; primate-specific and murine ERVL elements are strongly expressed during preimplantation development21,24,25,26. Finally, the maternal programs in human and mice are divergent; human embryos conspicuously lack the murine maternal effect transcripts POU5F1, HSF1, and DICER170.
The relaxed chromatin structure in early preimplantation embryos provides a unique regulatory context for maternal factors, which are essential to support cleavage stage-embryos before EGA. Whereas the condensed chromatin structure in somatic cells generally restricts DNA-binding proteins to regions of open chromatin, the dispersed chromatin in 2-cell bovine embryos may allow maternal factors to opportunistically and pervasively bind their recognition motifs. Although the chromatin landscape changes markedly upon major EGA in bovine, human18,19,21, and mouse17,22 embryos, active regulators during this developmental stage appear to differ significantly between humans and mice18, which suggests that regulation of mammalian EGA is species-specific. Genomic regulatory elements tend to evolve rapidly, making it difficult to identify homologous regions and compare accessibility in different species. On the other hand, sequences contained in open chromatin can indicate which regulatory factors are active in a given cell type. Therefore, the open chromatin landscapes of cattle, human, and mice embryos and ESC were compared for enrichment of known TF binding motifs. This analysis clearly clustered cattle and human together, especially 8-cell embryos and ESC, whereas the mouse remained distinct until later stages, when ICM and ESC regulatory circuitry began to converge with that of cattle and human ICM and ESC (Fig. 8a). To identify putative regulators of mammalian EGA, the enrichment of actively expressed TF motifs was compared across species and developmental stages. The activity of several regulatory factors differed between species, clearly separating humans and cattle from mice, and again suggesting divergence in regulatory and transcriptional programs (Supplementary Fig. 15). Compared to mouse 2-cell embryos, open chromatin in bovine and human 8-cell embryos demonstrated remarkably similar patterns of sequence enrichment. In cattle and human embryos, SP1, OTX2, and NFY were particularly implicated during major EGA, whereas NR5A2, RARG, and ESRRB were solely implicated in mouse EGA (Fig. 8b).
Nevertheless, it remains unclear if these TFs are regulators of EGA, or simply products of it. For instance, although NR5A2 binding sites were enriched in 2-cell mouse embryos, NR5A2 is an early regulator of inner cell mass and trophectoderm programs and is not essential for genome activation17. Although several TFs appeared to only be important in either mouse, or in cattle and humans, KLFs were substantially enriched during EGA in all three species. Considering the well-established role of KLFs in somatic cell reprogramming and establishment of pluripotency54, KLFs may play a conserved role in the MZT.
Sweeping changes to chromatin structure during bovine preimplantation development suggest that 2-cell embryos are characterized by globally decondensed chromatin, which is gradually compacted as development progresses, echoing similar observations in humans and mice. In particular, it is tempting to speculate that a conserved set of maternal factors establish basal promoter accessibility and the necessary chromatin architecture for enhancer-promoter interactions that will drive gene expression during major EGA (Fig. 9). However, the open chromatin landscape during major EGA clearly distinguished mice from cattle and humans, suggesting that whereas maternal regulation of EGA may be conserved across mammals, the transcriptional programs that are subsequently activated have diverged substantially. The resemblance of cattle and human EGA, both with respect to timing and associated regulators, strongly suggests that cattle are a more appropriate model system for human preimplantation development than mice. Nevertheless, the factors that appear to regulate the MZT in cattle, humans, and mice certainly warrant further investigation and validation, which will provide invaluable insight into the regulatory framework that governs successful preimplantation development.
Oocyte collection and maturation
Ovaries were procured from a local abattoir and transported to the laboratory in a warm saline solution, complying with University of California Davis relevant ethical regulation for animal testing and research. Follicles measuring 2–10 mm were aspirated to obtain cumulus oocyte complexes (COCs). Only COCs with healthy layers of cumulus cells were selected for maturation. These were washed in collection medium 6:4 M199 (Sigma M7653): SOF-Hepes, supplemented with 2% fetal bovine serum (FBS; Hyclone/Thermo Scientific) and transferred to maturation medium modified M199 medium (Sigma M2154) supplemented with ALA-glutamine (0.1 mM), sodium pyruvate (0.2 mM), gentamicin (5 μg/ml), EGF (50 ng/ml), oFSH (50 ng/ml), bLH (3 μg/ml), cysteamine (0.1 mM), and 10% FBS.
In vitro fertilization and embryo culture
After COCs matured for 24 h, MII oocytes were washed in SOF-IVF medium (107.7 mM NaCl, 7.16 mM KCl, 1.19 mM KH2PO4, 0.49 mM, MgCl2, 1.17 mM CaCl2, 5.3 mM sodium lactate, 25.07 mM NaHCO3, 0.20 mM sodium pyruvate, 0.5 mM fructose, 1X nonessential amino acids, 5 μg/ml gentamicin, 10 μg/ml heparin, 6 mg/ml fatty acid-free BSA) and transferred to drops of SOF-IVF medium under mineral oil. Frozen semen from a Holstein bull was thawed, and 106 sperm/ml were added to drops with MII oocytes, which were incubated at 38.5 °C for 12–18 h. Zygotes were then removed from fertilization medium, and cumulus cells were removed by vortexing for 5 min in SOF-Hepes medium. Zygotes were then transferred to culture media (KSOMaa supplemented with 4 mg/mL BSA) under mineral oil, and incubated at 38.5 °C in 5% CO2, 5% O2, and 90% N2. If embryos were to be transcriptionally inhibited, the culture medium was supplemented with α-amanitin (50 μg/ml) on day-one. On day-three, the culture medium was supplemented with 5% stem-cell qualified FBS (Gemini Bio 100–525). Blastocyst development (minimum 30%) was evaluated at 7 days post-insemination (dpi).
Culture of bovine ESC (bESC) and preparation for ATAC-seq
CTFR-bESC lines derived from ICM through Wnt-pathway inhibition37 were cultured in CTFR medium consisting in custom mTeSR1 (devoid of TGFb and FGF2) supplemented with 20 ng/mL human FGF2 (100–18B, Peprotech), 2.5 μM IWR-1 (I0161, Sigma Aldrich), and 100 U/mL penicillin–100 μg/mL streptomycin (15140–122, Gibco) on irradiated mouse embryonic fibroblast (MEF) (A34180, Gibco) at 37 °C and 5% CO2. Cells were sub-cultured every 4–5 days at 1:10 split ratio using TrypLE Express (12604–013, Gibco). Culture medium was changed daily between passages. To increase cell survival, bESC culture medium was supplemented with 10 μM ROCK inhibitor Y-27632 (ALX-270-333-M005, Enzo) at seeding. Before cell lysis and library preparation CTFR-bESCs were trypsinized and separated from MEF feeders by plating the cell suspension in CTFR culture medium supplemented with 10 μM ROCK inhibitor Y-27632 for 1 h in a 60 mm cell culture dish (430196, Corning) at 37 °C and 5% CO2. After incubation, cells that did not adhere to the culture dish were recovered for downstream processing. For each library, 50,000 cells were centrifuged at 500 rcf, 4 °C, for 5 min. Pellets were then washed once with 50 µl cold 1x PBS buffer and centrifuged at 500 rcf, 4 °C, for 5 min. Pellets were then resuspended in 50 µl cold ATAC-seq lysis buffer (10 mM Tris-HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl2 and 0.1% IGEPAL CA-630) and centrifuged at 500 rcf, 4 °C, for 10 min to isolate nuclear pellets.
Preparation of oocytes and embryos for ATAC-seq
Pools of oocytes and embryos were collected for ATAC-seq library preparation from three separate collections per developmental stage. Embryos intended for collection at the 2-cell, 4-cell, or 8-cell stages were divided into two groups, one of which was supplemented with α-amanitin, and cultured simultaneously. GV oocytes were collected for ATAC-seq prior to maturation. Preimplantation embryos were collected at the 2-cell (30–32 h post-insemination), 4-cell (2 dpi), 8-cell (3 dpi), and morula stages (5 dpi). For ICM collection, blastocysts were collected at 7 dpi and subjected to immunosurgery. Oocytes or embryos (a minimum of 500 cells) were treated with pronase (10 mg/ml) to completely remove the zona pellucida and washed with SOF-Hepes on a warming plate. Cells were then transferred to 1 ml cold SOF-Hepes, and centrifuged at 500 rcf, 4 °C, for 5 min. Morulae were vortexed for 3 min in cold ATAC-seq lysis buffer. Cell pellets were then resuspended in 1 ml cold ATAC-seq cell lysis buffer and centrifuged at 500 rcf, 4 °C, for 10 min, and supernatant was aspirated to isolate nuclear pellets. To isolate ICMs by immunosurgery, blastocysts were treated with pronase (10 mg/ml) and washed in SOF-Hepes medium, then washed in 20% antibovine whole serum (B8270-2ML SIGMA, dilute in SOF-Hepes medium), and transferred to individual drops of 20% antibovine whole serum solution under mineral oil (20 ul per drop). Blastocysts were incubated for 1 h at 38.5 °C, washed in SOF-Hepes medium, then washed in 20% guinea pig complement (Innovative Grade US origin, IGP-COMPL-21249) and transferred to individual drops of 20% guinea pig complement solution under mineral oil (20 μl drop). Blastocysts were again incubated for 1 h at 38.5 °C. ICM were isolated by pipetting, treated with trypsin for 2–3 min, and pipetted several times before adding 10% fetal bovine serum to stop trypsinization. ICM cells were then washed in 1 ml cold SOF-Hepes medium and centrifuged at 500 rcf, 4 °C, for 5 min. Cell pellets were then resuspended in 1 ml cold ATAC-seq cell lysis buffer and centrifuged at 500 rcf, 4 °C, for 10 min, and supernatant was aspirated to isolate nuclear pellets.
Transposition and ATAC-seq library construction
Nuclear pellets were then resuspended in 50 μl transposition reaction mix (25 μl TD buffer (Nextera DNA Library Prep Kit, Illumina), 2.5 μl TDE1 enzyme (Nextera DNA Library Prep Kit, Illumina), 22.5 μl nuclease-free H2O) and incubated for 60 min at 37 °C, shaking at 300 rpm. The transposase, which is loaded with Illumina sequencing adapters, cuts DNA where it is not sterically hindered and simultaneously ligates adapters, effectively producing a library in one incubation step. Transposed DNA was purified with the MinElute PCR purification kit (Qiagen, Hilden, Germany) and eluted in 10 μl buffer EB. Libraries were then PCR amplified: 50 μl reactions (25 μl SsoFast Evagreen supermix with low ROX (Bio-Rad, Hercules, CA), 0.6 μl 25 μM custom Nextera PCR primer 1, 0.6 μl 25 μM custom Nextera PCR primer 2 (containing the index used for demultiplexing; Supplementary Table 7), 13 μl nuclease-free H2O, and 10 μl eluted DNA) were cycled at 72 °C for 5 min, 98 °C for 30 s, and then thermocycling at 98 °C for 10 s, 63 °C for 30 s and 72 °C for 1 min. Libraries from GV oocytes, 2-cell, and 4-cell embryos were thermocycled for 13 cycles, and 8-cell and morulae libraries were thermocycled for 11 cycles. PCR-amplified libraries were again purified with the MinElute PCR purification kit (Qiagen, Hilden, Germany) and eluted in 10 μl buffer EB. Libraries were then evaluated for DNA concentration and nucleosomal laddering patterns using the Bioanalyzer 2100 DNA High Sensitivity chip (Agilent, Palo Alto, CA). Expected nucleosomal laddering was evidenced by the presence of both small fragments, corresponding to hyper-accessible DNA that was frequently transposed, and larger fragments, corresponding to DNA that was wrapped around one or more nucleosomes. This study focused on mapping open chromatin; therefore, the sub-nucleosomal length fraction of each library (150–250 bp) was size selected using the PippinHT system (Sage Science, Beverly, MA) with 3% agarose cassettes. Size-selected libraries were run on a Bioanalyzer DNA High Sensitivity chip to confirm size-selection and determine DNA concentration. Final libraries were then pooled for sequencing on the Illumina NextSeq platform to generate 40 bp paired end reads.
ATAC-seq read alignment and peak calling
Raw sequencing reads were trimmed with Trim_Galore, a wrapper around Cutadapt (v0.4.0)71, to remove residual Illumina adapter sequences and low quality (q < 20) ends, keeping unpaired reads and reads 10 bp or longer after trimming. Trimmed reads were then aligned to either the GRCm38 (mouse), GRCh38 (human), or ARS-UCD1.2 (cattle) assemblies using BWA aln (-q 15) and sampe72. PCR duplicates were removed with PicardTools, and mitochondrial and low-quality alignments (q < 15) were removed with SAMtools73. Alignments from biological replicates from each stage were merged and randomly subsampled to equivalent depth with SAMtools for detection of open chromatin. To determine which regions of the genome demonstrated significant enrichment of ATAC-seq signal, broad peaks were called with MACS274, using a q value cutoff of 0.05, and settings --nomodel --shift -100 --extsize 200.
RNA-seq alignment and gene expression quantification
Raw sequencing reads were trimmed with Trimmomatic (v0.33)75. Low-quality leading and trailing bases (3 bases) were clipped, and illumine adapter sequences were removed, allowing 2 seed mismatches, a palindrome clip threshold of 30, and a simple clip threshold of 10. Sliding window trimming was conducted with a window size of 4 bases, and a quality threshold of 15. Reads 36 bases or longer were retained after trimming. Trimmed reads were aligned to either the GRCm38 (mouse), GRCh38 (human), or ARS-UCD1.2 (cattle) assemblies with STAR (v2.7.2a) with options –outFilterScoreMinOverLread 0.85 and –seedSearchStartLmax 30. Low quality alignments (q < 5) were removed with SAMtools. Raw counts were calculated for genes in the Ensembl 96 annotations for each species with summarizeOverlaps, from the R package GenomicAlignments (v1.18.1)76, using ‘Union’ mode and allowing fragments for paired end data. The DESeq2 R package (v1.22.2)77 was used to identify differentially expressed genes, which were required to demonstrate a logFC >2 and an adjusted p value < 0.05. Variance stabilizing transformation (VST) was used to normalize expression counts for visualization.
Repeat expression quantification
Trimmed reads were aligned to the ARS-UCD1.2 genome assembly with STAR (v2.7.0)78 with options –outFilterMultimapNmax 100, --winAnchorMultimapNmax 100, and --twopassMode Basic. Raw expression values for individual repetitive elements were calculated for repeats in the RepeatMasker annotation for the ARS-UCD1.2 assembly (downloaded from the UCSC Genome Browser) using TEtoolkit79 in ‘multi’ mode, which improves quantification of transposable elements transcripts by including ambiguously mapped reads. Raw expression values were normalized with the variance stabilizing transformation included in the DESeq2 R package (v1.22.2).
Comparison of ATAC-seq and RNA-seq genome-wide signal
For both ATAC-seq and RNA-seq data, alignments were converted to bigwig format using bamCoverage from the DeepTools suite80, which binned the genome into windows (50 bp default size) and calculated normalized signal in each window (fragments per kilobase million (FPKM) for paired-end data, and reads per kilobase million (RPKM) for single-end data). These bigwig files were used for track visualization with the UCSC genome browser. To compare bigwig files from replicate libraries on a genome-wide scale, the multiBigWigSummary function from Deeptools was used in bins mode (bin size of 500 bp). The output was then piped to the plotPCA function from DeepTools to generate the PCA plot (options –transpose, --log2¸and –ntop 150000 for ATAC-seq libraries; options –transpose, --log2, and –ntop 100000 for RNA-seq libraries), and the plotCorrelation function to calculate the Pearson correlation coefficient between biological replicates. To assess average accessibility or expression at genomic intervals of interest, ATAC-seq normalized signal (FPKM in 50 bp bins) was visualized using the DeepTools computeMatrix (in scale-regions mode with option –skipZeros) and plotHeatmap functions. Regions of interest were scaled to 500 bp, and signal was also plotted for 1 kb up- and downstream.
Comparison and classification of ATAC-seq peaks
Peak sets from different stages were compared using the BEDtools intersect function81, requiring a minimum of 1 bp overlap to consider a peak shared by both sets. Similarly, peaks were classified as genic if they overlapped either the 2 kb region upstream of a transcription start site (TSS), exons, or introns by 1 bp. Otherwise, peaks were considered intergenic.
Genomic regions were evaluated for binding motif enrichment using the findMotifsGenome.pl script from HOMER (v4.8)82, using the exact sizes of the input genomic intervals (-size given). The most significant known or de novo motifs were reported, based on p value. Known motifs that matched significantly enriched de novo motifs were reported if their match score exceeded 0.6. To compare motif instance in open chromatin at different stages and in different species, sets of ATAC-seq peaks were evaluated for the presence of HOMER’s set of known TF motifs, measured as the percent of ATAC-seq peaks containing any given motif. To compare these data, PCA was performed with the prcomp function from the stats R package (v3.6.3), with options center = T and scale = F.
Genome-wide motif location prediction
Position-weight matrices were downloaded from the JASPAR database for TFs of interest38. Using the FIMO tool from the MEME suite (v5.0.4)83, TF motif locations (p < 1e–4) were predicted genome-wide in the ARS-UCD1.2 genome assembly.
Transcription factor footprinting analysis
The HINT framework from the Regulatory Genomics Toolbox84 was used to identify active TF binding sites from ATAC-seq data. When TFs are actively bound to DNA, transposase is sterically hindered, resulting in a small region, termed a “footprint,” where read coverage abruptly drops. To maximize ability to detect footprints, reads from biological replicates were pooled together for each developmental stage and treatment group, and broad peaks were called from pooled reads with MACS274 using a q value cutoff of 0.05, and settings --nomodel --shift -100 --extsize 200. For identifying footprints in an individual stage or treatment group, the rgt-hint command was used in footprinting mode to predict footprints within the corresponding ATAC-peaks, based on signal from pooled reads (commands –atac-seq and –paired-end). These footprints were matched to known motifs in JASPAR38 with the rgt-motifanalysis command in matching mode. For comparing TF binding activity between developmental stages, or between control and TBEs, ATAC-seq peaks called in each of the two stages or treatment groups were first combined with BEDtools merge. Footprints were then identified with rgt-hint in this merged set of ATAC-seq peaks based on signal from pooled reads in each of the two stages or treatment groups. Footprints were matched to known motifs in JASPAR38 with rgt-motifanalysis. Finally, the rgt-hint command in differential mode was used to compare the chromatin accessibility for each TF in each of the two stages or treatment groups. For each TF, the resulting binding activity score indicated whether the TF was more strongly bound in one group than the other, and the associated p-value indicated whether this change in binding activity between groups was significant (two-tailed t test; p < 0.05).
Repeat class enrichment in genomic intervals
To determine if repetitive elements (either individual elements, families, or classes) were enriched in open chromatin, the number of ATAC-seq peaks overlapping a set of repetitive elements was compared to randomized intervals (ATAC-seq peak locations shuffled with BEDtools shuffle function) overlapping the same set of repetitive elements, yielding a log ratio of random to observed.
Functional annotation enrichment analysis
For gene sets of interest, the Ensembl gene IDs of human homologs were fetched using biomaRt85. These Ensembl IDs were submitted to DAVID (v6.8)86,87 to identify enriched biological functions using medium stringency. Gene ontology terms (GOTERM_BP_DIRECT, GOTERM_CC_DIRECT, and GOTERM_MF_DIRECT) with a false discovery rate (FDR) <0.05 were reported.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The following published NGS data sets were used, and accessed through the NCBI GEO repository: for bovine oocytes and in vitro produced embryos, raw RNA-seq data were downloaded from accession number GSE5241532. For mouse and human preimplantation embryo ATAC-seq and RNA-seq data, raw sequencing files were downloaded from accession numbers GSE6639017 and GSE10157118, respectively. The ATAC-seq data produced in this study are available through the NCBI GEO repository under accession number GSE143658. All other relevant data supporting the key findings of this study are available within the article and its Supplementary Information files or from the corresponding author upon reasonable request. Source data are provided with this paper. A reporting summary for this Article is available as a Supplementary Information file.
Abe, K.-I. et al. Minor zygotic gene activation is essential for mouse preimplantation development. Proc. Natl Acad. Sci. USA. 115, E6780–E6788 (2018).
Ram, P. T. & Schultz, R. M. Reporter gene expression in G2 of the 1-cell mouse embryo. Dev. Biol. 156, 552–556 (1993).
Schulz, K. N. & Harrison, M. M. Mechanisms regulating zygotic genome activation. Nat. Rev. Genet. 20, 221–234 (2019).
Latham, K. E., Garrels, J. I., Chang, C. & Solter, D. Quantitative analysis of protein synthesis in mouse embryos. I. Extensive reprogramming at the one- and two-cell stages. Development 112, 921 LP–921932 (1991).
Eckersley-Maslin, M. A., Alda-Catalinas, C. & Reik, W. Dynamics of the epigenetic landscape during the maternal-to-zygotic transition. Nat. Rev. Mol. Cell Biol. 19, 436–450 (2018).
Iurlaro, M., von Meyenn, F. & Reik, W. DNA methylation homeostasis in human and mouse development. Curr. Opin. Genet. Dev. 43, 101–109 (2017).
Ross, P. J. et al. Polycomb gene expression and histone H3 lysine 27 trimethylation changes during bovine preimplantation development. Reproduction 136, 777–785 (2008).
Funaya, S. & Aoki, F. Regulation of zygotic gene activation by chromatin structure and epigenetic factors. J. Reprod. Dev. 63, 359–363 (2017).
Ooga, M., Fulka, H., Hashimoto, S., Suzuki, M. G. & Aoki, F. Analysis of chromatin structure in mouse preimplantation embryos by fluorescent recovery after photobleaching. Epigenetics 11, 85–94 (2016).
Ahmed, K. et al. Global chromatin architecture reflects pluripotency and lineage commitment in the early mouse embryo. PLoS ONE 5, e10531–e10531 (2010).
Martin, C. et al. Genome restructuring in mouse embryos during reprogramming and early development. Dev. Biol. 292, 317–332 (2006).
Probst, A. V., Santos, F., Reik, W., Almouzni, G. & Dean, W. Structural differences in centromeric heterochromatin are spatially reconciled on fertilisation in the mouse zygote. Chromosoma 116, 403–415 (2007).
Du, Z. et al. Allelic reprogramming of 3D chromatin architecture during early mammalian development. Nature 547, 232–235 (2017).
Bultman, S. J. et al. Maternal BRG1 regulates zygotic genome activation in the mouse. Genes Dev. 20, 1744–1754 (2006).
Jimenez, R. et al. Maternal SIN3A regulates reprogramming of gene expression during mouse preimplantation development. Biol. Reprod. 93, 89 (2015).
Hernandez, C. et al. Dppa2/4 Facilitate Epigenetic Remodeling during Reprogramming to Pluripotency. Cell Stem Cell 23, 396–411.e8 (2018).
Wu, J. et al. The landscape of accessible chromatin in mammalian preimplantation embryos. Nature 534, 652–657 (2016).
Wu, J. et al. Chromatin analysis in human early development reveals epigenetic transition during ZGA. https://doi.org/10.1038/s41586-018-0080-8 (2018).
Li, L. et al. Single-cell multi-omics sequencing of human early embryos. Nat. Cell Biol. 20, 847–858 (2018).
Gao, L. et al. Chromatin accessibility landscape in human early embryos and its association with evolution. Cell 173, 248–259.e15 (2018).
Liu, L. et al. An integrated chromatin accessibility and transcriptome landscape of human pre-implantation embryos. Nat. Commun. 10, 364 (2019).
Lu, F. et al. Establishing Chromatin Regulatory Landscape during Mouse Preimplantation Development. Cell 165, 1375–1388 (2016).
Bogliotti, Y. S. et al. Transcript profiling of bovine embryos implicates specific transcription factors in the maternal-to-embryo transition. Biol. Reprod. https://doi.org/10.1093/biolre/ioz209 (2019).
Whiddon, J. L., Langford, A. T., Wong, C.-J., Zhong, J. W. & Tapscott, S. J. Conservation and innovation in the DUX4-family gene network. Nat. Genet. 49, 935–940 (2017).
Hendrickson, P. G. et al. Conserved roles of mouse DUX and human DUX4 in activating cleavage-stage genes and MERVL/HERVL retrotransposons. Nat. Genet. 49, 925–934 (2017).
De Iaco, A. et al. DUX-family transcription factors regulate zygotic genome activation in placental mammals. Nat. Genet. 49, 941–945 (2017).
Kruse, K. et al. Transposable elements drive reorganisation of 3D chromatin during early embryogenesis. bioRxiv 523712. https://doi.org/10.1101/523712 (2019).
Falco, G. et al. Zscan4: a novel gene expressed exclusively in late 2-cell embryos and embryonic stem cells. Dev. Biol. 307, 539–550 (2007).
Takahashi, K., Ross, P. J. & Sawai, K. The necessity of ZSCAN4 for preimplantation development and gene expression of bovine embryos. J. Reprod. Dev. 65, 319–326 (2019).
Wang, H. & Dey, S. K. Roadmap to embryo implantation: clues from mouse models. Nat. Rev. Genet. 7, 185–199 (2006).
Vassena, R. et al. Waves of early transcriptional activation and pluripotency program initiation during human preimplantation development. Development 138, 3699 LP–3693709 (2011).
Graf, A. et al. Fine mapping of genome activation in bovine embryos by RNA sequencing. Proc. Natl Acad. Sci. 111, 4139 LP–4134144 (2014).
Zhang, A. et al. Dynamic changes of histone H3 trimethylated at positions K4 and K27 in human oocytes and preimplantation embryos. Fertil. Steril. 98, 1009–1016 (2012).
Wu, X. et al. Multiple histone site epigenetic modifications in nuclear transfer and in vitro fertilized bovine embryos. Zygote 19, 31–45 (2011).
Liu, X. et al. Distinct features of H3K4me3 and H3K27me3 chromatin domains in pre-implantation embryos. Nature 537, 558–562 (2016).
Zhang, B. et al. Allelic reprogramming of the histone modification H3K4me3 in early mammalian development. Nature 537, 553–557 (2016).
Bogliotti, Y. S. et al. Efficient derivation of stable primed pluripotent embryonic stem cells from bovine blastocysts. Proc. Natl Acad. Sci. USA. 115, 2090–2095 (2018).
Fornes, O. et al. JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. https://doi.org/10.1093/nar/gkz1001 (2019).
Nishikimi, A., Mukai, J. & Yamada, M. Nuclear translocation of nuclear factor kappa B in early 1-cell mouse embryos. Biol. Reprod. 60, 1536–1541 (1999).
Paciolla, M. et al. Nuclear factor-kappa-B-inhibitor alpha (NFKBIA) is a developmental marker of NF-kappaB/p65 activation during in vitro oocyte maturation and early embryogenesis. Hum. Reprod. 26, 1191–1201 (2011).
Eckersley-Maslin, M. et al. Dppa2 and Dppa4 directly regulate the Dux-driven zygotic transcriptional program. Genes Dev. 33, 194–208 (2019).
Payer, B. et al. Stella is a maternal effect gene required for normal early development in mice. Curr. Biol. 13, 2110–2117 (2003).
Bakhtari, A. & Ross, P. J. DPPA3 prevents cytosine hydroxymethylation of the maternal pronucleus and is required for normal development in bovine embryos. Epigenetics 9, 1271–1279 (2014).
Leidenroth, A. & Hewitt, J. E. A family history of DUX4: phylogenetic analysis of DUXA, B, C and Duxbl reveals the ancestral DUX gene. BMC Evol. Biol. 10, 364 (2010).
Chen, Z. & Zhang, Y. Loss of DUX causes minor defects in zygotic genome activation and is compatible with mouse development. Nat. Genet. 51, 1 (2019).
Guo, M. et al. Precise temporal regulation of Dux is important for embryo development. Cell Res. https://doi.org/10.1038/s41422-019-0238-4 (2019).
De Iaco, A., Verp, S., Offner, S., Grun, D. & Trono, D. DUX is a non-essential synchronizer of zygotic genome activation. Development 147, dev177725 (2020).
Ko, M. S. H. Zygotic genome activation revisited: looking through the expression and function of Zscan4. Curr. Top. Dev. Biol. 120, 103–124 (2016).
Srinivasan, R. et al. Zscan4 binds nucleosomal microsatellite DNA and protects mouse two-cell embryos from DNA damage. Sci. Adv. 6, eaaz9115 (2020).
Liu, J. et al. The oncogene c-Jun impedes somatic cell reprogramming. Nat. Cell Biol. 17, 856–867 (2015).
Ghirlando, R. & Felsenfeld, G. CTCF: making the right connections. Genes Dev. 30, 881–891 (2016).
Wiekowski, M., Miranda, M. & DePamphilis, M. L. Requirements for promoter activity in mouse oocytes and embryos distinguish paternal pronuclei from maternal and zygotic nuclei. Dev. Biol. 159, 366–378 (1993).
Koehler, D. et al. Changes of higher order chromatin arrangements during major genome activation in bovine preimplantation embryos. Exp. Cell Res. 315, 2053–2063 (2009).
Takahashi, K. & Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126, 663–676 (2006).
Di Giammartino, D. C. et al. KLF4 is involved in the organization and regulation of pluripotency-associated three-dimensional enhancer networks. Nat. Cell Biol. 21, 1179–1190 (2019).
Oldfield, A. J. et al. Histone-fold domain protein NF-Y promotes chromatin accessibility for cell type-specific master transcription factors. Mol. Cell 55, 708–722 (2014).
Marin, M., Karis, A., Visser, P., Grosveld, F. & Philipsen, S. Transcription factor Sp1 is essential for early embryonic development but dispensable for cell growth and differentiation. Cell 89, 619–628 (1997).
Memili, E. & First, N. L. Developmental changes in RNA polymerase II in bovine oocytes, early embryos, and effect of alpha-amanitin on embryo development. Mol. Reprod. Dev. 51, 381–389 (1998).
Adelson, D. L., Raison, J. M. & Edgar, R. C. Characterization and distribution of retrotransposons and simple sequence repeats in the bovine genome. Proc. Natl Acad. Sci. USA 4, 12855–12860 (2009).
Friedli, M. & Trono, D. The developmental control of transposable elements and the evolution of higher species. Annu. Rev. Cell Dev. Biol. 31, 429–451 (2015).
Spadafora, C. Endogenous reverse transcriptase: a mediator of cell proliferation and differentiation. Cytogenet. Genome Res. 105, 346–350 (2004).
Chuong, E. B., Elde, N. C. & Feschotte, C. Regulatory activities of transposable elements: from conflicts to benefits. Nat. Rev. Genet. 18, 71–86 (2017).
Raviram, R. et al. Analysis of 3D genomic interactions identifies candidate host genes that transposable elements potentially regulate. Genome Biol. 19, 216 (2018).
Yandım, C. & Karakülah, G. Expression dynamics of repetitive DNA in early human embryonic development. BMC Genomics 20, 439 (2019).
Franke, V. et al. Long terminal repeats power evolution of genes and gene expression programs in mammalian oocytes and zygotes. Genome Res. 27, 1384–1394 (2017).
Beraldi, R., Pittoggi, C., Sciamanna, I., Mattei, E. & Spadafora, C. Expression of LINE-1 retroposons is essential for murine preimplantation development. Mol. Reprod. Dev. 73, 279–287 (2006).
Bui, L. C. et al. Retrotransposon expression as a defining event of genome reprograming in fertilized and cloned bovine embryos. Reproduction 138, 289–299 (2009).
Bestor, T. H. & Bourc’his, D. Transposon silencing and imprint establishment in mammalian germ cells. Cold Spring Harb. Symp. Quant. Biol. 69, 381–387 (2004).
Pontis, J. et al. Hominoid-specific transposable elements and KZFPs facilitate human embryonic genome activation and control transcription in naive human ESCs. Cell Stem Cell 24, 724–735.e5 (2019).
Boroviak, T. et al. Single cell transcriptome analysis of human, marmoset and mouse embryos reveals common and divergent features of preimplantation development. Development 145, dev167833 (2018).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12 (2011).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Lerat, E., Fablet, M., Modolo, L., Lopez-Maestre, H. & Vieira, C. TEtools facilitates big data expression analysis of transposable elements and reveals an antagonism between their activity and that of piRNA genes. Nucleic Acids Res. 45, e17 (2017).
Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Bailey, T. L. et al. MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. 37, W202–W208 (2009).
Li, Z. et al. Identification of transcription factor binding sites using ATAC-seq. Genome Biol. 20, 45 (2019).
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184 (2009).
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1–13 (2009).
Funding for this project was provided by NIH grant HD070044 to P.J.R. and R.M.S. M.M.H. was supported by a NIFA National Needs Fellowship Grant (USDA-NIFA Competitive Grant Project no. 2014–38420–21796) and an Austin Eugene Lyons Fellowship. Semen used for in vitro embryo production was provided by Semex (Madison, WI). The authors thank the entire P.J.R. laboratory for contributions to embryo production and lively discussion.
The authors declare no competing interests.
Peer review information Nature Communications thanks Kathy Niakan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Halstead, M.M., Ma, X., Zhou, C. et al. Chromatin remodeling in bovine embryos indicates species-specific regulation of genome activation. Nat Commun 11, 4654 (2020). https://doi.org/10.1038/s41467-020-18508-3