Accessible chromatin reveals regulatory mechanisms underlying cell fate decisions during early embryogenesis

Fan, Tongqiang; Huang, Youjun

doi:10.1038/s41598-021-86919-3

Download PDF

Article
Open access
Published: 12 April 2021

Accessible chromatin reveals regulatory mechanisms underlying cell fate decisions during early embryogenesis

Tongqiang Fan¹ &
Youjun Huang¹

Scientific Reports volume 11, Article number: 7896 (2021) Cite this article

2926 Accesses
5 Citations
1 Altmetric
Metrics details

Subjects

Abstract

This study was conducted to investigate epigenetic landscape across multiple species and identify transcription factors (TFs) and their roles in controlling cell fate decision events during early embryogenesis. We made a comprehensively joint-research of chromatin accessibility of five species during embryogenesis by integration of ATAC-seq and RNA-seq datasets. Regulatory roles of candidate early embryonic TFs were investigated. Widespread accessible chromatin in early embryos overlapped with putative cis-regulatory sequences. Sets of cell-fate-determining TFs were identified. YOX1, a key cell cycle regulator, were found to homologous to clusters of TFs that are involved in neuron and epidermal cell-fate determination. Our research provides an intriguing insight into evolution of cell-fate decision during early embryogenesis among organisms.

Single-nucleus chromatin landscapes during zebrafish early embryogenesis

Article Open access 19 July 2023

An ATAC-seq atlas of chromatin accessibility in mouse tissues

Article Open access 20 May 2019

An atlas of dynamic chromatin landscapes in mouse fetal development

Article Open access 29 July 2020

Introduction

An outstanding challenge of developmental biology is to explain how multi-cell organisms originate from a single cell that mature through complex dynamic processes. In complex organs, the generation of a single lineage usually involves multiple steps of cell fate decisions. Comprehensively, understanding the pathways of cell lineage differentiation during in vivo development, especially transcriptional regulatory strategies at point of cell lineage segregation, and the common characteristics of multiple organisms, in critical for directing stem cell differentiation into desired cell types and the discovery of common ground of eukaryotes. Early embryo development of the metazoans and plants comprise a sequence of cell fate decisions in which cells are guided along a pathway of restricted potential and increasing specialisation.

During mammalian early embryo development, stem cell will undergo some cell fate determined phases, including ZGA (zygote genome activation), 1-, 2-, 4-, 8-, 16-, 32-cell, early blastocyst, and late blastocyst stages, in which 8–32-cell stages exhibit first cell fate decision, and early to late blastocyst stages occur second cell fate decision^1,2,3. The embryonic cell lineage of C. elegans has been traced from zygote to newly hatched larva⁴, in which 1-, 2-, 4-, 8-, and 16-cell stage embryos can represent early embryo cell lineage⁵. The control of the oocyte-to-embryo transition in Drosophila parallels that of other animals, however, in early Drosophila embryos, the master checkpoint laid on nuclear cycle from cycle 10 to 14⁶.

Compared to mammals, plants are ancient organisms, which require many centuries to obtain new organs. Shoot, root, flowers, fruits are continuously augmented to build sophisticated post-embryonic tissues. This process requires coordination of cell divisions, cell fate determination, and cell communication. Starting from fertilization, embryogenesis produces the first stem cell, and by the end of embryogenesis the zygote has transformed into a mature embryo that comprises the basic tissue types identical to any post-embryonic plant⁷. The processes of plant early embryo development include multiple stages: two cell, eight cell, 16 cell, globular, heart, torpedo, and bent stages⁸.

TFs (transcription factors) play an irreplaceable role in cell fate determination during embryogenesis⁹. The identities of cell types are also influenced by environmental signal transduction, in turn, results in the activation or inhibition of TFs¹⁰. Much progress has been made in understanding how core regulators such as OCT4, NANOG, and SOX2 as well as transcriptional effectors of signaling pathways, such as SMAD1/2/3/4, and TCF3, control the regulatory circuity that control earliest stage of embryonic development¹¹. During early embryogenesis, inner cells develop a stable regulation circuit, in which OCT4, SOX2, and NANOG are assigned to promote pluripotency to determine ICM (inner cell mass) cell fate¹². In contrary, outer cells upregulate TFs such as CDX2 and EOMES to promote the formation of trophectoderm-destined cells^13,14. Regulation of trophectoderm targets by OCT4, SOX2 and NANOG in the pluripotent lineage¹⁵, accompanied by the autoregulatory properties of OCT4 and CDX2¹⁶, ensure the maintenance of lineage segregation. SALL4, which establishes and maintains ICM integrity by upregulating Oct4 and Nanog expression¹⁷; TEAD4, which acts upstream regulator of CDX2 during trophectoderm development¹⁸; and episomal expression of the early trophoblast TFs such as TCFAP2C, GATA3, ELF5 or downregulation of the pluripotency factor OCT4 can induce trophoblast cell fate determination in ES cells^19,20,21; are all important supplements to the regulatory networks of the first cell fate decision.

Generally, TFs bind to open chromatin of cis-regulatory regions, such as promoters, and enhancers, however, a subset called ‘pioneer factor’ are dominant in their ability to engage silent, unmarked chromatin and initiate the recruitment of other factors, thereby creating a permissive state for gene activation^22,23.

The development of a multicellular organism with its organs and tissues is a reproducible event that shows high coordination between the increase of cell mass and the diversification of cell population. These reproducibility programmes underly species cell-fate decision events^24,25, which are implemented by gene regulatory networks (GRNs), which are basic units of molecular activity that establish and maintain functional tissues by promoting sequential, irreversible gene expression that link genes to cell lineages^26,27. The regulation mechanisms of cell-fate decisions lack of systematic research during early embryonic development at the chromatin level, especially with regard to cross-species joint research. Here, we focused on the functions of TFs and GRNs associated with cell-fate decision events during early embryogenesis. We used publicly available ATAC-seq²⁸ datasets of six species, human, mouse, Drosophila, C. elegans, Arabidopsis, and yeast (we termed cell cycle stage as early embryogenesis, as lifespan of yeast is short and cell-fate decision events may occur as early as cell cycle) to question whether there are common traits of TFs during early embryonic development and to investigate underlying transcriptional regulatory mechanism of cell fate control. We found many TFs match their functions of various cell fate determination by investigation of the dynamic chromatin changes and gene expression patterns of various species during different development stages. Furthermore, we quantified TFs abundance from different cells during early embryonic development. Finally, we investigated the evolutionary mechanisms underlying cell-fate determination. After data mining, we focused on YOX1, a key cell-cycle regulator in yeast, combined with homology, transcriptomes, and regulatory networks, the conserved roles of the homologs were found to be crucial for cell fate determination.

Results and discussion

Dynamic chromatin changes over developmental stages

Epigenome mapping is a powerful method for cataloging functional elements throughout the genome²⁹, and it can provide insights into the regulatory mechanisms that underlie changes of cell fate³⁰. To investigate the mechanisms underlying cell fate determination, we applied ATAC-seq datasets and standard data analysis pipeline (Figure S1) of six species (A. thaliana, C. elegans, H. sapiens, M. musculus, D. melanogaster, S. cerevisiae) as they have emerged as most appreciated models for system biological research. The detailed information about the ATAC-seq samples that we used were listed in Table S1 and S2. Firstly, we checked quality of all the raw materials, and the results showed that the insert size distribution of each ATAC-seq library displays a stereotypical 150 bp periodicity that consistent with the expected nucleosome occupancy of chromatin. However, the nucleosome occupancy of Arabidopsis was not so obvious, as plants have mitochondrial and chloroplast genomes, which are completely accessible to Tn5, and likely depletes Tn5 activity from the nuclear genome³¹ (Figure S2). Then, we checked the number of reads mapped to each chromosome (Fig. 1D, Fig. S5). The result showed highly similar reads distribution pattern, indicating of high sample quality. We designed a stringent computational framework to integrate all the samples from different species with unified parameters, resulting in the identification of 25,000–65,000 high-confidence, accessible peaks for Arabidopsis, 30,000–54,000 for Drosophila, 28,000–1,250,000 for human, 8000–650,000 for mouse, 2000–3500 for yeast, and 15,000–28,000 for nematode (Fig. 1A, Fig. S3). Examination of peak signals versus uniquely mapped reads revealed that the signal enrichments consistently plateau at greater sequencing depths (Fig. 1B, Fig. S4).

To investigate dynamics chromatin changes over different developmental stages of each species, we used deepTools2 software³². Visualization of all the ATAC-seq datasets revealed that with the developmental stages proceeding, most peaks were in promoter-TSS (transcription start site) region (Fig. 1C,E, Figs. S6, S7) indicating these binding sites were predominantly located around TSSs indicating these regions are critical for TF binding and transcription regulating. Histone modifications, function as a prerequisite for dynamic chromatin state changes allow perpetual diversification of epigenome³³. We found that H3K4me3 and H3K27ac modifications were associated with relatively higher peak density, compared to H3K4me1 and H3K27me3 with low density (Fig. 1F, Fig. S8). Because previous studies have demonstrated that H3K4me3 and H3K27ac were commonly associated with the activation of transcription³⁴ and mark spot of active enhancers³⁵, respectively, however, H3K4me1 and H3K27me3 were associated with transcriptional silencing and downregulation of nearby genes^36,37.

Taken together, these findings showed comparable open chromatin landscapes in early embryos, as early embryo samples tend to enrich more accessible signals compared to mature tissue samples.

Chromatin accessibility extends the dictionary of cis-regulatory elements

In a comparison of open chromatin among epigenomes of human, mouse, Drosophila, worm, Arabidopsis, and yeast, we found the genomic distribution of THSs in each were highly similar, as majority of the peaks were enriched in promoter regions, except for human and mouse samples (Fig. 2A, Fig. S9). However, more than 90% of THSs lie outside of transcribed regions, and the majority of these THSs were found within 3 kb upstream of TSS in all species but for human, mouse, and fruit fly. The differences in reads distribution between advanced organisms and relatively lower livings may due to the fact that transcriptional regulatory elements (TREs) in plants and microbes are generally less numerous and closer to the genes they regulate than those of advanced genomes. For example, the median distance the enhancer and the TSSs of their target genes in fruit fly was reported to be 10 kb³⁸. It was also uncovered that in human T cells, 91% of the enhancers, with a median distance of 130 kb between promoters, far greater than the distance across plant and fungi genome³⁹. Interestingly, compared to embryonic stem cells, more peaks were enriched in the CD4⁺T cell promoter regions. Previous research has demonstrated that more promoters marked by H3K4me3 during early phase of CD4 T cell activation, which could enhance accessible chromatin status and reinforce activation-induced upregulation of gene expression⁴⁰. Overall, it is clear that in all species the majority of THSs are located in promoter regions, suggesting that accessible chromatin overlaps extensively with putative cis-regulatory sequences.

While assessment of open chromatin plays an important role in predicting regulatory element in the genome^29,41, it does not provide direct information of functional activity. To address this issue, we asked whether shared open chromatin regions exist within all the datasets in each species, and what the role of regulatory elements that bound to these shared regions played. The results showed that there were 1082 shared peaks for A. thaliana, 7519 for C. elegans, 7970 for D. melanogaster, 1995 for S. cerevisiae, 8036 for H. sapiens, and 4151 for M. musculus. Functional enrichment annotation of these overlapped peaks shows that they were almost enriched in transcription and regulation of transcription (Fig. 2B, Fig. S10), indicating that majority of TFs that bound to accessible regions are involved in basal transcriptional activity. Then, we evaluated the ATAC-seq enrichment profiles of all samples in each species, the results showed differential ATAC-seq signals between different tissues/strains or developmental stages, indicating tissue or developmental heterogeneity (Fig. 2C, Fig. S11), except for C. elegans, which showed comparable signals in different developmental stages. And the high correlation of ATAC-seq signal between each of the sample (Fig. 2C, Fig. S11) demonstrates the high reproducibility.

Collectively, these results suggest that TREs tend to be focused near the promoter rather than at more distal regions. The hypothesis implicit that open chromatin site near a TSS reflect TREs that regulate TSS rather than more distal regions, and that promoter-binding upstream elements contribute the majority of regulatory effects. And the TREs bound to promoters generally are in basal transcription and regulation of transcription function. Interestingly, these assumptions were previously validated in wet lab showing that an upstream fragment of several kilobases is capable of recapitulating native transcription patterns^42,43,44, which are consistent to our findings that upstream THSs are the most abundant category of accessible chromatin sites.

Taken together, these data indicated that THSs in animal and plant genomes showed largely comparable landscapes and cis-regulatory elements that bound to open chromatin regions mainly play roles in transcription and regulation of transcription.

Identification of cell-fate TFs during early embryogenesis

We applied HOMER findmotif to determine what TFs that bound to these open chromatin regions. And 400 for A. thaliana, 38 for C. elegans, 93 for D. melanogaster, 109 for S. cerevisiae, 414 for H. sapiens, and 398 for M. musculus were identified (Fig. 3A, Table S3). Gene family classifications showed that majority of the identified TFs were enriched in Homeobox and C2H2 zinc finger family (Fig. 3B, Fig. S12). Motif discovery indicated that, PIF4, PCF, BIM1, and JKD genes were highly enriched for root and seedlings in Arabidopsis, elf-1, hlh-30, dpl-1, eor-1, pha-4, and pqm-1 were highly enriched during larva development in C. elegans, zld, Dref, and Trl for D. melanogaster during nuclear cycle period, ABF1, REB1, AZF1, OPI1, and RSC3 for different strains of S. cerevisiae, CTCF, BORIS, SOX2, NFYA, SP1, OCT4, and NANOG for H. sapiens during embryonic stem cell development, and JunB, Batf, Nanog, and AP-1 for M. musculus during induced pluripotent stem cell development (Fig. 3C, Fig. S13). To investigate the functions of these TFs, we performed functional GO (gene ontology) analysis. The results showed that these TFs were almost involved in transcription and regulation of transcription. Interestingly, we also found that some TFs were TFs involved in cell fate decisions (cell fate commitment, and cell fate specification) (Fig. 3C, Fig. S13).

To investigate the expression patterns of these TFs across different tissues or strains, we collected expression profiles from public databases. For Arabidopsis, the TFs such as GATA1, TCP3, CDF3, PIF4, CCA1, LHY, SPL1, and MYB38 were highly expressed in different mature tissues, and ABI5, WUS, HB5, WIP5, and SHP1 were lowly expressed even unexpressed (Figure S13), as these genes are involved in nuclear cycle or early embryonic development^{8,45,46,47,48}. For C. elegans, all identified TFs were deemed to differentially expressed across multiple strains (Figure S13). For Drosophila, some TFs such as Kr, bcd, zen, cad, and twi hardly expressed in various mature tissues, as these TFs previously supposed to play major role in early embryonic development of Drosophila^{49,50,51,52,53}. For S. cerevisiae, it is obvious that the identified TFs were differentially expressed across all yeast strains and were higher in strain w303a than other strains (Figure S13). For H. sapiens, we found some TFs, PAX6, SOX2, POU3F1, HOXA10, CDX2, NANOG, TEAD4, and OCT4 were scarcely expressed across mature tissues, because they function in early stem cell development^54,55,56,57. For M. musculus, we also found some TFs, Cdx2, Oct4, Eomes, Esrrb, Gsc, and Nanog, were scarcely expressed in mature tissues, as these TFs constitute an important reservoir for early embryonic development^57,58,59.

Additionally, we found a set of TF complexes which were pertinent to cell proliferation (Oct4::Sox17, promotes cell development and differentiation⁶⁰, and OCT4-SOX2-TCF-NANOG, forms core regulatory circuitry of ES cells, critical for pluripotency and self-renewal⁶¹), cell differentiation (RAR/RXR, triggers pluripotent cell differentiation⁶², NF1::FOXA1, mediate gene expression and cell differentiation in prostate⁶³), tumorigenesis, and immunogenesis (Figure S14), demonstrating that even in early embryos, these TFs that involved in oncogenesis, and tumor suppress are also expressed to maintain normal cell divisions and differentiation of early embryo.

To further validate some key TFs that function in early embryos, we visualized the ATAC-seq signal enrichment near them across all samples in each species using IGV (Integrative Genomics Viewer). The results showed that for the developmentally regulated genes, such as GATA6, NANOG, SMAD4, and FOXA1 in human, were found elevated ATAC-seq enrichment at annotated or putative enhancers and promoters during embryonic development instead of in mature cells (Fig. 1G). For TFs such as, Oct4, Sox4, Eomes, and Gata4, we also observed increased signals during mouse embryonic development (Figure S15), which comparable to Su(H), zen, Abd-B, and twi during Drosophila nuclear cycle (Fig. S15). However, for WUS, ATML1, JKD, and KAN in Arabidopsis, they showed distinct signal intensity over different tissues or under different treatments (Figure S15). Nevertheless, FKH1, STE12, MSN2, and DIG1 in S. cerevisiae showed comparable signal intensity over different strains (Figure S15).

Overall, by integrating the information of cell-fate-determined TFs and the transcriptomes, we delineated that these cell-type-specific TFs showed high tissue or developmental heterogeneity.

Regulatory networks of cell-fate decision in early embryo

Cell fate decisions play a key role in crucial processes such as tissue repair, immune response, or embryonic development^64,65,66. Here, we identified numerous TFs that are involved in cell fate control (Fig. 4, Fig. S16). For each species, using public expression profiles of early embryonic development, we have not only verified high expression values of some previously widely accepted early embryonic TFs (Fig. 4A), but also found some cell-fate determining TFs that were highly expressed that previously unreported during early embryonic development (Fig. 4B, Fig. S16), indicating they may play roles in early embryos. However, we also found some TFs previously reported to play major roles during early embryogenesis in Drosophila, had a low expression pattern during early embryonic stage, such as pnr⁶⁷, vnd⁶⁸, and Ubx⁶⁹ (Figure S16). The expression profiles of some previously unreported TFs that function in early embryogenesis were also have high correlations with some early embryonic TFs, such as Jra, Blimp-1, hth, and Tk in Drosophila, NR2E1, EBF2, EPAS1, TP53, and CEBPB in human.

To comprehensively resolve the mystery of regulatory mechanisms of cell fate control during early embryogenesis, we combined cell-fate TFs of six species to construct TF regulatory network to predict the regulatory circuit based on their homology relationships (Figure S17). And, we investigated some homolog TFs in other five species for human TFs (Table S4). Surprisingly, these homolog TFs are also previously reported to be involved in cell fate control during early embryonic development.

For Arabidopsis, we analyzed several key TFs, which play key roles during root epidermis patterning, seeding, leaf, and QC (quiescent center) development in details (Fig. 5A). Four cell-fate-determining TFs, JKD, GL2, GL3, and EGL3, which are homolog to PRDM14, GSC, and MITF in human, respectively, are indispensable for controlling the patterns of epidermis in the Arabidopsis root meristem⁷⁰. HAT3, homologs to PAX6 and NANOG, combined with HAT2, BZR1, and BIM1 to promote seedling development^70,71,72. We assume that BIM1 may play a role as a signal integrator to integrate signals from HAT2, HAT3, and BZR1 to promote seedling development (Fig. 5A). Another cluster of TFs, KAN, PHB, PHV, and BIM1, in which PHB and PHV are homolog to ISL1, contribute to promote Arabidopsis leaf development and leaf adaxial polarity^73,74. We hypothesize that BIM1 may function as downstream target genes of PHV to regulate leaf development (Fig. 5A). The last cluster TFs that we found have homologs of human early embryonic TFs are HDG11, KAN, WUS, PLT1, and WIP4, in which HDG11 and WUS are homolog to POU5F1 and ASCL1, respectively. Previous studies have demonstrated that WUS, PLT1, and WIP4 all contribute to the cell-fate determination of QC^47,75,76. Therefore, we conject that HDG11 and KAN may function as upstream target genes of WUS to form the HDG11-KAN-WUS-PLT1-WIP4 complex to control the cell-fate determination of QC. The expression value of all cell-fate-determined TFs mentioned above all keep high levels during Arabidopsis early embryonic development (Fig. 5B), indicating they may play major roles in Arabidopsis early embryos.

For C. elegans, we identified several early embryonic TFs, hlh-2, pha-4, elt-1, hlh-1, and pal-1, in which the former three are homolog to TCF3, FOXA1/2, and GATA2/3/6, respectively (Fig. 5C). Previous demonstrated elt-1 and pal-1 are critical for the specification of epidermal cell fates^77,78, furthermore, in our study, we presumed that hlh-2 and hlh-1 may act as upstream target TFs of pal-1, and pha-4 functions as binding protein of elt-1, and these five TFs function together to control epidermal cell fate. The expression patterns of these five TFs showed highest levels at 16-cell stage (Fig. 5D), indicating 16-cell stage may be a critical timepoint for epidermal cell fate determination.

As the propagating method of S. cerevisiae is budding reproduction without embryo development, we analogously regarded the cell cycle period as embryonic development stage. We identified several TFs, RIM101, FKH1, FKH2, MSN2, MSN4, ABF2, DAL80, and CBF1, which homolog to FOXA1, FOXA2, KLF4, PRDM14, SOX2, SOX17, GATA2/3/6, and MITF, respectively (Fig. 5E). Previous studies showed that these TFs were all involved in stress responses^79,80,81. So, we proposed a regulatory circuit that regulate the progressive process of stress responses based on their interaction relationships (Fig. 5E). And, we noticed these TFs kept high expression values during the full stages of cell cycle (Fig. 5F), indicating that yeast is susceptible to external or internal damages, TFs that regulate the defense systems need to be constantly functioning.

For D. melanogaster, we identified several clusters of TFs that involved in eye, gland, and nerve system cell fate determination (Fig. 5G). Firstly, a cluster of TFs, zen, tll, toy, pnr, Mad, Med, and lz, which homolog to NANOG, NR2F2, PAX6, GATA2/3/6, SMAD2, and SMAD4, respectively, were previously reported to be involved in pattern formation of eye cell fate^82,83,84, we proposed the model for eye cell fate decisions, lz-Med-Mad-pnr-toy-tll-zen, in which lz bound to Med, and Med bound to Mad, to promote the expression of Mad, as a research have shown that lz encodes a TF involved in prepatterning photoreceptor precursors in the Drosophila eye⁸⁵. Then, prd, fkh, pnr, and pan, in which the latter three are homolog to FOXA1/2, GATA2/3/6, TCF7L2, and SOX13, respectively, were demonstrated to be indispensable for gland cell fate determination^{86,87,88,89,90}, we proposed a regulatory model, pan-pnr-fkh-prd, in which pnr served as a binding protein, which bound to fkh, and prd may function as terminal target gene. Overall, these TFs function together to promote gland cell fate determination and cell development. Thirdly, su(Hw), ac, Su(H), Kr, tll, D, pnr, pnt, and gcm, homolog to PRDM14, ASCL1, RBPJ, KLF4, NR2F2, and SOX1/2, were reported to constitute important reservoirs for nervous system cell fate decisions^91,92,93. Based on the interactions of these TFs, we proposed a regulatory circuit, which is required for the cell fate control and development of nervous system (Fig. 5G). Expression profiles of these TFs showed that they started to play functions from nuclear cycle stage 14, regardless of some had high expression values from the very beginning of the nuclear cycle (Fig. 5H).

We have identified 24 human homologue TFs in mouse (Fig. 5I). Similar to humans, these TFs also function to determine cell fate or promote cell development during early embryonic developmental stage. Furthermore, the interactions of TFs were almost the same, and the expression patterns of these TFs were slightly different (Fig. 5J).

Taking together, by integrating the information about cis-regulatory elements and the transcriptomes, we scratched the surface of cell-fate-determining regulatory networks during early embryonic development that is orchestrated by a set of TFs and their targets.

Evolution of cell fate decision in early embryos

To further investigate the evolution characteristics of cell fate decisions, we focused on YOX1, a key G1/S transition regulator in yeast. We discovered a cluster of homeobox TFs in human, mouse, fruit fly, nematode, and Arabidopsis that orthologous to YOX1. The orthologous TFs in former four species are involved in neuron cell fate determination, while in Arabidopsis promote epidermis cell fate decision (Fig. 6A). Expression profiles of these TFs showed relatively high expression values during early stage of embryonic development, indicating cell fate decision events may occur during early embryogenesis. While distinct expression patterns across species during embryonic development might be a cue for the differences in the determination of different cell fates (Fig. 6B). Phylogenetic analysis of protein sequence of these homologous TFs indicated that TFs in yeast and Arabidopsis are more ancient in evolution, compared to that of advanced organisms (human, mouse, fruit fly, and worm) (Fig. 6C). Because yeast and Arabidopsis have experienced an ancient whole-genome duplication event⁹⁴. We further investigated whether consensus sequence of these TFs shared. And the result showed a motif in homeobox domain from residues 20 to 59 was conserved, indicating the critical functions in cell fate decision events (Fig. 6D).

Then, we constructed an integrated network to investigate the transcriptional regulatory functions of above TFs in cell fate decision events (Fig. 7). YOX1, a TF expressed in mid-G1 through early S phage, interplays with S-specific TF—YHP1, function as transcriptional repressor to negatively regulate MCM1-FKH2-NDD1-mediated G2/M-G1 transition during cell cycle progression⁹⁵. Recently, a research has reported that ROX1 is in promotion of RAP1-HAP1-MSN4 module, which is an important branch for G2/M to G1 phase transition in yeast⁹⁶. By homology analysis, we found YOX1 was homologous to three TFs (Pou5f1, Nanog, and Pax6) in mouse, which were previously reported to be involved in restriction of a cluster of neuron identity maintainers (Sox1, Sox2, Sox17, and Tcf7l2), which were homologous to ROX1^97,98. These maintainers, in turn, suppress the expression of neural differentiation effectors, including Irx1, Irx2, Zic1, and Zic2^99,100,101. Nevertheless, the ortholog of YOX1 in worm, unc-86 were involved in activation of and interplay with several TFs, including vab-3, ttx-3, and mec-3, to define neuron identity^102,103. For fruit fly, YOX1 was orthologous to vnd, which were reported to interplay with ind, D (Dichaete), and msh to regulate neuroblast cell fate¹⁰⁴. There are two models were proposed to regulate neuroblast cell fate determination, achaete–scute complex¹⁰⁵ and ‘neuroblast clock’¹⁰⁶, in which former one acts as proneural cluster and was activated by vnd to promote neuroblast formation¹⁰⁷. We hypothesized that interactions of vnd, ind, D, and msh may also positive regulate neuroblast clock model in the manner as achaete–scute complex. Meanwhile, homologs of YOX1 in Arabidopsis, GL2 and HDG11, interact with each other, play an intermediate role of a positive feedback loop to promote epidermis cell fate determination¹⁰⁸. GL2 was positively regulated by upstream complex, called WER-GL3/EGL3-TTG transcriptional complex^109,110, in turn, leading to the activation of downstream target gene MYB23¹⁰⁹. Then, MYB23 interact with WER-GL3/EGL3-TTG complex to form a positively regulatory loop¹¹⁰.

Above all, different (epidermis) or similar (neuron) cell fate decision events among different species not only depend on sequence characteristics and expression patterns of TFs but also the roles they played in regulatory networks. And conserved motifs may contribute to their conserved functions in different species.

Methods

ATAC-seq data acquisition

The raw ATAC-seq datasets were collected from NCBI Gene Expression Omnibus (GEO). We selected data from Arabidopsis thaliana, Drosophila melanogaster, Homo sapiens, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans, as these species are well-annotated models for transcriptional regulation researches. For Arabidopsis samples, ATAC-seq data were collected from GSE101940¹¹¹, GSE89346¹¹², GSE101482¹¹³, and GSE85203³¹. For Drosophila samples, ATAC-seq data were collected from GSE104957¹¹⁴ and GSE83851¹¹⁵. For human samples, ATAC-seq data were collected from GSE101571¹¹⁶, GSE85332¹¹⁷, and GSE47753²⁸. For mouse samples, ATAC-seq data were collected from GSE110264¹¹⁸, GSE79230¹¹⁹, GSE82010¹²⁰, and GSE67298¹²¹. For yeast samples, ATAC-seq data were collected from GSE111815¹²², GSE101290¹²³, and GSE66386¹²⁴. For worm samples, ATAC-seq data were collected from GSE114439¹²⁵ and GSE98758¹²⁶.

ATAC-seq data analysis

The raw ATAC-seq datasets from six difference species (human, mouse, A. thaliana, fruitfly, C. elegans, and yeast) were trimmed via trim-galore (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/), with parameters − q 20 − phred33 − nextera − length 20 − e 0.1 − stringency 3. Then the clean reads were quality-controlled by FastQC (v0.11.7, https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and MultiQC v 1.5¹²⁷. After quality control, the clean reads from all samples were mapped to corresponding reference genome (hg38, mm10, TAIR10, BDGP6, IRGSP-1.0, and R64-1-1). For paired-end reads longer than 50 bp, bowtie2 v 2.3.4.2¹²⁸ was applied with parameter—very-sensitive—maxins 2000, conversely, single-end reads were set to—very-sensitive. Paired-ends short or equal to 50 bp, we adopt bowtie v 1.2.2¹²⁹ with − X 2000 − m 1 parameters to allow up to 2 kb fragments to align and only uniquely mapped reads to retain²⁸. All unmapped, MAPQ < 30, and PCR duplicates were removed (samtools − F 0 × 4; samtools view q 30; sambamba markdup). Bam files were then converted to bed files and shifted using a custom shell script to reflect a 4 bp increase on the plus strand and a 5 bp decrease on the minus strand as recommended by²⁸. Replicate samples were merged by samtools merge¹³⁰.

Genomic tracks generation

For normalization and visualization, the sorted, filtered and merged .bam files from each sample were converted to bigwig format using bamCoverage utility in deepTools v3.3.0³² with parameters –binSize 1 –ignoreDuplicates –skipNonCoveredRegions –normalizeUsing RPKM. The normalized ATAC-seq signal for a scaled region representing each of the genes in our gene subsets plus/minus 2 kb were compiled and plotted using the computeMatrix and plotHeatmap programs from deepTools package. All genomic track visualization was performed using Integrative Genomics Viewer (IGV) v2.4.16¹³¹.

Peak calling

Peak calling on ATAC-seq data was performed using MACS2¹³² callpeak with parameters -g (tair10: 1.1e8, dm6: 1.4e8, hg38: 2.8e9, IRGSP-1.0: 3.7e8, mm10: 2.5e9, sacCer3: 1.2e7) − q 0.05 − extsize 200 − nomodel − shift − 100 − nolambda − keep-dup all. These parameters set a smoothing window of 200 bp between peaks before they are merged into a single peak and allow identification of variable length peaks, respectively.

Expression and correlation of overlapped accessible regions

After peak calling, we summarized the peaks called from each species by Intervene¹³³. We counted the number of reads that were enriched in overlapped peak regions by using featureCounts¹³⁴. Peak counts were normalized to log₁₀(FPKM + 1). Heatmaps of the expression of overlapped peaks were plotted to show differentially expressed peaks in all samples of each species. The count matrix of all the ATAC samples in six species was used to calculate and visualize the Spearman correlation for every sample pair by corrplot¹³⁵ package in R.

Peak distribution and functional enrichment annotation

We randomly selected 10,000 peaks and histone modification sites (H3K4me1, H3K4me2, H3K4me3, H3K27ac, H3K27me3, H3K36me3) in all samples of each species to show the distribution patterns of peaks in chromosomes by RIdeogram¹³⁶. The UCSC genomic annotation was used to associate peaks with different genomic regions. Then we called the annotatePeak function from the R/Bioconductor ChIPseeker¹³⁷ package for genomic annotation. Promoters were considered to be ± 3 kb from TSS and all the regions that did not fall within exons, introns, UTRs or promoters were classified as distal intergenic regions. The annotated peaks from ChIPseeker above were functionally enriched by compareCluster function from clusterProfiler¹³⁸ package with default parameters.

Transcript factor motif discovery and gene ontology

The peaks generated from ATAC-seq datasets were used for de novo motif analysis using HOMER v4.10¹³⁹ against the JASPAR, DMMPMM, Yeast, AthaMap, and Homer databases with parameters − size 400 − len 8,10,12. De novo motifs were retained if the p value < 0.01 and (< percent of target >/< percent of background >) > 1.0. Gene Ontology enrichment for these motifs/transcription factors was performed using Metascape¹⁴⁰. Those GO terms had a false discovery rate (FDR) of 0.05 or less were considered significant.

Transcriptional regulatory network construction

To explore the transcriptional regulatory basis of six species, we used BLAST to find ortholog genes of human TFs that play cell-fate-choice function, after which we used Cytoscape to construct a comprehensive network of six species. The regulatory relationships of different TFs were predicted based on STRING and TF2Network databases.

Conclusion

Study of the cell-fate decision across multiple species is still a long way to go, and epigenomic research seems to contribute to some extent. The findings in this study proposed possible molecules for further research of cell-fate determination. We speculate that both the TFs and motifs identified in the integration analysis of this study can be further investigated. Furthermore, the findings presented herein can be correlated with single-cell strategies, such as single-cell RNA-seq and single-cell ATAC-seq in order to uncover the mysterious veil of the evolutionary basis of cell-fate decision events.

References

Hamatani, T. et al. Dynamics of global gene expression changes during mouse preimplantation development. Dev. Cell 6, 117–131 (2004).
Article CAS PubMed Google Scholar
Wang, Q. T. et al. A genome-wide study of gene activity reveals developmental signaling pathways in the preimplantation mouse embryo. Dev. Cell 6, 133–144 (2004).
Article CAS PubMed Google Scholar
Zernicka-Goetz, M., Morris, S. A. & Bruce, A. W. Making a firm decision: Multifaceted regulation of cell fate in the early mouse embryo. Nat. Rev. Genet. 10, 467–477 (2009).
Article CAS PubMed Google Scholar
Sulston, J. E., Schierenberg, E., White, J. G. & Thomson, J. N. The embryonic cell lineage of the nematode Caenorhabditis elegans. Dev. Biol. 100, 64–119 (1983).
Article CAS PubMed Google Scholar
Tintori, S. C., Osborne Nishimura, E., Golden, P., Lieb, J. D. & Goldstein, B. A transcriptional lineage of the early C. elegans embryo. Dev. Cell 38, 430–440 (2016).
Article CAS PubMed PubMed Central Google Scholar
Vergassola, M., Deneke, V. E. & Di Talia, S. Mitotic waves in the early embryogenesis of Drosophila: Bistability traded for speed. Proc. Natl. Acad. Sci USA 115, E2165–E2174 (2008).
CAS Google Scholar
ten Hove, C. A., Lu, K. J. & Weijers, D. Building a plant: Cell fate specification in the early Arabidopsis embryo. Development 142, 420–430 (2015).
Article PubMed CAS Google Scholar
Boscá, S., Knauer, S. & Laux, T. Embryonic development in Arabidopsis thaliana: From the zygote division to the shoot meristem. Front. Plant Sci. 2, 93 (2011).
Article PubMed PubMed Central CAS Google Scholar
Stadhouders, R., Filion, G. J. & Graf, T. Transcription factors and 3D genome conformation in cell-fate decision. Nature 569, 345–354 (2019).
Article CAS PubMed ADS Google Scholar
Rhee, C., Kim, J. & Tucker, H. O. Transcriptional regulation of the first cell fate decision. J. Dev. Biol. Regen. Med. 1, 102 (2017).
PubMed PubMed Central Google Scholar
Boyer, L. A. et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 122, 947–956 (2005).
Article CAS PubMed PubMed Central Google Scholar
Tsankov, A. M. et al. Transcription factor binding dynamics during human ES cell differentiation. Nature 518, 344–349 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Strumpf, D. et al. Cdx2 is required for correct cell fate specification and differentiation of trophectoderm in the mouse blastocyst. Development 132, 2093–2102 (2005).
Article CAS PubMed Google Scholar
Russ, A. P. et al. Eomesodermin is required for mouse trophoblast development and mesoderm formation. Nature 404, 95–99 (2000).
Article CAS PubMed ADS Google Scholar
Loh, Y. H. et al. The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells. Nat. Genet. 38, 431–440 (2006).
Article CAS PubMed Google Scholar
Chew, J. L. et al. Reciprocal transcriptional regulation of Pou5f1 and Sox2 via the Oct4/Sox2 complex in embryonic stem cells. Mol. Cell Biol. 25, 6031–6046 (2005).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. Sall4 modulates embryonic stem cell pluripotency and early embryonic development by the transcriptional regulation of Pou5f1. Nat. Cell Biol. 8, 1114–1123 (2006).
Article CAS PubMed Google Scholar
Nishioka, N. et al. Tead4 is required for specification of trophectoderm in pre-implantation mouse embryos. Mech. Dev. 125, 270–283 (2008).
Article CAS PubMed Google Scholar
Kuckenberg, P. et al. The transcription factor TCFAP2C/AP-2γ cooperates with CDX2 to maintain trophectoderm formation. Mol. Cell Biol. 30, 3310–3320 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ralston, A. et al. Gata3 regulates trophoblast development downstream of Tead4 in parallel to Cdx2. Development 137, 395–403 (2010).
Article CAS PubMed Google Scholar
Ng, R. K. et al. Epigenetic restriction of embryonic cell lineage fate by methylation of Elf5. Nat. Cell Biol. 10, 1280–1290 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zaret, K. S. & Carroll, J. S. Pioneer transcription factors: Establishing competence for gene expression. Genes Dev. 25, 2227–2241 (2011).
Article CAS PubMed PubMed Central Google Scholar
Iwafuchi-Doi, M. & Zaret, K. S. Cell fate control by pioneer transcription factors. Development 143, 1833–1837 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sulston, J. E. & Horvitz, H. R. Post-embryonic cell lineages of the nematode, Caenorhabditis elegans. Dev. Biol. 56, 110–156 (1977).
Article CAS PubMed Google Scholar
Volpato, V. et al. Reproducibility of molecular phenotypes after long-term differentiation to human iPSC-derived neurons: A multi-site omics study. Stem Cell Rep. 11, 897–911 (2018).
Article CAS Google Scholar
Levine, M. & Davidson, E. H. Gene regulatory networks for development. Proc. Natl. Acad. Sci. USA 102, 4936–4942 (2005).
Article CAS PubMed ADS PubMed Central Google Scholar
Uzkudun, M., Marcon, L. & Sharpe, J. Data-driven modelling of a gene regulatory network for cell fate decisions in the growing limb bud. Mol. Syst. Biol. 11, 815 (2015).
Article PubMed PubMed Central CAS Google Scholar
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
Article CAS PubMed PubMed Central Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Rivera, C. M. & Ren, B. Mapping human epigenomes. Cell 155, 39–55 (2013).
Article CAS PubMed Google Scholar
Lu, Z., Hofmeister, B. T., Vollmers, C., DuBois, R. M. & Schmitz, R. J. Combining ATAC-seq with nuclei sorting for discovery of cis-regulatory regions in plant genomes. Nucleic Acids Res. 45, e41 (2017).
Article CAS PubMed Google Scholar
Ramirez, F. et al. deepTools2: A next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bannister, A. J. & Kouzarides, T. Regulation of chromatin by histone modifications. Cell Res. 21, 381–395 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chen, K. et al. Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor-suppressor genes. Nat. Genet. 47, 1149–1157 (2015).
Article CAS PubMed PubMed Central Google Scholar
Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl. Acad. Sci. USA 107, 21931–21936 (2010).
Article CAS PubMed ADS PubMed Central Google Scholar
Cheng, J. et al. A role for H3K4 monomethylation in gene repression and partitioning of chromatin readers. Mol. Cell 53, 979–992 (2014).
Article CAS PubMed PubMed Central Google Scholar
Reddington, J. P. et al. Redistribution of H3K27me3 upon DNA hypomethylation results in de-repression of Polycomb target genes. Genome Biol. 14, R25 (2013).
Article PubMed PubMed Central CAS Google Scholar
Kvon, E. Z. et al. Genome-scale functional characterization of Drosophila developmental enhancers in vivo. Nature 512, 91–95 (2014).
Article CAS PubMed ADS Google Scholar
Mumbach, M. R. et al. Enhancer connectome in primary human cells identifies target genes of disease-associated DNA elements. Nat. Genet. 49, 1602–1612 (2017).
Article CAS PubMed PubMed Central Google Scholar
LaMere, S. A., Thompson, R. C., Komori, H. K., Mark, A. & Salomon, D. R. Promoter H3K4 methylation dynamically reinforces activation-induced pathways in human CD4 T cells. Genes Immun. 17, 283–297 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yue, F. et al. A comparative encyclopedia of DNA elements in the mouse genome. Nature 515, 355–364 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cooper, S. J., Trinklein, N. D., Anton, E. D., Nguyen, L. & Myers, R. M. Comprehensive analysis of transcriptional promoter structures and function in 1% of the human genome. Genome Res. 16, 1–10 (2005).
Article PubMed CAS Google Scholar
Andersen, P. K., Lykke-Andersen, S. & Jensen, T. H. Promoter-proximal polyadenylation sites reduce transcription activity. Genes Dev. 26, 2169–2179 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mikhaylichenko, O. et al. The degree of enhancer or promoter activity is reflected by the levels and directionality of eRNA transcription. Genes Dev. 32, 42–57 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lopez-Molina, L., Mongrand, S., McLachlin, D. T., Chait, B. T. & Chua, N. H. ABI5 acts downstream of ABI3 to execute an ABA-dependent growth arrest during germination. Plant J. 32, 317–328 (2002).
Article CAS PubMed Google Scholar
De Smet, I. et al. Transcriptional repression of BODENLOS by HD-ZIP transcription factor HB5 in Arabidopsis thaliana. J. Exp. Bot. 64, 3009–3019 (2013).
Article PubMed PubMed Central CAS Google Scholar
Crawford, B. C. et al. Plant development. Genetic control of distal stem cell fate within root and embryonic meristems. Science 347, 655–659 (2015).
Article MathSciNet CAS PubMed ADS Google Scholar
Ehlers, K. et al. The MADS box genes ABS, SHP1, and SHP2 are essential for the coordination of cell divisions in ovule and seed coat development and for endosperm formation in Arabidopsis thaliana. PLoS One 11, e0165075 (2016).
Article PubMed PubMed Central CAS Google Scholar
Knipple, D. C., Seifert, E., Rosenberg, U. B., Preiss, A. & Jäckle, H. Spatial and temporal patterns of Krüppel gene expression in early Drosophila embryos. Nature 317, 40–44 (1985).
Article CAS PubMed ADS Google Scholar
Ali-Murthy, Z. & Kornberg, T. B. Bicoid gradient formation and function in the Drosophila pre-syncytial blastoderm. Elife 5, e13222 (2016).
Article PubMed PubMed Central Google Scholar
Rushlow, C., Colosimo, P. F., Lin, M., Xu, M. & Kirov, N. Transcriptional regulation of the Drosophila gene zen by competing Smad and Brinker inputs. Genes Dev. 15, 340–351 (2001).
Article CAS PubMed PubMed Central Google Scholar
Kimelman, D. & Martin, B. L. Anterior-posterior patterning in early development: Three strategies. Wiley Interdiscip. Rev. Dev. Biol. 1(2), 253–266 (2012).
Article CAS PubMed Google Scholar
Ip, Y. T., Park, R. E., Kosman, D., Yazdanbakhsh, K. & Levine, M. dorsal-twist interactions establish snail expression in the presumptive mesoderm of the Drosophila embryo. Genes Dev. 6, 1518–1530 (1992).
Article CAS PubMed Google Scholar
Terzić, J. & Saraga-Babić, M. Expression pattern of PAX3 and PAX6 genes during human embryogenesis. Int. J. Dev. Biol. 43, 501–508 (1999).
PubMed Google Scholar
Wu, X. et al. The POU Domain transcription factor POU3F1 is an important intrinsic regulator of GDNF-induced survival and self-renewal of mouse spermatogonial stem cells. Biol. Reprod. 82, 1103–1111 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kumar, R. P. et al. Regulation of energy metabolism during early mammalian development: TEAD4 controls mitochondrial transcription. Development 145, dev162644 (2018).
Article PubMed PubMed Central CAS Google Scholar
Göke, J. et al. Combinatorial binding in human and mouse embryonic stem cells identifies conserved enhancers active in early embryonic development. PLoS Comput. Biol. 7, e1002304 (2011).
Article PubMed PubMed Central CAS Google Scholar
Toyooka, Y., Oka, S. & Fujimori, T. Early preimplantation cells expressing Cdx2 exhibit plasticity of specification to TE and ICM lineages through positional changes. Dev. Biol. 411, 50–60 (2016).
Article CAS PubMed Google Scholar
Arnold, S. J., Hofmann, U. K., Bikoff, E. K. & Robertson, E. J. Pivotal roles for eomesodermin during axis formation, epithelium-to-mesenchyme transition and endoderm specification in the mouse. Development 135, 501–511 (2008).
Article CAS PubMed Google Scholar
Ying, L., Mills, J. A., French, D. L. & Gadue, P. OCT4 coordinates with WNT signaling to pre-pattern chromatin at the SOX17 locus during human ES cell differentiation into definitive endoderm. Stem Cell Rep. 5, 490–498 (2015).
Article CAS Google Scholar
Cole, M. F., Johnstone, S. E., Newman, J. J., Kagey, M. H. & Young, R. A. Tcf3 is an integral component of the core regulatory circuitry of embryonic stem cells. Gens Dev. 22, 746–755 (2008).
Article CAS Google Scholar
Chatagnon, A. et al. RAR/RXR binding dynamics distinguish pluripotency from differentiation associated cis-regulatory elements. Nucleic Acids Res. 43, 4833–4854 (2015).
Article CAS PubMed PubMed Central Google Scholar
Grabowska, M. M. et al. NF1 transcription factors interact with FOXA1 to regulate prostate-specific gene expression. Mol. Endocrinol. 28, 949–964 (2014).
Article PubMed PubMed Central CAS Google Scholar
de Lázaro, I. & Kostarelos, K. Engineering cell fate for tissue regeneration by in vivo transdifferentiation. Stem Cell Rev. Rep. 12, 129–139 (2016).
Article PubMed CAS Google Scholar
Petrocca, F. & Lieberman, J. Micromanagers of immune cell fate and function. Adv. Immunol. 102, 227–244 (2009).
Article CAS PubMed Google Scholar
Takaoka, K. & Hamada, H. Cell fate decisions and axis determination in the early mouse embryo. Development 139, 3–14 (2012).
Article CAS PubMed Google Scholar
Sandler, J. E. & Stathopoulos, A. Stepwise progression of embryonic patterning. Trends Genet. 32, 432–443 (2016).
Article CAS PubMed PubMed Central Google Scholar
Reeves, G. T. & Stathopoulos, A. Graded dorsal and differential gene regulation in the Drosophila embryo. Cold Spring Harb. Perspect. Biol. 1, a000836 (2009).
Article PubMed PubMed Central CAS Google Scholar
Qian, S., Capovilla, M. & Pirrotta, V. The bx region enhancer, a distant cis-control element of the Drosophila Ubx gene and its regulation by hunchback and other segmentation genes. EMBO J. 10, 1415–1425 (1991).
Article CAS PubMed PubMed Central Google Scholar
Hassan, H., Scheres, B. & Blilou, I. JACKDAW controls epidermal patterning in the Arabidopsis root meristem through a non-cell-autonomous mechanism. Development 137, 1523–1529 (2010).
Article CAS PubMed Google Scholar
Liu, K. et al. ERF72 interacts with ARF6 and BZR1 to regulate hypocotyl elongation in Arabidopsis. J. Exp. Bot. 69, 3933–3947 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhang, D. et al. Transcription factors involved in brassinosteroid repressed gene expression and their regulation by BIN2 kinase. Plant Signal Behav. 9, e27849 (2014).
Article CAS PubMed PubMed Central Google Scholar
Prigge, M. J. et al. Class III homeodomain-leucine zipper gene family members have overlapping, antagonistic, and distinct roles in Arabidopsis development. Plant Cell 17, 61–76 (2005).
Article CAS PubMed PubMed Central Google Scholar
Liang, T. et al. UVR8 interacts with BES1 and BIM1 to regulate transcription and photomorphogenesis in Arabidopsis. Dev. Cell 44, 512–523 (2018).
Article CAS PubMed Google Scholar
Kamiya, N., Nagasaki, H., Morikami, A., Sato, Y. & Matsuoka, M. Isolation and characterization of a rice WUSCHEL-type homeobox gene that is specifically expressed in the central of cells of a quiescent center in the root apical meristem. Plant J. 35, 429–441 (2003).
Article CAS PubMed Google Scholar
Rovere, F. D., Fattorini, L., Ronzan, M., Falasca, G. & Altamura, M. M. The quiescent center and the stem cell niche in the adventitious roots of Arabidopsis thaliana. Plant Signal Behav. 11, e1176660 (2016).
Article PubMed PubMed Central CAS Google Scholar
Page, B. D., Zhang, W., Steward, K., Blumenthal, T. & Priess, J. R. ELT-1, a GATA-like transcription factor, is required for epidermal cell fates in Caenorhabditis elegans embryos. Genes Dev. 11, 1651–1661 (1997).
Article CAS PubMed Google Scholar
Baugh, L. R. et al. The homeodomain protein PAL-1 specifies a lineage-specific regulatory network in the C. elegans embryo. Development 132, 1843–1854 (2005).
Article CAS PubMed Google Scholar
Serra-Cardona, A., Canadell, D. & Ariño, J. Coordinate responses to alkaline pH stress in budding yeast. Microb. Cell 2, 182–196 (2015).
Article CAS PubMed PubMed Central Google Scholar
Postnikoff, S. D., Malo, M. E., Wong, B. & Harkness, T. A. The yeast forkhead transcription factors fkh1 and fkh2 regulate lifespan and stress response together with the anaphase-promoting complex. PLoS Genet. 8, e1002583 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sadeh, A., Movshovich, N., Volokh, M., Gheber, L. & Aharoni, A. Fine-tuning of the Msn2/4-mediated yeast stress responses as revealed by systematic deletion of Msn2/4 partners. Mol. Biol. Cell 22, 3127–3138 (2011).
Article CAS PubMed PubMed Central Google Scholar
Friedrich, M. Evolution of insect eye development: First insights from fruit fly, grasshopper and flour beetle. Integr. Comp. Biol. 43, 508–521 (2003).
Article PubMed Google Scholar
Yeung, K. et al. Integrative genomic analysis reveals novel regulatory mechanisms of eyeless during Drosophila eye development. Nucleic Acids Res. 46, 11743–11758 (2018).
Article CAS PubMed PubMed Central Google Scholar
Roignant, J. Y. & Treisman, J. E. Pattern formation in the Drosophila eye disc. Int. J. Dev. Biol. 53, 795–804 (2009).
Article CAS PubMed PubMed Central Google Scholar
Daga, A., Karlovich, C. A., Dumstrei, K. & Banerjee, U. Patterning of cells in the Drosophila eye Lozenge, which shares syntenic domains with AML1. Genes Dev. 10, 1194–1205 (1996).
Article CAS PubMed Google Scholar
Xue, L. & Noll, M. Dual role of the Pax gene paired in accessory gland development of Drosophila. Development 129, 339–346 (2002).
Article CAS PubMed Google Scholar
Myat, M. M., Isaac, D. D. & Andrew, D. J. Early genes required for salivary gland fate determination and morphogenesis in Drosophila melanogaster. Adv. Dent. Res. 14, 89–98 (2000).
Article CAS PubMed Google Scholar
Abrams, E. W. & Andrew, D. J. CrebA regulates secretory activity in the Drosophila salivary gland and epidermis. Development 132, 2743–2758 (2005).
Article CAS PubMed Google Scholar
Minakhina, S., Tan, W. & Steward, R. JAK/STAT and the GATA factor Pannier control hemocyte maturation and differentiation in Drosophila. Dev. Biol. 352, 308–316 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhang, C. U., Blauwkamp, T. A., Burby, P. E. & Cadigan, K. M. Wnt-mediated repression via bipartite DNA recognition by TCF in the Drosophila hematopoietic system. PLoS Genet. 10, e1004509 (2014).
Article PubMed PubMed Central CAS Google Scholar
Harris, R. M., Pfeiffer, B. D., Rubin, G. M. & Truman, H. W. Neuron hemilineages provide the functional ground plan for the Drosophila ventral nervous system. Elife 4, e04493 (2015).
Article PubMed Central Google Scholar
Romani, S. et al. Krüppel, a Drosophila segmentation gene, participates in the specification of neurons and glial cells. Mech. Dev. 60, 95–107 (1996).
Article CAS PubMed Google Scholar
Soustelle, L. & Giangrande, A. Novel gcm-dependent lineages in the postembryonic nervous system of Drosophila melanogaster. Dev. Dyn. 236, 2101–2108 (2007).
Article CAS PubMed Google Scholar
Crow, K. D. & Wagner, G. P. What is the role of genome duplication in the evolution of complexity and diversity?. Mol. Biol. Evo. 23, 887–892 (2006).
Article CAS Google Scholar
Cho, C. Y., Kelliher, C. M. & Haase, S. B. The cell-cycle transcriptional network generates and transmits a pulse of transcription once each cell cycle. Cell Cycle 18, 363–378 (2019).
Article CAS PubMed PubMed Central Google Scholar
Panchy, N. L., Lloyd, J. P. & Shiu, S. H. Improved recovery of cell-cycle gene expression in Saccharomyces cerevisiae from regulatory interactions in multiple omics data. BMC Genom. 21, 159 (2020).
Article CAS Google Scholar
Wang, Z., Oron, E., Nelson, B., Razis, S. & Ivanova, N. Distinct lineage specification roles for NANOG, OCT4, and SOX2 in human embryonic stem cells. Cell Stem Cell 10, 440–454 (2012).
Article CAS PubMed Google Scholar
Lee, H. K., Lee, H. S. & Moody, S. A. Neural transcription factors: From embryos to neural stem cells. Mol. Cells 37, 705–712 (2014).
Article PubMed PubMed Central CAS Google Scholar
Becker, M. B., Zulch, A., Bosse, A. & Gruss, P. Irx1 and Irx2 expression in early lung development. Mech. Dev. 106, 155–158 (2001).
Article CAS PubMed Google Scholar
McMahon, A. R. & Merzdorf, C. S. Expression of the zic1, zic2, zic3, and zic4 genes in early chick embryos. BMC Res. Notes 3, 167 (2010).
Article PubMed PubMed Central CAS Google Scholar
Luo, Z. J. et al. Zic2 is an enhancer-binding factor required for embryonic stem cell specification. Mol. Cell 57, 685–694 (2016).
Article CAS Google Scholar
Baum, P. D., Guenther, C., Frank, C. A., Pham, B. V. & Garriga, G. The Caenorhabditis elegans gene ham-2 links Hox patterning to migration of the HSN motor neuron. Genes Dev. 13, 472–483 (1999).
Article CAS PubMed PubMed Central Google Scholar
Altun-Gultekin, Z. et al. A regulatory cascade of three homeobox genes, ceh-10, ttx-3 and ceh-23, controls cell fate specification of a defined interneuron class in C. elegans. Development 128, 1951–1969 (2001).
Article CAS PubMed Google Scholar
Seibert, J. & Urbach, R. Role of en and novel interactions between msh, ind, and vnd in dorsoventral patterning of the Drosophila brain and ventral nerve cord. Dev. Biol. 346, 332–345 (2010).
Article CAS PubMed Google Scholar
Skeath, J. B. & Carroll, S. B. The achaete–scute complex: Generation of cellular pattern and fate within the Drosophila nervous system. FASEB J. 8, 714–721 (1994).
Article CAS PubMed Google Scholar
Homem, C. C. & Knoblich, J. A. Drosophila neuroblasts: A model for stem cell biology. Development 139, 4297–4310 (2012).
Article CAS PubMed Google Scholar
Skeath, J. B., Panganiban, G. F. & Carroll, S. B. The ventral nervous system defective gene controls proneural gene expression at two distinct steps during neuroblast formation in Drosophila. Development 120, 1517–1524 (1994).
Article CAS PubMed Google Scholar
Khosla, A. et al. HD-Zip proteins GL2 and HDG11 have redundant functions in Arabidopsis trichomes, and gl2 activates a positive feedback loop via MYB23. Plant Cell 26, 2184–2200 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schiefelbein, J., Huang, L. & Zheng, X. H. Regulation of epidermal cell fate in Arabidopsis roots: The importance of multiple feedback loops. Front. Plant Sci. 5, 47 (2014).
Article PubMed PubMed Central Google Scholar
Bruex, A. et al. A gene regulatory network for root epidermis cell differentiation in Arabidopsis. PLoS Genet. 8, e1002446 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sijacic, P., Bajic, M., McKinney, E. C., Meagher, R. B. & Deal, R. B. Changes in chromatin accessibility between Arabidopsis stem cells and mesophyll cells illuminate cell type-specific transcription factor network. Plant J. 94, 215–231 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jégu, T. et al. The Arabidopsis SWI/SNF protein BAF60 mediates seedling growth control by modulating DNA accessibility. Genome Biol. 18, 114 (2017).
Article PubMed PubMed Central CAS Google Scholar
Maher, K. A. et al. Profiling of accessible chromatin regions across multiple plant species and cell types reveals common gene regulatory principles and new control modules. Plant Cell 30, 15–36 (2018).
Article CAS PubMed Google Scholar
Haines, J. & Eisen, M. B. Patterns of chromatin accessibility along the anterior–posterior axis in the early Drosophila embryo. PLoS Genet. 14, e1007367 (2018).
Article PubMed PubMed Central CAS Google Scholar
Blythe, S. A. & Wieschaus, E. F. Establishment and maintenance of heritable chromatin structure during early Drosophila embryogenesis. Elife 5, e20148 (2016).
Article PubMed PubMed Central Google Scholar
Wu, J. et al. Chromatin analysis in human early development reveals epigenetic transition during ZGA. Nature 557, 256–260 (2018).
Article CAS PubMed ADS Google Scholar
Liu, Q. et al. Genome-wide temporal profiling of transcriptome and open chromatin of early cardiomyocyte differentiation derived from hiPSCs and hESCs. Circ. Res. 121, 376–391 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cao, S. et al. Chromatin accessibility dynamics during chemical induction of pluripotency. Cell Stem Cell 22, 529–542 (2018).
Article CAS PubMed Google Scholar
Jung, Y. H. et al. Chromatin states in mouse sperm correlate with embryonic and adult regulatory landscape. Cell Rep. 18, 1366–1382 (2017).
Article CAS PubMed PubMed Central Google Scholar
Su, Y. et al. Neuronal activity modifies the chromatin accessibility landscape in the adult brain. Nat. Neurosci. 20, 476–483 (2017).
Article CAS PubMed PubMed Central Google Scholar
Maza, I. et al. Transient acquisition of pluripotency during somatic cell transdifferentiation with iPSC reprogramming factors. Nat. Biotechnol. 33, 769–774 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dronamraju, R. et al. Spt6 association with RNA polymerase II directs mRNA turnover during transcription. Mol. Cell 70, 1054–1066 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gowans, G. J. et al. INO80 chromatin remodeling coordinates metabolic homeostasis with cell division. Cell Rep. 22, 611–623 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schep, A. N. et al. Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions. Genome Res. 25, 1757–1770 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jänes, J. et al. Chromatin accessibility dynamics across C. elegans development and aging. Elife 7, e37344 (2018).
Article PubMed PubMed Central Google Scholar
Kolundzic, E. et al. FACT sets a barrier for cell fate reprogramming in Caenorhabditis elegans and human cells. Dev. Cell 46, 611–626 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Thorvaldsdottir, H., Robinson, J. T. & Mesirov, J. P. Integrative genomics viewer (IGV): High-performance genomics data visualization and exploration. Brief Bioinform. 14, 178–192 (2013).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central CAS Google Scholar
Khan, A. & Mathelier, A. Intervene: A tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinform. 18, 287 (2017).
Article CAS Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS PubMed Google Scholar
Wei, T. Y., & Simko, V. R. package "corrplot": Visualization of a Correlation Matrix (Version 0.84). https://github.com/taiyun/corrplot (2017).
Hao, Z. D. et al. RIdeogram: Drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput. Sci. 6, e251 (2020).
Article PubMed Central PubMed Google Scholar
Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: An R/bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015).
Article CAS PubMed Google Scholar
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Article CAS PubMed PubMed Central Google Scholar
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Article PubMed PubMed Central ADS CAS Google Scholar

Download references

Acknowledgements

The research was financially supported by the National Natural Science Foundation of China (31971672, 31470682 and 31670682), the Natural Science Foundation of Zhejiang Province (LY18C150002).

Author information

Authors and Affiliations

State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Lin’an, Hangzhou, 311300, People’s Republic of China
Tongqiang Fan & Youjun Huang

Authors

Tongqiang Fan
View author publications
You can also search for this author in PubMed Google Scholar
Youjun Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.F. carried out the data analysis and drafted the manuscript. Y.H. participated in the design of the study and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Youjun Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fan, T., Huang, Y. Accessible chromatin reveals regulatory mechanisms underlying cell fate decisions during early embryogenesis. Sci Rep 11, 7896 (2021). https://doi.org/10.1038/s41598-021-86919-3

Download citation

Received: 17 August 2020
Accepted: 22 March 2021
Published: 12 April 2021
DOI: https://doi.org/10.1038/s41598-021-86919-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Single-nucleus chromatin landscapes during zebrafish early embryogenesis

An ATAC-seq atlas of chromatin accessibility in mouse tissues

An atlas of dynamic chromatin landscapes in mouse fetal development

Introduction

Results and discussion

Dynamic chromatin changes over developmental stages

Chromatin accessibility extends the dictionary of cis-regulatory elements

Identification of cell-fate TFs during early embryogenesis

Regulatory networks of cell-fate decision in early embryo

Evolution of cell fate decision in early embryos

Methods

ATAC-seq data acquisition

ATAC-seq data analysis

Genomic tracks generation

Peak calling

Expression and correlation of overlapped accessible regions

Peak distribution and functional enrichment annotation

Transcript factor motif discovery and gene ontology

Transcriptional regulatory network construction

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links