Abstract
In contrast to rodents, the mechanisms underlying human trophectoderm and early placenta specification are understudied due to ethical barriers and the scarcity of embryos. Recent reports have shown that human pluripotent stem cells (PSCs) can differentiate into trophectoderm (TE)-like cells (TELCs) and trophoblast stem cells (TSCs), offering a valuable in vitro model to study early placenta specification. Here, we demonstrate that the VGLL1 (vestigial-like family member 1), which is highly expressed during human and non-human primate TE specification in vivo but is negligibly expressed in mouse, is a critical regulator of cell fate determination and self-renewal in human TELCs and TSCs derived from naïve PSCs. Mechanistically, VGLL1 partners with the transcription factor TEAD4 (TEA domain transcription factor 4) to regulate chromatin accessibility at target gene loci through histone acetylation and acts in cooperation with GATA3 and TFAP2C. Our work is relevant to understand primate early embryogenesis and how it differs from other mammalian species.
Similar content being viewed by others
Introduction
The placenta is the largest fetal supporting organ. It originates from the trophectoderm (TE) and constitutes, together with the pluripotent inner cell mass (ICM), the first lineage bifurcation event during development1. The ICM gives rise to the hypoblast and the epiblast, which will form the yolk sac and the three embryonic germ layers, respectively. The TE gives rise to the cytotrophoblast and ultimately the placenta. The placenta mediates exchanges between the maternal uterus and the fetus by supplying nutrients, producing hormones, eliminating waste products, and enabling gas diffusion via maternal blood circulation. In line with these functions, abnormalities in placental development are one of the key reasons for early pregnancy loss and are also responsible for many pregnancy-related diseases including preeclampsia2,3,4. Due to ethical considerations and difficulties in obtaining study material, the genetic regulatory mechanisms underlying the formation of human TE and early cytotrophoblast remain unclear.
In contrast to human, TE development has been relatively well explored in the mouse. TEAD4 (TEA domain transcription factor 4) has been identified as a key upstream transcription factor involved in TE formation. TE-specific genes, such as Cdx2, Gata3, and Fgfr2 fail to upregulate in Tead4 knockout mice resulting in pre-implantation embryonic lethality5,6,7. TEAD4 activity is tightly controlled by the Hippo pathway8,9,10. At the onset of mouse blastocyst formation, Hippo signaling is inactive in the outer embryonic layer of cells that will form the TE, and so its downstream target protein YAP (encoded by Yap1) is in an unphosphorylated form. Unphosphorylated YAP localizes in the nucleus and partners with TEAD4 to induce TE specifiers including Cdx2 and Gata311. In the ICM, the Hippo pathway is activated and leads to YAP phosphorylation, promoting cytoplasmic retention and proteasomal degradation. The YAP-related protein WWTR1 (also called as TAZ, encoded by Wwtr1) plays a similar functional role to YAP in the early mouse embryo12. Intriguingly, recent studies have revealed critical differences between mouse and human in the regulation of the Hippo pathway in early development13. For example, YAP is localized in the nucleus of both human TE and ICM cells, suggesting that additional or alternative regulatory mechanisms are involved in human TE formation. Moreover, YAP activity promotes human naïve pluripotent stem cell (PSC) generation and self-renewal14.
To explore early extraembryonic lineage specification, in vitro models have been developed. The derivation from early placenta samples of human trophoblast stem cells (TSCs), which can be used to model the villous cytotrophoblast of the human first-trimester placenta, provides a valuable tool for studying placenta formation15. This TSC derivation medium can also be applied to induce TSCs from human naïve PSCs, which mimic pre-implantation ICM cells16,17,18,19,20,21,22,23. Primed human PSCs, which mimic post-implantation epiblast cells, can acquire cytotrophoblast-like features using either BMP4 alone or in combination with small molecules (e.g., A83-01 and PD173074)24,25. However, the resulting cells only capture some cytotrophoblast characteristics and instead more closely resemble the amnion26,27. Primed human PSCs can also differentiate into functional TSCs with modified culture media22,28,29, but it is unclear whether this involves the same path as naïve PSCs. Recent reports have used these models to gain insight into early human placenta specification. For example, primate-specific miRNA on chromosome 19 (C19MC) and the transcription factors TEAD1 and TEAD4 are essential for producing human TSCs30,31,32. Mechanisms related to the onset of TE specification and most aspects of human TSC induction and maintenance remain in general poorly understood.
In this study, we used human naïve PSCs generated with our recently developed 4CL culture medium33 to produce TE-like cells (TELCs) and TSCs. We identified the transcriptional co-factor VGLL1 (vestigial-like family member 1) as a key regulator for the conversion of TELCs and maintenance of TSCs. VGLL1 has been previously reported to be highly expressed in the human placenta and is involved in preeclampsia34,35. Moreover, it can form a complex with TEAD4 in cancer cells36,37,38. We showed that in TELCs and TSCs, VGLL1 works through the interaction with TEAD4 by inducing histone H3 lysine 27 acetylation (H3K27ac) at target loci including cell cycle genes and TE/TSC-related genes. Besides, we found that VGLL1 and TEAD4 cooperate with GATA3 and TFAP2C. Importantly, VGLL1 is highly expressed in primate TE but negligibly in mouse, highlighting a potential interspecies difference in early placenta specification. We have thus uncovered a key regulatory mechanism of human early placenta formation whose further characterization may facilitate a deeper understanding of both normal and abnormal developmental processes.
Results
The transcriptional co-factor VGLL1 is highly expressed during primate TE induction
We produced TELCs from human naïve H9 embryonic stem cells (ESCs) generated with our newly established 4CL medium33 using a previously reported TELC conversion protocol26 (Fig. 1a). Flattened epithelial-like cells with a TE-like morphology were observed at day 5 (Fig. 1b). Downregulation of classical (e.g., NANOG and OCT4) and naïve (e.g., KLF17 and DPPA3) pluripotency genes, expression of naïve/TE shared genes such as TEAD4 and TFAP2C, and upregulation of TE-enriched genes like GATA3 and ENPEP were validated by immunofluorescence microscopy and/or bulk RNA-sequencing (RNA-seq) (Fig. 1c, d and Supplementary Fig. 1a). To assess the fidelity with which TELCs derived from 4CL ESCs reflect preimplantation embryo TE cells, we performed droplet-based single-cell RNA sequencing (scRNA-seq)39 at day 5 of TELC induction (TELC-D5) from 4CL ESCs and integrated them with previously reported scRNA-seq datasets of 4CL naïve ESCs33, human preimplantation40,41 and postimplantation embryo cells42. Uniform manifold approximation and projection (UMAP) representation showed that TELC-D5 cells clustered closely with human preimplantation embryo TE cells and their gene expression pattern was comparable (Fig. 1e, f). Pearson correlation analysis among the subclusters within TELC-D5 cells showed that they are highly correlated (correlation score > 0.9) except for two subclusters (cluster 7 and 8, 2.08 % of total TELC-D5 cells); cluster 7 also expressed SCT related genes (Supplementary Fig. 1b-d). These results suggest that 4CL ESC-derived TELC-D5 cells are relatively homogenous and resemble the human preimplantation embryo TE. We then determined the differentially expressed genes (DEGs) between naïve ESCs and TELC-D5 cells, and found VGLL1 as the most significantly upregulated gene in TELCs (Fig. 1g). In contrast, YAP1 and TEAD4 were only moderately upregulated in TELC-D5 cells. We confirmed the exclusivity of VGLL1 expression in TELCs by immunofluorescence microscopy, real-time quantitative PCR (RT-qPCR) and Western blotting (Fig. 1c, h, i). Western blotting also validated the downregulation of the pluripotency genes NANOG, OCT4 and SOX2 (Fig. 1i). Likewise, it confirmed that YAP and TEAD4 are slightly upregulated in TELC-D5 cells compared with naïve ESCs.
The high levels of VGLL1 in TELCs derived from naive ESCs prompted us to study its expression kinetics during early in vivo development. The human early blastocyst corresponding to embryonic day 5 (E5) is comprised of cells belonging to the earliest stage of TE and ICM lineage. We investigated the identities and gene expression patterns of human E5 cells from a previously reported scRNA-seq dataset40. UMAP segregated the TE and ICM cells in addition to pre-lineage cells in E5 cells (Fig. 2a). Classical and naive pluripotency genes were highly expressed in pre-lineage and ICM cells but downregulated in TE cells (Supplementary Fig. 1e). TE-related transcription factors like GATA3, GATA2 and CDX2 were upregulated in TE cells compared with pre-lineage or ICM cells (Fig. 2b, Supplementary Fig. 1e). VGLL1 was highly and specifically expressed in TE cells. This is relevant because context-dependent highly expressed genes often function as key regulators43. Unlike mouse, genes like ELF5 and EOMES were only marginally expressed at this stage (Supplementary Fig. 1e), which is consistent with previous report44. To better understand the dynamic changes of gene expression during the human TE and ICM lineage bifurcation, we performed pseudotime analysis of the scRNA-seq data using Monocle 245,46,47 (Fig. 2c). As expected, pluripotency genes such as OCT4 and SOX2 increased in the pre-lineage to ICM branch and were downregulated in the pre-lineage to TE branch (Supplementary Fig. 1f). In contrast, naive/TE shared genes like TEAD4, YAP1 and TFAP2C remained at similar expression levels along the pseudotime or were moderately upregulated, whereas TE-enriched genes like GATA3, GATA2 and CDX2 were progressively upregulated (Fig. 2d, Supplementary Fig. 1f). Further supporting a role in TE specification, VGLL1 showed a dramatic upregulation at the onset of the TE branch (Fig. 2d). The quick induction and highly selective expression of VGLL1 in TE were further confirmed by using another human early embryo scRNA-seq dataset41 (Supplementary Fig. 1g-j).
In addition, we investigated the cell types and gene expression pattern of early Macaca fascicularis monkey embryo using a reported scRNA-seq dataset48 and found that, similar to human, VGLL1 is highly expressed in a TE-specific manner (Fig. 2e, f). Pseudotime analysis also showed that VGLL1 is strongly induced since the beginning of TE lineage specification (Fig. 2g, h). In contrast, scRNA-seq of early mouse embryo development49 showed that Vgll1 is marginally expressed and minimally upregulated at the onset of mouse TE induction in the 16-cell stage (Fig. 2i-l), which is consistent with previous report that VGLL1 is not involved in mouse trophoblast specification34. These results indicate that VGLL1 may function as a TE induction regulator in primates.
VGLL1 regulates TELC generation from human naive PSCs
To explore whether VGLL1 has a functional role in human TE specification, we depleted it using short-hairpin RNAs (shRNAs; shVGLL1−2 and −4) during the 4CL naive PSC to TELC induction. Knockdown efficiency was confirmed by RT-qPCR and Western blotting (Fig. 3a, b). Notably, we observed that self-renewal (cell proliferation) during the conversion was severely compromised in shVGLL1 transduced cells compared to the shLuc control, and this was also accompanied by a lack of TE-like morphology (Fig. 3c, d). We then performed bulk RNA-seq for shLuc and shVGLL1 transduced TELC-D5 cells. Consistent with the reduced proliferation and lack of TE-like morphology, cell-cycle/self-renewal genes such as MKI67, CDK1/3/7 and CDC42, and TE genes like GATA3, ENPEP, NR2F2 and TEAD1, were substantially downregulated in shVGLL1 transduced cells compared to control shLuc (Fig. 3e). There was relatively less downregulation of pluripotency genes in shVGLL1 cells compared to shLuc control (Fig. 3e). These results were validated by RT-qPCR (Fig. 3f). In agreement, gene ontology (GO) analysis for the downregulated genes with shVGLL1 showed enrichment of terms related to cell cycle DNA replication, cell cycle checkpoint, in utero embryonic development and placenta development (Fig. 3g). Likewise, upregulated genes in shVGLL1 transduced cells compared to control shLuc were related to development and cell fate commitment (Fig. 3h). To further validate the function of VGLL1 in TELC induction, we generated VGLL1 knockout (KO) clones (C45 and C68) using CRISPR-Cas9 technology with H9 ESCs (Fig. 3i, j). Consistent with the shRNA-mediated VGLL1 knockdown, both VGLL1 KO clones, compared to wild-type (WT) clones, showed reduced TE gene expression determined by RNA-seq as well as compromised cell proliferation confirmed by phase contrast imaging and cell counting (Fig. 3k-m). The downregulated genes in VGLL1 KO cells were associated with in utero embryonic development, placenta development and positive regulation of cell cycle determined by GO enrichment analysis (Fig. 3n).
Trophoblast lineage differentiation potential may vary with cell lines and starting naive PSC states17. We thus performed VGLL1 knockdown in another human induced pluripotent stem cell (iPSC) line, UH1050,51, and tested other well-established human naive PSC culture conditions: PXGL and HENSM52,53. Consistently, VGLL1 knockdown during TELC induction resulted in reduced TE gene expression and compromised cell proliferation in UH10 iPSCs in 4CL medium or H9 ESCs in PXGL and HENSM media (Supplementary Fig. 2a-i). These findings demonstrate that VGLL1 is required for the induction of human TELCs regardless of the cell line or starting naive PSC medium.
To exclude the possibility that the effect of VGLL1 depletion on TE gene expression is due to cell cycle arrest, we generated two effective shRNA lentiviruses targeting TP53 (shTP53−1 and shTP53−2) and then transduced them into WT and VGLL1 KO (C45 and C68) H9 ESCs, along with shLuc as control (Supplementary Fig. 2j). In all settings, we observed a 50-70% increase of cell number in TP53 knockdown cells relative to shLuc control cells, as expected (Supplementary Fig. 2k). Despite the increased cell number, we did not observe upregulation of TE genes like GATA2, GATA3, TFAP2C and YAP1 in TP53 knockdown cells compared with shLuc control cells, providing evidence that the regulatory function of VGLL1 on TE gene expression is independent of cell cycle arrest (Supplementary Fig. 2l).
Therefore, we have demonstrated that VGLL1 is a critical regulator of the human naive PSC to TELC transition, highlighting a major difference with mouse.
VGLL1 works in tandem with TEAD4 to regulate human TELC specification
To dissect the function of VGLL1, we performed Cleavage Under Targets and Tagmentation (CUT&Tag)54 for VGLL1 in TELC-D5 cells differentiated from 4CL H9 ESCs to determine its genomic occupancy. A total of 54,773 binding loci were captured (Supplementary Data 1). Peak distribution analysis revealed that VGLL1 primarily binds to promoter, distal intergenic and intronic regions (Fig. 4a). This occupancy pattern suggested that VGLL1 regulates target genes by controlling both promoter and enhancer activity. Consistently, cross-comparison with our bulk RNA-seq results showed that 78.2% of downregulated genes in shVGLL1 compared with shLuc are bound by VGLL1 (Fig. 4b), supporting a direct regulatory effect of VGLL1 on these genes. Genes bound by VGLL1 were enriched for GO terms such as placenta development, metaphase/anaphase transition of mitotic cell cycle, and cilium assembly (Fig. 4c), among others. This shows that VGLL1 regulates genes involved in multiple aspects of TE function.
VGLL1 is a transcriptional co-factor55 and is not predicted to bind DNA directly. Therefore, we performed motif discovery analysis to uncover its potential transcription factor partners. The DNA-binding motif for TEAD4, a well-known transcription factor necessary for TE specification in mouse5,7, was highly enriched in VGLL1-bound sites (Fig. 4d). We then asked whether VGLL1 physically interacts with TEAD4. We first did a reciprocal co-immunoprecipitation in two cell lines (H9 ESCs and UH10 iPSCs) at day 5 of TELC induction from 4CL PSCs and found that VGLL1 and TEAD4 strongly interact with each other (Fig. 4e and Supplementary Fig. 3a). Moreover, in line with previous reports32, there was reduced cell proliferation and compromised TELC identity when we knocked down or knocked out TEAD4 during the TELC conversion, irrespective of the cell lines or naive culture conditions (Supplementary Fig. 3b–r). We also confirmed that the effect of TEAD4 suppression on TE gene expression is not due to cell cycle arrest using 4CL H9 ESCs (Supplementary Fig. 4a–c). These results demonstrate that VGLL1 forms a complex with TEAD4 to regulate human TELC induction.
TEAD4 controls mouse and human trophoblast lineage specification by interacting with nuclear YAP and WWTR1 to regulate target gene expression11,56. To explore the potential functional relationship of different co-factors with TEAD4, we examined the interaction of TEAD4 with YAP and WWTR1 through co-immunoprecipitation in 4CL H9 ESCs and UH10 iPSCs. We observed that TEAD4 also interacts with them, but to a much lesser extent than with VGLL1 (Fig. 4e, Supplementary Fig. 3a). Notably, VGLL1 did not interact with YAP or WWTR1. Consistently, the knockdown of both YAP1 or WWTR1 hindered TELC induction (Supplementary Fig. 4d–k), indicating that TEAD4 recruits all three co-factors to regulate human TELC induction.
To further illuminate how VGLL1 works in tandem with TEAD4 along with other co-factors like YAP, we performed CUT&Tag for TEAD4 and YAP in TELC-D5 cells differentiated from 4CL H9 ESCs. Genome-wide binding analysis revealed that regions bound by TEAD4 and YAP were also largely located at the promoter, distal intergenic and intronic regions (Fig. 4f, g). Co-occupancy analysis between VGLL1, YAP and TEAD4 showed that VGLL1 and TEAD4 share a large number (43,048) of binding sites, among which 57 % were also co-bound by YAP (24,626) (Fig. 4h). This data correlates with the co-immunoprecipitation results indicating that TEAD4 interacts more strongly with VGLL1 than with YAP (Fig. 4e, Supplementary Fig. 3a). Genes co-bound by VGLL1, TEAD4 and YAP included cell cycle/self-renewal genes and TE-related genes (Fig. 4h). We then performed GO analysis for genes co-bound by all these three factors, and those bound by either two of them in TELC-D5 cells. VGLL1-TEAD4 only or VGLL1-TEAD4-YAP co-bound genes were enriched in terms related to placenta development, cell cycle regulation and, interestingly, histone acetylation, while YAP-TEAD4 only co-bound genes were enriched for other biological processes such as protein polyubiquitination and positive regulation of autophagy (Fig. 4i). The distribution of these three factors across the genome indicates that VGLL1 and YAP have both shared and unique functions. We also tested the subcellular localization of VGLL1, TEAD4 and YAP in TELC-D5 cells derived from 4CL H9 ESCs. Subcellular fractionation followed by Western blotting showed that VGLL1 and TEAD4 mainly localized in the nucleus (Supplementary Fig. 4l), while YAP was more restricted to the cytoplasm. The distinct subcellular distribution further supports that TEAD4 interact less with YAP compared to VGLL1.
These results demonstrate that VGLL1 functions as a strong co-factor for TEAD4 in addition to YAP and WWTR1, potentially adding more flexibility to safeguard TE development in human.
VGLL1 controls human TELC induction by promoting chromatin remodeling at target loci
We next aimed to elucidate the underlying mechanism by which VGLL1 and TEAD4 regulate the expression of TE genes. Gene expression is highly dependent on the enrichment of active histone marks at target loci and in recent years histone acetylation has emerged as a major regulator of cell (including PSC) fate transitions57. We thus tested whether VGLL1 knockdown affects histone acetylation in TELCs derived from 4CL H9 ESCs and UH10 iPSCs. Western blotting showed that total histone H3 acetylation (H3ac) and histone H4 acetylation (H4ac) levels were decreased upon VGLL1 depletion in TELC-D5 cells compared to the shLuc control (Fig. 5a and Supplementary Fig. 5a). Importantly, histone H3 lysine 27 acetylation (H3K27ac), which often marks promoters and enhancers of active genes, was also reduced upon VGLL1 knockdown. This finding suggested that VGLL1 is involved in H3K27ac deposition. Inspection of the sites co-occupied by VGLL1 and TEAD4 and H3K27ac showed a high degree of overlap, with 73.3% percent of VGLL1-TEAD4 co-bound sites displaying enriched H3K27ac (Fig. 5b). These loci included genes related to self-renewal and TE specification/function genes. Enriched GO terms corresponding to these shared loci belonged to categories related to placenta development and cell cycle (Fig. 5c).
To better understand the relationship between TEAD4 and VGLL1 co-binding with H3K27ac enrichment, we also performed CUT&Tag for TEAD4 and H3K27ac in naïie PSCs. We first compared the TEAD4 binding pattern between 4CL H9 ESCs and TELC-D5 cells and identified three categories (naïve-specific-, shared- and TE-specific loci) of TEAD4-binding peaks (Fig. 5d). Naïve ESC-specific TEAD4-bound genes included pluripotency transcription factors such as OCT4, SOX2, KLF17 and DPPA5, consistent with the observation that TEAD4 buffers pluripotency genes in human PSCs58. Shared TEAD4-bound genes included many cell-cycle regulators and naïve/TE-shared genes such as CDK1, MKI67, TEAD4 and TFAP2C. As expected, the largest number of genes corresponded to TE-specific TEAD4-bound genes, which included many TE-enriched genes like VGLL1, GATA3, CDX2 and NR2F2. Next, we performed a signal density pile-up analysis for H3K27ac (naïve ESCs and TELC-D5 cells) and VGLL1 (TELC-D5 cells) occupancy based on the categories of the identified TEAD4-binding peaks (Fig. 5e). Notably, we observed that H3K27ac levels in naïve-specific TEAD4-bound genes were high in naïve ESCs but low in TELC-D5 cells. In contrast, H3K27ac levels in shared and TE-specific TEAD4-bound genes were increased in TELC-D5 cells, being more obviously in the latter case.
Because high H3K27ac levels influence target gene expression at least in part by increasing local chromatin accessibility59, we also did an assay for transposase-accessible chromatin using sequencing (ATAC-seq) for TELC-D5 cells differentiated from 4CL H9 ESCs. This showed that 95.27% of the loci co-bound by VGLL1-TEAD4 and enriched with H3K27ac display accessible chromatin (Fig. 5f, g). The regions with overlapping peaks included genes involved in placenta development and cell cycle regulation (Fig. 5h, i and Supplementary Fig. 5b, c). On the contrary, open chromatin regions only bound by VGLL1 or TEAD4 were mostly related with other lineages (Supplementary Fig. 5d-f). Of note, VGLL1 and TEAD4 were also bound by themselves and enriched for H3K27ac, indicating a positive feedback regulation (Fig. 5i and Supplementary Fig. 5c). To provide more evidence of the functional cooperation between VGLL1 and TEAD4 and the role of VGLL1 in histone acetylation, we measured TEAD4 binding and H3K27ac enrichment using CUT&Tag and chromatin accessibility using ATAC-seq in VGLL1 KO cells at day 5 of TELC induction from 4CL H9 ESCs. In VGLL1 KO cells, the average genome binding intensity of TEAD4 and the enrichment of H3K27ac at VGLL1-binding sites declined compared to WT cells (Fig. 5j, k). Along with the decrease of H3K27ac marks, the chromatin accessibility at these regions was also reduced in VGLL1 KO cells (Fig. 5l).
Overall, we have demonstrated that TEAD4 becomes redistributed in TELCs compared to naïve PSCs and highlighted a link between VGLL1 recruitment by TEAD4 at target loci and chromatin opening through increased histone acetylation.
VGLL1 is necessary to sustain the expandable human TSC identity
Upon implantation, the TE develops into the mononucleated cytotrophoblast. Cytotrophoblast cells self-renew while simultaneously produce the multinucleated syncytiotrophoblast (SCT) cells and the highly invasive extravillous trophoblast (EVT) cells, both of which are critical for forming the placenta. To further enhance our understanding about the role of VGLL1 in early human placenta development, we tested whether it also functions in maintaining TSC identity. We performed TSC derivation from 4CL H9 ESCs using a previously described medium15. TELC-D5 cells were cultured in this medium for another 5-7 days to convert them into stable and expandable TSCs, which expressed VGLL1, TEAD4, YAP and showed loss of pluripotency genes like NANOG, OCT4 and SOX2 at the protein level (Fig. 1a, Fig. 1i and Supplementary Fig. 6a). Consistent with the findings in TELCs, VGLL1 was among the highest expressed genes in TSCs compared to naïve ESCs, as assessed by bulk RNA-seq and validated with RT-qPCR (Supplementary Fig. 6b-d). Likewise, VGLL1 knockdown compromised TSC self-renewal as detected by a change in cell morphology and reduced proliferation compared to the shLuc control (Fig. 6a, b and Supplementary Fig. 6e, f). Bulk RNA-seq analysis also showed that cell cycle- and TSC-related (e.g., GATA3, CAPN6, GCM1) genes were downregulated in VGLL1 depleted cells (Fig. 6c). GO analysis for downregulated genes in VGLL1 knockdown cells included biological processes such as cell cycle regulation and placenta development (Fig. 6d). The need for VGLL1 was further validated by performing shRNA knockdown in UH10 iPSCs and embryo blastocyst-derived TSCs (TSCBT)15 (Supplementary Fig. 6g-l). To assess the functional role of VGLL1 during SCT and EVT differentiation, expandable TSCs derived from TELCs differentiated from 4CL H9 ESCs were transduced with two effective shRNAs for VGLL1 along with shLuc as control. Subsequently, SCT and EVT differentiation were carried out at the time of shRNA transduction as previously described60. VGLL1 knockdown resulted in reduced cell proliferation, widespread cell death and failure to differentiate into SCT or EVT cells, while the control (shLuc) cells successfully differentiated and they expressed specific markers, SDC1 for SCT and HLA-G for EVT (Supplementary Fig. 6m, n). This demonstrates the importance of VGLL1 in maintaining the functional potential of TSCs.
To study the regulatory mechanism of VGLL1 in maintaining TSC identity, we first confirmed the interaction between VGLL1 and TEAD4 in TSCs derived from H9 ESCs and UH10 iPSCs by performing reciprocal co-immunoprecipitation (Fig. 6e and Supplementary Fig. 7a). Consistent with TELC-D5 cells, TEAD4 showed stronger interaction with VGLL1 compared with YAP and WWTR1, even though their knockdown also compromised TSC identity (Supplementary Fig. 7b–i). In addition, we studied whether the subcellular localization of VGLL1 and TEAD4 is similar to TELC-D5 cells. Unlike TELCs, VGLL1 was evenly distributed between the nucleus and cytoplasm in TSCs, while TEAD4 tended to localize more in the nucleus (Supplementary Fig. 7j). Additionally, we demonstrated that TEAD4 is important for preserving TSC identity by performing shRNA knockdown in TSCs derived from different PSC lines and TSCBT (Supplementary Fig. 7k–u). These results indicate that TEAD4 is also required to safeguard TSC differentiation potential (Supplementary Fig. 7v, w).
We then performed CUT&Tag for VGLL1 and TEAD4 in H9 TSCs and noticed that as in TELC-D5 cells, they share a large proportion (~80%) of genomic binding sites (Supplementary Fig. 8a). Similarly, TEAD4 was one of the most enriched motifs at VGLL1-bound sites (Supplementary Fig. 8b). We also noticed a decrease in the global levels of H3ac, H3K27ac and H4ac levels upon VGLL1 knockdown in H9 ESCs and UH10 iPSCs (Fig. 6f and Supplementary Fig. 8c). A large proportion of VGLL1- and TEAD4-shared regions corresponded to H3K27ac CUT&Tag peaks (80.7%) and chromatin accessible regions detected by ATAC-seq (93.17% of VGLL1, TEAD4 and H3K27ac shared sites), and these genes were related to placenta development and cell cycle (Fig. 6g and Supplementary Fig. 8d–i).
To explore a potential functional difference of the VGLL1-TEAD4 complex in TELCs and TSCs, we compared the binding pattern of VGLL1 and TEAD4, which showed different groups. Group 1 peaks, which take up a substantial proportion of VGLL1 and TEAD4 co-bound loci, were shared between TELCs and TSCs (Fig. 6h). Group 2 peaks were co-bound by VGLL1 and TEAD4 only in TELCs, while group 3 were co-bound by VGLL1 and TEAD4 only in TSCs. Group 1 peaks included genes with functions mostly related to placenta development, cell cycle regulation, histone modification and especially histone acetylation (Fig. 6i), consistent with our functional study in TELCs and TSCs. Group 2 peaks corresponded to genes involved not only in placenta development but also in various other developmental processes such as mesonephric epithelium development and sensory system development. We speculate that binding of VGLL1 and TEAD4 at these genes correlates with an unstable preliminary extraembryonic cell fate, in which the chromatin regions of different lineage genes are still accessible for binding. In this regard, TE cells isolated from human blastocysts have the capacity to develop into ICM cells and express NANOG, suggesting that their identity is not yet fully committed61. Group 3 peaks comprised genes participating in signaling pathways and functional terms like phagocytosis and endocytosis. Consistent with the notion that VGLL1-TEAD4 co-bound sites are enriched with H3K27ac, we found that this active histone mark was also positively correlated with VGLL1-TEAD4 binding sites for group 2 and group 3. For example, genes like APOB and GREM2 are bound by the VGLL1-TEAD4 complex only in TELCs (group 2) and their promoter and enhancer regions were also enriched for H3K27ac (Fig. 6j and Supplementary Fig. 8j). On the contrary, genes like PSG8 and PSG1 were co-bound by VGLL1 and TEAD4 in TSCs (group 3), with higher adjacent H3K27ac levels.
We have demonstrated that the function of the tandem VGLL1-TEAD4 in controlling self-renewal and cell identity is conserved in both TELCs and TSCs derived from human naïve PSCs, although there are differences between both cell types.
VGLL1-TEAD4 cooperate with GATA3 and TFAP2C to control the human TELC gene regulatory network
Cellular states are maintained by master gene regulatory networks (GRNs) consisting of key transcription factors and co-regulators62. It is known that the transcription factors GATA3 and TFAP2C are essential regulators of mouse and human TE specification63,64. TEAD4, GATA3 and TFAP2C are required for activating CDX2 expression and inducing TE lineage specification in mouse preimplantation embryos and GATA3 binds to the VGLL1 promoter to promote its expression in human trophoblast progenitors65. The regulatory network of TEAD4, GATA3 and TFAP2C in human models has not been studied in detail yet. To elucidate whether the VGLL1-TEAD4 complex acts synergistically with these two transcription factors, we performed CUT&Tag for GATA3 and TFAP2C in human TELC-D5 cells differentiated from 4CL H9 ESCs and compared it with the genome-wide binding of VGLL1 and TEAD4. Pairwise comparison showed that VGLL1 and TEAD4 shared 79.47% and 69.62%, respectively, of their binding sites with GATA3, and less than 50% with TFAP2C (49.12% and 45.52%, respectively), suggesting a stronger functional correlation between VGLL1, TEAD4 and GATA3 (Fig. 7a). In line with this, TE gene induction was impaired when GATA3 was knocked down during the TELC induction (Supplementary Fig. 8k, l). Besides, we noticed a relatively lower co-binding between GATA3 and TFAP2C (37.98%), indicating that they bind to the VGLL1-TEAD4 complex both independently and in combination (Fig. 7a). A four-way Venn diagram comparison of the binding pattern of these four transcription factors showed the largest overlaps among VGLL1-TEAD4-GATA3-TFAP2C and VGLL1-TEAD4-GATA3 bound loci, whereas VGLL1-TEAD4-TFAP2C had a smaller overlap (Fig. 7b). Further investigation revealed that genes co-bound by all four factors were related to placenta development, cell growth regulation and histone modification (Fig. 7c), whereas genes co-bound by VGLL1-TEAD4-GATA3 were associated with multiple lineage development (Fig. 7d). These results suggest that the VGLL1-TEAD4 complex may function as an interacting hub for other human TE regulators including GATA3 and TFAP2C. Supporting these observations, co-immunoprecipitation showed interaction between these four factors in TELC-D5 cells (Supplementary Fig. 8m). Besides, the binding motifs of TEAD4, GATA3 and TFAP2C were all enriched in VGLL1-bound sites, TEAD4 and TFAP2C motifs were enriched in GATA3-bound sites, and TEAD4 and GATA3 motifs were enriched in TFAP2C-bound sites (Fig. 4d and Fig. 7e, f). We also identified that VGLL1, TEAD4, GATA3 and/or TFAP2C co-bound genes were related to TE/TSC identity (Fig. 7g). Additionally, we observed binding of these factors at their own regulatory regions, pointing at positive feedback mechanisms. These results provide additional insights on the role of VGLL1 as a master regulator of human early placenta formation.
Discussion
During human preimplantation embryonic development, early embryonic cells divide and differentiate to form a layer of epithelial cells, the TE, that consists of the outer cells of the blastocyst and later forms the placenta. The transiently formed TE lineage will develop into the cytotrophoblast, which is the stem cell niche of the placenta villi and gives rise to the multinucleated SCT and the invasive EVT. Failure to properly proceed with any of these events leads to pregnancy loss or placental diseases66. Human cytotrophoblast identity can be captured in vitro from embryo blastocysts, first-trimester placenta or PSCs in the form of TSCs15,16,19,21,27, which self-renew and can be differentiated into SCT and EVT. While the developmental trajectories are reasonably well understood, the key regulators and pathways underlying human TE formation and TSC maintenance are only beginning to be investigated and outstanding questions remain.
Using TELCs and TSCs derived from human naïve PSCs, we have demonstrated that VGLL1 functions as an essential regulator of their induction and maintenance. VGLL1 depletion in TELCs or TSCs affects an array of cell cycle-related genes, TE/placenta-related transcription factors and other regulators, which impairs TELC/TSC proliferation and cell identity. VGLL1 was known to be expressed in the placenta and be involved in preeclampsia34,35 but the spatiotemporal kinetics of its expression during mammalian early development at the single-cell level had not been clarified. We show that VGLL1 expression substantially increases at the onset of both human and monkey TE specification, consistent with the idea that stage-specific regulators often play critical roles in controlling cell fate.
VGLL1 has been shown to be required for the induction of a panel of TE-related genes in primed human PSCs treated with BMP434, but this model does not recapitulate TE differentiation26,27. We show that VGLL1 cooperates with TEAD4 to regulate TE/TSC identity. Previous work had shown that VGLL1 interacts with TEAD4 to regulate gastric cancer malignancy36 but a role for the VGLL1-TEAD4 complex in early human placenta formation had not been described. Interestingly, mouse TE formation in vivo requires nuclear translocation of YAP and the interaction with TEAD4, but VGLL1 is negligibly expressed at this stage34. In contrast, YAP is mostly cytoplasmic in human TE derived from naïve PSCs. In human first trimester placenta, YAP is located in both the nucleus and cytoplasm13, suggesting that as development progresses the localization of YAP is dynamically regulated. Importantly, YAP/WWTR1 have been demonstrated to regulate human trophoblast cell identity13,56. Our findings suggest that YAP, WWTR1 and VGLL1 are critical partners of TEAD4 in TELCs and TSCs generated from human naïve PSCs. Further studies will be necessary to ascertain the potential crosstalk between them in the regulation of human TELCs/TSCs and the early placenta, and also the upstream signals controlling the subcellular distribution and function of both cofactors. Similarly, TEAD1 has recently been shown to regulate naïve PSC-derived TSC induction30 and it is tempting to speculate that this transcription factor can also recruit VGLL1.
VGLL1 acts by promoting H3K27ac deposition at target loci, which in turn increases chromatin accessibility. This is consistent with the role of TEAD4 in promoting histone acetylation in cancer cells67. To fully understand this process, it will be important to determine what epigenetic modulators interact with VGLL1, which can for example be done by performing immunoprecipitation coupled with mass spectrometry. Furthermore, we show that the VGLL1-TEAD4 complex functions synergistically with other known master TE regulators such as the transcription factors GATA3 and TFAP2C68. The overall genome-wide binding overlap for all these factors is, however, only partial, pointing at both shared and unique functions. Importantly, some of these regulators are also enriched in naïve PSCs (e.g., TFAP2C, TEAD4) and amnion (e.g., GATA3), among other lineages. In the case of TEAD4, its binding is reorganized and expanded in TELCs compared to naïve PSCs, but some regions are shared. This suggests preparation for lineage switching when VGLL1 is recruited. Clarification of how these and other factors coordinate with each other to achieve their goals will help explain the continuum of cell fate decisions during early human development. An in-depth study of these mechanisms will not only provide new ways to understand normal and abnormal development but will also be instrumental to improve the quality and functionality of human PSC-derived embryo models69,70,71,72,73,74,75,76.
Methods
Inclusion and ethics statement
The use of human H9 ESCs (WiCell, WA09), UH10 iPSCs and TSCBT in this study is compliant with the ‘Guidance of the Ministry of Science and Technology for the Review and Approval of Human Genetic Resources’ and was approved by the ‘Life Science and Medical Ethics Committee’ of the Guangzhou Institutes of Biomedicine and Health under license number GIBH-LMEC2023-001-01(AL).
Cell culture
HEK293T cells and ICR mouse embryonic fibroblast-derived feeders were maintained in DMEM (Corning) supplemented with 10% fetal bovine serum (NATOCOR). Primed human PSCs, including H9 ESCs and UH10 iPSCs50,51 were routinely cultured in mTeSRTM1 medium (STEMCELL). For passaging, primed PSCs were washed with DPBS (Hyclone) once and treated with 0.5 mM EDTA (Invitrogen, 15575020) for 5 minutes. Then, EDTA was removed and cells were passaged as small clumps using a Pasteur pipette. Human TSCBT were cultured in TSC medium15 (DMEM/F12 supplemented with N2 and B27, penicillin-streptomycin, Glutamax, β-mercaptoethanol, 1.5 µg/ml L-ascorbic acid, 50 ng/ml EGF (PeproTech), 0.5 µM A83-01 (Selleck), 1 µM SB431542 (Selleck), 2 µM CHIR99021 (Axon), 0.8 mM VPA (Vetec) and 5 µM Y-27632 (Axon) supplemented with 10 µM Y-27632. Human H9 ESCs were purchased from WiCell Research Institute, human UH10 iPSCs were provided by Dr. Guangjin Pan (Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, China), and TSCBT were provided by Dr. Hiroaki Okae and Dr. Takahiro Arima (Department of Informative Genetics, Tohoku University Graduate School of Medicine). All cell lines were negative for mycoplasma.
Naïve PSC induction
Generation of 4CL human naïve PSCs was performed as reported33. Briefly, primed PSCs were dissociated into single cells with a 1:1 mixture of 0.5 mM EDTA and TrypLE Express (Gibco) and plated at a density of 100,000 to 150,000 cell/well of 6-well plate on feeders in mTeSRTM1 medium supplemented with 10 µM Y-27632 (Axon, 1683). Twenty-four hours later, the medium was switched to 4CL medium33, which was refreshed daily. Cells were passaged as single cells (1:5 to 1:8) every 3 to 4 days. PXGL and HENSM naïve PSCs were induced as previously reported52,53.
TE and TSC induction
TE and TSC induction were performed as previously described15,26. Briefly, for generating TE, human naïve PSCs were dissociated into single cells with a 1:1 mixture of 0.5 mM EDTA and TrypLE Express and plated at a density of 100,000 cell/well of 6-well plate on Geltrex in TE induction medium (1:1 mix of Neurobasal medium (Gibco) and DMEM/F12 (HyClone) supplemented with N2 (Gibco) and B27 (Gibco), penicillin-streptomycin (HyClone), Glutamax (Gibco), β-mercaptoethanol (Sigma), 1 µM PD0325901 (Axon) and 1 µM A83-01 (Selleck)) for 5 days. TELCs at day 5 were used for downstream analysis throughout this study. For generating expandable TSCs, TELCs at day 5 were dissociated into single cells with TrypLE and plated on Geltrex-coated 6-well plates at a 1:4-1:8 split ratio in TSC medium (DMEM/F12 supplemented with N2 and B27, penicillin-streptomycin, Glutamax, β-mercaptoethanol, 1.5 µg/ml L-ascorbic acid, 50 ng/ml EGF (PeproTech), 0.5 µM A83-01 (Selleck), 1 µM SB431542 (Selleck), 2 µM CHIR99021 (Axon), 0.8 mM VPA (Vetec) and 5 µM Y-27632 (Axon) supplemented with 10 µM Y-27632.
EVT and SCT differentiation
Differentiation of TSCs toward EVT and SCT were performed as previously described60. Briefly, for EVT differentiation, TSCs were plated at a density of 70,000 cell/well on Geltrex-coated 6-well plates in EVT medium (TeSR™-E8 medium (STEM CELL), 2.5 µM SB431542 and 2.5 ng/ml EGF). Over the course of 12 days, medium was refreshed every 2 days. For SCT differentiation, TSCs were plated at a density of 40,000 cell/per well on Geltrex-coated 6-well plates in SCT medium (TeSR™-E6 medium (STEM CELL), 20 ng/ml activin A and 50 ng/ml EGF). Medium was refreshed every two days during the course of 14 days.
shRNA mediated knockdown
For shRNA-mediated VGLL1, TEAD4, YAP1, WWTR1, TP53 and GATA3 knockdown experiments, shRNA inserts were cloned into pLKO.1 vector. Lentiviruses were generated with HEK293T cells using Lipofectamine 3000 (Invitrogen) following the manufacturer’s instructions. One day after passaging, human naïve PSCs, cells at day 1 of TELC-induction (for TP53 knockdown) or TSCs were infected with shRNA viruses for 6-8 hours. Transduced cells were subjected to puromycin selection (1 μg/mL) 36 hours later and continued for 1-2 days until control cells died. Transduced human naïve PSCs were used for TELC differentiation as previously described. Transduced TSCs were collected without further passaging or used for further EVT and SCT differentiation. All shRNA target sequences are listed in Supplementary Table 1.
Generation of knockout cells
Single guide RNAs (sgRNAs) targeting VGLL1 and TEAD4 exons were cloned into lentiCRISPR-v2 plasmids (Addgene, 52961) for KO cell line generation. H9 ESCs were transfected with sgRNA-coding plasmids using Lipofectamine 3000 (ThermoFisher) following the manufacturer’s instructions. Puromycin selection (1 µg/mL) was conducted 2 days post-transfection and continued for 1-2 days until control cells were dead. Single-cell derived clones were picked for further expansion and PCR-based genotyping was performed to test frameshift mutations. Homozygous KO were further confirmed by TA cloning followed by Sanger sequencing, and also by Western blotting. All sgRNA sequences are listed in Supplementary Table 2. Primers used for genomic PCR are listed in Supplementary Table 3.
Growth curve generation
A total of 1.25 × 104 TELCs or TSCs were seeded at day 0 per well of a 12-well plate, or a total of 8 × 104 TELCs or 2.5 × 104 TSCs were seeded at day 0 per well of a 6-well plate. The medium was changed every day. The cell number was counted for 3 wells as triplicates using a hemocytometer. Three independent experiments were performed.
Immunofluorescence, Western blotting and immunoprecipitation
For immunofluorescence, cells were washed with PBS and then fixed with 4% paraformaldehyde for 10 minutes at room temperature and permeabilized with 3% BSA (bovine serum albumin) +0.2% Triton X-100 in PBS for 60 minutes at room temperature. Cells were incubated with primary antibodies overnight at 4 °C and subsequently with secondary antibodies for 1 hour at room temperature. Finally, cells were stained with DAPI for 5 minutes and observed with a confocal microscope. Immunofluorescence was assessed using a Leica TCS SP2 spectral confocal microscope. For Western blotting, cells were washed with PBS and lysed on ice in RIPA buffer (50 mM Tris-HCl (pH 7.5), 150 mM NaCl, 0.25% sodium deoxycholate, 0.1% NP-40 and 0.1% Triton X-100) supplemented with a proteinase inhibitor (Cocktail, Roche). Samples were sonicated and subjected to SDS–PAGE and transferred onto a PVDF membrane (Millipore). Membranes were blocked with 5% non-fat milk in TBST and then sequentially incubated with primary antibody overnight at 4 °C and secondary antibodies for 1 hour at room temperature. Signals were detected by Amersham ECL (GE Healthcare) and visualized with a FUSION SOLO 4 M machine (Vilber Lourmat) and analyzed with FusionCapt Advance Solo 4.16.15. For nuclear-cytoplasmic fractionation, cells were scraped down from the culture plates and lysed in 150 μL lysis buffer (PBS containing 0.04% NP-40 and proteinase inhibitor) on ice with occasional pipetting. 50 μL of sample were kept as whole cell lysate input. The remaining samples were centrifuged and 50 μL supernatant was kept as a cytoplasmic fraction. The precipitated nuclear fraction was washed four times and sonicated. The whole cell lysate input, cytoplasmic fraction and nuclear fraction were used for SDS-PAGE analysis. For immunoprecipitation, cells were lysed in TNE lysis buffer (50 mM Tris-HCL (pH 7.5), 250 mM NaCl, 0.5% NP-40 and 1 mM EDTA) containing a proteinase inhibitor cocktail and rotated on ice for 15 minutes. Lysates were homogenized by a 0.4 mm syringe needle and then centrifuged at 13,000 g for 15 minutes at 4 °C. Supernatants were pre-cleaned with prewashed protein A/G for 1 hour at 4 °C. Same volume of TNEG buffer (50 mM tris-HCl (pH 7.5), 50 mM NaCl, 0.5% NP-40, 20% glycerol, and 1 mM EDTA) was added to dilute the supernatants. Primary antibodies were incubated with the diluted supernatants overnight at 4 °C and then incubated with prewashed protein A/G magnetic beads for 3 hours at 4 °C. Samples were then loaded on an Invitrogen magnetic separator and beads were washed four times with washing buffer (20 mM Tris-HCl pH 7.6 and 140 mM NaCl). Proteins were eluted with 1x loading buffer by boiling at 80 °C in a water bath for 10 minutes and subjected to SDS-PAGE analysis. Antibodies used for immunofluorescence, Western blotting and immunoprecipitation are listed in Supplementary Table 5. Unprocessed scans of Western blotting for all figures are provided in Supplementary Fig. 9.
RNA isolation, RT–qPCR and RNA-seq library preparation
Total RNA was isolated using RNAzol (MRC) based on the manufacturer’s instructions. RT-qPCR analysis was performed using the SYBR Premix ExTaq Kit (Takara, RR420A) with an ABI 7500 real-time PCR machine. All RT-qPCR primers are listed in Supplementary Table 4. Bulk RNA-seq library preparation and sequencing were performed at Berry Genomics Co. Ltd. (Beijing, China).
ATAC-seq and CUT&Tag library construction
For ATAC-seq library preparation, 50,000 cells were washed once with cold PBS and resuspended in 50 μl of lysis buffer (10 mM Tris-HCl (pH 7.4) 10 mM NaCl, 3 mM MgCl2, and 0.1% (v/v) IGEPAL CA-630 (Sigma)). The suspension was then centrifuged at 500 g for 10 minutes at 4 °C, followed by the addition of 50 μl of the transposition reaction mixture of the TruePrep DNA Library Prep Kit (Vazyme, TD502). Samples were then incubated at 37 °C for 30 minutes. Transposition reactions were cleaned up using the MinElute PCR Purification Kit (Qiagen, 28004). ATAC-seq libraries were subjected to five cycles for preamplification and amplified by PCR for an appropriate number of cycles. Amplified libraries were purified using the QIAquick PCR Purification Kit (Qiagen, 28104). Library concentration was measured using the VAHTSTM Library Quantification Kit (Vazyme, NQ101). For CUT&Tag library construction, Hyperactive In-Situ ChIP Library Prep Kit for Illumina (Vazyme, TD902-01) was used. In brief, 100,000 cells were incubated with Concanavalin A-coated magnetic beads and then primary antibodies were added to incubate overnight at 4 °C. Tubes were placed on a magnetic rack to remove the supernatants. Secondary antibodies were added and samples were incubated at room temperature for 1 hour. After placing the tubes on the magnetic rack and removing the supernatants, hyperactive pG-Tn5/pA-Tn5 transposon (incubated at room temperature for 1 hour) were added. Tagmentation was performed for 1 hour and DNA was extracted using phenol–chloroform–isoamyl alcohol. Purified DNA fragments were amplified by PCR for 15–20 cycles and the PCR products were purified following the kit’s instructions. CUT&Tag library construction were performed in duplicate. Libraries were sequenced by Berry Genomics Co. Ltd. (Beijing, China). Primary antibodies used for CUT&Tag library construction are listed in Supplementary Table 5.
Bulk RNA-seq, ATAC-seq and CUT&Tag data analysis
For bulk RNA-seq, reads were trimmed of adapters and low-quality reads were removed by fastp77 (v0.21.0) (https://github.com/OpenGene/fastp) with the default option. Trimmed data were aligned to the Ensembl v76 (hg38) transcript annotations using STAR78 (v2.7.3a) with the settings --outSAMtype BAM Unsorted, --quantMode TranscriptomeSAM, --outSAMheaderHD\@HD VN:1.4 SO:unsorted. Gene counts were calculated using RSEM (v1.2.18) with the settings --paired-end. After removing the genes with zero count in all samples, DEGs were determined using DESeq279 (v1.24) in R (v3.6.0). Heatmaps were generated using pheatmap (v1.0.12) in R (v3.6.0). Volcano plots were generated from DESeq2 output files using EnhancedVolcano (v1.8.0) in R (v3.6.0) with the settings pCutoff = 10e-3, FCcutoff = 1.0. Processing of the public bulk RNA-seq data was the same as that of our bulk RNA-seq data. For ATAC-seq and CUT&Tag, reads were trimmed of adapters and low-quality reads were removed by fastp77 (v0.21.0) (https://github.com/OpenGene/fastp) with the settings -g, --length_required = 10, --n_base_limit = 5. Trimmed data were first aligned to the hg38 assembly using Bowtie2 (v2.2.5) with the settings --very-sensitive. Low quality mapped reads were removed using Samtools with the settings -q 30. Duplicated reads were collapsed using Picard (v1.9.0). Peaks were called using MACS2 (v2.1.0) with the settings -B, --nomodel, --keep-dup 1, -g hs, --call-summits, -q 1e-05 and then were annotated using ChIPseeker80 (v1.26.2) annotatePeak tools. Pie charts of genomic annotation were generated using ggplot2 (v3.3.3) in R (v3.6.0). Motif discovery was performed using HOMER (v4.9.1) findMotifsGenome.pl tools with the default option. Venn diagrams were generated from MACS2 narrowPeak files using ChIPpeakAnno81,82 (v3.24.2) findOverlapsOfPeaks and makeVennDiagram tools. BigWig files were generated using MACS2 (v2.1.0) bdgcmp tools with the settings -m logLR, -p 0.00001 and deeptools(v3.4.3) bamCoverage tools with the settings —ignoreDuplicates, --normalizeUsing CPM, --minMappingQuality 30. Co-occupancy analysis by tag density pileups was done by deeptools (v3.4.3) computeMatrix reference-point tools with the settings -skipZeros, plotHeatmap tools with the settings --sortRegions keep and plotProfile tools with the settings —perGroup, --kmeans 1. Functional annotation was performed by ClusterProfiler83 (v3.6.0) enrichGO tools in R (v3.6.0) and barplots were generated using ggplot2 (v3.3.3) in R (v3.6.0).
Library preparation for scRNA-seq
ScRNA-seq libraries were prepared using a DNBelab C Series Single-Cell Library Prep set (MGI, 1000021082)84. Briefly, cells were dissociated into single cells with a 1:1 mixture of 0.5 mM EDTA and TrypLE Express (Gibco). After passing through 70 μM cell strainers, cell suspensions were converted to barcoded scRNA-seq libraries according to the manufacturer’s protocol. Qubit ssDNA Assay kit (Thermo Fisher Scientific, Q10212) was used to quantify the sequencing library concentration. The resulting libraries were sequenced using a DIPSEQ T1 sequencer at the China National GeneBank (CNGB) in Shenzhen.
scRNA-seq data analysis
Raw data processing
Raw sequencing reads were filtered and demultiplexed using PISA (v0.2) (https://github.com/shiquan/PISA). Filtered reads were aligned to hg38 genome reference using STAR78 (v2.7.3a) and sorted by sambamba (v0.7.0)85. Cell versus gene UMIs (Unique Molecular Identifiers) count matrices were generated by PISA.
Quality control
Count matrices were processed using the Seurat package (v3.1.4)86 in R (v3.6.0). Genes expressed in less than three cells were filtered out. Cells with less than 15% mitochondrial reads, less than 500 genes and 3,000 UMIs detected were excluded from further analysis. Doublet removal was performed by DoubletFinder87 with the default parameter. Public data were re-analyzed from E-MTAB-3929, PRJEB30442, GSE136447, GSE74767, GSE84892 and CNP0001454. Quality control for public datasets was processed according to their respective reference articles.
Analysis of scRNA-seq data
Seurat package88 (v3.1.4) in R (v3.6.0) was used to create Seurat objects for analysis. Public datasets were annotated using the gene lists provided in their original articles. We first used the Seurat FindIntegrationAnchors function to integrate our scRNA-seq data of TELC-D5 with human embryo public scRNA-seq data (E-MTAB-3929, PRJEB30442, GSE136447)40,41,42. Four Seurat objects were merged and then three thousand genes that were repeatedly variable across datasets were selected by SelectIntegrationFeatures tools. FindIntegrationAnchors was then performed with the settings reduction=rpca. All datasets were finally integrated by IntegrateData with default parameters. Principal components were selected by principal component analysis (PCA). UMAP was then used for visualization. Plots were generated using DimPlot and DotPlot. For scRNA-seq data of TELC-D5 cells, Seurat object was processed using NormalizeData, FindVariableFeatures, ScaleData, RunPCA and RunUMAP functions in turn. Dimension reduction started with PCA on 3,000 significantly variable genes and the first thirty principal components were selected as input for UMAP visualization. Correlation analysis was based on the 10,000 significantly variable genes and cor () function in R (v3.6.2). Correlation heatmap were generated by pheatmap (v1.0.12). UMAP was used for visualization of public embryo scRNA-seq data (E-MTAB-3929, GSE74767, GSE84892, PRJEB30442). The single-cell trajectory was analyzed using the count matrix of public embryo scRNA-seq data by Monocle 2 (v4.1.0). DEGs between cell types were identified using FindAllMarkers function of Seurat package with the settings only.pos = TRUE, min.pct = 0.25, logfc.threshold = 0.25 for trajectory analysis. Dimensions were reduced by using the reduceDimension function with the method of DDRTree. Lineage trajectory and branch points were then constructed. Pre-lineage of the trajectory was chosen as the root. Trajectory was plotted by plot_cell_trajectory function of Monocle 2 (v4.1.0). Expression patterns of key landmark genes along the trajectory were generated by plot_genes_branched_pseudotime function of Monocle 2. Plots were generated using stacked violin functions.
Statistics and reproducibility
Data of bar charts are represented as mean ± standard error of the mean (SEM). Significance was tested using a two-tailed unpaired Student’s t-test. The related P values or P value ranges are shown in the figure legend. In all cases, *P < 0.05, **P < 0.01, ***P < 0.001. The number of replicates for each experiment is presented in the figure legends.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
All bulk sequencing data (RNA-seq, ATAC-seq and CUT&Tag) and single-cell sequencing data (scRNA-seq) reported in this paper are deposited in the Gene Expression Omnibus (GEO) database under the accession number GSE193621. The raw single-cell sequencing data are also deposited in the CNGB Nucleotide Sequence Archive under the accession number CNP0004944. Public scRNA-seq dataset of human early embryo development were downloaded from EBI ArrayExpress (E-MTAB-3929)40, ENA database (PRJEB30442)41 and GEO database (GSE136447)42. scRNA-seq datasets of monkey and mouse early embryo development were downloaded from GEO database (GSE7476748, GSE8489249). Bulk RNA-seq and scRNA-seq dataset of 4CL H9 naïve ESCs were downloaded from CNGBdb (CNP0001454)33. Source data are provided with this paper.
References
Rossant, J. & Tam, P. P. L. Exploring early human embryo development. Science 360, 1075–1076 (2018).
Maltepe, E. & Fisher, S. J. Placenta: the forgotten organ. Annu. Rev. Cell Dev. Biol. 31, 523–552 (2015).
Turco, M. Y. & Moffett, A. Development of the human placenta. Development 146, dev163428 (2019).
Cao, Z.-B. et al. Chromatin remodeler INO80 mediates trophectoderm permeability barrier to modulate morula-to-blastocyst transition. Zool. Res. 42, 562–573 (2021).
Yagi, R. et al. Transcription factor TEAD4 specifies the trophectoderm lineage at the beginning of mammalian development. Development 134, 3827–3836 (2007).
Ralston, A. et al. Gata3 regulates trophoblast development downstream of Tead4 and in parallel to Cdx2. Development 137, 395–403 (2010).
Nishioka, N. et al. Tead4 is required for specification of trophectoderm in pre-implantation mouse embryos. Mech Dev 125, 270–283 (2008).
Karasek, C., Ashry, M., Driscoll, C. S. & Knott, J. G. A tale of two cell-fates: role of the Hippo signaling pathway and transcription factors in early lineage formation in mouse preimplantation embryos. Mol Hum Reprod 26, 653–664 (2020).
Sasaki, H. Roles and regulations of Hippo signaling during preimplantation mouse development. Dev Growth Differ 59, 12–20 (2017).
Soncin, F. & Parast, M. M. Role of Hippo signaling pathway in early placental development. Proc. Natl Acad. Sci. USA 117, 20354–20356 (2020).
Nishioka, N. et al. The Hippo signaling pathway components Lats and Yap pattern Tead4 activity to distinguish mouse trophectoderm from inner cell mass. Dev. Cell 16, 398–410 (2009).
Panciera, T., Azzolin, L., Cordenonsi, M. & Piccolo, S. Mechanobiology of YAP and TAZ in physiology and disease. Nat. Rev. Mol. Cell Biol. 18, 758–770 (2017).
Meinhardt, G. et al. Pivotal role of the transcriptional co-activator YAP in trophoblast stemness of the developing human placenta. Proc. Natl Acad. Sci. USA 117, 13562–13570 (2020).
Qin, H. et al. YAP induces human naive pluripotency. Cell Rep. 14, 2301–2312 (2016).
Okae, H. et al. Derivation of human trophoblast stem cells. Cell Stem Cell 22, 50–63.e56 (2018).
Castel, G. et al. Induction of human trophoblast stem cells from somatic cells and pluripotent stem cells. Cell Rep. 33, 108419 (2020).
Cinkornpumin, J. K. et al. Naive human embryonic stem cells can give rise to cells with a trophoblast-like transcriptome and methylome. Stem Cell Rep. 15, 198–213 (2020).
Dong, C. et al. Derivation of trophoblast stem cells from naive human pluripotent stem cells. Elife 9, e52504 (2020).
Karvas, R. M. et al. Stem-cell-derived trophoblast organoids model human placental development and susceptibility to emerging pathogens. Cell Stem Cell 29, 810–825.e818 (2022).
Liu, X. et al. Reprogramming roadmap reveals route to human induced trophoblast stem cells. Nature 586, 101–107 (2020).
Pham, T. X. A. et al. Modeling human extraembryonic mesoderm cells using naive pluripotent stem cells. Cell Stem Cell 29, 1346–1365.e1310 (2022).
Viukov, S. et al. Human primed and naïve PSCs are both able to differentiate into trophoblast stem cells. Stem Cell Rep. 17, 2484–2500 (2022).
Naama, M. & Buganim, Y. Human trophoblast stem cell-state acquisition from pluripotent stem cells and somatic cells. Curr. Opin. Genet. Dev. 81, 102084 (2023).
Xu, R. H. et al. BMP4 initiates human embryonic stem cell differentiation to trophoblast. Nat. Biotechnol. 20, 1261–1264 (2002).
Amita, M. et al. Complete and unidirectional conversion of human embryonic stem cells to trophoblast by BMP4. Proc. Natl Acad. Sci. USA 110, E1212–E1221 (2013).
Guo, G. et al. Human naive epiblast cells possess unrestricted lineage potential. Cell Stem Cell 28, 1040–1056.e1046 (2021).
Io, S. et al. Capturing human trophoblast development with naive pluripotent stem cells in vitro. Cell Stem Cell 28, 1023–1039 e1013 (2021).
Soncin, F. et al. Derivation of functional trophoblast stem cells from primed human pluripotent stem cells. Stem Cell Rep. 17, 1303–1317 (2022).
Wei, Y. et al. Efficient derivation of human trophoblast stem cells from primed pluripotent stem cells. Sci. Adv. 7, eabf4416 (2021).
Dong, C. et al. A genome-wide CRISPR-Cas9 knockout screen identifies essential and growth-restricting genes in human trophoblast stem cells. Nat. Commun. 13, 2548 (2022).
Kobayashi, N. et al. The microRNA cluster C19MC confers differentiation potential into trophoblast lineages upon human pluripotent stem cells. Nat. Commun. 13, 3071 (2022).
Saha, B. et al. TEAD4 ensures postimplantation development by promoting trophoblast self-renewal: an implication in early human pregnancy loss. Proc. Natl Acad. Sci. USA 117, 17864–17875 (2020).
Mazid, M. A. et al. Rolling back human pluripotent stem cells to an eight-cell embryo-like stage. Nature 605, 315–324 (2022).
Soncin, F. et al. Comparative analysis of mouse and human placentae across gestation reveals species-specific regulators of placental development. Development 145, dev156273 (2018).
Enquobahrie, D. A. et al. Differential placental gene expression in preeclampsia. Am. J. Obstet. Gynecol. 199, 566 e561–511 (2008).
Im, J. Y. et al. VGLL1 phosphorylation and activation promotes gastric cancer malignancy via TGF-beta/ERK/RSK2 signaling. Biochim. Biophys. Acta Mol. Cell Res. 1868, 118892 (2021).
Pobbati, A. V., Chan, S. W., Lee, I., Song, H. & Hong, W. Structural and functional similarity between the Vgll1-TEAD and the YAP-TEAD complexes. Structure 20, 1135–1140 (2012).
Kim, B. K. et al. PI3K/AKT/beta-catenin signaling regulates vestigial-like 1 which predicts poor prognosis and enhances malignant phenotype in gastric cancer. Cancers (Basel) 11, 1923 (2019).
Han, L. et al. Cell transcriptomic atlas of the non-human primate Macaca fascicularis. Nature 604, 723–731 (2022).
Petropoulos, S. et al. Single-Cell RNA-Seq reveals lineage and X chromosome dynamics in human preimplantation embryos. Cell 165, 1012–1026 (2016).
Meistermann, D. et al. Integrated pseudotime analysis of human pre-implantation embryo single-cell transcriptomes reveals the dynamics of lineage specification. Cell Stem Cell 28, 1625–1640 e1626 (2021).
Xiang, L. et al. A developmental landscape of 3D-cultured human pre-gastrulation embryos. Nature 577, 537–542 (2020).
Adachi, K. et al. Context-dependent wiring of Sox2 regulatory networks for self-renewal of embryonic and trophoblast stem cells. Mol. Cell 52, 380–392 (2013).
Blakeley, P. et al. Defining the three cell lineages of the human blastocyst by single-cell RNA-seq. Development 142, 3613 (2015).
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 32, 381–386 (2014).
Qiu, X. et al. Reversed graph embedding resolves complex single-cell trajectories. Nat. Methods 14, 979–982 (2017).
Qiu, X. et al. Single-cell mRNA quantification and differential analysis with Census. Nat. Methods 14, 309–315 (2017).
Nakamura, T. et al. A developmental coordinate of pluripotency among mice, monkeys and humans. Nature 537, 57–62 (2016).
Posfai, E. et al. Position- and Hippo signaling-dependent plasticity during lineage segregation in the early mouse embryo. Elife 6, e22906 (2017).
Huang, K. et al. BMI1 enables interspecies chimerism with human pluripotent stem cells. Nat. Commun. 9, 4649 (2018).
Zhu, Y. et al. Generating functional cells through enhanced interspecies chimerism with human pluripotent stem cells. Stem Cell Rep. 17, 1059–1069 (2022).
Yanagida, A. et al. Naive stem cell blastocyst model captures human embryo lineage segregation. Cell Stem Cell 28, 1016–1022.e1014 (2021).
Bayerl, J. et al. Principles of signaling pathway modulation for enhancing human naive pluripotency induction. Cell Stem Cell 28, 1549–1565.e1512 (2021).
Kaya-Okur, H. S. et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat. Commun. 10, 1930 (2019).
Mori, S., Takeuchi, T., Ishii, Y. & Kukimoto, I. The Transcriptional Cofactor VGLL1 Drives Transcription of Human Papillomavirus Early Genes via TEAD1. J. Virol. 94, e01945–19 (2020).
Ray, S. et al. Hippo signaling cofactor, WWTR1, at the crossroads of human trophoblast progenitor self-renewal and differentiation. Proc. Natl Acad. Sci. USA 119, e2204069119 (2022).
Zhuang, Q. et al. NCoR/SMRT co-repressors cooperate with c-MYC to create an epigenetic barrier to somatic cell reprogramming. Nat. Cell Biol. 20, 400–412 (2018).
Beyer, T. A. et al. Switch enhancers interpret TGF-beta and Hippo signaling to control cell fate in human embryonic stem cells. Cell Rep. 5, 1611–1624 (2013).
Klemm, S. L., Shipony, Z. & Greenleaf, W. J. Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet. 20, 207–220 (2019).
Mischler, A. et al. Two distinct trophectoderm lineage stem cells from human pluripotent stem cells. J. Biol. Chem. 296, 100386 (2021).
De Paepe, C. et al. Human trophectoderm cells are not yet committed. Hum. Reprod. 28, 740–749 (2012).
Li, M. & Izpisua Belmonte, J. C. Deconstructing the pluripotency gene regulatory network. Nat. Cell Biol. 20, 382–392 (2018).
Bai, Q. et al. Dissecting the first transcriptional divergence during human embryonic development. Stem Cell Rev Rep 8, 150–162 (2012).
Benchetrit, H. et al. Extensive nuclear reprogramming underlies lineage conversion into functional trophoblast stem-like cells. Cell Stem Cell 17, 543–556 (2015).
Kumar, B. et al. Polycomb repressive complex 2 shields naïve human pluripotent cells from trophectoderm differentiation. Nat Cell Biol 24, 845–857 (2022).
Bacenkova, D. et al. In vitro model of human trophoblast in early placentation. Biomedicines 10, 904 (2022).
Chen, C. L. et al. Arginine is an epigenetic regulator targeting TEAD4 to modulate OXPHOS in prostate cancer cells. Nat. Commun. 12, 2398 (2021).
Krendl, C. et al. GATA2/3-TFAP2A/C transcription factor network couples human pluripotent stem cell differentiation to trophectoderm with repression of pluripotency. Proc. Natl Acad. Sci. USA 114, E9579–E9588 (2017).
Kagawa, H. et al. Human blastoids model blastocyst development and implantation. Nature 601, 600–605 (2022).
Yu, L. et al. Blastocyst-like structures generated from human pluripotent stem cells. Nature 591, 620–626 (2021).
Liu, X. et al. Modelling human blastocysts by reprogramming fibroblasts into iBlastoids. Nature 591, 627–632 (2021).
Ai, Z. et al. Dissecting peri-implantation development using cultured human embryos and embryo-like assembloids. Cell Res. 33, 661–678 (2023).
Liu, L. et al. Modeling post-implantation stages of human development into early organogenesis with stem-cell-derived peri-gastruloids. Cell 186, 3776–3792 e3716 (2023).
Oldak, B. et al. Complete human day 14 post-implantation embryo models from naive ES cells. Nature 622, 562–573 (2023).
Pedroza, M. et al. Self-patterning of human stem cells into post-implantation lineages. Nature 622, 574–583 (2023).
Weatherbee, B. A. T. et al. Pluripotent stem cell-derived model of the post-implantation human embryo. Nature 622, 584–593 (2023).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015).
Zhu, L. J. Integrative analysis of ChIP-chip and ChIP-seq dataset. Methods Mol. Biol. 1067, 105–124 (2013).
Zhu, L. J. et al. ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data. BMC Bioinf. 11, 237 (2010).
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Liu, C. et al. A portable and cost-effective microfluidic system for massively parallel singlecell transcriptome profiling. bioRxiv https://doi.org/10.1101/818450 (2019).
Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032–2034 (2015).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902.e1821 (2019).
McGinnis, C. S., Murrow, L. M. & Gartner, Z. J. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 8, 329–337.e324 (2019).
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
Acknowledgements
We thank all members of the Laboratory of Integrative Biology of the Guangzhou Institutes of Biomedicine and Health ang BGI research for their support. We thank the instrument platform and the supercomputing center of the Guangzhou Institutes of Biomedicine and Health for their technical support. We thank the CNGB for providing technical support. This work was supported by the National Natural Science Foundation of China (32370848, 32250710149 and U20A2015 to M.A.E., 32150410348 to M.A.M. and 32200669 to W.L.), the Guangdong Basic and Applied Basic Research Foundation (2021B1515120075 to M.A.E.), the Science and Technology Planning Project of Guangdong Province (2023B1212060050 to W.L.), the Guangzhou Science and Technology Foundation (2024A04J4702 and 202102020176 to W.L.), the CAS President’s International Fellowship Initiative (CAS-PIFI) for special experts (2020FSB0002 to M.A.M.) and the Foreign Expert Project-Young Talent Program (QN2022031001L to M.A.M.).
Author information
Authors and Affiliations
Contributions
W.L., M.A.M., and M.A.E. conceived the original idea and supervised this study. Z.L. contributed to the original idea. Y.Y. and W.J. conducted most of the experiments with help from Z.L., Y.Li, Hao L., L.F., Y.J., J.Lai., Haiwei L., B.J.S., Y.Z., Y.Lv., Y.S. and C.L. W.J. performed most of the bioinformatics analysis with help from J. Li, L.W. and Y.Lai. W.L., M.A.M., and M.A.E. interpreted the data and wrote the manuscript with input from W.J., A.P.H., T.Z. and L.L.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yang, Y., Jia, W., Luo, Z. et al. VGLL1 cooperates with TEAD4 to control human trophectoderm lineage specification. Nat Commun 15, 583 (2024). https://doi.org/10.1038/s41467-024-44780-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-44780-8
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.