Hijacking of transcriptional condensates by endogenous retroviruses

Asimi, Vahid; Sampath Kumar, Abhishek; Niskanen, Henri; Riemenschneider, Christina; Hetzel, Sara; Naderi, Julian; Fasching, Nina; Popitsch, Niko; Du, Manyu; Kretzmer, Helene; Smith, Zachary D.; Weigert, Raha; Walther, Maria; Mamde, Sainath; Meierhofer, David; Wittler, Lars; Buschow, René; Timmermann, Bernd; Cisse, Ibrahim I.; Ameres, Stefan L.; Meissner, Alexander; Hnisz, Denes

doi:10.1038/s41588-022-01132-w

Download PDF

Article
Open access
Published: 21 July 2022

Hijacking of transcriptional condensates by endogenous retroviruses

Vahid Asimi^1,2^na1,
Abhishek Sampath Kumar^1,3^na1,
Henri Niskanen¹^na1,
Christina Riemenschneider^1,3^na1,
Sara Hetzel ORCID: orcid.org/0000-0002-4783-3814¹,
Julian Naderi¹,
Nina Fasching⁴,
Niko Popitsch^4,5,
Manyu Du ORCID: orcid.org/0000-0001-5737-3578^6,7,
Helene Kretzmer¹,
Zachary D. Smith^8,9,
Raha Weigert¹,
Maria Walther¹,
Sainath Mamde¹,
David Meierhofer¹⁰,
Lars Wittler¹¹,
René Buschow ORCID: orcid.org/0000-0002-9800-2578¹²,
Bernd Timmermann¹³,
Ibrahim I. Cisse^6,7,
Stefan L. Ameres^4,5,
Alexander Meissner ORCID: orcid.org/0000-0001-8646-7469^1,8,9 &
…
Denes Hnisz ORCID: orcid.org/0000-0002-6256-1693¹

Nature Genetics volume 54, pages 1238–1247 (2022)Cite this article

27k Accesses
33 Citations
120 Altmetric
Metrics details

Subjects

Abstract

Most endogenous retroviruses (ERVs) in mammals are incapable of retrotransposition; therefore, why ERV derepression is associated with lethality during early development has been a mystery. Here, we report that rapid and selective degradation of the heterochromatin adapter protein TRIM28 triggers dissociation of transcriptional condensates from loci encoding super-enhancer (SE)-driven pluripotency genes and their association with transcribed ERV loci in murine embryonic stem cells. Knockdown of ERV RNAs or forced expression of SE-enriched transcription factors rescued condensate localization at SEs in TRIM28-degraded cells. In a biochemical reconstitution system, ERV RNA facilitated partitioning of RNA polymerase II and the Mediator coactivator into phase-separated droplets. In TRIM28 knockout mouse embryos, single-cell RNA-seq analysis revealed specific depletion of pluripotent lineages. We propose that coding and noncoding nascent RNAs, including those produced by retrotransposons, may facilitate ‘hijacking’ of transcriptional condensates in various developmental and disease contexts.

m⁶A RNA methylation regulates the fate of endogenous retroviruses

Article 13 January 2021

The RNA m⁶A reader YTHDC1 silences retrotransposons and guards ES cell identity

Article 03 March 2021

Transcription of MERVL retrotransposons is required for preimplantation embryo development

Article Open access 02 March 2023

Main

ERVs make up around 10% of mammalian genomes, and ERVs are repressed by multiple mechanisms including heterochromatin, DNA methylation and modification of their RNA transcripts^{1,2,3,4,5,6,7,8,9,10,11,12,13}. One of the best-studied repressive pathways involves the TRIM28 heterochromatin corepressor that is recruited by KRAB-ZFP transcription factors to ERVs in pluripotent embryonic stem cells (ESCs), where it recruits the histone H3 K9 methyltransferase SETDB1 and the heterochromatin protein HP1α that together establish a repressive chromatin environment^{7,9,14,15,16,17}. ERV derepression is associated with lethality at various embryonic stages and in ESCs deficient for the TRIM28-HP1α pathway^7,8,9,10, although most ERVs in mice and humans have lost their ability to undergo retrotransposition^1,2,3,4,5,6, and deletion of entire clusters of KRAB-ZFP factors does not lead to elevated transposition rates in mice¹⁸. These findings suggest that RNA transcripts produced by ERVs may contribute to developmental phenotypes associated with ERV derepression.

RNA has long been recognized as a component of phase-separated biomolecular condensates, including stress granules, splicing speckles and Cajal bodies¹⁹, and recent studies indicate that RNA may make important contributions to nuclear condensates formed by transcriptional regulatory proteins²⁰. During transcription, nascent RNA is thought to promote formation of transcriptional condensates enriched in RNA polymerase II (RNAPII) and the Mediator coactivator through electrostatic interactions that contribute to phase separation²¹. ERV RNAs can be transcribed at hundreds, if not thousands, of genomic loci, and many nuclear noncoding RNAs localize to the loci where they are produced²². These data lead us to hypothesize that ERV RNA transcripts may impact the genomic distribution of transcriptional condensates in cells deficient for ERV repression.

Here, we test the model that ERV RNA transcripts contribute to lethality associated with ERV derepression through disrupting the genomic distribution of transcriptional condensates. We found that RNAPII-containing condensates, typically associating with SE-driven pluripotency genes, are hijacked by transcribed ERV loci upon acute perturbation of the machinery responsible for ERV repression in ESCs. Condensate association was dependent on ERV RNAs and was rescued by ERV RNA knockdown or forced expression of pluripotency transcription factors. The results highlight an important role of ERV RNA transcripts in nuclear condensates in pluripotent cells.

Results

Rapid and selective degradation of TRIM28 in mESCs

ERVs, including intracisternal A-type particles (IAPs), are bound by members of the TRIM28-HP1α pathway and marked by H3K9me3 in murine ESCs (mESCs)^{7,9,14,15,16,17} (Fig. 1a,b and Supplementary Fig. 1a–g). In contrast, heterochromatin components tend not to occupy enhancers bound by the pluripotency transcription factors (TFs) OCT4, SOX2 and NANOG that drive the cell-type-specific transcriptional program of mESCs^7,23,24 (Fig. 1a,b and Supplementary Fig. 1h–j).

**Fig. 1: TRIM28 degradation leads to the reduction of SE transcription and loss of transcriptional condensates at SEs in mESCs.**

Resolving the direct consequences of ERV derepression has been impeded, in part, by limitations of classic gene disruption strategies and the essential nature of the TRIM28-HP1α pathway in mESCs^7,8,9,10. To overcome these challenges, we generated an mESC line that encodes degradation-sensitive TRIM28-FKBP alleles using the dTAG system (Fig. 1c and Supplementary Fig. 2a,b)²⁵. Directed differentiation and tetraploid aggregation assays confirmed that TRIM28-FKBP mESCs maintained pluripotency and a gene expression profile similar to parental V6.5 mESCs (Supplementary Fig. 2d–n). Endogenously tagged TRIM28 experienced reversible, ligand-dependent proteolysis with near-complete degradation after 6 h of exposure to the dTAG-13 ligand (Fig. 1d and Supplementary Fig. 2b). Quantitative mass spectrometry confirmed that TRIM28 degradation was highly selective up to 24 h of dTAG-13 treatment (Supplementary Fig. 2c). Short-term (up to 24 h) TRIM28 degradation did not substantially alter the protein levels of pluripotency markers (for example, OCT4, SOX2, SSEA-1) (Fig. 1d and Supplementary Fig. 2j-l), suggesting that acute TRIM28 degradation did not markedly alter the pluripotent state.

Reduced SE transcription in TRIM28-degraded mESCs

To monitor changes in transcriptional activity upon acute TRIM28 degradation, we used TT-SLAM-seq, a recently developed genome-wide nascent transcription readout²⁶. TT-SLAM-seq combines metabolic labeling and chemical nucleoside conversion (SLAM-seq)²⁷ with selective enrichment of newly synthesized RNA (TT-seq)²⁸ to detect nascent RNA transcription with high temporal resolution and sensitivity (Supplementary Fig. 3a–c). Consistent with previous reports^7,29, we observed derepression of several main classes of ERVs, including IAPs, MMERVK10C and MMERVK9C elements in TRIM28-degraded ESCs (Fig. 1e and Supplementary Fig. 4a–f) and loss of H3K9me3 at these sites (Supplementary Fig. 4g). Derepression of ERVs was also confirmed with extended TRIM28 degradation for 96 h and RNA-seq (Fig. 1e and Supplementary Fig. 4a–e). The TT-SLAM-seq data revealed around 250 genes whose transcription was significantly induced and around 300 genes whose transcription was significantly reduced upon 24 h of TRIM28 degradation (greater than twofold, false discovery rate (FDR) < 0.05) (Fig. 1f,g). The downregulated genes were enriched for SE-associated pluripotency genes (NES = −1.6, P < 10⁻³) (Fig. 1h). Downregulation of these genes was associated with the reduction of nascent transcription at the SEs (Fig. 1i and Extended Data Fig. 1a–d), which tended to precede the reduction of transcription at the SE-driven gene (Fig. 1f and Extended Data Fig. 1a–c). These results were unexpected, as TRIM28 binds to ERVs in mESCs and is not bound at enhancers or SEs (Fig. 1b and Supplementary Fig. 1i,j). These data reveal the direct transcriptional response to the loss of TRIM28 and suggest that acute TRIM28 degradation leads to reduction of SE transcription in ESCs.

Reduced SE-condensate association in TRIM28-degraded mESCs

Components of the transcription machinery, for example, RNAPII and the Mediator coactivator, form biomolecular condensates that associate with SEs in ESCs^30,31,32,33, and the presence of RNAPII condensates at genomic sites correlates with elevated transcriptional activity³². We thus hypothesized that reduction of SE transcription in TRIM28-degraded ESCs may be caused by reduced association of transcriptional condensates with SE loci. To test this idea, we visualized the genomic region containing the well-studied SE at the miR290-295 locus using nascent RNA-fluorescence in situ hybridization (FISH) and transcriptional condensates using immunofluorescence (IF) against RNAPII³¹. RNAPII puncta consistently colocalized with the miR290-295 locus in control ESCs, and the colocalization was reduced after 24 h of TRIM28 degradation (Fig. 1j and Extended Data Fig. 2a), while the overall level of RNAPII did not change (Extended Data Fig. 2b,c). Similar results were observed at the Fgf4 SE locus (Extended Data Fig. 2d,e). These data indicate that transcriptional condensates associate less with SE loci in TRIM28-degraded ESCs.

To further probe colocalization between RNAPII condensates and SEs, we used live-cell super-resolution photoactivated localization microscopy (PALM)³². We used an mESC line that encodes 24 copies of an MS2 stem-loop integrated into the SE-driven Sox2 gene, a transgene encoding the MCP MS2-binding protein with a SNAP tag (MCP-SNAP) and Rpb1 RNAPII subunit endogenously tagged with the Dendra2 photoconvertible fluorophore. We integrated the degradation-sensitive FKBP tag into the Trim28 locus in these cells, enabling acute TRIM28 degradation (Extended Data Fig. 2f). In this system, the MCP-SNAP protein can be used to visualize nascent RNA produced by the Sox2 gene, and the Dendra2 tag can be used to track RNAPII clusters³². We then visualized RNAPII clusters for 2 min in live mESCs using PALM and measured the size and distance of the RNAPII cluster nearest to the Sox2 locus. We found that 24 h dTAG treatment led to a significant reduction in the size of the RNAPII cluster nearest to the Sox2 locus (P = 2 × 10⁻⁴, Wilcoxon–Mann–Whitney test) (Fig. 1k) and an increase in the distance between the locus and the nearest RNAPII cluster (P = 0.04, Wilcoxon–Mann–Whitney test) (Fig. 1k), while the global size of RNAPII clusters in the cells and the average number of RNAPII clusters per cell did not change (Fig. 1k). These data indicate reduced association of RNAPII condensates at the Sox2 SE locus upon acute TRIM28 degradation in live cells.

Derepressed IAP RNA foci overlap RNAPII condensates

To investigate whether transcriptional condensates colocalize with derepressed ERVs, we visualized IAP ERV loci with RNA-FISH. Nuclear IAP foci became progressively apparent after 24–48 h of TRIM28 degradation (Fig. 2a and Extended Data Fig. 3a), and some nuclear IAP foci colocalized with RNAPII puncta visualized with IF (mean Manders’ overlap coefficient (M_OC), 0.193; n = 24 cells) (Fig. 2a and Extended Data Fig. 3a). Colocalization of IAP foci was similarly observed with Mediator puncta visualized with IF using antibodies against the MED1 (M_OC, 0.135; n = 24 cells) (Fig. 2b) and MED23 Mediator subunits (Extended Data Fig. 3b). Overall, ~20% of IAP foci were located within 200 nm of an RNAPII or MED1 puncta, a distance range compatible with regulatory interactions³² (Fig. 2c). Consistent with the colocalization of transcriptional condensates with IAP foci and their reduced colocalization with SEs, the occupancy of RNAPII, Mediator and the transcription-associated H3K27Ac chromatin mark increased at various ERV families already after 24 h of TRIM28 degradation, while their enrichment was reduced at SEs (Fig. 2d and Supplementary Fig. 5a,b).

**Fig. 2: Derepressed IAPs form nuclear foci that associate with RNAPII condensates and incorporate nearby genes.**

As transcriptional condensates may associate with multiple distant DNA sites, we explored the possibility that condensates associating with ERVs incorporate ERV-proximal genes. To this end, we visualized the Cthrc1 locus using nascent RNA-FISH, as Cthrc1 was among the top upregulated genes in the TT-SLAM-seq data after 24 h of TRIM28 degradation and is located within 100 kb of three ERVs (Fig. 2e). We found that the Cthrc1 locus colocalized with RNAPII puncta in TRIM28-degraded ESCs (Fig. 2f and Extended Data Fig. 4a). The locus also colocalized with puncta formed by the NFY TF, whose motif is highly enriched in the long terminal repeat (LTR) of IAPs and other ERVs (Fig. 2g and Extended Data Fig. 4b,c), but not with puncta of a control TF NRF1 (Extended Data Fig. 4d). Transient (30 min) treatment of the cells with 1.5% 1,6 hexanediol (1-6 HD)—a short chain aliphatic alcohol that dissolves various biomolecular condensates including RNAPII condensates³² (Extended Data Fig. 3c,d)— reduced the level of Cthrc1 nascent RNA (twofold, P < 0.05, t-test) in TRIM28-degraded cells, indicating that RNAPII condensates contribute to the upregulation of this gene (Extended Data Fig. 3e). We then used CRISPR–Cas9 to delete the three ERVs at the Cthrc1 locus and found that, in the absence of the three ERVs, induction of Cthrc1 and other genes in the locus was compromised upon TRIM28 degradation (Fig. 2h and Supplementary Fig. 6a–e). To further probe contacts between derepressed ERVs and genes, we performed in situ Hi-C in control and TRIM28-degraded ESCs. We found that 24 h of TRIM28 degradation did not lead to marked genome-wide changes in chromatin contacts (Supplementary Figs. 7a,b and 8a,b) but did lead to a shift of the most-induced ERV taxa from the inactive ‘B’ towards the active ‘A’ compartment (Fig. 2i and Supplementary Fig. 7c) and a moderate increase in the contact frequency of ERVs with transcribed genes and SEs (Fig. 2j and Supplementary Fig. 7d). These results demonstrate that transcriptional condensates may incorporate genes proximal to derepressed ERVs.

SE-enriched TFs rescue condensate localization

RNAPII- and Mediator-containing condensates are thought to be anchored at SEs by TFs that are enriched at these sites³⁴. One would thus expect that overexpression of SE-enriched TFs rescues the reduced association of transcriptional condensates with SEs in TRIM28-degraded cells. To test this idea, we generated degradation-sensitive TRIM28-FKBP alleles in an induced pluripotent stem cell (iPSC) line that contains integrated transgenes encoding OCT4, SOX2, KLF4 and MYC under a doxycycline-inducible promoter (Fig. 3a,b and Extended Data Fig. 5a,b)³⁵. The OCT4, SOX2 and KLF4 TFs are highly enriched at SEs in ESCs³⁶. TRIM28 degradation in the iPSCs led to the appearance of IAP foci as revealed by IAP RNA-FISH (Fig. 3c,d). Overexpression of OCT4, SOX2, KLF4 and MYC substantially reduced the fraction of iPSCs containing IAP foci (Fig. 3c,d) and overall IAP RNA level in the cell population (Fig. 3e). Furthermore, OCT4, SOX2, KLF4 and MYC overexpression rescued the extent of colocalization of RNAPII puncta with the miR290-295 SE locus in TRIM28-degraded cells (Fig. 3f) while the overall levels of RNAPII subunits did not change (Extended Data Fig. 5c,d). OCT4, SOX2, KLF4 and MYC overexpression also partially rescued the downregulation of the miR290-295 SE RNA and Pri-miR290-295 transcript in TRIM28-degraded cells (Fig. 3g) and nascent transcript levels at the Klf4, Fgf4, Oct4 and Mycn SE loci (Extended Data Fig. 5e–h). These results suggest that forced expression of SE-binding TFs prevents the loss of transcriptional condensates at the miR290-295 SE locus and attenuates IAP induction in TRIM28-degraded cells.

Roles of ERV RNA in condensate formation and localization

RNA is a key component of numerous biomolecular condensates¹⁹ and nascent RNA can enhance phase separation of transcriptional regulatory proteins²¹. Therefore, we hypothesized that RNA produced at ERV loci may contribute to the genomic localization of RNAPII-containing condensates. To test this idea, we knocked down various ERV RNAs in TRIM28-degraded cells. Expression of shRNAs targeting the four most prominent ERV families (IAPs, MMERVK10Cs, MMERVK9Cs, MMETns) partially rescued the downregulation of SEs and their associated genes after 24 h of TRIM28 degradation while knocking down IAPs alone did not (Extended Data Fig. 6a–d). However, expression of the shRNAs for 24 h before inducing TRIM28 degradation (for 24 h) almost entirely rescued the upregulation of ERV transcript levels (Fig. 4a,b and Extended Data Fig. 6e,g), the appearance of IAP RNA-FISH foci (Extended Data Fig. 6h), reduced transcription at SEs and their associated genes (Fig. 4b,c and Extended Data Fig. 6f) and the reduced association of RNAPII condensates at the mir290-295 SE locus (Fig. 4d and Extended Data Fig. 6i–j). These results indicate that knockdown of ERV RNAs rescues the decrease of SE transcription and reduced condensate localization at SEs in TRIM28-degraded cells.

**Fig. 4: Contributions of IAP RNA to condensate localization in vivo and condensate formation in vitro.**

To further dissect the relationship between ERV RNA and transcriptional condensates, we performed in vitro reconstitution experiments. We purified recombinant, mCherry-tagged C-terminal domain (CTD) of RNAPII, which was shown previously to form condensates in vitro^30,33, and mixed it with fluorescein-labeled in vitro transcribed IAP RNA fragments. The IAP RNA fragments facilitated RNAPII CTD droplet formation in a dose-dependent manner (Fig. 4e,f and Extended Data Fig. 7a), and the IAP RNA was enriched within RNAPII CTD droplets in a dose-dependent manner (Fig. 4e,g). IAP RNA also facilitated condensation of the intrinsically disordered region (IDR) of the MED1 Mediator subunit (Extended Data Fig. 7b–e)—a frequently used in vitro model of Mediator^31,33,34. Furthermore, IAP RNA enhanced droplet formation of purified recombinant HP1α (an in vitro model of heterochromatin³⁷), but the optimal concentration of the RNA for HP1α was about fivefold lower than that for MED1 IDR in this in vitro system (Fig. 4h and Extended Data Fig. 8a,b). As expected, various other RNAs for example SE RNA²¹ and RNA from main satellite repeats³⁸ also enhanced droplet formation of MED1 IDR and HP1α in vitro, but the difference in the optimal RNA concentration stayed consistently about fivefold (Extended Data Fig. 8a,b). Moreover, IAP RNA fragments facilitated partitioning of both the MED1 IDR and NFYC-IDR into IAP-RNA-containing heterotypic droplets (Fig. 4i,j and Extended Data Fig. 7f–i). These results indicate that IAP RNA can enhance droplet formation of key transcriptional regulatory proteins, and suggest a mechanistic basis for the difference of the effect of RNA on heterochromatin and transcriptional condensates.

Transgenic ERVs compete with SEs for activators

Derepressed ERV loci seem to compete for transcriptional condensates with SEs, in part through producing RNA that facilitates condensation of transcriptional activators. To probe this competition directly, we investigated whether simultaneously activated transcription at repetitive loci (for example, ERVs) could compromise transcription at SEs. First, we attempted to activate IAPs using CRISPRa³⁹, but targeting a dCas9-VP64 protein to IAPs with several guide RNAs failed to produce meaningful transcription at those elements. We then mimicked the effect of simultaneous ERV induction by generating an mESC line containing multiple copies of an integrated PiggyBac transposon (Extended Data Fig. 9a–d). The transposon encoded a Dox-inducible green fluorescent protein (GFP) transgene with a polyA between two loxP sites, and ~900 bp fragments of IAPEz ERVs (Fig. 4k). Transfection of a plasmid encoding a Cre-recombinase enabled the generation of an isogenic line (‘IAPEz line’) encoding Dox-inducible IAPEz transgenes with the same copy number and insertion sites as GFP in the parental mESC line (‘GFP line’) (Fig. 4k). Quantitative PCR with reverse transcription (qRT–PCR) analyses confirmed induction of either GFP or IAPEz transcription upon Dox treatment in the respective lines (Fig. 4l). Moreover, induction of IAPEz transcription led to a rapid reduction of SE transcription at the miR290-295, Klf4 and Fgf4 loci, and reduced transcript levels of the associated genes, whereas it generally did not affect transcript levels of typical enhancer-associated genes (Fig. 4m and Extended Data Fig. 9e). In contrast, induction of GFP transcription had only a mild effect on SEs (Fig. 4m and Extended Data Fig. 9e). Consistent with the specific effect of IAPEz RNA induction, cellular fractionation experiments revealed that about twice as much of the IAPEz RNA is retained in the nuclear fraction compared with the GFP RNA (Extended Data Fig. 9f). Similar results were observed in a second pair of mESC lines in which the IAPEz fragment was substituted with fragments of MMERVK10C ERVs of around 900 bp (Extended Data Fig. 9g–i). Induction of MMERVK10C transcription from a PiggyBac transposon compromised transcription at SEs and their associated genes (Extended Data Fig. 9g–i). These results demonstrate that simultaneous activation of transgenic ERVs may compromise SE transcription in mESCs, and the ERV RNA seems to play an important role in this process.

ERV derepression correlates with loss of pluripotent cells

The above results suggest that pluripotent stem cells fail to maintain transcription of SE-driven genes when ERV repression is compromised. This model predicts that the amount of ERV products would correlate with the inability of embryos to maintain a pluripotent compartment. To test this model in vivo, we used our recently developed zygotic perturbation platform (Fig. 5a)^40,41,42. We generated zygotic deletion mutants of TRIM28, SETDB1, HP1α and other epigenetic regulators implicated in ERV repression, and assayed the timing and amount of the GAG protein produced by IAPs (Fig. 5b). IAP GAG foci were detected in E3.5 blastocysts of TRIM28, SETDB1 and KDM1A knockout (KO) mutants, and these mutations were lethal at around E6.5 in embryos (Fig. 5b,c) suggesting that early appearance of IAP GAG foci may correlate with the onset of embryonic lethality.

**Fig. 5: Early ERV activation correlates with depletion of pluripotent lineages in mouse embryos.**

To probe which cell types are affected by derepression of ERVs, we used single-cell RNA-seq (scRNA-seq). We created a reference cell state map of an early mouse embryo spanning a window of developmental stages (E5.5 to E7.0) that encompass the onset of lethality in TRIM28 KO mice (around E6.5) (Extended Data Fig. 10a–f,i)^7,43,44. We then generated TRIM28-deficient embryos using Cas9/sgRNA delivery into zygotes (Extended Data Fig. 10a–c) and mapped cell states using scRNA-seq (Fig. 5d and Extended Data Fig. 10g). The E6.5 TRIM28 KO scRNA-seq data revealed a dramatic scarcity of epiblast cells normally derived from pluripotent cells of the inner cell mass (ICM) (Fig. 5d), and an abundance of extraembryonic lineages (for example, parietal endoderm) (Fig. 5e and Supplementary Fig. 9a,b). Quantifying the fraction of reads mapping to various ERV taxa revealed that IAPs and MMERVK10Cs were derepressed in all cell types, while MMERVK9Cs and MMETns were derepressed specifically in epiblast cells, resulting in a higher total fraction of reads from ERVs (Fig. 5e and Extended Data Fig. 10h). These data indicate that the amount of ERV RNA transcripts is especially high in pluripotent cells in TRIM28 KO embryos.

IF imaging corroborated specific depletion of pluripotent cells in early embryos. The pluripotency factors NANOG, OCT4, SOX2 and KLF4 were already virtually absent in the ICM of TRIM28 KO E3.5 blastocysts (Fig. 5f and Supplementary Fig. 10a-b), and the inner part of the blastocysts was instead populated by cells expressing the endoderm marker GATA6 (Fig. 5f). This phenotype is reminiscent of NANOG knockout embryos, in which the pluripotent ICM is replaced by GATA6-expressing parietal endoderm-like cells (Fig. 5f)⁴⁵. These data are consistent with upregulation of endoderm markers observed in TRIM28-degraded ESCs (Supplementary Fig. 11a-c). Overall, these findings suggest that extended ERV derepression results in the loss of expression of pluripotency genes and consequent depletion of pluripotent cells in early mouse embryos.

Discussion

The results presented here support a model in which ERV retrotransposons have the capacity to hijack biomolecular condensates formed by key transcriptional regulatory proteins in pluripotent cells (Fig. 5g). This model may help explain why thousands of transposition-incapable ERVs are repressed in mammals and how their reactivation could alter cellular fates in the absence of retrotransposition^{7,9,18,46,47,48,49,50}.

Derepressed ERVs seem to compete with SEs for transcriptional condensates in pluripotent cells, in part through the RNA transcripts they produce. In vitro, various RNA species facilitate phase separation of transcriptional regulators and heterochromatin proteins, primarily via engaging in electrostatic interactions^21,38. In ESCs, forced transcription of ERV RNA from multiple loci led to a profound decrease in SE transcription while transcription of GFP RNA had a moderate effect (Fig. 4k–m and Extended Data Fig. 9g–i), suggesting sequence contribution to the impact of ERV RNAs on RNAPII-containing condensates in vivo. Recent studies reported that m6A methylation plays an important role in repressing ERV transcripts^11,12,13. The contribution of ERV RNA to the hijacking of transcriptional condensates from SEs may explain the in vivo role and importance of ERV RNA modifications.

Many nuclear noncoding RNAs are known to localize to the loci where they are produced²², but their functions are mysterious. Transient expression of ERVs and nuclear IAP foci have been described in early-stage human and mouse embryos^48,49,51,52 and adult immune cells⁵³, suggesting that ERV RNAs may play important roles in transcriptional programs during mammalian development. Consistently, previous studies suggested that derepressed transposons can act as enhancers and alternative promoters of cellular genes^{29,48,53,54,55}. RNA transcripts produced by ERVs may thus contribute to the genomic distribution of condensates in various developmental contexts.

Condensate hijacking by ERVs may contribute to disease. Trim28 haploinsufficiency is associated with obesity⁵⁶ and predisposes to Wilms’ tumor⁵⁷. Some ERVs may function as enhancers in acute myeloid leukemia⁵⁸, and ERV transcription is associated with neurological diseases⁵⁹ such as amyotrophic lateral sclerosis⁶⁰ and schizophrenia⁶¹. The capacity of ERV RNAs to hijack transcriptional condensates may shed light on the molecular basis of these conditions.

Methods

Licenses

All animal procedures were performed in our specialized facility, following all relevant animal welfare guidelines and regulations, approved by the Max Planck Institute for Molecular Genetics and LAGeSo, Berlin (license number, G0247/18-SGr1; and Harvard University (IACUC protocol 28-21)). S2 work was performed following all relevant guidelines and regulations, approved by the Max Planck Institute for Molecular Genetics and the local authorities LAGeSo, Berlin (license number, 222/15-17a).

Cell culture

The V6.5 mESCs and iPSCs were cultured on irradiated primary mouse embryonic fibroblasts (MEFs) under standard serum/leukemia inhibitory factor (LIF) conditions (KO DMEM containing 15% fetal bovine serum (FBS), supplemented with 1× penicillin/streptomycin, 1× GlutaMAX supplement, 1× nonessential amino acids, 0.05 mM β-mercaptoethanol (all from Gibco) and 1,000 U ml^–1 LIF).

For ChIP–seq, TT-SLAM–seq and RNA-seq experiments, mESCs were depleted from MEFs by incubating them on gelatin-coated cell culture plates for 45 min at 37 °C, allowing MEFs to attach while mESCs remain in suspension. MEF depletion was performed twice, after which mESCs were seeded on gelatin-coated plates and maintained in serum/LIF conditions with 2,000 U ml^–1 LIF.

For RNA-FISH combined with IF, MEF-depleted cells were grown on round 18-mm glass coverslips (Roth LH23.1). Coverslips were coated with 5 µg ml^–1 of poly-l-ornithine (Sigma-Aldrich, catalog no. P4957) for 30 min at 37 °C and with 5 µg ml^–1 of Laminin (Corning, catalog no. 354232) overnight at 37 °C.

To perturb RNAPII condensates, cells were treated for 30 min with 1.5% 1-6 HD (Sigma) in serum/LIF conditions with 2,000U ml^–1 LIF (Extended Data Fig. 3c,d).

Generation of the TRIM28-FKBP ESC line

To knock in the degradation-sensitive FKBP^F36V tag at the N-terminus of TRIM28, a repair template containing homology arms spanning upstream and downstream of the target site was cloned into a pUC19 vector (NEB) (Supplementary Fig. 2a). The repair template included a mRuby2 fluorescent protein sequence, P2A linker and the FKBP tag sequence (Supplementary Fig. 2a)²⁵. mRuby2 sequence was amplified from the mRuby2-N1 plasmid (Addgene, catalog no. 54614), and the P2A-FKBP sequence was amplified from the PITCh dTAG donor vector (Addgene, catalog no. 91792). A guide RNA (Supplementary Table 1b) targeting the N-terminus of TRIM28 was cloned into the sgRNA-Cas9 vector pX458 (Addgene, catalog no. 48138). The repair template and the sgRNA-Cas9 vector were transfected into V6.5 mESCs and iPSCs by nucleofection using Amaxa 4D Nucleofector X Unit (Lonza) according to the manufacturer’s instructions. To screen for positive integrations, the transfected cells were sorted for mRuby2 fluorescent protein expression with flow cytometry. The sorted cells were seeded as single cells and expanded for a few days. Single colonies were picked and genotyped for the correct integration with Western blot.

TRIM28 degradation

Before treatment, cells were seeded on 0.2% gelatin-coated plates after two rounds of MEF depletion. For degradation of TRIM28, 500 nM of dTAG-13 compound²⁵ was mixed with mESC medium (supplemented with 2,000 U mL–1 LIF) and incubated for the time indicated; medium was changed daily for fresh dTAG-13.

RNA-FISH combined with IF

RNA-FISH combined with IF was performed essentially as described³¹. For IF, dTAG-13- or DMSO- treated cells were fixed in 4% paraformaldehyde for 10 min at RT and stored in PBS at 4 °C. All buffers and antibodies were diluted in RNase-free PBS (Thermo Fisher, catalog no. AM9624). Cells were permeabilized with 0.5% Triton X-100 (Thermo Fisher, catalog no. 85111) for 10 min at RT, followed by three consecutive 5 min PBS washes. Cells were then incubated in the primary antibody (RNAPII (Abcam, catalog no. ab817) at 1:500, NFY-A (Santa Cruz, catalog no. sc-17753 X) at 1:250, NRF1 (Abcam, catalog no. ab55744) at 1:500, MED1 (Abcam, catalog no. ab64965) at 1:500 and MED23 (Bethyl Labs, catalog no. A300-425A) in PBS overnight. After two 5 min PBS washes, cells were incubated in the secondary antibody (Invitrogen, goat anti-mouse Alexa Fluor 488 (catalog no. A-11001) or goat anti-rabbit Alexa Fluor 488 (catalog no. A-11008)) at 1:500 in PBS for 60 min at room temperature. Cells were washed twice in PBS for 5 min and re-fixed with 4% paraformaldehyde in PBS for 10 min at room temperature. Following two 5 min PBS washes, cells were washed once with 20% Stellaris RNA-FISH Wash Buffer A (Biosearch Technologies, catalog no. SMF-WA1-60) and 10% deionized formamide (EMD Millipore, catalog no. S4117) in RNase-free water (Invitrogen, catalog no. 10977035) for 5 min at RT. Cells were hybridized with 90% Stellaris RNA-FISH Hybridization Buffer (Biosearch Technologies, catalog no. SMF-HB1-10), 10% deionized formamide and 12.5 or 25 µM Stellaris RNA-FISH probes. Probes were hybridized in a humidified chamber overnight at 37 °C. Cells were washed with Wash Buffer A for 30 min at 37 °C and stained with 0.24 µg ml^–1 4,6-diamidino-2-phenylindole (DAPI) in Wash Buffer A for 3 min at room temperature. Cells were washed with Stellaris RNA-FISH Wash Buffer B (Biosearch Technologies, catalog no. SMF-WB1-20) for 5 min at RT, mounted onto glass microscopy slides with Vectashield mounting medium (Vector Laboratories, catalog no. H-1900) and sealed using transparent nail polish. Images were acquired with LSM880 Airyscan microscope equipped with a Plan-Apochromat ×63/1.40 oil differential interference contrast objective or Z1 Observer (Zeiss) microscope with ×100 magnification with Zen 2.3 v.2.3.69.1016 (blue edition) or Zen (black edition). Images were processed with ZEN 3.1 (Zeiss) and ImageJ software v.2.1.0/1.53i (Figs. 1j, 2a–b, 2f–g, 3f and 4d and Extended Data Figs. 2a, 3a–b,d, 4d and 6h). ImageJ colocalization plugins were used for colocalization analysis of ERV IAP RNA-FISH with RNAPII and MED1 IF^62,63. For nearest RNAPII cluster distance analysis in the miR290-295 RNA-FISH dataset, z-projections consisting of ±4.5 slices around the FISH spot were obtained in both channels and thresholded to allow detection of individual RNAPII clusters. Center of mass distances to the nearest cluster were calculated using FIJI (DiAna)⁶³. RNA-FISH probes were designed and generated by Biosearch Technologies Stellaris RNA-FISH to target introns of miR290-295 primary transcript and Cthrc1, and IAPEz transcripts. Sequences of RNA-FISH probes are available in Supplementary Table 1a.

Live-cell PALM

Live-cell PALM imaging was carried out as described before^32,64,65. mESCs used for live-cell PALM imaging were derived from R1 background, with the Sox2 gene tagged with 24 repeats of MS2 stemloops at its mRNA 3′ end, Rpb1 tagged with Dendra2 at its N-terminus, EF1α-NLS-MCP-SNAP inserted stably into the genome and both alleles of Trim28 tagged with the degradation-sensitive FKBP tag (Extended Data Fig. 2f). Cells were simultaneously illuminated with 1.3 W cm^–2 near UV light (405 nm) for photoconversion of Dendra2 and 3.2 kW cm^–2 (561 nm) for fluorescence detection with an exposure time of 50 ms. We acquired images of Dendra2-RNAPII for 100 s (2,000 frames) for quantification of Pol II clusters. For dual-color imaging, cells were incubated with 100 nM JF646 SNAP ligand for 20 min and washed with 2i medium, followed by 30 min incubation in 2i media without JF646-HaloTaq ligands, to wash out unbound SNAP ligands before fluorescence imaging in L-15 medium. We acquired 50 frames (2.5 s) with 642 nm excitation with a power intensity of 2.5 kW cm^–2 and quickly switched to simultaneous 405/561 imaging for PALM. Super-resolution images were reconstructed and analyzed using MTT⁶⁶ and qSR⁶⁷. RNAPII cluster size was defined as the total number of localizations within the image acquisition time (100 s). The distance was calculated as the distance between the center of the MS2 nascent transcription site and the center of the nearest RNAPII cluster (Fig. 1k).

TT-SLAM-seq

TT-SLAM-seq was performed as described previously²⁶. Briefly, cells were treated with DMSO or 500 nM dTAG-13 for 2, 6 or 24 h and subjected to 15 min of 4-thiouridine (4sU) labeling using 500 µM 4sU. Total RNA was extracted with Trizol (Ambion) and 24:1 chloroform:isoamylalcohol (Sigma) while using 0.1 mM dithiothreitol (DTT) in isopropanol precipitation and ethanol washes. For each sample, 50 µg of total RNA was fragmented with Magnesium RNA Fragmentation Module (NEB), and fragmentation buffer was removed from samples with ethanol precipitation in presence of 0.1 mM DTT. RNA was then resuspended in 350 µl RNase-free water, diluted in biotinylation buffer (200 mM HEPES pH 7.5, and 10 mM EDTA) and topped up with 5 µg MTS-Biotin (previously diluted to 50 µg ml^–1 in dimethylformamide) to reach a final volume of 500 µl. The biotinylation reaction was incubated for 30 min at room temperature while keeping samples in rotation and protected from light. Unbound biotin was removed with acid-phenol:chloroform extraction (125:24:1, Ambion) and isopropanol precipitation. Biotinylated RNA was resuspended in 100 µl RNase-free water, denatured in 65 °C for 10 min and then cooled on ice for 5 min. The biotinylated RNA was captured with 100 µl µMACS streptavidin beads (Miltenyi) by incubating for 15 min in rotation while keeping samples protected from light. µMACS columns were equilibrated on magnetic stand with nucleic acid equilibration buffer and two times with biotinylation buffer (20 mM HEPES, 1 mM EDTA, pH 8). Beads were transferred to columns and washed three times with wash buffer (100 mM Tris-HCl pH 7.5, 10 mM EDTA, 1 M NaCl and 0.1 % Tween 20), and labeled RNA was eluted twice with a total 200 µl of 100 mM DTT. RNA was cleaned up with RNeasy Minelute columns (Qiagen) and eluted to RNase-free water with 1 mM DTT. 4sU residues of RNA were alkylated with iodoacetamide treatment (10 mM iodoacetamide in 50 mM NaPO₄, pH 8 and 50 % DMSO) by incubating samples in 50 °C for 15 min, followed by quenching with 20 mM DTT. RNA samples were purified with ethanol precipitation and treated with Turbo DNase (Invitrogen). Sequencing libraries were prepared with NEBNext Ultra II Directional RNA Library Prep Kit and NEBNext Multiplex Oligos (NEB), according to manufacturer’s instructions, except using 8 min incubation time in fragmentation step.

Generating wild-type and mutant mouse embryos

Zygotes were generated by in vitro fertilization (IVF) as previously described⁶⁸. Briefly, B6D2F1 female mice aged 7–9 weeks were superovulated with two rounds of hormone injections (5 IU of pregnant mare serum gonadotrophin followed by 5 IU of human chorionic gonadotrophin after 46 h). Oocytes were isolated and cultured in pre-gassed KSOM medium before IVF. F1 (C57BL/6J × Castaneous) sperm isolated from the cauda epididymis were thawed and used for IVF. At 6 h after fertilization, zygotes were washed in M2 medium for multiple rounds and then prepared for electroporation. Alt-R CRISPR–Cas9 and guide RNAs ribonucleoproteins were prepared as described previously⁴⁰. Guide RNAs used to target the genes are listed in Supplementary Table 1b. Zygotes were washed in three drops of OptiMEM Reduced Serum Medium (Thermo Fisher Scientific) before electroporation. NEPA21 electroporator (NEPAgene) was used for electroporating zygotes with the following settings for a small chamber: four poring pulses of 34 V for 2.5 ms with an interval of 50 ms were used to generate pores in the zona pellucida layer. Voltage decay was set at 10% and (+) polarity. To enable intake of the ribonucleoproteins, five transfer pulses of 5 V were applied for 50 ms each with an interval of 50 ms. Voltage decay for the transfer was set at 40% with an alternating polarity of (+) and (−). Electroporated zygotes were washed in three drops of KSOM medium and cultured in pre-gassed KSOM drops until blastocyst stage under standard embryo culture conditions. Blastocysts were scored for viability and morphology and retransferred bilaterally in a clutch of 15 blastocysts per uterine horn into day 2.5 pseudopregnant CD-1 surrogate female mice. E6.5 embryos were dissected from the uterus in 1× Hanks’ Balanced Salt Solution and used for further analysis. E5.5 wild-type embryos were generated with the setup, and mock electroporation with guide targeting GFP sequence was used.

scRNA-seq of embryos

E5.5 wild-type and E6.5 TRIM28 mutant embryos were dissected from the decidua in 1× Hanks’ Balanced Salt Solution and then washed in 1× PBS. Reichert’s membrane was removed carefully with sharp forceps and glass capillaries, and the embryos were washed in 1× PBS with 0.4% BSA. The embryos were disaggregated with TrypLE Express (Gibco) with gentle pipetting every 10 min up to a total of 40 min at 37 °C. The dissociated cells were counted for viability and then washed in 1× PBS with 0.4% BSA for a total of three washes at 4 °C and 1,200 r.p.m. for 5 min. The cells were subjected to scRNA-seq using a 10x Genomics Chromium Single Cell 3′ v.2 kit. Single-cell libraries were generated following the manufacturer’s instructions with the exception of the cycle number used. Libraries were sequenced on a Novaseq6000 with asymmetric reads and a depth of 300–350 million fragments per library.

Average image and radial distribution analysis

The image analysis pipeline used for the colocalization analysis of RNA-FISH combined with IF was described previously³¹. Briefly, MATLAB scripts were used to identify RNA-FISH foci in z stacks through intensity thresholding (the same threshold was used for image sets shown on the same figure panels) and create RNA-FISH signal centroids (x, y, z) that were stitched together and positioned in a box of size l = 1.5 μm. For identified FISH foci, signal from corresponding location in the IF channel was collected in the l × l square centered at the RNA-FISH focus at each corresponding z-slice. The IF signal centered at FISH foci for each FISH and IF pair were then combined to calculate an average intensity projection, providing averaged data for IF signal intensity within a l × l square centered at FISH foci. The same process was carried out for the FISH signal intensity centered on its own coordinates, providing averaged data for FISH signal intensity within a l × l square centered at FISH foci. As a control, this same process was carried out for IF signal centered at random nuclear positions generated using custom Python scripts. These average intensity projections were then used to generate two-dimensional contour maps of the signal intensity or radial distribution plots. Contour plots are generated using inbuilt functions in MATLAB. The intensity radial function ((r)) is computed from the average data. For the contour plots of the IF channel, an intensity colormap consisting of 14 bins with gradients of black, violet and green was generated. For the FISH channel, black to magenta was used. The generated colormap was employed to 14 evenly spaced intensity bins for all IF plots. The averaged IF centered at FISH or at randomly selected nuclear locations were plotted using the same color scale. For the radial distribution plots, the Spearman correlation coefficients, r, were computed and reported between the FISH and IF (centered at FISH) signal. A two-tailed Student’s t-test, comparing the Spearman correlation calculated for all pairs, was used to generate P values (Figs. 1j, 2f–g, 3f and 4d and Extended Data Figs. 2a and 4d).

Bioinformatics

All analyses were carried out using R v.3.6.3 unless stated otherwise.

TT-SLAM-seq processing

Raw reads were trimmed by quality, Illumina adapter content and polyA content analogous to the RNA-seq samples and aligned with STAR with parameters ‘–outFilterMultimapNmax 50–outReadsUnmapped Fastx’ to the SILVA database⁶⁹ (downloaded 6 March 2020) to remove rRNA content. Unaligned reads were afterwards reverse-complemented using the seqtk ‘seq’ command (https://github.com/lh3/seqtk, v.1.3-r106; parameters: -r). Reverse-complemented reads were processed using SLAM-DUNK⁷⁰ with the ‘all’ command (v.0.4.1; parameters: -rl 100 -5 0) with the GENCODE gene annotation (VM19) as ‘-b’ option. Reads with a ‘T > C’ conversion representing nascent transcription were filtered from the BAM files using alleyoop (provided together with SLAM-DUNK) with the ‘read-separator’ command. Counts per gene were quantified based on the ‘T > C’-converted reads using htseq-count (v.0.11.4; parameters:–stranded=yes,–nonunique=all)⁷¹. FPKM values were calculated based on the resulting counts. For genome-wide coverage tracks, technical replicates were merged using samtools ‘merge’⁷². Coverage tracks for single and merged replicates were obtained using deepTools bamCoverage⁷² (v.3.4.3; parameters:–normalizeUsing CPM) separately for the forward and reverse strand based on the ‘T > C’-converted reads.

Enhancer and SE annotation

The annotation of SErs, enhancers and enhancer constituents was taken from Whyte et al.⁷³. Coordinates were lifted from mm9 to mm10 using UCSC liftOver. These coordinates were used throughout this study for all enhancer-associated analyses (Supplementary Table 2).

Retrotransposon element definition

The genome-wide retrotransposon annotation of LTR, LINE and SINE elements was downloaded from Repbase⁷⁴. Based on the Repbase classification system, we used the element annotation as LTR, LINE or SINE as the retrotransposon classes. Retrotransposon families considered in this study were L1 and L2 elements (LINE), ERV1, ERV3, ERVK, ERVL and MALR (LTR), as well as Alu, B2, B4 and MIR elements (SINE). Repeat subfamilies used in this study were subdivided into IAP, MMERVK and MMETn (ERVK) elements. IAPs and MMERVKs consist of multiple different subfamilies as annotated by Repbase (Supplementary Figs. 1a and 4a), which we summarized under these broader terms. The classification is consistent with retrotransposon classification described in previous studies^1,75,76.

Full-length retrotransposons were defined based on the Repbase repeat annotation. For full-length ERVK elements, we required the element to consist of an inner part with two flanking LTRs. First, elements annotated as inner parts (containing the keyword ‘int’) were merged if they belonged to the same subfamily and were located within maximal 200 bp of each other. Second, only the merged inner parts with an annotated ERVK LTR within a distance of, at most, 50 bp on each side were selected as full-length element candidates. For IAPs specifically, only LTRs that belonged to an IAP subfamily were considered. No size restrictions were applied on the inner parts or LTRs, which could lead to potential false positive candidates that are too truncated to be able to be transcribed, but, on the other hand, provides an unbiased definition of full-length repeat elements. The subfamily per element was defined based on the inner part. Inner parts flanked by only one LTR were termed half-length elements. LTRs without an inner part were termed solo LTRs. To provide a broad overview of potential full-length L1 elements, only annotated elements with a size of greater than 6 kb were shown. The genomic coordinates of retrotransposons are listed in Supplementary Table 2a–e.

scRNA-seq processing

Fastq files for the wild-type timepoints E6.5 and E7.0 were downloaded from GEO (Supplementary Table 1c)⁷⁷. For the wild-type time point E5.5 and the Trim28 KO, raw reads (fastq) were generated using Cell Ranger (https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest) (v.4) from 10x Genomics Inc. with the command ‘cellranger mkfastq.’ Reads from all timepoints were aligned against the mouse genome (mm10), and barcodes and unique molecular identifiers were counted using ‘cellranger count’. Multiple sequencing runs were combined using ‘cellranger aggr.’

Retrotransposon expression quantification

Global repeat expression quantification from RNA-seq, TT-SLAM-seq and scRNA-seq (Fig. 1e and Supplementary Fig. 4b-e) was carried out as described⁴⁰. Briefly, to estimate the expression for each retrotransposon subfamily without bias due to gene expression, only reads not overlapping any gene were considered for the analysis. Reads overlapping splice sites, as well as reads with a high polyA content, were removed. The remaining reads were counted per subfamily only if they aligned uniquely or multiple times to elements of the same subfamily. Here, any annotated element of a specific subfamily from Repbase was considered independent of our full-length ERVK annotations. Reads aligning to multiple elements were counted only once. For scRNA-seq samples, reads were counted per subfamily, sample and cell state. The number of reads per subfamily was normalized by library size for RNA-seq and TT-SLAM-seq samples and normalized by reads aligning to genes and repeats for scRNA-seq samples. Fold change (FC) was calculated with respect to the DMSO or wild-type samples.

Statistical tests

The statistical significance of the difference of IAP expression between DMSO control and dTAG timepoints for TT-SLAM-seq and RNA-seq was calculated using an unpaired two-sided t-test (Fig. 1e). Statistical significance of differences in FC (versus DMSO) in control versus 1-6 HD-treated cells was estimated with unpaired two-sided t-test (Extended Data Fig. 3e). All other tests are described in the figure legends.

Definition of boxplot elements

In Figs. 1i and 4c,f –h,j, Extended Data Figs. 7c,e and 8b and Supplementary Fig. 7c, elements depicted in boxplots are as follows: middle line, median; box limits, upper and lower quartile; whiskers, 1.5× interquartile range. In Extended Data Figs. 2b–d and 5c, elements depicted in dot plots are as follows: middle line, mean; whiskers, s.d.; points, all data points.

Statistics and reproducibility

For all RNA-FISH combined with IF experiments, the target combination of gene transcript and transcriptional activator was probed on one coverslip of mESCs and at least two viewpoints were acquired. The number of detected foci included in the radial plot analysis is indicated under n_foci in Figs. 1j, 2f,g, 3f and 4d and Extended Data Figs. 2b,d, 4d and 6j. For Fig. 2a,b, n indicates the number of analyzed nuclei collected from at least three viewpoints, whereas the total number of detected IAPez foci is indicated in Fig. 2c (1,774 for RNAPII and 2,735 for MED1). Colocalizing foci (distance <200 nm) from Fig. 2a,b are indicated in Fig. 2c (344 of 1,774 for RNAPII and 381 of 2,735 for MED1). In Fig. 1h, enhancer constituents with significant transcription (FPKM > 1) are included (n = 117 for super-enhancers, n = 153 for typical enhancers).

IAP RNA-FISH–RNAPII IF experiments were repeated three times. Images and analysis of one representative experiment are displayed in Fig. 2a, and those from a second replicate experiments in Extended Data Fig. 3a. IAP RNA-FISH–MED1/MED23 IF images are from one biological replicate staining experiment (Fig. 2b and Extended Data Fig. 3b). IF images of 1-6 HD-treatment experiments (Extended Data Fig. 3d) and Cthrc1 RNA–NRF1 IF images are from one biological replicate staining experiment (Extended Data Fig. 4d).

For in vitro biochemistry experiments (Fig. 4e,i and Extended Data Figs. 7a,b,d,g,h and 8a), at least one independent slide containing the indicated mix was imaged and at least five independent viewpoints were acquired for each slide. Data are displayed as boxplots (Fig. 4f,g,h,j and Extended Data Figs. 7c,e,i and 8b), and each dot represents an individual droplet (n is the total number of droplets). In the boxplots, the lower box limit was set to the 25th percentile, upper box limit was set to the 75th percentile, the center line indicates the median and the whiskers represent the range within 1.5× interquartile. The following numbers of viewpoints and droplets were analyzed (formatted as condition per number of replicates per number of viewpoints per experiment per total number of droplets shown if the figure contains a boxplot): Fig. 4e,f,g, (IAP RNA: 0 nM; Fluorescein: 73 nM)/3/5/4,152, (3.7 nM)/3/5/1,337, (37 nM)/3/5/8,142, (73 nM)/3/5/11,018; Fig. 4h, (all conditions)/1/5/5 (each dot represents an image); Fig. 4i,j, (IAP RNA: 0 nM)/2/5/308, (15 nM)/2/5/332; Extended Data Fig. 7a–e, (IAP RNA: 0 nM; Fluorescein: 73 nM)/2/5/945, (3.7 nM)/2/5/293, (37 nM)/2/5/217, (73 nM)/2/5/321; Extended Data Fig. 7g, (all conditions)/2/5; Extended Data Fig. 7h,i, (mEGFP)/2/5/337, (NFYC-IDR-mEGFP)/2/5/434; Extended Data Fig. 8a,b, (IAP RNA-Cy5 all combinations)/1/5/5 images, (Maj Sat Repeat RNA-Cy5 all combinations)/1/10/10 images, (MiR290-295 SE RNA-Cy5 with MED1 IDR-mEGFP)/1/10/10 images and (MiR290-295 SE RNA-Cy5 with HP1α-mCherry)/1/20/20 images).

Sample sizes in Fig. 1k are as follows: left, sample size: 157 (DMSO), 112 (dTAG-13) cells; middle left, sample size: 157 (DMSO), 112 (dTAG-13) cells; middle, right, sample size: 2,591 (DMSO), 1,572 (dTAG-13) RNAPII clusters; right, number of RNAPII clusters per cell: sample size: 94 (DMSO), 59 (dTAG-13) cells.

Sample sizes in Fig. 4c are as follows: left panel shows SE constituents with FPKM > 0.05 (n = 163), middle panel contains SE-associated genes (n = 185) and right panel includes all active genes FPKM > 1 (n = 11,525).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data are available in the Supplementary Information. Sequence data were deposited at GEO under the accession number GSE159468. Mass spectrometry data were deposited at ProteomeXchange under the accession ID PDX021895. Plasmids generated in the study are available at Addgene. Source data are provided with this paper.

Code availability

All custom code used in this study was deposited along with the raw data at Zenodo (https://doi.org/10.5281/zenodo.6521914).

References

Thompson, P. J., Macfarlan, T. S. & Lorincz, M. C. Long terminal repeats: from parasitic elements to building blocks of the transcriptional regulatory repertoire. Mol. cell 62, 766–776 (2016).
Article CAS PubMed PubMed Central Google Scholar
Grewal, S. I. & Jia, S. Heterochromatin revisited. Nat. Rev. Genet. 8, 35–46 (2007).
Article CAS PubMed Google Scholar
Cordaux, R. & Batzer, M. A. The impact of retrotransposons on human genome evolution. Nat. Rev. Genet. 10, 691–703 (2009).
Article CAS PubMed PubMed Central Google Scholar
Payer, L. M. & Burns, K. H. Transposable elements in human genetic disease. Nat. Rev. Genet. 20, 760–772 (2019).
Article CAS PubMed Google Scholar
Rowe, H. M. & Trono, D. Dynamic control of endogenous retroviruses during development. Virology 411, 273–287 (2011).
Article CAS PubMed Google Scholar
Wells, J. N. & Feschotte, C. A. A field guide to eukaryotic transposable elements. Annu Rev. Genet. 54, 539–561 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rowe, H. M. et al. KAP1 controls endogenous retroviruses in embryonic stem cells. Nature 463, 237–240 (2010).
Article CAS PubMed Google Scholar
Cammas, F. et al. Mice lacking the transcriptional corepressor TIF1beta are defective in early postimplantation development. Development 127, 2955–2963 (2000).
Article CAS PubMed Google Scholar
Matsui, T. et al. Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET. Nature 464, 927–931 (2010).
Article CAS PubMed Google Scholar
Hu, G. et al. A genome-wide RNAi screen identifies a new transcriptional module required for self-renewal. Genes Dev. 23, 837–848 (2009).
Article CAS PubMed PubMed Central Google Scholar
Xu, W. et al. METTL3 regulates heterochromatin in mouse embryonic stem cells. Nature 591, 317–321 (2021).
Article CAS PubMed Google Scholar
Chelmicki, T. et al. m(6)A RNA methylation regulates the fate of endogenous retroviruses. Nature 591, 312–316 (2021).
Article CAS PubMed Google Scholar
Liu, J. et al. The RNA m(6)A reader YTHDC1 silences retrotransposons and guards ES cell identity. Nature 591, 322–326 (2021).
Article CAS PubMed Google Scholar
Lachner, M., O’Carroll, D., Rea, S., Mechtler, K. & Jenuwein, T. Methylation of histone H3 lysine 9 creates a binding site for HP1 proteins. Nature 410, 116–120 (2001).
Article CAS PubMed Google Scholar
Wolf, G. et al. The KRAB zinc finger protein ZFP809 is required to initiate epigenetic silencing of endogenous retroviruses. Genes Dev. 29, 538–554 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schultz, D. C., Ayyanathan, K., Negorev, D., Maul, G. G. & Rauscher, F. J. 3rd SETDB1: a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins. Genes Dev. 16, 919–932 (2002).
Article CAS PubMed PubMed Central Google Scholar
Yang, B. X. et al. Systematic identification of factors for provirus silencing in embryonic stem cells. Cell 163, 230–245 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wolf, G. et al. KRAB-zinc finger protein gene expansion in response to active retrotransposons in the murine lineage. eLife 9, e56337 (2020).
Article CAS PubMed PubMed Central Google Scholar
Roden, C. & Gladfelter, A. S. RNA contributions to the form and function of biomolecular condensates. Nat. Rev. Mol. Cell Biol. 22, 183–195 (2021).
Article CAS PubMed Google Scholar
Sharp, P. A., Chakraborty, A. K., Henninger, J. E. & Young, R. A. RNA in formation and regulation of transcriptional condensates. RNA 28, 52–57 (2022).
Article CAS PubMed PubMed Central Google Scholar
Henninger, J. E. et al. RNA-mediated feedback control of transcriptional condensates. Cell 184, 207–225.e24 (2021).
Article CAS PubMed Google Scholar
Quinodoz, S. A. et al. RNA promotes the formation of spatial compartments in the nucleus. Cell 184, 5775–5790.e30 (2021).
Article CAS PubMed Google Scholar
Elsasser, S. J., Noh, K. M., Diaz, N., Allis, C. D. & Banaszynski, L. A. Histone H3.3 is required for endogenous retroviral element silencing in embryonic stem cells. Nature 522, 240–244 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bulut-Karslioglu, A. et al. Suv39h-dependent H3K9me3 marks intact retrotransposons and silences LINE elements in mouse embryonic stem cells. Mol. Cell 55, 277–290 (2014).
Article CAS PubMed Google Scholar
Nabet, B. et al. The dTAG system for immediate and target-specific protein degradation. Nat. Chem. Biol. 14, 431–441 (2018).
Article CAS PubMed PubMed Central Google Scholar
Reichholf, B. et al. Time-resolved small RNA sequencing unravels the molecular principles of microRNA homeostasis. Mol. Cell 75, 756–768 e757 (2019).
Article CAS PubMed PubMed Central Google Scholar
Muhar, M. et al. SLAM-seq defines direct gene-regulatory functions of the BRD4-MYC axis. Science 360, 800–805 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schwalb, B. et al. TT-seq maps the human transient transcriptome. Science 352, 1225–1228 (2016).
Article CAS PubMed Google Scholar
Rowe, H. M. et al. TRIM28 repression of retrotransposon-based enhancers is necessary to preserve transcriptional dynamics in embryonic stem cells. Genome Res. 23, 452–461 (2013).
Article CAS PubMed PubMed Central Google Scholar
Boehning, M. et al. RNA polymerase II clustering through carboxy-terminal domain phase separation. Nat. Struct. Mol. Biol. 25, 833–840 (2018).
Article CAS PubMed Google Scholar
Sabari, B. R. et al. Coactivator condensation at super-enhancers links phase separation and gene control. Science 361, eaar3958 (2018).
Article PubMed PubMed Central CAS Google Scholar
Cho, W. K. et al. Mediator and RNA polymerase II clusters associate in transcription-dependent condensates. Science 361, 412–415 (2018).
Article CAS PubMed PubMed Central Google Scholar
Guo, Y. E. et al. Pol II phosphorylation regulates a switch between transcriptional and splicing condensates. Nature 572, 543–548 (2019).
Article CAS PubMed PubMed Central Google Scholar
Boija, A. et al. Transcription factors activate genes through the phase-separation capacity of their activation domains. Cell 175, 1842–1855 e1816 (2018).
Article CAS PubMed Google Scholar
Wernig, M. et al. A drug-inducible transgenic system for direct reprogramming of multiple somatic cell types. Nat. Biotechnol. 26, 916–924 (2008).
Article CAS PubMed PubMed Central Google Scholar
Whyte, W. A. et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell 153, 307–319 (2013).
Article CAS PubMed PubMed Central Google Scholar
Larson, A. G. et al. Liquid droplet formation by HP1alpha suggests a role for phase separation in heterochromatin. Nature 547, 236–240 (2017).
Article CAS PubMed PubMed Central Google Scholar
Huo, X. et al. The nuclear matrix protein SAFB cooperates with major satellite RNAs to stabilize heterochromatin architecture partially through phase separation. Mol. Cell 77, 368–383.e7 (2020).
Article CAS PubMed Google Scholar
Konermann, S. et al. Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Nature 517, 583–588 (2015).
Article CAS PubMed Google Scholar
Grosswendt, S. et al. Epigenetic regulator function through mouse gastrulation. Nature 584, 102–108 (2020).
Article CAS PubMed PubMed Central Google Scholar
Smith, Z. D. et al. Epigenetic restriction of extraembryonic lineages mirrors the somatic transition to cancer. Nature 549, 543–547 (2017).
Article PubMed PubMed Central CAS Google Scholar
Andergassen, D., Smith, Z. D., Kretzmer, H., Rinn, J. L. & Meissner, A. Diverse epigenetic mechanisms maintain parental imprints within the embryonic and extraembryonic lineages. Dev. Cell 56, 2995–3005 e2994 (2021).
Article CAS PubMed Google Scholar
Messerschmidt, D. M. et al. Trim28 is required for epigenetic stability during mouse oocyte to embryo transition. Science 335, 1499–1502 (2012).
Article CAS PubMed Google Scholar
Sampath Kumar, A. et al. Loss of maternal Trim28 causes male-predominant early embryonic lethality. Genes Dev. 31, 12–17 (2017).
Article PubMed PubMed Central CAS Google Scholar
Mitsui, K. et al. The homeoprotein Nanog is required for maintenance of pluripotency in mouse epiblast and ES cells. Cell 113, 631–642 (2003).
Article CAS PubMed Google Scholar
Mouse Genome Sequencing, C. et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
Article CAS Google Scholar
Maksakova, I. A. et al. Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line. PLoS Genet. 2, e2 (2006).
Article PubMed PubMed Central CAS Google Scholar
Macfarlan, T. S. et al. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature 487, 57–63 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. et al. Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells. Nature 516, 405–409 (2014).
Article CAS PubMed Google Scholar
Kunarso, G. et al. Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat. Genet. 42, 631–634 (2010).
Article CAS PubMed Google Scholar
Fadloun, A. et al. Chromatin signatures and retrotransposon profiling in mouse embryos reveal regulation of LINE-1 by RNA. Nat. Struct. Mol. Biol. 20, 332–338 (2013).
Article CAS PubMed Google Scholar
Grow, E. J. et al. Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature 522, 221–225 (2015).
Article CAS PubMed PubMed Central Google Scholar
Chuong, E. B., Elde, N. C. & Feschotte, C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science 351, 1083–1087 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jachowicz, J. W. et al. LINE-1 activation after fertilization regulates global chromatin accessibility in the early mouse embryo. Nat. Genet. 49, 1502–1510 (2017).
Article CAS PubMed Google Scholar
Fueyo, R., Judd, J., Feschotte, C. & Wysocka, J. Roles of transposable elements in the regulation of mammalian transcription. Nat. Rev. Mol. Cell Biol. 23, 481–497 (2022).
Article CAS PubMed Google Scholar
Dalgaard, K. et al. Trim28 haploinsufficiency triggers bi-stable epigenetic obesity. Cell 164, 353–364 (2016).
Article CAS PubMed PubMed Central Google Scholar
Diets, I. J. et al. TRIM28 haploinsufficiency predisposes to Wilms tumor. Int. J Cancer 145, 941–951 (2019).
Article CAS PubMed Google Scholar
Deniz, O. et al. Endogenous retroviruses are a source of enhancers with oncogenic potential in acute myeloid leukaemia. Nat. Commun. 11, 3506 (2020).
Article CAS PubMed PubMed Central Google Scholar
Geis, F. K. & Goff, S. P. Silencing and transcriptional regulation of endogenous retroviruses: an overview. Viruses 13, 884 (2020).
Article CAS Google Scholar
Li, W. et al. Human endogenous retrovirus-K contributes to motor neuron disease. Sci. Transl. Med. 7, 307ra153 (2015).
PubMed PubMed Central Google Scholar
Karlsson, H. et al. Retroviral RNA identified in the cerebrospinal fluids and brains of individuals with schizophrenia. Proc. Natl Acad. Sci. USA 98, 4634–4639 (2001).
Article CAS PubMed PubMed Central Google Scholar
Bolte, S. & Cordelieres, F. P. A guided tour into subcellular colocalization analysis in light microscopy. J. Microsc. 224, 213–232 (2006).
Article CAS PubMed Google Scholar
Gilles, J. F., Dos Santos, M., Boudier, T., Bolte, S. & Heck, N. DiAna, an ImageJ tool for object-based 3D co-localization and distance analysis. Methods 115, 55–64 (2017).
Article CAS PubMed Google Scholar
Cho, W. K. et al. RNA polymerase II cluster dynamics predict mRNA output in living cells. eLife 5, e13617 (2016).
Article PubMed PubMed Central CAS Google Scholar
Cisse, I. I. et al. Real-time dynamics of RNA polymerase II clustering in live human cells. Science 341, 664–667 (2013).
Article CAS PubMed Google Scholar
Serge, A., Bertaux, N., Rigneault, H. & Marguet, D. Dynamic multiple-target tracing to probe spatiotemporal cartography of cell membranes. Nat. Methods 5, 687–694 (2008).
Article CAS PubMed Google Scholar
Andrews, J. O. et al. qSR: a quantitative super-resolution analysis tool reveals the cell-cycle dependent organization of RNA polymerase I in live human cells. Sci. Rep. 8, 7424 (2018).
Article CAS PubMed PubMed Central Google Scholar
Nakagata, N. Cryopreservation of mouse spermatozoa and in vitro fertilization. Methods Mol. Biol. 693, 57–73 (2011).
Article CAS PubMed Google Scholar
Quast, C. et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596 (2013).
Article CAS PubMed Google Scholar
Neumann, T. et al. Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets. BMC Bioinf. 20, 258 (2019).
Article CAS Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Article CAS PubMed Google Scholar
Ramirez, F., Dundar, F., Diehl, S., Gruning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hnisz, D. et al. Super-enhancers in the control of cell identity and disease. Cell 155, 934–947 (2013).
Article CAS PubMed Google Scholar
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
Article CAS PubMed Google Scholar
Stocking, C. & Kozak, C. A. Murine endogenous retroviruses. Cell. Mol. Life Sci. 65, 3383–3398 (2008).
Article CAS PubMed PubMed Central Google Scholar
Crichton, J. H., Dunican, D. S., Maclennan, M., Meehan, R. R. & Adams, I. R. Defending the genome from the enemy within: mechanisms of retrotransposon suppression in the mouse germline. Cell. Mol. Life Sci. 71, 1581–1605 (2014).
Article CAS PubMed Google Scholar
Chan, M. M. et al. Molecular recording of mammalian embryogenesis. Nature 570, 77–82 (2019).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank T. Aktas and A. Bulut-Karslioglu for comments on the manuscript, C. Haggerty (MPIMG) for S2 cells, G. Winter (CeMM, Vienna), M. Jäger (CeMM, Vienna), M. Biesaga (IRB, Barcelona) and X. Salvatella (IRB, Barcelona) for sharing RNAPII antibody, A. Dall’Agnese and E. Guo for advice on FISH-IF, S. Basu for helping with colocalization analysis, S. Mackowiak for help with code, C. Althoff for help with Western blots, B. Lukaszewsa-McGreal for help with mass spectrometry, C. Hillgardt, C. Franke and the MPIMG transgenic facility for help with mouse work and the MPI-MG Sequencing core for sequencing. This work was funded by the Max Planck Society and partially supported by grants from the NIH (1P50HG006193 to A.M.; 5R01GM134734 to I.I.C.), the Deutsche Forschungsgemeinschaft (DFG) Priority Program SPP 2202 Grant HN 4/1-1 and HN 4/3-1 (to D.H.), the Austrian Academy of Sciences and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (ERC-CoG-866166, RiboTrace) (to S.L.A.). H.N. is supported by fellowships from the Emil Aaltonen, Orion Research Foundation and Instrumentarium Science Foundation.

Funding

Open access funding provided by Max Planck Society.

Author information

These authors contributed equally: Vahid Asimi, Abhishek Sampath Kumar, Henri Niskanen, Christina Riemenschneider.

Authors and Affiliations

Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
Vahid Asimi, Abhishek Sampath Kumar, Henri Niskanen, Christina Riemenschneider, Sara Hetzel, Julian Naderi, Helene Kretzmer, Raha Weigert, Maria Walther, Sainath Mamde, Alexander Meissner & Denes Hnisz
Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany
Vahid Asimi
Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany
Abhishek Sampath Kumar & Christina Riemenschneider
Institute of Molecular Biotechnology (IMBA), Vienna BioCenter (VBC), Vienna, Austria
Nina Fasching, Niko Popitsch & Stefan L. Ameres
Department of Biochemistry and Cell Biology, Max Perutz Labs, University of Vienna, Vienna BioCenter (VBC), Vienna, Austria
Niko Popitsch & Stefan L. Ameres
Department of Physics, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
Manyu Du & Ibrahim I. Cisse
Department of Biological Physics, Max Planck Institute of Immunobiology and Epigenetics, Freiburg, Germany
Manyu Du & Ibrahim I. Cisse
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Zachary D. Smith & Alexander Meissner
Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
Zachary D. Smith & Alexander Meissner
Max Planck Institute for Molecular Genetics, Mass Spectrometry Facility, Berlin, Germany
David Meierhofer
Department of Developmental Genetics, Max Planck Institute for Molecular Genetics, Berlin, Germany
Lars Wittler
Microscopy Core Facility, Max Planck Institute for Molecular Genetics, Berlin, Germany
René Buschow
Sequencing Core Facility, Max Planck Institute for Molecular Genetics, Berlin, Germany
Bernd Timmermann

Authors

Vahid Asimi
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Sampath Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Henri Niskanen
View author publications
You can also search for this author in PubMed Google Scholar
Christina Riemenschneider
View author publications
You can also search for this author in PubMed Google Scholar
Sara Hetzel
View author publications
You can also search for this author in PubMed Google Scholar
Julian Naderi
View author publications
You can also search for this author in PubMed Google Scholar
Nina Fasching
View author publications
You can also search for this author in PubMed Google Scholar
Niko Popitsch
View author publications
You can also search for this author in PubMed Google Scholar
Manyu Du
View author publications
You can also search for this author in PubMed Google Scholar
Helene Kretzmer
View author publications
You can also search for this author in PubMed Google Scholar
Zachary D. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Raha Weigert
View author publications
You can also search for this author in PubMed Google Scholar
Maria Walther
View author publications
You can also search for this author in PubMed Google Scholar
Sainath Mamde
View author publications
You can also search for this author in PubMed Google Scholar
David Meierhofer
View author publications
You can also search for this author in PubMed Google Scholar
Lars Wittler
View author publications
You can also search for this author in PubMed Google Scholar
René Buschow
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Timmermann
View author publications
You can also search for this author in PubMed Google Scholar
Ibrahim I. Cisse
View author publications
You can also search for this author in PubMed Google Scholar
Stefan L. Ameres
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Meissner
View author publications
You can also search for this author in PubMed Google Scholar
Denes Hnisz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.A. conceived the study, generated mESC lines, designed and performed RNA-FISH-IF experiments, analyzed and interpreted data, supported TT-SLAM-seq experiments and revised the manuscript. A.S.K. conceived the study, generated and characterized mESC lines, performed directed differentiation experiments, scRNA-seq, repeat knockdown, IAP-gag RNA-FISH, in vivo KO experiments, tetraploid aggregation experiments, whole-embryo IF, flow cytometry experiments and analysis and revised the manuscript. H.N. conceived the study, performed ChIP–seq, TT-SLAM-seq and in situ Hi-C experiments, analyzed and interpreted the data from TT-SLAM-seq and Hi-C experiments, assisted in RNA-seq analysis, performed and supported qRT–PCR experiments, generated ERV-TKO cell lines and revised the manuscript. C.R. conceived the study, designed and performed OSKM ectopic expression and ERV overexpression experiments, generated and characterized iPSC-OSKM-TRIM28-FKBP, mESC-IAPEz and mESC-MMERVK10C cell lines and revised the manuscript. S.H. performed initial processing of RNA-seq, TT-SLAM-seq, ChIP–seq and scRNA-seq datasets, analyzed and interpreted RNA-seq, ChIP–seq and scRNA-seq data, assisted in the TT-SLAM-seq data analysis and revised the manuscript. J.N. performed protein purification and droplet assays. N.F., N.P. and S.L.A. supported TT-SLAM-seq methodology and data analysis. M.D. and I.I.C. designed, performed and analyzed time-correlated PALM experiments. H.K. assisted in scRNA-seq analysis. Z.D.S. and L.W. performed in vivo Trim28 KO and tetraploid aggregation experiments. R.W. generated and performed NPC experiments. M.W. supported cloning and qRT–PCR experiments. R.B. performed quantification for IAP-gag RNA-FISH. S.M. wrote code for image analysis. D.M. performed mass spectrometry. B.T. supported next-generation sequencing methodology. A.M. conceived the study and supervised the work. D.H. conceived the study, supervised the work and wrote the manuscript.

Corresponding author

Correspondence to Denes Hnisz.

Ethics declarations

Competing interests

S.L.A. declares competing interest based on a granted patent related to SLAMseq. S.L.A. is co-founder, advisor and member of the board of QUANTRO Therapeutics GmbH

Peer review

Peer review information

Nature Genetics thanks Alessio Zippo, Todd Macfarlan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Reduction of super-enhancer transcription and the pluripotency circuit in TRIM28-degraded mESCs.

a. Acute reduction of transcription at the miR290-295 super-enhancer locus upon TRIM28-degradation. Displayed are genome browser tracks of ChIP-seq data (H3K27Ac, OCT4, SOX2, NANOG) in control mESCs, and TT-SLAM-seq data upon 0 h, 2 h, 6 h and 24 h dTAG-13 treatment at the miR290-295 locus. Rpm: reads per million. Co-ordinates are mm10 genome assembly co-ordinates. b. Acute reduction of transcription at the Mycn super-enhancer locus upon TRIM28-degradation. Displayed are genome browser tracks of ChIP-seq data (H3K27Ac, OCT4, SOX2, NANOG) in control mESCs, and TT-SLAM-seq data upon 0 h, 2 h, 6 h and 24 h dTAG-13 treatment at the Mycn locus. Rpm: reads per million. Co-ordinates are mm10 genome assembly co-ordinates. c. qRT-PCR validation of the TT-SLAM-seq data at the miR290-295 and Klf4 loci. Displayed are transcript levels after the indicated duration of dTAG-13 treatment. Values are displayed as mean ± SD from three independent experiments and are normalized to the level at 0 h. P values are from two-tailed t-tests. ****: P < 10⁻⁴, ***: P < 10⁻³, **: P < 10⁻², *: P < 0.05. d. Visualization of nascent transcripts at super-enhancers, enhancers and de-repressed LTR retrotransposons. Displayed are TT-SLAM-seq read densities from both strands within 4 kb around the indicated sites. The genomic features (the middle part of the plot) were length normalized. Meta-analyses of the mean signals are displayed above the heatmaps.

Extended Data Fig. 2 Loss of SE-association with RNAPII puncta at super-enhancers.

a. Analyses of cells used in Fig. 1j. (left) RNAPII IF intensity at the miR290-295 FISH foci (n_DMSO = 61, n_dTAG-13 = 50). (middle) RNAPII mean fluorescence intensity at random nuclear positions (n_DMSO = 61, n_dTAG-13 = 50). (right) Distance between the FISH focus and the nearest RNAPII puncta (n_DMSO = 67, n_dTAG-13 = 53). Data presented as mean values ± SD from one staining experiment. P values are from two-sided Mann-Whitney tests. NS: not significant. b. Images of RNA-FISH and IF signal. Nuclear periphery determined by DAPI staining is highlighted as a white contour. Also shown are averaged signal of either RNA FISH or RNAPII IF centered on the miR290-295 FISH foci or randomly selected nuclear positions. Data were collected as an independent replicate of experiments displayed in Fig. 1j. Scale bars: 2.5 μm. c. Analysis of cells used in panel ‘b’. (left) RNAPII IF intensity at the miR290-295 FISH foci (n_DMSO = 43, n_dTAG-13 = 25). (center) RNAPII mean fluorescence intensity (n_DMSO = 30, n_dTAG-13 = 40). (right) Number of RNAPII puncta on a representative set of cells (n_DMSO = 22, n_dTAG-13 = 22). Data are presented as mean values ± SD from one staining experiment. P values are from two-sided Mann-Whitney tests. NS: not significant. d. Images of individual z-slices (same z) of the Fgf4 RNA-FISH and IF signal. Nuclear periphery determined by DAPI staining is highlighted as a white contour. Also shown are averaged signals of either RNA FISH or RNAPII IF centered on the FISH foci or randomly selected positions. Scale bars: 2.5 μm. e. Analyses of cells used in panel ‘d.’ (left) RNAPII IF intensity at the Fgf4 FISH foci (n_DMSO = 53, n_dTAG-13 = 37). (right) RNAPII mean fluorescence intensity at random nuclear positions (n_DMSO = 53, n_dTAG-13 = 29). Data presented as mean values ± SD from one staining experiment. P values are from two-sided Mann-Whitney tests. NS: not significant. f. (left) Scheme of FKBP knock-in strategy in the R1 mESCs used in the PALM experiments. (right) TRIM28 Western blot in the R1 mESCs. Western blot was done once.

Source data

Extended Data Fig. 3 Condensate hijacking additional data.

a, b. Co-localization between the IAP RNA and (a) RNAPII puncta and (b) MED23 puncta in TRIM28-degraded mESCs. Displayed are separate images of the RNA-FISH and IF signal, and an image of the merged channels. The nuclear periphery determined by DAPI staining is highlighted as a white contour. The zoom column displays the region of the images highlighted in a yellow box zoomed in for greater detail. After 24 h dTAG-13 treatment, small nuclear puncta appear, and after 48 h of dTAG-13 treatment, large nuclear foci are visible. Scale bars: 2.5 μm. c. Scheme of the 1-6 hexanediol (1-6 HD) treatment experiments. d. Representative images of RNAPII immunofluorescence in control and 1-6 HD-treated cells. 1-6 HD partially dissolved the punctate localization of RNAPII. Scale bars: 5 μm. e. Transcription of the nascent Cthcr1 RNA is reduced by 30 min 1% 1-6 hexanediol-treatment in TRIM28-degraded cells. The bar plots show qRT-PCR data as fold change normalized to the DMSO control across 6 and 3 biological replicates for 24 h and 48 h timepoints, respectively. Note that the IAP RNA does not contain introns; thus, the IAP RNA qRT-PCR detects the steady state pool of IAP RNAs. Each dot represents a data point, and bar indicates the mean. P values are from two-tailed t tests. NS: not significant.

Extended Data Fig. 4 Additional characterization NFY and NRF1.

a. The RNAPII IF signal at Cthrc1 FISH foci is higher than the average nuclear signal. Quantification of the RNAPII IF intensity at Cthrc1 FISH foci (n = 20) and nuclei (n = 47) in the cells analyzed in Fig. 2f is shown. Data are presented as mean values ± SD from one staining experiment. P value is from a two-sided Mann-Whitney test. b. The sequence of IAPs is enriched for various TF binding motifs, including the motif of NFY. Top: schematic of an IAP element. Bottom: motif images, adjusted P values and motif IDs, and the expression level of the TF in mESC RNA-seq data. Displayed are the top-scoring motifs based on adjusted P-value. Motifs were filtered for redundancy. c. The NFYA IF signal at Cthrc1 FISH foci is higher than the average nuclear signal. Quantification of the NFYA IF intensity at Cthrc1 FISH foci (n = 14) and nuclei (n = 16) in the cells analyzed in Fig. 2g is shown. Data are presented as mean values ± SD from one staining experiment. P value is from a two-sided Mann-Whitney test. d. NRF1 puncta do not co-localize with the nascent RNA Cthrc1 in TRIM28-degraded mESCs. Displayed are separate images of the RNA-FISH and IF signal, and an image of the merged channels. The nuclear periphery determined by DAPI staining is highlighted as a white contour. Also shown are averaged signals of either RNA FISH or NRF1 IF centered on the Cthrc1 FISH foci or randomly selected nuclear positions. Scale bars: 2.5 μm.

Extended Data Fig. 5 Additional characterization of the OSKM/dTAG-13 experiments.

a. Western blot validation of the FKBP degron tag and its ability to degrade TRIM28 in iPSCs. Washout of the dTAG-13 ligand (24 h) indicates reversibility of degradation. Western blot experiments were performed twice and one representative image is shown. Actin is shown as the loading control. b. Western blot validation of the OSKM ectopic expression in the iPSC line. Western blot experiments were performed three times and one representative image is shown. HSP90 is shown as the loading control. c. dTAG-13 treatment leads to reduced RNAPII immunofluorescence signal at miR290-295 FISH foci which is rescued by OSKM ectopic expression, while overall RNAPII levels do not change. (top) Quantification of RNAPII mean fluorescence intensity (n = 117 for DMSO, n = 138 for dTAG-13, n = 110 for Dox+dTAG-13) in the cells used in Fig. 3f. (bottom) Quantification of RNAPII IF intensities at the miR290-295 FISH foci (n = 128 for DMSO, n = 39 for dTAG-13, n = 89 for Dox+dTAG-13) detected in the cells used in Fig. 3f. Data are presented as mean values ± SD from one staining experiment. P value is from a two-sided Mann-Whitney test. d. Mass spectrometry-detected protein abundance for three individual replicate samples after 0 h, 24 h, 48 h, 72 h and 96 h dTAG-13 treatment of mESCs. RNAPII subunits are highlighted in green. Mediator complex subunits are highlighted in red. e–h. qRT-PCR data normalized to the 0 h of dTAG-13 treatment. Data are from three independent biological replicates (that is, three wells on a tissue culture plate) and are presented as mean values ± SD. The experiment was repeated three times, and data from one representative experiment are shown. P values are from two-tailed t-tests. *: P < 0.05, ***: P < 10⁻³, ****: P < 10⁻⁴.

Source data

Extended Data Fig. 6 Knockdown of ERV RNA rescues super-enhancer transcription in TRIM28-degraded cells.

a. Scheme of knockdown experiments with simultaneous TRIM28 degradation. b. qRT-PCR analysis from three independent biological replicates. Data presented as mean values ± SD. Experiment was performed twice, and the data shown are from one representative experiment. P values are from two-tailed t tests. ****: P < 10⁻⁴, ***: P < 10⁻³, **: P < 10⁻², *: P < 0.05. c. Scheme of IAP knockdown experiments. d. qRT-PCR data displayed as fold change normalized to the 0 h control from three independent biological replicates. Data are presented as mean values ± SD. Each dot represents the mean of the three biological replicates of an individual experiment. P values are from two-tailed t-tests. ****: P < 10⁻⁴, ***: P < 10⁻³, **: P < 10⁻², *: P < 0.05. e. Scheme of the experiment in which shRNAs are induced for 24 h and then treated either with DMSO (yellow), dTAG-13 (orange) or with dTAG-13 and Dox (maroon) for additional 24 h. f. qRT-PCR data as fold change normalized to the Dox (24 h) treatment control from three independent biological replicates. Data are presented as mean values ± SD. Experiment was performed twice, and data from one representative experiment is shown. P values are from two-tailed t-tests. ****: P < 10⁻⁴, ***: P < 10⁻³, **: P < 10⁻², *: P < 0.05. g. RNA levels detected with total RNA-seq. Values from three biological replicates are normalized to levels detected at 0 h. Red arrowheads highlight the ERV taxa against whose sequences the shRNAs were designed. h. Representative images of IAP RNA FISH in cells described in panels (e–f). Scale bar: 2.5 μm. i. Analyses of cells used in Fig. 4d. (left) RNAPII IF intensities at the miR290-295 FISH foci (n_DMSO = 100, n_dTAG-13 = 52, n_Dox+dTAG-13 = 60). (right) RNAPII mean fluorescence intensity at random nuclear positions (n_DMSO = 97, n_dTAG-13 = 88, n_Dox+dTAG-13 = 60). Data presented as mean values ± SD from one staining experiment. P values are from two-sided Mann-Whitney tests. NS: not significant. j. Images of individual z-slices (same z) of the RNA-FISH and IF signal. Nuclear periphery determined by DAPI staining is highlighted as a white contour. Also shown are averaged signals of either RNA FISH or RNAPII IF centered on the miR290-295 FISH foci or randomly selected positions. Scale bars: 2.5 μm. The experiment is an independent biological replicate of the experiments shown in Fig. 4d.

Extended Data Fig. 7 IAP RNA facilitates droplet formation of transcriptional activators in vitro.

a. IAP RNA facilitates RNAPII CTD droplet formation in vitro. Displayed are representative images of droplet formation by purified RNAPII CTD-mCherry fusion proteins in the presence of in vitro transcribed IAP RNA fragments. Scale bar: 10 μm. b. IAP RNA facilitates MED1 IDR droplet formation in vitro. Displayed are representative images of droplet formation by purified MED1 IDR-mCherry fusion proteins in the presence of in vitro-transcribed IAP RNA. c. Partitioning ratio of MED1 IDR-mCherry into droplets at the indicated IAP RNA concentrations. Every dot represents a detected droplet. Data for the quantification was acquired from at least five images of two independent image series per condition. P value is a from a two-tailed t test. d. IAP RNA is enriched within MED1 IDR droplets. Displayed are representative images of the enrichment of fluorescein-labeled IAP RNA in MED1 IDR-mCherry droplets. e. Quantification of the enrichment of fluorescein-labeled IAP RNA in MED1 IDR-mCherry droplets. Data for the quantification was acquired from at least five images of two independent image series per condition. P value is from a two-tailed t test. f. (left) Schematic model of the heterotrimeric NFY transcription factor. (right) Graphs plotting intrinsic disorder in the NFYA, NFYB and NFYC proteins. The NFYC-IDR cloned for subsequent experiments is highlighted with a blue bar. g. Concentration-dependent droplet formation by purified recombinant NFYC IDR-mEGFP. Scale bar: 10 μm. h. Representative images of droplet formation by purified NFYC IDR-mEGFP and MED1 IDR-mCherry fusion proteins. MED1 IDR-mCherry mixed with purified mGFP is included as a control. Scale bar: 10 μm. i. Partitioning ratio of NFYC IDR-mEGFP or mEGFP in MED1 IDR-mCherry droplets. Every dot represents a detected droplet. Data for the quantification was acquired from at least five images of two independent image series per condition. P value is a from a two-tailed t test.

Extended Data Fig. 8 Effects of various RNAs on MED1 IDR and HPα droplets in vitro.

a. RNA facilitates MED IDR and droplet HPα formation in vitro. Displayed are representative images of droplet formation by purified (left) MED1 IDR-mEGFP fusion protein and (right) HPα-mCherry fusion protein in the presence of in vitro transcribed RNA fragments. Scale bar: 10 μm. b. Quantification of the partitioning of (left) MED1 IDR and (right) HPIα into droplets in the presence of the indicated RNA species. Values are normalized against the partition ratio at no RNA added. Data for the quantification was acquired from at least five images of two independent image series per condition. IAP RNA quantification is the same plot displayed in Fig. 4h.

Extended Data Fig. 9 Induction of ERV transcription compromises super-enhancer transcription.

a. Western blot of TRIM28, OCT4 and SOX2 in the indicated cell lines. Western blot experiments were performed once. b. FACS analysis of GFP in the ‘GFP (IAPEz) line’ and ‘GFP (MMERVK10C) line’. c. Genotyping qRT-PCR of the ‘GFP (IAPEz) line’ and ‘GFP (MMERVK10C) line.’ Primer sets amplifying the transgenic GFP or repeat sequence (IAP or MMERVK10C) were used. Data are from triplicate experiments. d. qRT-PCR validation of IAPEz upregulation. Values are normalized against the IAPEz level in the corresponding DMSO condition. Data are presented as mean values ± SD from three biological replicates. e. Additional supporting data for the experiment in Fig. 4m. qRT-PCR data are shown as fold change normalized to the DMSO control treatment. Data are presented as mean values ± SD from three biological replicates. P values are from two-tailed t-tests. ****: P < 10⁻⁴, ***: P < 10⁻³, **: P < 10⁻², NS: not significant. f. The amount of GFP, IAPEz, Hprt and Malat1 RNA were quantified in the nuclear and cytoplasmic fractions by qRT-PCR. The values are normalized against the amount of in vitro transcribed Ttn RNA that was spiked in at equimolar amount to the cytoplasmic and nuclear fractions. The expression values are then displayed as the percentage of the RNA in the nuclear and cytoplasmic fractions. Hprt is used as a control of a cytoplasmic mRNA, and Malat1 is used as control RNA known to be enriched in the nucleus. Data are presented as mean values ± SD from three biological replicates. P values are from a two-way ANOVA Sidak’s multiple comparison test. ***: P < 10⁻³, NS: not significant. g. Schematic model of the experiment mimicking MMERVK10C transcription. mESC lines harboring ~61 copies of a PiggyBac transposon encoding a Dox-inducible GFP or MMERVK10C transgene were treated with Dox (to induce GFP or MMERVK10C expression). h. qRT-PCR validation of the ‘GFP line’ and ‘MMERVK line’. The bar plots show qRT-PCR data as fold change normalized to the DMSO control treatment. Data are presented as mean values ± SD from three biological replicates. N.d.: not detectable. i. qRT-PCR data as fold change normalized to the DMSO control treatment. Data are presented as mean values ± SD from three biological replicates. P values are from two-tailed t-tests. ****: P < 10⁻⁴, ***: P < 10⁻³, **: P < 10⁻², NS: not significant.

Source data

Extended Data Fig. 10 Reference cell state maps in early mouse embryos.

a. Scheme of the zygotic CRISPR/Cas9 – scRNA-seq platform. b. Cut site analysis. The number, type and distribution of reads mapping to the sites targeted with the TRIM28 guide RNAs is quantified in the scRNA-seq data. c. Immunofluorescence verification of TRIM28 KO. Representative images are shown, with the number of embryos where the immunofluorescence confirmed the genotype per the total number of embryos analyzed. The knockout experiment was performed five times independently, and the pool of embryos from all experiments were used for staining. Scale bars: 20 μm. d. UMAP of wild type early mouse embryos spanning E5.5-E7.0 developmental window. The wild type cells used in scRNA-seq experiments from E5.5, E6.5 and E7.0 developmental stages were included. e. RNA velocity map of wild type early mouse embryos spanning E5.5-E7.0 developmental window. f. Heatmap representation of the expression levels of marker genes of each cluster/cell state. g. Combined cell state map. The wild type cells and TRIM28 KO cells used in scRNA-seq experiments were included. h. Elevated IAP expression in TRIM28 KO cells. (left) Wild type cells used in scRNA-seq experiments from E5.5, E6.5 and E7.0 developmental stages are projected on the combined reference map and are colored according to the IAP expression of the corresponding embryo and cell state. (right) TRIM28 KO E6.5 cells are projected on the combined reference map and are colored according to IAP expression of the corresponding embryo and cell state. i. Bright-field images of representative embryos from E5.5 wild type and E6.5 TRIM28 KO. Dotted lines represent the embryo that was dissected out from the dense Reichert’s membrane. Scale bar is 100 µm.

Supplementary information

Supplementary Information

Supplementary methods, figure legends, table legends, references and figures.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–6.

Supplementary Data 1

Uncropped gel images for Supplementary Fig. 2b

Supplementary Data 2

Uncropped gel images for Supplementary Fig. 2m.

Supplementary Data 3

Uncropped gel images for Supplementary Fig. 6b.

Source data

Source Data Fig. 1

Uncropped blot images for Fig. 1d.

Source Data Fig. 3

Uncropped blot images for Fig. 3b.

Source Data Extended Data Fig. 2

Uncropped blot images for Extended Data Fig. 2f.

Source Data Extended Data Fig. 5

Uncropped blot images for Extended Data Fig. 5a,b.

Source Data Extended Data Fig. 9

Uncropped blot images for Extended Data Fig. 9a.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Asimi, V., Sampath Kumar, A., Niskanen, H. et al. Hijacking of transcriptional condensates by endogenous retroviruses. Nat Genet 54, 1238–1247 (2022). https://doi.org/10.1038/s41588-022-01132-w

Download citation

Received: 11 August 2021
Accepted: 26 May 2022
Published: 21 July 2022
Issue Date: August 2022
DOI: https://doi.org/10.1038/s41588-022-01132-w

This article is cited by

Comparative analysis of retroviral Gag-host cell interactions: focus on the nuclear interactome
- Gregory S. Lambert
- Breanna L. Rice
- Leslie J. Parent
Retrovirology (2024)
An activity-specificity trade-off encoded in human transcription factors
- Julian Naderi
- Alexandre P. Magalhaes
- Denes Hnisz
Nature Cell Biology (2024)
KAP1 negatively regulates RNA polymerase II elongation kinetics to activate signal-induced transcription
- Usman Hyder
- Ashwini Challa
- Iván D’Orso
Nature Communications (2024)
The homeobox transcription factor DUXBL controls exit from totipotency
- Maria Vega-Sendino
- Felipe F. Lüttmann
- Sergio Ruiz
Nature Genetics (2024)
Biomolecular condensates: insights into early and late steps of the HIV-1 replication cycle
- Francesca Di Nunzio
- Vladimir N. Uversky
- Andrew J. Mouland
Retrovirology (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Rapid and selective degradation of TRIM28 in mESCs

Reduced SE transcription in TRIM28-degraded mESCs

Reduced SE-condensate association in TRIM28-degraded mESCs

Derepressed IAP RNA foci overlap RNAPII condensates

SE-enriched TFs rescue condensate localization

Roles of ERV RNA in condensate formation and localization

Transgenic ERVs compete with SEs for activators

ERV derepression correlates with loss of pluripotent cells

Discussion

Methods

Licenses

Cell culture

Generation of the TRIM28-FKBP ESC line

TRIM28 degradation

RNA-FISH combined with IF

Live-cell PALM

TT-SLAM-seq

Generating wild-type and mutant mouse embryos

scRNA-seq of embryos

Average image and radial distribution analysis

Bioinformatics

TT-SLAM-seq processing

Enhancer and SE annotation

Retrotransposon element definition

scRNA-seq processing

Retrotransposon expression quantification

Statistical tests

Definition of boxplot elements

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links