Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D genome

Lu, J. Yuyang; Chang, Lei; Li, Tong; Wang, Ting; Yin, Yafei; Zhan, Ge; Han, Xue; Zhang, Ke; Tao, Yibing; Percharde, Michelle; Wang, Liang; Peng, Qi; Yan, Pixi; Zhang, Hui; Bi, Xianju; Shao, Wen; Hong, Yantao; Wu, Zhongyang; Ma, Runze; Wang, Peizhe; Li, Wenzhi; Zhang, Jing; Chang, Zai; Hou, Yingping; Zhu, Bing; Ramalho-Santos, Miguel; Li, Pilong; Xie, Wei; Na, Jie; Sun, Yujie; Shen, Xiaohua

doi:10.1038/s41422-020-00466-6

Download PDF

Article
Open access
Published: 29 January 2021

Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D genome

J. Yuyang Lu¹^na2,
Lei Chang ORCID: orcid.org/0000-0002-2606-3347^2,3^na2,
Tong Li ORCID: orcid.org/0000-0002-0786-5272¹^na2,
Ting Wang¹^na1,
Yafei Yin¹^na1,
Ge Zhan¹^na1,
Xue Han¹^na1,
Ke Zhang¹^na1,
Yibing Tao¹^na1,
Michelle Percharde^4,5,
Liang Wang¹,
Qi Peng¹,
Pixi Yan¹,
Hui Zhang¹,
Xianju Bi¹,
Wen Shao¹,
Yantao Hong¹,
Zhongyang Wu¹,
Runze Ma⁶,
Peizhe Wang¹,
Wenzhi Li¹,
Jing Zhang¹,
Zai Chang¹,
Yingping Hou²,
Bing Zhu⁶,
Miguel Ramalho-Santos⁷,
Pilong Li¹,
Wei Xie¹,
Jie Na¹,
Yujie Sun ORCID: orcid.org/0000-0002-9489-4820² &
…
Xiaohua Shen ORCID: orcid.org/0000-0002-4590-1502¹

Cell Research volume 31, pages 613–630 (2021)Cite this article

18k Accesses
77 Citations
31 Altmetric
Metrics details

Subjects

Abstract

Organization of the genome into euchromatin and heterochromatin appears to be evolutionarily conserved and relatively stable during lineage differentiation. In an effort to unravel the basic principle underlying genome folding, here we focus on the genome itself and report a fundamental role for L1 (LINE1 or LINE-1) and B1/Alu retrotransposons, the most abundant subclasses of repetitive sequences, in chromatin compartmentalization. We find that homotypic clustering of L1 and B1/Alu demarcates the genome into grossly exclusive domains, and characterizes and predicts Hi-C compartments. Spatial segregation of L1-rich sequences in the nuclear and nucleolar peripheries and B1/Alu-rich sequences in the nuclear interior is conserved in mouse and human cells and occurs dynamically during the cell cycle. In addition, de novo establishment of L1 and B1 nuclear segregation is coincident with the formation of higher-order chromatin structures during early embryogenesis and appears to be critically regulated by L1 and B1 transcripts. Importantly, depletion of L1 transcripts in embryonic stem cells drastically weakens homotypic repeat contacts and compartmental strength, and disrupts the nuclear segregation of L1- or B1-rich chromosomal sequences at genome-wide and individual sites. Mechanistically, nuclear co-localization and liquid droplet formation of L1 repeat DNA and RNA with heterochromatin protein HP1α suggest a phase-separation mechanism by which L1 promotes heterochromatin compartmentalization. Taken together, we propose a genetically encoded model in which L1 and B1/Alu repeats blueprint chromatin macrostructure. Our model explains the robustness of genome folding into a common conserved core, on which dynamic gene regulation is overlaid across cells.

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

DNA double-strand break–capturing nuclear envelope tubules drive DNA repair

Article 17 April 2024

Endogenous aldehyde-induced DNA–protein crosslinks are resolved by transcription-coupled repair

Article Open access 10 April 2024

Introduction

The mammalian genomic DNA that is roughly 2 meters long in a cell is folded extensively in order to fit the size of the nucleus with a diameter of ~5–10 μm.¹ Microscopic and 3C-based approaches reveal a hierarchical organization of the genome.^2,3,4,5 At the megabase scale, chromatin is subdivided into two spatially segregated compartments, arbitrarily labeled as A and B, with distinct transcriptional activity and histone modification as well as other features such as CpG frequency and DNA replication timing.^6,7,8,9,10 The euchromatic A compartment adopts a central position, whereas the heterochromatic B compartment moves towards the nuclear periphery and nucleolar regions.¹¹ This nuclear organization appears to be conserved from ciliates to humans and has been maintained in eukaryotes over 500 million years of evolution.¹² Within compartments at the kilobase-to-megabase scale, chromatin is organized in topologically associated domains (TADs), which serve as functional platforms for physical interactions between co-regulated genes and regulatory elements.¹³ At a finer scale, TADs are divided into smaller loop domains, in which distal regulatory elements such as enhancers come into direct contact with their target genes via chromatin loops.¹⁴ Intriguingly, most A/B compartments and TADs are relatively stable in different mouse and human cell types (Supplementary information, Text S1), whereas sub-TAD loops and a small fraction of lineage-specific regions with less pronounced compartment associations tend to be more variable for differential gene expression during cell-fate transition.^{13,15,16,17,18,19}

Evidence suggests that compartments and TADs may be formed by distinct mechanisms. TADs are thought to be formed by active extrusion of chromatin loops by the ring-shaped cohesin complex, which co-localizes with the insulator protein CTCF at the boundaries and anchor regions of contact domains and loops.^20,21 Depletion of CTCF disrupted TAD boundaries but failed to impact compartmentalization, whereas cohesion loss made TADs disappear but increased compartmentalization, although both eliminated sub-TAD loop contacts.^{22,23,24,25,26,27,28,29,30} These results indicate that compartmentalization of mammalian chromosomes emerges independently of proper insulation of TADs. A few mechanisms have been proposed for compartmentalization, such as anchoring heterochromatin to the nuclear lamina,^{31,32,33,34,35} preferential attraction of chromatin harboring similar histone modifications and regulators,^{4,36,37,38,39} and hypothetical models involving pairing of homologous sequences mediated by active transcription and phase separation of block copolymers.^{36,40,41,42,43,44} Although lamin-associated domains (LADs) contribute to a basal chromosome architecture, a large body of work has demonstrated a secondary role for lamina scaffolding in compartmental segregation of heterochromatin and euchromatin.^{31,32,33,34,35,45,46,47,48,49} In vitro assembled nucleosomal arrays harboring histone H3 lysine 9 di- and tri-methylation (H3K9me2 and H3K9me3) marks undergo phase separation with heterochromatin protein 1 (HP1) and associated proteins to form macromolecule-enriched liquid droplets, reminiscent of heterochromatin.³⁸ However, the role of histone modifications in regulating compartmentalization in vivo remains uncertain. Taken SUV39H H3K9 methyltransferases for example, SUV39H double-null cells still exhibit DAPI-dense heterochromatin foci despite the loss of pericentric H3K9me3 marks;³⁹ and double knockout mice of SUV39H survive at birth with abnormalities.⁴⁸ A phase-separation model of block copolymers with similar activity appears attractive in explaining compartmental formation.^{12,34,40,41,42,43,44} However, this hypothesis remains inconclusive, owing to a large void of identification and experimental validation of the molecular drivers that underlie compartmental segregation of euchromatin and heterochromatin.

Repetitive elements comprise more than half of human and mouse genomes.^50,51 Once regarded as genomic parasites,⁵² retrotransposons have been recently implicated in playing active roles in re-wiring the genome and gene expression programs in diverse biological processes.^{53,54,55,56,57,58,59,60,61} Long and short interspersed nuclear elements (LINEs and SINEs, respectively) are the two predominant subfamilies of retrotransposons in most mammals.⁶² L1 (also named as LINE1 or LINE-1) is the most abundant subclass of all repeats, making up to 19% and 17% (0.9–1.0 million copies) of the genome in mouse and human, respectively.⁶³ B1 in mouse and its closely related, primate-specific Alu elements in human are the most abundant subclass of SINEs, constituting 3%–11% (0.6–1.3 million copies) of mouse and human genomes.^64,65 L1 and B1/Alu have distinct nucleotide compositions and sequence lengths. L1 elements are 6–7 kb long and AT-rich, while Alu elements are ~300 bp long and rich in G and C nucleotides.⁶⁶ Analysis of metaphase chromosome banding showed roughly inverse distributions of L1 and Alu elements in chromosomal regions with distinct biochemical properties.^45,67,68 Initial studies suggested that Alu/B1 elements appear to be enriched in gene-rich, euchromatic A compartments, whereas L1 elements tend to be enriched in gene-poor, heterochromatic B compartments that interact with lamina-associated domains.^{35,45,47,69,70,71} However, evidence to pinpoint a role for L1 and B1/Alu repeats in organizing the genome has to our knowledge not been reported, albeit fragmented information about their localizations in scattered reports (Supplementary information, Text S1). Systematic mapping and visualization of L1 and B1/Alu distributions are still lacking.

We have postulated that the primary DNA sequences, particularly abundant repetitive elements embedded in the genome, may instruct genome folding.⁶¹ Here, we report that L1 and B1/Alu repeats tend to cluster with sequences from their own repeat subfamily and form grossly exclusive domains in the nuclear space, which efficiently explains and predicts the compartmental organization revealed by Hi-C. The segregated pattern of L1-rich sequences in the nuclear and nucleolar peripheries and B1/Alu-rich sequences in the nuclear interior is highly conserved across a variety of mouse and human cells, and re-occurs during the cell cycle. In addition, de novo establishment of nuclear segregation of L1- and B1-rich compartments is coincident with the formation of higher-order chromatin structures during early embryogenesis, and appears to be critically regulated by L1 and B1 repeat RNA. Importantly, depletion of L1 RNA in mouse embryonic stem cells (mESCs) significantly weakens spatial contacts of homotypic repeat DNA, disrupts the nuclear localization and segregation of L1- or B1-rich chromosomal sequences, and leads to attenuated compartmentalization of the higher-order chromatin structure. Moreover, we show that recombinant HP1α is able to bind RNA and to phase separate in the presence of RNA or DNA in vitro. Genome-wide co-localization of L1 and HP1α renders these repeat DNA and RNA sequences an advantage in promoting HP1α phase separation in heterochromatin contexts. Altogether, our findings suggest a genetically encoded mechanism by which L1 and B1/Alu repeats organize chromatin macrostructure at the compartmental level, providing an important clue to the conservation and robustness of the higher-order chromatin structure across mouse and human.

Results

L1 and B1/Alu distributions correlate with global compartmentalization in mouse and human

We analyzed the genomic positions of the major repeat subfamilies in mouse and observed positive correlations within L1 or SINE B1 subfamilies, but strong inverse correlations between them (Supplementary information, Fig. S1a). This suggests that L1 and B1 elements tend to be positioned away from each other in the genome, while repeats from the same subfamily tend to be clustered. The non-random positioning of repeat sequences in the genome prompted us to examine their relative distributions in high-order chromatin structures. We first analyzed the published Hi-C data from mESCs.⁷² Dense L1 and B1 repeats appear to be enriched in distinct compartments across the mouse genome, and within a compartment they are evenly distributed without obvious bias towards the boundary (Fig. 1a, b). L2 repeats show weak enrichments in B1-rich compartments, whereas other types of retrotransposons such as ERV1 and ERVK tend to be randomly distributed (Fig. 1a and Supplementary information, Fig. S1b). The compartments marked by B1 repeats show enrichment of active histone marks (H3K4me3, H3K9ac, H3K27ac, H3K36me3), strong binding of RNA polymerase II (Pol II), and high levels of chromatin accessibility and transcription activity. In contrast, the compartments marked by L1 repeats show signatures of heterochromatin, including enrichment of the repressive H3K9me2 and H3K9me3 marks, and strong binding of heterochromatin proteins such as HP1α and the nuclear corepressor KRAB-associated protein-1 (KAP1 or TRIM28) (Fig. 1a and Supplementary information, Fig. S1b).

**Fig. 1: B1- and L1-rich genomic regions homotypically interact, characterize and predict Hi-C compartments.**

We then performed a quantitative sequence analysis of annotated A/B compartments in six distinct mouse and human cell types.^20,72 All cells exhibit consistently high levels of SINE repeats (including B1, B2, B4 in mouse and Alu in human) in the A compartments and L1 repeats (including truncated or intact, and evolutionary old or young L1s) in the B compartments (Fig. 1b and Supplementary information, Fig. S1c–e). In contrast, L2 and ERV1 repeats fail to show consistent enrichments across mouse and human. In addition, unsupervised clustering revealed that the genomic positions of A/B compartments are highly similar across six cell types, with an average Spearman correlation coefficient > 0.73 within species and > 0.52 between species (Supplementary information, Fig. S1f). Compared to other subclass repeats, L1 and B1/Alu are most strongly related to the high-order chromatin structures, and their distributions appear to be conserved in homologous regions of the mouse and human genomes (Supplementary information, Fig. S1f). For example, a region in mouse chromosome 2 (chr2: 140–170 Mb) and its syntenic region in human chromosome 20 (chr20: 6–50 Mb) show similar patterns of Hi-C contact probabilities, and gene and repeat compositions and distributions in the corresponding A and B compartments along the DNA sequences (Fig. 1c).

We further analyzed the published datasets of higher-order chromatin interactions in 21 primary human tissues and cell lines.¹⁷ On the basis of the PC1 values of a principal components analysis on the Hi-C correlation matrix reported by Schmitt et al.,¹⁷ we found that A/B-compartmental associations are highly correlated across all 21 examined samples with correlation coefficients ranging 0.47–0.99 and a median value of 0.79 (Supplementary information, Fig. S2a). The degree of compartmental conservation is highly significant (P < 2.2e-16), as ~80% of the genome shows consistent compartmental labeling in at least 16 samples and ~40% is invariant in all 21 samples, in contrast to 7% and 0% to be expected by chance, respectively (Supplementary information, Fig. S2b, c). Most of compartmental switches that account for 20% of the genome occur in one or few (≤ 5) samples with less pronounced compartmental labeling (gray highlighted regions with low absolute values of PC1 in Supplementary information, Fig. S2d; see also Supplementary information, Text S1). Thus, despite some switching events occurring in individual cells, global compartmentalization is rather stable. Consistently, the genomic regions with conserved A or B compartments across samples exhibit significantly higher levels of Alu or L1 repeats, respectively (Supplementary information, Fig. S2e). Altogether, these results indicate that co-segregation of B1/Alu and L1 repeats with the A and B compartments appears to be stable in different cell types in mouse and human.

Homotypic clustering of L1 and B1 repeats characterizes and predicts compartmental organization

To have a close look at repeat distribution and the higher-order chromatin structure, we took mouse chromosome 17 (chr17, 95 Mb in length) as an example to overlay L1 and B1 features on the Hi-C interaction matrix of mESCs. Interestingly, the plaid pattern of enriched and depleted interaction blocks in the Hi-C map is largely correlated with the compositions and distributions of B1 and L1 along the whole chr17 (Fig. 1d). In a 42-Mb region of chr17, four L1-rich compartments (denoted by c, e, g and i) and three B1-rich compartments (denoted by D, F and H) are alternately positioned along the linear DNA sequence (Fig. 1e). Strong interactions were observed within L1-rich compartments (represented by ce, cg, eg, ei and gi, dotted boxes) or B1-rich compartments (represented by DF, DH and FH, dotted boxes), but not between these two compartments (Fig. 1e). The interaction frequencies between D and F (DF) or between c and e (ce) are much stronger than those of D or F with c or e (cD, De, eF), despite the fact that these regions are closer in the linear sequence. Note that L1- or B1-rich segments often span several adjacent TADs (Fig. 1e and Supplementary information, Fig. S2f), consistent with the findings that TADs are smaller, structural units of compartments.^73,74 L1 and B1 compositions within a TAD also exhibit strong anti-correlations (Pearson correlation coefficient <−0.75) across 2200 annotated TADs in mESCs (Supplementary information, Fig. S2g). This observation is consistent with repeat analyses at the genome-wide and compartmental levels (Fig. 1a and Supplementary information, Fig. S1a), illustrating a mutually exclusive distribution of L1 or B1-rich sequences along the genome.

Conversion of Hi-C contact frequencies into Pearson correlation coefficients sharpened our view of the long-range chromatin interactions (Fig. 1f). By visual inspection, we found that the plaid pattern of the Hi-C correlation map precisely matches the distribution and interaction status of L1 and B1. L1-rich or B1-rich regions show strong enrichment of contacts with regions containing the same repeat type (red blocks in Fig. 1f). We refer to these as homotypic contacts. Contacts between regions containing the other repeat type (heterotypic interactions) are strongly depleted (blue blocks in Fig. 1f). For example, in one region of chr17 (35 to 95 Mb), L1-rich segments (from e to u) and B1-rich segments (from F to T) exhibit high frequencies of homotypic contacts (Fig. 1f, highlighted by arrows), but strong depletion of heterotypic contacts. Similarly, homotypic contacts between L1-rich regions or B1-rich regions were also observed between chromosomes, as illustrated by chromosomes 17 and 19 (Supplementary information, Fig. S2h). These results indicate that genomic regions containing B1 or L1 repeats tend to interact with genomic regions containing repeat sequences from similar subfamilies, but not from different subfamilies, regardless of linear proximity, which characterizes the organization at intra- and inter-chromosomal levels.

Next, we sought to predict compartmental organization based on repeat distributions. We used the criterion of log₂ ratio of B1 to L1 density [log₂(B1/L1)] larger or smaller than 0 for B1-rich or L1-rich compartments, respectively. About 540 B1-rich and 648 L1-rich compartments were identified with a median size of 1.2 Mb across the mouse genome (Supplementary information, Table S1). The numbers and sizes of these B1- and L1-rich compartments called de novo are comparable to those of A and B compartments annotated by Hi-C in mESCs (366 and 364, respectively, with a median size of 1.9 Mb). Importantly, 82% of B1-rich compartments and 77% of L1-rich compartments are overlapped with annotated A or B compartments, respectively (Fig. 1g and Supplementary information, Fig. S3). Only 18% to 23% of compartments show inconsistent labeling between our prediction and Hi-C. We then analyzed genomic features in these ‘falsely’ labeled regions. Intriguingly, L1-rich regions that fall into Hi-C-annotated A compartments (designated as ‘L1.A’) still exhibit a high level of heterochromatic H3K9me3 mark and low levels of chromatin accessibility and gene expression, and contain genes enriched in specialized functions such as responses to pheromone and immunoglobulin and synapse (Supplementary information, Fig. S4a–c). Similarly, B1-rich regions that fall into Hi-C-annotated B compartments (designated as ‘B1.B’) exhibit high levels of chromatin accessibility and gene expression but low H3K9me3 binding (Supplementary information, Fig. S4a–c). In addition, L1.A and B1.B regions exhibit significantly less pronounced PC1 values (close to zero) than those consistent regions (B1.A and L1.B) (Supplementary information, Fig. S4a). Thus, a mere usage of B1 to L1 density ratios successfully re-constructs most of A and B compartments annotated by Hi-C, which suggests that the linear genomic DNA repeats contain the macroscopic structural information. Taken together, homotypic clustering of regions rich in B1 or L1 repeats nicely explains and predicts genome organization at the compartmental level.

Nuclear segregation of L1- and B1/Alu-rich compartments is conserved

High-resolution imaging of L1 and B1 distributions in the conventional nucleus remains lacking, despite initial evidence of their differential localization.^45,68 To visualize their positioning in the nuclear space, we performed dual-color DNA fluorescence in situ hybridization (FISH) using fluorescence-tagged oligonucleotide probes that specifically target the consensus sequences of B1 and L1 elements (Fig. 2a). Strikingly, L1 and B1 exhibit distinct yet complementary nuclear localizations in mESCs (Fig. 2b and Supplementary information, Fig. S5a). B1 DNA shows punctate signals in the nuclear interior. In contrast, L1 DNA exhibits highly organized and concentrated signals that line the periphery of the nucleus and nucleolus. Weak L1 signals were also detected in a few areas of the nuclear interior subregions where B1 signals were absent. Both B1 and L1 signals are absent from DAPI-dense regions, which likely represent satellite repeat-enriched chromocenters.⁷⁵

**Fig. 2: DNA FISH reveals the spatial segregation of L1 and B1 compartments.**

To confirm the L1 localization at the nucleolar periphery in mESCs, we performed DNA immuno-FISH, using an antibody against the nucleolar marker Nucleolin (NCL). Indeed, L1 signals surround and partially overlap with the ring-shaped signals of NCL at the nucleolar periphery (Fig. 2c). The localization of L1 surrounding the nucleus and nucleolus is consistent with sequencing-based analysis of nucleolus- and lamina-associated domains (NADs and LADs), in which L1-rich sequences are sequestered.^61,69,76,77 In addition, to further confirm nuclear colocalization of L1-rich sequences in B compartments and B1-rich sequences in A compartments, we performed Oligopaint DNA FISH for five representative loci, each of which ranging from ~100 to 1 Mb was targeted by a set of 500–4500 DNA probes (targeting single-copy sequences) at a density of 200–300 bp per probe. Indeed, three regions (F, H, and R) annotated in the A compartment are colocalized with B1 FISH signals in the nuclear interior (205 out of 217 nuclei), whereas two B compartment-associated regions (g and q) are colocalized with L1 FISH signals in either LAD or NAD (248 out of 251 nuclei) (Fig. 2d and Supplementary information, Fig. S5b, c).

Moreover, to ask whether L1 and B1 localizations might vary with cell type, we analyzed four additional cell lines, including mouse neural stem cells (NSC), fibroblasts (NIH3T3), and myoblasts (C2C12), and human cervical cancer cells (HeLa) (Fig. 2c). Similar to mESCs, all these cells show non-overlapping localizations of B1/Alu in the nuclear interior and L1 at the nuclear and nucleolar peripheries. Thus, consistent with Hi-C results, the segregated staining pattern of B1/Alu and L1 further demonstrates that homotypic clustering of similar repeat sequences in the nuclear space divides the nucleus into distinct territories. This pattern is conserved across different cell types in mouse and human.

Dynamic re-construction of L1 and B1/Alu segregation during the cell cycle

We then asked whether the nuclear segregation of L1- and B1-rich compartments could be re-constructed during mitosis when chromatin structure undergoes dynamic reorganization. DNA FISH analysis of synchronous mESCs showed that L1 and B1 localizations change dramatically at different cell cycle stages (Fig. 3a and Supplementary information, Fig. S6a). S-phase cells show non-overlapping and complementary localizations of L1 and B1 repeats (Figs. 2b and 3a). This is similar to the pattern we observed previously in asynchronous mESCs, more than half of which are in the S phase of the cell cycle (Supplementary information, Fig. S6a). However, L1 and B1 DNA signals are mixed on mitotic chromosomes in metaphase (M phase, including prophase and anaphase), when the nuclear membrane and nucleoli are disassembled. As the cell cycle progresses into the G1 phase, L1 and B1 DNA start to segregate again (Fig. 3a and Supplementary information, Fig. S6b). To quantify the degree of segregation, we defined a FISH-based segregation index as the negative value of Pearson’s correlation coefficient of L1 and B1 DNA signals in the nucleus. The FISH segregation index is lowest in M-phase cells, but increases significantly in the G1 phase and peaks in the S phase (Fig. 3b).

**Fig. 3: Dynamic segregation of L1 and B1 compartments during the cell cycle and embryonic development.**

To provide molecular evidence for the segregation of repeats during the cell cycle, we analyzed the published Hi-C data from cell-cycle synchronized mESCs and HeLa cells.^78,79 In both cell types, G1-phase cells exhibit a classic plaid pattern of hierarchical interactions, with enriched and depleted interaction blocks outside of the diagonal region of the Hi-C interaction heatmap (Fig. 3c, d and Supplementary information, Fig. S6c, d). In contrast, M-phase cells exhibit stronger signals along the diagonal, which represent the linearly organized, longitudinally compressed array of consecutive chromatin loops. To quantify this difference, we defined a Hi-C-based segregation index by calculating the ratio of homotypic versus heterotypic interaction frequencies between L1 and B1/Alu subfamilies. Indeed, the Hi-C segregation index is significantly higher in G1-phase cells than in M-phase cells (Fig. 3c and Supplementary information, Fig. S6c). These results indicate that segregation of B1/Alu and L1 repeats is dispersed by mitosis, and is re-established when the higher-order chromatin structure forms during each cell cycle in both mouse and human cells. This finding agrees with previous reports that in metaphase, chromosome folding becomes homogeneous and large megabase-scale A and B compartments are lost, whilst in interphase, chromosomes return to a highly compartmentalized state.^78,79

Dynamic establishment of L1 and B1 segregation in early embryogenesis

After fertilization, the chromatin undergoes extensive reprogramming from a markedly relaxed state in zygotes to fully organized structures in blastocysts.^80,81,82 We performed a time-course DNA FISH analysis of L1 and B1 in early mouse embryos. During embryonic divisions, L1 and B1 signals are largely overlapping in zygotes, and become progressively more segregated in 2-cell, 4-cell, morula and blastocyst embryos (Fig. 3e, f). Consistently, analysis of the published Hi-C data of early embryos⁸¹ showed that early 2-cell embryos exhibit prevalent cis-chromosomal contacts along the diagonal of the Hi-C interaction map, whereas the plaid patterns of Hi-C interactions become readily detectable in late 2-cell embryos and are fully established in the inner cell mass (ICM) cells of blastocysts (Fig. 3g and Supplementary information, Fig. S7a). Plotting the FISH and Hi-C segregation indexes showed a gradual increase of L1 and B1 segregation along the course of blastocyst development, which reaches the highest level in blastocysts or in mESCs (Fig. 3f, h). We conclude that, in early embryos, compartmentalization of L1- and B1-rich regions appears to be established in a stepwise manner, coincident with de novo establishment of higher-order chromatin structures. Notably, the greatest change (steepest trend-line) of FISH segregation indexes occurs between the zygote and the late 2-cell stage (Fig. 3f), which implies that the initiation of B1 and L1 compartmentalization may coincide with the zygotic genome activation, during which massive transcription switches on.

It was reported that inhibition of Pol II by α-amanitin caused embryonic arrest at the late 2-cell stage,⁸³ yet the higher-order chromatin structure could still be established.^80,81 However, compared to the control groups, we found that α-amanitin-treated embryos exhibited significantly lower L1/B1 segregation indexes and less clear patterns of Hi-C plaids (Fig. 3g, h). At 20 h, α-amanitin-treated embryos showed a low median level of L1/B1 segregation indexes and a Hi-C pattern with extensive diagonal signals that are similar to those of early 2-cell embryos, while the control group had proceeded into the late 2-cell stage (Fig. 3g, h). At 45 h, the control group had proceeded into the 8-cell and morula stages, whereas α-amanitin-treated embryos (45 h) still showed low L1/B1 segregation and a Hi-C plaid pattern similar to that of late 2-cell embryos, despite the segregation index modestly increases compared with that of embryos at 20 h (Fig. 3g, h and Supplementary information, Fig. S7b). These results indicate a delayed and incomplete formation of the higher-order chromatin structure in the absence of zygotic Pol II transcription in mouse.

In accordance with delayed chromatin folding in embryos, treatments of mESCs with the drug 5,6-dichloro-1-β-d-ribofuranosylbenzimidizole (DRB) which inhibits Pol II transcription elongation, led to a partial loss of L1 perinucleolar localization and a gain of mixed nuclear L1 and B1 signals (Fig. 4a, b and Supplementary information, Fig. S7c). Inhibition of both Pol I and II by a high concentration of actinomycin D (ActD) had a more severe effect compared to DRB treatment (Fig. 4a, b). Thus, in both ESCs and early embryos, inhibition of Pol II transcription appears to partially, but not completely, block L1/B1 segregation. These results imply that L1/B1 compartmentalization is likely to be autonomously initiated and subsequently facilitated by transcription.

**Fig. 4: Repeat RNA and transcription promote the spatial segregation of L1 and B1 compartments.**

Repeat transcripts promote L1 and B1 segregation in embryonic cells

In an effort to link repeat function with chromatin structure, we sought to explore the role of repeat RNA that is transcribed from L1 and B1 sequences. Both L1 and B1 repeats are activated and highly expressed in two-cell embryos (Supplementary information, Fig. S7d).^71,84,85,86 We have reported previously that depletion of L1 RNA by an antisense morpholino (AMO) in mouse embryos led to arrest at the 2-cell stage; and in mESCs, its depletion led to reduced proliferation and global de-repression of hundreds of L1-associated genes; however, it did not alter the expression of OCT4 and NANOG, two known master regulators of the pluripotency program, nor induced ESC differentiation.^61,87 Using the same L1 AMO sequence, we depleted L1 RNA by 17.4% on average shown by RNA FISH (n = 16 embryos; Supplementary information, Fig. S8a). This modest depletion is consistent with the general consensus that AMO acts through steric blockage of its target RNA rather than inducing RNA degradation. In concordance with the previous report by Percharde et al.,⁸⁴ more than 91.3% of embryos (42 out of 46) were arrested at the 2-cell stage in contrast to only 15.4% of embryos treated with scramble AMO that were arrested (Fig. 4c), indicating effective inhibition of L1 RNA.

We also sought to perturb B1 expression by microinjecting B1 antisense oligonucleotides (ASO) into mouse zygotes. Two B1 ASOs significantly downregulated B1 RNA levels by 36% shown by RNA FISH (n = 17 embryos; Supplementary information, Fig. S8b). Strikingly, embryos depleted of B1 RNA were able to pass the first embryonic division, but failed to divide further and became arrested at the 2-cell stage (n = 79 embryos; Fig. 4c), indicating an essential requirement for B1 RNA in embryonic development. We collected these embryos for DNA FISH analysis when the control group injected with scramble AMO or ASO had grown to the late 2-cell stage. Compared to the control embryos, both L1- and B1-depleted embryos exhibited significantly lower L1/B1 FISH segregation indexes (Fig. 4d and Supplementary information, Fig. S8c, d), indicating delayed segregation of L1 and B1 compartments.

In order to dissect the effects independent of embryonic progression, we then tried to deplete B1 and L1 transcripts in mESCs. B1/Alu repeats have been broadly implicated in diverse processes, including transcription, RNA processing, and nuclear export.^88,89,90,91 Treatment of mESCs with B1 ASO led to severe cell death within hours of transfection (data not shown), which precluded direct assessment of B1 RNA in chromatin organization. It has been suggested that nuclear organization is critically dependent on interactions within heterochromatin,^34,92 where L1, the most abundant one of all repeat subclasses, is predominantly enriched. In subsequent analysis, we in-depth characterized the effects of depleting L1 RNA on chromatin organization.

Treatments of mESCs with L1 AMO led to a depletion of L1 RNA by 28% (n = 30 cells) (Supplementary information, Fig. S8e, f). At both 12 and 36 h post transfection of L1 AMO, nucleoplasmic signals of L1 DNA were obviously increased and perinucleolar L1 signals became fuzzy or absent (Fig. 4e). B1 signals became more uniformly dispersed in the nucleoplasm, in contrast to punctate staining of B1 in the control mESCs. Emergence of the overlapping L1 and B1 FISH signals is indicative of decreased homotypic clustering and segregation of L1 and B1 DNA. In comparison, treatment with the drug azidothymidine (AZT), which blocks L1 retrotransposition activity,⁹³ failed to affect the nuclear localization of L1 and B1 as well as L1 RNA levels (Fig. 4e and Supplementary information, Fig. S8g), illustrating an effect independent of L1 retrotransposition activity.

Image quantification of a large number of L1-depleted cells (n = 41 cells randomly picked at 12 and 36 h) showed significantly lower FISH segregation indexes (Fig. 4f), compared to mESCs treated with scramble AMO or AZT (n = 43 cells). This indicates that the most majorities of L1 AMO-treated cells exhibit decreased L1/B1 segregation. In contrast, increases of 2-cell-like cells and G2/M-arrested cells occur in small populations, from 2% to 9% and from 13% to 31%, respectively.⁸⁷ In addition, cells in M phase show a drastically different staining pattern of L1 and B1 from cells in S phase (Fig. 3a). These observations argue against a secondary effect due to changes in mESC state upon depletion of L1 RNA.

Hi-C reveals a key role of L1 RNA in maintaining the 3D chromatin structure

To reveal molecular defects in detail, we performed Hi-C analysis at 36 h after transfecting L1 AMO into mESCs. Direct visualization of Hi-C interaction maps revealed obvious differences in the plaid pattern of L1-depleted and control mESCs across mouse chromosomes (Fig. 5a and Supplementary information, Fig. S9a). In L1-depleted mESCs, as illustrated by chr17, Hi-C contact signals were abnormally increased along the diagonal line, whereas the plaid signals outside of the diagonal regions became fuzzy or even lost (Fig. 5a, panel (i)). Accordingly, a comparison of Hi-C contact frequencies of the control versus L1-depleted cells showed decreased ratios (in blue) across the diagonal and increased ratios (in red) in the periphery (Fig. 5a, panel (ii)). Examination of all 20 mouse chromosomes revealed similar changes (Supplementary information, Fig. S9a), indicating enhanced local chromosomal contacts but decreased long-range interactions across the genome in L1-depleted mESCs.

**Fig. 5: L1 RNA is required for the formation and maintenance of higher-order chromatin structure.**

A zoomed-in view of a 42-Mb region of chr17 in L1-depleted mESCs further shows decreased homotypic chromatin contacts between L1-rich or B1-rich compartments, but aberrantly increased heterotypic contacts (Fig. 5a, b). For example, B1–B1 interactions (represented by DF, DH and FH) or L1–L1 interactions (represented by ce, cg, ci, eg, ei and gi) were downregulated in L1-depleted cells, whereas aberrant B1–L1 contacts (represented by cD, De, Dg, Di, eF, Fg, and Hi) were increased. At the genome-wide level, L1-depleted cells show significantly lower Hi-C segregation indexes of L1 and B1 compared to control mESCs (Fig. 5c).

To evaluate potential changes in the higher-order chromatin structure, we quantified and plotted compartment strength based on the ratio of homotypic (A–A and B–B) to heterotypic (A–B or B–A) compartmental interactions.^34,94 Saddle plots of compartmental interactions show obvious decreases in homotypic B–B and A–A interactions and increases in aberrant A–B interactions upon L1 AMO (Fig. 5d, e and Supplementary information, Fig. S9b). Compared to scramble AMO samples, L1-depleted cells exhibit significantly reduced compartment strength (2.6–2.8 versus 3.5–3.7, P < 0.01). Thus, consistent with DNA FISH, Hi-C analysis further revealed that depletion of L1 RNA causes abnormal increases in heterotypic contacts and genome-wide decreases in homotypic repeat contacts, L1/B1 segregation, and A/B compartmentalization. We noted that depletion of L1 did not alter TAD boundaries (Supplementary information, Fig. S9c). The finding that L1 RNA regulates compartmental organization but not TADs is in line with the notion that the formation of compartments and TADs may involve distinct mechanisms.²⁹

L1 RNA regulates spatial contacts of L1/B1-rich sequences

Having shown a global role of L1 RNA in the regulation of chromatin organization, next we performed Oligopaint dual-color DNA FISH to ask whether depletion of L1 might affect the nuclear localization of specific chromosomal segments. We first chose two heterotypic repeat regions on chr17, e (L1-rich) and F (B1-rich), which are ~100-kb in length covered by 500 oligo probes (Fig. 6a and Supplementary information, Fig. S5b). The e and F sites are juxtaposed to each other with a linear genomic distance of 4.39 Mb. In the nuclear space, they are positioned far away from each other with a median distance of 1.72 ± 0.55 µm in control mESCs, whereas depletion of L1 RNA significantly shortened the spatial distance between them to 1.27 ± 0.47 µm (Fig. 6b, c, left).

**Fig. 6: Depletion of L1 RNA alters nuclear localizations of L1/B1-rich sequences.**

Next, we chose two large L1-rich regions, g and q, each of which is ~1 Mb in length covered by 4500 probes (Fig. 6a and Supplementary information, Fig. S5b). The g and q regions are separated by multiple compartmental domains with a linear distance of 26.4 Mb in sequence; however, they reside in close spatial proximity (1.08 ± 0.45 µm) in the nuclei of control cells. L1 depletion significantly increased the nuclear distance between g and q to 1.81 ± 0.82 µm (Fig. 6b, c, right). These opposing changes observed between homotypic L1–L1 and heterotypic L1–B1 repeat contacts provide visual evidence for L1 RNA in the regulation of nuclear positioning of specific chromosomal segments. We noted that L1 AMO led to moderate but significant increases in the volume of two large L1-rich segments (g and q), implying heterochromatin decompaction (Supplementary information, Fig. S10a). In addition, L1 AMO did not alter the genome-wide binding of H3K9me3 (Supplementary information, Fig. S11), arguing against an indirect consequence due to loss of heterochromatic histone marks.

Taken together, the combined analyses of the whole nucleus and individual loci by sequencing and imaging approaches convincingly demonstrate that L1 RNA critically promotes spatial interactions of homotypic repeats and compartmental segregation of L1- and B1-rich chromosomal sequences. By comparison, depletion of CTCF or RAD21, the core component of cohesion, failed to affect homotypic contacts and nuclear segregation of L1 and B1 repeats (Supplementary information, Fig. S10b–i), which is consistent with the Hi-C results reported previously.^22,23,95

L1 repeats promote phase separation of HP1α

To have a glimpse of the mechanism by which L1 regulates chromatin organization, we investigated the interplay of L1 repeat DNA and RNA with HP1α, a known H3K9me3 reader in heterochromatin organization. We have shown that HP1α binds strongly to L1-rich heterochromatin, but is depleted in B1-rich regions (Figs. 1a and 7a–c and Supplementary information, Fig. S12a). Previously, we performed chromatin isolation by L1 RNA purification followed by sequencing (ChIRP-seq).⁶¹ L1 RNA is significantly enriched in L1 DNA-associated compartments with high levels of H3K9me3 and HP1α signals, but is depleted in B1-associated compartments (Fig. 7a–c). We enriched heterochromatins by sucrose-gradient centrifugation of UV-crosslinked chromatin fragments and performed transcript analysis. Consistently, L1 transcripts are preferentially enriched in heterochromatic fractions that are depleted of H3K4me3 and snRNP70 (Fig. 7d). In addition, L1 RNA signals show no overlap with B1 RNA in the nucleus as shown by RNA FISH (Supplementary information, Fig. S12b).

HP1α has been reported to bind RNA with a preference towards nuclear RNA and major forward transcripts produced from satellite repeats.^96,97,98,99 RNA immunoprecipitation in mESCs showed that HP1α binds to L1 transcripts (Supplementary information, Fig. S12c, d). To test their direct interactions, we purified human HP1α proteins and various RNA fragments in roughly 1-kb length transcribed in vitro. The full-length L1 sequence (6544 bp) was arbitrarily truncated into 8 overlapping 1-kb fragments (F1 to F8) in order to be efficiently produced by in vitro transcription (Supplementary information, Fig. S13a). For comparison, we also generated two synthetic DNA and RNA sequences in 1-kb length, comprising 8 tandem copies of either B1 element or scrambled B1 (designated as 8× B1 or 8× SCR, respectively). L1 mix as well as two synthetic fragments in DNA or RNA efficiently pulled down recombinant HP1α (Supplementary information, Fig. S13b). In addition, L1 RNA–HP1α interactions were robustly detected in highly stringent conditions with up to 1 M of salt and urea (Fig. 7e). These results indicate that HP1α exhibits strong binding activities towards RNA and DNA in vitro.

Recent studies have reported that HP1α forms phase-separated droplets in the presence of DNA or nucleosomes in vitro, and heterochromatin formation may entail a phase-separation mechanism.^{37,38,100,101,102} Indeed, the L1 DNA mix as well as two synthetic DNA controls (8× B1 and 8× SCR) promote the phase separation of HP1α, which fails to phase separate on its own (Supplementary information, Fig. S13c, d). Consistent with its strong RNA-binding activity, HP1α also phase separates with the L1 RNA mix in a concentration-dependent manner to form spherical droplets with liquid-like properties, such as fusion of droplets and rapid recovery after photo-bleaching (Fig. 7f and Supplementary information, Fig. S13d–f). Intriguingly, L1 RNA fragments from F3 to F6 with low GC contents (< 40%) covering the inter-ORF and central conserved ORF2 sequence (3.2-kb) tend to have higher activities in promoting HP1α droplet formation compared to F1 and F8 fragments with GC contents of 45%–56% (Supplementary information, Fig. S13g, h). In addition, careful examination of HP1α ChIP-seq in mESCs showed that HP1α preferentially binds to the central region of L1 repeat DNA (Supplementary information, Fig. S13i), which hints some degree of specificity in L1–HP1α interplay. However, there was no obvious difference detected between L1 mix and 8× B1 (60% in GC) RNA/DNA in HP1α phase separation (Supplementary information, Fig. S13j). Although we cannot conclude a sequence specificity of HP1α, clearly, HP1α shows strong RNA- and DNA-binding activities and phase-separates in the presence of RNA and DNA in vitro. Given the abundance and co-residence of L1 repeat DNA and RNA with HP1α within the nucleus, it is tempting to speculate that their co-localization may provide a means of specificity for L1 in promoting HP1α phase separation during heterochromatin formation.

Discussion

Although tremendous efforts have been dedicated to studies of structural chromatin proteins and cataloging chromatin maps, the role of DNA sequences in 3D genome organization has been largely ignored. Interestingly, the overall higher-order chromatin structure has been reported to be stable across different cell types and conserved in related species, despite occasional compartment switches in a small portion of the genome in a given cell (Supplementary information, Text S1). This remarkable conservation of chromatin compartments suggests a fundamental principle which all cells stick to while coping with shifting signals in different cell fates. Compared to transcription and epigenetic modifications, the primary DNA sequence has an unparalleled advantage to directly control and govern the stability of 3D genome folding due to its static nature during development. Then, the question comes what DNA sequences serve such a task as the blueprint of 3D genome folding. By employing in silico polymer simulations to interpret microscopy and Hi-C data, Mirny and colleagues have suggested that compartmental segregation may occur through a microphase-separation mechanism of block copolymers.^29,34 Together with the Solovei group, they further proposed that interactions between heterochromatic regions, rather than euchromatic contacts and lamina-heterochromatin interactions, are crucial for compartmentalization of the genome in both inverted and conventional nuclei.^29,34 However, what was unknown in their model is the molecular determinants, particularly the genetic information of block copolymers that are responsible for chromatin compartmentalization.

In this study, we reveal a remarkable correlation between repeat distribution and compartmental organization of the higher-order chromatin structure. First, using complementary genomics and imaging approaches, we demonstrate that the self-clustering of L1 and B1/Alu repeats forms grossly exclusive nuclear domains that are highly correlated with and predict the known A/B compartments, and that nuclear segregation of L1-rich and B1/Alu-rich sequences is conserved across mouse and human cells, and can be dynamically established during cell division and early embryogenesis. Second, we show that depletion of L1 RNA by AMOs drastically alters repeat segregation and compartmentalization on a global scale and at individual loci by Hi-C, DNA FISH and Oligopaint FISH. Collectively, the overall positive correlation and the essentiality of L1 RNA in compartmental organization suggest a functional role for L1 repeats in driving genome folding. These results disfavor the notion that L1 or B1/Alu repeats are merely markers of large chromosomal segments with a different activity, although we cannot firmly exclude this possibility. Our model is also consistent with the growing evidence showing active roles of retrotransposons in re-wiring the genome and regulatory programs.^{53,54,55,56,57,58,59,60,61} As often a challenge for genome organization studies, we note that the current support for going beyond correlative evidence to really show a causative role of L1 is still limited.

L1 RNA tends to co-localize with L1 DNA sequences in regions enriched for the binding of HP1α and H3K9me3. Intriguingly, L1 RNAs can also be detected outside the nuclear and nucleolar periphery (Supplementary information, Fig. S12b). L1 RNA has a short half-life of 40 min.⁶¹ Torres-Padilla and colleagues reported previously that exogenous L1 RNA fails to rescue the chromatin defects upon abnormal silencing of L1 in mouse zygotes.⁷¹ Although these observations disfavor a trans-acting mechanism, it is possible that L1 transcripts might be mobilized to distal L1 DNA sequences. As 20%–40% of L1 repeats are located in annotated euchromatic compartments (Supplementary information, Fig. S1e), these L1 transcripts may be more readily visualized by microscopy. For the majority of L1 enriched in transcriptionally silenced heterochromatic environments, their expression might be temporally regulated in a more transient way, thus creating difficulty for direct visualization and detection. Studies of X chromosome inactivation revealed different roles for silenced and actively transcribed L1s in regulating heterochromatin formation induced by Xist.¹⁰³ Silent L1 repeats participate in the assembly of a heterochromatic compartment, whereas transient transcription of certain young L1s facilitates local propagation of the silencing into regions that would be otherwise prone to escape. Recently, we have reported that depletion of L1 RNA leads to relocation of L1-rich DNA from inactive domains to the nuclear interior and genome-wide de-repression of L1-associated genes.⁶¹ Together, these results indicate a role of L1 RNA in mediating its own DNA’s function. However, the questions arise of where and when L1 RNA is produced, how it is regulated, and whether L1 transcription also plays a role during heterochromatin formation and compartmentalization.

Recombinant HP1α binds and phase separates with all tested DNA and RNA fragments in vitro, yet L1 RNAs in the central conserved region (inter-ORF and ORF2) show high activities compared to L1 sequences in the 5’ and 3’ ends. In mESCs, HP1α tends to bind the central region of L1 repeat DNA. These observations imply some degree of weak sequence-specificity for HP1α in recognizing its targets. This notion is congruent with several reports that HP1α preferentially binds nuclear RNA and rRNA rather than tRNA and randomly chosen RNA,⁹⁸ and binds the forward strand but not the reverse of major repeat RNA.⁹⁹ In addition, extensive co-localization of L1 and HP1α suggests a location-derived specificity in cells, while other DNA- and/or RNA-binding proteins may also endow the specificity of endogenous HP1α to L1 repeats. On the other hand, in the presence of substantial concentrations of macromolecules in a crowded milieu of living cells, even non-specific interactions may contribute considerably to total free energy that drives the phase separation of heterochromatic domains.¹⁰⁴ Together, these results implicate L1 RNA and DNA in the phase separation of heterochromatin formation. Future work should dissect the specific domains or sequence features of L1 that mediate its interactions with HP1α and address their interplays in vivo.

As a chromatin-bound noncoding RNA, L1 RNA may facilitate heterochromatin formation through a number of ways, such as promoting HP1α phase separation by providing a means of multivalency and recruiting RNA-binding proteins to increase local molecular mass, or stabilizing DNA-binding activities of heterochromatin-associated proteins (for example, HP1α binds both L1 RNA and DNA, in a manner similar to YY1¹⁰⁵), or acting as a scaffold to anchor L1-rich chromosomal segments to the nucleolar and nuclear peripheries. Cases for RNA in organizing subnuclear domains have been reported. For example, Xist RNA binds the lamin B receptor in the inner nuclear membrane to anchor the inactive X chromosome.¹⁰⁶ Transcription of satellite repeats precedes chromocenter formation and their transcripts help to recruit SUV39H to centromeric DNA sequences.^107,108 In-depth investigation of the expression, function and mechanism of L1 RNA could be a subject of future studies.

Based on our results together with previous reports, we propose a hypothetical model in which repetitive elements organize the macroscopic structure of the genome at three hierarchical levels (Fig. 7g). First, B1/Alu and L1 repeats serve as the genetic basis for A and B compartments, respectively. The abundance and scattering of these repeat elements in the genome provide numerous nucleation points or ‘structural codes’ to seed the formation of nuclear subdomains. Homotypic clustering of L1-rich or B1/Alu-rich regions initiates genome folding. Second, the structural information embedded in linear genomic DNA repeats is in part transacted by their transcripts, particularly L1 RNA as demonstrated in this study, into spatially ordered chromatin in the nucleus, likely through a phase-separation mechanism. Phase separation of individual subdomains based on differences in activity and protein composition may also lead to their segregation in the nuclear space and the eventual formation of distinct L1-rich heterochromatin and B1/Alu-rich euchromatin domains. However, the contribution of B1/Alu transcripts to 3D genome organization remains to be tested. Third, chromatin compartmentalization may be further reinforced and stabilized through the attachment of repeat DNA sequences to subnuclear structures such as the nucleolus and nuclear speckles.¹⁰⁹ Evidence to support this notion includes our observation that L1 repeats are preferentially localized at the nuclear and nucleolar peripheries, and depletion of L1 RNA disrupts the localization of L1 DNA at these sites.

L1 or B1/Alu DNA tends to associate with distinct sets of histone marks and transcription and chromatin regulators.⁶¹ For example, heterochromatin proteins such as HP1α, KAP1, and SETDB1 are specifically enriched on L1 elements, whereas general transcription factors such as GTF3C2 and CEBPB and RNA Pol II subunits show enriched binding on B1/Alu.⁶¹ We envision that repeat RNA may provide additional layers of regulatory specificity and multivalency to generate molecular crowding. Specific interactions among similar RNAs and proteins at homologous L1 or B1/Alu chromosomal segments may not only enhance subdomain formation, but also promote their segregation through phase separation. Selectivity for similar binding partners has been reported by the Tjian group.¹¹⁰ There does not appear to be cross-talk between different transcription factors; instead, they interact among themselves to form concentrated hubs on synthetic lacO DNA arrays in cells.¹¹⁰ In addition, different biophysical properties of phase-separated L1 or B1 subdomains, for example chromatin compactness, may further promote their segregation.

We have shown that inhibition of Pol II in both embryos and mESCs led to delayed and incomplete formation of L1/B1 segregation and Hi-C plaid patterns without abolishing genome folding. Previously, the Solovei group reported clustering of exogenous sequences with genomic segments of the same repeat class.⁴⁴ When a human artificial chromosome was introduced into mouse ESCs, human L1s spatially interact with mouse L1-rich regions but avoid the SINE-rich regions, and vice versa for human SINEs.⁴⁴ Based on these findings, we posit that L1/B1 repeats may represent autonomously functional units of the genome, and homotypic repeat clustering initiates compartmentalization, which is subsequently facilitated by transcription. An autonomous model for DNA sequence-dependent nuclear organization has been well demonstrated by the self-assembly of tandem repeat sequences such as ribosomal DNA (rDNA) and satellite repeats, which promotes high-order assemblages of the nucleolus and pericentromeric domains, respectively.^111,112

Although different in nucleotide sequences and length, both primate-specific Alu and rodent-specific B1 elements belong to the same class of SINE repeats, which originate from a common ancient ancestor, 7SL RNA, prior to the primate-rodent split about 80 million years ago,¹¹³ arguing against convergent evolution in driving retrotransposons in genome organization. Comparative genomics analysis across mammalian genomes has revealed that large-scale conserved patterns of retrotransposon accumulation follow similar evolutionary trajectories through conservation of synteny, gene regulation and nuclear organization, in spite of dissimilar retrotransposons.¹¹⁴ In addition, it has been reported that the landscape of endogenous L1 elements differs significantly from that of new L1 retrotransposon insertions, which broadly target all regions of the human genome, being insensitive to chromatin state.^115,116 This suggests that purifying selection, rather than biased insertions, reshapes the genomic distributions of L1 and Alu/B1 post their integration.^{61,115,117,118} We speculate that the genetic marking of compartments with distinct activity is so important that during evolution it has imposed selective pressures on the most abundant subfamilies of LINE and SINE repeats for them to accumulate in specific compartments. In comparison, we find ERV retrotransposons to be randomly distributed, which is consistent with a previous report by the Ren group that no genome-wide enrichment of ERVs was found at TAD boundary.¹³ Recently, the Ren group reported that transcriptionally active HERV-H repeats, a subclass of ERVs, demarcate TADs in human hESCs.^119,120 In fact, among > 1000 HERV-H repeats in human, < 50 show detectable expression in hESCs and only ~20 have a TAD boundary structure as indicated by directionality index. It is likely that a very small proportion of ERVs act at TAD and loop boundaries for more specific local chromatin regulation in a few genomic loci.

In summary, our study provides important initial evidence to unravel the fundamental principle of 3D genome organization. As discovered by Anfinsen in the late 1950s, the amino acid sequence of a protein determines its structure and function.^121,122 Analogously, we propose that the primary DNA sequences, particularly L1 and B1/Alu elements, dictate how the genome folds and functions. We envision that genome folding occurs autonomously, through a process that is driven by homotypic clustering of regions containing L1 or B1/Alu repeat sequences, which could be further facilitated by transcription processes, transcripts produced at these repeat elements, regulatory proteins, and perhaps a combination of these factors that act above and beyond repeat DNA sequences to influence their chromatin states. The widespread yet conserved distribution of homologous repeats in mammalian genomes render them a unique advantage to perform such a task as the blueprint for genome organization and function, compared to histone marks and transcription activities. Structural information embedded in L1 and B1/Alu repeats may be universally recognizable, thus contributing to the high degree of stability and conservation in the compartmental organization that is observed across mouse and human cell types. We want to note that L1 and B1/Alu compartmental domains represent a structural and functional ground state of chromatin organization, on which subsequent regulatory features, such as dynamic enhancer–promoter interactions, are overlaid. Nevertheless, the same principle of homotypic clustering, phase separation and spatial segregation of chromosomal segments may be reiterated at different genomic scales, consequently folding the genome. Lastly, our study calls for more work towards a complete understanding how genome folding occurs, particularly, on revealing the causality and mechanisms how these repetitive sequences act.

Materials and methods

Genomic analysis of repeat sequences

The reference catalog of repetitive elements was built from RepeatMasker annotations.¹²³ We used the 10-kb bin in all repeat analyses unless otherwise indicated. For de novo compartment calling, the mouse genome was first segmented into 100-kb bins, and the densities of L1 and B1 repeats were normalized to their genome background (19% for L1 and 3% for B1), and log₂ of the ratio of normalized B1 to L1 densities [log₂(B1/L1)] was calculated. The adjacent regions with size >500 kb were kept (85% left) and assigned as B1-rich (540) or L1-rich (648) compartments with a median size of 1.1 Mb to 1.3 Mb (Supplementary information, Table S1).

mESC and embryonic experiments

For cell-cycle synchronization, mESC (J1) cells¹²⁴ were treated with 1.25 mM Thymidine for 14 h and then 50 ng/mL Nocodazole for 7 h. G1 and S phase cells were collected at 1.5 h and 7 h, respectively, after Nocodazole release. mESCs were treated with DRB (100 μM) and ActD (1 μg/mL) for 3 h to inhibit transcription. For heterochromatin fractionation, sucrose gradient centrifugation of mESC nuclear extracts was performed as previously reported with modifications.^125,126 For embryonic microinjection, ASO (5 μM) and AMO (1 mM) was injected into PN3 zygotes on a Leica DMI3000B microscope equipped with a Leica micromanipulator.

Imaging analysis

DNA FISH,⁴ immuno-FISH,¹⁰⁹ and RNA FISH¹²⁷ in mESCs and embryos were performed as previously described with modifications. FISH Probes targeting consensus sequence of L1 and B1 (Fig. 2a) were used for both mESCs and embryos (Supplementary information, Table S2). For Oligopaint FISH analysis of non-repetitive sequences in A/B compartments, each of four regions in ~100-kb length, including B1-rich regions F, H and R, and L1-rich region e, is targeted by 500 DNA probes; and each of two large L1-rich regions g and q (~1 Mb) is targeted by 4500 DNA probes. Probe sets are shown in Supplementary information, Table S3. Image acquisition and quantification were conducted with UltraVIEW VoX spinning disc microscope (PerkinElmer) and Imaris version 8.4.1.

RNA depletion by AMO or ASO

To deplete L1 RNA in mouse embryos and mESCs, we used the same morpholino antisense oligonucleotide (AMO or ASO) as Percharde et al. previously used.⁸⁷ To deplete B1 RNA, two antisense oligonucleotide (ASO) sequences targeting the B1 consensus sequence were synthesized by IDT (Integrated DNA technologies). Sequences are listed in Supplementary information, Table S4.

Hi-C analysis

Small-scale in situ Hi-C (sisHi-C) was performed as previously described,⁸¹ following two independent experiments of L1 or scramble (SCR) AMO transfection. The summary statistics for Hi-C quality control is shown in Supplementary information, Table S5. Paired-end raw reads of Hi-C library data were processed with HiCPro (version 2.7.7) as described.¹²⁸ A and B compartments⁶ and compartmentalization strength^23,34,94 were identified as described previously.

Segregation index

The FISH-based segregation index is defined as the negative value of Pearson’s correlation coefficient of L1 and B1 DNA signals in the nucleus. The Hi-C-based segregation index is defined as the ratio of homotypic versus heterotypic interaction frequencies between L1 and B1/Alu subfamilies.

In vitro pull-down and phase separation assays

A series of eight 1-kb fragments, designated as F1 to F8, were produced to cover the full-length 6544-kb L1 sequence by PCR (Supplementary information, Fig. 13a). Two artificial fragments comprising eight tandem copies of either B1 element (8× B1, in 1-kb length) or scrambled B1 sequence (8× SCR, in 1-kb length) were used for comparison. Biotin-labeled RNA was obtained by in vitro transcription for the pull-down experiment. We purified the recombinant human HP1α protein as previously described.³⁸ In phase separation assays, recombinant HP1α with DNA or RNA fragments for L1, 8× B1, or 8× SCR were incubated at 4 °C overnight.

Statistical analysis

Statistical analyses were carried out using Excel or R (version 3.4.3).

Please also see Supplementary information, Data S1 for details.

Data availability

Sequencing data generated in this study have been deposited into the Gene Expression Omnibus database under accession numbers GSE123806 and GSE125766.

References

Annunziato, A. T. DNA packaging: nucleosomes and chromatin. Nature Educ. 1, 26 (2008).
Google Scholar
Mirny, L. A. The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 19, 37–51 (2011).
Article CAS PubMed PubMed Central Google Scholar
Boettiger, A. N. et al. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature 529, 418–422 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, S. et al. Spatial organization of chromatin domains and compartments in single chromosomes. Science 353, 598–602 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome conformation. Science 295, 1306–1311 (2002).
Article CAS PubMed Google Scholar
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Article CAS PubMed PubMed Central Google Scholar
Dekker, J., Marti-Renom, M. A. & Mirny, L. A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat. Rev. Genet. 14, 390–403 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rivera-Mulia, J. C. & Gilbert, D. M. Replication timing and transcriptional control: beyond cause and effect-part III. Curr. Opin. Cell Biol. 40, 168–178 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gibcus, J. H. & Dekker, J. The hierarchy of the 3D genome. Mol. Cell 49, 773–782 (2013).
Article CAS PubMed PubMed Central Google Scholar
Bonev, B. & Cavalli, G. Organization and function of the 3D genome. Nat. Rev. Genet. 17, 661–678 (2016).
Article CAS PubMed Google Scholar
Buchwalter, A., Kaneshiro, J. M. & Hetzer, M. W. Coaching from the sidelines: the nuclear periphery in genome regulation. Nat. Rev. Genet. 20, 39–50 (2019).
Article CAS PubMed PubMed Central Google Scholar
Solovei, I., Thanisch, K. & Feodorova, Y. How to rule the nucleus: divide et impera. Curr. Opin. Cell Biol. 40, 47–59 (2016).
Article CAS PubMed Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kadauke, S. & Blobel, G. A. Chromatin loops in gene regulation. Biochim. Biophys. Acta 1789, 17–25 (2009).
Article CAS PubMed Google Scholar
Dixon, J. R. et al. Chromatin architecture reorganization during stem cell differentiation. Nature 518, 331–336 (2015).
Article CAS PubMed PubMed Central Google Scholar
Stadhouders, R. et al. Transcription factors orchestrate dynamic interplay between genome topology and gene regulation during cell reprogramming. Nat. Genet. 50, 238–249 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schmitt, A. D. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rennie, S., Dalby, M., van Duin, L. & Andersson, R. Transcriptional decomposition reveals active chromatin architectures and cell specific regulatory interactions. Nat. Commun. 9, 487 (2018).
Article PubMed PubMed Central Google Scholar
Stevens, T. J. et al. 3D structures of individual mammalian genomes studied by single-cell Hi-C. Nature 544, 59–64 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Splinter, E. et al. CTCF mediates long-range chromatin looping and local histone modification in the beta-globin locus. Genes Dev. 20, 2349–2354 (2006).
Article CAS PubMed PubMed Central Google Scholar
Rao, S. S. P. et al. Cohesin loss eliminates all loop domains. Cell 171, 305–320 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nora, E. P. et al. Targeted degradation of CTCF decouples local insulation of chromosome domains from genomic compartmentalization. Cell 169, 930–944 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wutz, G. et al. Topologically associating domains and chromatin loops depend on cohesin and are regulated by CTCF, WAPL, and PDS5 proteins. EMBO J. 36, 3573–3599 (2017).
Article CAS PubMed PubMed Central Google Scholar
Schwarzer, W. et al. Two independent modes of chromatin organization revealed by cohesin removal. Nature 551, 51–56 (2017).
Article PubMed PubMed Central Google Scholar
Bintu, B. et al. Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783 (2018).
Google Scholar
Gassler, J. et al. A mechanism of cohesin-dependent loop extrusion organizes zygotic genome architecture. Embo J. 36, 3600–3618 (2017).
Article CAS PubMed PubMed Central Google Scholar
Haarhuis, J. H. I. et al. The cohesin release factor WAPL restricts chromatin loop extension. Cell 169, 693–707 e614 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nuebler, J., Fudenberg, G., Imakaev, M., Abdennur, N. & Mirny, L. A. Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc. Natl. Acad. Sci. USA 115, E6697–E6706 (2018).
Article CAS PubMed Google Scholar
Kubo, N. et al. Preservation of chromatin organization after acute loss of CTCF in mouse embryonic stem cells. bioRxiv https://doi.org/10.1101/118737 (2017).
van Steensel, B. & Belmont, A. S. Lamina-associated domains: links with chromosome architecture, heterochromatin, and gene repression. Cell 169, 780–791 (2017).
Article PubMed PubMed Central Google Scholar
Guelen, L. et al. Domain organization of human chromosomes revealed by mapping of nuclear lamina interactions. Nature 453, 948–951 (2008).
Article CAS PubMed Google Scholar
Zullo, J. M. et al. DNA sequence-dependent compartmentalization and silencing of chromatin at the nuclear lamina. Cell 149, 1474–1487 (2012).
Article CAS PubMed Google Scholar
Falk, M. et al. Heterochromatin drives compartmentalization of inverted and conventional nuclei. Nature 570, 395–399 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wijchers, P. J. et al. Characterization and dynamics of pericentromere-associated domains in mice. Genome Res. 25, 958–969 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jost, D., Carrivain, P., Cavalli, G. & Vaillant, C. Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 42, 9553–9561 (2014).
Article CAS PubMed PubMed Central Google Scholar
Strom, A. R. et al. Phase separation drives heterochromatin domain formation. Nature 547, 241–245 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, L. et al. Histone modifications regulate chromatin compartmentalization by contributing to a phase separation mechanism. Mol. Cell 76, 646–659 (2019).
Article CAS PubMed Google Scholar
Maison, C. et al. Higher-order structure in pericentric heterochromatin involves a distinct pattern of histone modification and an RNA component. Nat. Genet. 30, 329–334 (2002).
Article PubMed Google Scholar
Cook, P. R. & Marenduzzo, D. Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulations. Nucleic Acids Res. 46, 9895–9906 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rowley, M. J. et al. Evolutionarily conserved principles predict 3D chromatin organization. Mol. Cell 67, 837–852 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cook, P. R. Predicting three-dimensional genome structure from transcriptional activity. Nat. Genet. 32, 347–352 (2002).
Article CAS PubMed Google Scholar
Tang, S. J. Potential role of phase separation of repetitive DNA in chromosomal organization. Genes (Basel) 8, 279 (2017).
Article Google Scholar
van de Werken, H. J. G. et al. Small chromosomal regions position themselves autonomously according to their chromatin class. Genome Res. 27, 922–933 (2017).
Article PubMed PubMed Central Google Scholar
Solovei, I. et al. Nuclear architecture of rod photoreceptor cells adapts to vision in mammalian evolution. Cell 137, 356–368 (2009).
Article CAS PubMed Google Scholar
Amendola, M. & van Steensel, B. Nuclear lamins are not required for lamina-associated domain organization in mouse embryonic stem cells. EMBO Rep. 16, 610–617 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lenain, C. et al. Massive reshaping of genome-nuclear lamina interactions during oncogene-induced senescence. Genome Res. 27, 1634–1644 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zheng, X. et al. Lamins organize the global three-dimensional genome from the nuclear periphery. Mol. Cell 71, 802–815 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chang, L. et al. Nuclear peripheral chromatin-lamin B1 interaction is required for global integrity of chromatin architecture and dynamics in human cells. Protein Cell https://doi.org/10.1007/s13238-020-00794-8 (2020).
Biemont, C. A brief history of the status of transposable elements: from junk DNA to major players in evolution. Genetics 186, 1085–1093 (2010).
Article CAS PubMed PubMed Central Google Scholar
de Koning, A. P., Gu, W., Castoe, T. A., Batzer, M. A. & Pollock, D. D. Repetitive elements may comprise over two-thirds of the human genome. PLoS Genet. 7, e1002384 (2011).
Article PubMed PubMed Central Google Scholar
Orgel, L. E. & Crick, F. H. Selfish DNA: the ultimate parasite. Nature 284, 604–607 (1980).
Article CAS PubMed Google Scholar
Rebollo, R., Romanish, M. T. & Mager, D. L. Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu. Rev. Genet. 46, 21–42 (2012).
Article CAS PubMed Google Scholar
Bourque, G. et al. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res. 18, 1752–1762 (2008).
Article CAS PubMed PubMed Central Google Scholar
Lynch, V. J., Leclerc, R. D., May, G. & Wagner, G. P. Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals. Nat. Genet. 43, 1154–1159 (2011).
Article CAS PubMed Google Scholar
Grow, E. J. et al. Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature 522, 221–225 (2015).
Article CAS PubMed PubMed Central Google Scholar
Chuong, E. B., Elde, N. C. & Feschotte, C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science 351, 1083–1087 (2016).
Article CAS PubMed PubMed Central Google Scholar
Durruthy-Durruthy, J. et al. The primate-specific noncoding RNA HPAT5 regulates pluripotency during human preimplantation development and nuclear reprogramming. Nat. Genet. 48, 44–52 (2016).
Article CAS PubMed Google Scholar
Wang, J. et al. Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells. Nature 516, 405–409 (2014).
Article CAS PubMed Google Scholar
Liu, N. et al. Selective silencing of euchromatic L1s revealed by genome-wide screens for L1 regulators. Nature 553, 228–232 (2018).
Article CAS PubMed Google Scholar
Lu, J. Y. et al. Genomic repeats categorize genes with distinct functions for orchestrated regulation. Cell Rep. 30, 3296–3311 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mandal, P. K. & Kazazian, H. H. Jr. SnapShot: vertebrate transposons. Cell 135, 192–192.e1 (2008).
Article CAS PubMed Google Scholar
Taylor, M. S. et al. Affinity proteomics reveals human host factors implicated in discrete stages of LINE-1 retrotransposition. Cell 155, 1034–1048 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mouse Genome Sequencing Consortium et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
Article Google Scholar
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article CAS PubMed Google Scholar
Jurka, J., Kohany, O., Pavlicek, A., Kapitonov, V. V. & Jurka, M. V. Duplication, coclustering, and selection of human Alu retrotransposons. Proc. Natl. Acad. Sci. USA 101, 1268–1272 (2004).
Article CAS PubMed Google Scholar
Korenberg, J. R. & Rykowski, M. C. Human genome organization: Alu, lines, and the molecular structure of metaphase chromosome bands. Cell 53, 391–400 (1988).
Article CAS PubMed Google Scholar
Bolzer, A. et al. Three-dimensional maps of all chromosomes in human male fibroblast nuclei and prometaphase rosettes. PLoS Biol. 3, e157 (2005).
Article PubMed PubMed Central Google Scholar
Meuleman, W. et al. Constitutive nuclear lamina-genome interactions are highly conserved and associated with A/T-rich sequence. Genome Res. 23, 270–280 (2013).
Article CAS PubMed PubMed Central Google Scholar
Deininger, P. Alu elements: know the SINEs. Genome Biol. 12, 236 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jachowicz, J. W. et al. LINE-1 activation after fertilization regulates global chromatin accessibility in the early mouse embryo. Nat. Genet. 49, 1502–1510 (2017).
Article CAS PubMed Google Scholar
Fraser, J. et al. Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation. Mol. Syst. Biol. 11, 852 (2015).
Article PubMed PubMed Central Google Scholar
Sexton, T. & Cavalli, G. The role of chromosome domains in shaping the functional genome. Cell 160, 1049–1059 (2015).
Article CAS PubMed Google Scholar
Dixon, J. R., Gorkin, D. U. & Ren, B. Chromatin domains: the unit of chromosome organization. Mol. Cell 62, 668–680 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mateos-Langerak, J. et al. Pericentromeric heterochromatin domains are maintained without accumulation of HP1. Mol. Biol. Cell 18, 1464–1471 (2007).
Article CAS PubMed PubMed Central Google Scholar
Peric-Hupkes, D. et al. Molecular maps of the reorganization of genome-nuclear lamina interactions during differentiation. Mol. Cell 38, 603–613 (2010).
Article CAS PubMed PubMed Central Google Scholar
Nemeth, A. et al. Initial genomics of the human nucleolus. PLoS Genet. 6, e1000889 (2010).
Article PubMed PubMed Central Google Scholar
Nagano, T. et al. Cell-cycle dynamics of chromosomal organization at single-cell resolution. Nature 547, 61–67 (2017).
Article CAS PubMed PubMed Central Google Scholar
Naumova, N. et al. Organization of the mitotic chromosome. Science 342, 948–953 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ke, Y. et al. 3D chromatin structures of mature gametes and structural reprogramming during mammalian embryogenesis. Cell 170, 367–381 e320 (2017).
Article CAS PubMed Google Scholar
Du, Z. et al. Allelic reprogramming of 3D chromatin architecture during early mammalian development. Nature 547, 232–235 (2017).
Article CAS PubMed Google Scholar
Flyamer, I. M. et al. Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature 544, 110–114 (2017).
Article CAS PubMed PubMed Central Google Scholar
Qiu, J. J. et al. Delay of ZGA initiation occurred in 2-cell blocked mouse embryos. Cell Res. 13, 179–185 (2003).
Article CAS PubMed Google Scholar
Fadloun, A. et al. Chromatin signatures and retrotransposon profiling in mouse embryos reveal regulation of LINE-1 by RNA. Nat. Struct. Mol. Biol. 20, 332–338 (2013).
Article CAS PubMed Google Scholar
Rothstein, J. L. et al. Gene-expression during preimplantation mouse development. Gene Dev. 6, 1190–1201 (1992).
Article CAS PubMed Google Scholar
Abe, K. et al. The first murine zygotic transcription is promiscuous and uncoupled from splicing and 3’ processing. EMBO J. 34, 1523–1537 (2015).
Article CAS PubMed PubMed Central Google Scholar
Percharde, M. et al. A LINE1-nucleolin partnership regulates early development and ESC Identity. Cell 174, 391–405 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chen, C., Ara, T. & Gautheret, D. Using Alu elements as polyadenylation sites: a case of retroposon exaptation. Mol. Biol. Evol. 26, 327–334 (2009).
Article CAS PubMed Google Scholar
Dominissini, D., Moshitch-Moshkovitz, S., Amariglio, N. & Rechavi, G. Adenosine-to-inosine RNA editing meets cancer. Carcinogenesis 32, 1569–1577 (2011).
Article CAS PubMed Google Scholar
Lubelsky, Y. & Ulitsky, I. Sequences enriched in Alu repeats drive nuclear localization of long RNAs in human cells. Nature 555, 107–111 (2018).
Article CAS PubMed PubMed Central Google Scholar
Polak, P. & Domany, E. Alu elements contain many binding sites for transcription factors and may play a role in regulation of developmental processes. BMC Genom. 7, 133 (2006).
Article Google Scholar
Houda Belaghzal, T. B. et al. Compartment-dependent chromatin interaction dynamics revealed by liquid chromatin Hi-C. bioRxiv https://doi.org/10.1101/704957 (2019).
Xie, Y., Rosser, J. M., Thompson, T. L., Boeke, J. D. & An, W. Characterization of L1 retrotransposition with high-throughput dual-luciferase assays. Nucleic Acids Res. 39, e16 (2010).
Article PubMed PubMed Central Google Scholar
Abramo, K. et al. A chromosome folding intermediate at the condensin-to-cohesin transition during telophase. Nat. Cell Biol. 21, 1393–1402 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rhodes, J. D. P. et al. Cohesin disrupts polycomb-dependent chromosome interactions in embryonic stem cells. Cell Rep. 30, 820–835 (2020).
Article CAS PubMed PubMed Central Google Scholar
Keller, C. et al. HP1(Swi6) mediates the recognition and destruction of heterochromatic RNA transcripts. Mol. Cell 47, 215–227 (2012).
Article CAS PubMed Google Scholar
Stunnenberg, R. et al. H3K9 methylation extends across natural boundaries of heterochromatin in the absence of an HP1 protein. EMBO J. 34, 2789–2803 (2015).
Article CAS PubMed PubMed Central Google Scholar
Muchardt, C. et al. Coordinated methyl and RNA binding is required for heterochromatin localization of mammalian HP1 alpha. EMBO Rep. 3, 975–981 (2002).
Article CAS PubMed PubMed Central Google Scholar
Maison, C. et al. SUMOylation promotes de novo targeting of HP1alpha to pericentric heterochromatin. Nat. Genet. 43, 220–227 (2011).
Article CAS PubMed Google Scholar
Larson, A. G. et al. Liquid droplet formation by HP1alpha suggests a role for phase separation in heterochromatin. Nature 547, 236–240 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zwicker, D., Decker, M., Jaensch, S., Hyman, A. A. & Julicher, F. Centrosomes are autocatalytic droplets of pericentriolar material organized by centrioles. Proc. Natl. Acad. Sci. USA 111, E2636–E2645 (2014).
Article CAS PubMed Google Scholar
Li, P. et al. Phase transitions in the assembly of multivalent signalling proteins. Nature 483, 336–340 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chow, J. C. et al. LINE-1 activity in facultative heterochromatin formation during X chromosome inactivation. Cell 141, 956–969 (2010).
Article CAS PubMed Google Scholar
Marenduzzo, D., Finan, K. & Cook, P.R. The depletion attraction: an underappreciated force driving cellular organization. J. Cell Biol. 175, 681–686 (2006).
Sigova, A. A. et al. Transcription factor trapping by RNA in gene regulatory elements. Science 350, 978-981 (2015).
Article CAS PubMed PubMed Central Google Scholar
McHugh, C. A. et al. The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature 521, 232–236 (2015).
Article CAS PubMed PubMed Central Google Scholar
Probst, A. V. et al. A strand-specific burst in transcription of pericentric satellites is required for chromocenter formation and early mouse development. Dev. Cell 19, 625–638 (2010).
Article CAS PubMed Google Scholar
Velazquez Camacho, O. et al. Major satellite repeat RNA stabilize heterochromatin retention of Suv39h enzymes by RNA-nucleosome association and RNA: DNA hybrid formation. Elife 6, e25293 (2017).
Quinodoz, S. A. et al. Higher-order inter-chromosomal hubs shape 3D genome organization in the nucleus. Cell 174, 744–757 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chong, S. et al. Imaging dynamic and selective low-complexity domain interactions that control gene transcription. Science 361. eaar2555 (2018).
Falahati, H., Pelham-Webb, B., Blythe, S. & Wieschaus, E. Nucleation by rRNA dictates the precision of nucleolus assembly. Curr. Biol. 26, 277–285 (2016).
Article CAS PubMed PubMed Central Google Scholar
Tjong, H. et al. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization. Proc. Natl. Acad. Sci. USA 113, E1663–E1672 (2016).
Article CAS PubMed Google Scholar
Tsirigos, A. & Rigoutsos, I. Alu and b1 repeats have been selectively retained in the upstream and intronic regions of genes of specific functional classes. PLoS Comput. Biol. 5, e1000610 (2009).
Article PubMed PubMed Central Google Scholar
Buckley, R. M., Kortschak, R. D., Raison, J. M. & Adelson, D. L. Similar evolutionary trajectories for retrotransposon accumulation in mammals. Genome Biol. Evol. 9, 2336–2353 (2017).
Article CAS PubMed PubMed Central Google Scholar
Sultana, T. et al. The landscape of L1 retrotransposons in the human genome is shaped by pre-insertion sequence biases and post-insertion selection. Mol. Cell 74, 555–570 e557 (2019).
Article CAS PubMed Google Scholar
Flasch, D. A. et al. Genome-wide de novo L1 retrotransposition connects endonuclease activity with replication. Cell 177, 837–851 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pavlicek, A. et al. Similar integration but different stability of Alus and LINEs in the human genome. Gene 276, 39–45 (2001).
Article CAS PubMed Google Scholar
Graham, T. & Boissinot, S. The genomic distribution of L1 elements: the role of insertion bias and natural selection. J. Biomed. Biotechnol. 2006, 75327 (2006).
Article PubMed PubMed Central Google Scholar
Kruse, K. et al. Transposable elements drive reorganisation of 3D chromatin during early embryogenesis. bioRxiv https://doi.org/10.1101/523712 (2019).
Zhang, Y. et al. Transcriptionally active HERV-H retrotransposons demarcate topologically associating domains in human pluripotent stem cells. Nat. Genet. 51, 1380–1388 (2019).
Article CAS PubMed PubMed Central Google Scholar
Anfinsen, C. B. The formation and stabilization of protein structure. Biochem. J. 128, 737–749 (1972).
Article CAS PubMed PubMed Central Google Scholar
Anfinsen, C. B., Redfield, R. R., Choate, W. L., Page, J. & Carroll, W. R. Studies on the gross structure, cross-linkages, and terminal sequences in ribonuclease. J. Biol. Chem. 207, 201–210 (1954).
Article CAS PubMed Google Scholar
Tempel, S. Using and understanding RepeatMasker. Methods Mol. Biol. 859, 29–51 (2012).
Article CAS PubMed Google Scholar
Shen, X. et al. Jumonji modulates polycomb activity and self-renewal versus differentiation of stem cells. Cell 139, 1303–1314 (2009).
Article PubMed PubMed Central Google Scholar
Becker, J. S. et al. Genomic and proteomic resolution of heterochromatin and its restriction of alternate fate genes. Mol. Cell 68, 1023–1037 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bhatt, D. M. et al. Transcript dynamics of proinflammatory genes revealed by sequence analysis of subcellular RNA fractions. Cell 150, 279–290 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tsanov, N. et al. smiFISH and FISH-quant–a flexible single RNA detection approach with super-resolution capability. Nucleic Acids Res. 44, e165 (2016).
Article PubMed PubMed Central Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the Shen Laboratory members for insightful discussions, and we thank Benoit G. Bruneau and Robert J. Klose for providing CTCF-AID and RAD21-AID mESCs. This work was supported in part by the National Basic Research Program of China (2018YFA0107604, 2017YFA0504204 to X.S.); the National Natural Science Foundation of China (31925015, 31630095 to X.S.; 21825401 to Y.S.); the National Key R&D Program of China (2017YFA0505300 to Y.S.); Beijing Advanced Innovation Center for Structural Biology (to X.S). J.Y.L. is a Shuimu Tsinghua Scholar.

Author information

These authors contributed equally: Ting Wang, Yafei Yin, Ge Zhan, Xue Han, Ke Zhang, Yibing Tao
These authors contributed equally as co-first authors: J. Yuyang Lu, Lei Chang, Tong Li

Authors and Affiliations

Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
J. Yuyang Lu, Tong Li, Ting Wang, Yafei Yin, Ge Zhan, Xue Han, Ke Zhang, Yibing Tao, Liang Wang, Qi Peng, Pixi Yan, Hui Zhang, Xianju Bi, Wen Shao, Yantao Hong, Zhongyang Wu, Peizhe Wang, Wenzhi Li, Jing Zhang, Zai Chang, Pilong Li, Wei Xie, Jie Na & Xiaohua Shen
State Key Laboratory of Membrane Biology, Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, and College of Future Technology, Peking University, Beijing, 100871, China
Lei Chang, Yingping Hou & Yujie Sun
Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou, Guangdong, 510005, China
Lei Chang
MRC London Institute of Medical Sciences (LMS), London, W120NN, UK
Michelle Percharde
Institute of Clinical Sciences (ICS), Faculty of Medicine, Imperial College London, London, W120NN, UK
Michelle Percharde
National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
Runze Ma & Bing Zhu
Lunenfeld-Tanenbaum Research Institute, University of Toronto, Toronto, Ontario, M5T 3H7, Canada
Miguel Ramalho-Santos

Authors

J. Yuyang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Lei Chang
View author publications
You can also search for this author in PubMed Google Scholar
Tong Li
View author publications
You can also search for this author in PubMed Google Scholar
Ting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yafei Yin
View author publications
You can also search for this author in PubMed Google Scholar
Ge Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Xue Han
View author publications
You can also search for this author in PubMed Google Scholar
Ke Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yibing Tao
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Percharde
View author publications
You can also search for this author in PubMed Google Scholar
Liang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qi Peng
View author publications
You can also search for this author in PubMed Google Scholar
Pixi Yan
View author publications
You can also search for this author in PubMed Google Scholar
Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xianju Bi
View author publications
You can also search for this author in PubMed Google Scholar
Wen Shao
View author publications
You can also search for this author in PubMed Google Scholar
Yantao Hong
View author publications
You can also search for this author in PubMed Google Scholar
Zhongyang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Runze Ma
View author publications
You can also search for this author in PubMed Google Scholar
Peizhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenzhi Li
View author publications
You can also search for this author in PubMed Google Scholar
Jing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zai Chang
View author publications
You can also search for this author in PubMed Google Scholar
Yingping Hou
View author publications
You can also search for this author in PubMed Google Scholar
Bing Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Ramalho-Santos
View author publications
You can also search for this author in PubMed Google Scholar
Pilong Li
View author publications
You can also search for this author in PubMed Google Scholar
Wei Xie
View author publications
You can also search for this author in PubMed Google Scholar
Jie Na
View author publications
You can also search for this author in PubMed Google Scholar
Yujie Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohua Shen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.S. and J.Y.L. conceived of the study. X.S. and Y.S. supervised the study. X.S., Y.S., J.Y.L., and L.C. designed the experiments. J.Y.L. performed bioinformatics analysis. L.C. conducted all imaging experiments with help from Y.Y., G.Z., T.W., X.H., and J.Z. T.L. performed phase separation and RNA pull-down experiments. K.Z., Q.P., and P.Y. prepared the Hi-C library. T.W., W.L., and P.W. did embryo injections. P.L. and L.W. provide HP1α proteins. W.X., Y.T., J.N., H.Z., X.B., W.S., Z.W., R.M., B.Z., M.R.-S., M.P., Z.C., Y. Hong, and Y. Hou contributed technical assistance/suggestions. X.S., J.Y.L., and L.C. wrote the manuscript with input from all authors. T.W., Y.Y., G.Z., X.H., K.Z. and Y.T. contributed equally to this work.

Corresponding authors

Correspondence to Yujie Sun or Xiaohua Shen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Supplementary information

Supplementary information, Figure S1

Supplementary information, Figure S2

Supplementary information, Figure S3

Supplementary information, Figure S4

Supplementary information, Figure S5

Supplementary information, Figure S6

Supplementary information, Figure S7

Supplementary information, Figure S8

Supplementary information, Figure S9

Supplementary information, Figure S10

Supplementary information, Figure S11

Supplementary information, Figure S12

Supplementary information, Figure S13

Supplementary information, Table S1

Supplementary information, Table S2

Supplementary information, Table S3

Supplementary information, Table S4

Supplementary information, Table S5

Supplementary information, Text S1

Supplementary information, Data S1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lu, J.Y., Chang, L., Li, T. et al. Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D genome. Cell Res 31, 613–630 (2021). https://doi.org/10.1038/s41422-020-00466-6

Download citation

Received: 11 August 2020
Accepted: 17 December 2020
Published: 29 January 2021
Issue Date: June 2021
DOI: https://doi.org/10.1038/s41422-020-00466-6

This article is cited by

Emergence of replication timing during early mammalian development
- Tsunetoshi Nakatani
- Tamas Schauer
- Maria-Elena Torres-Padilla
Nature (2024)
Mapping crossover events of mouse meiotic recombination by restriction fragment ligation-based Refresh-seq
- Yan Wang
- Yijun Chen
- Fuchou Tang
Cell Discovery (2024)
Regulation and function of transposable elements in cancer genomes
- Michael Lee
- Syed Farhan Ahmad
- Jian Xu
Cellular and Molecular Life Sciences (2024)
Conserved chromatin and repetitive patterns reveal slow genome evolution in frogs
- Jessen V. Bredeson
- Austin B. Mudd
- Daniel S. Rokhsar
Nature Communications (2024)
Dynamic chromatin architectures provide insights into the genetics of cattle myogenesis
- Jie Cheng
- Xiukai Cao
- Hong Chen
Journal of Animal Science and Biotechnology (2023)