Signal-induced enhancer activation requires Ku70 to read topoisomerase1–DNA covalent complexes

Enhancer activation serves as the main mechanism regulating signal-dependent transcriptional programs, ensuring cellular plasticity, yet central questions persist regarding their mechanism of activation. Here, by successfully mapping topoisomerase I–DNA covalent complexes genome-wide, we find that most, if not all, acutely activated enhancers, including those induced by 17β-estradiol, dihydrotestosterone, tumor necrosis factor alpha and neuronal depolarization, are hotspots for topoisomerase I–DNA covalent complexes, functioning as epigenomic signatures read by the classic DNA damage sensor protein, Ku70. Ku70 in turn nucleates a heterochromatin protein 1 gamma (HP1γ)–mediator subunit Med26 complex to facilitate acute, but not chronic, transcriptional activation programs. Together, our data uncover a broad, unappreciated transcriptional code, required for most, if not all, acute signal-dependent enhancer activation events in both mitotic and postmitotic cells.

Forty years removed from their initial discovery 1,2 , gene control by transcriptional enhancers is considered one of the dominant molecular mechanisms underlying cell-type and signal-specific transcriptional diversity in metazoans. Thus, in addition to their critical functions in cell-type determination and differentiation, transcriptional enhancers serve as the predominant regulators of the precise, rapidly altered patterns of gene regulation in response to diverse acute signaling pathways [3][4][5][6][7] , reflecting the preferential binding of many of these signal-dependent transcription factors to enhancers, rather than promoters. Interestingly, signal-dependent enhancer activation generally temporally precedes activation of its cognate promoter 8,9 , and the robust, signal-regulated enhancer exhibit increased eRNA transcription and concomitant condensation of RNA-dependent ribonucleoprotein (RNP) complexes upon acute stimulation but maturing to a more gel-like state upon chronic stimulation [10][11][12][13] . However, whether there might be a universal strategy required for signal-dependent enhancer activation is largely unknown. Intriguingly, recent reports have revealed that active enhancers serve as the hotspots for DNA single-strand breaks, or DNA nicks 14,15 , which are the most common form of DNA damage. Unfortunately, the role of these putative DNA nicks in the signal-dependent transcriptional activation of the functional enhancers remains unclear.
In this article, we interrogate the potential role of the 'nicked' covalent intermediate that topoisomerase I (Top1) makes with genomic DNA (TOP1cc) in response to acute activating signals. Although Top1 has emerged as a critical component of the transcriptional machinery on promoters, few experiments have been directed at examining its role in robustly activated enhancers. This is because detection of the genomic landscape of the TOP1cc in mammalian cells has remained a main challenge. Firstly, TOP1cc is transient and not readily detectable 16 . Secondly, traditional chromatin immunoprecipitation (ChIP) assays induce very high artificial formation of poly ADP-ribose 17 , which is involved Article https://doi.org/10.1038/s41594-022-00883-8 This approach affords a dramatic improvement in detection of the specific TOP1cc actions compared with previous TOP1-seq approaches 30 , which, for example, in human colon cancer HCT116 cells, detected only 508 peaks of Top1-dependent single-stranded DNA nicks in the genome. We also compared TOP1cc signals with traditional ChIP-seq data targeting Top1 in human prostate cancer LNCAP cells (Extended Data Fig. 1a) and found that, although 62.3% TOP1cc proved to locate with Top1-enriched regions (Extended Data Fig. 1b-d), TOP1cc could be detected at only around 14.1% of the Top1-enriched regions detected with ChIP-seq 29 , which might reflect assay noise due to the formaldehyde fixation 17 .

Top1 is required for acute transcriptional activation
To explore the function of TOP1cc at active enhancers, we found that bromodomain-containing protein 4 (BRD4), a well-known transcriptional activator 31 , was correlated strongly and positively with TOP1cc at the enhancers in E 2 -treated MCF7 cells (Extended Data Fig. 2a,b). We thus employed hypergeometric optimization of motif enrichment (HOMER) 4 to investigate motifs enriched at enhancers marked with TOP1cc in E 2 -treated MCF7 cells, finding estrogen response elements (ERE), Forkhead Box A1 (FOXA1) and Grainyhead-like protein 2 (GRHL2) motifs were highly represented (Fig. 2a), all of which are enriched in ERα transcriptionally activated enhancers 32,33 . Strikingly, ERα was detected in 68.5% of TOP1cc-enriched enhancers in E 2 -treated MCF7 cells by ChIP-seq ( Fig. 2b and Extended Data Fig. 2c), and TOP1cc signals were increased (Extended Data Fig. 2d) following the induction of ERα signals (Extended Data Fig. 2e) at TOP1cc-enriched, robustly active MegaTrans enhancers, indicating that the recruitment of ERα to these signal-dependent active enhancers might promote the formation of TOP1cc.
Further, we found that knockdown of Top1 (Extended Data Fig. 3a) impaired the E 2 -dependent eRNA transcription at TOP1cc-enriched, robustly active MegaTrans enhancers (Extended Data Fig. 3b,c). In contrast, enhancers at which TOP1cc were not enriched were minimally impaired by Top1 knockdown (Extended Data Fig. 3d). Validating the importance of TOP1cc for signal-dependent enhancer activation, we found that the robustly E 2 -activated Tff1 and Greb1 enhancers could be rescued in Top1 wt, but not by the catalytically dead Top1-Y723F-expressing cells (Fig. 2c).
We next asked whether TOP1cc is also required for the chronic activation of the ERα-marked active enhancers, characterized by similar ERα binding but less robust eRNA transcription 13 . We found that TOP1cc was, at best, minimally detected at the MegaTrans enhancers following chronic (around 14 h) E 2 stimulation ( Fig. 2d and Extended Data Fig. 4a-d). In concert with these findings, the acute E 2 -dependent activation of the cognate target genes of the MegaTrans enhancers was inhibited by Top1 knockdown (Fig. 2e), while the activation of these cognate target genes following chronic E 2 treatment was not impaired (Fig. 2f). Lastly, because of the role of eRNA in the assembly and physical properties of MegaTrans enhancer condensates 12,13 , we found that the eRNA expression level at the TOP1cc-enriched, acutely activated MegaTrans enhancers was also much higher than that of non-TOP1cc-enriched MegaTrans enhancers (Extended Data Fig. 4e,f), in the formation and release of TOP1cc 16,18 . Third, current assays for DNA-protein intermediates such as an immunocomplex of enzyme and trapped in agarose DNA immunostaining assays are low throughput 19,20 , while chaotropic salts isolation 21 , fluorescein isothiocyanate-labeling 22 , rapid approach to DNA adduct recovery 23 approaches and even the recently developed repair-seq 14 , synthesis-associated with repair sequencing and S1-END-seq 15 have failed in identifying the DNA-bound sites of the specific proteins interrogated. Therefore, the function of TOP1cc in translating activation signals into context-specific responses remains poorly understood.
Here, by taking advantage of a specific antibody recognizing TOP1cc, in conjunction with CUT&RUN assays, we unexpectedly find that TOP1cc serves as an epigenomic signature recognized by the DNA damage sensor protein Ku70, which has been classically linked to the double-strand DNA damage repair pathway. We provide evidence that Ku70 is an invariant requirement for acute signal-dependent enhancer activation, acting to tether HP1γ-Med26 to facilitate the phosphorylation of serine 5 at RNA polymerase II (Pol II) to promote the transcriptional elongation of the enhancers. Remarkably, we find that not only 17β-estradiol (E 2 ) but also dihydrotestosterone (DHT), tumor necrosis factor alpha (TNFα) and even neuronal depolarization also require the actions of TOP1cc and the reading of the TOP1cc by Ku70 for acute enhancer activation. The discovery of TOP1cc as an epigenomic signature in a broad swath of acute signal-dependent enhancer activation events uncovers a conceptually new dimension in the regulation of enhancer function in mammalian cells.

TOP1cc is detected at signal-dependent active enhancers
An ideal system to explore any potential link between the actions of Top1 at enhancers and signal-dependent enhancer activation is afforded by E 2 -caused rapid assembly of a megadalton-sized enhancer complex (MegaTrans complex), on robustly activated estrogen receptor alpha (ERα)-bound MegaTrans enhancers, controlling E 2 -regulated transcriptional programs [24][25][26] . To avoid formaldehyde fixation-induced high artificial formation of poly ADP-ribose 17 , which is involved in the formation and release of TOP1cc 18 , we employed CUT&RUN assays to uncover the genomic landscape of the TOP1cc by utilizing a monoclonal antibody with specificity for TOP1cc. We found 18,308 TOP1cc-enriched regions in E 2 (1 h)-treated human breast cancer MCF7 cells, of which 55.9% were localized to intronic and intergenic regions and around 31.6% were localized at promoters (Fig. 1a). Indeed, TOP1cc was associated strongly with active enhancers, as measured by H3K27Ac and H3K4me2, but not with condensed chromatin, as identified by H3K9me3 (Fig. 1b,c). To verify the authenticity of the TOP1cc detected in the genome, the requirement for enzymatically active Top1 was tested by employing CRISPR interference (CRISPRi; dCas9 fused with KRAB protein) 27 to suppress the endogenous Top1 gene promoter in MCF7 cells expressing tetracycline-inducible wild type (wt) or the catalytically dead mutant (Y723F) 16,28,29 Top1 (Fig. 1d). Our results showed that the appearance of TOP1cc was strikingly diminished in Top1-Y723F-expressing cells (Fig. 1e), confirming the authenticity of the TOP1cc loci detected in the genome. shows the genome-wide distribution of TOP1cc CUT&RUN signals in E 2 -treated MCF7 cells. b, Numbers of TOP1cc-enriched regions at ATAC-seq peaks, ChIP-seq peaks of enhancer-related marks (H3K4me2 and H3K27Ac), additional chromatin marks at accessible regions (H3K4me3, H4K16Ac, H2A.Z, SMC1, RAD21 and CTCF), chromatin silencing marks (H4K20me1, H3K27me3 and H3K9me3) and 179,220 random selected genomic regions in E 2 -treated MCF7 cells. c, Genomic browser images show the TOP1cc CUT&RUN signals, Top1 and H3K27Ac ChIP-seq signals at selected gene loci. IgG and H3K27me3 CUT&RUN signals are serving as the experimental controls for the CUT&RUN assays. Enhancers are highlighted with light-brown boxes. d, Western blots show the expression of BLRP-tagged wt and Y723F enzymatic dead mutant Top1 in MCF7 cells. Endogenous Top1 was inhibited with dCas9-KRAB, and wt and Y723F enzymatic dead mutant Top1 were expressed using DOX; three biological replicates. e, TOP1cc signals (n = 18,308) in E 2 -treated BLRP-tagged wt-and enzymatic dead mutant Top1-Y723F-expressing MCF7 cells detected by CUT&RUN assays are shown with heatmaps. An additional 5 kb from the center of the peaks is shown, and the color scale shows the normalized tag numbers. Center lines show the medians, box limits indicate the 25th and 75th percentiles and whiskers extend 1.5× the interquartile range from the 25th and 75th percentiles. P value generated from unpaired two-tailed t-test denotes statistical differences between wt and Y723F conditions. Uncropped images for d and data for graphs in b are available as Source data.

Ku70 is tethered to TOP1cc-enriched active enhancers
To explore the mechanisms underlying TOP1cc-mediated transcriptional activation, proteins interacting with Top1 were identified by reversible crosslink immunoprecipitation (ReCLIP) 34 and mass spectrometry (MS). We found that the components of the nonhomologous end joining (NHEJ) pathway such as catalytic subunit of DNA protein kinase (DNA-PKcs), paralogues of XRCC4 and XLF (PAXX), Ku70 and Ku80 were the main interactants that increased in E 2 -treated MCF7 cells ( Fig. 3a and Supplementary Tables 1-3), and the reciprocal immunoprecipitation (IP) confirmed that these components interact with Top1 ( Fig. 3b).
We then investigated the genomic landscape of the DNA-PKcs, Ku70 and Ku80 by ChIP-seq experiments. Both DNA-PKCs and Ku70 were induced at TOP1cc-enriched enhancers in response to E 2 treatment (Fig. 3c). Furthermore, the recruitment of Ku70 was correlated positively with TOP1cc at the TOP1cc-enriched MegaTrans enhancers (Fig. 3d). We next tested whether TOP1cc could function as the molecular signature that could be 'read' by Ku70, finding that both the Top1-dependent DNA nicking (Top1-DN) activities, which is abolished by the TOP1-Y723F mutant, and the presence of DNA were required for the interaction between Top1 and Ku70 by IP (Fig. 3e,f), suggesting that Ku70 is recruited to TOP1cc-enriched genomic regions resulting from Top1-induced single-strand DNA nicking at enhancers upon acute signal stimulation.
In contrast, Ku80, which is required for double-strand DNA damage repair by forming a heterodimer with Ku70 (ref. 35 ), was not at all, or only minimally, increased following the induction of TOP1cc signals at these TOP1cc-enriched MegaTrans enhancers (Fig. 3g,h). Interestingly, in genomic regions that were cobound with Ku70 and Ku80, and without any transcriptional activation as indicated by diminished RNA Pol II induction upon E 2 stimulation, Ku70 was not induced; in contrast, in regions cobound with Ku70, Ku80 and exhibiting Pol II induction, Ku70 was induced upon acute E 2 stimulation (Fig. 3i). Finally, while the DNA damage sensor proteins including DNA-PKcs and Ku70 were required for the transcriptional activation of E 2 -regulated enhancers, such as the Greb1 and Tff1 enhancers, knockdown of DNA damage repair proteins, such as XRCC4, was not required (Fig. 3j and Extended Data Fig. 5), confirming the role of the classic DNA damage sensor protein Ku70, in TOP1cc-induced acute transcriptional regulation at signal-dependent active enhancers.

HP1γ and Med26 are recruited to facilitate transcription
To explore how the classic DNA damage sensor protein 36 , Ku70, can induce transcriptional activation at enhancers, and noting that HP1γ, one of the main factors pulled down by TOP1cc, has been reported to be tethered to euchromatin by Ku70 (ref. 37 ), we tested the possibility that Ku70 functions to assemble the phosphorylated HP1γ on TOP1cc-enriched enhancers. Strikingly, our data indicated that HP1γ localization was highly correlated with the genomic landscape of TOP1cc and Ku70 (Fig. 4a). Moreover, HP1γ was induced at Ku70 and TOP1cc-enriched MegaTrans enhancers upon E 2 treatment, but not at other active enhancers (Fig. 4b). The interactions between HP1γ with Ku70 and Top1 were detected upon E 2 treatment by IP (Fig. 4c). Knockdown of Ku70 greatly decreased the enrichment of HP1γ at the TOP1cc-enriched MegaTrans enhancers (Fig. 4d). Consistently, knockdown of Top1 decreased the enrichment of HP1γ at those enhancers that exhibited TOP1cc and HP1γ colocalized/overlap by ChIP-seq experiments (Fig. 4e), as exemplified by the Tff1 enhancer (Fig. 4f). In contrast, knockdown of Top1 had little, or no, effect on the enrichment of HP1γ at the nonoverlapped regions (Fig. 4g).
Indeed, minimal HP1γ binding was observed in regions harboring heterochromatin-associated lysine 9 trimethylated histone H3 (H3K9me3), which is the main marker directing the localization of HP1 family members in the genome (Extended Data Fig. 6a,b). The correlation between H3K9me3 and HP1γ at heterochromatin is high (Extended Data Fig. 6c), but the genome-wide correlation between H3K9me3 and HP1γ is quite low (Extended Data Fig. 6d), while the correlation between TOP1cc and HP1γ is high (Extended Data Fig. 6e), indicating the role of TOP1cc in tethering HP1γ to the euchromatin regions. Then, to confirm whether phosphorylation of HP1γ 37 was important for the enrichment of HP1γ at enhancers, rescue experiments were performed. We found that mutation of Ser83 to alanine (S83A), which prevented the phosphorylation of HP1γ at Ser83, decreased HP1γ recruitment to enhancers. In contrast, mutation of Ser83 to the phosphomimic glutamine (S83D), resulted in increased HP1γ recruitment to the enhancers (Fig. 4h).

Fig. 2 | TOP1cc
is required for signal-dependent acutely enhancer activation. a, Motifs for sequences within 200 bp of the summit of the TOP1cc peaks at the enhancers in E 2 -treated MCF7 cells are presented by bar graph. P values generated from two-tailed binomial test denote statistical differences between the target and background sequences for enrichment. b, ERα ChIP-seq signals and TOP1cc CUT&RUN signals and at the enhancers in MCF7 cells are shown with heatmaps. Additional 3 kb from the center of the peaks are shown. c, RT-qPCR results show that the transcriptional activation of Tff1 and Greb1 enhancers could be recused by DOX-induced expression of wt Top1, but not enzymatic dead mutant Top1Y723F. d, Heatmaps show TOP1cc CUT&RUN signals at MegaTrans enhancers upon Veh (-E 2 ), acute (1 h) and chronic (around 14 h) E 2 treatment. An additional 5 kb from the center of the peaks is shown, and the color scale shows the normalized tag numbers. e, RT-PCR shows that Top1 is required for the acute activation of E 2 -induced transcriptional programs. f, RT-qPCR results show that the chronical activation of MegaTrans enhancers cognate target genes is not Top1-dependent. For c, e and f, data are shown as mean ± s.d. (n = 3 (three independent biological replicates), two-tailed Student's t-test); NS, nonstatistically significant. Raw data for graphs in c, e and f are available as Source data. ChIP-seq tags (log 2 ) at TOP1cc-enriched MegaTrans enhancers (n = 481). i, Violin plots show normalized Ku70 and Ku80 ChIP-seq tags (log 2 ) at Pol II unchanged genomic regions (n = 3,466) and E 2 -dependent Pol II increased genomic regions (n = 3,907). j, RT-qPCR results show that the E 2 -dependent transcriptional activation of Greb1 and Tff1 enhancer RNAs are impaired. Data are shown as mean ± s.d. (n = 3 (three independent biological replicates), two-tailed Student's t-test). For h and i, center lines show the medians, box limits indicate the 25th and 75th percentiles and whiskers extend 1.5× the interquartile range from the 25th and 75th percentiles. P values generated from unpaired two-tailed t-test denote statistical differences between -E 2 and +E 2 conditions, and the median value of normalized Pol II ChIP-seq tags (log 2 ) are listed under the boxplots. Uncropped images for b, e and f and data for graphs in j are available as Source data.
Article https://doi.org/10.1038/s41594-022-00883-8 Assessing the function of the HP1γ in mediating the transcriptional activation at enhancers, we found that, whereas Pol II exhibited only modest changes at other active enhancers upon the depletion of the Hp1γ, Pol II and other coactivators including BRD4, CBP and GATA3 were decreased dramatically at TOP1cc-enriched MegaTrans enhancers ( Fig. 4i and Extended Data Fig. 7). In exploring why HP1γ, a protein primarily associated with transcriptional silencing, was involved in the transcriptional activation at enhancers, we were cognizant of the fact that an alternative mediator subunit, Med26, harbors a canonical PXVXL motif at the extreme C-terminus that is sufficient to bind HP1γ 38 . Accordingly, we tested this possibility experimentally, finding that Med26 was highly recruited to the MegaTrans enhancers in response

General requirement for TOP1cc-Ku70-HP1γ for transcription
Based on the discovery of TOP1cc-Ku70-HP1γ in E 2 -dependent enhancer activation, we investigated whether this strategy might represent a common or even general mechanism underlying acute signal-induced enhancer activation. We tested whether TOP1cc-Ku70-HP1γ are employed in other acute signal-induced transcriptional activation events, consequent to the induction of TOP1cc at their corresponding enhancers. Strikingly, TOP1cc-Ku70-HP1γ were present at the acute TNFα-induced p65-bound active proinflammatory enhancers ( Fig. 5a) and acute DHT-induced androgen receptor (AR)-bound active enhancers (Fig. 5b), wherein induction of the target genes mediated by the acute treatment of the TNFα or DHT was, in each case, diminished by the knockdown of the Top1, Ku70 or Hp1γ (Fig. 5c).
To further generalize the importance of TOP1cc, we assessed enhancer activation during the depolarization of primary murine neuronal cultures, using a standard KCl-mediated depolarization protocol to mimic neuronal activity stimulation (Extended Data Fig. 10a) 41 . We identified 1,344 active enhancers in primary cortical neurons, of which 737 exhibited the induction of TOP1cc signals at KCl-activated enhancers (Fig. 5d). Specifically, the presence of corresponding binding motifs, transcription factors such as MEF2a, MEF2b, MEF2c and MEF2d, which are known to be crucial for neuronal activity-regulated gene transcription 42 , highly represented at TOP1cc-enriched enhancers (Extended Data Fig. 10b), but not at non-TOP1cc enhancers (Extended Data Fig. 10c). TOP1cc at the enhancers adjacent to the neuronal activity-regulated genes, such as Npas4 and Fos, was upregulated following KCl treatment in primary cortical neuronal cultures (Fig. 5e). Ku70-HP1γ was also induced at these TOP1cc-enriched neuronal enhancers upon KCl-mediated depolarization (Extended Data Fig. 10d).

TOP1cc serves as a universal epigenomic signature
Because most DNA-binding factors activated in response to diverse signals bind primarily to cognate DNA sites in enhancers, rather than promoters, activation of enhancers serves as the main mechanism regulating acute signal-dependent modulation of transcriptional programs in virtually all cell types. Here, we address whether there is an as yet unappreciated molecular strategy that underlies most, if not all, signal-dependent enhancer activation events, despite the diversity of primary sequence, cell type and activating signal. Specifically, by employing a specific antibody recognizing Top1-DNA transient intermediate with CUT&RUN assays to detect TOP1cc, we established a powerful technology that permitted us to provide evidence that TOP1cc is required to achieve robust enhancer activation following acute signals in all the systems that we have examined. We provide evidence that TOP1cc and the subsequent downstream events are required for diverse types of signal-induced enhancer activation, including not only estrogen, androgen and TNFα-activated enhancers (regulating proinflammatory genes), but also for neuronal depolarization-induced enhancer activation.
The role of Top1 in mediating acute, but not chronic, signaldependent enhancer activation correlated with the distinct physiochemical properties of condensates established on acute versus chronically stimulated enhancers. That is, whereas the E 2 -dependent assembly of RNP condensates on acute stimulated enhancers displays properties of liquid-liquid phase separation, chronically activated enhancers show progressive maturation of RNPs to a distinct, perhaps more gel-like, state 13 . Importantly, from our studies, eRNA appears to be capable not only of promoting liquid-liquid phase separation but also of augmenting the formation of TOP1cc. However, these two effects are probably interrelated, consistent with the ability of condensates to increase enzymatic activity 11 .
Notably, the assay we describe here, utilizing a monoclonal antibody for Top1 covalently bound to DNA, distinguishes Top1-mediated nicks from single-strand DNA nicks associated with long patch DNA damage repair on neuronal specific enhancers 15 , which are also enriched at active enhancers, but are not linked with transcriptional activation. Collectively, we uncover here an unappreciated mechanism underlying a broad range of signal-dependent enhancer activation events, wherein signal-dependent binding of regulated transcription factors to responsive enhancers elicits TOP1cc as an epigenomic signature for subsequent transcriptional activation.

Ku70 functions as the 'reader' of the Top1-DN
Remarkably, the acute signal-induced TOP1cc licenses the recruitment of Ku70, which functions as a 'reader' of TOP1cc. It has long been assumed that the key to Top1 actions is to relieve torsional stress arising from robust transcription at coding gene bodies and promoters. However, we note that dynamic supercoiled DNA has been undetected at the regulatory enhancers 43 , and the stabilized TOP1cc generated by Top1 inhibitors, which is well known to inhibit the relief of torsional stress 44 , could induce the transcription of nascent RNA 45 . We find here that TOP1cc can be co-opted as an epigenetic signature for the recruitment of Ku70, hinting at a noncanonical role for TOP1cc in the mobilization of transcription factors rather than relief of torsional stress per se.
Indeed, in the traditional model for NHEJ, the initial recognition and binding of Ku70-Ku80 heterodimer to the double-strand breaks is followed by the recruitment of DNA-PKcs and formation of the DNA-PK holoenzyme, resulting in the formation of a DNA-PK dimer mediated by the conserved C-terminal helix of Ku80. This DNA-PK dimer then acts as a platform for binding of other protein and brings the broken DNA ends close together, allowing for completion of the process of double-strand break repair through NHEJ 46,47 . Here, we find that  Top1-induced single-stand DNA nicks do not require the presence of Ku80, and that Ku80 binding is not particularly induced at the enhancers upon acute signal-dependent stimulation, which is consistent with previous reports showing that Ku80 is not presented at Ku70-HP1γ complexes 37 . Thus, in addition to its evolutionarily important functions in double-strand DNA damage repair events as a heterodimer with Ku80, we propose that Ku70 has acquired an independent function as a transcriptional coactivator in signal-induced enhancer activation, serving as a 'reader' of TOP1cc.

Ku70 facilitates the recruitment of HP1γ-Med26
HP1γ is a member of the heterochromatin protein 1 family that reads H3K9 methylation via a conserved chromodomain, but can associate with the active gene promoters 48 , and interacts with Med26 through    Mediator has been demonstrated to be required for the robust transcriptional activation 50,51 , and our data strongly suggest that Med26 at acutely activated enhancers facilitates the transition of Pol II from initiation condensates with unphosphorylated Ser2 to RNA processing or elongation condensates with phosphorylated Ser2. The functional importance of Med26 proposed in this article is consistent with a recent report showing that the knock-out of Med26 affects a larger gene expression program than knock-out of any other mediator subunit evaluated, including Med1, in mammalian cells 52 .
Taken together, we conclude that TOP1cc serves in effect as an epigenomic signature recognized by Ku70, leading to the nucleation of a HP1γ-Med26 complex required for robust transcriptional activation at signal-dependent acute activated enhancers (Fig. 5f). Strikingly, TOP1cc-dependent Ku70-HP1γ recruitment emerges from our studies as a general requirement for acute signal-dependent enhancer activation events in mammalian cells and is consistent with the model that the interaction of HP1γ with Med26 is probably to augment its functions to activate the elongation complex and increase eRNA transcription and transcriptional factor condensation at acute activated enhancers.

Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at https://doi.org/10.1038/s41594-022-00883-8. Before plating neurons, cell culture plates were washed three times with sterile distilled water. Neurons were grown in neuronal medium consisting of Neurobasal medium containing B27 supplement (2%; Thermo Fisher), penicillin-streptomycin (50 g ml -1 ) and Glutamax (1×; Thermo Fisher). Neurons were subsequently plated and placed in a cell culture incubator that maintained a temperature of 37 °C and a CO 2 concentration of 5%; 10 min after plating neurons, the medium was aspirated completely from cells and replaced with fresh warm neuronal medium. Neurons were grown in vitro until day 7.

Drug treatment
For hormone treatments, cells were incubated at 37 °C and 5% CO 2 for at least 3 days in phenol red-free DMEM (GIBCO/Invitrogen) supplemented with 5% charcoal dextran-stripped FBS (GIBCO/Invitrogen). For MCF7 cells, 17-β-estradiol (Steraloids, Inc.) was added to a final concentration of 100 nM. TNFα (R&D systems) was added to a final concentration of 20 ng ml -1 . For LNCAP cells, 5α-DHT (Sigma) was added to a final concentration of 100 nM. The ethanol vehicle control was 0.05% in all samples. For the acute activation, cells were treated with drugs (ethanol or E 2 or TNFα or DHT) for 1 h for ChIP-seq or PRO-seq assays. To detect the changes of messenger RNA (mRNA) expression levels for the acute activation, cells were treated with drugs (ethanol or E 2 or TNFα or DHT) for 4 h and harvested for RNA isolation and subsequent RT-qPCR experiments. For the chronic activation, cells were treated with drugs (ethanol or E 2 ) for 14 h and harvested for RNA isolation and subsequent RT-qPCR experiments.
For ChIP-seq experiments, mouse cortical neurons were plated at an approximate density of 2 × 10 6 on 35-mm dishes. Neurons were plated in 2 ml neuronal medium. A 1 ml aliquot of the medium was replaced with 1 ml fresh warm medium on days 3 and 6. Before KCl depolarization, neurons were silenced with 1 μM tetrodotoxin (TTX; Fisher) on day 6. Neurons were subsequently stimulated on day 7 by adding warmed KCl depolarization buffer (170 mM KCl, 2 mM CaCl 2 , 1 mM MgCl 2 and 10 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES)) directly to the neuronal culture to a final concentration of 31% in the neuronal culture medium within the culture plate or well for the indicated time. 11058-021), and incubated for 6 h, and then changed to phenol red-free medium. For quantitative PCR (qPCR) or western blots, cells were transfected in six-well plates in regular DMEM without antibiotics and 4 μl of 20 μM siRNAs were transfected with Lipofectamine 2000 reagent (Invitrogen catalog no. 11668-019). In both conditions, cells were transfected 2 days later with siRNA again in phenol red-free medium. At 3 days following transfection, cells were treated with ethanol or E 2 for 4 h and harvested for RNA isolation for acute activation, treated with ethanol or E 2 for 14 h and harvested for RNA isolation for chronic activation or treated with ethanol or E 2 for 1 h for PRO-seq assay.

Short hairpin RNA (shRNA) lentivirus package and infection
pLKO lentiviral shRNA constructs and control shRNA constructs were purchased from Addgene. The sequences of the shRNAs used in this article are shown in Supplementary Table 4. Knockdown experiments with lentivirus shRNAs were conducted according to the standard lentivirus package and transduction protocols from Addgene. pLKO-based lentiviral shRNA plasmids were cotransfected with packaging plasmids (psPAX2 and pMD2.G) into 293 T cells. Lentiviruses were harvested, concentrated and used for MCF7 cell infection. Stable knockdown MCF7 cells were selected with 1 μg ml -1 puromycin and collected for experiments within 5 days. Before collection, the cells were grown for 3 days in phenol red-free DMEM (GIBCO/Invitrogen) supplemented with 5% charcoal dextran-stripped FBS (GIBCO/Invitrogen) and 0.5 μg ml -1 puromycin for continued selection to achieve better knockdown.

RNA isolation and RT-qPCR
RNA was isolated using Quick-RNA Miniprep Kit (Zymo Research, catalog no. R1054), and genomic DNA was removed from 1 μg of total RNA with DNA-free DNA removal kit (Invitrogen, catalog no. AM1609) and then total RNA reversed transcribed using the SuperScript III first-strand synthesis system with random hexamers (Invitrogen, catalog no. 18080-051). qPCR was performed with StepOne Plus (Applied Biosystems). Primer sequences used for the different gene targets are shown in Supplementary Table 4. All primers were checked on a standard curve, and it was verified that efficiencies were near 100%. β-Actin mRNA or glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was used as the internal control. Relative mRNA levels were calculated by the ΔΔCt method with the vehicle (ethanol) used as the calibrator. The experiments were repeated at least three times, and one representative plot was shown in all figures; most P values were obtained using a two-tailed Student's t-test.

MCF7 Tet-On stable cell line construction
For BLRP-2XHA-Top1 wt or Y723F mutant Top1 cell line, MCF7 Tet-On Advanced cell line (catalog no. 631153) was bought from Takara Clontech. pRev-TRE-Flag-2XHA-Top1 or pRev-TRE-Flag-2XHA-Top1-Y723F was transfected into MCF7 Tet-On cells with the Tet-free serum supplied. At 2 days after transfections, cells were selected with hygromycin for 2 weeks. Colonies were picked and verified by western blot after doxycycline (DOX) induction. After titration, 5 μg ml -1 DOX was chosen to achieve similar level of expression of HA-tagged Top1 compared with endogenous Top1.

Immunoblotting and coimmunoprecipitations
Cells were washed with ice-cold PBS and harvested in cold PBS. For the preparation of whole cell extracts, pellets were resuspended in lysis buffer containing 20 mM Tris-Cl (pH 8.0), 137 mM NaCl, 10% glycerol, 1% Nonidet P-40 and a mixture of protease inhibitors. Samples were sonicated by using a Bioruptor (Diagenode) for 16 min at medium power, with an interval of 30 s between pulses. Following sonication, samples were centrifuged for 2 × 5 min at 21,000g. After preclearing, IP was performed overnight at 4 °C by using the indicated antibodies and protein G-Sepharose. IP was followed by five washes in 1 ml lysis buffer performed at 4 °C, and protein complexes were denatured in Laemmli sample buffer (2% SDS, 10% glycerol, 60 mM Tris-Cl (pH 6.8), 0.01% bromophenol blue, 100 mM dithiothreitol (DTT)) for 5 min at 95 °C and resolved by NuPAGE Novex 4-12% Bis-Tris Protein Gels (Invitrogen catalog no. NP0336PK2). After electronic transfer, the PVDF membrane was blocked by incubation at room temperature for 1 h in Blocker Casein in TBS (Thermo Scientific, catalog no. 37532). Complexes were revealed by Clarity Western ECL substrate (Bio-Rad catalog no. 170-5061), as recommended by the manufacturer.

ReCLIP and MS
ReCLIP was performed using dithiobis(succinimidyl propionate) (DSP, Thermo Scientific) following previous reports with minor changes 34,54 . Briefly, 4 × 10 9 MCF7 cells were incubated in PBS containing 0.6 mM DSP for 30 min at 4 °C with mild rotation. After removing PBS, the remaining DSP was quenched by incubating the cells in TBS (50 mM Tris-Cl pH 7.4, 150 mM NaCl, 1 mM EDTA) for 15 min on ice. The cell nuclei were isolated through hypotonic lysis, and then lysed in lysis buffer (50 mM Tris-Cl pH 7.5, 150 mM NaCl, 0.25% Triton X-100, 0.25% Na-deoxycholate, 0.05% SDS, 1 mM EDTA, 5% glycerol) containing 5 mM MgCl 2 , 300 mM NaCl and protease inhibitor cocktail (Roche). The nuclear extracts were then sonicated for 10 min, followed by rotation for 1 h at 4 °C and centrifugation at 20,000g for 10 min. For immunopurification of the TOP1 protein complexes, nuclear extracts were incubated overnight with 100 μl streptavidin beads (Thermo Fisher Scientific) at 4 °C. After binding of the protein complexes, the streptavidin beads were washed sequentially with TBS, lysis buffer, high-salt lysis buffer (500 mM NaCl) and TBS. Finally, the beads were resuspended in 100 ml 2× SDS-sample buffer containing β-mercaptoethanol and heated for 10 min at 95 °C to elute the bound proteins. Protein complexes pulled down with streptavidin beads from precleared cell nuclei were employed for MS analysis at the University of California San Diego (UCSD) molecular mass spectrometry facility. Briefly, protein samples were diluted in TNE (50 mM Tris pH 8.0, 100 mM NaCl, 1 mM EDTA) buffer. RapiGest SF reagent (Waters Corporation) was added to the mix to a final concentration of 0.1% and samples were boiled for 5 min. Tris (2-carboxyethyl) phosphine (TCEP) was added to 1 mM (final concentration) and the samples were incubated at 37 °C for 30 min. Subsequently, the samples were carboxymethylated with 0.5 mg ml -1 of iodoacetamide for 30 min at 37 °C followed by neutralization with 2 mM TCEP (final concentration). Proteins samples prepared as above were digested with trypsin (trypsin:protein ratio, 1:50) overnight at 37 °C. RapiGest was degraded and removed by treating the samples with 250 mM HCl at 37 °C for 1 h followed by centrifugation at 21,130g for 30 min at 4 °C. The soluble fraction was then added to a new tube and the peptides were extracted and desalted using C18 desalting columns (Thermo Scientific, catalog no. PI-87782). Peptides were quantified using BCA assay and a total of 1 μg of peptides was injected for LC-MS analysis.

Ultra-high-pressure liquid chromatography coupled with tandem MS
Trypsin-digested peptides were analyzed by ultra-high-pressure liquid chromatography coupled with tandem MS (LC-MS/MS) using nanospray ionization. The nanospray ionization experiments were performed using an Orbitrap Fusion Lumos hybrid mass spectrometer (Thermo) interfaced with nanoscale reversed-phase UPLC (Thermo Dionex UltiMate 3000 RSLCnano System) using a 25 cm, 75-micron ID glass capillary packed with 1.7-μm C18 (130) BEHTM beads (Waters Corporation). Peptides were eluted from the C18 column into the mass spectrometer using a linear gradient (5-80%) of acetonitrile (ACN) at a flow rate of 375 μl min -1 for 1.5 h. The buffers used to create the ACN gradient were Buffer A (98% H 2 O, 2% ACN, 0.1% formic acid) and Buffer B (100% ACN, 0.1% formic acid). MS parameters were as follows: an MS1 survey scan using the orbitrap detector (mass range (m/z): 400-1,500 (using quadrupole isolation), 120,000 resolution setting, spray voltage of 2,200 V, ion transfer tube temperature of 275 °C, AGC target of 400,000 and maximum injection time of 50 ms) was followed by data-dependent scans (top speed for most intense ions, with charge state set to include only +2-5 ions, and 5 s exclusion time, while selecting ions with minimal intensities of 50,000 in which the collision event was carried out in the high-energy collision cell (higher energy collision dissociation energy of 30%), and the fragment masses were analyzed in an ion trap mass analyzer (with ion trap scan rate of turbo, first mass m/z was 100, AGC target of 5,000 and maximum injection time of 35 ms). Protein identification was carried out using Peaks Studio v.8.5 (Bioinformatics Solutions Inc.). The proteins with the following standards were selected: peptide -log 10 (P) (P value generated from two-tailed t-test) ≥15, protein -log 10 (P) (P value generated from two-tailed t-test) ≥25, proteins unique peptides ≥2, de novo ALC score ≥50% and false discovery rate < 1%. All the selected proteins are shown in Supplementary Tables 1-3.

CUT&RUN assays
CUT&RUN was performed using the CUTANA CUT&RUN Protocol (www.epicypher.com) which is an optimized version of that previously described 55,56 . For each sample, 5 × 10 5 cells were immobilized onto Concanavalin-A beads (EpiCypher catalog no. 21-1401) and incubated overnight (4 °C with gentle rocking) with 0.5 μg of antibody. CUT&RUN-enriched DNA was purified and 10 ng used to prepare sequencing libraries with the KAPA HTP/LTP Library Preparation Kits (Roche catalog no. 07961880001). Libraries were sequenced with Illumina HiSeq 4000 or NovaSeq 6000 system according to the manufacturer's instructions. Paired end fastq files were aligned to the hg38 or mm10 reference genome using the Bowtie v.2 algorithm. Only uniquely aligned reads were retained for subsequent analyses.

ChIP and ChIP-seq
ChIP was performed as described previously 57 . Briefly, cells were crosslinked with 1% formaldehyde at room temperature for 10 min. The cross-linking was then quenched with 0.125 M glycine for 5 min. Chromatin was fragmented using a Bioruptor Pico (Diagenode) for 10 min at high power, with an interval of 30 s between pulses to get around 200 bp fragments and precleared using 20 μl Protein G Dynabeads (Life Technologies, catalog no. 10009D). Subsequently, the soluble chromatin was incubated with 2-5 μg antibodies at 4 °C overnight. Immunoprecipitated complexes were collected using 30 μl Protein G Dynabeads, which have been saturated with PBS/1% BSA overnight at 4 °C, per reaction. For all ChIPs, after decrosslinking overnight at 65 °C, final ChIP DNA was extracted and purified using QIAquick spin columns (Qiagen). ChIP-seq libraries were constructed following Illumina's ChIP-seq sample prep kit. The library was amplified by 14 cycles of PCR.

PRO-seq
PRO-seq experiments were performed as previously described with a few modifications 57,58 . Briefly, around 2 million MCF7 cells treated with E 2 for 1 h were washed three times with cold PBS and then swelled sequentially in swelling buffer (10 mM Tris-HCl pH 7.5, 2 mM MgCl 2 , 3 mM CaCl 2 ) for 10 min on ice, harvested and lysed in lysis buffer (swelling buffer plus 0.5% Nonidet P-40, 20 U of SUPERase-In and 10% glycerol). The resultant nuclei were washed two more times with 10 ml lysis buffer and finally resuspended in 100 μl freezing buffer (50 mM Tris-HCl pH 8.3, 40% glycerol, 5 mM MgCl 2 , 0.1 mM EDTA). For the run-on assay, resuspended nuclei were mixed with an equal volume of reaction buffer (10 mM Tris-HCl pH 8.0, 5 mM MgCl 2 , 1 mM dithiothreitol, 300 mM KCl, 20 units of SUPERase-In, 1% sarkosyl, 250 μM A/ GTP, 50 μM biotin-11-C/UTP (Perkin-Elmer)) and incubated for 5 min at 30 °C. The resultant nuclear-run-on RNA was then extracted with TRIzol LS reagent (Life Technologies, catalog no. 10296-028) following the manufacturer's instructions. Nuclear-run-on RNA was fragmented to around 200-500 nt by alkaline base hydrolysis on ice for 30 min and neutralized by adding one volume of 1 M Tris-HCl pH 6.8. Excessive salt and residual NTPs were removed by using a P-30 column (Bio-Rad, catalog no. 732-6250), followed by treatment with DNase I (Promega catalog no. M6101) and Antarctic phosphatase (NEB catalog no. M0289L). Fragmented nascent RNA was bound to 10 μl of MyOne Streptavidin C1 dynabeads (Invitrogen, catalog no. 65001) following the manufacturer's instructions. The beads were washed twice in high salt (2 M NaCl, 50 mM Tris-HCl pH 7.5, 0.5% Triton X-100, 0.5 mM EDTA), once in medium salt (1 M NaCl, 5 mM Tris-HCl pH 7.5, 0.1% Triton X-100, 0.5 mM EDTA) and once in low salt (5 mM Tris-HCl pH 7.5, 0.1% Triton X-100). Bound RNA was extracted from the beads using Trizol (Invitrogen, catalog no. 15596-018) in two consecutive extractions, and the RNA fractions were pooled, followed by ethanol precipitation, and PRO-seq libraries were prepared with NEBNext Small RNA Library Prep Kit (NEB, catalog no. E7330).

Deep sequencing
For all high-throughput sequencing, the extracted DNA libraries were sequenced with an Illumina HiSeq 4000 or NovaSeq 6000 system according to the manufacturer's instructions. DNA sequences generated by the Illumina Pipeline were aligned to the human (hg38) or mouse (mm10) genome assembly using Bowtie v.2 (ref. 59 ). The data were visualized by preparing custom tracks on the University of California Santa Cruz genome browser using the HOMER software package 4 . For each experiment presented in this study, the total number of mappable reads was normalized to 10 7 .

Identification of ChIP-seq peaks and TOP1cc-enriched regions
ChIP-seq peak identification, quality control and motif analysis were performed using Samtools 60 and HOMER 4 as described in our previously published methods 25,61 . Briefly, we created tag directories for each individual sample, allowing no more than two tags per base pair and the combined replicates of each treatment, and then normalized each directory by the total number of mapped tags such that each directory contains 10 million tags. We next made peak calls with a very low threshold as required for IDR (findPeaks -style factor -o auto) on the individual samples, combined replicates, individual pseudo replicates and combined pseudo replicates. We then applied the HOMER-IDR program 4 to format the data for the IDR R package to determine the IDR threshold and identify the top peaks above that threshold. TOP1cc peak identification, quality control and motif analysis were performed following the same rules we used for ChIP-seq.

Heatmap and tag density analyses
To generate histograms for the average distribution of tag densities, position-corrected, normalized tags in 100 bp windows were tabulated within the indicated distance from specific sites in the genome. Clustering plots for normalized tag densities at each genomic region were generated using HOMER 4 and then clustered using Gene Cluster 3.0 (ref. 62 ) and visualized using Java TreeView 63 .

PRO-seq analysis
PRO-seq data analyses were performed as previously reported 57 . The sequencing reads were aligned to hg38 using Bowtie v.2 using very sensitive parameters. The common artifacts derived from clonal amplification were circumvented by considering maximal three tags from each unique genomic position as determined from the mapping data.
To determine E 2 -dependent changes in gene body, the sequencing reads for RefSeq genes were counted over the first 13 kb of the entire gene body, excluding the 500 bp promoter-proximal region on the sense strand with respect to the gene orientation by using HOMER 4 . EdgeR 64 was used to compute the significance of the differential gene expression (fold change (FC) ≥ 1.5, false discovery rate ≤ 0.01). Additionally, a read density threshold (that is, normalized total read counts per kilobase) was used to exclude low-expressed genes. PRO-seqs were normalized to 10 million tags, and HOMER 4 was used to quantify eRNA expression by tabulating normalized tag numbers surrounding ±1,000 bp from the center of the peaks. eRNAs with a FC > 1.5 in PRO-seq signals were differentially expressed.

Bioinformatic characterization of enhancer groups
We followed our previously published method to define enhancer groups in MCF7 cells 13 . Briefly, putative enhancer sites were first defined based on ChIP-seq enrichment of H3K27Ac (GSM1115992) flanking ±1,000 bp from the center of the ERα peaks or assay for transposase accessible chromatin with high-throughput sequencing (ATAC-seq) peaks.

Nature Structural & Molecular Biology
Article https://doi.org/10.1038/s41594-022-00883-8 ERα-marked MegaTrans enhancers were defined in our previous reports with the following criteria: (1) regions are at least 3 kb away from annotated transcription start sites (TSSs); (2) regions have at least 16 tags from H3K27Ac ChIP-seq normalized to 10 million tags; (3) regions are at least 10 tags from PRO-seq normalized to 10 million tags when MCF7 cells were treated with E 2 ; and (4)FC of eRNA expression between E 2 and ethanol conditions was at least 1.5.
ERα-marked other active enhancers were defined by the following criteria: (1) regions were at least 3 kb away from annotated TSSs; (2) regions had at least 16 tags from H3K27Ac ChIP-seq normalized to 10 million tags; (3) regions had at least 10 tags from PRO-seq normalized to 10 million tags when MCF7 cells were treated with either ethanol or E 2 ; and (4)FC of eRNA expression between E 2 and ethanol condition were more than 0.67 and less than 1.5.
Other active enhancers were defined by the following criteria: (1) regions were at least 3 kb away from annotated TSSs and were not marked by ERα; (2) regions had at least 16 tags from H3K27Ac ChIP-seq normalized to 10 million tags; (3) regions had at least 10 tags from PRO-seq normalized to 10 million tags when MCF7 cells were treated with either ethanol or E 2 ; and (4)FC of eRNA expression between E 2 and ethanol conditions were more than 0.67 and less than 1.5.
p65-marked proinflammatory enhancers were defined by the following criteria: (1) regions are at least 3 kb away from annotated TSSs; (2) regions have at least 16 tags from H3K27Ac ChIP-seq normalized to 10 million tags; and (3) regions are at least 16 tags from Pol II ChIP-seq and p65 ChIP-seq signals normalized to 10 million tags when MCF7 cells were treated with TNFα.
DHT-induced active enhancers were defined by the following criteria: (1) regions are at least 3 kb away from annotated TSSs; (2) regions have at least 16 tags from H3K27Ac ChIP-seq normalized to 10 million tags; (3) regions are at least 10 tags from Pol II ChIP-seq and AR ChIP-seq signals normalized to 10 million tags when LNCAP cells were treated with DHT; and (4)FC of Pol II ChIP-seq tags between DHT and Veh condition is at least 1.5.
KCl-induced neuronal enhancers were defined by the following criteria: (1) regions are at least 3 kb away from annotated TSSs; (2) regions have at least 16 tags from H3K27Ac ChIP-seq normalized to 10 million tags; (3) regions are at least 10 tags from Pol II ChIP-seq signals normalized to 10 million tags when primary cortical neurons were treated with KCl (30 mins); and (4)FC of Pol II ChIP-seq tags between KCl (30 mins) and KCl (0 min) conditions is at least 1.5.

Motif analysis and gene ontology analysis
For de novo motif analysis, transcription factor motif finding was performed on ±200 bp relative to the centers defined from ChIP-seq peaks or TOP1cc peaks using HOMER 4 . Peak sequences were compared with random genomic fragments of the same size and normalized G/C content to identify motifs enriched in the ChIP-seq targeted sequence. Sequence logos were generated using WebLOGO 65 . Gene ontology analysis was performed with Metascape 66 .

Overlaps
The overlaps between sites identified in ChIP-seq for DNA-binding proteins and TOP1cc signals were calculated using BEDTools 67 and their statistical significance (versus background distribution) was confirmed using HOMER 4 .

Statistics and reproducibility
All qPCR experiments were performed with at least three independent biological replicates, and results are shown as means ± s.d. Statistical analyses were conducted using Prism v.6 software (GraphPad Software). Statistical comparisons between groups were analyzed for significance by paired two-tailed t-test. Differences are considered significant at P < 0.05. NS, nonstatistically significant, **P < 0.01; *** P <0.001. The exact values of n, statistical measures (mean ± s.d.) and statistical significance are reported in the figure legends. For western blots in Figs. 1d, 3b,e,f and 4c and Extended Data Figs. 3a, 7a and 9a, at least two independent biological replicates were performed. For ChIP-seq, ATAC-seq and all the CUT&RUN assays, we initially generated two biological replicates and calculate the Pearson correlation. If the correlation was <0.9, additional replicates were generated. For PRO-seq experiments, we generated a minimum of three biological replicates. For all the boxplots for the genome-wide experiments analysis, unpaired two-tailed t-tests were performed. For all the Pearson correlations in Fig. 3d and Extended Figure 6c-e, one-tailed t-tests were adopted.

Data resources
We used some published ChIP-seq data from the Gene Expression Omnibus database for DNA-PKcs under accession number GSE60270 (ref. 24 ), and GRO-seq data for MCF7 with 1 h E 2 treatment under accession number GSE45822 (ref. 25 ).

Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability
Most data are available in the main text or the supplementary materials. Whole genome sequencing datasets have been deposited to NCBI GSE135808. Please direct any requests for further information or reagents to the lead contact M.G.R., School of Medicine, UCSD, La Jolla, CA 92093, USA. Source data are provided with this paper.