3D genome alterations associated with dysregulated HOXA13 expression in high-risk T-lineage acute lymphoblastic leukemia

Yang, Lu; Chen, Fengling; Zhu, Haichuan; Chen, Yang; Dong, Bingjie; Shi, Minglei; Wang, Weitao; Jiang, Qian; Zhang, Leping; Huang, Xiaojun; Zhang, Michael Q.; Wu, Hong

doi:10.1038/s41467-021-24044-5

Download PDF

Article
Open access
Published: 17 June 2021

3D genome alterations associated with dysregulated HOXA13 expression in high-risk T-lineage acute lymphoblastic leukemia

Lu Yang ORCID: orcid.org/0000-0001-6574-6194^1,2^na1,
Fengling Chen³^na1^nAff8^nAff9,
Haichuan Zhu^1,2^na1^nAff10,
Yang Chen³^nAff11,
Bingjie Dong^1,2,
Minglei Shi^3,4,
Weitao Wang^1,2,
Qian Jiang⁵,
Leping Zhang⁶,
Xiaojun Huang^2,5^na2,
Michael Q. Zhang^3,4,7^na2 &
…
Hong Wu ORCID: orcid.org/0000-0001-7545-7919^1,2,5^na2

Nature Communications volume 12, Article number: 3708 (2021) Cite this article

6796 Accesses
20 Citations
11 Altmetric
Metrics details

Subjects

Abstract

3D genome alternations can dysregulate gene expression by rewiring enhancer-promoter interactions and lead to diseases. We report integrated analyses of 3D genome alterations and differential gene expressions in 18 newly diagnosed T-lineage acute lymphoblastic leukemia (T-ALL) patients and 4 healthy controls. 3D genome organizations at the levels of compartment, topologically associated domains and loop could hierarchically classify different subtypes of T-ALL according to T cell differentiation trajectory, similar to gene expressions-based classification. Thirty-four previously unrecognized translocations and 44 translocation-mediated neo-loops are mapped by Hi-C analysis. We find that neo-loops formed in the non-coding region of the genome could potentially regulate ectopic expressions of TLX3, TAL2 and HOXA transcription factors via enhancer hijacking. Importantly, both translocation-mediated neo-loops and NUP98-related fusions are associated with HOXA13 ectopic expressions. Patients with HOXA11-A13 expressions, but not other genes in the HOXA cluster, have immature immunophenotype and poor outcomes. Here, we highlight the potentially important roles of 3D genome alterations in the etiology and prognosis of T-ALL.

Three-dimensional chromatin landscapes in T cell acute lymphoblastic leukemia

Article 23 March 2020

Subtype-specific 3D genome alteration in acute myeloid leukaemia

Article 26 October 2022

CTCF-mediated genome organization and leukemogenesis

Article 09 June 2020

Introduction

T-ALL is an aggressive hematological malignancy caused by genetic and epigenetic alterations that affect normal T-cell development¹. T-ALL represents 15% of pediatric and 25% of adult cases of acute lymphoblastic leukemia (ALL)^1,2, early T-cell precursor ALL (ETP ALL) is a high-risk subtype, which is characterized by an immature immunophenotype and a gene expression profile similar to early T-cell precursors^3,4. Compared to ETP ALL, non-ETP ALLs, including HOXA-, TLX-, and TAL-related subgroups, are blocked at the later T-cell differentiation stages⁵. Recent whole-exome and RNA sequencing analyses of large T-ALL cohorts have identified driver mutations, dysregulated oncogenic transcription factors, and pathways as the major contributors to its pathogenesis^4,6,7,8. However, whether the noncoding region of the genome and 3D-genomic structure play important roles in T-ALL development are largely unknown.

The genomes are hierarchically organized by multi-scaled structural units, including compartments, topologically associated domains (TADs), and loops, which can be identified by Hi-C⁹. At megabase scale, genomes are segregated into A and B compartments, which broadly correspond to transcriptionally active and inactive regions of the genome, respectively¹⁰. The A and B partitioning of the genome is dynamic, and A-to-B or B-to-A switches have been reported during normal development¹¹ and in disease states¹². TADs are genomic regions separated by the binding of insulating proteins, such as CTCF. An important function of TAD is to encompass the enhancers and their controlled gene inside the same domain^9,13, and therefore, DNA elements within the TAD preferentially form intradomain rather than interdomain interactions¹³. At the kilobase scale, linear DNA is folded in loops, probably by loop extrusion, through the action of CTCF and the cohesin complex^14,15. Loops frequently bridge promoter and enhancer interaction⁹ and are further organized into individual TADs¹³. Each of these layers of organization have pronounced effects on gene expression^9,10,16,17. Recently, Kloetgen et al. discovered that recurrent TAD boundary changes in the MYC locus are associated with MYC dysregulation while NOTCH pathway activation can also regulate 3D genome organization in T-ALL¹⁸.

Chromosomal rearrangements are common in cancers and have the potential to disrupt TAD boundaries. Disruption of TAD boundary can create ectopic loops (neo-loop) between enhancers and promoters that are normally separated, termed enhancer hijacking. Enhancer hijacking can result in aberrant gene expression, including ectopic expression of oncogenes^{19,20,21,22,23,24,25,26}. However, since enhancer hijacking often happens in the noncoding regions of the genome, it is difficult to identify such events via whole-exome and RNA sequencing analyses.

To determine whether alterations in the 3D genome organization are associated with malignant transformation of T-ALL, we conduct BL-Hi-C²⁷ analysis using purified primary leukemic blasts from 18 newly diagnosed T-ALL patients, including 8 ETP ALL and 10 non-ETP ALL, two clinical subtypes of T-ALL, as well as normal peripheral T cells from 4 healthy volunteers. The maximum resolutions of the chromatin contact maps for ETP, non-ETP ALL, and normal samples are ~3.5, 3.5, and 10 kb, respectively (Supplementary Data 1). We focus our analysis on chromatin translocations, especially those translocations involving the noncoding regions of the genome. Among the 34 previously unrecognized translocations, we identify recurrent HOXA13 translocations that cause the “neo-loops” formation. Meanwhile, we discover that T-ALL with NUP98-related fusions are associated with enhanced loop structures within the 5′HOXA TAD. Taken together, our findings suggest that chromosomal rearrangements can reshape the loop structures of HOXA locus in T-ALL by “cis” (enhancer hijacking) and “trans” (oncogenic fusion events) mechanisms. Furthermore, by studying the association between 3D genome alterations and clinical phenotypes, we find that ectopic expression of the HOXA11-A13 genes is associated with immature ETP immunophenotype and poor outcome of T-ALL.

Results

3D genome landscape in T-ALL

Principal component analysis (PCA) at the levels of the compartment, TAD, and loop structures demonstrated that the T-ALL samples could be separated from the control samples by PC1, while ETP and non-ETP ALL could be separated by PC2 at all three architectural levels (Fig. 1a, upper panels) and be further delineated by hierarchical clustering analysis (Fig. 1a, lower panels). By detailed comparisons of the 3D chromosomal organizations of the T-ALL samples and the healthy controls, we identified compartment switches corresponding to 3% of genome, ~700 differential TAD boundaries and more than 6000 differential looping events (Supplementary Fig. 1a). These results indicate that there are multi-scaled chromatin structural differences between T-ALLs and normal T cells. Such differences could associate with events leading to T-ALL leukemogenesis or simply reflect the different developmental stages corresponding to T-ALL and normal T cells.

**Fig. 1: 3D genome architectures in T-ALLs.**

Correlations between 3D genome alterations and differential gene expressions in T-ALL

To investigate the potential impact of 3D genome alterations on T-ALL development, we performed RNA-seq analysis on all samples. PCA and hierarchical clustering revealed that the transcriptome changes were highly correlated with that of the 3D genome changes (comparing Fig. 1a, b), similar to recent publication by Kloetgen et al.¹⁸. By integrating Hi-C and RNA-seq data, we found that a large fraction (996/3392, 29%) of the DEGs was associated with 3D genome alterations (Supplementary Data 2).

Comparing to normal T cells, genes associated with the B-to-A compartment switches, increased domain scores (D-score), and enhanced loops in T-ALL were mostly upregulated (Fig. 1c, red bars and Fig. 1d, “Methods”), and were enriched in pathways such as hematopoietic cell lineage, transcriptional misregulation in cancer, and cell cycle (Supplementary Fig. 1e). In contrast, genes associated with the A-to-B compartment switches, decreased D-scores, and reduced loops in T-ALL were mostly downregulated (Fig. 1c, blue bars and Fig. 1d), and were enriched in pathways such as cytokine–cytokine receptor interaction and T-cell receptor signaling (Supplementary Fig. 1e).

To evaluate the potential impacts of copy-number variations (CNVs) on these 3D genome alteration-associated DEGs, we analyzed CNV data of 242 T-ALL samples from Liu et al.⁶ and identified 110 upregulated genes with copy-number gain and 250 downregulated genes with copy-number loss. By aligning these DEGs, we found that only 1/568 3D genome alterations-associated upregulated DEGs exhibited copy-number gain and only 8/428 3D genome alterations-associated downregulated genes had copy-number loss (data not shown). Therefore, majority of the dysregulated genes associated with 3D genome alternation are not due to the CNV changes.

Among the upregulated DEGs, CDK6 is a potential target for T-ALL treatment²⁸. The CDK6 locus exhibited a strong intra-TAD interaction and its expression was upregulated in all T-ALL samples (Fig. 1e). Similarly, several upregulated oncogenic driver genes and T-ALL-associated transcription factors, such as MYB, MYCN, BCL11A, SOX4, and WT1, also had increased D-scores (Supplementary Fig. 1f). SOX4 was a unique case among these dysregulated transcription factors, as Hi-C map showed B-to-A compartment change, increased D-score, and new loop formations between its promoter and 3 distal enhancers in T-ALL (Fig. 1f and Supplementary Data 2).

Comparing H3K27ac and CTCF ChIP data at CDK6 and SOX4 loci in normal T cells and T-ALL Jurkat cells, we also noticed that genomic structure changes often coincide with CTCF binding or H3K27ac modification changes. We then calculated the co-localization ratio between loop anchor and H3K27ac modification or CTCF binding sites and found that T-ALL-specific loop anchors exhibit a significant enrichment of T-ALL-specific H3K27ac modification or CTCF binding, the same pattern was also seen in normal T cells (Supplementary Fig. 1g). Together, these data suggest that 3D genome alterations as well as their associated dysregulated gene expressions are more closely associated with epigenetic changes such as CTCF binding and histone modification than CNV events.

ETP and non-ETP ALL subtypes represent different “frozen stages” of T-cell development

Our 3D genome landscape analyses could separate the ETP ALL samples from the non-ETP ALL samples (Fig. 1a), suggesting that the chromosomal organizations of T-ALL may represent different “frozen stages” of T-cell development²⁹. To test this hypothesis, we first projected the T-ALL samples onto the T-cell developmental trajectory (Fig. 2a, upper) defined by RNA-seq analysis³⁰. PCA revealed that most of the ETP ALL samples were arrested at the immature stage, corresponding to the LMPP to Thy1 stages, while the non-ETP samples were arrested at the Thy2 to Thy4 stages (Fig. 2a, lower). The lack of TCR rearrangement in most of the ETP ALL samples and different rearrangements in individual non-ETP samples further support the notion that ETP and non-ETP ALL are arrested at different developmental stages (Fig. 2b and Supplementary Fig. 2b, c). Sample 093 was a unique case as it fell between ETP and non-ETP ALL (Figs. 1a, b and 2a) and had significant TCR rearrangement (Supplementary Fig. 2b). We also observed a lack of RAG1 and PTCRA expression in most of the ETP ALL samples, which are essential for TCR V(D)J rearrangements (Fig. 2c).

**Fig. 2: ETP and non-ETP ALLs have different loop structures.**

Since the ETP and non-ETP ALLs can be better separated at the loop level (Fig. 1a), we further analyzed the differences in loop structures between ETP and non-ETP ALL samples and identified 1820 enhanced and 831 reduced loops in ETP ALL (Fig. 2d). When plotting gene expression changes between ETP and non-ETP ALL against the combined p value of the loop strength and D-score changes, we found a strong positive correlation (Pearson’s correlation coefficient 0.685; Fig. 2e). Approximately 20% and 16% of the upregulated genes in ETP and non-ETP ALL, respectively, harbored consistent chromatin structure changes, including key transcription factors or oncogenes, such as CEBPA, MYCN, and LYL1 for ETP and LEF1, TCF12 and PAX9 for non-ETP ALL (Fig. 2e and Supplementary Data 3).

Gene ontology analysis further revealed that genes associated with the ETP ALL-enhanced loops were enriched in immune response-activating signal transduction, myeloid cell differentiation, and regulation of B-cell activation, consistent with the definition of ETP ALL (Supplementary Fig. 2a, left). Genes associated with the non-ETP ALL-enhanced loops were enriched in terms such as positive regulation of RNA metabolism, transcription, and TCR V(D)J recombination (Supplementary Fig. 2a, right).

The 3D genome analysis also provided a potential explanation for ETP and non-ETP ALL-specific transcription factor expressions. For example, we detected subtype-specific loops and expression patterns in the MEF2C locus in ETP and the PAX9 locus in the non-ETP ALL samples, respectively, which were associated with H3K27ac marks in the ETP ALL sample 077, ETP ALL cell line KE37, and non-ETP ALL cell line Jurkat (Fig. 2f, g). Collectively, the differential gene expression profiles between ETP and non-ETP ALL subtypes represent T progenitor cell arrests at different T-cell developmental stages, which are tightly associated with alterations of 3D genomes.

Hi-C analysis revealed previous unrecognized translocations in T-ALL

Chromosomal rearrangement is one of the major driving forces for tumorigenesis³¹, especially for leukemia¹. By adapting hic_breakfinder³² (“Methods”), we identified 46 translocations in 14/18 T-ALL samples (Supplementary Data 4), of which 34 were newly discovered and 26 were interchromosomal events (Fig. 3a, red lines). Among 78 unique breakpoints identified, 47% located in noncoding regions, and 66% located in the stable A compartment (Supplementary Fig. 3a). These newly identified translocations not only influenced the expression of the nearest genes (Fig. 3a) but also resulted in the formation of 44 neo-loops across the translocated chromosomes, which we named translocation-mediated neo-loops (Supplementary Fig. 3b and Supplementary Data 5, “Methods”). Interestingly, the ends of these translocation-mediated loops tend to anchor at the pre-existing loop anchors and CTCF binding sites (Fig. 3b). Importantly, nearly 78% of the translocation-mediated loops with CTCF motifs were linked to pairs of convergently orientated CTCF motifs (Fig. 3c), indicating that these loops may be mediated by loop extrusion mechanism, similar to canonical chromatin loops^15,33,34.

**Fig. 3: Chromosomal rearrangements in T-ALLs.**

Clinically, non-ETP ALL can be further classified into the HOXA, TLX, and TAL subtypes according to their gene expression profiles³⁵. Notably, there was a complete match between loop-based hierarchical clustering and non-ETP ALL subtypes, which were signified by chromosomal translocation-mediated dysregulation of T-ALL-associated transcription factors (Fig. 3d). Giving oncogenic transcription factors, such as NOTCH, could drive cancer-specific chromatin interactions³⁶, these results suggested that ectopic-expressed transcription factors may shape the subgroup-specific loop organization and cause the unique gene expression profile of each T-ALL subgroup.

Potential mechanisms involved in translocation-mediated gene activation

With the translocation-mediated neo-loops, we could accurately assign the possible dysregulated genes affected by the translocations. We found that translocation could potentially activate T-ALL-associated transcription factors via either “trans” or “cis” mechanism. The “trans” mechanism involves translocations within the coding regions of the genome, which mediate gene fusions, such as the PSIP1-NUP98, SET-NUP214, and MLL-related gene fusion events KMT2A-MLLT1, PICALM-MLLT10, and DDX3X-MLLT10. As reported in previous studies, these fusion events could epigenetically activate HOXA cluster gene expressions^37,38,39,40.

The “cis” mechanism involved translocations within the noncoding region of the genome in the ETP, TLX, and TAL subtypes (Fig. 3d and Supplementary Fig. 3b), of which the ectopically expressed transcription factors, such as TLX3, could hijack the enhancers from the translocated BCL11B via translocation-mediated neo-loop formation (Fig. 3e and Supplementary Fig. 3c, d). This “cis” mechanism could involve 2 BCL11B-TLX3, 1 TRB-TAL2, and 3 HOXA13 translocations identified in this study (Supplementary Fig. 3b and Supplementary Data 5). Interestingly, most of these putative hijacked enhancers are from genes that are normally expressed during T-cell development, such as BCL11B and TRB, which could lead to ectopic expression of T-ALL-associated transcription factors in the T lineage and block normal differentiation (Supplementary Fig. 3d).

3D genome alterations and ectopic HOXA gene expressions

The dysregulated HOXA cluster is a common feature of T-ALL³⁵ and associated with poor prognosis^41,42, we therefore focused our attention on the relationship between 3D genome alterations and ectopic HOXA gene expressions. We conducted unsupervised hierarchical clustering based on the levels of HOXA gene expressions, which separated 15 T-ALL samples without HOXA translocation into HOXA-negative (HOXA⁻) and HOXA-positive expression (HOXA⁺) groups. Translocation-mediated fusion events could be detected in 5/7 HOXA⁺ samples, implying that translocation may be the major driving force for HOXA activation. The HOXA⁺ T-ALLs could be further separated into two subgroups: the 3′HOXA⁺ or 5′HOXA⁺ subgroups, with respect to the location of the HOXA genes within the HOXA cluster (Fig. 4a, b). The expression patterns of the three HOXA13 translocation cases (HOXA13-T, breakpoints shown in Fig. 4b) were closer to those in the 5′HOXA⁺ subgroup, characterized by ectopic HOXA13 expressions (Fig. 4a).

**Fig. 4: Chromatin interaction profile and expression patterns of the *HOXA* cluster in T-ALLs.**

The HOXA cluster, which contains 11 genes, is transcriptionally repressed in normal T cells but can be transactivated in T-ALLs by fusion proteins that recruit histone methyltransferase DOT1L to the HOXA locus^37,43. Although this mechanism uncovered how the HOXA cluster is activated, it cannot explain the diverse HOXA expression patterns associated with different fusion proteins. By integrating Hi-C maps with HOXA gene expression patterns, we found that the differential HOXA gene expressions were associated with different 3D genome organizations.

Hi-C maps and CTCF motif orientations showed that the 11 HOXA genes were partitioned between two TADs (Fig. 4b and Supplementary Fig. 4a): the CTCF binding site C11/13 was used as the 3′ boundary of the 5′ TAD in all samples, while the 5′ boundaries of the 3′ TADs varied among different samples: C7/9 was used by most of the HOXA⁻ (6/8) and all 5′HOXA⁺ samples (2/2), while C10/11 was used by most of the 3′HOXA⁺ samples (4/5) (red and blue arrows/lines, respectively; Fig. 4b and Supplementary Fig. 4a).

We further identified six enhancer regions in each of the TAD, labeled as E1–E12, which could interact with the HOXA cluster (Fig. 4c). Using HOXA⁻ cases as common denominators (Fig. 4c, top panel), we calculated the overall differential interaction intensities. Although there was no significant difference between the healthy controls and the HOXA⁻ cases in the 12 interaction regions, we found significantly enhanced interactions between E2–E6 and genes in the 3′HOXA⁺ subgroup, as well as between E8, 9, 11, 12 and genes in the 5′HOXA⁺ subgroup, either as a group average (Fig. 4c) or individually (Supplementary Fig. 4b, c). ChIP-seq analysis of the HOXA⁺ Loucy cell line indicated that these increased interactions may be correlated with gains in the H3K27ac histone mark (Fig. 4c). These results suggest that 3D genome organization is closely associated with the patterns of ectopic gene expression within the HOXA cluster.

Unique fusion events associated with ectopic HOXA11-A13 gene expression

By investigating the associations between different fusion events and ectopic expression of the HOXA genes, we found that MLL-related fusion events, such as KMT2A-MLLT1, PICALM-MLLT10, and DDX3X-MLLT10, were associated with the 3′HOXA subgroup, the PSIP1-NUP98 fusion event was associated with the 5′HOXA subgroup, while SET-NUP214 fusion events were associated with both subgroups (Fig. 4d).

We further verified these results by including a larger cohort of HOXA⁺ T-ALL samples from St. Jude data set⁶. Out of 131 HOXA-positive T-ALL samples (82 from St. Jude, 39 from in-house data, and 10 from our study), we found 41 with aforementioned fusion events and 4 with HOXA13-T. Among MLL-related fusions, 29/32 had HOXA7 expression, a signature for 3′HOXA subgroup, while 5/6 NUP98-related fusions were HOXA13-positive, a signature for 5′HOXA subgroup. SET-NUP214 fusion samples showed no specificity (Fig.4e and Supplementary Fig. 4d). These results suggest that different translocation-mediated fusion events may preferentially alter the genome interactome of the HOXA cluster and control a specific set of HOXA gene expression.

Activation of 5′HOXA genes via potential enhancer hijacking

To explain the gene expression patterns seen in the four HOXA13-T samples, we mapped the breakpoints of three T-ALL samples with Hi-C data. We found that all the breakpoints located within the 5′ TAD, upstream of the HOXA13 gene. The translocation partner breakpoints lie in the gene bodies of the BCL11B and CDK6 genes on chromosomes 14 and 7, respectively, as well as upstream of the ERG gene on chromosome 21 (Fig. 5 and Supplementary Fig. 5). By examining the TAD structures associated with the translocations, using HOXA⁻ samples as controls, we found translocation-mediated neo-loop formations between the 5′ of the HOXA cluster and cis-regulatory elements associated with BCL11B, ERG, and CDK6 genes (Fig. 5a–c). BCL11B, ERG, and CDK6 were expressed in these T-ALL samples (Supplementary Figs. 4c and 5a, b), and ChIP-seq analysis of blast cells from case 077, Jurkat and Loucy cells did confirm that the elements associated with the neo-loops were marked by H3K27ac, a sign for active enhancers (Fig. 5a–c). These results suggest that the active enhancers from the BCL11B, ERG, and CDK6 genes juxtapose to upstream HOXA13 after translocation, promoting ectopic expression of HOXA13 in the case of 076 and HOXA9-A13 in the case of 077 (blue circles for neo-loops and green bar graphs for gene expressions; Fig. 5a, b and Supplementary Fig. 3b). For the case of 108, the inter-TAD inversion could potentially lead to the adoption of the active CDK6 enhancers and neo-loop formation, leading ectopic expression of the HOXA11-A13 genes (Fig. 5c).

**Fig. 5: Translocation-mediated enhancer hijacking and ectopic *HOXA* gene expressions in T-ALLs.**

To directly assess the capacities of the putative enhancers to potentiate gene expressions, we cloned DNA fragments corresponding to the putative enhancers or adjacent DNA fragments into the luciferase reporter construct (E, enhancer fragment; C, adjacent control fragment; Fig. 5a–c and Supplementary Fig. 5c–e). When expressed in Jurkat cells, these putative enhancers led to robust reporter activities as compared to the control fragments (Fig. 5d). Most of the putative enhancers also showed robust reporter activities in Loucy cells (Supplementary Fig. 5f). These results support the notion that translocation and inversion could bring active enhancers close to otherwise silenced 5′HOXA genes through enhancer hijacking, leading to dysregulated gene expressions. The existence of the CTCF binding sites C11/13 in the case of 076, C7/9 in the case of 077, and C10/11 in the case of 108 (Fig. 5e) may insulate the activate enhancer spreading to HOXA genes located in the 3′ TAD, leading to 5′HOXA-specific expression patterns (Fig. 4b and Supplementary Fig. 4d).

HOXA11-A13 positivity is linked to inferior outcome in pediatric T-ALL

With heterogenous HOXA gene expression patterns and different activation mechanisms discovered by Hi-C, we next investigated whether HOXA⁺ cases also represent heterogenous clinical entities. By analyzing a cohort of T-ALL patients with outcome information (our unpublished results), we found that HOXA13 or HOXA11 positiveness, alone or in combination, but not the expression of other HOXA genes, such as the previously used biomarker HOXA9, was associated with poor overall and event-free survivals in young adult and pediatric T-ALLs (Fig. 6a and Supplementary Fig. 6a, b). In multivariate analysis, HOXA13⁺ status could serve as an independent predictor for the overall survival of pediatric and young adult T-ALLs (Fig. 6b).

**Fig. 6: Ectopic *HOXA11*-*A13* expressions are correlated with poor outcomes in pediatric and young adult T-ALLs.**

As almost all cases with HOXA13-T and NUP98-related fusion events belong to the ETP ALL subtype and were associated with ectopic expression of the HOXA11-A13 genes (Fig. 4a and Supplementary Fig. 4d), we extended our analysis by integrating our in-house data with data published by Liu et al.⁶ to further characterize the HOXA11-A13⁺ group. Compared to the HOXA1-A10⁺ cases, HOXA11-A13⁺ cases were indeed enriched for ETP ALL subtype in this large cohort of 273 patients (Fig. 6c). We also checked mutations associated with the HOXA11-A13⁺ group in 325 samples with WES data and found that the rate of JAK-STAT pathway-associated mutations was significantly higher in the HOXA11-A13⁺ group than that of the HOXA1-A10⁺ group (Fig. 6d). Therefore, HOXA11-A13 positiveness, which can be identified either by gene expression or cytogenetic analysis of translocation and fusion events identified in this study, may serve as a biomarker for identification of T-ALL patients with poor prognosis. Anti-JAK-STAT inhibitor treatment may also benefit this group of patients.

Discussion

Most studies on the underlying mechanisms of T-ALL leukemogenesis are focused on the coding regions of the genome and many disease-driving genes and pathways have been identified through WES or RNA-seq analyses. Our work, together with recent publication by Kloetgen et al.¹⁸, provides a comprehensive view of the 3D chromosomal structures of T-ALL. Both works demonstrate that global 3D genome architecture can separate normal T cells and two T-ALL subtypes (Fig. 1), although precaution must be taken in interpreting this result as the differences detected by Hi-C may simply reflect the different developmental stages corresponding to T-ALL and normal T cells. The chromosomal structure-based clustering is consistent with gene expression-based grouping, implying that 3D genome alterations may be responsible for dysregulated gene expressions in T-ALL (Fig. 2). Indeed, in our study ~29% differentially expressed genes between T-ALL and normal T cell, including those dysregulated key transcription factors in ETP and non-ETP ALL, are associated with 3D genome alterations but not CNVs. While Kloetgen et al.¹⁸ focused on NOTCH pathway regulated genomic structures and recurrent TAD fusion events that regulating MYC expressions, we focused our attention on the translocation events previous unknown to the field, especially those translocations within the noncoding regions of the genome. Overall, both studies suggest that 3D genome alterations may be contributing factors for T-ALL leukemogenesis.

By employing high-resolution Hi-C map, we identified 34 translocations in T-ALL that cannot be recognized by RNA-seq and WES (Fig. 3). Dysregulated oncogenic transcription factors, such as TAL1, TAL2, TLX3, LMO2, and HOXA were associated with translocation-mediated genomic alterations in 78% (14/18) cases, suggesting that translocations may play driver roles in T-ALL leukemogenesis. We further provided detailed contacting maps between the HOXA cluster and the two TADs surrounding the cluster (Fig. 4). We demonstrated the tight associations among enhanced 3′TAD contacts, MLL-related gene fusion events, and 3′HOXA gene expressions. Similarly, the 5′HOXA gene expression pattern, enhanced 5′TAD contacts, and NUP98-related fusion events showed positive correlation. These results suggest that different fusion events may differentially activate HOXA gene expressions by enhancing the interaction between 3′- or 5′-TAD and the HOXA cluster. Importantly, we demonstrated translocation-mediated “neo-TAD” and “neo-loop” formation on the Hi-C contact maps of three HOXA13 translocation cases, and potential enhancer-hijacking mechanisms that were likely involved in T-ALL development (Fig. 5). Further study is required to demonstrate the causality between the “neo-TAD” and “neo-loop” and dysregulated transcription factor.

Our 3D structure-based analysis helped to separate T-ALL patients with ectopic HOXA cluster expressions into two groups, one with HOXA11-A13 expressions and the other with HOXA1-10 expressions (Figs. 4 and 6). Although ectopic expressions of the HOXA1-10 genes did not contribute to the survival of T-ALL patients, HOXA11-A13 expressions could serve as an independent predictor for poor overall survival of pediatric and young adult T-ALLs. Patients with ectopic expressions of HOXA11-A13 genes were associated with HOXA13 translocation and NUP98-related fusions, which could be readily identified by cytogenetic analysis. Patients with ectopic expressions of HOXA11-A13 genes also have higher rate of JAK-STAT pathway mutations, suggesting that anti-JAK-STAT inhibitor treatment may benefit this group of patients. Together, our study indicates that 3D genome alterations may contribute to T-ALL development by regulating adjacent or distant gene expressions via “cis” or “trans” mechanism and understanding the underlying mechanism may inform new strategies for patient stratification and targeted therapy.

Methods

Patients and samples

The T-ALL samples were collected from diagnostic bone marrow aspirations at the Peking University People’s Hospital of China. The patient characteristics are described in Supplementary Data 6. ETP ALL status was defined as previously published³. Leukemia blast cells were prepared by density-gradient centrifugation of bone marrow samples, and CD19⁻CD14⁻CD235⁻CD45⁺CD7⁺ cells were further purified by flow cytometry sorting using antihuman antibodies for RNA-seq and Hi-C library preparations. Peripheral blood samples were obtained from four healthy donors. T cells were purified using the EasySep^TM Direct Human T Cell Isolation Kit (StemCell Technology #19661). This study was approved by the Ethics Committee of Peking University People’s Hospital. All patients and healthy donors provided written informed consent before any study procedure.

RNA-seq library preparation, data processing, and differential gene expression analysis

RNA-seq libraries were prepared with the TruSeq RNA Library Prep Kit v2 (Illumina). Paired-end RNA-seq reads of the 18 patients and 4 healthy controls were generated with an average depth of 15 million read pairs. Reads were aligned to the hg19 genome with TopHat (v2.1.0) using default settings⁴⁴. Duplicates were removed, and aligned reads were calculated for each protein-coding gene using HTSeq⁴⁵, followed by FPKM transformation by normalizing gene exon length and sequencing depth. Raw RNA-seq data for Loucy and Jurkat were downloaded from the GEO database and analyzed as described above.

DESeq2⁴⁶ was applied to identify the differentially expressed genes with FDR < 0.01 and fold change >2. Genes with fewer than five reads in 20% of the samples or with mean reads fewer than two were excluded. Signal tracks were generated by using BEDTools⁴⁷ genomeCoveragebed to produce bedGraph files scaled to one million reads per data set. Then, the UCSC Genome Browser utility⁴⁸ bedGraphToBigWig was used with default parameters to generate bigwig files.

ChIP-seq, ATAC-seq data processing, and motif analysis

H3K27ac ChIP of case 077 was performed on 10⁵ cells, according to the protocol of Hyperactive In-Situ ChIP Library Prep Kit for Illumina (Vazyme, TD 901). ChIP-seq reads were mapped to the hg19 genome with Bowtie2⁴⁹ (v2.3.5) using default parameters, while ATAC-seq reads were mapped with Bowtie2 using parameter -X 2000 λno-mixed --no-discordant --no-unal. Aligned reads were filtered for a minimum MAPQ of 20, and duplicates were removed using SAMtools⁵⁰. Signal tracks and peaks were generated by using the –SPMR option in MACS2⁵¹. Then, the UCSC Genome Browser utility bedGraphToBigWig was used with default parameters to transform the bedgraph files to bigwig files. FIMO⁵² was used to detect the 20-bp CTCF motif from the Homer motif database in Loucy and Jurkat CTCF peaks with default parameters.

Hi-C and Hi-C data processing

Hi-C was performed on one million cells/sample, according to the BL-Hi-C protocol²⁷. Raw BL-Hi-C reads were processed by the in-house HiCpipe framework, which integrated several Hi-C analysis methods to generate multiple features of the Hi-C data. In particular, ChIA-PET2⁵³ was used to trim the bridge linkers and HiC-Pro⁵⁴ to align reads, filter artifact fragments, and remove duplicates; Juicer⁹ was applied to the resulting uniquely mapped contacts to generate individual or merged Hi-C files that could be deposited as contact matrices with multiple resolutions. Knight-Ruiz⁵⁵ (KR)-normalized matrices were used in the compartment and TAD analyses.

Compartment and TAD analysis

The compartment was calculated with the eigenvector command of Juicer under 100-kb resolution KR normalized Hi-C matrices. For every 100-kb bin, A or B compartments were defined by the over 70% sample majority rule.

TAD boundaries were calculated by the Insulation score method⁵⁶ (with parameters: -is 1000000 -ids 200000 -im mean -nt 0.1) on pooled 40-kb Hi-C matrices of the healthy T-cell controls, ETP, and non-ETP samples. The resulting TAD boundaries were merged and assigned with relative insulation scores of all samples calculated from HiCDB⁵⁷. Differential TAD boundaries were defined with a t-test FDR < 0.01 and a difference between cases and controls higher than 50% quantile of the overall difference.

A TAD was defined when its boundaries were detected in at least two conditions among normal T cells, ETPs, and non-ETPs. The domain score⁵⁸ was calculated in each sample by dividing the intra-TAD interactions with all interactions connected to the corresponding TAD. Differential domain scores were calculated with a t-test FDR < 0.01 and fold change higher than 70% quantile of the overall fold change.

Loop detection and differential loop calling

Loops were called by HiCCUPS⁹ at 5- and 10-kb resolutions with default parameters (except -d 15000,20000) for pooled Hi-C matrices of the healthy T-cell controls, ETP and non-ETP samples, respectively. The differential loop detection method was adapted from Phanstiel et al.⁵⁹. Loops were split into two distance ranges (> or <150 kb) to minimize potential bias (Rubin et al.⁶⁰). Differential loops were called within each range (FDR < 0.1) and then combined.

Loop aggregation and functional analysis

Aggregate peak analysis (APA) plots were generated to assess the quality of loop detection and explore the characteristics of different loop classes by the Juicer APA command⁹ under 5-kb resolution. Its output matrix was normalized by the loop number that contributed to the matrix generation. For analysis of the function of dynamic loops between non-ETP and ETP, the loop anchors were analyzed by GREAT⁶¹ (v3.0.0) using the nearest gene within 100 kb to generate the enriched biological process. In other sections, genes related to loops were determined if their promoter (5 kb around TSS) overlapped the loop anchors, and DAVID⁶² 6.8 was used for KEGG pathway enrichment analysis.

Visualization and V4C plot generation

Tracks of Hi-C maps and ChIP-seq were generated by pyGenomeTracks⁶³. Hi-C maps of each condition were normalized by its cis interaction pairs. A visual 4C (V4C) plot for specific loci was generated as the interactions related to the corresponding viewpoint under 10-kb resolution.

Translocation and translocation-mediated loop detection with Hi-C

hic_breakfinder³² was adapted to detect translocations in 18 T-ALL patient samples. After we filtered the “translocations” also detected in normal controls, the remaining translocations were manually assessed, and the precise breakpoints were determined. As the average depth of patient Hi-C samples is 486 million read pairs and the read length is 150 bp, Hi-C raw data were treated as single ends to refine the breakpoint locations to single base-pair resolution. Any single ends that could be mapped to two different chromatins without the BL-Hi-C bridge linker in between were chimeric reads. The chimeric reads detected from the BL-Hi-C data overlapped with the aforementioned translocations at 5–20-kb resolution. For each translocation, the exact locations (single base-pair resolution) supported by more than three chimeric reads were identified as the actual breakpoints and are reported in Supplementary Data 4.

As translocation-mediated loops were hard to identify by loop detection tools designed for intrachromosomal loop detection and easy to capture by visualization, their locations were manually recorded on interchromosome Hi-C maps with the help of Juicebox⁶⁴, which is an interactive visualization software.

Translocation, translocation-mediated loop annotation and visualization

The nearest genes to translocation breakpoints were determined by BEDTools. Known translocations were collected from refs. ^6,8 and ChimerPub⁶⁵. A translocation was considered novel if any of the breakpoints was not near any known breakpoint within a 100-kb distance. For translocation-mediated loops, the genes with a promoter or gene body overlapping the loop anchors were annotated as the associated genes. A gene near a breakpoint was considered upregulated if its FPKM was greater than one- and twofold higher than the control samples without nearby breakpoints. Hi-C heatmaps and Visual 4C plots of the reassembled chromatin were generated by MATLAB code.

Luciferase enhancer assays

Candidate enhancer and negative control regions were amplified by PCR using the primer sets listed in Supplementary Table 1 and cloned into the pGL4.26 vector (Promega) containing a multiple cloning region for insertion of a response element of interest upstream of a minimal promoter and the firefly luciferase reporter gene. Firefly luciferase constructs and control reporter Renilla luciferase vector were co-electroporated Jurkat and Loucy cells with a Celetrix Transfection System Device (Dakewe, CTX-1500A). Electroporation conditions: Vset = 440 V, Tset = 30 ms, Punm = 1n, Tint = 1 ms. Luciferase activity was measured 60 h after electroporation with Dual-Luciferase^® Reporter Assay System (Promega, E1960) by Glomax^® 20/20 Lunimometer (Promega).

Statistics

Specific statistical analyses are described in each section. In general, the Wilcoxon rank sum test was employed in R for comparisons of distributions. Survival analysis was performed by a Cox regression model using overall and event-free survival as outcomes. Overall survival was defined as the time from diagnosis to death from any cause. Event-free survival was defined as the time from diagnosis to treatment failure, relapse, or death from any cause. The proportional hazard assumption was tested. Variables tested in the multivariable Cox regression model were sex, age (pediatric vs. adult), white blood cell counts, hemoglobin levels, platelet counts, hepatosplenomegaly, percentage of blasts in the bone marrow, and MRD status. For in-house data, 86 patient samples with RNA-seq data were used for ETP enrichment analyses, of which 38 samples under the age of 40 with outcome and 63 samples with whole-exon sequencing (WES) data were used for survival and mutation analyses, respectively.

Table 1 Univariable and multivariable analysis of overall survival according to HOXA13 or HOXA11-13 expression and select variables.

Full size table

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw sequence data of Hi-C, RNA-Seq, ChIP-seq, and ATAC-seq reported in this paper have been deposited in the Genome Sequence Archive (GSA) for human under accession number HRA000113 and Gene Expression Omnibus (GEO) database under the accession number: GSE146901. GEO accession codes of the published data used in this study are as follows: CTCF ChIP-seq of CD4+ T cell and Jurkat cell line, GSE12889; CTCF ChIP-seq of Loucy cell line, GSE123214; sATAC-seq of CD4+ T cell, GSE87254; ATAC-seq of Jurkat cell line, GSE115438; H3K27ac ChIP-seq of CD4+ T cell, GSE122826; H3K27ac ChIP-seq of Jurkat cell line, GSE68978; H3K27ac ChIP-seq of Loucy cell line, GSE74311; RNA-Seq of Loucy cell line, GSE100694; RNA-seq of T-cell development, GSE69239. Graph files for the called peaks have been deposited on UCSC [https://genome.ucsc.edu/s/ChenFengling/TALL]. The remaining data are available within the article, Supplementary Information or available from the authors upon request. Source data are provided with this paper.

Code availability

All code for Hi-C analysis have been deposited on GitHub (https://github.com/ChenFengling/HiCpipe).

References

Belver, L. & Ferrando, A. The genetics and mechanisms of T cell acute lymphoblastic leukaemia. Nat. Rev. Cancer 16, 494–507 (2016).
Article CAS PubMed Google Scholar
Hunger, S. P. & Mullighan, C. G. Acute lymphoblastic leukemia in children. N. Engl. J. Med. 373, 1541–1552 (2015).
Article CAS PubMed Google Scholar
Coustan-Smith, E. et al. Early T-cell precursor leukaemia: a subtype of very high-risk acute lymphoblastic leukaemia. Lancet Oncol. 10, 147–156 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. The genetic basis of early T-cell precursor acute lymphoblastic leukaemia. Nature 481, 157–163 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Wada, H. et al. Adult T-cell progenitors retain myeloid potential. Nature 452, 768–772 (2008).
Article ADS CAS PubMed Google Scholar
Liu, Y. et al. The genomic landscape of pediatric and young adult T-lineage acute lymphoblastic leukemia. Nat. Genet. 49, 1211–1218 (2017).
Article CAS PubMed PubMed Central Google Scholar
Seki, M. et al. Recurrent SPI1 (PU.1) fusions in high-risk pediatric T cell acute lymphoblastic leukemia. Nat. Genet. 49, 1274–1281 (2017).
Article CAS PubMed Google Scholar
Chen, B. et al. Identification of fusion genes and characterization of transcriptome features in T-cell acute lymphoblastic leukemia. Proc. Natl Acad. Sci. U. S. A. 115, 373–378 (2018).
Article CAS PubMed Google Scholar
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Bonev, B. et al. Multiscale 3D genome rewiring during mouse neural development. Cell 171, 557–572.e24 (2017).
Article CAS PubMed PubMed Central Google Scholar
Johnstone, S. E. et al. Large-scale topological changes restrain malignant progression in colorectal cancer. Cell https://doi.org/10.1016/j.cell.2020.07.030 (2020).
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Sanborn, A. L. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl Acad. Sci. U. S. A. 112, E6456–E6465 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dixon, J. R. et al. Chromatin architecture reorganization during stem cell differentiation. Nature 518, 331–336 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Krijger, P. H. et al. Cell-of-origin-specific 3D genome structure acquired during somatic cell reprogramming. Cell Stem Cell 18, 597–610 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kloetgen, A. et al. Three-dimensional chromatin landscapes in T cell acute lymphoblastic leukemia. Nat. Genet. 52, 388–400 (2020).
Article CAS PubMed PubMed Central Google Scholar
Spielmann, M., Lupianez, D. G. & Mundlos, S. Structural variation in the 3D genome. Nat. Rev. Genet. 19, 453–467 (2018).
Article CAS PubMed Google Scholar
Northcott, P. A. et al. Enhancer hijacking activates GFI1 family oncogenes in medulloblastoma. Nature 511, 428–434 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Peifer, M. et al. Telomerase activation by genomic rearrangements in high-risk neuroblastoma. Nature 526, 700–704 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Valentijn, L. J. et al. TERT rearrangements are frequent in neuroblastoma and identify aggressive tumors. Nat. Genet. 47, 1411–1414 (2015).
Article CAS PubMed Google Scholar
Zimmerman, M. W. et al. MYC drives a subset of high-risk pediatric neuroblastomas and is activated through mechanisms including enhancer hijacking and focal enhancer amplification. Cancer Discov. 8, 320–335 (2018).
Article CAS PubMed Google Scholar
Gryder, B. E. et al. Miswired enhancer logic drives a cancer of the muscle lineage. iScience 23, 101103 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Groschel, S. et al. A single oncogenic enhancer rearrangement causes concomitant EVI1 and GATA2 deregulation in leukemia. Cell 157, 369–381 (2014).
Article CAS PubMed Google Scholar
Weischenfeldt, J. et al. Pan-cancer analysis of somatic copy-number alterations implicates IRS4 and IGF2 in enhancer hijacking. Nat. Genet. 49, 65–74 (2017).
Article CAS PubMed Google Scholar
Liang, Z. et al. BL-Hi-C is an efficient and sensitive approach for capturing structural and regulatory chromatin interactions. Nat. Commun. 8, 1622 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Sawai, C. M. et al. Therapeutic targeting of the cyclin D3:CDK4/6 complex in T cell leukemia. Cancer Cell 22, 452–465 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hu, G. et al. Transformation of accessible chromatin and 3D nucleome underlies lineage commitment of early T Cells. Immunity 48, 227–242.e8 (2018).
Article CAS PubMed PubMed Central Google Scholar
Casero, D. et al. Long non-coding RNA profiling of human lymphoid progenitor cells reveals transcriptional divergence of B cell and T cell lineages. Nat. Immunol. 16, 1282–1291 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. K., Choi, Y. L., Kwon, M. & Park, P. J. Mechanisms and consequences of cancer genome instability: lessons from genome sequencing studies. Annu Rev. Pathol. Mech. Dis. 11, 283–312 (2016).
Article CAS Google Scholar
Dixon, J. R. et al. Integrative detection and analysis of structural variation in cancer genomes. Nat. Genet. 50, 1388–1398 (2018).
Article CAS PubMed PubMed Central Google Scholar
Nuebler, J., Fudenberg, G., Imakaev, M., Abdennur, N. & Mirny, L. A. Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc. Natl Acad. Sci. U. S. A. 115, E6697–E6706 (2018).
Article CAS PubMed PubMed Central Google Scholar
Davidson, I. F. et al. DNA loop extrusion by human cohesin. Science 366, 1338–1345 (2019).
Article ADS CAS PubMed Google Scholar
Soulier, J. et al. HOXA genes are included in genetic and biologic networks defining human acute T-cell leukemia (T-ALL). Blood 106, 274–286 (2005).
Article CAS PubMed Google Scholar
Petrovic, J. et al. Oncogenic notch promotes long-range regulatory interactions within hyperconnected 3D cliques. Mol. Cell 73, 1174–1190.e12 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bernt, K. M. et al. MLL-rearranged leukemia is dependent on aberrant H3K79 methylation by DOT1L. Cancer Cell 20, 66–78 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jo, S. Y., Granowicz, E. M., Maillard, I., Thomas, D. & Hess, J. L. Requirement for Dot1l in murine postnatal hematopoiesis and leukemogenesis by MLL translocation. Blood 117, 4759–4768 (2011).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, A. T., Taranova, O., He, J. & Zhang, Y. DOT1L, the H3K79 methyltransferase, is required for MLL-AF9-mediated leukemogenesis. Blood 117, 6912–6922 (2011).
Article CAS PubMed PubMed Central Google Scholar
Okada, Y. et al. hDOT1L links histone methylation to leukemogenesis. Cell 121, 167–178 (2005).
Article CAS PubMed Google Scholar
Matlawska-Wasowska, K. et al. MLL rearrangements impact outcome in HOXA-deregulated T-lineage acute lymphoblastic. Leuk.: Children’s Oncol. Group Study Leuk. 30, 1909–1912 (2016).
Article CAS Google Scholar
Bond, J. et al. An early thymic precursor phenotype predicts outcome exclusively in HOXA-overexpressing adult T-cell acute lymphoblastic leukemia: a Group for Research in Adult Acute Lymphoblastic Leukemia study. Haematologica 101, 732–740 (2016).
Article CAS PubMed PubMed Central Google Scholar
Barry, E. R., Corry, G. N. & Rasmussen, T. P. Targeting DOT1L action and interactions in leukemia: the role of DOT1L in transformation and development. Expert Opin. Ther. Targets 14, 405–418 (2010).
Article CAS PubMed Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS PubMed PubMed Central Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central CAS Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kuhn, R. M., Haussler, D. & Kent, W. J. The UCSC genome browser and associated tools. Brief. Bioinform 14, 144–161 (2013).
Article CAS PubMed Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central CAS Google Scholar
Grant, C. E., Bailey, T. L. & Noble, W. S. FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018 (2011).
Article CAS PubMed PubMed Central Google Scholar
Li, G., Chen, Y., Snyder, M. P. & Zhang, M. Q. ChIA-PET2: a versatile and flexible pipeline for ChIA-PET data analysis. Nucleic Acids Res. 45, e4 (2017).
Article PubMed CAS Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central CAS Google Scholar
Knight, P. A. & Ruiz, D. A fast algorithm for matrix balancing. Ima J. Numer. Anal. 33, 1029–1047 (2013).
Article MathSciNet MATH Google Scholar
Giorgetti, L. et al. Structural organization of the inactive X chromosome in the mouse. Nature 535, 575–579 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, F., Li, G., Zhang, M. Q. & Chen, Y. HiCDB: a sensitive and robust method for detecting contact domain boundaries. Nucleic Acids Res. 46, 11239–11250 (2018).
Article CAS PubMed PubMed Central Google Scholar
Stadhouders, R. et al. Transcription factors orchestrate dynamic interplay between genome topology and gene regulation during cell reprogramming. Nat. Genet. 50, 238–249 (2018).
Article CAS PubMed PubMed Central Google Scholar
Phanstiel, D. H. et al. Static and dynamic DNA loops form AP-1-bound activation hubs during macrophage development. Mol. Cell 67, 1037–1048.e6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rubin, A. J. et al. Lineage-specific dynamic and pre-established enhancer-promoter contacts cooperate in terminal differentiation. Nat. Genet. 49, 1522–1528 (2017).
Article CAS PubMed PubMed Central Google Scholar
McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).
Article CAS PubMed PubMed Central Google Scholar
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Article PubMed CAS Google Scholar
Ramirez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat. Commun. 9, 189 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kim, P. et al. ChimerDB 2.0–a knowledgebase for fusion genes updated. Nucleic Acids Res. 38, D81–D85 (2010).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Drs Meng Lv, Yingjun Chang, and Yan Chang for sample collection; Dr Cheng Li of Peking University for critically reviewing the manuscript. We also thank Yan Liu, Fei Wang, and Xuefang Zhang from the National Center for Protein Sciences Beijing at Peking and Tsinghua Universities for assistance with FACS. This project was supported by the Peking-Tsinghua Center for Life Sciences, Beijing Advanced Innovation Center for Genomics at Peking University for HW and the National Natural Science Foundation of China (81602254 for L.Y., 31871343 for Y.C., 31671384 and 81890994 for Y.C. and M.Q.Z). F.C. and W.W. were supported by the Postdoctoral Fellowship of Peking-Tsinghua Center for Life Sciences.

Author information

Fengling Chen
Present address: Center for Stem Cell Biology and Regenerative Medicine, MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing, China
Fengling Chen
Present address: Tsinghua-Peking Center for Life Sciences, Beijing, China
Haichuan Zhu
Present address: Institute of Biology and Medicine, College of Life and Health Sciences, Wuhan University of Science and Technology, Hubei, China
Yang Chen
Present address: Department of Biochemistry and Molecular Biology, The State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, School of Basic Medicine, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
These authors contributed equally: Lu Yang, Fengling Chen, Haichuan Zhu
These authors jointly supervised this work: Xiaojun Huang, Michael Q. Zhang, Hong Wu.

Authors and Affiliations

The MOE Key Laboratory of Cell Proliferation and Differentiation, School of Life Sciences, Peking University, Beijing, China
Lu Yang, Haichuan Zhu, Bingjie Dong, Weitao Wang & Hong Wu
Peking-Tsinghua Center for Life Sciences, Beijing, China
Lu Yang, Haichuan Zhu, Bingjie Dong, Weitao Wang, Xiaojun Huang & Hong Wu
MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Bioinformatics Division, BNRist, Department of Automation, Tsinghua University, Beijing, China
Fengling Chen, Yang Chen, Minglei Shi & Michael Q. Zhang
School of Medicine, Tsinghua University, Beijing, China
Minglei Shi & Michael Q. Zhang
Peking University Institute of Hematology, National Clinical Research Center for Hematologic Disease, Beijing, China
Qian Jiang, Xiaojun Huang & Hong Wu
Department of Pediatrics, Peking University People’s Hospital, Beijing, China
Leping Zhang
Department of Biological Sciences, Center for Systems Biology, The University of Texas, Richardson, TX, USA
Michael Q. Zhang

Authors

Lu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Fengling Chen
View author publications
You can also search for this author in PubMed Google Scholar
Haichuan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Bingjie Dong
View author publications
You can also search for this author in PubMed Google Scholar
Minglei Shi
View author publications
You can also search for this author in PubMed Google Scholar
Weitao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qian Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Leping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojun Huang
View author publications
You can also search for this author in PubMed Google Scholar
Michael Q. Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.Y., H.Z., and H.W. conceived the project; Y.C. designed the Hi-C and RNA-seq experiments; L.Y., H.Z., M.S., and W.W. performed the Hi-C and RNA-seq experiments; F.C. designed the bioinformatic pipelines and performed the Hi-C and RNA-seq integrated analyses, while B.D. conducted the survival analysis. Q.J., L.Z., and X.H. contributed the clinical samples and data. L.Y., F.C., and B.D. generated the figures and tables. L.Y., F.C., and H.W. wrote the manuscript with help from all authors. X.H. was in charge of the clinical study; M.Q.Z. and Y.C. oversaw the bioinformatics analyses, and H.W. supervised the entire project.

Corresponding authors

Correspondence to Xiaojun Huang, Michael Q. Zhang or Hong Wu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Jean-Christophe Andrau, Musa Mhlanga, David Vetrie, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, L., Chen, F., Zhu, H. et al. 3D genome alterations associated with dysregulated HOXA13 expression in high-risk T-lineage acute lymphoblastic leukemia. Nat Commun 12, 3708 (2021). https://doi.org/10.1038/s41467-021-24044-5

Download citation

Received: 10 March 2020
Accepted: 01 June 2021
Published: 17 June 2021
DOI: https://doi.org/10.1038/s41467-021-24044-5

This article is cited by

Isocitrate dehydrogenase 1 mutation drives leukemogenesis by PDGFRA activation due to insulator disruption in acute myeloid leukemia (AML)
- Sophie Steinhäuser
- Patricia Silva
- Claudia Dorothea Baldus
Leukemia (2023)
Single-Cell RNA-seq Analysis of a Human Embryonic Stem Cell to Endothelial Cell System Based on Transcription Factor Overexpression
- XiangWang Xu
- JunRu Chen
- Liang Hu
Stem Cell Reviews and Reports (2023)
ATF4 renders human T-cell acute lymphoblastic leukemia cell resistance to FGFR1 inhibitors through amino acid metabolic reprogramming
- Zi-jian Zhang
- Qi-fang Wu
- Hai-chuan Zhu
Acta Pharmacologica Sinica (2023)
3D chromatin architecture and transcription regulation in cancer
- Siwei Deng
- Yuliang Feng
- Siim Pauklin
Journal of Hematology & Oncology (2022)
Inside the biology of early T-cell precursor acute lymphoblastic leukemia: the perfect trick
- Francesco Tarantini
- Cosimo Cumbo
- Francesco Albano
Biomarker Research (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.