Introduction

The ATR-X syndrome (OMIM no. 301040) is a severe X-linked mental retardation (XLMR) syndrome associated with a characteristic facial appearance, urogenital abnormalities and α-thalassemia.1, 2 Mutations in the ATRX gene have been identified as the cause of the ATR-X syndrome,3 other XLMR disorders with similar features but lacking thalassemia,4 and the acquired form of α-thalassemia myelodysplasia syndrome (ATMDS).5, 6, 7, 8

The ATRX gene encodes a chromatin remodeling protein with an ATPase/helicase domain of the SNF2 family and a plant homeodomain (PHD) zinc finger.9 ATRX mutations are predominantly missense mutations with most identified in the PHD (60%) or SNF2 domains (25%).10 While it is presumed that ATRX mutations are associated with reduced function, few studies have addressed structure/function relationships.

Subcellular localization and initial biochemical studies with ATRX indicate that it likely has a repressive role on nuclear processes. It is localized to pericentromeric heterochromatin and can physically interact with proteins that reside within heterochromatin, including HP1 and EZH2.11, 12, 13, 14 The ATRX protein also localizes to PML-NBs where it interacts with Daxx, a transcriptional and apoptotic regulator.12, 15, 16 ATRX is also involved in the regulation of methylation at ribosomal DNA (rDNA) repeats.17 Given that ATRX acts at multiple locations within the nucleus, some mutations are likely to alter subnuclear localization thereby reducing function.

In this regard, immunoblots of protein isolated from a panel of ATR-X patients with unique missense mutations have shown that many samples have reduced ATRX protein levels.13, 18 Nonetheless, other experiments have demonstrated a more direct role of mutations on protein function. Baculovirus expressed ATRX protein containing engineered patient mutations within the SNF2 domain were shown to have attenuated ATPase activity compared to the wild-type (WT) protein.15 Similarly, expression of the PHD domain in bacteria demonstrated that patient mutations impaired DNA-binding activity.18 Collectively, these studies suggest that some mutations may interfere directly with function while others may have an indirect effect through altered protein stability or aberrant subnuclear targeting of the protein.

Here, we utilized GFP–fusion proteins to identify the domains responsible for nuclear localization, heterochromatin binding and subnuclear targeting to the PML-NBs. Moreover, we demonstrate that patient mutations within one domain reduces the efficiency of targeting to PML-NBs.

Materials and methods

Construction of GFP-tagged protein subdomains

The full-length ATRX-EGFP construct was generated by cutting the HA-ATRX-pCAGGS construct19 with XhoI and SstII and ligating to EGFP-C2. EcoRI, SacI-PstI and PstI-SstII fragments were cut from HA-ATRX-pCAGGs and ligated to the EGFP-C2 linearized vector to create the L16, 85/30 and 8R/4 constructs, respectively. The 8R4-EGFPC2 construct was put back in frame by cutting with XhoI and filling the ends with Klenow enzyme. The L3, L5, L6 and L8 constructs were created by cutting 85/30-C2 with SacI-HindIII, EcoRI-PstI, SacI-AflII and HindII-EcoRI, respectively. Construct L9 was generated by deleting the BamHI fragment from the 85/30+8R4 vector. Constructs L18 and L19 were obtained by isolating the BamHI and EcoRI-BamHI fragments from 8R4-C2, respectively and religating downstream of the nuclear localization signal in the linearized L8 construct. We obtained L20 by cutting a SacI fragment from L19 and religating to the EGFP-C3 vector. L24 was generated by removing an AflII-SalI fragment from the L16 construct. Finally, L25 was obtained by cutting the L16 construct with AflII and XhoI, treating with Klenow followed by religation.

Site-directed mutagenesis

Mutant constructs were generated by a PCR-based site-directed mutagenesis method (Stratagene QuickChange kit) using the L16, L2 and L19 GFP-tagged constructs as templates. Template DNA (20 ng) was amplified using the appropriate primers (100 ng each forward and reverse, see Table 1) using the PCR in a 50-μl reaction with the following conditions: 95°C for 30 s, then 95°C for 30 s, 55°C for 1 min and 68°C for 14 min for 12 cycles. The samples were then incubated for 1 h at 37°C with DpnI to digest parental non-mutated double-stranded DNA. The product (1 μl) was transformed in XL-2 Blue competent cells. Plasmid DNA was obtained from the resulting colonies and screened using the new or abolished restriction enzyme sites introduced by silent mutations (see Table 1 below).

Table 1 Primers used to establish ATRX-GFP constructs with patient mutations

Transfections and immunofluorescence

HeLa cells were grown in EMEM supplemented with 10% fetal bovine serum in 5% CO2 at 37°C. Plasmid DNA for each GFP deletion construct was isolated using the Midi-Preparation kit from Qiagen and verified by sequence analysis. Transient transfection was performed using the Lipofectamine 2000 reagent (Life Technologies Inc.) according to the manufacturer's protocol. Depending on the level of cell death observed with some of the constructs, cells were processed after 16–24 h by fixing in cold ethanol/methanol (3:1) and counterstaining with DAPI. Immunofluorescence images were captured using a Zeiss Axioplan 2 microscope outfitted with an AxioCam camera and AxioVision software. For analysis of mutated constructs, at least 300 cells were counted and assessed for localization pattern.

Results

Nuclear localization of ATRX deletion constructs

To identify the domains responsible for nuclear and subnuclear localization we generated a panel of GFP–ATRX fusion constructs. A full-length GFP–ATRX fusion protein recapitulated the staining pattern of the endogenous ATRX protein upon transfection into HeLa cells and demonstrating that the tag does not interfere with localization (Figure 1a). The PredictNLS program (http://cubic.bioc.columbia.edu/predictNLS/) identified putative nuclear localization signals (NLSs) in the ATRX protein sequence encompassing amino acids 685–697, three overlapping sites within 1126–1196, and individual sites at 1420–1426 and 1928–1938. As such, we divided the ATRX gene into three fragments – L16, 85/30 and 8R/4 for a preliminary analysis of nuclear localization. We observed that both L16 and 85/30 independently localized to the nucleus with distinct patterns. The L16-staining pattern was diffuse throughout the nucleus with 6–10 intense staining ‘bundles’ (Figure 1b) consistent with previous reports that this region targets the protein to heterochromatin.13, 20 The 85/30 fusion protein was localized to 20–30 distinct and smaller nuclear speckles, reminiscent of PML nuclear body staining and consistent with previous reports that ATRX localizes to these structures.12, 15, 16 In contrast, the C-terminal fragment 8R/4 was restricted to the cytoplasm in a pattern containing one or two large regions of intense perinuclear staining that most likely represents protein aggregates (Figure 1b). It also suggested that the predicted NLS (1928–1938) located within this fragment does not support nuclear localization on its own. As such, we fused an NLS-containing region of ATRX (924–1214) in front of the C-terminal fragment (construct L18), resulting in a homogeneous nuclear staining pattern. Further reduction of L18 to generate the C-terminal fragment (L19) demonstrated a speckled nuclear staining pattern upon transfection (Figure 1b) indicating that the larger fusion protein may have masked the domain responsible for speckle formation.

Figure 1
figure 1

Subnuclear mapping of ATRX–GFP fusion proteins. (a) Expression of full-length ATRX–GFP fusion protein results in a punctate nuclear-staining pattern that is similar to the endogenous ATRX protein expression profile in HeLa cells. (b) Schematic diagram of the ATRX full-length construct and the various deletion constructs used to identify the domains responsible for the punctate nuclear-staining pattern. The PHD domain is shown in red; the HP1-binding domain is outlined in blue; the putative nuclear localization signal (NLS) sites are shown in gray and the SNF2 domain in yellow. The 85/30 and L19 constructs localized to the nucleus and demonstrated a speckled-staining pattern.

The NLS sequences between 685–697 and 1126–1196 can function independently to target ATRX to the nucleus. The NLS sequences between residues 1420–1426 (within construct L5; Figure 2) and 1928–1938 were unable to support nuclear localization of the fusion protein. Moreover, we defined two regions within ATRX that generate nuclear speckles and could potentially target ATRX to PML nuclear bodies.

Figure 2
figure 2

Deletion mapping of the nuclear speckle targeting domains TD1 and TD2. (a) Schematic representation of the different ATRX-GFP constructs used for transfection. The specific amino acids of each construct are delineated in brackets. The domain color scheme is similar to Figure 1 with the nuclear speckle targeting domains (TD1 and TD2) shown in black. (b) Representative cells showing the typical GFP fluorescence patterns of the constructs depicted in A. DAPI-counterstained nuclei (blue) are also shown.

Refined mapping of the nuclear speckle domains

To refine further the domain within 85/30 that results in speckle formation, we generated additional deletion constructs (Figure 2a). Constructs L3 and L5 did not enter the nucleus (Figure 2b and data not shown), thereby refining the NLS to amino acids 924–1214. The L9 and L2 constructs each retained the nuclear speckle-staining pattern in HeLa cells and in 293 cells (Figure 2b and data not shown). We occasionally observed significant staining in the cytoplasm of HeLa cells with the L2 construct that was not observed in 293 cells (data not shown). However, upon longer incubation before fixation, the cytoplasmic staining in the HeLa cells dissipated suggesting that high expression levels of L2 may overwhelm the transport machinery in this cell type. The L8 and L6 fusion proteins always showed a nuclear-staining pattern with only a few large dots of intense staining. Taken together, these findings demonstrate that the region between amino acids 1498–1651 contains a domain essential for the formation of nuclear speckles, which we named as targeting domain 1 (TD1).

Similarly, we digested the L19 fusion construct with SacI resulting in the removal of the region encoding the last 253 amino acids (L20). The L20 construct demonstrated the nuclear speckle pattern thereby delimiting the C-terminal speckle domain between residues 1965–2239 (TD2). We utilized these smaller constructs to examine whether they targeted the ATRX protein to PML nuclear bodies.

The C-terminal end of ATRX target PML-NBs

The nuclear speckles that we observed with the L2 and L19 constructs are similar in size and frequency to that observed for PML nuclear bodies. Moreover, we and others have shown that ATRX localizes to PML nuclear bodies.12, 15, 16, 21, 22 To determine whether TD1 and TD2 both can target ATRX to PML nuclear bodies, we transfected HeLa cells with the L2 and L19 constructs and examined their colocalization with endogenous PML and Daxx (Figure 3a), or with ectopically expressed PML tagged with the fluorescent DsRed protein (Figure 3b). The L2 fragment showed almost complete exclusion from endogenous or overexpressed PML (Figure 3b and c). In contrast, the L19 fragment always showed a high incidence of colocalization with endogenous or overexpressed PML indicating that the C-terminal end of the ATRX protein confers its ability to associate with PML nuclear bodies (Figure 3b and c).

Figure 3
figure 3

ATRX and the L19 but not the L2 constructs colocalize with promyelocytic leukemia (PML)-NBs. (a) Cells transfected with the L19 or L2 construct (GFP) are shown co-labeled with antibodies to endogenous PML (red). The far right panel is the Merged images showing colocalization of L19 with PML but not the L2 construct. (b) Overexpression of dsRed PML demonstrates significant colocalization with endogenous ATRX (top panel) and the L19 construct (bottom panel) but not the L2 expression plasmid (middle panel).

C-terminal ATRX mutations disrupt the association with PML-NBs

The L19 fragment is large, comprising 534 amino acids and encompassing several functional domains of the ATRX protein including motifs IV–VI of the ATPase domain, the P domain which has weak homology to the SWI/SNF family, and a stretch of glutamine residues named the Q domain.9 A number of naturally occurring ATR-X patient mutations have been identified in the L19 fragment containing the PML-interacting C terminus of the ATRX protein (Figure 4a), which we have renamed targeting domain 2 (TD2). To assess the functional importance of clinical mutations for the interaction of ATRX with PML, we introduced four specific mutations in TD2 in different conserved domains (Figure 4a). Ped 23 (D2035V) and Ped 3 (Y2084H) missense mutations flank motif IV of the ATPase domain. Ped 17 (R2386X) is a nonsense mutation that is located within the P element and truncates the last 106 amino acids of the ATRX protein. The mutation identified in Ped 26 was a 2-kb genomic deletion that helped localize the ATRX gene.23 The deletion removed the acceptor splice site and the first 2 kb of the last exon and we recapitulated that deletion by creating a nonsense mutation (L2401X) that deletes the remaining 91 amino acids.

Figure 4
figure 4

Patient mutations within the L19 construct lead to cells with either a homogenous or speckled nuclear-staining pattern. (a) Schematic diagram of the L19 construct showing the location of the different ATRX patient mutations generated. (b) Representative transfected cells showing either the homogeneous or speckled-staining patterns for the different L19 mutant constructs. The second panel depicts promyelocytic leukemia (PML) staining for the transfected cell, the third panel is the merged image and the far panel depicts the DAPI counterstaining.

Transfection of each mutated L19 construct was analyzed for speckle formation by fluorescence microscopy (Figure 4b). While all cells transfected with L19 displayed a speckled pattern of GFP fluorescence, this was not the case for the mutated TD2 constructs which primarily presented with a uniform nuclear-staining pattern (Figure 4b). To quantify this result 300 transfected cells (from five independent transfections) for each mutant construct were scored for uniform or speckled pattern. Overall, the uniform pattern of localization was observed in approximately 83% of the cells whereas only 17% of the cells transfected with mutant constructs had the characteristic speckled pattern (Table 2), indicating that these mutations interfere with localization to PML nuclear bodies or, alternatively, affect retention of the protein within these domains. Nonetheless, the dramatic effect of ATR-X clinical mutations on PML targeting suggests that these interactions are crucial for the normal function of ATRX in the nucleus.

Table 2 Frequency of punctate and uniform staining patterns of GFP-ATRX fusion proteins

To determine whether the changes in subnuclear localization were not restricted to the truncated ATRX proteins we introduced two patient mutations into the full-length GFP-ATRX construct, namely the Ped 23 and a PHD zinc-finger mutation that should not affect localization to the PML nuclear body. In addition, we generated a construct containing a mutation in the active site of the ATPase domain (K1600R; named Mu) that we have previously shown to represent an ATPase-dead mutation.15 As shown in Figure 5 and Table 2, we observed a predominant speckled pattern in cells transfected for the WT and PHD mutant constructs, whereas there was an increased frequency of cells with a uniform staining pattern when the Ped 23 and Mu mutants were introduced. Taken together, these results suggest that the SNF2 domain may have a role, in addition to nucleosome remodeling, in facilitating targeting to the PML-NB.

Figure 5
figure 5

Staining pattern of full-length ATRX-GFP constructs. A wild type (WT) ATRX-GFP expression plasmid or constructs with an engineered mutation in either the plant homeodomain (PHD) (PHD), the SNF domain (Ped 23), or an ATPase dead mutant (Mu) were transfected into HeLa cells and scored for nuclear speckle formation. Representative cells are shown and the relative frequencies are quantitated in Table 2.

Discussion

Defective chromatin remodeling is now recognized as the cause of numerous mental retardation disorders, including the ATR-X syndrome.24, 25 For a few syndromes, namely Rett syndrome and Rubinstein-Taybi syndrome, specific neuronal target genes have been identified and their dysregulation shown to contribute to disease pathophysiology.26, 27, 28, 29, 30 Despite these successes, our understanding of the developmental role of XLMR-associated chromatin remodeling proteins is limited. In this regard, ATRX is a prime example. It localizes to pericentromeric heterochromatin and PML-NBs and it can physically interact with heterochromatin protein 1 or with Daxx in a protein complex.11, 12, 13, 15, 16, 31 However, it is not clear what the functional implications of these interactions are and, more importantly, how mutations affect these interactions or lead to deficits in neural development. The goal in the present study was to identify the domains responsible for nuclear and subnuclear localization and to determine whether patient mutations interfered with targeting.

The ATRX protein contains four regions that fulfill the criteria for an NLS motif (amino acids 685–697, three motifs between amino acids 1126–1196, amino acids 1420–1426 and residues 1928–1938). We demonstrated that two of these regions could function in isolation to target GFP fusion proteins to the nucleus. L16 which contained the first putative NLS located at amino acids 685–697 was correctly targeted to the nucleus. The 85/30 construct contained two putative NLS sequences but only the one within L8, at residues 1126–1196, conferred nuclear localization to GFP. Moreover, this fragment could localize other C-terminal ATRX fragments to the nucleus. There are no reported mutations in the two functional NLS sequences we describe here or in the other putative NLS regions. While NLS mutations may be identified as more patients are characterized, it is equally probable that the outcome of such a mutation is a lethal phenotype since ATRX ablation in mice results in early embryonic lethality.32

Two different regions of the ATRX protein, TD1 and TD2 resulted in a punctate nuclear-staining pattern, yet only TD2 colocalized with PML nuclear bodies. The TD1 containing construct L2 encompasses motifs I and Ia of the SNF2 domain and contains the Daxx interaction domain.9, 15 While we previously demonstrated that the Daxx interaction domain could target ATRX to PML-NBs this was not the case with L2 in the present study. The difference between the two studies was that in Tang et al15 we overexpressed both PML and Daxx in addition to the ATRX fragment (F14). When these experiments were repeated using endogenous levels of PML and Daxx we did not observe localization of F14 to PML-NBs; such localization required PML and Daxx overexpression (data not shown). Importantly, our present results are consistent with findings for ATRXt, an endogenous ATRX isoform that truncates prior to the SNF2 domain and does not associate with PML-NBs.33 Nonetheless, some controversy remains as a different study has shown that Daxx is necessary to recruit ATRX to PML-NBs in mouse fibroblasts.12 While the nature of the L2 speckles remains to be determined they could represent genes targeted for transcriptional regulation by a Daxx/ATRX complex. Alternatively, it could be a cell type specific difference that promotes ATRX targeting to PML-NBs. Indeed, a recent study has shown that MeCP2 is necessary to target ATRX to heterochromatin in mature but not immature neurons.34

In contrast, the L19 construct did colocalize with PML-NBs suggesting that the TD2 domain is sufficient for localization of the ATRX protein to this subnuclear structure. These findings indicate that ATRX could bind directly to other components of PML-NBs, distinct from Daxx, through the TD2 region for targeting to these foci. The TD2 region also overlaps with the MeCP2 interaction domain suggesting that binding of different factors to this domain may determine the subnuclear localization of ATRX.34

Interestingly, patient mutations resulted in fewer cells with PML colocalization suggesting that alteration of the TD2 domain impairs targeting efficiency or retention within this structure. How do patient mutations interfere with localization to the PML-NBs and how could this lead to mental retardation? The Ped3 and Ped23 mutations were previously shown to attenuate ATPase activity15 and they showed reduced targeting to PML-NBs when introduced into the L19 construct or within a full-length ATRX–GFP fusion protein. Similarly, the ATPase dead mutant (Mu) showed a reduced number of cells with speckle formation. This was not simply an effect of mutating ATRX since the PHD mutant had no effect on nuclear speckle formation. The Mu mutation lies outside the TD2 domain within ATPase motif I of the SNF2 domain thereby suggesting that chromatin-remodeling activity may also be important for PML-NB localization/retention. Indeed, it has been postulated that the association of satellite DNA with ATRX, Daxx and HP1 within PML-NBs at the G2 phase may facilitate re-establishment of condensed heterochromatin at satellite DNA.35 Certainly, one could speculate that proper targeting of ATRX to PML-NBs is essential for ATRX-directed chromatin remodeling of specific target genes tagged for silencing during neural development. Mutations located within the SNF2 domain may attenuate (but not abolish) the chromatin-remodeling activity and/or the localization to the PML-NB such that the ATRX-specific target genes are derepressed. Certainly, the inappropriate regulation of gene expression in the nervous system during development could result in a mental retardation phenotype and thus, the identification of such targets remains a high priority of investigation.