Structural basis for ELL2 and AFF4 activation of HIV-1 proviral transcription

Qi, Shiqian; Li, Zichong; Schulze-Gahmen, Ursula; Stjepanovic, Goran; Zhou, Qiang; Hurley, James H.

doi:10.1038/ncomms14076

Download PDF

Article
Open access
Published: 30 January 2017

Structural basis for ELL2 and AFF4 activation of HIV-1 proviral transcription

Shiqian Qi^1,2,
Zichong Li²,
Ursula Schulze-Gahmen²,
Goran Stjepanovic ORCID: orcid.org/0000-0002-4841-9949^2,3,
Qiang Zhou² &
…
James H. Hurley^2,3

Nature Communications volume 8, Article number: 14076 (2017) Cite this article

3008 Accesses
24 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The intrinsically disordered scaffold proteins AFF1/4 and the transcription elongation factors ELL1/2 are core components of the super elongation complex required for HIV-1 proviral transcription. Here we report the 2.0-Å resolution crystal structure of the human ELL2 C-terminal domain bound to its 50-residue binding site on AFF4, the ELLBow. The ELL2 domain has the same arch-shaped fold as the tight junction protein occludin. The ELLBow consists of an N-terminal helix followed by an extended hairpin that we refer to as the elbow joint, and occupies most of the concave surface of ELL2. This surface is important for the ability of ELL2 to promote HIV-1 Tat-mediated proviral transcription. The AFF4–ELL2 interface is imperfectly packed, leaving a cavity suggestive of a potential binding site for transcription-promoting small molecules.

Structural and functional insight into the effect of AFF4 dimerization on activation of HIV-1 proviral transcription

Article Open access 18 February 2020

The host RNA polymerase II C-terminal domain is the anchor for replication of the influenza virus genome

Article Open access 05 February 2024

Crystal structure of a highly conserved enteroviral 5′ cloverleaf RNA replication element

Article Open access 07 April 2023

Introduction

Curing AIDS is a major global health goal. AIDS is caused by the human immunodeficiency virus (HIV), which has proved exceptionally difficult to eradicate^1,2. The principal obstacle to HIV eradication is the persistence in patients of a reservoir of cells harbouring latent provirus integrated within the genome³. Clinical interest in the reactivation of latent HIV^1,2 has brought renewed attention to the mechanism of transcriptional regulation of the HIV provirus. Latency is regulated at the levels of epigenetic silencing, and transcription initiation and elongation⁴. Transcription elongation, which is promoted by the HIV Tat protein and TAR RNA sequence, is the best understood of these mechanisms. The functions of HIV Tat and TAR in promoting elongation are completely dependent on their ability to hijack the host super elongation complex (SEC)^5,6,7.

The SEC consists of the Cyclin-dependent kinase CDK9 and Cyclin T (CycT1 or T2), together known as P-TEFb⁸; one of either of the intrinsically disordered (ID) scaffold proteins AFF1 or AFF4 (refs 5, 6); one of either ENL or AF9; and one of either of the RNA polymerase elongation factors ELL1 or ELL2 (refs 5, 9, 10). The reason that Tat is such a powerful activator of HIV-1 transcription lies in its ability to pack two distinct transcriptional elongation factors P-TEFb and ELL1/2 into a single SEC complex, where the two factors can synergistically stimulate a single RNA Pol II elongation complex^5,7. AFF1/4 is >1,100 residues long and is the principal scaffold that holds the SEC together¹¹. AFF1/4 consists almost entirely of predicted ID regions (IDRs). AFF1 and AFF4 function in transcription elongation by virtue of various peptide motifs interspersed throughout their sequences, much like many other ID signalling and regulatory proteins that have come under intensive study^12,13. The AFF1- and ELL2-containing version of the SEC is the most important in the promotion of proviral elongation, despite its low abundance¹⁴.

The structure of P-TEFb lacking the C-terminal IDR of CycT1 has been determined in complex with HIV-1 Tat¹⁵ and the N-terminal 60 residues of AFF4 (refs 16, 17). This structure shows that AFF4 residues 32–67 bind as an extended strand followed by two α-helices to the CycT1 surface. Nuclear magnetic resonance studies showed that AFF4 residues 761–774 fold into a β-strand that combines with two strands of the AF9 AFF4-binding domain to generate a three-stranded β-sheet¹⁸. The structures of the P-TEFb and AF9 complexes with AFF4 revealed two of the three known interfaces used by AFF4 in assembly of the SEC. In this study, we set out to visualize the last of the three known interfaces critical for AFF4 function, its binding site for ELL1/2.

Progress in characterizing the AFF4 interface with ELL2 has been slower than for the P-TEFb and ENL/AF9 interfaces, in part because the AFF4-binding domain of ELL2 is poorly soluble and prone to aggregation. Here we work with a fusion construct such that a stable obligate complex between ELL2 and AFF4 is formed. This fusion-based tethered complex is stable and soluble enough to be crystallized. The crystal structure confirms that the AFF4-binding domain of ELL2 has an occludin fold, as predicted from the sequence homology¹⁹. It shows that the IDR consisting of AFF4 residues 301–351 (hereafter referred to as AFF4^ELLBow for ELL1/2 binding) folds up into a helix and elbow joint arrangement that makes extensive contacts with the occludin domain of ELL2 (hereafter ELL2^Occ). These results complete the structural picture of how AFF1/4 engages its three known partners in the SEC.

Results

Mapping the AFF4^ELLBow and ELL2^Occ interaction

Following the initial mapping of the AFF4 and ELL2 interaction sites to approximately residues 318–337 of the former and 519–640 of the latter²⁰ (Fig. 1a), we sought to isolate a stable form of this monomeric complex for crystallization (Supplementary Fig. 1a). It was difficult to obtain diffraction-quality crystals of ELL2^Occ constructs with AFF4^ELLBow fragments because of the propensity of the ELL2 fragment to aggregate over time. We reasoned that fusion of AFF4^ELLBow and ELL2^Occ fragments might protect the AFF4-binding epitope on ELL2^Occ from aggregation. Constructs were generated for both AFF4^ELLBow–(Gly-Ser)₄–ELL2^Occ and ELL2^Occ–(Gly-Ser)₄– AFF4^ELLBow. The ELL2^Occ–(Gly-Ser)₄–AFF4^ELLBow dimerized in solution, while AFF4^ELLBow–(Gly-Ser)₄–ELL2^Occ was monomeric (Supplementary Fig. 1b). Given that the unfused fragments were monomeric, we concluded that the dimerization of ELL2^Occ–(Gly-Ser)₄–AFF4^ELLBow represented a domain-swapping artifact (Supplementary Fig. 1c) and focused efforts on AFF4^ELLBow–(Gly-Ser)₄–ELL2^Occ.

**Figure 1: Crystal structure of the AFF4 ELLBow in complex with the occludin homology domain of ELL2.**

Structure of the AFF4^ELLBow complex with ELL2^Occ

The structure of the AFF4^ELLBow–ELL2^Occ complex was determined by Selenomethionyl (SeMet) multiwavelength anomalous dispersion (MAD) phasing (Fig. 1b; Supplementary Fig. 2; Table 1). Helix α1 (residues 538–578) of ELL2 bends inward at Tyr552 by 30° such that the C-terminal part of α1 (553–578) packs against α2 (Fig. 1c). Helices α2 (584–602) and α3 (607–638) of ELL2 are oriented at an angle of ∼100° with respect to each other such that both pack along the length of the long, bent helix α1 (Fig. 1c). The structure confirms that ELL2^Occ has a similar arch-shaped three-helix fold as the C-terminal domain of occludin¹⁹. The ELL2^Occ and occludin C-terminal domain (pdb entry 1XAW) structures can be superimposed with an root mean squared deviation of 4.0 Å for 104 residue pairs (Fig. 1d). The main differences are in the α2–α3 connector and in the mutual orientation of these two helices. The α2–α3 angle is steeper in ELL2^Occ than in occludin. A minor difference is that ELL2^Occ has an extra single-turn helix, denoted α0, at its N terminus.

Table 1 Statistics of crystallographic data reduction and refinement.

Full size table

AFF4^ELLBow is ordered over residues 314–349 and buries 1,535 Å² of solvent-accessible surface area. Fully 37% of the entire solvent-accessible surface area of AFF4^ELLBow is buried in the contact. The AFF4^ELLBow sequence folds into several distinct regions. It begins with helix α1 (315–324), is followed by an extended hydrophobic sequence (325–327), a polyproline segment (328–330), an extended region that doubles back on itself in what we refer to as the ELLBow joint (331–343), and a second extended hydrophobic sequence (344–349) (Fig. 2a). The fusion construct contains 17 residues that are not visualized in electron density. These include AFF4 350–351, followed by 8 Gly-Ser linker residues and ELL2 residues 519–524. These 17 residues are more than adequate to span the 15 Å gap between the carbonyl carbon of AFF4 residue 349 and the amide nitrogen of ELL2 residue 525 in the structure. Hydrophobic side chains of AFF4^ELLBow α1, including Val316, Ile319, Leu320 and Met323, are buried in a hydrophobic groove formed by the C-terminal half of ELL2^Occ α1 and α2 (Fig. 2b). These helices of ELL2^Occ contribute residues Val565, Phe569, Ile570, Leu572, Asp573, Val589, His590, Tyr596, Leu594 and Ile599 to the AFF4 α1-binding site (Fig. 2b). ELL2^Occ buries 1,315 Å² of solvent-accessible surface area, corresponding to 15% of its total surface area.

**Figure 2: AFF4^ELLBow–ELL2^Occ interaction surfaces.**

AFF4^ELLBow is centred on Trp327, which forms extensive hydrophobic interactions with the side chains of ELL2 residues His559, Met562, Cys614 and Glu615. The Trp327 indole nitrogen also forms a water-mediated hydrogen bond with the Tyr607 hydroxyl. This cluster of residues is completed by the side chains of AFF4 Pro328, Phe345 and Phe347 (Fig. 2c). Collectively, this cluster forms an extensive interaction network, in which AFF4^ELLBow folds up not only against ELL2 but also against itself.

In the AFF4^ELLBow joint, the side chain of Leu331 sticks into a pocket comprising Tyr552, Tyr555, His618, Leu621 and Ala622 of the N-terminal half of ELL2^Occ α1 and α3. The side chain of Ile334 packs against the side chains of Lys545, Phe547, Lys625 and Leu628. At the distal end of ELLBow joint, Pro342 falls into a shallow cavity composed of Ala622, Lys625 and Arg626 (Fig. 2d).

A number of hydrogen bonds are observed in the complex (Fig. 3a). In AFF4^ELLBow α1, the side chains of Asp317 and Arg576 of ELL2 form a bidentate salt bridge with one another (Fig. 3b). Glu322 of AFF4 forms a 2.8 Å salt bridge with one of the two observed rotamers of His608 of ELL2 (Fig. 3b). In the central cluster, the carbonyl group of Pro328 forms a 2.7 Å hydrogen bond with the side chain of His559 of ELL2 (Fig. 3c). Moving into the ELLBow joint, the main-chain amide and carbonyl of AFF4 Leu331 form hydrogen bonds with the hydroxyl oxygens of Tyr552 and Tyr555, respectively, of ELL2. A 2.6 Å hydrogen bond is formed between Thr332 of AFF4 and Lys625 of ELL2 (Fig. 3d). The Ile334 carbonyl accepts a hydrogen bond from the side chain of Lys545. The Cys338 main-chain amide donates a hydrogen bond to the side chain of Asp632. The main-chain amide of Phe345 forms a 2.9 Å hydrogen bond with the side chain of Gln619 (Fig. 3d).

**Figure 3: Hydrogen bonding in the AFF4^ELLBow–ELL2^Occ complex.**

The AFF4^ELLBow–ELL2^Occ complex was screened for cavities using POCASA 1.1 (POcket-CAvity Search Application)²¹ with a probe radius of 3 Å. Of the five largest cavities located, one is an internal cavity at the AFF4^ELLBow–ELL2^Occ interface (Fig. 4a). The cavity is 36 Å³ in volume and is connected to the exterior by a narrow mouth (Fig. 4a). It is lined by the aliphatic part of Glu322, Met323, His325, Trp327, Phe347 and Pro348 of AFF4, and by Met562, Ala566, Tyr607 and the aliphatic part of Lys611 of ELL2 (Fig. 4b). These residues are in or adjoin the central cluster part of the interface.

**Figure 4: Cavity at the AFF4^ELLBow–ELL2^Occ interface.**

Function of the AFF4^ELLBow interface with ELL2^Occ in binding

To validate whether the observed structural interface corresponded to the mode of binding of AFF4^ELLBow and ELL2^Occ in solution, we carried out a series of mutant peptide binding assays using fluorescence polarization. We considered this particularly critical given the use of the fusion construct to obtain crystals. The assay monitored the displacement of fluorescently labelled wild-type AFF1^ELLBow peptide by unlabelled mutant peptides 301–351. The unlabelled wild-type peptide in this system has K_d=86 nM (Supplementary Table 1; Fig. 5a). The AFF4 hydrophobic residues Val316, Ile319, Leu320, Met323, Trp327, Leu331, Ile334 and Pro342, were mutated to Asp to maximally destabilize hydrophobic interactions. Consistent with expectation, mutation of multiple hydrophobic residues to Asp resulted in large decreases in affinity. The double mutant I319D/L320D reduced affinity by >25-fold (Supplementary Table 1; Fig. 5a). The K_d for the triple mutant I319D/L320D/M323D was immeasurable due to weak binding, but >3 μM, representing a ∼50-fold loss of affinity (Supplementary Table 1; Fig. 5a). The same was true of two other triple hydrophobic mutants tested, M323D/L331D/I334D and W327D/L331D/I334D (Supplementary Table 1; Fig. 5b). The single mutant M323D has the largest effect of any single-amino-acid change, with a reduction in affinity of >25-fold (Supplementary Table 1; Fig. 5b). Moving closer to the centre of the AFF4^ELLBow, L331D and I334D reduce affinity by ∼20- and 8-fold, respectively (Supplementary Table 1; Fig. 5b). This highlights the role of hydrophobic residues in AFF4^ELLBow helix α1 and immediately C terminal to it in the central cluster, as the critical anchor points and affinity determinants.

**Figure 5: Contributions of AFF4 ELLBow interactions to binding in solution.**

Hydrophobic residues of the central cluster make smaller contributions than those highlighted above. W327D reduces affinity fourfold, while F345D/F347D reduces it by less than twofold. P342D led to a similar threefold drop (Supplementary Table 1; Fig. 5b). These more modest contributions may reflect that these side chains are partially solvent accessible in the AFF4^ELLBow–ELL2^Occ complex. Moreover, their interactions are made in part with other residues within the AFF4^ELLBow such that they could potentially make residual hydrophobic interactions even in unbound AFF4. The polyproline helix does not seem to have a major role in affinity, with the double 328–329 Pro–Gly mutant reducing affinity only by a factor of three (Supplementary Table 1; Fig. 5b).

The interface has a significant polar component, with some hydrophilic residues contributing substantially to binding, and others less so. The AFF4^ELLBow α1 mutant D317P/E317P was designed to disrupt hydrogen bonding, involving Asp317 and to introduce helix breaker mutants in α1. This mutation lowered affinity by 10-fold (Supplementary Table 1; Fig. 5a). The charge reversal mutation E322H reduced affinity by less than twofold (Supplementary Table 1; Fig. 5a).

It proved impossible to purify hydrophobic to Asp mutants in the AFF4-binding site of ELL2^Occ because these proteins were insoluble when expressed in Escherichia coli. Presumably, this is because these hydrophobic residues also contribute to the hydrophobic core of the ELL2^Occ fold. It was, however, possible to purify ELL2^Occ polar mutants in the binding site. We examined the roles of ELL2 His559, His608, Asn619 and Lys625 by pull-down assay (Fig. 5c). Single mutants H559E, H608E, N619A and K625T had no apparent effect on binding by pull-down. However, the quadruple mutant H559E/H608E/N619A/K625T completely abrogated binding both in the pull-down assay (Fig. 5c) and in a fluorescence polarization binding assay (Fig. 5d). This validates the role of these residues in the interface in solution.

The AFF4^ELLBow and ELL2^Occ interface in vivo

It had previously been shown that the AFF4 sequence 318–337 was sufficient for ELL2 binding²⁰ and that AFF4 can heterooligomerize AFF1 via its C-terminal domain²². To prevent the endogenous AFF1 from rescuing the mutant construct, function was tested in the context of a deletion of the C-terminal sequence 970–1,163. Double deletion of AFF4 residues 318–337 and 970–1,163 abrogated the interaction between AFF4 and ELL1/2 completely (Fig. 6a). To determine if single residues within AFF4^ELLBow contributed to binding and function in cells, point mutants were constructed in the context of AFF4 Δ970–1,163. ELL1 contains a C-terminal domain homologous to that of ELL2, hence binding to ELL1 was also tested. L320D was most effective, blocking both ELL1 and ELL2, consistent with its very strong effect on binding in vitro (Fig. 6a; Supplementary Table 1). E322H, P329G and I334D partially blocked ELL2 binding, but completely knocked out ELL1 binding, consistent with their intermediate effects on in vitro peptide binding. Both ELL1 and ELL2 bound robustly to the mutants P324D, F345D and F347D, consistent with their two- to threefold effects on binding in vitro (Fig. 6a; Supplementary Table 1).

**Figure 6: Role of the ELLBow in AFF4 interactions with ELL1/2 in nuclear extracts.**

To determine if the AFF4-binding site on ELL2 was functional in cells, polar mutants were inserted into ELL2 alleles and these were transfected into HeLa cells.

We avoided testing hydrophobic mutants of ELL2, since we had previously found that these destabilized the ELL2 structure. HA-tagged ELL2^H559E/H608E and ELL2^{H559E/H608E/N619A/K625T} were expressed at essentially wild-type levels in HeLa cells (Fig. 6b). Wild-type HA-ELL2 pulled down AFF1, AFF4 and ELL1 from extracts. ELL2^H559E/H608E has sharply reduced binding to AFF1, AFF4 and ELL1. ELL2^{H559E/H608E/N619A/K625T} has only trace binding to AFF1 and AFF4 in extracts. These findings support that the structural interface is responsible for the interaction of ELL2 with both AFF1 and AFF4 in cells.

Role of the interface in proviral transactivation

Overexpression of AFF4 stimulates proviral transcription by ∼5–9-fold and ∼26-fold in HEK 293 T and HeLa cells, respectively (Fig. 7a). Deletion of the C-terminal ELL1/2-binding domain almost completely blocked transactivation (Fig. 7a). The residual activity of AFF4 Δ970–1,163 was so low that meaningful results could not be obtained for transactivation phenotypes of these mutants (Fig. 7a). The abundance of the SEC complex appears to be limiting for transactivation such that overexpression of ELL2 in the presence of extra AFF4 promotes transcription by a factor of 14 (Fig. 7b). Polar mutants in the AFF4-binding site of ELL2^Occ were tested for their effects on transcription. ELL2^H559E/H608E and ELL2^{H559E/H608E/N619A/K625T} had threefold and fivefold less transactivation activity, respectively, than wild type. Very similar three- to fourfold effects are seen in Jurkat 2D10 cells (Fig. 7c). These observations strongly support a functional role for the AFF4^ELLBow-binding site on ELL2^Occ in transactivation.

**Figure 7: The ELLBow-binding site of ELL2 is important for HIV-1 long terminal repeat (LTR) transcription.**

Discussion

The crystallization of the AFF4^ELLBow–ELL2^Occ complex rounds out our structural-level understanding of how the AFF4 scaffold recruits its three known partners in the SEC, P-TEFb, ENL/AF9 and ELL1/2. The limited solubility of ELL2^Occ made this a more challenging target for crystallization, hence the necessity for the fusion approach. When using protein chimeras as a basis for structure solution, it is particularly critical to validate the findings in solution and in functional assays. One area that remains to be further explored is the relationship between ELL1/2 binding to AFF1/4 and the putative heterodimerization mediated by the C-terminal domains of AFF1/4. Binding assays in vitro, pull-downs from nuclear extracts, and proviral transactivation assays present a unified, consistent picture that validates the structural results.

The structure confirms the decade-old prediction that the C-terminal domains of ELL1/2 would have the same fold as the occludin ZO-1-binding domain. Occludin is a transmembrane tight junction protein that has no known involvement in transcription. It is not clear why this protein and ELL1/2 should share a domain uniquely present in this small set of otherwise unrelated proteins. In the initial analysis of the occludin structure, it was proposed that another tight junction protein, ZO-1, bound to a basic patch at the concave center of the arch¹⁹. This patch of occludin includes Lys504 and Lys511, which correspond structurally to the functionally important His618 and Lys625 in the AFF1/4-binding site of ELL2 (Fig. 1e). Subsequently, another report proposed that ZO-1 bound elsewhere, at one tip of the occludin domain arch. Despite these uncertainties, the structural similarities are extensive enough to suggest a common evolutionary origin and related protein-binding functions for the three-helical domains of occludin and ELL1/2.

The bromodomain and extraterminal protein inhibitor JQ1 (ref. 23) and related latency-reversing agents promote reactivation of HIV-1 from latency via P-TEFb^{24,25,26,27,28}. New classes of HIV-1 latency-reversing agents are being sought in the context of HIV eradication strategies². We observed a cavity at the AFF4–ELL2 interface that appears likely to be present also in the AFF1–ELL2 complex relevant to proviral activation¹⁴, on the basis of the complete identity of the AFF1 and AFF4 residues involved. If so, this could provide an avenue for the design of new SEC activators with JQ1-like effects on latency, but acting by an orthogonal molecular mechanism.

Methods

Cloning and protein purification

DNAs for ELL2 fragments and AFF4–ELL2 fusions were subcloned into pGST-parallel2, and DNAs for AFF4 peptide fragments were subcloned into pRSFduet-1 and pHis-parallel2. Plasmids expressing FLAG-tagged wild-type AFF4 and HA-tagged wild-type ELL2 were generated using the primers described in Supplementary Table 2 (ref. 5). The plasmids expressing mutant versions of AFF4 and ELL2 were generated by PCR mutagenesis. The mutant constructs were verified by DNA sequencing. All proteins were expressed in E. coli BL21-gold (DE3) cells (Agilent Technologies). After induction with 0.2 mM isopropyl-β-D-thiogalactoside overnight at 16 °C, the cells were pelleted by centrifugation at 4,000g for 10 min. Cell pellets were lysed in 25 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.5 mM TCEP-HCl and 1 mM phenylmethylsulphonyl fluoride by ultrasonication. The lysate were centrifuged at 25,000g for 1 h at 4 °C. The supernatants for ELL2 and its fusions were loaded onto GS4B resin at 4 °C, target proteins were eluted and the eluate applied to a Hi Trap Q HP column. Peak fractions were collected and digested with tobacco etch virus protease at 4 °C overnight. Tobacco etch virus and GST were removed by loading the solution onto Ni-NTA and GS4B columns, respectively. Target proteins were further purified on a Superdex 200 16/60 column equilibrated with 25 mM Tris-HCl pH 8.0, 150 mM NaCl and 0.5 mM TCEP-HCl. The peak fractions were collected and flash-frozen in liquid N₂ for storage. The supernatant of AFF4 was loaded onto Ni-NTA resin at 4 °C, eluted with an imidazole gradient, and applied to a Superdex 75 16/60 column equilibrated with 25 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.5 mM TCEP-HCl. SeMet protein was expressed in E. coli BL21-gold (DE3) cells grown in M9 minimal medium supplemented with 5% LB medium. An amount of 0.2 mM isopropyl-β-D-thiogalactoside and 100 mg selenomethionine were added when the OD₆₀₀ reached 1.0. Cells were pelleted by centrifugation at 4,000g for 10 min after overnight induction at 16 °C. SeMet AFF4^301–351–(Gly-Ser)₄–ELL2^519–640 was prepared as above and SeMet incorporation verified by mass spectrometry.

Crystallization of the AFF4^ELLBow–ELL2^Occ fusion

The purified fusion construct AFF4 (301–351)–(Gly-Ser)₄–ELL2 (519–640) was concentrated to 10 mg ml⁻¹ with a 10 kD centrifugal filter (Millipore). Crystals were grown by hanging-drop vapour diffusion at 19 °C. The protein solution was mixed with well buffer composed of 0.2 M NaCl, 10 mM MgCl₂, 0.3 M Na₃ citrate, 0.2 M Na thiocyanate and 0.1 M Hepes pH 7.4. Crystals appeared in 24 h and grew to full size in 5 days. Crystals were flash-frozen with liquid N₂ in well buffer. SeMet crystals were grown in the same condition as native crystals. Native data were collected on BL7-1 at Stanford Synchrotron Radiation Lightsource. Native crystals diffracted to 2.5 Å and data were collected at a wavelength of 1.1271 Å. The structure was solved using data from SeMet crystals as described in the main text.

Pull-down assays

Mutants of ELL2 (519–640) and AFF4 (300–351) were purified as described above. The concentration of proteins and peptides was determined by ultraviolet absorption at 260–280 nm. A measure of 9 μM GST-ELL2 and 20 μM His₆-AFF4 were incubated with GS4B resin at 4 °C for 2 h in 80 μl of 25 mM Tris-HCl pH 8.0, 150 mM NaCl and 0.5 mM TCEP-HCl. The resin was washed three times with the incubation buffer. Then, the resin was boiled in 30 μl 1 × SDS loading buffer at 95 °C for 5 min before being applied to SDS–polyacrylamide gel electrophoresis for analysis.

Fluorescence polarization

Protein binding was measured using the fluorescence anisotropy of a 33-residue segment of AFF1 (residues 358–390), following procedures similar to those used previously to characterize AFF4 binding to P-TEFb¹⁷. AFF1 358–390 are almost identical to AFF4 318–350 with only three amino-acid changes between the two homologues. The AFF1 peptide C-FAM-GABA-EILKEMTHSWPPPLTAIHTPSTAEPSKFPFPTK-amide was synthesized (University of Utah DNA/Peptide Facility), where FAM indicates 5-carboxyfluoroscein and GABA indicates a γ-amino-butyric acid spacer. Competition titration experiments with unlabelled His-tagged AFF4 protein 301–351 were performed using 2 μM Sumo-ELL2 in 25 mM HEPES pH 7.5, 100 mM NaCl, 10% glycerol, 0.05% NP-40, 0.5 mM TCEP and 5 nM fluorescent peptide. A Victor 3V (Perkin Elmer) multi-label plate reader was used to measure fluorescence anisotropy. Three experimental replicates were carried out for each curve. Binding curves were fit to a formula describing competitive binding of two different ligands to a protein²⁹ using Prism version 5.0c (Graphpad Software).

Co-immunoprecipitation

Approximately 2 × 10⁷ HEK 293T cells (UC Berkeley Cell Culture facility) in two 145-mm dishes were transfected by plasmids expressing the wild-type or mutant FLAG-AFF4 or ELL2-HA (20 μg each). Forty-eight hour after transfection, the cells were harvested and swollen in 4 ml hypotonic buffer A (10 mM HEPES-KOH (pH 7.9), 1.5 mM MgCl₂ and 10 mM KCl) for 5 min and then centrifuged at 362g for 5 min. The cells were then disrupted by grinding 20 times with a Dounce tissue homogenizer in 2 ml buffer A, followed by centrifugation at 3,220g for 10 min to collect the nuclei. The nuclei were then extracted in 400 μl buffer C (20 mM HEPES-KOH (pH 7.9), 0.42 M NaCl, 25% glycerol, 0.2 mM EDTA, 1.5 mM MgCl₂, 0.4% NP-40, 1 mM dithiothreitol and 1 × protease inhibitor cocktail) on ice for 30 min, followed by centrifugation at 20,800g for 30 min. The supernatant (nuclear extracts (NE)) was then mixed with 10 μl of anti-FLAG agarose (A2220 Sigma) or anti-HA agarose (A2095 Sigma) and rotated at 4 °C overnight. The beads were then washed three times with buffer D (20 mM HEPES-KOH (pH 7.9), 0.3 M KCl, 15% glycerol, 0.2 mM EDTA and 0.4% NP-40), and eluted with 30 μl 0.1 M glycine-HCl (pH 2.0). For western blot, 3% of the NE input and 50% of the immunoprecipitation eluate were loaded into each NE and immunoprecipitation lane, respectively. Primary antibodies used for western blots are: mouse anti-FLAG (F1804, Sigma), rat anti-HA (11867423001, Roche), rabbit anti-human ELL2 (A302–505A, Bethyl), rabbit anti-human ELL1 (A301–645A, Bethyl), rabbit anti-human AFF1 (A302–344A, Bethyl), mouse anti-human AFF4 (ab57077, Abcam) and mouse anti-human α-tubulin (CP06, EMD CHEMICALS). Secondary antibodies used for western blots are: goat anti-mouse-680 nm (A-21057, Invitrogen), goat anti-rabbit-680 nm (A-21076, Invitrogen) and goat anti-rat-800 nm (612-132-120, Rockland). For endogenous proteins except α-tubulin, the primary antibodies were diluted to 1 μg ml⁻¹, for FLAG/HA tags and α-tubulin, the primary antibodies were diluted 5,000-fold. Secondary antibodies were diluted 10,000-fold.

Luciferase reporter assay

Approximately 6 × 10⁵ HEK 293T cells or 4 × 10⁵ HeLa cells (UC Berkeley Cell Culture facility) in six-well plates were transfected in triplicate by plasmids expressing FLAG-AFF4 and/or ELL2-HA (1 μg each) with the HIV-1 LTR-luciferase construct (0.1 μg). Forty-eight hours after transfection, the cells were collected and lysed in 1 × reporter lysis buffer (E3971 Promega), followed by centrifugation at 20,800g for 1 min. Luciferase activities in the supernatant were measured using the Luciferase Assay System (E1501 Promega) on a Lumat LB 9501 luminometer.

CRISPR/Cas9-mediated knockout of ELL2 gene in HeLa cells

The procedures and single-guide RNA sequence for generating the HeLa-based ELL2-KO knockout (KO) cell line dELL2 (ref. 14) were as follows. Briefly, forward (5′-CACCGAGCGCCCGGATCGCCGTCT-3′) and reverse(5′-AAACAGACGGCGATCCGGGCGCTC-3′) DNA oligos containing the single-guide RNA sequence targeting the first exon of ELL2 were synthesized, annealed and cloned into the pSpCas9(BB)-2A-Puro vector (Addgene plasmid ID: 48,139), and transfected into HeLa cells, which were then selected by puromycin for 2 days, and diluted to single clones. The KO clone was initially identified by anti-ELL2 immunoblotting (Supplementary Fig. 3), and then verified by Sanger sequencing of the targeted genomic site.

Test of the effects of ELL2 mutants on HIV latency reversal

A total of 2 μg plasmids expressing AFF1/4 alone or in combination with ELL2wt or its mutants were nucleofected into 1 × 10⁶ Jurkat 2D10 cells (gift of J. Karn, Case Western Reserve University)³⁰ using Amaxa kit V and the manufacture’s protocols (X-005). GFP+ cells indicating the reversal of HIV latency were measured by flow cytometry 48 h post nucleofection. Three biological repeats were done for each group, with their percentages of GFP+ averaged and s.d.’s calculated to generate the error bars. An aliquot of cells from each group were lysed for immunoblotting with the indicated antibodies.

Data availability

Coordinates and structure factor of the structure reported here have been deposited into the Protein Data Bank with PDB Code: 5JW9. All additional experimental data are available from the corresponding author on reasonable request. The PDB Code 1XAW, UniProt accession codes Q9UHB7 and O00472 were used in this study.

Additional information

How to cite this article: Qi, S. et al. Structural basis for ELL2 and AFF4 activation of HIV-1 proviral transcription. Nat. Commun. 8, 14076 doi: 10.1038/ncomms14076 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Archin, N. M., Sung, J. M., Garrido, C., Soriano-Sarabia, N. & Margolis, D. M. Eradicating HIV-1 infection: seeking to clear a persistent pathogen. Nat. Rev. Microbiol. 12, 750–764 (2014).
Article CAS Google Scholar
Margolis, D. M., Garcia, J. V., Hazuda, D. J. & Haynes, B. F. Latency reversal and viral clearance to cure HIV-1. Science 353, aaf6517 (2016).
Article Google Scholar
Ruelas, D. S. & Greene, W. C. An integrated overview of HIV-1 latency. Cell 155, 519–529 (2013).
Article CAS Google Scholar
Mbonye, U. & Karn, J. Transcriptional control of HIV latency: cellular signaling pathways, epigenetics, happenstance and the hope for a cure. Virology 454, 328–339 (2014).
Article Google Scholar
He, N. et al. HIV-1 Tat and host AFF4 recruit two transcription elongation factors into a bifunctional complex for coordinated activation of HIV-1 transcription. Mol. Cell 38, 428–438 (2010).
Article CAS Google Scholar
Sobhian, B. et al. HIV-1 Tat assembles a multifunctional transcription elongation complex and stably associates with the 7SK snRNP. Mol. Cell 38, 439–451 (2010).
Article CAS Google Scholar
Lu, H. et al. AFF1 is a ubiquitous P-TEFb partner to enable Tat extraction of P-TEFb from 7SK snRNP and formation of SECs for HIV transactivation. Proc. Natl Acad. Sci. USA 111, E15–E24 (2014).
Article CAS Google Scholar
Price, D. H. P-TEFb, a cyclin-dependent kinase controlling elongation by RNA polymerase II. Mol. Cell. Biol. 20, 2629–2634 (2000).
Article CAS Google Scholar
Biswas, D. et al. Function of leukemogenic mixed lineage leukemia 1 (MLL) fusion proteins through distinct partner protein complexes. Proc. Natl Acad. Sci. USA 108, 15751–15756 (2011).
Article ADS CAS Google Scholar
Luo, Z. J. et al. The super elongation complex family of rna polymerase II elongation factors: gene target specificity and transcriptional output. Mol. Cell. Biol. 32, 2608–2617 (2012).
Article CAS Google Scholar
Lin, C. et al. AFF4, a component of the ELL/P-TEFb elongation complex and a shared subunit of MLL chimeras, can link transcription elongation to leukemia. Mol. Cell 37, 429–437 (2010).
Article CAS Google Scholar
Tompa, P., Davey, N. E., Gibson, T. J. & Babu, M. M. A million peptide motifs for the molecular biologist. Mol. Cell 55, 161–169 (2014).
Article CAS Google Scholar
Csizmok, V., Follis, A. V., Kriwacki, R. W. & Forman-Kay, J. D. Dynamic protein interaction networks and new structural paradigms in signaling. Chem. Rev. 116, 6424–6462 (2016).
Article CAS Google Scholar
Li, Z., Lu, H. & Zhou, Q. A minor subset of super elongation complexes plays a predominant role in reversing HIV-1 latency. Mol. Cell. Biol. 36, 1194–1205 (2016).
Article CAS Google Scholar
Tahirov, T. H. et al. Crystal structure of HIV-1 Tat complexed with human P-TEFb. Nature 465, 747–751 (2010).
Article ADS CAS Google Scholar
Gu, J. et al. Crystal structure of HIV-1 Tat complexed with human P-TEFb and AFF4. Cell Cycle 13, 1788–1797 (2014).
Article CAS Google Scholar
Schulze-Gahmen, U. et al. The AFF4 scaffold binds human P-TEFb adjacent to HIV Tat. Elife 2, e00327 (2013).
Article Google Scholar
Leach, B. I. et al. Leukemia fusion target AF9 is an intrinsically disordered transcriptional regulator that recruits multiple partners via coupled folding and binding. Structure 21, 176–183 (2013).
Article CAS Google Scholar
Li, Y. H., Fanning, A. S., Anderson, J. M. & Lavie, A. Structure of the conserved cytoplasmic C-terminal domain of occludin: identification of the ZO-1 binding surface. J. Mol. Biol. 352, 151–164 (2005).
Article CAS Google Scholar
Chou, S. et al. HIV-1 Tat recruits transcription elongation factors dispersed along a flexible AFF4 scaffold. Proc. Natl Acad. Sci. USA 110, E123–E131 (2013).
Article ADS CAS Google Scholar
Yu, J., ZHou, Y., Tanaka, I. & Yao, M. Roll: a new algorithm for the detection of protein pockets and cavities with a rolling probe sphere. Bioinformatics 26, 46–52 (2010).
Article Google Scholar
Yokoyama, A., Lin, M., Naresh, A., Kitabayashi, I. & Cleary, M. L. A higher-order complex containing AF4 and ENL family proteins with P-TEFb facilitates oncogenic and physiologic MLL-dependent transcription. Cancer Cell 17, 198–212 (2010).
Article CAS Google Scholar
Filippakopoulos, P. et al. Selective inhibition of BET bromodomains. Nature 468, 1067–1073 (2010).
Article ADS CAS Google Scholar
Banerjee, C. et al. BET bromodomain inhibition as a novel strategy for reactivation of HIV-1. J. Leukocyte Biol. 92, 1147–1154 (2012).
Article CAS Google Scholar
Bartholomeeusen, K., Xiang, Y. H., Fujinaga, K. & Peterlin, B. M. Bromodomain and extra-terminal (BET) bromodomain inhibition activate transcription via transient release of positive transcription elongation factor b (P-TEFb) from 7SK small nuclear ribonucleoprotein. J. Biol. Chem. 287, 36609–36616 (2012).
Article CAS Google Scholar
Zhu, J. et al. Reactivation of Latent HIV-1 by Inhibition of BRD4. Cell Rep. 2, 807–816 (2012).
Article CAS Google Scholar
Boehm, D. et al. BET bromodomain-targeting compounds reactivate HIV from latency via a Tat-independent mechanism. Cell Cycle 12, 452–462 (2013).
Article CAS Google Scholar
Li, Z. C., Guo, J., Wu, Y. T. & Zhou, Q. The BET bromodomain inhibitor JQ1 activates HIV latency through antagonizing Brd4 inhibition of Tat-transactivation. Nucleic Acids Res. 41, 277–287 (2013).
Article CAS Google Scholar
Wang, Z. X. An exact mathematical expression for describing competitive-binding of 2 different ligands to a protein molecule. FEBS Lett. 360, 111–114 (1995).
Article CAS Google Scholar
Pearson, R. et al. Epigenetic silencing of human immunodeficiency virus (HIV) transcription by formation of restrictive chromatin structures at the viral long terminal repeat drives the progressive entry of HIV into latency. J. Virol. 82, 12291–12303 (2008).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Xuefeng Ren, James Holton and George Meigs for assistance with data collection and Bo Wan for the construction of the HeLa-based ELL2-KO cell line. This work was supported by NIH grants P50GM082250 (J.H.H.), and NIAID R01AI041757 and R01AI095057 (Q.Z.), and NSFC grant 81671388 (Q.S.). The Minstrel crystal farm was purchased with support from the NIH, S10 OD016268. Beamline 8.3.1 at the Advanced Light Source, LBNL, is supported by the U. C. Office of the President, Multicampus Research Programs and Initiatives grant MR-15-328599 and the Program for Breakthrough Biomedical Research, which is partially funded by the Sandler Foundation. The Advanced Light Source is supported by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. The Stanford Synchrotron Radiation Lightsource is supported by the U.S.D.O.E. under contract No. DE-AC02-76SF00515. The SSRL Structural Molecular Biology Program is supported by the DOE Office of Biological and Environmental Research and by NIH grant P41GM103393.

Author information

Authors and Affiliations

Department of Urology, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University and National Collaborative Innovation Center, Chengdu, 610041, China
Shiqian Qi
Department of Molecular and Cell Biology and California Institute of Quantitative Biosciences, University of California, Berkeley, Berkeley, 94720, California, USA
Shiqian Qi, Zichong Li, Ursula Schulze-Gahmen, Goran Stjepanovic, Qiang Zhou & James H. Hurley
Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, 94720, California, USA
Goran Stjepanovic & James H. Hurley

Authors

Shiqian Qi
View author publications
You can also search for this author in PubMed Google Scholar
Zichong Li
View author publications
You can also search for this author in PubMed Google Scholar
Ursula Schulze-Gahmen
View author publications
You can also search for this author in PubMed Google Scholar
Goran Stjepanovic
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
James H. Hurley
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.Q. performed the structural biological study. Z.L. performed the cell biological experiments. U.S.-G. performed the fluorescence polarization assay. G.S. performed hydrogen-deuterium exchange coupled to mass spectrometry (HDX MS) analysis used to optimize crystallization constructs. J.H.H. and S.Q. wrote the manuscript. All the authors discussed the results and commented on the manuscript.

Corresponding author

Correspondence to James H. Hurley.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary figures and Supplementary Tables. (PDF 12469 kb)

Peer review file (PDF 328 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Qi, S., Li, Z., Schulze-Gahmen, U. et al. Structural basis for ELL2 and AFF4 activation of HIV-1 proviral transcription. Nat Commun 8, 14076 (2017). https://doi.org/10.1038/ncomms14076

Download citation

Received: 01 September 2016
Accepted: 28 November 2016
Published: 30 January 2017
DOI: https://doi.org/10.1038/ncomms14076

This article is cited by

Structure, function and inhibition of critical protein–protein interactions involving mixed lineage leukemia 1 and its fusion oncoproteins
- Xin Li
- Yongcheng Song
Journal of Hematology & Oncology (2021)
Circular RNA AFF4 modulates osteogenic differentiation in BM-MSCs by activating SMAD1/5 pathway through miR-135a-5p/FNDC5/Irisin axis
- Chao Liu
- An-Song Liu
- Ke Yin
Cell Death & Disease (2021)
Evaluation of common genetic variants in vitamin E-related pathway genes and colorectal cancer susceptibility
- Qiuyi Zhang
- Yixuan Meng
- Meilin Wang
Archives of Toxicology (2021)
Structural and functional insight into the effect of AFF4 dimerization on activation of HIV-1 proviral transcription
- Dan Tang
- Chunjing Chen
- Shiqian Qi
Cell Discovery (2020)
Fused in sarcoma silences HIV gene transcription and maintains viral latency through suppressing AFF4 gene activation
- Simona Krasnopolsky
- Lital Marom
- Ran Taube
Retrovirology (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.