Pre-mRNA splicing follows a pathway driven by ATP-dependent RNA helicases. A crucial event of the splicing pathway is the catalytic activation, which takes place at the transition between the activated Bact and the branching-competent B* spliceosomes. Catalytic activation occurs through an ATP-dependent remodelling mediated by the helicase PRP2 (also known as DHX16)1,2,3. However, because PRP2 is observed only at the periphery of spliceosomes3,4,5, its function has remained elusive. Here we show that catalytic activation occurs in two ATP-dependent stages driven by two helicases: PRP2 and Aquarius. The role of Aquarius in splicing has been enigmatic6,7. Here the inactivation of Aquarius leads to the stalling of a spliceosome intermediate—the BAQR complex—found halfway through the catalytic activation process. The cryogenic electron microscopy structure of BAQR reveals how PRP2 and Aquarius remodel Bact and BAQR, respectively. Notably, PRP2 translocates along the intron while it strips away the RES complex, opens the SF3B1 clamp and unfastens the branch helix. Translocation terminates six nucleotides downstream of the branch site through an assembly of PPIL4, SKIP and the amino-terminal domain of PRP2. Finally, Aquarius enables the dissociation of PRP2, plus the SF3A and SF3B complexes, which promotes the relocation of the branch duplex for catalysis. This work elucidates catalytic activation in human splicing, reveals how a DEAH helicase operates and provides a paradigm for how helicases can coordinate their activities.
Pre-mRNA splicing occurs in two transesterification reactions on the spliceosome, the sequential assembly and remodelling of which are driven by eight conserved ATP-dependent RNA helicases4,8,9,10. The branching reaction occurs between the branch site adenosine (BS-A) and the 5′ splice site (5′SS) during the formation of the post-branching complex (also known as the C complex). As a prerequisite for branching, the reactants juxtapose to the catalytic centre in a process known as catalytic activation, which occurs at the conversion of the Bact (also known as activated spliceosome) to the branching-competent B* (also known as catalytically activated spliceosome) complex1. The B∗ and the post-branching C complexes are very similar in structure and composition, differing primarily in the phosphodiester bond between the BS-A and the 5'SS10.
Catalytic activation requires the remodelling of Bact spliceosomes. As part of the U2–BS duplex (also known as the branch duplex), the BS-A is sequestered 50 Å away from the catalytic centre and is enclosed by the SF3B complex of the U2 small nuclear ribonucleoprotein (snRNP)2,3. This inactive conformation of Bact is stabilized by the RES complex and the proteins SF3B1 and PRP8. Additionally, the proteins SF3A2 and CWC24 (also known as RNF113A) shield the 5′SS from the catalytic centre. These contacts must be disrupted during catalytic activation to relocate the branch helix to the active centre for catalysis2,11,12,13.
In Saccharomyces cerevisiae (budding yeast), the DEAH helicase Prp2p drives catalytic activation by releasing RES, SF3a and SF3b complexes, and the proteins Cwc24p and Cwc27p; however, mechanistic details of this process remain largely unknown1,5,14. In humans, in addition to PRP2 (an orthologue of Prp2p), the helicase Aquarius is recruited to the spliceosome as part of a pentameric intron-binding complex (IBC), which contributes to the B-to-C transitions of the spliceosome. The spliceosome remodelling promoted by Aquarius and the time point of its action relative to PRP2 is unknown6,7. Aquarius also stands out as the only splicing helicase from the SF1 family.
Here we reconstitute a spliceosome stalled between the actions of PRP2 and Aquarius, which we refer to as the BAQR complex. The cryogenic electron microscopy (cryo-EM) structure of the BAQR spliceosome reveals the mechanism of Bact-to-B remodelling and the mode of action of PRP2 assisted by interacting proteins. Our results also provide insight into how the two helicases coordinate their activities to promote the branching reaction.
Overview of the human BAQR spliceosome
After identifying the IBC as a complex that delivers Aquarius to the spliceosome6, we reconstituted the BAQR spliceosome on MINX pre-mRNA in HeLa nuclear extracts supplemented with IBC carrying the dominant-negative Aquarius mutant K829A (IBC(K829A)). This mutant stalls the spliceosome before branching6 (Extended Data Fig. 1a–d and Supplementary Fig. 1). The proteome of BAQR resembled one of the Bact complexes2,13,15. However, within BAQR, PRP2 and PPIL4 were abundant, whereas the PRP2 cofactor GPKOW was detected in low amounts (Supplementary Data 1).
We reconstructed a cryo-EM map of the BAQR core complex at about 2.9 Å resolution. Focused 3D classification allowed us to resolve the peripheral regions of the complex and model in the BRR2 helicase, the PRP19 and IBC complexes, and the 3′ module of the U2 snRNP (Fig. 1a, Extended Data Figs. 1e–i, 2 and 3a–d, Extended Data Table 1, Supplementary Data 2 and Supplementary Video 1).
Compared with Bact complexes2,13, BAQR exhibited repositioning of PRP2, the SF3B complex, SKIP (also known as PRP45), the Jab1 domain and the RNaseH-like (RH) domain of PRP8, and the intron RNA downstream of the branch helix. More subtle changes were present for BRR2, CWC15 (also known as AD002) and CWC22. Furthermore, PPIL4 was recruited to the BAQR complex while the RES complex, SF3B6 (also known as p14), CWC27, SRRM1 and GPKOW were destabilized (Fig. 1 and Supplementary Video 1).
The position of Aquarius was primarily the same in Bact complexes and BAQR complexes2,13, but notably different in C complexes2,16, which indicates that inactivated Aquarius prevents the conversion of BAQR-to-C complexes (Extended Data Fig. 3e). By contrast, PRP2 occupied different locations in Bact and BAQR. This result indicates that BAQR is an intermediate spliceosome stalled after the translocation of PRP2 but before the action of Aquarius. We conclude that BAQR represents an intermediate of the splicing pathway that follows the Bact complex while preceding B* and C complexes1,11,12,17 (Fig. 1b).
Structure of PRP2 in the BAQR spliceosome
PRP2 is composed of a flexible N-terminal region (residues 1–387) and a conserved globular core (PRP2core, residues 388–1042), which is typical of all four spliceosomal DEAH helicases (Fig. 2a). Structures of fungal PRP2core have been reported for Chaetomium thermophilum and budding yeast in complex with ligands such as ADP, ADP-BeF3–, RNA oligonucleotides and the cofactor Spp2p (refs. 5,18). The PRP2core has also been resolved at 3.2 Å resolution in yeast Bact spliceosomes or docked as a homology model in human Bact complexes5,11,13.
The cryo-EM density map of human BAQR enabled the building of the PRP2core and of the accessory N-terminal domain (PRP2NTD, residues 265–295; Fig. 2b,c and Extended Data Fig. 3c,d). The PRP2core region exhibited the two RecA-like domains (RecA1 and RecA2) and a carboxy-terminal module (PRP2CTD) that encompassed the winged-helix (WH), oligonucleotide/oligosaccharide-binding (OB) and helix-bundle (HB) domains. The PRP2core from BAQR is in the open, post-ATP hydrolytic conformation. This conformation has previously only been observed for the ATPase-defective mutant Prp2p(K252A) in the yeast Bact spliceosome5.
The structure and function of PRP2NTD were unknown. In the BAQR spliceosome, PRP2NTD acquired a stage-defining fold that lacked a hydrophobic core and was organized into three distinct modules. Owing to their extended conformation, they interacted with several spliceosomal subunits. We refer to these elements as the hook, the clip and the pin (Fig. 2a,b and Extended Data Fig. 4a).
PRP2 translocates about 19 nucleotides
PRP2 has previously been observed at the periphery of human and yeast Bact complexes, residing on the convex side of the HEAT domain of SF3B1 (SF3B1HEAT)5,11,13. Notably, in the BAQR, PRP2 is no longer anchored at the periphery of the spliceosome. Instead, PRP2 has now moved around 85 Å deep towards the branch helix and the core of BAQR, filling a large cavity between SF3B1 and PRP8 within BAQR (Fig. 2d). In the new location, PRP2 shares interfaces with SF3B1HEAT and PRP8 through the RecA2 and HB domains, respectively (Extended Data Fig. 4b,c). The helicase rotated about 70° counterclockwise, and the RecA-like domains adopted the open conformation on top of SF3B1HEAT (refs. 5,18) (Fig. 2b,c). In this post-translocation state, PRP2 accommodated seven nucleotides of the intron, assigned as the 7–13 region downstream of BS-A, in a channel framed by the RecA-like domains on one side and PRP2CTD on the other side.
Within yeast Bact complexes, PRP2 binds RNA at positions 28–34 (refs. 5,19), which probably correspond to positions 26–32 in human Bact complexes according to our 3D alignment data (Extended Data Fig. 5a–d). A comparison between BAQR and Bact indicated that PRP2 advanced approximately 19 nucleotides towards the spliceosome core during the Bact-to-BAQR transition, which explains the stringent requirement for at least 32–34 nucleotides downstream of BS-A for efficient branching in human splicing15 (Figs. 2d and 3a and Extended Data Fig. 5e). Therefore, PRP2 translocates along the intron from its earlier position in the Bact complex rather than acting from a distance like a molecular winch that pulls the intron from the periphery of the spliceosome (Supplementary Video 2). Concomitantly, the exiting intron was relocated about 70 Å owing to the helicase rotation and exchange in intron–protein interactions following the formation of the BAQR complex (Fig. 2d and Extended Data Fig. 5f).
Mechanism of dissociation of the RES complex
The RES complex is required to convert B to Bact complexes and dissociates after PRP2 action at the Bact-to-B* transition20,21. The RES complex comprises the proteins BUD13, RBMX2 (also known as SNU17) and SNIP1 (also known as PML1), and resides in a cavity of the Bact spliceosome framed by SF3B1 and PRP8. In this cavity, the RES complex binds the proteins SF3B1, PRP8, SKIP and CWC22, whereas the RRM domain of RBMX2 binds the polypyrimidine tract (PPT; or the equivalent region in yeast)2,13.
In the BAQR structure, the translocation of PRP2 displaces several subunits, including the RES complex from the top of SF3B1HEAT (Fig. 3b,c and Supplementary Video 3). The dissociation of RES is primarily caused by the stripping of RBMX2 away from the intron (Fig. 3a and Extended Data Fig. 5e,f). As BUD13 and SNIP1 share interfaces with RBMX2, dislocation of the latter probably releases the entire RES complex. Moreover, the clip of PRP2NTD replaced SNIP1 by interacting with the α3 helix of SKIP, whereas new interactions between PRP2core, SKIP and PRP8 replace those between BUD13 and SKIP in the Bact complex (Fig. 3a and Extended Data Fig. 4c,d).
After destabilizing the RES complex from RNA and former protein contacts, only a short stretch of BUD13 (residues 530–557) is visible in BAQR, which is bound to a composite structure formed by the hook of PRP2NTD, PRP8 and the slightly relocated MA3 domain of CWC22 (Fig. 3a and Extended Data Fig. 4e,f). However, proteomics analysis showed that the entire RES complex remained bound to the spliceosome (Supplementary Data 1). Thus, BUD13(530–557) anchors the destabilized RES complex to the spliceosome, after being displaced from the intron. This flexible anchoring is conditioned by the hook of PRP2 binding to PRP8, the presence of which may serve as a signature for the occurrence of the translocation of PRP2. In this way, PRP2 might control the trajectory of the exiting RES complex, moving it away from the downstream intron region that must engage in new interactions.
SF3B1 unfastens the branch duplex
The remodelling and destabilization of the SF3B complex is a key event in the splicing pathway and a consequence of the translocation of PRP2. The largest subunit of SF3B is SF3B1, of which the HEAT domain exhibits a distinctive loose conformation in the BAQR complex. This conformation differs markedly from the open and closed conformations of SF3B1 from pre-A and from A-to-Bact complexes, respectively (Fig. 3d). For example, SF3B1(A514) and MINX(U136) are separated by 4, 48 and 42 Å in Bact, BAQR and pre-A complexes, respectively. Furthermore, interacting residues SF3B1(H550) and PHF5A(Y51) from pre-A complexes are separated by 16 Å in the BAQR complex.
The transition of SF3B1 from closed to loose following the conversion of Bact to BAQR seemed to be caused by PRP2, after stripping the PPT (nucleotides 13–18) away from the binding pocket. Because the pocket is also a hinge, SF3B1 opens widely to unclamp the branch duplex from one side, like an opening pair of tweezers (Figs. 2b,c and 3d, Extended Data Fig. 6a,b and Supplementary Video 3).
The transition mainly involvs the H1–H10 repeats, whereas the contacts between BS-A and its binding pocket are almost identical in Bact and BAQR complexes (Fig. 3d and Extended Data Fig. 6c). Overall, the consequence of the translocation of PRP2 is the disruption of 60% (around 1,100 Å2) SF3B1’s contact interface with RNA, facilitating the release of the branch duplex for relocation to the catalytic centre.
The opening of SF3B1HEAT by PRP2 triggers additional remodelling that weakened the interaction between the SF3B complex and the domains of PRP8 (Supplementary Video 3). In detail, the change in SF3B1HEAT curvature disrupts contacts between the repeats H10–H11 and PRP8RH, and perturbs the endonuclease-like domain of PRP8 (Extended Data Fig. 6a). In addition, transition to the loose conformation induces the release of SRRM1 and SF3B6 from SF3B1. De-structuring of SF3B1NTD (residues 1–488) also occurrs, which is an extended region that wraps around SF3B6 within the Bact complex (Fig. 3b,c and Supplementary Video 3). As SF3B6 and SF3B1NTD are connectors between the SF3B complex and PRP8 (Fig. 3c), their destabilization probably facilitates the subsequent release of SF3B and SF3A complexes, and the Aquarius-dependent relocation of the branch duplex at the transition from BAQR to B*and then C complexes (Supplementary Video 4).
Termination of the translocation of PRP2
The low complementarity between U2 and the branch sites of human introns requires tight regulation of the activity of PRP2 to prevent it from unwinding the branch duplex. Therefore, PRP2 needs to advance sufficiently far to dissociate the PPT from SF3B1 and open the SF3B1HEAT clamp, yet not too far to compromise the branch duplex. Although PRP8 acts like an initial physical barrier able to stop the movement of PRP2 (Fig. 2d), the helicase may keep pulling the intron like a winch to unwind or alter the branch duplex. Thus, a different mechanism for the termination of PRP2 translocation is required.
PRP2 in BAQR shares large interfaces with PPIL4 (1,040 Å2) and SKIP (1,850 Å2), which suggests that these proteins could regulate the helicase (Fig. 3c and Extended Data Fig. 4g–k). Indeed, flexible elements of PRP2NTD, PPIL4 and SKIP form a stable 3D assembly that bound the exiting RNA strand at the nucleotides 17–20, downstream of the BS-A (Fig. 4a and Extended Data Fig. 4g,h). This result indicates that these proteins have a collective role in the timely termination of helicase translocation.
PPIL4 belongs to the cyclophilin family of peptidylprolyl isomerases (PPIases) and was not assigned in other cryo-EM structures of spliceosomes, which left its role in splicing unknown. In the BAQR map, we built PPIL4, which included the PPIase and RRM domain separated by a linker containing an α-helix (Extended Data Fig. 4g,h). Notably, PPIL4RRM and the linker bound the exiting intron, covering the position occupied previously by RBMX2RRM of the RES complex in Bact (Figs. 3a–c and 4a,b). Therefore, this intron-binding site is only accessible for PPIL4 after PRP2 has dissociated RES and translocated to an upstream position. Consistent with the BAQR structure, recombinant PPIL4 formed a stable complex with PRP2 (137–1022) (delineated as the region built in the cryo-EM map; Extended Data Fig. 7a–c and Supplementary Fig. 2). Furthermore, PPIL4 was able to bind a model RNA substrate (for example, a poly(A)-rich 3′ single-stranded overhang) both in the presence and absence of PRP2 (Extended Data Fig. 7d,e and Supplementary Fig. 3).
Anchoring of PPIL4 to spliceosomes is facilitated by the NineTeen complex (NTC). The CWC15 component of NTC resides on the surface of PRP8, where it binds the PPIL4PPIase domain through a short stretch (Figs. 3c and 4c and Extended Data Fig. 4g). The defined distance between the PPIase and RRM domains of PPIL4 is maintained by the interdomains linker, structurally stabilized by helices from PRP2NTD and SKIP (Fig. 4a and Extended Data Fig. 4h).
SKIP is a largely unstructured protein, whose disparate elements interact with subunits of other spliceosomes8. The long characteristic α3 helix of SKIP (residues 282–340) binds PRP8 in Bact, BAQR and C complexes (Fig. 3b,c and Extended Data Fig. 8a). In BAQR, this helix stabilizes the post-translocation state of PRP2 through interactions with the clip of PRP2NTD and the linker of PPIL4. Furthermore, the residues 352–400 of SKIP are visible only in BAQR, in which the α4 helix binds the clip of PRP2NTD and the intron at nucleotides 17–20 (Fig. 4a,b and Extended Data Fig. 4h). Notably, the residues 371–400 binds PRP2core, such that a 310 helix intercalates like a wedge between the two RecA modules and the HB domain of PRP2 (Fig. 4a and Extended Data Fig. 4j). Superposition of our structure with the PRP2 structure from C. thermophilum in the closed state (Protein Data Bank (PDB) identifier 6zm2)18 indicates how the wedge element of SKIP might lock the helicase in the open conformation and therefore terminates translocation (Fig. 4d,e, Extended Data Fig. 4i and Supplementary Video 5). We suggest that PPIL4 and SKIP might act as negative regulators of PRP2, forming a context-specific molecular brake to terminate the translocation of the helicase at a defined time and location on the intron.
Mode of action of a DEAH helicase
The four DEAH splicing helicases (PRP2, PRP16, PRP22 and PRP43) were observed at relatively fixed positions from the periphery of spliceosomes, where they act by less understood mechanisms4. In particular, whether PRP2 moves along the RNA or pulls the substrate from a distance remained unclear3,4,19,22,23.
The structure of BAQR reveals how PRP2 can operate in several stages (Fig. 5 and Supplementary Videos 2, 3 and 5). Starting from the periphery of Bact complexes5,11,13, PRP2 translocates around 19 nucleotides to form the BAQR complex while stripping away the proteins RBMX2 and SF3B1 from the intron. A cascade of events follows, including destabilizations (RES proteins), dissociations (SF3B6, SRRM1 and CWC27), restructuring (SF3B1NTD, SKIP and PRP2NTD), rearrangement of globular domains (PRP8 and CWC22) and recruitment (PPIL4). Essentially, the signal from ATP hydrolysis is amplified to induce large-scale remodelling.
The translocation of PRP2 terminates six nucleotides downstream of the BS-A in a network of interactions that resemble a molecular brake constructed from PRP2NTD, SKIP and PPIL4. The brake binds the exiting intron like a brake shoe and intercalates a wedge element of SKIP between the RecA1, RecA2 and HB domains, immobilizing PRP2 in the open conformation (Fig. 4a,b,d and Supplementary Video 5). Thus, PPIL4 and SKIP might act as helicase cofactors from the class of negative regulators24. This termination mechanism differs from the constitutive inhibition of the DEAD-box helicase eIF4AIII in the exon–junction complex, whereby MAGO, Y14 and MLN51 appear to enclose and stabilize the RecA-like domains in the closed conformation25. Notably, CWC22 and NTC might contribute to the regulation of PRP2 by interacting with the RES complex and PPIL4, respectively. Finally, PRP2 dissociates during the spliceosome remodelling process driven by Aquarius.
Recombinant PRP2(137–1022) did not exhibit unwinding activity in vitro, either alone or in the presence of the cofactor GPKOW (Extended Data Fig. 7f–l and Supplementary Fig. 3). The same inactivity has been observed for the budding yeast counterpart Prp2p26,27. Whether this reflects a vital role of spliceosomes in activating translocation or an intrinsic inability of the helicase to use translocation for duplex separation remains unclear.
A question that is raised from these results is whether other splicing DEAH helicases also operate by translocation rather than winching22,28,29. PRP16 interacts with its targets in C complexes, the step I factors CCDC49 (Cwc25p in S. cerevisiae) and CCDC94 (Yju2 in S. cerevisiae), at the 3′ end of the branched intron before their release16,30. Reminiscent of the function of PRP2, PRP16 might translocate in a 3′-to-5′ direction to displace the intron-bound step I factors, thereby promoting the repositioning of the pre-mRNA substrate for exon ligation. Furthermore, although introns bound by PRP22 are not visible in cryo-EM maps, PRP22 slightly relocates between different C* complex variants31, which suggests that translocation is possible.
Identifying PRP2NTD as an accessory domain for multiple context-specific functions might provide suggestions on how NTDs function in other DEAH helicases. For instance, a fragment of PRP22NTD (residues 393–427) inserts between the RT and linker domains of PRP8, which indicates that they have a role in anchoring the helicase downstream of the 3′ exon in the human C* and P spliceosomes31,32,33. Overall, the mechanism of action of PRP2 captured within BAQR may provide a paradigm for other DEAH helicases and guide future investigations.
Two helicases drive catalytic activation
An important implication of this work is that the Bact to B* transition, also known as catalytic activation, is driven by two helicases—PRP2 and Aquarius, acting on the branch site from downstream and upstream locations, respectively (Figs. 1b and 5b).
Arrested between the action of the two helicases, the BAQR structure revealed how PRP2 operates at the Bact-to-BAQR transition and implicates Aquarius as the helicase that promotes the conversion of BAQR to B*. The requirement for two helicases is probably an adaptation for transferring the delicate branch duplex—which is often unstable owing to low complementarity34,35—to the catalytic centre while keeping it unaltered. Therefore, whereas PRP2 operates downstream of the branch site to unfasten the branch duplex from the constraints of the SF3B–RES complex, Aquarius acts upstream of the branch site to induce the removal of the remaining constraints and the relocation to the catalytic centre.
This two-step catalytic activation process could provide additional checkpoints to proofread and discard incorrectly selected introns, the length and sequence of which can vary substantially in metazoans36,37. This mechanism might also facilitate coordination between splicing and other intron-related events. Of relevance, Aquarius couples splicing to the biogenesis of intron-encoded box C/D small nucleolar RNP (snoRNP)38, which suggests that it coordinates separation in time and space between snoRNP biogenesis and branch duplex transfer for splicing.
Aquarius binds the intron at position −40 to −33 upstream of BS-A in C complexes38. Despite the moderate resolution around Aquarius (around 6.2 Å), a comparison between BAQR and C complexes showed a repositioning of the helicase, which suggests how it might operate (Extended Data Fig. 9). We have previously shown that Aquarius unwinds duplexes in vitro with apparent 3′-to-5′ polarity. As the mutant Aquarius(Y1196A) inactivates unwinding but not splicing, the relevance of this assay on splicing remains unclear6. However, putative translocation with 3′-to-5′ polarity might induce relaxation of the intron and perturb the contacts between the domains of Aquarius and adjacent subunits. This movement could propagate changes in interfaces between proteins interposed from Aquarius to the SF3B1–BS-A interface (primarily SF3A and SF3B subunits), which causes the release of BS-A and dissociation of the SF3A and SF3B complexes (Extended Data Fig. 9a–c).
Aquarius might also use 5′-to-3′ polarity, similar to the phylogenetically related SF1 helicase Upf1. By pulling the intron with this polarity, Aquarius could remotely eject BS-A from the primary pocket of SF3B1. Consequently, SF3B1 would switch from a loose to an open conformation, thereby restructuring HEAT repeats 16–20, which forms interfaces with PRP8, SF3B2 and SF3A2 (Extended Data Fig. 9d–g). Ultimately, the entire SF3A–SF3B complex would undock from the core components PRP8 and the snRNA U6, which accomplishes the conversion of BAQR to B* and C complexes (Fig. 5b, Extended Data Fig. 8 and Supplementary Video 4).
The arrest of PRP2 in interactions with the molecular brake indicates that Aquarius starts acting after the complete translocation of PRP2. However, some chronological overlap of the actions of the two helicases cannot be excluded. For instance, Aquarius might change the tension in the intron to induce additional loosening of SF3B1 while PRP2 facilitates the complete liberation of the branch duplex.
Significantly, Aquarius and all molecular brake components—PPIL4, elements of PRP2NTD and SKIP—are highly conserved among humans, plants and yeasts39 (Supplementary Fig. 4). This conservation supports the view that Aquarius and the molecular brake have coevolved to complement PRP2 across all kingdoms of eukaryotes, which indicates that catalytic activation in two ATP-dependent steps might be a universal mechanism.
Simplified catalytic activation in yeast
In contrast to fission yeast (Schizosaccharomyces pombe), budding yeast (S. cerevisiae) lacks Aquarius and the molecular brake, which raises the question of how catalytic activation can occur only by PRP2 (Prp2p in S. cerevisiae). Based on this work and a large body of published data about Prp2p, we propose a model for spliceosome remodelling in budding yeast (Fig. 5c).
Prp2p binds the 3′ tail of the intron, advancing in the 3′-to-5′ direction to disrupt interactions between SF3b1 (also known as Hsh155p) and the branch duplex while destabilizing SF3a, SF3b and RES complexes19,23,40,41. Consistent with the human model, this remodelling implies that Prp2p uses translocation rather than winching, as postulated in earlier models19,40 (Fig. 5c). Spp2p, the G-patch orthologue of human GPKOW, assists these remodelling steps1,5,27.
After destabilization of Res, we propose that Prp2p might transit into a BAQR-like state, arriving at approximately 6 nucleotides downstream of the branch duplex. This scenario could explain the genetic interaction between the cold-sensitive allele Prp2pQ548N (PRP2Q721 in humans) and the SF3b1(D450G) or SF3b1(V502F) mutant42,43. The human counterparts SF3B1(D781) and SF3B1(L833) are in physical proximity to PRP2 only within BAQR spliceosomes (Extended Data Fig. 4l).
Furthermore, the human BAQR structure might explain the effect of Cwc22p(454–491) on the ability of Prp2p to remodel the Bact spliceosome44. The equivalent region of CWC22, the hook of PRP2NTD (conserved in budding yeast) and the RES complex interact in BAQR, which suggests that the role of Cwc22p is conserved in facilitating the function of the helicase44 (Fig. 3b and Extended Data Fig. 4m–o). Notably, Prp2pNTD is dispensable for viability45, which suggests that neighbouring subunits might functionally compensate for its deletion.
In contrast to the human orthologue, Prp2p might advance beyond the BAQR-equivalent time point, as there is no molecular brake to terminate translocation. The unrestricted Prp2p could extract BS-A from the primary pocket, which triggers the transition of SF3b1 from the loose to the close conformation. In this way, Prp2p liberates the branch duplex and dissociates SF3a, SF3b and RES complexes, and the proteins Cwc24p and Cwc27p19,23,41 (Fig. 5c). We suggest that this simplified mechanism is tolerated in budding yeast primarily because the high complementarity between the intron and U2 snRNA prevents its alteration by the unidirectional action of Prp2p.
The complete cycle of SF3B1 transitions
Accumulating evidence outlines SF3B1 as a conformational switch adapted for the recognition and transfer of branch sites. This work provides new insight into the function and cyclic pathway of conformational transitions of SF3B1, mediated by RNA helicases (Extended Data Fig. 6d).
First, the ability of SF3B1 to switch conformations relies on two pockets—here referred to as primary and secondary pockets—that sequentially bind BS-A and PPT after the formation of the prespliceosome or release them in the reverse order during catalytic activation (Fig. 3d and Extended Data Fig. 6d). Both pockets are also conformational hinges, which thereby induce the progressive compaction or decompaction of the HEAT domain following the binding or release of cognate RNA elements. Consequently, SF3B1 transits stepwise to the open, half-closed and closed conformation after BS-A and PPT recognition during pre-A conversion to A complexes46,47,48. The open state facilitates the formation of the branch duplex by toehold-mediated strand invasion—a non-enzymatic mechanism captured in a human pre-A complex stalled using spliceostatin A47.
The closed conformation persists until the Bact complex, when PRP2 detaches the bound PPT from the secondary pocket, which then induces the loose conformation of SF3B1 within BAQR. The requirement of bound PPT for the closed conformation might explain the formation of Bact-like spliceosomes on substrates bearing at least 16 nucleotides downstream of BS-A, whereas 6 nucleotides are insufficient15. Notably, the loose conformation differs from the half-closed one, which is probably due to the contacts between the PRP2core and HEAT repeats (410 Å2 interfaces; Fig. 2b,c).
Next, Aquarius promotes the extraction of BS-A from the primary pocket and the release of SF3B, probably in the open conformation typical for the apo form46. The released SF3B might be recycled as a building block for the biogenesis of 17S U2 snRNPs assisted by the helicase SF3b125 (ref. 49), thus re-entering the splicing pathway. Overall, the structure of BAQR reveals the crucial importance of SF3B1 as a conformational switch in catalytic activation and how helicases can modulate the pathway of conformational transitions of SF3B1.
Recombinant proteins were produced from codon-optimized synthetic genes (GeneArt, ThermoFisher Scientific). Full-length wild-type Aquarius and its K829A mutant were fused to a C-terminal 8×His-tag and cloned into the pFL vector backbone as previously described6. For co-expression of IBC subunits, the master, 4-protein construct ∆ISCC, comprising SYF1 (also known as XAB2), ISY1, CCDC16 (also known as ZNF830) and PPIE (also known as CypE), was assembled by in vitro Cre–loxP recombination of the donor pSPL-CCDC16–PPIE and acceptor pFL-SYF1–ISY1 vectors6. The expression constructs were transformed into DH10MultiBacY Escherichia coli cells. The resulting bacmids were isolated using a High Pure Plasmid Isolation kit (Roche) and used for transfection into Sf9 (Spodoptera frugiperda) insect cells. Sf9 cells were transfected with the help of either X-tremeGENE HP (Roche) or FuGENE HD (Promega) reagents. Insect cell lines were not tested for mycoplasma contamination.
Human PRP2 (NCBI Reference Sequence identifier: NM_003587, transcript variant 1) and human PPIL4 (NCBI Sequence identifiers BC020986 and NM_139126.4) open reading frame clones were obtained from OriGene (RC202912) and Applied Biological Materials (373710120000), whereas human GPKOW and SKIP were codon-optimized for expression in insect cells and synthesized by GeneArt (ThermoFisher Scientific). All constructs were inserted by ligation-independent cloning into a modified pFastBac vector backbone in-frame with an N-terminal twin-StrepII affinity tag, which can be cleaved off with the HRV-3C protease. All recombinant constructs were verified by Sanger sequencing.
Purification of recombinant proteins
Recombinant proteins were produced in insect cells using the MultiBac system50. The initial V0 and V1 baculovirus stocks were produced in Sf9 cells. Large-scale protein co-expression was conducted in High Five (Hi5) insect cells (Trichoplusia ni, BTI-TN5B1-4) through the combination of two V1 stocks of baculovirus, coding for the C-terminal 8×His-tagged Aquarius (wild-type or mutant K829A) and the untagged ∆ISCC complex, in 1:1.5 ratio (V1AQR:V1ΔISCC). The purification of the wild-type and the K829A dominant-negative IBCs was performed according to a previous protocol established in our laboratory, but with some modifications6. Insect cells expressing IBC were disrupted by sonication in lysis buffer (50 mM HEPES-NaOH, pH 7.5, 400 mM NaCl, 10% (v/v) glycerol, 5 mM 2-mercaptoethanol (2-ME), 30 mM imidazole and protease inhibitors (complete EDTA-free, Roche)) and the cell debris was pelleted by centrifugation at 18,000 r.p.m. at 4 °C for 1 h (A27-8×50 rotor; Thermo Scientific). The supernatant was loaded on a 5 ml HisTrap HP Ni(II)-chelating column (GE Healthcare/Cytiva) equilibrated with the lysis buffer. The Ni(II)-bound protein species were eluted using a linear 30–400 mM imidazole gradient in Ni(II) elution buffer (50 mM HEPES-NaOH, pH 7.5, 200 mM NaCl, 10% (v/v) glycerol and 5 mM 2-ME) and analysed by SDS–PAGE. The IBC-containing fractions were loaded on an anion-exchange Q Sepharose HP column (GE Healthcare/Cytiva) equilibrated in 20 mM HEPES-KOH, pH 7.5, 200 mM KCl, 10% (v/v) glycerol and 5 mM dithiothreitol (DTT). The bound IBC was eluted off the resin with a linear 0.2–1 M KCl gradient. The peak fractions, corresponding to the stoichiometric IBC, were pooled, concentrated to 10 mg ml–1, aliquoted, flash-frozen in liquid nitrogen and stored at −80 °C.
Twin-StrepII-tagged human PRP2(137–1022), PPIL4 and GPKOW were expressed in Sf9 or Hi5 insect cells using recombinant baculoviruses, prepared as described above, in amounts sufficient to induce cell cycle arrest in 24 h. All recombinant proteins used in functional assays were purified at 4 °C using similar purification protocols. In brief, insect cells, expressing the target proteins, were briefly sonicated and lysed in lysis buffer (50 mM HEPES-KOH, pH 7.5, 150 mM KCl, 10% (v/v) glycerol, 0.2% (v/v) Triton X-100 RNase-free, 2 mM DTT and protease inhibitors (complete EDTA-free, Roche)), with the detergent added after sonication, and the cell debris was pelleted by centrifugation at 23,000 r.p.m. for 1 h in an A27-8×50 rotor (Thermo Scientific) or 40,000 r.p.m. (approximately 164,244g) in a type 70 Ti rotor (Beckman Coulter) for 45 min at 4 °C. The filtered supernatant was applied to a 5 ml Strep-Tactin XT 4Flow pre-packed column (IBA Lifesciences) or incubated in batch for 1 h with around 1 ml or 2.5 ml Strep-Tactin beads per 1 litre of Sf9 or Hi5 culture, respectively. The bound proteins were eluted by competition with 50 mM biotin in lysis buffer or with elution buffer containing 60 mM biotin (50 mM HEPES-KOH, pH 7.5, 100 mM KCl, 5% (v/v) glycerol, 1 mM EDTA, 2 mM DTT and 60 mM biotin). The Strep-Tactin eluates were concentrated and further purified by size-exclusion chromatography (SEC) on Superdex 200 HiLoad 16/600 200 pg (GE Healthcare/Cytiva) or Superdex 200 Increase 10/300 GL (GE Healthcare/Cytiva) equilibrated in 20 mM HEPES-KOH, pH 7.5, 150 mM KCl, 10% (v/v) glycerol and 2 mM DTT. Alternatively, the PRP2(137–1022) Strep-Tactin eluates were purified on Q Sepharose HP (GE Healthcare/Cytiva) and eluted from the 5 ml anion-exchange column using a linear 0–30% gradient formed over 40 ml between buffer A (20 mM HEPES-KOH, pH 7.5, 100 mM KCl, 5% (v/v) glycerol and 0.5 mM DTT) and buffer B (20 mM HEPES-KOH, pH 7.5, 1 M KCl, 5% (v/v) glycerol and 0.5 mM DTT). With the exception of PPIL4, all proteins used in functional assays comprised the N-terminal Twin-StrepII tag. The Twin-StrepII tag of PPIL4 was removed by on-bead cleavage with the HRV-3C protease, and the untagged protein was purified by SEC on Superdex 200 Increase 10/300 GL (GE Healthcare/Cytiva). Purified proteins were concentrated by ultrafiltration to their stock concentration (PRP2(137–1022), around 19–22 µM; GPKOW, around 165 µM; untagged PPIL4, around 47 µM; tagged PPIL4, about 67 µM), frozen in liquid nitrogen and stored at −80 °C or used directly in assays.
Reconstitution of PRP2–GPKOW and PRP2–PPIL4 complexes
Using a minimal in vitro system, we first tested direct interactions and stable formation of complexes between PRP2(137–1022) and its cofactors GPKOW and PPIL4 as subjected to SEC. All SEC analyses were carried out in SEC buffer (20 mM HEPES-KOH, pH 7.5, 100 mM KCl, 1.5 mM MgCl2 and 5% (v/v) glycerol). The PRP2 complexes were reconstituted in vitro by mixing around 30 μg recombinant helicase with a 5-fold excess of cofactor (PPIL4 or GPKOW) in a 100 μl reaction volume. The samples were then incubated for 1 h on ice and applied to a Superdex 200 Increase 10/300 GL column (GE Healthcare/Cytiva) run at 0.5 ml min–1 using an Äkta Go system (Cytiva). SDS–PAGE analyses of the SEC fractions showed that in both cases, the peak profile of PRP2(137–1022) shifted to early fractions in the presence of a cofactor (fraction 9 in the presence of PPIL4 and fraction 7 in the presence of GPKOW) compared with the protein alone (fraction 11), which indicated the formation of a helicase–cofactor complex. As independent means of probing the direct interaction of PRP2(137–1022) with PPIL4, we co-expressed the two splicing factors in Sf9 insect cells and affinity purified their complex from baculovirus-infected cultures. The cultured cells were lysed in lysis buffer (50 mM HEPES-KOH, pH 7.5, 150 mM KCl, 5% (v/v) glycerol, 2 mM DTT, 2 mM MgCl2, 0.1% (v/v) Triton X-100 and protease inhibitors (complete EDTA-free, Roche)) and the expressed factors were captured on Strep-Tactin XT 4FLOW affinity beads (IBA Lifesciences). The protein samples were eluted by competition with biotin in elution buffer (50 mM HEPES-KOH, pH 7.5, 150 mM KCl, 5% (v/v) glycerol, 2 mM DTT, 2 mM MgCl2 and 60 mM biotin), concentrated by ultrafiltration to about 500 μl and subjected to SEC in SEC buffer. As for the in vitro assembled PRP2(137–1022)–PPIL4 complex, the complex prepared from insect cells peaked in fraction 9 when eluting from the Superdex 200 column. Overall, this result shows that the limited interaction interface between the NTD of PRP2 and PPIL4, observed in the BAQR cryo-EM structure, promotes the stable recruitment of the helicase cofactor in a RNA-independent manner. The PRP2(137–1022)–PPIL4 complex used in helicase activity assays was prepared by insect cell co-expression, purified in two steps, as described above, and concentrated to about 14.5 μM. The PRP2(137–1022)–GPKOW complex used in biochemical assays was assembled in vitro, purified by SEC from GPKOW excess and concentrated to about 40.3 μM. In vitro reconstitution of the complex between PRP2(137–1022) and PPIL4 was performed two times using two independent protein preparations. The PRP2(137–1022)–GPKOW complex was assembled in vitro at least three times.
Preparation of the nuclear extracts active in splicing
HeLa S3 cells, tested for mycoplasma, were obtained from GBF (Helmholtz Centre for Infection Research). The nuclear extract active in splicing was prepared according to the standard protocol from Dignam and used as described13. Cells were grown in a 30 litre fermenter (Applikon Biotek) to a density of 6.5 × 106 cells per ml in DMEM/F12 (1:1) medium supplemented with 5% (v/v) newborn calf serum. After collection by centrifugation for 10 min at 2,000 r.p.m. in an 8 × 2,000 ml BIOS rotor (Thermo Scientific), the cells were washed twice with cold 1× PBS buffer. The cell pellet was re-suspended in MC buffer (10 mM HEPES-KOH, pH 7.6, 10 mM potassium acetate, 0.5 mM magnesium acetate, 0.5 mM DTT and 2 tablets of protease inhibitors (complete EDTA-free, Roche) per 50 ml of the buffer). After 5 min of incubation on ice, the cells were lysed with 18 strokes of a Dounce homogenizer at 4 °C. The nuclei were pelleted for 5 min at 10,000 r.p.m. in a F14-14×50cy rotor (Thermo Scientific) and were further lysed in Roeder C buffer (20 mM HEPES-KOH, pH 7.9, 0.2 mM EDTA, pH 8.0, 25% (v/v) glycerol, 420 mM NaCl, 1 mM MgCl2, 0.5 mM DTT and 0.5 mM PMSF) by 20 strokes of a Dounce homogenizer at 4 °C. The mixture was stirred slowly for 40 min at 4 °C and centrifuged at 12,300 r.p.m. in a F14-14×50cy rotor. The supernatant corresponding to the active nuclear extract was aliquoted and flash-frozen in liquid nitrogen and stored at −80 °C.
In vitro pre-mRNA splicing
A typical splicing reaction contained the following components: 20% (v/v) non-dialysed HeLa nuclear extract, 3 mM MgCl2, 2 mM ATP, 20 mM creatine phosphate and 10 nM m7G-capped MINX-3×MS2 pre-mRNA, fluorescently body-labelled with cyanine 5-uridine-5′-triphosphate (Cy5-UTP, Enzo). To monitor splicing of the Cy5-labelled pre-mRNA substrate, the splicing reactions were incubated for 15, 30, 60, 90 and 120 min at 30 °C (Extended Data Fig. 1c). To assess the effect of recombinant IBC(K829A) on the assembly of the spliceosome, the splicing reaction was supplemented with IBC(K829A) to a final concentration of 0.45 μM and pre-incubated for 20 min at 30 °C before the addition of the pre-mRNA substrate and ATP (Extended Data Fig. 1c). In all cases, RNA was recovered by phenol–chloroform–isoamyl alcohol extraction followed by ethanol precipitation. The recovered RNA samples were analysed on denaturing urea polyacrylamide gels (14%, 0.5× TBE). The fluorescently labelled pre-mRNA substrate, splicing intermediates and the final products were detected by in-gel fluorescence using a Typhoon FLA 9500 imaging system (GE Healthcare).
Affinity purification of the human BAQR spliceosome
Spliceosomal complexes stalled with recombinant IBC(K829A) were assembled in vitro on non-labelled m7G-capped MINX-3×MS2 pre-mRNA, synthesized by T7 RNA polymerase run-off transcription. Before initiating the reaction, pre-mRNA substrate (used at 10 nM, final concentration) was incubated with a 20-fold molar access of MBP-MS2 fusion protein for 30 min at 4 °C. Before the addition of ATP and pre-mRNA-MBP-MS2, the large-scale splicing reactions were supplemented with 0.45 μM recombinant IBC(K829A) complex and incubated for 20 min at 30 °C. Therefore, a typical preparative splicing reaction, used from cryo-EM sample preparation, comprised (final concentrations) the following: 20 mM HEPES-KOH, pH 7.9, 2 mM ATP, 20 mM creatine phosphate, 3.2 mM MgCl2, 84 mM NaCl, 5% (v/v) glycerol, 20% (v/v) non-dialysed HeLa nuclear extract, 10 nM MINX-3×MS2, 200 nM MBP-MS2 and 40 µM IBC(K829A).
To enable the spliceosome assembly, the splicing mixture was slowly stirred for 120 min at 30 °C in a water bath. The non-incorporated pre-mRNA was cleaved by DNA-directed endogenous RNase H digestion for 20 min at 30 °C using a 30-fold molar excess of the cmd42 (5′-TCTTACCGTTCG-3′) and cmd43 (5′-CGGGTTTCCGAT-3′) antisense DNA oligonucleotides. To prevent precipitation of assembled spliceosomes, NaCl was slowly added to the splicing reaction to a final concentration of 120 mM. The aggregates were removed by centrifugation for 10 min at 12,300 r.p.m. in a F14-14×50 rotor. The supernatant was then applied onto a pre-packed 5 ml MBPTrap HP column (GE Healthcare/Cytiva) at 1 ml min–1, washed with 20× column volumes of G120 buffer (20 mM HEPES-KOH, pH 7.9, 1.5 mM MgCl2 and 120 mM KCl) and the spliceosomal complexes were eluted with G120 buffer supplemented with 3 mM maltose. The peak fractions were analysed by denaturing 4–12% NuPAGE gels (Life Technologies). The gels were stained with SYBR Gold (Invitrogen) and Coomassie to detect RNA and protein species, respectively. The elution fractions containing IBC(K829A) spliceosomes (BAQR) were loaded onto a linear 10–30% (w/v) glycerol gradient prepared in G120 buffer and centrifuged at 20,500 r.p.m. for 16 h at 4 °C in a TST 41.14 rotor (Kontron). The gradients were divided into 23 fractions (500 μl each) and manually collected from the top. Peak fractions (14, 15 and 16) from 3 different gradients were pooled and crosslinked with 0.1% (v/v) glutaraldehyde (Electron Microscopy Sciences) for 1 h on ice. The unreacted glutaraldehyde was quenched with 100 mM aspartate (final concentration). The crosslinked BAQR was then incubated for 2 h on ice and concentrated to the sample absorbance at 280 nm of 0.6. The buffer was exchanged by ultrafiltration to the sample buffer (20 mM HEPES-KOH, pH 7.9, 120 mM NaCl, 1.5 mM magnesium acetate and 2% (w/v) glycerol) using an Amicon 50 kDa MWCO (Millipore) spin concentrator and used directly for cryo-EM grid preparation.
Preparation of cryo-EM grids
Volumes of 4 μl of concentrated sample were applied to one side of glow-discharged UltrAuFoil 200 R2/2 grids (Quantifoil) in a Vitrobot Mark IV (FEI) operated at 4 °C and 100% humidity. The grids were blotted for 2 s with a blotting force of 5 and immediately frozen by plunging into liquid ethane cooled by liquid nitrogen.
Cryo-EM data acquisition and image analysis
The electron micrographs for all datasets were acquired on an FEI Titan Krios G2 transmission electron microscope operated at 300 keV in EFTEM mode, equipped with a Quantum LS 967 energy filter (Gatan), zero loss mode, 30-eV slit width and a K2 Summit direct electron detector (Gatan) in counting mode. Automated data acquisition for dataset 1 (untilted) and dataset 2 (tilted, 25°) was performed using EPU software (Thermo Fisher) at a nominal magnification of ×130,000 (1.05 Å per pixel (Å px–1)). Micrographs for these two datasets were collected as 40-frame movies at a dose rate of around 5 e– Å–2 s−1 over 9 s, which resulted in a total dose of about 45 e– Å–2.
The two cryo-EM datasets of BAQR, collected at 0° and 25° tilt angles, were separately preprocessed on the fly with Warp (motion correction, CTF estimation and dose weighting), and particles were picked using a retrained convolutional neural network51. Each set of Warp-extracted particles was then subjected to three parallel 2D classification runs in cryoSPARC (v.0.65) using 50 classes and applying a class uncertainty factor of 1.5. The good classes were selected from the three runs, and all particles in these good classes were merged (while removing duplicates).
Initial attempts at processing the cryo-EM images in cryoSPARC revealed that this spliceosome complex exhibited a more structurally rigid core, which continued with more dynamic and poorly resolved peripheral regions (Extended Data Fig. 2). Hence, we systematically tested different EM data processing routines with the aim of improving the more peripheral densities of the spliceosome and to computationally resolve its structural heterogeneity. In brief, the Warp-picked, combined BAQR particle images (734,691 particles) were re-extracted in Relion 3.1 using a box size of 640/640 px (672 Å/672 Å) and then 2× binned before being subjected to 2D classification with the ‘Ignore CTFs until the first peak’ option switched on (Extended Data Fig. 2). The resulting ‘good’ particles subset (160,650 particles) was then refined in 3D using a 60 Å low-passed human Bact map as a reference volume (Electron Microscopy Databank identifier EMD-4236)13 and a spherical mask. The obtained consensus map of the complex (map M1) was then used as a reference in a global 3D classification round with 10 classes starting from the initial pool of BAQR particles (Extended Data Fig. 2). The cleaned subset of BAQR particles was then subjected to another round of global 3D classification with 8 classes and a 20 Å resolution limit. This second-round of 3D classification enabled us to resolve the two major compositional states of the complex (Extended Data Fig. 2), termed state A (29.2% particles) and state B (3.4% particles). In the state B complex, stronger density was observed for the PRP19 helical bundle and the step two splicing factor PRP17. Conversely, these map regions appeared to be poorly resolved in the state A complex, which is probably due to their flexibility and low occupancy, as observed for a previously described Bact complex13. In both cases, however, the peripheral building blocks of the complex (the IBC, U2 3′ core and the BRR2 helicase) were less resolved than the central part of the complex. To further improve the density of the state A map, we performed an additional global 3D classification without image alignment with 8 classes and a 20 Å resolution limit and then re-extracted and re-centred the BAQR particle images at their original sampling rate (1.05 Å px–1) in a 520 px box. The 3D refinement of this subset of particles (179,552 particles) with a soft mask was then followed by CTF refinement (per particle defocus, per micrograph astigmatism) and an additional round of classification in 2D. The subsequent refinement in 3D of this final subset of particles (146,157 particles) with loose or tight soft masks around BAQR resulted in map 2 and map 3 of the complex that reached global, gold-standard Fourier shell correlation resolutions of about 3.1 Å and 2.9 Å, respectively (Extended Data Figs. 2 and 3). Using these well-resolved consensus maps, we modelled the translocating PRP2 helicase, including its extended NTD domain, as well as the remodelled SF3B complex, and two out of the four BAQR PPIases (that is, PPIL2 and PPIL4).
Starting from the particle sets, which led to the higher resolution core maps, we also carried out local 3D classifications (Extended Data Fig. 4). Thus, by applying a mask covering the overall BAQR volume, we subjected these particles to 3D classification without image alignment with 6 classes. Particles exhibiting more pronounced densities for the U2 3′ core module and the IBC module (27,719 particles) were subsequently re-extracted and refined in 3D with soft and spherical masks. These alternative BAQR maps enabled the rigid body placement of the U2 3′ core and IBC modules at the periphery of BAQR. Local classifications were also performed on the particle set assigned to state B of the complex. In this case, a soft mask was applied to the region of the spliceosome where the BRR2 helicase resides, which was poorly resolved in the consensus map. The state B particle images were then subjected to 3D classification without image alignment and 4 classes. A minority subset of the overall particle set (12,395 particles) appeared to contain a more ordered BRR2, with the helicase being generally destabilized in BAQR cryo-EM images, which is probably because of the substantial change to the conformation of SF3B1 induced by the translocation of PRP2. These particles were then re-extracted and refined in 3D to obtain the complete map of the complex (map M4), which now enclosed the U2 3′ core, the IBC module, the PRP19 helical bundle, the BRR2 helicase and the splicing factor PRP17.
Density assignment, model building and refinement
To enable model building, the BAQR core maps (Supplementary Fig. 2) were sharpened using DeepEMhancer52 or locally scaled with LocScale53. The quality of the cryo-EM density map at the core of BAQR enabled careful model building and side-chain assignment, whereas modelling of the more solvent-exposed map regions was restricted to backbone tracing and rigid-body docking of known structures and computational models. Initial interpretation of the BAQR maps was facilitated by the available models of human Bact complexes obtained in several states (PDB identifiers 6FF4, 6FF7 and 5Z57). Thus, the model building of the BAQR complex was initiated by the docking of the human Bact models into the consensus maps (maps 2, 3 and 4) of the complex, followed by manual, residue-by-residue model adjustment in Coot54 and refinement with phenix.real_space_refine55. Although filtered, locally scaled maps were used for model interpretation, the real space refinement of the BAQR model was carried out exclusively against the original, unsharpened maps. The higher local resolution of our maps enabled us to improve and correct some of the available models for the Bact core region. After initial placement and refinement, several new (or reconfigured) BAQR density regions were observed compared with previous Bact complexes. The globular density element identified in the U2 snRNP region of the complex and surrounded by the HEAT domain of SF3B1 was assigned to the helicase domain of PRP2 (residues 388–1017). We modelled the helicase in an open state, consistent with it being trapped in a post-translocation state, and identified seven intron nucleotides accommodated by its RNA-binding tunnel. Compared with a previously published yeast Bact structure (PDB identifier 7DCO), the helicase domain of PRP2 (PRP2core) was no longer positioned on the convex side of SF3B1HEAT, but translocated along the intron towards the branch helix. The PRP2-bound RNA formed a continuous density stretch with the intron strand of the U2–BS duplex, showing that the helicase had translocated in a 3′-to-5′ direction from its position in the Bact complex. Supporting this model, density for the RES complex subunits RBMX2 and SNIP1, which bind intron regions (or are located close) downstream of the branch helix, could not be observed in BAQR, whereas only a short helical region belonging to the BUD13 subunit (residues 530–557) was identified in the proximity of the MA3 domain of CWC22. Besides modelling the helicase domain of PRP2, we also built de novo a large part of the PRP2NTD that was missing in previous Bact structures. The N-terminal most α-helical region (the pin, residues 161–193) was positioned at the periphery of the helicase domain, where it established interfaces with the RecA1, WH and HB domains of the helicase; two other long helices (the clip, residues 223–256 and residues 264–296) were engaged in tight interactions with the PPIL4 linker region and the long helix of SKIP (residues 286–340), respectively. The latter PRP2NTD helix extended further towards PRP8 and finally reached CWC22.
The new density of PRP2 in BAQR coincides with a significant change to the curvature of SF3B1HEAT, with its N-terminal HEAT repeats no longer in contact with the BS–U2 helix. However, the density of BS-A was still observed in the SF3B1–PHF5A binding pocket, with the hinge region of SF3B organized as in the closed state. Consistently, the SF3A2 and SF3A3 matrin-type zinc-finger domains were organized and positioned as in the Bact complex and still engaged the 5′-end of the U2 snRNA. We modelled SF3B1 by individually docking its consecutive HEAT repeats and manually adjusting their fit to the map. The resulting structure of the reconfigured SF3B1 differed from all other known conformations of SF3B1HEAT. Because of the PRP2-induced reconfiguration of SF3B1HEAT, its SF3B6 and SRRM1 binding partners were destabilized from their earlier locations on the N-terminal side of the HEAT superhelix and, compared with Bact, their densities were no longer observed in BAQR.
Facing the PRP2 density element and following the RNA density exiting the helicase cassette, a new small globular domain appeared to be recruited to BAQR. We assigned it to the C-terminal RRM region of PPIL4 on the basis of its continuity, through an α-helical density, to the predicted cyclophilin-type PPIase domain that interacts with the long helix of SKIP(286–340) and proteomics analysis. A similar SKIP-bound PPIase density has previously been observed in the later C complex (PDB identifier 5yzg), at an almost identical position; in the published C complex model (PDB 5YZG), the authors assigned it to the PPIase domain of the PPIase PPIG30. The latter lacks a RRM domain and is more abundant in C complexes, but only in trace amounts in BAQR. The other PPIase identified in the BAQR proteome, which comprises a PPIase and an RRM domain, is PPIE, an IBC module component located at the periphery of the spliceosome.
In addition to PPIL4 and PPIE, we modelled two other PPIases: PPIL1, interacting with and, probably, stabilizing the PRP19 helical bundle onto the BAQR core and PPIL2. The latter PPIase adopted an extended conformation with its N-terminal tandem U-box motifs, interacting with SNU114 and separated from its C-terminal PPIase domain by about 90 Å. An ordered linker region (residues 234–266) connected the two PPIL2 moieties, as previously observed in a Bact cryo-EM structure5. We did not observe density for the PPIase domain of CWC27, which is consistent with it being destabilized by the propagated changes induced by the remodelling of SF3B1. However, its interacting partner CWC24 was still present in BAQR, with densities observed for both its zinc finger motif (residues 190–238), sequestering the 5′SS and its C-terminal moiety (residues 262–309) bound to the BPB domain of SF3B3. The final model of the complex (Supplementary Data 2), including its more dynamic peripheral modules, consisted of four RNA molecules (the U2, U5 and U6 snRNA, and the MINX pre-mRNA substrate) and 45 individual polypeptide chains, among which 3 are splicing helicases (PRP2, BRR2 and Aquarius) and 4 are PPIases (PPIL1, PPIL2, PPIL4 and PPIE).
Morphing and generation of movies
The PRP2 helicase translocation trajectory was generated using ChimeraX (v.1.3), with the morph functionality starting from the Bact state of the SF3B module (that is, SF3B1 in a closed conformation with PRP2 positioned on the convex side of SF3B1HEAT). The NTD domains of SF3B1 (SF3B1NTD) and PRP2 (PRP2NTD) were not considered in the morphing analysis, and the U2 sequence was limited to the U2 stem-loop IIa structure and the BS-interaction region. The initial (Bact) and final (BAQR) trajectory snapshots were aligned using the PHF5A subunit of SF3B as a reference before morphing. The RNA-bound state of PRP2 in Bact was modelled based on the yeast Bact structure5. Supplementary Videos 1–4 were generated using ChimeraX (v.1.3), and Supplementary Video 5 was generated using PyMol.
The helicase assays were performed as previously described for other splicing helicases6,56,57,58,59. To assess the ability of human PRP2 to unwind the U2–BS helix in the presence of its different cofactors, we performed in vitro helicase assays on fluorescently labelled model substrates comprising a perfect double-stranded RNA duplex followed by a single-stranded 3′ overhang. The unwinding activity of PRP2 activity was monitored using either a gel-based readout6, in which the single-stranded Cy5-labelled product is separated from the helicase substrate on a native polyacrylamide gel, or by recording the time course of decrease in the fluorescence of the substrate58 as the duplex is unwound. In this case, the dual-labelled RNA strand, displaced because of the helicase activity, forms an intramolecular hairpin that brings in proximity the terminal Cy5 probe and its spectrally overlapping dark quencher (BHQ-2), thereby leading to fluorophore emission quenching and fluorescence decay.
The helicase substrate for the gel-based assay was prepared by mixing 30 μM Cy5-labelled strand (5′-CACCAGCUCCGUAGGCGC-Cy5-3′) with 45 μM unlabelled RNA oligonucleotide (5′-GCGCCUACGGAGCUGGUGGCGUAGGCGCAAAAAAAAAAAAAAAAAAAA-3′, the complementary region is shown in bold) in 20 mM HEPES-KOH, pH 7.5. The RNA substrate used in the fluorescent-based helicase assay58 was prepared by mixing in a similar molar ratio the dual-labelled RNA oligonucleotide (5′-Cy5-GCGCCUACGCCACCAGCUCCGUAGGCGC-BHQ-2-3′) with the unlabelled strand (5′-GCGCCUACGGAGCUGGUGGCGUAGGCGCAAAAAAAAAAAAAAAAAAAA-3′, the complementary region is shown in bold) in RNA annealing buffer (6 mM HEPES-KOH, pH 7.5, 50 mM KCl and 0.2 mM MgCl2). In both cases, the single-stranded RNA oligonucleotides were annealed by sequential incubation at 95 °C for 2 min, then at 80 °C for 10 min and then slowly cooled down to room temperature and stored on ice. The RNA and DNA oligonucleotides used in the helicase assays were obtained from IDT (Integrated DNA Technologies) or Microsynth.
In a typical gel-based helicase assay, 50–100 nM fluorescently labelled RNA substrate was mixed in 20 μl on ice with increasing concentrations of PRP2 constructs or PRP2 complexes (1–10 μM) in helicase assay buffer (20 mM HEPES-KOH, pH 7.5, 50 mM KCl, 2 mM MgCl2, 0.1 mg ml–1 BSA (NEB), 0.08 U μl–1 RNasin (Promega), 5% (v/v) glycerol and 0.5 μM competitor DNA (5′-GCGCCTACGGAGCTGGTG-3′), final concentrations). The GPKOW cofactor was added in a 5-fold molar excess over PRP2(137–1022) or PRP2(137–1022)–PPIL4. After pre-incubation for 10 min at 37 °C (or 20 min at 30 °C), the unwinding reaction was initiated by the addition of 2 mM ATP, and the samples were incubated for 1 h at 37 °C (or 1 h at 30 °C). The reactions were stopped by the addition of 5 μl quenching buffer (0.83 mM Tris-HCl, pH 7.6, 5 mM EDTA, 5% (v/v) glycerol, 0.0025% bromophenol blue, 0.0025% xylene cyanol FF and 0.04 U μl–1 proteinase K (NEB), final concentrations) and incubated for 20 min at 37 °C. RNA duplex unwinding was assessed on a 14% polyacrylamide native gel prepared in 1× TBE or 1× Tris-glycine buffer and was run in 0.5× TBE (or 1× Tris-glycine) at 100 V for approximately 90 min at room temperature. The yeast Prp22p helicase, used as a positive control, was prepared as previously described27. The RNA gels were scanned at the Cy5 excitation peak using an iBright 1500 imaging system (Invitrogen). The gel-based helicase assays were repeated at least two times.
The molecular beacon helicase assays were performed using a Fluorolog spectrofluorometer (Horiba). In brief, 200 nM dual-labelled substrate was mixed with 1–5 μM purified helicase in reaction buffer (20 mM HEPES-KOH, pH 7.5, 50–150 mM KCl and 2 mM MgCl2) in the absence of ATP and preincubated for at least 10 min at room temperature. As for the gel-based unwinding assays, GPKOW was added in a 5-fold molar excess over PRP2(137–1022), whereas PPIL4 was added in a 3-fold molar excess. The reactions were subsequently transferred to a 1.5 mm ultra-microcuvette (105.252-QS, Hellma) and equilibrated at the measurement temperature (25 °C or 30 °C) in the spectrometer. The unwinding reaction was initiated by the addition of 2 mM ATP (final concentration), and the decay in Cy5 fluorescence was immediately recorded at an interval time of 1 s and an integration time of 0.1 s. The Cy5 fluorescent probe was excited at 645 nm (3 nm slit width) and its emission was measured at 667 nm (3 nm slit width) and corrected for detector noise. Data were analysed and plotted using OriginPro 2020 (18.104.22.168). The fluorescence-based helicase assay was repeated at least three times. Three independent preparations of PRP2(137–1022) were tested in these assays.
Electrophoretic mobility shift assay
To investigate the RNA binding activity of the different PRP2 complexes tested in the unwinding assays, 50 nM fluorescent substrate was mixed with increasing concentrations of helicase or helicase cofactor samples (0.1–8 μM). The 20 μl binding reactions were prepared in RNA binding buffer (20 mM HEPES-KOH, pH 7.5, 50 mM KCl, 2 mM MgCl2, 0.1 mg ml–1 BSA (NEB) and 5% (v/v) glycerol, final concentrations) and incubated for 30 min at room temperature before being loaded on a 5% polyacrylamide native gel prepared in 1× Tris-glycine buffer. The native RNA gel was pre-run at 40 V for 30 min at room temperature and run then at 60 V in 1× Tris-glycine. The gels were scanned at the Cy5 excitation peak. The RNA binding assays were repeated at least two times, and similar results were obtained.
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
The coordinate files have been deposited into the PDB with the identifiers 7QTT (high-resolution core) and 8CH6 (overall composite model). The cryo-EM maps have been deposited into the Electron Microscopy Data Bank with the identifiers EMD-14146 (high-resolution core) and EMD-16658 (overall reconstruction). The mass spectrometry protein composition and model building of the BAQR spliceosomes are available at FigShare (https://doi.org/10.6084/m9.figshare.22047275).
Warkocki, Z. et al. Reconstitution of both steps of Saccharomyces cerevisiae splicing with purified spliceosomal components. Nat. Struct. Mol. Biol. 16, 1237–1243 (2009).
Zhang, X. et al. Structure of the human activated spliceosome in three conformational states. Cell Res. 28, 307–322 (2018).
Rauhut, R. et al. Molecular architecture of the Saccharomyces cerevisiae activated spliceosome. Science 353, 1399–1405 (2016).
De Bortoli, F., Espinosa, S. & Zhao, R. DEAH-box RNA helicases in pre-mRNA splicing. Trends Biochem. Sci. 46, 225–238 (2021).
Bai, R. et al. Mechanism of spliceosome remodeling by the ATPase/helicase Prp2 and its coactivator Spp2. Science https://doi.org/10.1126/science.abe8863 (2021).
De, I. et al. The RNA helicase Aquarius exhibits structural adaptations mediating its recruitment to spliceosomes. Nat. Struct. Mol. Biol. 22, 138–144 (2015).
De, I., Schmitzova, J. & Pena, V. The organization and contribution of helicases to RNA splicing. Wiley Interdiscip. Rev. RNA 7, 259–274 (2016).
Kastner, B., Will, C. L., Stark, H. & Luhrmann, R. Structural insights into nuclear pre-mRNA splicing in higher eukaryotes. Cold Spring Harb. Perspect. Biol. https://doi.org/10.1101/cshperspect.a032417 (2019).
Wan, R., Bai, R., Zhan, X. & Shi, Y. How is precursor messenger RNA spliced by the spliceosome? Annu. Rev. Biochem. 89, 333–358 (2020).
Wilkinson, M. E., Charenton, C. & Nagai, K. RNA splicing by the spliceosome. Annu. Rev. Biochem. 89, 359–388 (2020).
Galej, W. P. et al. Cryo-EM structure of the spliceosome immediately after branching. Nature 537, 197–201 (2016).
Wan, R., Yan, C., Bai, R., Huang, G. & Shi, Y. Structure of a yeast catalytic step I spliceosome at 3.4 Å resolution. Science 353, 895–904 (2016).
Haselbach, D. et al. Structure and conformational dynamics of the human spliceosomal Bact complex. Cell 172, 454–464.e11 (2018).
King, D. S. & Beggs, J. D. Interactions of PRP2 protein with pre-mRNA splicing complexes in Saccharomyces cerevisiae. Nucleic Acids Res. 18, 6559–6564 (1990).
Bessonov, S. et al. Characterization of purified human Bact spliceosomal complexes reveals compositional and morphological changes during spliceosome activation and first step catalysis. RNA 16, 2384–2403 (2010).
Bertram, K. et al. Structural insights into the roles of metazoan-specific splicing factors in the human step 1 spliceosome. Mol. Cell 80, 127–139.e6 (2020).
Wan, R., Bai, R., Yan, C., Lei, J. & Shi, Y. Structures of the catalytically activated yeast spliceosome reveal the mechanism of branching. Cell 177, 339–351.e13 (2019).
Schmitt, A., Hamann, F., Neumann, P. & Ficner, R. Crystal structure of the spliceosomal DEAH-box ATPase Prp2. Acta Crystallogr. D Struct. Biol. 74, 643–654 (2018).
Liu, H. L. & Cheng, S. C. The interaction of Prp2 with a defined region of the intron is required for the first splicing reaction. Mol. Cell. Biol. 32, 5056–5066 (2012).
Dziembowski, A. et al. Proteomic analysis identifies a new complex required for nuclear pre-mRNA retention and splicing. EMBO J. 23, 4847–4856 (2004).
Bao, P., Will, C. L., Urlaub, H., Boon, K. L. & Luhrmann, R. The RES complex is required for efficient transformation of the precatalytic B spliceosome into an activated Bact complex. Genes Dev. 31, 2416–2429 (2017).
Semlow, D. R., Blanco, M. R., Walter, N. G. & Staley, J. P. Spliceosomal DEAH-box ATPases remodel pre-mRNA to activate alternative splice sites. Cell 164, 985–998 (2016).
Bao, P., Hobartner, C., Hartmuth, K. & Luhrmann, R. Yeast Prp2 liberates the 5′ splice site and the branch site adenosine for catalysis of pre-mRNA splicing. RNA 23, 1770–1779 (2017).
Sloan, K. E. & Bohnsack, M. T. Unravelling the mechanisms of RNA helicase regulation. Trends Biochem. Sci. 43, 237–250 (2018).
Bono, F., Ebert, J., Lorentzen, E. & Conti, E. The crystal structure of the exon junction complex reveals how it maintains a stable grip on mRNA. Cell 126, 713–725 (2006).
Kim, S. H., Smith, J., Claude, A. & Lin, R. J. The purified yeast pre-mRNA splicing factor PRP2 is an RNA-dependent NTPase. EMBO J. 11, 2319–2326 (1992).
Warkocki, Z. et al. The G-patch protein Spp2 couples the spliceosome-stimulated ATPase activity of the DEAH-box protein Prp2 to catalytic activation of the spliceosome. Genes Dev. 29, 94–107 (2015).
Schwer, B. A conformational rearrangement in the spliceosome sets the stage for Prp22-dependent mRNA release. Mol. Cell 30, 743–754 (2008).
Chung, C. S. et al. Dynamic protein–RNA interactions in mediating splicing catalysis. Nucleic Acids Res. 47, 899–910 (2019).
Zhan, X., Yan, C., Zhang, X., Lei, J. & Shi, Y. Structure of a human catalytic step I spliceosome. Science 359, 537–545 (2018).
Zhan, X., Lu, Y., Zhang, X., Yan, C. & Shi, Y. Mechanism of exon ligation by human spliceosome. Mol. Cell 82, 2769–2778.e4 (2022).
Bertram, K. et al. Cryo-EM structure of a pre-catalytic human spliceosome primed for activation. Cell 170, 701–713.e11 (2017).
Fica, S. M., Oubridge, C., Wilkinson, M. E., Newman, A. J. & Nagai, K. A human postcatalytic spliceosome structure reveals essential roles of metazoan factors for exon ligation. Science 363, 710–714 (2019).
Gao, K., Masuda, A., Matsuura, T. & Ohno, K. Human branch point consensus sequence is yUnAy. Nucleic Acids Res. 36, 2257–2267 (2008).
Taggart, A. J. et al. Large-scale analysis of branchpoint usage across species and cell lines. Genome Res. 27, 639–649 (2017).
Berget, S. M. Exon recognition in vertebrate splicing. J. Biol. Chem. 270, 2411–2414 (1995).
Yeo, G., Hoon, S., Venkatesh, B. & Burge, C. B. Variation in sequence and organization of splicing regulatory elements in vertebrate genes. Proc. Natl Acad. Sci. USA 101, 15700–15705 (2004).
Hirose, T. et al. A spliceosomal intron binding protein, IBP160, links position-dependent assembly of intron-encoded box C/D snoRNP to pre-mRNA splicing. Mol. Cell 23, 673–684 (2006).
Yan, C. et al. Structure of a yeast spliceosome at 3.6-angstrom resolution. Science 349, 1182–1191 (2015).
Kim, S. H. & Lin, R. J. Spliceosome activation by PRP2 ATPase prior to the first transesterification reaction of pre-mRNA splicing. Mol. Cell. Biol. 16, 6810–6819 (1996).
Ohrt, T. et al. Prp2-mediated protein rearrangements at the catalytic core of the spliceosome as revealed by dcFCCS. RNA 18, 1244–1256 (2012).
Kaur, H., Groubert, B., Paulson, J. C., McMillan, S. & Hoskins, A. A. Impact of cancer-associated mutations in Hsh155/SF3b1 HEAT repeats 9–12 on pre-mRNA splicing in Saccharomyces cerevisiae. PLoS ONE 15, e0229315 (2020).
Carrocci, T. J., Zoerner, D. M., Paulson, J. C. & Hoskins, A. A. SF3b1 mutations associated with myelodysplastic syndromes alter the fidelity of branchsite selection in yeast. Nucleic Acids Res. https://doi.org/10.1093/nar/gkw1349 (2017).
Yeh, T.-C. et al. Splicing factor Cwc22 is required for the function of Prp2 and for the spliceosome to escape from a futile pathway. Mol. Cell. Biol. 31, 43–53 (2011).
Edwalds-Gilbert, G., Kim, D. H., Silverman, E. & Lin, R. J. Definition of a spliceosome interaction domain in yeast Prp2 ATPase. RNA 10, 210–220 (2004).
Cretu, C. et al. Molecular architecture of SF3b and structural consequences of its cancer-related mutations. Mol. Cell 64, 307–319 (2016).
Cretu, C. et al. Structural basis of intron selection by U2 snRNP in the presence of covalent inhibitors. Nat. Commun. 12, 4491 (2021).
Tholen, J., Razew, M., Weis, F. & Galej, W. P. Structural basis of branch site recognition by the human spliceosome. Science 375, 50–57 (2022).
Will, C. L. et al. Characterization of novel SF3b and 17S U2 snRNP proteins, including a human Prp5p homologue and an SF3b DEAD-box protein. EMBO J. 21, 4978–4988 (2002).
Bieniossek, C., Imasaki, T., Takagi, Y. & Berger, I. MultiBac: expanding the research toolbox for multiprotein complexes. Trends Biochem. Sci. 37, 49–57 (2012).
Tegunov, D. & Cramer, P. Real-time cryo-electron microscopy data preprocessing with Warp. Nat. Methods 16, 1146–1152 (2019).
Sanchez-Garcia, R. et al. DeepEMhancer: a deep learning solution for cryo-EM volume post-processing. Commun. Biol. 4, 874 (2021).
Jakobi, A. J., Wilmanns, M. & Sachse, C. Model-based local density sharpening of cryo-EM maps. eLife 6, e27131 (2017).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Terwilliger, T. C., Sobolev, O. V., Afonine, P. V. & Adams, P. D. Automated map sharpening by maximization of detail and connectivity. Acta Crystallogr. D Struct. Biol. 74, 545–559 (2018).
Studer, M. K., Ivanovic, L., Weber, M. E., Marti, S. & Jonas, S. Structural basis for DEAH-helicase activation by G-patch proteins. Proc. Natl Acad. Sci. USA 117, 7159–7170 (2020).
Christian, H., Hofele, R. V., Urlaub, H. & Ficner, R. Insights into the activation of the helicase Prp43 by biochemical studies and structural mass spectrometry. Nucleic Acids Res. 42, 1162–1179 (2014).
Hamann, F. et al. The structure of Prp2 bound to RNA and ADP-BeF3– reveals structural features important for RNA unwinding by DEAH-box ATPases. Acta Crystallogr. D Struct. Biol. 77, 496–509 (2021).
Felisberto-Rodrigues, C. et al. Structural and functional characterisation of human RNA helicase DHX8 provides insights into the mechanism of RNA-stimulated ADP release. Biochem. J. 476, 2521–2543 (2019).
Zhang, Z. et al. Molecular architecture of the human 17S U2 snRNP. Nature 583, 310–313 (2020)
Yan, C., Wan, R. & Shi, Y. Molecular mechanisms of pre-mRNA splicing through structural biology of the spliceosome. Cold Spring Harb. Perspect. Biol. 11, a032409 (2019).
Cretu, C. et al. Structural basis of splicing modulation by antitumor macrolide compounds. Mol. Cell 70, 265–273.e8 (2018).
Beusch, I. et al. Targeted high throughput mutagenesis of the human spliceosome reveals its in vivo operating principles. Preprint at bioRxiv https://doi.org/10.1101/2022.11.13.516350 (2022).
Zhang, J. et al. DHX15 is involved in SUGP1-mediated RNA missplicing by mutant SF3B1 in cancer. Proc. Natl Acad. Sci. USA 119, e2216712119 (2022).
Feng, Q. et al. Splicing quality control mediated by DHX15 and its G-patch activator, SUGP1. Preprint at bioRxiv https://doi.org/10.1101/2022.11.14.516533 (2022).
We thank C. Richardson (The Institute of Cancer Research, London) for technical support in computation; M. Raabe, S. König and A. Chernev (Max-Planck Institute, Göttingen) for mass spectrometry analysis; S. Bessonov (Max-Planck Institute, Göttingen) for technical advice in splicing biochemistry; and A. Hoskins (University of Wisconsin-Madison, USA) for critical reading of the manuscript. This work was supported by the German Research Foundation (grant DFG PE 2079/2-2 and DFG PE 2079/4-1), the Wellcome Trust (220300Z/20/Z) and the Institute of Cancer Research.
The authors declare no competing interests.
Peer review information
Nature thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
Extended Data Fig. 1 Purification, biochemical analysis and cryo-EM image-processing workflow of BAQR.
a, Schematic depicting the MINX pre-mRNA substrate, fused to three downstream MS2 aptamers, which was used to assemble the BAQR spliceosome in the HeLa nuclear extract. b, Pre-mRNA splicing is arrested by incorporating the recombinant dominant-negative IBC/AquariusK829A in the human spliceosomes. Typical in vitro splicing reactions of the model MINX pre-mRNA substrate were set up in the presence (+IBCK829A) or absence (+buffer) of the recombinant complex added in trans, in excess over the endogenous IBC. The reactions were stopped after 0–120 min and the Cy5-labeled RNA substrate was extracted, analyzed on a denaturing polyacrylamide gel, and visualized by in-gel fluorescence. The experiment was repeated at least three times, with sample resulting from distinct preparations, with similar results. c, Protein and RNA composition of the purified BAQR complexes. The SDS-PAGE gels were stained with Coomassie (left) and SYBR Gold (right). The purification was repeated at least 3 times, with similar results. For gel source data, see Supplementary Fig. 1. d, Representative cryo-EM micrograph of the stalled BAQR spliceosome and typical reference-free 2D class averages of BAQR particle images. We collected two independent datasets from two different sample preparations e, Cryo-EM data processing schematic showing the most significant steps of the computational analysis. f–i, Global resolution of key BAQR maps (see also Extended Data Fig. 2). The FSC (Fourier Shell Correlation) analysis was carried out in RELION 3.1 (left). The angular distribution (right) of the BAQR particles contributing to the map shown on top is depicted in two different orientations, with the red color indicating a higher number of particles at a given projection angle.
Extended Data Fig. 2 Cryo-EM image-processing by local classification of the peripheral modules of the BAQR complex.
Cryo-EM data processing schematic showing the most significant steps of the computational analysis, the FSC (Fourier Shell Correlation) analysis of the obtained focused maps, and the angular distribution of the particles contributing to the final volumes.
a, Overall maps of state A and B complexes depicted together with the final model. b, Local resolution of the state A maps, estimated in RELION and visualized in ChimeraX. c, Cryo-EM density snapshots. Various subunits are colored and labeled. d, Selected snapshots of modeled BAQR subunits are depicted together with the unsharpened map of the core region of the complex (map M2). e, Structural comparison between Bact, BAQR, and C complexes. PRP2, Aquarius, SYF1, the U2 snRNA, and the pre-mRNA substrate are colored and labeled. Note the large-scale repositioning of the PRP2 RNA helicase during the conversion of Bact to BAQR, and of the helicase Aquarius during the transition of Bact/BAQR complexes to the C-stage spliceosome.
a–f, Interactions between PRP2, PRP8, SF3B1, CWC22 and SKIP. Interacting distances corresponding to polar and hydrophobic contacts are indicated as dashed lines. g, Overview of PRP2 and the composite molecular brake. PRP2’s conserved domains are color-coded. HB – helix-bundle; OB – oligonucleotide/oligosaccharide- binding fold; WH – winged-helix. h, Interactions between PPIL4, SKIP, and PRP2. Residues involved in contacts are depicted as spheres and shown in the same color as their interacting partner. i, Interactions between the wedge element of SKIP and PRP2core. j, The wedge element of SKIP appears to lock the RecA domains of PRP2. The ADP molecule is modeled by superposition with Ct PRP2 (PDB 6zm2) and is shown for the sake of orientation. k, Superposition between human PRP2 in the open conformation (observed in BAQR) and Ct PRP2 in the closed conformation (colored in teal) (PDB 6zm2). Equivalent residues of PRP2 interacting with SKIP in BAQR are shown for Ct PRP2. Structures from panels i and k are depicted in the same orientation. The red arrow indicates the movement of the RecA2-like domain during the helicase transition from the open to the closed conformation. l, Genetic interactions from yeast mapped on the structure of human BAQR. The cold-sensitive allele of Prp2pQ548N genetically interacts with the D450G and V502F substitutions of Hsh155p. Residues involved in genetic interactions in yeast (blue) are depicted as spheres and mapped on the BAQR model. m-o, Structural superposition of the human (Hs) and budding yeast (Sc) MA3 domain of CWC22, composed from HEAT repeats. The 10th HEAT repeat (residues 454–491) of Cwc22p is required for the productive function of yeast Prp2p and interacts with the “hook” motif of PRP2NTD and BUD13 in BAQR. This suggests that Prp2p, Cwc22p, and Bud13p may interact in a similar fashion in budding yeast spliceosomes, thus explaining the functional connection between Prp2p and Cwc22p. Note that human CWC22 residues that interact with PRP2 and BUD13 are conserved in budding yeast Cwc22p (shown in n and o). The protein subunits, U2 snRNA and the pre-mRNA substrate are colored and labeled.
Extended Data Fig. 5 Estimation of the number of nucleotides translocated by PRP2 during the Bact-to-BAQR transition.
a,b, Superposition between budding yeast (PDB 7DCO) and human (PDB 5Z56) Bact complexes over equivalent residues from PHF5A. For clarity’s sake, only SF3B1, PRP2, the intron and U2 snRNA are depicted. All subunits are labeled accordingly. c, PRP2:RNA from human BAQR, as the only available structure of the human counterpart, was superimposed onto the Prp2p helicase from budding yeast Bact and shown in the same orientation as in b. Note that the conformation of the RNA strand bound by human PRP2 or budding yeast Prp2p is virtually unchanged. d, Assignment of nucleotides bound by human PRP2 in Bact based on the superposition. e, PRP2 translocates about 19 nucleotides towards the branch helix during the Bact-to-BAQR transition. PRP2, RBMX2 and PPIL4 positions on the intron in different spliceosomal complexes are depicted. f, PRP2 translocation results in a substantial change in the intron’s conformation, dissociation of RBMX2 and recruitment of PPIL4 to the intron.
a, Destabilization of SF3B6, de-structuring of SF3B1NTD, and the reorientation of PRP8’s EN (endonuclease-like, PRP8EN) and RH (RNase H-like, PRP8RH) domains upon PRP2 translocation. PRP2 is located on the plane above SF3B1HEAT and is not shown for clarity’s sake. The reactive BS-A, the guanosine of the 5’SS, and the catalytic metal ions are shown as spheres and labeled. b, Close-up view of SF3B1, the intron, and the U2 snRNA from BAQR (color-coded) and Bact (grey and black) after superposition of SF3B1’s equivalent residues. c, The binding pocket of the BS-A (primary pocket) is virtually unchanged in Bact and BAQR. d, Cycle of SF3B1 transitions in splicing. The conformational transitions of SF3B1 are depicted based on cryo-EM structures and biochemical analyses. Here we refer to the intermediates II and III as pre-A1 and pre-A2 for clarity. The key spliceosome complexes representative for the SF3B1 intermediates shown here are: 17S U2 snRNP48,60, pre-A1 (A-like cross-exon complex bound by spliceostatin A47), pre-A2 (ref. 48), A-to-Bact (reviewed in refs. 8,10,61), BAQR (this work), the SF3B complex46,62. Helicases that facilitate the transitions are shown in red. The helicase DHX15, colored magenta, mediates the disassembly of kinetically-slowed complexes (e.g. formed on suboptimal introns, weak splice sites and PPTs, multiple branch sites or cryptic sites63,64,65).
a, Size-exclusion chromatography (SEC) profiles of PRP2 (137-1022), PPIL4 and the mixture of the two proteins shows that PRP2 (137-1022) forms a stable complex with PPIL4, in an RNA-independent manner. PRP2 (137-1022) comprises both the modeled PRP2NTD (i.e., the pin, clip, and hook elements) and the helicase core. b, SDS-PAGE gels corresponding to the fractions from a. The in vitro reconstitution experiments were repeated two times, using two independent preparations of PRP2 (137–1022) and PPIL4. c, SDS-PAGE gel of SEC fractions showing the reconstitution of a stable PRP2-PPIL4 complex in vivo by co-expression in insect cells. PRP2 (137–1022) and PPIL4 were co-expressed in Sf9 insect cells from individual baculoviruses. The complex was captured by Strep-Tactin affinity followed by SEC. The preparation was performed twice with similar results d, The RNA-binding activity of human PRP2 in the presence of GPKOW and PPIL4. The ability of PRP2 (137–1022) and of the purified PRP2 (137–1022)-PPIL4 and PRP2 (137-1022)-GPKOW complexes to bind a Cyanine 5 (Cy5)-labeled RNA substrate was evidenced by EMSA. The RNA substrate comprised an RNA duplex, followed by 30 nucleotides 3′ single-stranded overhang, mimicking PRP2’s spliceosome substrate observed in BAQR. The free RNA substrate was separated from PRP2-bound (or cofactor-bound) species on a 5% polyacrylamide native gel. The EMSA gels were imaged at the Cy5 excitation peak. The assays were repeated two times. e, The RNA binding activity of human PPIL4 in the absence and presence of PRP2 (137–1022). Compared to the EMSA shown in d, PPIL4 was added in a 5-fold excess over PRP2 (137–1022) and PRP2’s final assay concentrations are indicated above the last three gels lanes. Note the apparent increase in PRP2’s RNA affinity in the presence of PPIL4. The EMSAs were repeated three times. f, SEC profiles showing the formation of a stable complex between PRP2 (137–1022) and GPKOW, in an RNA-independent manner. The SDS-PAGE corresponding to the complex purified by SEC is shown. g, The RNA-binding activity of human GPKOW assessed by an EMSA. The same RNA substrate was used as in d and the assay was repeated four times. h, The helicase activity of human PRP2 was investigated using a fluorescence-based assay. Compared to the gel-based helicase assays shown in i-l, the fluorescence-based assay employs a dual-labeled helicase substrate. Displacement of the labeled strand by the helicase leads to the formation of an intramolecular hairpin, which brings in proximity the Cy5 fluorophore and its spectrally overlapping quencher (BHQ-2). The decrease in substrate’s fluorescence upon unwinding is monitored as a function of time. Several representative fluorescence traces of PRP2, recorded under different experimental conditions and in the presence/absence of helicase cofactors, are shown together with the unwinding curves of Prp22p, used as a positive control. i, The helicase/unwinding activity of human PRP2 and of the PRP2-GPKOW complex. To assess the ability of PRP2 (137–1022) and of the purified, in vitro reconstituted PRP2 (137–1022)-GPKOW complex to unwind RNA-RNA duplexes, the purified protein samples were mixed with the Cy5-labeled helicase substrate (depicted on the right with the labeled strand colored in red) in the presence (or absence) of ATP and of a competitor DNA (green). Following a 1-hour incubation, the samples were analyzed on a native 14% polyacrylamide gel and imaged by in-gel fluorescence. Budding yeast Prp22p was used as a positive control. All gel-based helicase assays were repeated at least two times. j, The helicase activity of the in vivo reconstituted PRP2 (137–1022)-PPIL4 complex, in the absence or presence of GPKOW. k, The helicase activity of human PRP2 (137–1022) in the presence or absence of its cofactor GPKOW. Compared to the assay shown in i, the GPKOW cofactor was added to the purified helicase in a 5-fold excess. The concentrations indicated above the native gel represent the final assay concentrations of PRP2 (137–1022) or Prp22p. The RNA bands labeled with an asterisk represent, most likely, degradation products. l, Comparative helicase activities of the PRP2 (137–1022)-PPIL4 and PRP2 (137–1022)-GPKOW complexes. For gel source data, see Supplementary Figs. 2 and 3.
a, During the Bact (PDB 5Z57) to the BAQR transition, translocation of PRP2 in a 3′-to-5′ direction results in the destabilization of the RES complex (RBMX2, BUD13, SNIP1). In addition, SRRM1 and SF3B6/p14 are no longer observed in BAQR due to PRP2-induced SF3B1 opening. During the remodeling of BAQR and transition to the C complex (PDB 5yzg, PDB 6zym), the remaining SF3B/SF3A subunits and CWC24 are released from the branch helix and the 5’SS. The branch helix then moves to the catalytic center, bringing the BS-A and the 5’SS GU nucleotides in proximity for the branching reaction. The different human spliceosome states were structurally aligned by using the PRP8 subunit as a reference and are depicted in two different orientations. The spliceosome subunits are color-coded and shown in cartoon representation. Spliceosome subunits not undergoing significant rearrangement were omitted for the sake of simplicity. b, Repositioning of the branch helix (U2/BS) during the transition from Bact to BAQR and then to C complexes. The different spliceosome states, Bact (PDB 5Z57), BAQR (this work), and C complex (PDB 5yzg, PDB 6zym), were aligned using the PRP8 subunit as a reference. All protein subunits, except PRP8, were omitted and the RNA moieties were color-coded. The reactive BS-A and the 5’SS GU nucleotides are shown as spheres and colored red and light green, respectively.
Extended Data Fig. 9 Aquarius promotes the transfer of the BS-A to the catalytic center during BAQR-to-C complex transition, likely by inducing the “loose” to “open” conformational transition of SF3B1.
a, Lines of structural communication between Aquarius and the branch duplex in BAQR. Aquarius and PRP2 are in diametrically-opposed locations of the BAQR spliceosome. A continuous bridge of proteins is present between Aquarius and the SF3B1:branch duplex. Most of these proteins are SF3A/B subunits and PPIE. Another bridge that primarily involves the RBM22 protein is between Aquarius and U6 catalytic core. The intron region not visible in the density is dashed. Aquarius is depicted in red (RecA-like domains) and light blue (accessory domains). The first nucleotide of the intron (G+1) is positioned in the catalytic center. Other subunits of the spliceosome, including SYF1 and ISY1, are not shown for the sake of clarity b, Subunits of the C complex (pdb 5yzg) are shown in the same orientation as in a. The U6 snRNA was used as a reference for the superposition between the C and BAQR complexes. Note that PRP2 and the SF3A/B complex have dissociated and the BS-A has been relocated to the catalytic centre (see also c, below). RBM22’s orientation has remained virtually the same in BAQR and C complexes. The subunits are colored as in a and labeled. The BAQR and C complexes were superimposed over PRP8 (not shown) and U6 snRNA c, The BS-A juxtaposed to the first nucleotide of the intron in the catalytic center of the C complex. The catalytic metal ions are shown as magenta spheres. d, Superposition between the “open” (observed in the apo SF3B complex, PDB 5IFE) and the “loose” states (observed in BAQR) of SF3B1. Except for SF3B1OPEN, all shown subunits belong to the BAQR complex. e, BAQR and the SF3B complex in its apo form (PDB 5IFE) were superimposed over equivalent residues from PHF5A (not shown in the figures d, f and g for clarity’s sake). Note that the HEAT repeats H16-H20 of SF3B1 adopt virtually identical conformations in the “loose” and “open” states, while the other helical repeats are substantially reorganized. f–g, The BS-A’s release from the primary pocket of SF3B1 likely induces rearrangement of the HEAT repeats upon the “loose” to “open” conformational transition. Consequently, the contacts between HEAT repeats and PRP8, and those between SF3B2 (attached to the HEAT repeats) and the U6 snRNA might get disrupted, causing the complete dissociation of the SF3A/B complexes from the spliceosome.
Supplementary figures and legends for the supplementary figures, the supplementary data files and the supplementary video files.
About this article
Cite this article
Schmitzová, J., Cretu, C., Dienemann, C. et al. Structural basis of catalytic activation in human splicing. Nature 617, 842–850 (2023). https://doi.org/10.1038/s41586-023-06049-w
This article is cited by