Main

DExH/D box proteins are defined by six conserved peptide motifs arrayed in a colinear fashion, of which the eponymous DExH/D box comprises motif II1,2,3. Most DExH/D box proteins can hydrolyze NTPs to NDPs in a reaction that is either stimulated by or entirely dependent on a nucleic acid cofactor. Some DExH/D box proteins also possess intrinsic RNA or DNA helicase activity. Mutational studies, in conjunction with the crystal structures of several helicases, implicate conserved motifs I (GxGKT), II (DExH/D) and VI (H/QRxGRxGR) in NTP hydrolysis whereas motif III (S/TAT) couples the chemical reaction to the mechanical steps of duplex unwinding4,5,6,7,8,9,10,11,12. Given that RNAs are often bound to proteins in the cell, the question arises whether DExH/D proteins are capable of disrupting protein–RNA interactions and what the biological ribonucleoprotein targets for DExH/D NTPases may be.

A DExH helicase can displace an RNA-bound protein

The exemplary DExH-box RNA helicase is vaccinia virus NPH-II. NPH-II is essential for virus replication and plays a role in mRNA transcription by enzymes encapsidated in the virion9,13. NPH-II can processively unwind long stretches of duplex RNA in a 3′ to 5′ direction with a kinetic step size of about one-half helical turn14,15. NPH-II is thought to unwind by translocating along the `template' RNA strand to which it initially binds and displacing in its wake either DNA or RNA strands annealed to the RNA. In a recent report, Jankowsky et al.16 ask whether NPH-II is also capable of dislodging proteins from RNA. They designed a substrate that contains a high-affinity binding site for a well characterized RNA-binding protein, the U1A protein, embedded within a duplex region (Fig. 1). NPH-II unwound the RNA whether or not U1A was present, showing that the helicase was not impeded by RNA loops (caused by the inclusion of the U1A binding site) or by the bound U1A. Was displacement of U1A an active process or simply a reflection of passive dissociation of U1A from the RNA? The dissociation rate for U1A was increased by several orders of magnitude when NPH-II plus ATP were present. Detailed kinetic analyses showed that although NPH-II appears to `stumble' when it encounters the U1A protein, it maintains its processivity.

Figure 1: Schematics of a synthetic RNP substrate16.
figure 1

The RNA binding protein is U1A, which is a component of the U1 small ribonucleoprotein particle (U1 snRNP) and it also binds its own pre-mRNA and inhibits polyadenylation of its own pre-mRNA. A high affinity binding site for the U1A protein, from the 3′ untranslated region of U1A pre-mRNA, is embedded within duplex regions16. U1A protein (indicated in blue) binds as a dimer to the two asymmetric loops55. A 3′ single-stranded overhang serves as a binding site for the RNA helicase NPH-II (indicated in purple). NPH-II binds to the 3′ ss tail and translocates on the RNA in a 3′ to 5′ direction, powered by ATP hydrolysis.

Even though the experimental scenario is not physiological (that is, NPH-II is unlikely to encounter U1A as a target in vivo), this study clearly demonstrates that a DExD/H box RNA helicase can function to displace proteins from RNA. Although often speculated upon17,18, such an `RNPase' function (a term coined by Jankowsky et al.16) had never been demonstrated for any DExH/D box protein. The RNPase activity of NPH-II provides clues to how other DExH/D box proteins may act upon complex ribonucleoprotein assemblies such as the spliceosome.

DExH/D box proteins and the spliceosome

The spliceosome is formed by the ordered assembly of U1, U2, and U4/U6/U5 snRNPs and various non-snRNP proteins onto the pre-mRNA (Fig. 2). Intron removal occurs by two sequential transesterification reactions. In step 1, the 5′ splice site is cleaved and a branched lariat intermediate is formed. In step 2, the 3′ splice site is cleaved and the two exons are joined19. Whereas the hypothesis that the chemical steps are catalyzed by snRNAs remains plausible, it is clear that proteins are required to assemble, modulate and disassemble the catalytic core20,21,22. Phylogenetic, genetic and biochemical studies illuminate a dynamic network of RNA–RNA and RNA–protein interactions necessary for splice site recognition and accurate positioning of the reactive nucleotides for catalysis18,23,24. The challenge now is to determine in molecular detail how these RNA–RNA and RNA–protein contacts are crafted and remodeled.

Figure 2: The splicing pathway.
figure 2

Stages of spliceosome assembly, the two transesterification steps and release of mRNA (disassembly) are shown. Small nuclear ribonucleoprotein particles (U1, U2, U4, U6, U5 snRNPs) assemble onto the pre-mRNA in an ordered manner. U1 binds to the 5′ splice site (5′ ss), U2 to the branchpoint sequence element. U4/U6/U5 enter the spliceosome as a tri-snRNP. U1 and U4 snRNPs are destabilized prior to step 1 transesterification. Please note that non-snRNP proteins, which associate with the pre-mRNA are omitted for simplicity (see refs 19 and 20 for more details). The DExH/D proteins that act at the various ATP-dependent stages, are indicated on the right. Prp43p is thought to play a role in the disassembly after mRNA is released56, but it is not clear whether ATP is required for this reaction.

Eight DExD/H box proteins are important for pre-mRNA splicing in yeast or mammalian systems19,20: Prp5p, Brr2p, Prp28p, UAP56/Sub2p, Prp2p, Prp16p, Prp22p and Prp43p. The DExH NTPases Prp2p, Prp16p and Prp22p act sequentially at distinct steps in the splicing pathway (Fig. 2). ATP hydrolysis by Prp2p propels the first transesterification reaction25,26. NTP hydrolysis by Prp16p is essential for the second transesterification step27. The ATP-dependent helicase activity of Prp22p is crucial for the release of mature mRNA from the spliceosome10. Conformational changes in the spliceosome accompany these steps. For example, Prp16p effects a rearrangement at the 3′ splice site (3′ ss) and Prp22p activity leads to the disassembly of the spliceosome28,29,30. However, the immediate molecular targets of Prp2p, Prp16p and Prp22p action within the spliceosome are not defined.

Four recent reports shed light on the functions of two DExD proteins, Prp28p and Sub2p, in early steps of spliceosome assembly31,32,33,34. They demonstrate that the essential functions of Prp28p and Sub2p can be bypassed in vivo by mutations in the U1C protein (or the U1 snRNA) or a deletion of the Mud2 protein, respectively, thereby implicating RNA-bound proteins as key components of the targets upon which Prp28p and Sub2p act31,32.

Change of pairing partners at the 5′ splice site. In a genetic screen for suppressors of the cold sensitive (cs) phenotype of the prp28-102 strain35, Chen et al.31 identified dominant mutations in the gene encoding the essential U1C protein. U1C contains an N-terminal zinc finger motif that is important for its function36,37. Each of three suppressing mutations mapped to a single leucine residue (Leu 13) in the zinc finger. Changing Leu 13 to all other 19 natural amino acids showed that all but six variants could suppress the cs phenotype of prp28-102. Remarkably, the suppressor strains readily lost prp28-102, arguing that the U1C[L13] mutants could bypass the essential function of the PRP28 gene product. U1C protein is important for the stable association of the U1snRNP with the 5′ splice site (5′ ss) sequence36,38. The U1C[L13] suppressors likely affect this stabilizing function because the requirement for Prp28p can also be bypassed by certain mutations in U1 snRNA that destabilize the U1/5′ ss duplex. The disruption of the U1/5′ ss helix allows for contacts of the 5′ ss with U6 snRNA, an interaction that is essential for accurate 5′ cleavage site selection23 (Fig. 3).

Figure 3: An RNA switch at the 5′ splice site.
figure 3

The consensus sequences are those from yeast19. The cleavage site is indicated by an arrow at the exon/intron border. Yeast U1 snRNA forms base-pair interactions with the 5′ ss. U1-C protein can be crosslinked to the pre-mRNA and the green star shows its relative position57. The U1/5′ ss interaction is disrupted at the expense of a mutually exclusive interaction between U6 snRNA and the 5′ ss23. ATP hydrolysis is necessary, and the two DExH/D box proteins Prp28p and Brr2p act in this switch. The relative position in the intron (+7/+8 with respect to the site of cleavage) to which hPrp28p can be crosslinked41 is indicated (pink flash).

Previous studies had implicated Prp28p in this switch of pairing partners with the 5′ ss39. However, because the switch from U1/5′ ss to U6/5′ ss is closely linked with unwinding of the extensively base-paired U4/U6 snRNAs and the necessity for another DExH protein, Brr2p, it had not been feasible to define the point of action of Prp28p exactly (Fig. 3). The study by Chen et al.31 suggests a direct role for Prp28p in disrupting the U1/5′ ss helix. But, how is Prp28p positioned for action? Does it interact with U1C, with RNA or both?

The human homolog hPrp28p is an integral component of the U5 and U4/U6/U5 snRNPs and thus likely to enter the spliceosome together with the tri-snRNP complex40. Ismaili et al.41 have used site-specific crosslinking to identify hPrp28p as a protein that binds to intron sequences at the 5′ ss. Kinetic analyses indicated that hPrp28p remains bound to the pre-mRNA throughout the splicing pathway. The site of the crosslink in the protein was mapped near motif III (TAT). In other DExH/D box helicases, including Prp22p, NPH-II, hepatitis C virus NS3 and mammalian eIF-4A, motif III has been shown to couple the energy of ATP hydrolysis to duplex unwinding4,8,9,10. Yeast Prp28p can hydrolyze ATP in an RNA-stimulated fashion42 and it will be interesting to determine if and how Prp28p uses ATP hydrolysis during the splicing reaction.

Handing off the branchpoint sequence to U2 snRNP. The 5′ splice site is recognized at least twice — initially by U1 snRNP and then U6 snRNA — prior to the first catalytic step. Similarly, the branchpoint sequence (BPS) and 3′ splice site (3′ ss) region are contacted initially by proteins, and then the BPS interacts with U2 snRNA (Fig. 4). The branchpoint binding protein BBP/SF1 binds to the BPS in cooperation with U2AF65 or its yeast homolog Mud2p, which bind to a pyrimidine-rich element between the BPS and the 3′ ss20,43,44,45. The bound proteins are replaced in the pre-spliceosome by U2 snRNA. The switch of pairing partners at the BPS requires essential DExH/D box proteins.

Figure 4: Switch of pairing partners at the branchpoint/3′ splice site region.
figure 4

Shown are the yeast branchpoint and 3′ ss consensus sequences19. The BPS/3′ ss region in the intron is recognized by BBP and Mud2p (SF1/mBBP and U2AF65, respectively in the mammalian system20). Interaction of the U2 snRNA with the BPS requires the DEAD ATPase Prp5p and the DExH protein Sub2p.

The yeast ATPase Prp5p is thought to modulate RNA structure in the U2 snRNP to make it accessible for interaction with the BPS46,47. The ATP requirement for U2 association (but not the requirement for Prp5p per se) can be bypassed by deletion of the non-essential U2 snRNP protein CUS2. Perriman and Ares48 suggest that ATP stimulates the rate of U2 snRNP recruitment to the spliceosome, in a manner that is controlled by CUS2 and the U2 snRNA structure. A mammalian counterpart for Prp5p has been identified (C. Query, pers. comm.), and additionally, mammalian UAP56, which was identified through its physical interaction with U2AF65, is essential for stable U2 snRNP association49.

Three groups32,33,34 have now identified the DExD box protein Sub2p as the yeast homolog of UAP56. Strong evidence that UAP56 and Sub2p are functional homologs is provided by the finding that UAP56 can support growth of a lethal sub2Δ mutant34. Kistler and Guthrie32 postulated that if Sub2p is involved in the change of pairing partners at the BPS, its essential function might be to antagonize Mud2p and/or BBP, which must be displaced before U2 snRNP can gain access (Fig. 4). They found that deletion of the non-essential MUD2 gene rescued the lethality of a sub2Δ strain. Given that Sub2p is a member of the DExD box family, one can speculate that Sub2p hydrolyzes ATP to dislodge Mud2p. Zhang and Green34 find that mutations of Sub2p at positions corresponding to amino acids essential for ATP hydrolysis in other DExH/D box proteins inactivate Sub2p in vivo. However, it remains to be demonstrated that Sub2p has NTPase activity and, if so, whether Sub2p requires its ATPase activity to displace Mud2p (and possibly BBP) or to effect any other conformational changes during spliceosome assembly. In vitro studies using extracts from different sub2 mutants suggest a role for the protein at several stages during spliceosome formation32,33,34.

Prp28p function is not rate-limiting in vivo when the U1/5′ ss interaction is weakened and Sub2p is not essential when the Mud2 protein is absent. However, the bypass strains fail to grow at low temperatures30,31. A detailed analysis of pre-mRNA splicing in the bypass strains may uncover intermediate steps that may otherwise not be apparent, much in the way in which conditional mutants have been instrumental to define the splicing pathway at its current level of resolution18,19. These strains also offer an opportunity to separate the ATP requirements of Prp28p and Sub2p from those of other DExH/D box proteins that act at nearby steps. For example, Prp5p functions temporally near Sub2p (Fig. 4) and Brr2p acts close to Prp28p (Fig. 3).

Thus, the recent findings by Chen et al.31 and Kistler and Guthrie32 provide the means to dissect the splicing reaction in greater detail and they clearly define specific ribonucleoproteins within the spliceosome as targets for DExH/D ATPases.

`Non-helicase' DExH box ATPases

Exemplary DExH ATPases that lack intrinsic helicase function but act nonetheless to remodel protein nucleic acid complexes include the chromatin remodeling factor SWI2/SNF2 and vaccinia virus transcription termination factor NPH-I. NPH-I acts on the RNA polymerase ternary complex to effect the release of nascent mRNA chains in response to an upstream termination signal in the RNA. The ATPase activity of NPH-I, which is absolutely dependent on single-stranded DNA, is essential for termination50,51. It is proposed that ATP hydrolysis is triggered by interaction of NPH-I with single stranded DNA in the transcription bubble.

SWI2/SNF2 enzymes, often as part of a large multiprotein complex, elicit structural alterations in the DNA–histone complex to make the DNA more accessible to transcription factors and the DNA repair machinery52. The DNA-stimulated ATPase activity of SWI2/SNF2 is crucial to its function and conserved residues in motifs that distinguish this DExH family are important for in vivo and in vitro activity53,54.

Common themes?

RNPase activity can now be added to the functional repertoire of DExH/D box proteins along with NTP hydrolysis (the common denominator) and, in some cases, RNA unwinding. Is there a unifying mechanistic theme to all DExH/D box proteins or are there fundamental differences in how energy is used? The question can be reduced to the issue of whether all DExH/D box NTPases can translocate along RNA, powered by NTP hydrolysis, and effectively act as molecular cowcatchers (as on a locomotive) to sweep proteins and/or RNA duplexes out of their way. Some DExH/D proteins may translocate, yet fail to score in a conventional in vitro RNA helicase assay, which typically demands unwinding of RNA duplexes of 15–30 base pairs. A translocation mechanism could accommodate the emerging models that implicate DExH/D box proteins as catalysts of conformational changes in the spliceosome18,31,32.

The NPH-II model of RNPase activity establishes proof-of-principle, but it does not resolve a fundamental question of whether protein displacement is necessarily coupled to RNA unwinding, because the U1A binding site is embedded within an RNA duplex16. The challenge for future studies is to devise substrates and assay systems to measure protein translocation along an RNA molecule independent of duplex unwinding.

An alternative model that invokes fundamental mechanistic differences among various DExH/D box proteins could entail direct binding of the DExH/D box NTPase to an RNA-binding protein (or next to it on RNA), leading directly to its displacement without translocation.

A biochemical characterization of each DExH/D box family member with respect to NTP hydrolysis and nucleic acid unwinding parameters (directionality, processivity, and so forth) is necessary to draw inferences about in vivo mechanisms. Valuable insights have been gleaned from the crystal structures of a helicase domain of the hepatitis C virus NS3, a DExH box protein11,12. Structural analyses of other members of the family may help resolve unanswered questions regarding cofactor and nucleotide specificity and directionality of unwinding/translocation.

In summary, functions other than NTP hydrolysis cannot be attributed to DExH/D box proteins based solely on sequence conservation. Even highly related proteins, for example Prp22p and Prp2p or NPH-II and NPH-I, can differ in their biochemical properties (helicase versus non-helicase)6,25,29,30,50. The ongoing identification of natural targets and cofactors for DExH/D box proteins will be critical to understand their biological functions.