The author is at the HIV Drug Resistance Program, National Cancer Institute-Frederick, P.O. Box B, 1050 Boyle Street, Building 535, Room 211, Frederick, Maryland 21702-1201, USA. rein@ncifcrf.go
Retrovirus particles contain a dimeric form of their genomic RNA. A recent study suggests a mechanism that might underlie the selective incorporation of dimeric RNAs into assembling virus particles.
The genomic RNA of retroviruses, like that of many RNA viruses, leads a double life. On the one hand, the retroviral RNA must be incorporated into nascent virus particles in order to carry the viral genetic information to the new host cell. On the other hand, it serves as the mRNA for the Gag and Gag-Pol proteins within the virus-producing cell. How are these disparate roles balanced and coordinated? An article in a recent issue of Nature1 describes structural studies on components of Moloney murine leukemia virus (MLV) that lead the authors to propose an elegant solution to this problem. The results show that a viral protein involved in particle formation binds selectively to a region of the RNA that is exposed in packaged RNA, but not in mRNA.
The assembly of a retrovirus particle is mediated by a single viral gene product, the Gag protein. Gag normally incorporates the genomic RNA into the nascent particle by recognizing a cis-acting signal, termed , present in the RNA. The structure of and the molecular details of this recognition are poorly understood at present, but is generally believed to consist of a small number of stem-loop structures near the 5' end of the viral RNA2.
One striking feature of the packaged genomic RNA is that it is dimeric within the virus particlethat is, it is present in two identical copies joined by a limited number of Watson-Crick base pairs. The redundancy of the information provides insurance against breaks or other damage that would be fatal in a single copy. When the virus enters a new host cell and the RNA is copied into DNA by reverse transcriptase, the enzyme can jump from one copy to the other during DNA synthesis. This template-switching during reverse transcription can result in genetic recombination, an important source of genetic variation for retroviruses. In essence, retroviruses are the simplest diploid organisms.
The exact structure of the linkage between the two RNAs in the dimer is not known. The situation is complicated by the fact that the RNA-RNA linkage is different in 'immature' particles, which have just been released from the cell, from that in 'mature' particles, in which the viral proteins have been cleaved by the viral protease3. Nevertheless, when partially denatured RNAs from mature particles were examined by electron microscope many years ago, they were found to be joined near their 5' ends; thus, at least in RNAs from mature virus particles, the most stable linkage is in this region of the RNA4,
5. This region has sometimes been called the dimer linkage site (DLS).
A common feature of DLS regions is the presence of stem-loops (SLs) with four- or six-base palindromic sequences in the loops ('kissing loops'), and there is extensive evidence that these structures contribute to the dimerization of short RNAs6,
7,
8. Thus, one simple, attractive scenario for dimerization is that the first step is base-pairing between the loops of two RNA monomers; this paired structure has frequently been referred to as a 'kissing complex' (Fig. 1). Such a complex might be converted during virus maturation to a more stable 'extended dimer,' in which bases in the stems, as well as those in the loops, are paired intermolecularly rather than intramolecularly (Fig. 1c).
(a) The secondary structure of MLV bases 278−374 in monomeric form. (b) The kissing complex, in which DIS-2 has undergone a register shift, exposing the palindrome AGCU in the loop (blue). DIS-2 from two monomers is shown with intermolecular base-pairing between the loops. Note that the register shift has also exposed bases 304−309 (red). (c) The extended dimer, in which the stems, as well as the loops, of the DIS-2 elements of the two monomers are base-paired intermolecularly.
The fact that and the DLS are in the same region of the viral RNA suggests the possibility that dimerization and packaging are intimately related to each other. In other words, perhaps dimerization of the RNA is a prerequisite for packaging. In fact, when packaging of viral RNA is reduced, either by actinomycin D treatment of the cells9 or by changes in the structure of the RNA10, the genomic RNA in the released particles is still dimeric, even when the majority of particles do not contain genomic RNA; this is strong evidence that the packaged genomic RNA molecules enter the assembling particles in pairs, rather than singly. Furthermore, one striking exception to this rule is found in HIV-1 RNAs that have been engineered to contain two tandem copies of the region: these RNAs are packaged into virus particles as monomers11. These results are all consistent with the idea that the real packaging signal is only present in dimeric RNAs (or in RNAs with a pseudo-dimeric three-dimensional structure11).
Extensive genetic analysis suggests that MLV contains three stem-loops, designated SL-B (dimerization initiation site 2 or 'DIS-2' in the current work), SL-C and SL-D. The loop in DIS-2 is palindromic, and this structure clearly plays an important role in the dimerization of short MLV-derived RNAs12. Some years ago, structures of short RNAs containing these three stem-loops were investigated by chemical structure-probing (in which the reactivity of individual bases toward various chemical agents was measured)13. It was found that the bases in the stem of DIS-2 shift register when the RNA dimerizes (Fig. 1a,b). This shift exposes the palindromic sequence, AGCU, which becomes the loop in the dimeric form; this conformation enables the two monomers to form the kissing complex (Fig. 1b). In addition, the shift moves the six bases UAUCUG (nucleotides 304−309 of MLV) from the 3' end of the stem, in which they are paired with nucleotides 278−282, to the region between DIS-2 and SL-C. In other words, bases 304−309 are paired in the monomeric RNA, but not in the dimer. This fact is a key element in the ideas now put forth by D'Souza and Summers1.
In a previous study, D'Souza and Summers and their colleagues determined the three-dimensional structure of a 101-base RNA molecule representing all three of these stem-loops by NMR14. Although the RNA molecule used in those studies was altered to prevent dimerization, it was designed to mimic the conformation of one monomer in a dimer (Fig. 1c). They found that SL-C and SL-D stack together as an extended helix; DIS-2 is connected to this helix by a flexible linker, consisting of the four bases UCUG (MLV nucleotides 306−309). Thus, the three-dimensional structure of this RNA completely confirms the unpaired nature of these bases in the dimer structure.
One of the cleavage products formed from Gag during virus maturation is nucleocapsid (NC), a very small, very basic protein with one or two zinc fingers that may mediate Gag interactions with nucleic acids. NC binds with high affinity and stoichiometrically to MLV sequences only when all three stem-loops are present15. To gain insight into the specific interaction of MLV Gag with the element in MLV genomic RNA, D'Souza and Summers have now determined the structure of the complex between MLV NC and the 101-base RNA previously described1. They found that NC binds specifically to the UCUG sequence linking DIS-2 with the SL-C−SL-D stack (Fig. 2). In fact, the binding of NC to the RNA could be completely ascribed to its affinity for the unpaired UCUG bases, as was confirmed first with a series of truncated RNAs and finally by isothermal titration calorimetry with the hexanucleotide rUAUCUG. The dissociation constants obtained from these titrations are in the range of 100 nM.
Figure 2. Structure of the complex between MLV NC and an RNA representing MLV bases 278−374.
NC (gray) binds to bases 306−309 (red), the flexible linker between DIS-2 and stem-loops C and D.
Because the UCUG is only unpaired and available to NC when the RNA is in the dimeric conformation, these observations suggest an elegant mechanism explaining the exclusive packaging of dimeric RNA. It will be of great interest to test, in biological systems, the possibility that these four bases are a critical element in . However, the authors acknowledge the likelihood that will not turn out to be localized in a single site. Attempts to define by genetic means have not been successful at pinpointing it to a small number of bases. In fact, the region contains several UCUG sequences; perhaps they will be recognized in concert by several Gag molecules at the first stage of RNA packaging. Furthermore, an additional stem-loop ('DIS-1' in the present work) in the 5' region of the MLV genome seems analogous in many ways to DIS-2: it can also shift register to expose a palindromic sequence in the loop (there is evidence that DIS-1 can contribute to dimerization of short RNAs in vitro and of packaged MLV RNA16,
17), and this register shift, like that in DIS-2, exposes the sequence UAUCUG.
It should be noted that a somewhat similar proposal has also been made for HIV-1 RNA18. Thus, it seems possible that a conformational switch between monomers and dimers, exposing a crucial sequence only in the dimeric RNA, will eventually be recognized as a general mechanism for the packaging of dimeric RNAs by retroviruses.