Previous Article | Next Article ![]()
Journal of Virology, February 2005, p. 1824-1835, Vol. 79, No. 3
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.3.1824-1835.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Molecular Genetics and Microbiology, University of New Mexico Health Sciences Center, Albuquerque, New Mexico
Received 27 May 2004/ Accepted 21 September 2004
|
|
|---|
|
|
|---|
The initial steps in hantavirus infection require binding of the G1 and/or G2 protein to ß3 integrin (9) or other cell surface receptors, followed by virus entry and uncoating. After entry, L-protein mediates transcription of minus-strand RNA, resulting in plus-stranded mRNA in the cytoplasm, apparently with an orthomyxovirus-like cap-snatching mechanism (7, 8, 16). Following viral mRNA translation, transcription shifts from mRNA to plus-strand complementary RNA and de novo minus-strand viral RNA synthesis concomitant with the formation of ribonucleoprotein structures (7, 16, 37). The ribonucleoproteins appear to be composed of viral RNA, N protein, and presumably viral polymerase and accumulate on the cytoplasmic side of intracellular membranes, possibly through interaction with the G1 ands G2 proteins (10, 28). Additional evidence suggests that hantavirus assembly can occur at the plasma membranes of infected cells (10, 32). It has also been shown that transport of newly synthesized ribonucleoproteins to the plasma membrane, where Black Creek canal hantavirus is assembled, could be mediated through actin filaments (33).
The viral N protein is central to the hantaviral replication cycle. The activity and biological function of N protein in the viral life cycle are proposed to depend on differential interaction with minus-strand viral RNA, plus-strand complementary RNA, and the mRNA. Cell culture-based experiments with bunyamwera virus indicate that the formation of intracellular RNA-nucleocapsid complexes is specific for full-length viral RNA or complementary RNA molecules, since no mRNA molecules are observed in nucleocapsids (18), indicative of a role for N during both complementary RNA and viral RNA synthesis. The encapsidation process must necessarily result in the preferential packaging of viral RNA into nucleocapsids destined for particle exit.
Electron microscopic analysis indicates that the termini of the phlebovirus Uukuniemi virus interact to form circular structures that dissociate under denaturing conditions (14). Furthermore, such circular RNA structures for other members of the Bunyaviridae have also been observed in the presence of nucleoprotein complexes, further suggesting that intramolecular association of the viral RNA termini forms in vivo (25, 29, 31). All members of the Bunyaviridae family contain terminal nucleotide sequences that could facilitate the association of termini through the formation of short panhandles. Moreover, the termini of both viral RNA and complementary RNA may undergo substantial antiparallel base-pairing, so both the negative- and positive-sense RNAs could potentially form panhandles of similar stability.
Biochemical and genetic methods indicate that N protein forms trimers that are ostensibly subunits that form the capsid. Based on analysis of peptides derived from Sin Nombre hantavirus, the C-terminal half plus the N-terminal 40 amino acid residues of N protein are responsible for intermolecular N-protein association (1). Similarly, analysis of protein-protein interaction with truncated N mutants in conjunction with a mammalian two-hybrid assay indicates that N-terminal amino acids 1 to 43 are involved in and C-terminal amino acids 393 to 398 (VNHFHL) are essential for homotypic interactions in N protein homodimers and homotrimers in Tula hantavirus (20).
In vitro binding studies have been used to examine the specificity of the interaction between N and its RNA with bacterially expressed, hexahistidine-tagged N from various species from the hantavirus genus (12, 39, 40), the bunyavirus genus (26), and the tospovirus genus (35). Based on this work, hantavirus and bunyavirus N proteins appear to preferentially interact with a target site near the 5' end of the viral RNA. Recently, it was observed that trimeric glutathione S-transferase (GST)-tagged N preferentially binds to the panhandle, an RNA substrate composed of both the 5' and 3' ends (23). This specificity for the panhandle appears to be a characteristic of the trimer, as a heterogenous population of N composed of a population of N molecules of mixed subunit composition does not discriminate between the 5' end and the panhandle. The region of hantavirus N required for RNA binding appears to be situated in the central conserved region from amino acids 175 to 217 (43). However, for the tospovirus genus, the data are consistent with the presence of multiple RNA binding domains (35).
Here we examined the characteristics of the panhandle RNA required for recognition by hantavirus N protein. Recognition of the panhandle by purified trimeric N requires specific features of the panhandle, as reflected by both a requirement for secondary structure and conserved nucleotide sequence. Moreover, binding of the trimeric N protein to the panhandle is an entropy-driven process resulting in the recruitment of one trimer per panhandle and results in a conformational change in the protein. This specific recognition is likely to be a key step in the hantaviral RNA encapsidation process and general to members of the Bunyaviridae family.
|
|
|---|
Expression and purification of trimeric hantavirus GST-N protein. Sin Nombre virus nucleocapsid protein derived from strain 77734 (2) was expressed as an N-terminal GST fusion protein in Escherichia coli. This protein was denatured and renatured, and stable trimeric GST-N was purified on 10 to 60% sucrose gradients as described in detail previously (23). Freshly prepared trimeric GST-N protein was used for all experiments.
Preparation of RNA substrates. Wild-type S segment panhandle RNA containing 32 nucleotides from both the 5' and 3' ends of Sin Nombre virus S segment viral RNA separated by a loop of six uracil residues was synthesized by in vitro transcription with T7 polymerase. We used a single-stranded template sequence (5'-TAGTAGTAGACTCCTTGAGAAGCTACTCGACAAAAAATTCTTGTTGCTTTTCAAGGAGCATACTACTACTATAGTGAGTCGTATTAGCTAG-3') containing a T7 promoter juxtaposed to the sequence coding for S segment RNA from Sin Nombre virus strain R11 (4). This single-stranded sequence was amplified by PCR with the primers 5'-CTAGCTAATACGACTCACTATAGTAGTAG-3' and 5'-TAGTAGTAGACTCCTTGAGAAGCT-3'. The PCR product was gel purified before use in the T7 transcription reaction.
For the synthesis of the various mutants, we employed the same strategy, incorporating priming oligonucleotides with the appropriate nucleotide changes. The necessary primers or template containing the desired mutations was amplified with PCR, and the amplified DNA was gel purified and used directly in T7 transcription reactions. [32P]CTP-radiolabeled transcripts were produced from different templates according to the manufacturer's protocol with T7 transcription reagents (MBI Fermentase). After transcription, the reaction mixture was digested with DNase I to remove the DNA template. Purification of RNA was performed with Trizol reagent (Qiagen). Purified RNA was stored at 20°C in 20-µl aliquots for up to 2 weeks.
Filter binding assays. Binding reactions were carried out as described previously (23) in binding buffer at a constant concentration of RNA with increasing concentration of purified trimeric GST-N. The reaction mixture was incubated at room temperature for 45 min and filtered through nitrocellulose filters under vacuum. The filters were washed with 10 ml of binding buffer and dried, and the retained RNA was measured with a scintillation counter. Nonspecific retention of RNA was monitored by filtering the complete reaction mixture in the absence of protein. Dissociation constants were calculated by fitting the experimental data points in to a hyperbolic equation with Origin 6 (Microcal). The apparent dissociation constant (Kd) corresponds to the concentration of N protein required to obtain half saturation, assuming that complex formation obeys a simple bimolecular equilibrium.
Competition experiments were performed with filter binding assays. A constant concentration of hantavirus GST-N protein (100 nM) was incubated with a constant concentration of [32P]CTP-labeled wild-type panhandle RNA (0.01 nM) and increasing concentrations of competitor RNA. The reaction mixture was incubated at room temperature for 45 min and then filtered through nitrocellulose filters under vacuum as described for the filter binding assay above. The bound wild-type panhandle RNA was plotted against the competitor RNA concentration. Data points were fit by nonlinear curve fit analysis with the Origin 6 program (Microcal).
RNA secondary structure analysis.
We examined multiple suboptimal secondary structure plots to identify alternative structures that might be nearly as stable as those shown in Fig. 1 and 2. In addition, we determined the P-num values for each of the nucleotides in the RNAs. The P-num value of a nucleotide is a representation of the number of potentially stable pairing partners for that nucleotide elsewhere in the same RNA. In particular, P-num is a predictive measure of pairing fidelity at suboptimal free energy values (
G), and bases with low values can be used parametrically to identify motifs with the highest probability of assuming similar configurations among a series of energetically related structures (17). Thus, when viewed from the context of suboptimal folds, the structures composed of low P-num nucleotides still form and are still composed of bases with the fewest alternative pairing partners (low P-num) (27).
![]() View larger version (29K): [in a new window] |
FIG.1. RNA binding profile of trimeric GST-N with the viral and complementary panhandles corresponding to the S, M, and L segments. (A) The nucleotide sequence and highly probable secondary structure of each of the RNAs is shown at the left. For each of the panhandle sequences, the nucleotides common to the both the complementary RNA (cRNA) and its antecedent viral RNA (vRNA) are shown in grey, while the nucleotides of the complementary RNA that vary relative to the viral RNA are shown in black. To generate viral RNA (randomized), the 5' nucleotides of the S segment RNA were randomized and the 3' nucleotides were chosen such that the secondary structure of the RNA would be the same as that of the viral RNA panhandle. Nucleotides that vary relative to those of the viral RNA panhandle are shown in black. (B) The RNA binding profiles as a function of increasing purified trimeric GST-N are shown in the right part of the figure. Increasing concentrations of purified trimeric GST-N were incubated with a constant concentration of 32P-labeled RNA and protein-RNA complexes were isolated by filter binding as described in the Materials and Methods section. The data are plotted to show the binding profile at relatively high concentrations of trimeric GST-N and at 80 mM salt to display binding to complementary RNA and viral RNA (randomized). To determine the dissociation constants for the viral RNAs, the data were replotted to optimize visualization and analysis at lower trimeric GST-N concentrations. The data shown are from one experiment. However, each of the RNAs was analyzed in three separate experiments, and the measured Kds with standard deviations are shown in Table 1.
|
![]() View larger version (37K): [in a new window] |
FIG.2. Nucleotide variants in the S segment RNA panhandle. RNAs were synthesized as described in the Materials and Methods section. (A) Probable secondary structures of a set of RNAs containing nucleotide substitutions. Substitutions are shown in black. The RNAs shown in B are secondary structures of S segment viral RNA (vRNA) panhandles of decreasing length, as indicated. Panhandle RNAs containing additional nucleotides at the 5' or 3' terminus or both are shown in C. Additional nucleotides are indicated in black. The likely secondary structure of each of the RNAs was assessed by both P-num analysis and examination of alternative structures (Materials and Methods). The secondary structures of all of the RNAs except RNA 14 are highly probable. The three substitutions in RNA 14 would be expected to substantially disrupt base pairing such that the terminal two nucleotides would be unpaired in a substantial fraction of the RNA population.
|
G of approximately 30 kcal/mol, whereas the next most likely structure has a negative free energy that is more than 70% of this low minimum free energy. Moreover, the P-num values for most of the nucleotides that constitute the panhandle is either 1 or 0, where paired nucleotides have a value of 1 whereas unpaired nucleotides have a value of 0. Thus, it is highly unlikely that RNAs of alternative secondary structure to that shown constitute a significant proportion of the RNA population. Calculation of binding stoichiometry. The binding stoichiometry (expressed in terms of number of N molecules bound per RNA panhandle) was estimated from the intersection of two straight lines of a least square fit plot of the increase in bound RNA against the ratio of input concentrations of GST-N protein and RNA (24).
Fluorescence spectroscopic analysis. Structural alterations in the GST-N protein trimer due to its binding with different RNA molecules were monitored by quenching the tryptophan fluorescence of the N protein trimer with acrylamide in the presence and absence of RNA according to the Stern-Volmer equation (22) F0/F = 1 + Ksv[Q], where F0 and F are the fluorescence intensities in the absence and presence of quencher, respectively, [Q] is the concentration of quencher, and Ksv is the Stern-Volmer quenching constant. Fluorescence quenching measurements of tryptophan in N protein with acrylamide quencher were made by serial addition of small aliquots of concentrated acrylamide solution to 1 ml of sample in a cuvette. If the plot of F0/F versus [Q] is linear, which indicates a single class of fluorophores equally accessible to acrylamide, the value of Ksv can be calculated from the slope of straight line fit to the data points.
Analysis of thermodynamic parameters.
Thermodynamic parameters
H (van't Hoff enthalpy),
S (entropy), and
G (free energy) for the association of GST-N protein trimer with wild-type panhandle RNA were calculated with the equations ln Kapp =
H/RT +
S/R and
G =
H T
S, where R and T are the universal gas constant and absolute temperature, respectively, and Kapp (apparent equilibrium constant = 1/Kd) was calculated at three different temperatures (25, 30, and 37°C) by fluorometric titration of GST-N protein trimer with S segment panhandle RNA. The fluorescence spectra of GST-N protein trimer in binding buffer excited at 295 nm were recorded with a Hitachi spectrofluorometer at an excitation and emission band pass of 5 and 10 nm, respectively. Contribution from the binding buffer was subtracted whenever required. The apparent dissociation constant (Kd) corresponds to the concentration of N protein required to obtain half saturation, assuming that complex formation obeys a simple bimolecular equilibrium.
H and
S were calculated from the slope and intercept of a plot of ln Kapp against 1/T.
G was calculated from the third equation after incorporation of the values for
H and
S obtained from the second equation.
|
|
|---|
Previously we observed that a relatively small RNA composed of the S segment RNA panhandle is sufficient for high-affinity binding by trimeric GST-N protein (23). Thus, we examined the association of trimeric GST-N with small panhandle RNAs containing the terminal 33 nucleotides from both the 5' and 3' ends and separated by a 6-nucleotide-long spacer composed of U residues. Depending on sequence length and complexity, RNAs can potentially form multiple secondary structures. However, on the basis of both analysis of suboptimal RNA folds and P-num analysis (see above) RNAs of secondary structure significantly different from those shown in Fig. 1 are predicted to be highly unlikely.
The results of RNA filter binding experiments with these viral and complementary RNA substrates are shown in Fig. 1, and a summary of the binding data with these panhandle RNAs is shown in Table 1. Each of the viral RNAs corresponding to the three genome segments was recognized with similar high affinity by trimeric GST-N protein. Furthermore, this protein-RNA interaction was stable to increasing NaCl concentrations, indicative of specificity. Interestingly, the binding affinity of the plus-strand complementary RNA panhandles was markedly reduced relative to that of the viral RNAs and sensitive to increased salt.
|
View this table: [in a new window] |
TABLE 1. Dissociation constants for purified trimeric Sin Nombre virus GST-N with minus-strand viral RNA (vRNA) or plus-strand complementary RNA (cRNA)
|
Both secondary structure and primary sequence are required for panhandle-N interaction. The S, M, and L segment panhandles are double stranded but punctuated by single symmetrically opposed unpaired nucleotides at positions 9 and either 18 or 19 from the 5' and 3' termini. We constructed various mutations in the S segment viral RNA panhandle to examine the features of the viral RNA required for recognition by trimeric GST-N. RNA 14 contains three adjacent nucleotide substitutions near the 5' end that would disrupt the terminal stem structure (Fig. 2A). We carried out simple RNA binding analysis with trimeric GST-N, similar to that shown in Fig. 1, to measure binding affinity for these RNAs. The data from this experiment indicated that RNA 14 was not recognized with high affinity (Table 1).
In addition, we performed competition binding experiments in which increasing amounts of unlabeled competitor RNA were added to binding reactions containing purified GST-N trimer and a constant amount of radiolabeled wild-type panhandle RNA; effective competition by the unlabeled RNA would result in reduction of binding of the labeled RNA to GST-N. This experiment indicated that mutant 14 did not effectively compete for binding to GST-N protein (Fig. 3). Thus, both the binding and competition data are consistent with a requirement for terminal base-pairing for specific recognition by N.
![]() View larger version (14K): [in a new window] |
FIG. 3. RNA binding competition analysis of S segment viral RNA variants. As described in detail in the Materials and Methods section and the text, a constant amount of purified trimeric GST-N protein was incubated with a constant amount of labeled RNA corresponding to the wild-type viral RNA panhandle and increasing concentrations of unlabeled competitor RNA. The quantitative reduction of binding of trimeric GST-N to labeled RNA as a function of increasing competitor RNA was measured for each RNA. The figure depicts the competition curves for three of the variant RNAs and for the wild type. The competition curves for the other RNAs in Fig. 2 are not shown. However, the maximum extent of competition for all of the S segment panhandle variants is presented in Table 2. The data in the figure and in Table 2 are the averages of three separate experiments. The standard deviation for the measured values of all of the RNAs is presented in Table 2.
|
In some hantavirus strains related to Sin Nombre virus, such as NY1, Puumala, Tula, and Andes (6, 11, 15, 21), an A residue rather than G is present at position 10 from the 5' end of the RNA. This transition mutation would maintain the general panhandle configuration since there is a U residue at position 10 from the 3' end of the RNA. RNA 12 contains such a G-to-A substitution at position 10 from the 5' end (Fig. 2A). While viral RNA contains a single pair of unpaired nucleotides at position 9, in plus-strand Sin Nombre virus complementary RNAs, nucleotides 9 and 10 are both unpaired. Thus, RNA 13 contains a G-to-C transversion such that in the viral RNA, nucleotide 10 along with nucleotide 9 would both remain unpaired. Finally, RNA 15 contains two consecutive C-to-G substitutions at positions 13 and 14 that would disrupt the nine-nucleotide-long, base-paired segment that begins at nucleotide 10. RNA 12 was recognized at high affinity by GST-N protein (Table 1) and competed with wild-type RNA for binding to GST-N protein (Table 2), indicating that an alternative base pair at position 10 does not affect the interaction with N. However, disruption of the stem structure (in mutants 13 and 15) resulted in a corresponding reduction in specific recognition by trimeric GST-N and the inability to compete efficiently with wild-type RNA for binding to trimeric GST-N (Tables 1 and 2).
|
View this table: [in a new window] |
TABLE 2. Competition analysis of Sin Nombre virus GST-N protein-RNA interactionsa
|
The data from RNA binding experiments and binding competition experiments with these mutants are summarized in Tables 1 and 2. Interestingly, it appeared important for nucleotides 9 and 19 to remain unpaired for specific recognition by trimeric GST-N. Substitution of a nucleotide that would retain nucleotides 9 and 19 in unpaired form (RNAs 8 and 10, respectively) resulted in RNAs that were recognized by trimeric N with high affinity even at increasing salt concentrations. In contrast, the introduction of base pairs at nucleotides 9 and 19 (RNAs 7, 9, and 11) resulted in RNAs that could not stably interact with trimeric GST-N at increasing salt concentrations. Taken together, these data suggest that overall secondary structure is important for recognition by trimeric N protein. However, nonconservative nucleotide substitutions in base-paired regions can markedly reduce the affinity of GST-N for the panhandle, indicative of a contribution of a primary sequence requirement for recognition by N.
Minimum panhandle size and stoichiometry of N-panhandle interaction. Based on previous work (23) and the data described here, a hantavirus panhandle composed of the terminal 32 nucleotides from both the 5' and 3' ends is sufficient as a substrate for trimeric N recognition. To determine the minimum size of panhandle required for recognition by trimeric N, we constructed RNAs that contained fewer interior nucleotides and carried out RNA binding studies with these RNAs. RNA 5 contains the terminal 23 nucleotides and RNA 6 contains the terminal 18 nucleotides of the panhandle, respectively (Fig. 2B). The results of this experiment indicated robust interaction between trimeric GST-N and RNA 5 but not RNA 6 (Tables 1 and 2). Thus, a panhandle 23 nucleotides in length is sufficient for recognition by trimeric N.
We attempted to determine the binding stoichiometry for N with its substrate panhandle with the shortest RNA that bound to trimeric GST-N with high affinity. We replotted the data from the RNA binding experiments with RNA 5 to calculate stoichiometry (Fig. 4). The plot is consistent with the binding of a trimer-to-panhandle ratio of 1:1. Similar data were obtained from stoichiometry plots with other panhandle RNAs that interact with trimeric GST-N protein at high specificity (data not shown). We also examined the binding stoichiometry of RNAs that were not recognized by trimeric N with high affinity. These also indicated a 1:1 stoichiometry. For example, see RNA 2 (Fig. 4)
![]() View larger version (10K): [in a new window] |
FIG. 4. Binding stoichiometry of trimeric N-panhandle interaction. The results from binding RNA experiments were replotted to determine stoichiometry as described in the Materials and Methods section. Data for RNA 5, which is bound by trimeric GST-N at high affinity, and RNA 2, which is bound at lower affinity, are shown. The data indicate that three monomers are bound per panhandle. Since the binding experiments were carried out with purified trimeric GST-N, this corresponds to 1 trimer/panhandle. The binding data for all of the RNAs analyzed were similarly plotted and indicated similar stoichiometry independent of binding affinity (not shown).
|
Binding of trimeric N to the panhandle is an entropy-driven process.
We further investigated the interaction between trimeric N protein and S segment panhandle RNA with fluorescence spectroscopy. Essentially, we tried to examine the general conformation of trimeric GST-N in the absence and presence of panhandle RNA, as evidenced by the disposition of tryptophan as measured by excitation and emission spectra. N protein contains 10 tryptophan residues/polypeptide. Figure 5A (inset) shows the emission spectra of trimeric GST-N protein in the absence of RNA following excitation with light at a wavelength (
) of 295 nm. Whereas free L-tryptophan exhibits an emission maximum of 350 nm (data not shown), GST-N protein yielded a
max of 330 nm. This N-dependent shift in
max is approximately twice that observed for most monomeric proteins (3, 41) and is consistent with the shielding of N-associated tryptophan in a trimeric protein.
![]() View larger version (18K): [in a new window] |
FIG. 5. Fluorescence spectroscopic and thermodynamic analysis of N protein trimer and S segment viral RNA panhandle association. (A) As described in the Materials and Methods section, a fixed concentration of N protein trimer in binding buffer was excited at 295 nm, and fluorescence intensity at 330 nm was recorded. The experiment was carried out at 25°C. The fluorescence signal from free binding buffer in the absence of GST-N protein trimer was subtracted where appropriate. The fluorescence intensity of the N protein trimer at 330 nm was monitored at different input concentrations of S segment Sin Nombre virus RNA panhandle. A plot of fluorescence intensity (at 330 nm) as a function of Sin Nombre virus S segment RNA panhandle concentration was then plotted in panel A. The inset shows the fluorescence emission spectra ( ex = 295 nm) of N protein trimer alone ( ) and in the presence of S segment viral RNA panhandle at concentrations of 50 nM ( ) and 110 nM ( ). (B) The N protein bound at different input concentrations of S segment Sin Nombre virus RNA panhandle was calculated from panel A. A plot of the N protein bound at each input concentration of S segment Sin Nombre virus RNA panhandle is shown in panel B. The dissociation constant, corresponding to the concentration of S segment Sin Nombre virus RNA panhandle at which half of the N protein trimer is bound, was calculated from this binding profile (panel B). ln Kapp (1/Kd) at three different temperatures was calculated. A plot of ln Kapp versus 1/T (van't Hoff plot) is shown in the inset. H and S values were calculated from the van't Hoff plot as described in the Materials and Methods section. a.u., arbitrary units.
|
max (Fig. 5A, inset). A similar effect but varying in magnitude was observed with each of the panhandle RNAs listed in Table 1 (data not shown). The decrease in the fluorescence quantum yield at 330 nm with addition of different concentrations of wild-type minipanhandle RNA at 25°C is shown in Fig. 5A. The dissociation constants corresponding to the concentration of N protein at half saturation were calculated at three different temperatures. These data were then used to generate a van't Hoff plot (Fig. 5B) that was used to calculate the pertinent thermodynamic parameters
G,
H, and
S as described in the Materials and Methods section. This analysis indicated that
G = 10.4 kcal/mol,
H = 49.9 kcal/mol, and
S = 202.4 entropy units. From these data it is evident that binding between the N protein trimer and the wild-type minipanhandle RNA is an entropy-driven process. The positive entropy change compensates for an observed positive enthalpy value and drives the binding reaction by a total free energy change of 10.5 kcal/mol at 25°C. These are characteristics consistent with the correct interaction of a substrate RNA with an RNA binding pocket of N. To further compare the interaction of the panhandle RNAs with trimeric GST-N protein, we compared fluorescence quenching of N-associated tryptophan residues in the presence and absence of panhandle RNAs. Figure 6 depicts the effect of wild-type panhandle RNA on the fluorescence quenching of purified trimeric GST-N with increasing concentrations of acrylamide, an uncharged molecule that tends to quench accessible tryptophan residues. This analysis yielded a Stern-Volmer quenching constant (Ksv) of 4.6. The presence of panhandle RNA decreased the quenching of tryptophan by acrylamide, resulting in a Ksv of 2.5. This is consistent with a panhandle-dependent conformational change in trimeric N protein such that tryptophan residues are sequestered away from the surface of the protein following binding to the panhandle.
![]() View larger version (13K): [in a new window] |
FIG. 6. Stern-Volmer plots of tryptophan fluorescence quenching of N protein trimer by acrylamide. A fixed concentration of trimeric GST-N in binding buffer was excited at 295 nm, and emission was recorded at 330 nm. The fluorescence studies were carried out at room temperature. The fluorescence signal from free binding buffer in absence of GST-N protein trimer was subtracted whenever required. The fluorescence intensity at 330 nm was determined at different input concentrations of acrylamide. Plots of F0/F as a function of acrylamide concentration (Stern-Volmer plot) for free GST-N protein trimer in absence ( ) and presence ( ) of RNA of S segment Sin Nombre virus RNA panhandle (panel A), RNA 6 (panel B), and RNA2 (panel C) was used to calculate the Ksv values as described in the Materials and Methods section. The data shown are for viral RNA, RNA 6, and RNA 2 in graphs A, B, and C, respectively.
|
|
View this table: [in a new window] |
TABLE 3. Stern-Volmer quenching constants for the association of N protein trimers with different panhandle RNA molecules
|
|
|
|---|
Plus-stranded complementary RNA can form panhandle structures similar in overall conformation and stability to those formed by the viral RNA but are recognized at lower affinity by trimeric N. What characteristics of the viral RNA panhandle account for specific recognition? Mutational analysis of the panhandle indicates that secondary structure and primary sequence are both important for recognition by trimeric GST-N. The terminal 23 nucleotides of the panhandle are sufficient for interaction with N, and for the S and M genome segments, all but two of these nucleotide positions contain hydrogen-bonded nucleotides. As might be expected, this extensive base-pairing is important for interaction with N, as evidenced by the introduction of nucleotides that abolish base pairing. However, it is also important that the four unpaired nucleotides of the panhandle, at positions 9 and 19, remain unpaired, since nucleotide substitutions that enable pairing at these positions result in concomitant inhibition of binding by trimeric N. In contrast, substitutions that maintain these nucleotides in an unpaired state allow high-affinity interaction with trimeric N.
The primary sequence of the base paired nucleotides is also important for specific recognition by trimeric N, since substitution with alternative paired nucleotides results in a substantial reduction in binding affinity by trimeric N. This role of primary sequence likely accounts for the differential binding affinity of trimeric N for the minus and plus strands. It is formally possible that an alternative tertiary RNA structure was formed following the introduction of alternative base pairs in the panhandle. However, substantial differences in tertiary structure seem unlikely since the panhandle is marked by such extensive base-pairing. Double-stranded RNA typically assumes an A-form helix with a 28-Å pitch (42). With the exception of the nucleotide pairs at positions 9 and 19, the panhandle is double stranded, corresponding to an RNA that is approximately 650 Å in length. The introduction of alternative nucleotide pairs would be unlikely to alter this basic configuration. It is more likely that position-specific interaction between particular nucleotides and the amino acids in the RNA binding pocket of N is required for high-affinity interaction.
In addition to a probable role in encapsidation, recognition of the panhandle by N may function in RNA replication. Trimeric N might facilitate the formation of an initiation complex with the RNA-dependent RNA polymerase, the RNA panhandle, and a priming oligoribonucleotide derived by cap snatching. This would then enable the polymerase to initiate replication at the 3' end of the RNA template within the panhandle. Such a role for N can be readily envisioned for the initiation of plus-strand synthesis since trimeric N recognizes the minus-strand panhandle with high affinity and also in minus-strand initiation, although the interaction between N and the plus-strand panhandle is of lower affinity.
Previously published work indicated that N protein from viruses of the Hantavirus and Bunyavirus genera preferentially binds to a signal(s) near the 5' end of the S segment (26, 40). While the subunit composition of the bacterially expressed hexahistidine-tagged N used in these studies was not examined, it is likely that these protein preparations were composed of a mixture of monomeric, dimeric, and trimeric N. Comparison of the RNA binding affinity of unfractionated GST-N with the purified trimeric GST-N indicates that the trimer binds to the panhandle, while the mixed population binds to RNAs containing the viral RNA 5' end (23). It is possible that monomeric and dimeric N functions in a way that is distinct from that of trimeric N during virus replication. However, since trimeric N appears to be a fundamental subunit intermediate leading to capsid formation (1, 19), we favor the idea that recognition of the viral RNA panhandle by the trimer is likely to provide the basis for specific RNA encapsidation.
The presence of a GST fused to N, which was used in our N expression construct, or a hexahistidine tag, which was used in previous studies to examine the interaction between N and RNA (12, 26, 35, 39, 40), could potentially affect the properties of N. In the present instance, it is most pertinent to consider the possibility that the presence of the GST tag affected either the trimerization of N or the ability of purified trimeric N to recognize the viral panhandle RNA with specificity. We recently initiated experiments with N protein expressed from the bone fide start and stop codons of the Sin Nombre virus N gene. Thus, the resulting protein is devoid of additional amino acids. Preliminary sucrose sedimentation of this protein indicates a peak fraction at approximately 150 kDa, consistent with the formation of trimers, and preliminary RNA binding properties of the purified trimer with S and M segment viral RNA and complementary RNA yield Kd values virtually identical to those measured for purified trimeric GST-N (data not shown).
The panhandle RNAs that we analyzed were generated by T7 RNA polymerase. As with other RNA polymerases, initiation can typically occur at several nucleotides near the formal +1 position of the T7 promoter (13, 30). Thus, the RNAs that we examined are very likely to be composed of a population of RNAs with 5' termini that differ by one or several nucleotides. Given this likely heterogeneity, it is possible that only some of the molecules in the RNA population which initiate at a particular 5' nucleotide are preferentially bound by trimeric GST-N. However, the addition of nucleotides to the 5' end (RNA 17) did not affect recognition by trimeric N. In addition, the 5' end of viral RNA synthesized during virus infection contains heterogenous 5' ends that arise from initiation of cellular RNAs by a putative cap-snatching mechanism (7, 8, 16). Thus, it is likely that the termini of the panhandle need not be formed with precision for specific recognition by trimeric N. Nonetheless, without direct sequencing of all the RNAs, it is possible that the substitution of specific nucleotides close to the +1 position might have led to an unexpectedly large effect on the distribution of initiation sites for a specific RNA.
If one assumes that trimeric N protein is globular, the approximate theoretical diameter of trimeric N protein is 80 to 100 Å. Potentially, there would be space for several trimers along the length of the predominantly double-stranded RNA panhandle. However, the binding stoichiometry appears to be consistent instead with the binding of a single trimer per panhandle. Capsid formation would involve both trimer RNA interaction and interaction between multiple N molecules. It seems likely that the specific in vitro interaction between trimeric N and the RNA panhandle represents an initial step in RNA encapsidation. In vivo, such a complex between N and the panhandle may serve as a nucleation point for subsequent intermolecular interaction between multiple N molecules, resulting in capsid assembly.
Though the panhandle by definition is composed of the terminal nucleotides of the viral RNA, the structure need not be located at the RNA terminus for high-affinity recognition by trimeric N, since the addition of nucleotides to either the 5' end, the 3' end, or both did not measurably inhibit binding by N. However, the fact that trimeric N recognizes the terminal sequences with specificity likely reflects a role for N protein at that site in the RNA distinct from encapsidation. In particular, binding to the panhandle is consistent with a role for trimeric N protein in either initiation of plus-strand RNA at the 3' end of the viral RNA template, termination at the 5' end, or in both events.
Based on the thermodynamic events that occur during binding of the panhandle to the trimer, the protein-RNA interaction is an entropy-driven process. The most conventional interpretation of the data is that, in the absence of RNA, the amino acids constituting the RNA binding pocket of N are occupied by water molecules. Interaction of RNA with the binding pocket would then result in displacement of the water molecules, resulting in the measured increase in entropy. Based on the fluorescence quenching data, RNA-protein interaction also results in an apparent conformational change in N, as evidenced by the general redistribution of the tryptophan molecules to positions inaccessible to an uncharged quencher. Interestingly, the addition of nucleotides to either the 5' or 3' end of the panhandle did not diminish high-affinity binding by trimeric GST-N. However, the presence of these extra nucleotides abolished the conformational change in GST-N. Thus, it appears that the interaction between N and the panhandle results in initial high-affinity binding followed by protein rearrangement.
The simultaneous addition of complementary nucleotides to both the 5' and 3' ends of the panhandle such that a longer double-stranded RNA was created had no apparent effect on either binding affinity or subsequent conformational change in N. This differential effect of the addition of double-stranded versus single-stranded RNA on the conformational alteration of N is intriguing and suggests that rearrangement of N is dependent on interaction between double-stranded regions of the panhandle and the RNA binding site of N. It is likely that concurrent addition of no complementary nucleotides to the 5' and 3' ends would also abolish the rearrangement of N.
The formation of the panhandle and its recognition by trimeric N provide a simple and attractive model to account for specificity during viral RNA encapsidation. However, it is not clear how the panhandle forms. Thermodynamic calculations indicate that the highest negative free energy states for all three viral RNAs involve the formation of a panhandle. However, random intramolecular interaction is unlikely to drive panhandle formation, especially for an RNA as long as the hantavirus L segment. The efficient formation of a panhandle likely involves the activity of an RNA chaperone, a molecule that facilitates the rearrangement of RNA to allow the refolding of RNA to its most stable forms. Such a chaperone activity has been implicated in the correct folding of other viral RNAs and is associated with the viral nucleocapsid protein (5, 34). For hantaviruses and other members of the Bunyaviridae, such a chaperone activity most likely similarly resides in the N protein itself.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»