Previous Article | Next Article ![]()
Journal of Virology, January 2005, p. 1164-1179, Vol. 79, No. 2
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.2.1164-1179.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
School of Animal and Microbial Sciences, University of Reading, Reading,1 Division of Molecular Biology, Institute for Animal Health, Compton,2 Molecular and Cellular Biology Research Group, Faculty of Biological Sciences,3 Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds,5 Waters Corporation, Micromass MS Technologies Centre, Wythenshawe, Manchester, United Kingdom4
Received 3 September 2003/ Accepted 5 July 2004
|
|
|---|
|
|
|---|
N protein, the virus RNA binding protein, is one of the most abundant viral proteins in an infected cell (31). Several functions have been postulated for the coronavirus N protein throughout the virus life cycle (31); primarily, it complexes with the genomic RNA to form a ribonucleocapsid structure (17) and associates with the M protein (19, 39) to form the viral core (48). While N protein is required in trans to rescue the full-length clone of IBV (8) and a porcine coronavirus transmissible gastroenteritis virus clone (82), it is not required for others (1, 71, 72). Certainly, in the case of the rescue of the full-length clone of severe acute respiratory syndrome coronavirus, the presence of N protein increases viral titers compared to rescue performed in the absence of N transcript (83), suggesting that N protein may be involved in the efficiency of replication but that it is not essential.
Based on amino acid sequence comparisons, three domains have been identified in the murine coronavirus, mouse hepatitis virus (MHV) N protein (46), of which the central domain (domain II) was identified as a potential RNA binding site (35, 40) capable of binding both coronavirus- and non-coronavirus-derived RNA sequences (35, 68). However, whether this binding occurs with equal or different affinity is uncertain (14, 35, 49). N protein has been shown to associate with several motifs on viral RNA, including the leader RNA sequence, with particular affinity for the core sequence of the TAS (2, 41), sequences at the 3' end of the genomic RNA (84), and the packaging signal (37). How these sequences promote N binding is unknown.
Several coronavirus N proteins have been shown to be phosphorylated, including IBV, MHV, and transmissible gastroenteritis virus N proteins, although the precise sites were not identified (31). The role of phosphorylation in the virus life cycle is unknown, although the phosphorylation state of N protein has been predicted to play a role in determining the affinity of binding of viral RNA (31, 41), and dephosphorylation of MHV N protein by cellular phosphoprotein phosphatase has been suggested to facilitate the infectious process (36). Siddell et al. (64) identified a virion-associated protein kinase activity that could transfer an additional phosphate from ATP to N protein.
We investigated the role of phosphorylation of IBV N protein in RNA binding by using surface plasmon resonance, which has not previously been used to study coronavirus N protein binding kinetics. To model nonphosphorylated and phosphorylated protein, we expressed and purified N protein from Escherichia coli and Sf9 cells, respectively. Mass spectroscopic analysis indicated that the sample expressed in Sf9 cells was phosphorylated proximal to RNA binding domains and had a phosphorylation pattern identical to that of N protein expressed in the model cell line for IBV infection. We used either an IBV-defective RNA to model the viral genome or synthetic RNAmers resembling the viral leader sequence as target RNAs. Our results showed that the phosphorylated form of N protein has decreased affinity for random RNA and that multiple high-affinity N protein binding sites were present on leader RNA.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Sequences of oligonucleotides used in this studya
|
![]() View larger version (28K): [in a new window] |
FIG. 1. Diagrammatic representation of the synthetic RNAmers used in this study to show delineation between IBV and a random sequence. Note that SL1 and SL2 are predicted structures only and not yet established by experimental work.
|
Infection and transfection of Vero cells. Vero cells (seeded at 105 in 9.6 cm2) were either transfected or infected at 70% confluence. Transfections were performed with 2 µg of pCI-Neo-N with 8 µl of Lipofectamine transfection reagent (Invitrogen) and incubated in Opti-MEM serum-free medium (Invitrogen) at 37°C for 4 h. The transfection medium was replaced with 2 ml of supplemented Dulbecco modified Eagle medium and left for an additional 20 h. Vero cells were infected with 1 ml of IBV Beaudette (multiplicity of infection of approximately 1) and incubated for 1 h at 37°C. The initial inoculum was replaced with 1 ml of media and incubated for a further 7 h postinfection.
Preparation of total protein extracts from Vero cells and Western blotting. Vero cells were washed with phosphate-buffered saline, detached by a cell scraper into 1 ml of phosphate-buffered saline, pelleted by centrifugation at 250 x g for 5 min before lysis by addition of 200 µl of cell lysis buffer (20 mM Tris-HCl [pH 7.5], 100 mM NaCl, 0.5 mM EDTA, 0.5% NP-40, and complete protease cocktail inhibitor [Roche]) for 10 min at 4°C. Samples were centrifuged at 10,000 x g for 5 min at 4°C, and supernatants were transferred into a new Eppendorf tube and stored at 80°C.
Preparation of N protein from E. coli, Sf9, and Vero cells. Expression and purification of His-tagged N protein from either E. coli or Sf9 cells has been previously described (12, 13). To obtain N protein from mammalian cells, Vero cells were transfected with pTriExIBVN as described previously (13), and purification was identical to that described for N protein from Sf9 cells. Western blotting was performed by using the ECL detection kit (Amersham Pharmacia) according to the manufacturer's instructions. Briefly, each sample (3 µg of total protein as quantified by Bradford assay) was separated on a 10% NuPage Bis-Tris precast polyacrylamide gel (Invitrogen) in morpholinepropanesulfonic acid (MOPS) running buffer. Protein was transferred onto a polyvinylidene difluoride membrane (Invitrogen) in Invitrogen transfer buffer for 1 h at room temperature. The membrane was blocked for 1 h in Tris-buffered saline (TBS) plus 0.05% Tween 20 and 5% milk. Primary rabbit anti-IBV polyclonal sera (a gift from D. Cavanagh, IAH Compton), diluted 1:10,000 in TBS plus 0.05% Tween 20, was incubated with the membrane for 2 h at room temperature to detect IBV N protein. The membrane was washed three times in TBS plus 0.05% Tween, and then the membrane was incubated with secondary goat anti-rabbit immunoglobulin G conjugated to horseradish peroxidase (diluted 1:1,000 in TBS plus 0.05% Tween) for 1 h at room temperature. The membrane was then washed three times, treated with ECL detection reagents (Amersham Pharmacia), and subjected to autoradiography.
Mass spectroscopy. Recombinant IBV N protein (from E. coli [NE. coli] or Sf9 [NSf9] cells) was dialyzed into digest buffer (100 mM ammonium bicarbonate, 2 M urea [pH 7.8]). Proteins were digested by the addition of approximately 5 to 10% (wt/wt) endoproteinase Glu-C (Sigma). Resulting proteolytic peptides were analyzed by online capillary HPLC-mass spectrometry (MS). Samples were preconcentrated and desalted by use of a self-packed protein trap, followed by separation by use of a 180-µm-inner-diameter capillary column, self-packed with Hichrom C18 reversed-phase sorbent (3.5-µm bead size, 150-Å pore size). Peptides were eluted from the column by an increasing gradient of solvent B (0.05% vol/vol trifluoroacetic acid in 5:95 water-acetonitrile), and solvent A was 0.05% vol/vol trifluoroacetic acid in 95:5 water-acetonitrile. The eluent was passed, via a U-Z View UV flow cell (LC Packings), to a Quattro II tandem quadrupole mass spectrometer (Micromass United Kingdom Ltd.). The mass spectrometer was equipped with a commercially available Z-spray source operated in continuous flow nanospray mode. Full-scan mass spectra were acquired in positive-ion mode from an m/z of 300 to 2,100 with a scan time of 4 s. The voltage applied to the extraction cone was ramped to improve detection of high-m/z ions. Recombinant IBV N protein expressed by Vero cells (NVero) was digested with trypsin and analyzed by HPLC-MS or automated HPLC-MS/MS at Micromass United Kingdom Ltd. Peptides were separated on a reversed-phase capillary HPLC column (75-µm C18 Pepmap; LC Packings), and the eluent was passed directly to the Z-spray source of a Micromass Q-Tof Ultima API (Waters Corp.) mass spectrometer. The mass spectrometer was operated in data-dependent acquisition mode to generate peptide sequence information automatically, and theoretical phosphopeptide ions were added to the include list to ensure that these peptides were sequenced.
Runoff transcription and hybridization of CD-61 RNA to biotinylated oligo(dT). pCD-61 containing the cDNA for the 6.1-kb minigenome of Beaudette IBV was made linear by digestion with NotI. RNA with a 3' poly(A) tail was in vitro synthesized from pCD-61 by using T7 RNA polymerase (Promega) as described previously (47). RNA (2.5 pmol) was denatured in 400 µl of 0.5 x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate) at 65°C for 10 min. Biotinylated oligo(dT) (2.5 pmol) was annealed to the RNA by cooling to room temperature.
Surface plasmon resonance analysis of RNA-protein interactions. Surface plasmon resonance provides both kinetic and equilibrium data about intermolecular interactions and is an excellent tool for the study of RNA-protein interactions (27). In most nucleic acid protein binding studies, RNA is the ligand and is therefore anchored to a solid matrix and the protein is the analyte and is passed over this material (27). In our experiments, kinetic analysis was performed by using a Biacore 3000 (Uppsala, Sweden). Research grade sensor chips coated with streptavidin were obtained from Biacore. Unless otherwise stated, the the running buffer (also used for analyte dilution) was HBS-EP (0.01 M HEPES, 0.15 M NaCl, 3 mM EDTA, 0.005% surfactant P20 [pH 7.4]).
To reduce effects attributed to mass transport, low levels of ligand (given in relative units [RU]) are immobilized on the solid matrix, coupled with a fast flow rate of analyte (27). For example, Stockley et al. (67), in their study of binding of RNA polymerase to immobilized DNA, suggested that between 200 and 800 RU represented low levels of immobilized ligand, and they used a flow rate of 30 or 40 µl/min. Likewise, Waysbort et al. (76) immobilized on average 1,000 RU of ligand and used a flow rate of 5 µl/min of analyte in their study of pyrimidine tract binding protein. Blaesing et al. (4) immobilized between 100 and 700 RU of an oligonucleotide containing a DnaA box motif and a flow rate of 100 µl/min in an investigation of the DNA-binding domain of E. coli DnaA protein. In contrast, Park et al. (45) immobilized 30 to 50 RU of target RNA and used a flow rate of 30 µl/min. In a binding study, Kortt et al. (28) found that, with 1,260 or fewer RU of ligand and a flow rate of analyte between 20 and 50 µl/min, no mass transport was apparent and experimental data exhibited no marked deviation from the 1.1 Langmuir binding model used in the study. Given the apparent variation in RU of immobilized ligand used in different binding studies, the average RU of immobilized ligand in our study was approximately 500 RU (as recommended by BIAcoreAB, application note 301, Bianote). The average for CD-61 RNA was 649 RU. For RNAmers, the averages were 526 RU for 3leaderTAS, 490 RU for 3leaderSAT, 488 RU for random90, 488 RU for random90TAS, 435 RU for SL1SL2TASrand, 418 RU for SL1SL2rand, 514 RU for DSL1SL2TAS, 573 RU for DSL1SL2, 560 RU for DSL1SL2rand, 563 RU for SL1TASrand, 450 RU for antileader, 477 RU for 3leaderTAS, 614 RU for 3leaderSAT, 514 RU for random90, and 620 RU for random90TAS.
Prior to our kinetic analysis, we ran control experiments to assess the influence of mass transport on the ligand-analyte interaction and to investigate the use of high flow rates of analyte (N protein) to reduce possible mass flow effects. In brief, different concentrations of N protein were injected at a flow rate of 5, 15, or 75 µl/min over immobilized RNA. As an example, the average binding kinetics of nonphosphorylated N protein (at a concentration of 6.25 nM) to target RNA are shown in Fig. 2 and kinetic values are given in Table 2. The average Kd values obtained were 13.3, 8.44, and 6.97 nM, respectively, indicating that there was no significant difference between the data from a flow rate of 15 to 75 µl/min. Therefore, in subsequent work to study RNA-N protein interactions, and similar to that used by Park et al. (45), we used a flow rate of 30 µl/min to minimize the influence of mass transport.
![]() View larger version (11K): [in a new window] |
FIG. 2. Sensorgram showing the binding analyte (nonphosphorylated N protein) to target ligand with different flow rates to investigate mass transport effects. N protein (the analyte) at the concentration of 6.25 nM was injected at flow rate of 5, 15, or 75 µl/min. RU, response units.
|
|
View this table: [in a new window] |
TABLE 2. Average kinetic values for complexes of analyte (nonphosphorylated N protein) with ligand RNA to investigate the effects of mass transport and the use of different flow rates to reduce this effect
|
![]() View larger version (13K): [in a new window] |
FIG. 3. Sensorgram showing the binding of N protein to CD-61 to investigate whether any secondary binding occurred, 6.25 nM nonphosphorylated N protein was injected for 1, 3, or 20 min. RU, response units.
|
|
|
|---|
![]() View larger version (70K): [in a new window] |
FIG. 4. Western blot analysis of IBV N protein in infected cells (lane 1), cells transfected with a plasmid expressing N protein (lane 2), mock-infected cells (lane 3), and mock-transfected cells (lane 4). The appropriate molecular weight markers are indicated to the left of their lane (M). The position of the IBV N protein is indicated by an arrow.
|
IBV N protein contains internal disulfide bonds. IBV N protein expressed in E. coli, Sf9 cells, and Vero cells was subjected to proteolytic digestion and analysis by capillary HPLC-MS. A representative HPLC chromatogram from analysis of peptides released by tryptic digestion of NE. coli is shown in Fig. 5. Many proteolytic peptides were identified, which covered a large amount of the IBV-N protein sequence; peaks are labeled with the residue numbers that the peptides span.
![]() View larger version (27K): [in a new window] |
FIG. 5. Capillary HPLC-MS analysis of peptides released after tryptic digestion of NE. coli. Peptides were detected by both UV absorbance at 214 nm (upper trace) and mass spectrometric total ion current (TIC, lower trace). Peaks in the upper trace are labeled with the retention time of the peptides, and those in the lower trace are labeled according to the residue numbers of NE. coli tryptic peptides that were identified. Peaks marked with asterisks correspond to trypsin autolysis products.
|
![]() View larger version (19K): [in a new window] |
FIG. 6. Tandem mass spectrum of a putative disulfide bond-containing peptide. The peptide covers residues Ile318 to Lys331 and was released by tryptic digestion of NE. coli. The presence of both diagnostic b and y series fragment ions confirm the identity of this peptide. The mass of the peptide was observed to increase by 2 Da after reduction of disulfide bonds with dithiothreitol, confirming the existence of the single disulfide bond within this peptide.
|
![]() View larger version (22K): [in a new window] |
FIG. 7. Deconvoluted electrospray mass spectra of polypeptides released by endoproteinase Glu-C digestion of NE. coli (a) and NSf9 (b) proteins. (a) Peaks corresponding to residues 1 to 182 (theoretical mass, 19,531.7 Da) and 1 to 220 (theoretical mass, 23,641.3 Da) are evident. (b) Peaks corresponding to the same polypeptides have masses that have increased by around 40 Da. This mass difference was measured to be 44 Da from the original peak envelopes. In addition, the polypeptide spanning residues 1 to 220 is present in three different forms, with each separated by around 80 Da. This is consistent with up to two phosphorylations in this polypeptide. The absence of these signals in the peptide corresponding to residues 1 to 182 implies that the sites of phosphorylation are between residues 183 and 220.
|
Residues in both the N- and C-terminal domains are phosphorylated in both NSf9 and NVero. To locate the sites of phosphorylation, NSf9 was digested with trypsin to yield smaller fragments; the peptide pool was desalted by use of a ZipTip and was analyzed by offline MS in negative-ion mode. To specifically detect phosphopeptides, a precursor ion scan was used that detected peptides fragmenting to yield a moiety of an m/z of 79 (PO3). The resulting spectrum is shown in Fig. 8, and peaks are labeled with putative residue numbers of the peptides, assigned solely from the peptide mass. From these data, it appears that there are two sites of phosphorylation between residues 186 to 198 and a further two between residues 367 to 394. A fresh aliquot of tryptic peptide mixture was analyzed in positive-ion mode, and the relevant phosphopeptides were subjected to collision activation experiments by MS/MS. The MS/MS spectrum of the doubly phosphorylated peptide with a mass of 3,460.4 Da from NSf9 is shown in Fig. 9a. Fragment ions are labeled according to the convention of Roepstorff and Fohlman (50). Neutral losses of 98 Da (H2PO4) from the parent ion, characteristic of phosphopeptides, was observed. In addition, structural information was obtained from sequence-specific fragment ions, which are shown more clearly in the magnified inset of Fig. 9a. By comparison with corresponding MS/MS spectra of singly and unphosphorylated peptides, the sites of phosphorylation were distinguished. The data showed that the peptide spans residues 367 to 394 and that the sites of phosphorylation are Thr378 and Ser379. In the singly phosphorylated peptide, however, it is not clear which of these two sites is occupied.
![]() View larger version (23K): [in a new window] |
FIG. 8. Negative-ion mass spectrometric analysis of tryptic peptides of NSf9 by offline precursor ion scanning. The mass spectrum shows all ions that fragment to give a specific fragment ion with an m/z of 79 (PO3). Peaks are labeled with a letter to identify the peptide and a number corresponding to the charge state. Although many signals are evident, this is the result of heterogeneity in digestion and phosphorylation. All peptides span essentially the same residues in the N- and C-terminal domains, and these data suggest up to two phosphorylations between residues 186 and 198 and an additional two between residues 367 and 394.
|
![]() View larger version (40K): [in a new window] |
FIG. 9. Tandem mass spectra of phosphopeptides released after tryptic digestion of NSf9 (a) and NVero (b). The parent ion in panel a had an m/z of 866, corresponding to the putative diphosphorylated peptide of residues 367 to 394 with four charges. Characteristic neutral losses of 98 Da, m/z of 841.86 and 817.45, can be observed. The inset in panel a is a magnification of the region of the spectrum containing sequence-specific fragment ions and confirms the identity of the peptide and the sites of phosphorylation as Thr378 and Ser379. The parent ion in panel b had an m/z of 642.8, corresponding to the peptide of residues 188 to 198 with a single phosphorylation. A single neutral loss to an m/z of 593.8 is evident. The sequence-specific fragment ions confirm the identity of the peptide and conclusively indicate the site of phosphorylation as Ser190. In all cases, peaks are labeled according to the fragment ion nomenclature of Roepstorff and Fohlman (50).
|
To confirm that these phosphorylation sites were present in N protein expressed by mammalian cells, the protein was expressed in Vero cells (NVero), digested with trypsin, and analyzed by HPLC-MS. Since the amount of protein obtained from these cells was low, this analysis was performed by use of a more-sensitive mass spectrometer, a Q-ToF instrument. The tryptic digest of NVero was similar to that of NSf9, and putative phosphopeptides were observed with the same masses as measured for NSf9. Operating in data-dependent acquisition mode allowed peptides to be automatically subjected to MS/MS analysis as they eluted from the column, and this included putative phosphopeptides. The MS/MS spectrum of the singly phosphorylated peptide spanning residues 188 to 198 from NVero is shown in Fig. 9b. It is identical to that obtained from NSf9 (data not shown) and confirms that the initial site of phosphorylation in the N-terminal region is Ser192. We also obtained MS/MS spectra for the C-terminal phosphorylation sites. Our data indicated that the same phosphorylation sites were occupied in NVero as were found in NSf9, establishing that NSf9 is a relevant model for phosphorylated N protein and thus could be used for kinetic analysis. N protein produced in E. coli is subsequently referred to as Nnonphos, and N protein produced in Sf9 cells is referred to as Nphos.
Comparison of the affinity of binding of nonphosphorylated and phosphorylated forms of N protein with viral RNA by using surface plasmon resonance. We compared the binding of Nnonphos protein and Nphos protein with two models of the IBV genome. The first was RNA synthesized by in vitro transcription from pCD-61. The second was a synthetic biotinylated RNA, RNAmer 3LeaderTAS, which was identical to the 5' end of IBV mRNA 3 up to, and including, the translation initiation codon for gene 3a. CD-61 RNA was generated by runoff transcription from pCD-61 and hybridized to oligo(dT), which was immobilized on a streptavidin flow cell.
Various concentrations of either Nnonphos or Nphos proteins were passed over immobilized CD-61 RNA. The resultant sensorgrams are shown in Fig. 10, and kinetic analysis is shown in Table 3. The data suggested that there was no significant difference in the binding of Nnonphos protein (Kd, 8.89 ± 0.08 nM) to CD-61 than Nphos protein (Kd, 16.9 ± 0.53 nM), which indicated that both forms of N protein associated with viral RNA with high affinity. The preparation of the CD-61-oligo(dT) complex may also have contained free oligo(dT), which was available to bind to the streptavidin flow cell. However, we found that N protein did not bind to immobilized oligo(dT) (data not shown), suggesting that any binding characteristics of the CD-61-oligo(dT) complex were due to interactions with CD-61 RNA only.
![]() View larger version (16K): [in a new window] |
FIG. 10. Sensorgrams showing the binding of different concentrations (indicated above the appropriate nanomolar concentration) of either phosphorylated (A) or nonphosphorylated (B) N protein to CD-61 RNA by using surface plasmon resonance. RU, response units.
|
|
View this table: [in a new window] |
TABLE 3. Average kinetic values for complexes of phosphorylated and nonphosphorylated N protein with CD-61a
|
![]() ![]() View larger version (43K): [in a new window] |
FIG. 11. Sensorgrams showing the binding of phosphorylated N protein to various biotinylated RNAmers (indicated) by using surface plasmon resonance. RU, response units. The sensorgram for 3LeaderTAS (indicated by an asterisk) is superimposed for reference.
|
![]() View larger version (26K): [in a new window] |
FIG. 12. Sensorgrams showing the binding of nonphosphorylated N protein to various biotinylated RNAmers (indicated) by using surface plasmon resonance. RU, response units. The sensorgram for 3LeaderTAS (indicated by an asterisk) is superimposed for reference. The bottom sensorgram is for comparison of the binding of nonphosphorylated (Nnonphos) and phosphorylated (Nphos) N protein to both 3LeaderTAS and Random90.
|
|
View this table: [in a new window] |
TABLE 4. Average kinetic values for complexes of phosphorylated and nonphosphorylated N protein with target RNAmersa
|
Core sequence determines association rate. We investigated the effect of the core sequence on RNA binding by using two different approaches: by scrambling the core sequence from gene 3 (RNAmer 3LeaderSAT) and by replacing eight nucleotides of the Random90 sequence with the gene 3 core site (RNAmer Random90TAS) in the same location as the viral RNA, i.e., at nucleotide 57 of the leader sequence. The mutations introduced into the gene 3 core sequences in RNAmer 3LeaderSAT were the same mutations that abolished transcription in an IBV minireplicon (65) and were thus biologically relevant. Both biotinylated RNAmers were immobilized to the streptavidin flow cell, and then 12.5 nM Nphos protein was passed over these ligands for 360 s. The sensorgrams for 3LeaderSAT and Random90TAS are shown in Fig. 11. The kinetic analysis of the binding experiments is shown in Table 4. Abolition of the core sequence did not appear to significantly affect the affinity of binding of Nphos to the IBV gene 3 leader sequence. However, the addition of the core site to Random90 increased the affinity of binding of Nphos protein to RNA approximately 240 times, mainly due to the increased association rate, indicating that the core site affected the association rate between Nphos protein and RNA but not the dissociation rate. The same trend was observed for the binding of Nnonphos protein to these RNAmers (Table 4; Fig. 11), and the difference in binding affinity by Nnonphos protein and the different RNAmers was less than that for the Nphos protein.
Multiple N protein binding sites are present in leader sequence. The above data indicated that, although the TAS acted to promote high-affinity binding of N protein on a nonviral RNA, when the site was mutated in the leader sequence, there was no significant difference to binding, suggesting that other binding sites may be present. Analysis of the structure of the IBV leader sequence indicated that two predicted stem-loops were formed and that the sequences corresponding to these predicted structures were conserved in different strains (66). Where nucleotide differences occurred between strains, a corresponding covariant mutation was present to conserve the appropriate predicted stem-loop structure (66). Predicted stem-loop one (SL1) is formed between nucleotides 7 and 30, and predicted stem-loop two (SL2) is formed between nucleotides 34 and 48. Although a third stem-loop is formed when predicting the structure of the leader sequence, this feature is not present when the analysis is extended in the 3' direction (P. Britton, personal communication). Therefore, we investigated the role of the two conserved predicted stem-loop regions upstream of the core element in binding phosphorylated N protein. Appropriate regions of the leader sequence were replaced with the corresponding nucleotides from the random nonviral RNA. All subsequent experiments were conducted with Nphos protein, as this protein could distinguish between nonviral and viral RNA and probably represents the predominant form of N protein in the infected cell and virion (31).
Replacement of sequence downstream of the TAS with random RNA (RNAmer SL1SL2TASrand) did not affect the binding affinity of N protein when compared to 3LeaderTAS (Kd, 2.59 and 2.82, respectively) (Table 4; Fig. 11). The association rate was three times slower. The binding affinity of Nphos protein on SL1SL2rand was also less (Kd, 11.7) when compared to SL1SL2TASrand (Kd, 2.59), which is a reflection of the lower association rate, again indicating that the TAS contributes to association rate (e.g., Random90TAS has the fastest association rate for the binding of Nphos protein).
SL2 can act as a high-affinity N protein binding site. Replacement of SL1 with random RNA (RNAmer DSL1SL2TAS) did not appear to significantly affect the binding affinity (Kd, 2.42) when compared to the leader RNA (Kd, 2.82), although the disassociation rate was slower (Table 4; Fig. 11). The association rate of Nphos protein with RNA was lower with DSL1SL2 compared to the leader RNA, and the binding affinity was approximately five times less. Interestingly, the data indicated that SL2 by itself could act as a high-affinity binding region for N protein, as replacement of flanking sequences with random RNA gave a binding affinity with a Kd of 1.51.
Role of SL1 in N protein binding. SL2 and sequences downstream of the TAS were replaced with random RNA, forming RNAmer SL1TASrand, so the RNA contained only SL1 and a TAS. Nphos protein bound to this RNA with higher affinity (Kd, 0.63) than to the leader RNA (Table 4; Fig. 11). Unfortunately, an RNAmer containing SL1 only proved difficult to synthesize, and thus, we were unable to determine the binding affinity of Nphos protein to SL1. However, the binding affinity of Nphos protein to SL1TASrand is greater than to RandomTAS, suggesting that SL1 can promote high-affinity binding.
Binding of Nphos protein to antileader RNA. In coronavirus-infected cells, negative-sense copies of sgRNAs are present and contain antileaders (60). These negative-strand RNAs can act as templates for the synthesis of positive-strand sgRNAs (24, 54-57, 59). We investigated the binding of Nphos protein to antileader RNA. This RNAmer was the exact anticopy of 3LeaderTAS, except that it was one nucleotide shorter because the synthesis process could not terminate with a U (Invitrogen). The kinetic analysis (Table 4; Fig. 11) indicated that Nphos protein formed a high-affinity association with antileader (2.17 nM).
|
|
|---|
![]() View larger version (38K): [in a new window] |
FIG. 13. Amino acid sequence of the IBV Beaudette strain N protein (5) showing the sites of phosphorylation and disulfide bridge formation.
|
Although our principal aim with mass spectroscopy was to investigate the phosphorylation status of N protein, analysis revealed several other potential modifications. We found that the N-terminal region of N protein may be acetylated. Acetylation in the N-terminal region of both cellular and viral proteins is a common posttranslational modification, e.g., the phosphoprotein (P) of some paramyxoviruses (62). Although acetylation of proteins has a number of different functions, it is also involved in transcriptional regulation, e.g., acetylation of human immunodeficiency virus Tat protein (43) and CREB-mediated gene expression (33). Acetylation of the coronavirus N protein could therefore play a role in the transcriptional regulatory activity the protein may display. N protein from the three different sources had identical disulfide bridges, formed between residues Cys281 and Cys308 and Cys320 and Cys323. Analysis of N protein by sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis under native conditions indicated that the predominant species in solution were both monomeric and multimeric (but not dimeric), whereas under reducing conditions, N protein was present as a monomer only (12). Although these bridges may have formed during the extraction and purification of the protein, normally when this occurs, disulfide bridge formation occurs sequentially, and therefore, we would have predicted that disulfide bridges would form between Cys228 and Cys281 and Cys308 and Cys320. However, we did not observe this. Although N protein is cytosolic, non-endoplasmic reticulum-based redox pathways can contribute to disulfide bridge formation (61). Disulfide bridge formation may therefore be important in the correct folding of N protein. However, it should be noted that the severe acute respiratory syndrome coronavirus N protein does not contain any cysteine residues and therefore would not contain disulfide bridges (34).
Surface plasmon resonance (Biacore) has not previously been applied to the study of the interaction of coronavirus N protein with coronavirus RNA. Therefore, we used this technique to investigate the interaction of nonphosphorylated and phosphorylated IBV N protein with a variety of viral and nonviral target RNAs. In the case of the biotinylated RNAmers, we were able to examine the interaction of N protein with target RNAs at near physiological temperature (37°C). We examined the binding of nonphosphorylated and phosphorylated N protein on both the gene 3 leader sequence and CD-61. There was no significant difference in the binding of nonphosphorylated and phosphorylated N protein to leader RNA or a defective interfering RNA (DI-RNA). Our results are in contrast to those of Cologna et al. (14), who found that the bovine coronavirus (BCoV) N protein bound more efficiently to a BCoV minireplicon than to the BCoV leader sequence. This discrepancy may be due to the differences between filter binding assays used by Cologna et al. (14) and surface plasmon resonance (45) used in our study, or it may be an intrinsic property of the different coronavirus N proteins. However, our results suggested that phosphorylated N protein had 264 times greater affinity for the leader sequence than a nonviral RNA. N protein failed to bind to an oligomer with 25 T residues, indicating that sequence context may be important for nucleotide binding. The differences in the amount of N binding to RNA, the source of N protein, and the kinetics of this interaction may account for why previous studies could not resolve whether N bound either specifically to viral RNA or bound with equal affinity to both viral and nonviral RNA.
Nelson et al. (41) measured the binding affinity of MHV N protein (isolated from E. coli and therefore nonphosphorylated) to the MHV leader sequence and obtained a Kd of 14 nM. Although the binding affinity of IBV Nnonphos protein to the IBV leader (Kd, 0.66 nM) is greater than that of MHV N protein to MHV leader, this may be a reflection of the differences in recombinant N protein preparation (His tag in the case of IBV N protein versus glutathione S-transferase fusion for MHV N protein) and kinetic measurement (surface plasmon resonance versus filter binding assay). With that said, the binding affinity of MHV N protein to MHV leader (41) and IBV N protein to IBV leader is very strong and thus probably reflects a general pattern for coronaviruses. Consideration should be given to the fact that N protein in virions and the infected cell is exclusively phosphorylated and is probably not nonphosphorylated (31). A phosphoamino acid analysis of the arterivirus (a member of the Nidovirales) porcine reproductive and respiratory syndrome virus nucleocapsid protein revealed that this protein was also exclusively phosphorylated, and no nonphosphorylated species were found (80). Therefore, care should be taken when interpreting the results obtained with nonphosphorylated N protein and drawing biologically relevant conclusions, as this form of N protein is probably not present in virus-infected cells. We favor the hypothesis that the role of phosphorylation is to decrease the affinity of N protein for random RNA so that N protein associates with viral RNA in preference to cellular RNA.
Comparison of the binding affinity between IBV N protein and RNA and other RNA binding proteins also revealed that N protein had high-affinity binding. Using surface plasmon resonance, Park et al. (45) determined the binding affinity for human neuron-specific RNA binding (HuD) protein with AU-rich target biotinylated RNAs was approximately 0.7 nM. In this experiment, recombinant HuD protein was C-terminally His tagged and purified from E. coli and presumably nonphosphorylated. The binding affinity of IBV Nnonphos and Nphos protein to leader RNA is similar.
The principal function of N protein in terms of the infectious cycle of the virus can perhaps be viewed as binding to viral RNA to form the ribonucleocapsid structure, part of the core particle. In a number of viruses, viral capsids can form in the absence of RNA and instead can be replaced with either cellular RNAs or suitable polyanions; this has been perhaps best characterized in the encapsidation and assembly of nodavirus particles (29, 58, 73). The high affinity of phosphorylated N protein for viral RNA over nonviral RNA together with appropriate packaging signals (16, 18, 20, 37) may explain why virions are composed of viral RNA (48).
Using the phosphorylated form of N protein, we found that the IBV gene 3 core sequence contributed to the kinetics of RNA binding. Introducing a core sequence into nonviral RNA (Random90) increased N binding, with the major change being in the association rate. Scrambling the core sequence in the context of the leader RNA resulted in only a small decrease in binding affinity, suggesting that the core sequence was not the only N protein-binding site present on leader RNA. Stirrups et al. (66) described two conserved predicted stem-loops that were present upstream of the core sequence in the leader sequence. By replacing these sequences with random RNA (from Random90), that did not promote high-affinity binding, we showed that the sequences that form predicted SL1 and predicted SL2 both contributed to high-affinity binding of phosphorylated N protein to leader RNA. However, it should be noted that optimum RNA ligand binding sequences are not necessarily the same as those found in vivo (78). The high-affinity binding of N protein to the stem-loop structures may suggest that N protein could be a general double-stranded RNA binding protein. However, structural analysis of RNAmer Random90 indicated that this molecule formed stem-loop structures (data not shown), and in the case of the phosphorylated N protein, we observed low-affinity binding with this target compared to viral RNA, suggesting that, in general, N protein did not specifically recognize double-stranded RNA.
Such stem-loop structures have also been identified in the 5' end of several other coronaviruses, including BCoV DI-RNA (10, 11) and MHV (75), and these sites may therefore also act as high-affinity sites for N protein. However, with that said, in their kinetic analysis, Stohlman et al. (68) located only one high-affinity binding site on the MHV leader RNA that bound MHV N protein, and this site encompassed the MHV core element. But discrepancies between our findings with IBV leader sequence and IBV N protein and the MHV leader sequence and MHV N protein, may be a reflection of the inherent properties of the molecules themselves or of the different kinetic analysis used.
The high-affinity association of phosphorylated N protein with the leader sequence and TAS may have implications for viral translation and possibly replication. If coronavirus mRNAs are translated in a cap-dependent manner, in that they initiate translation at the 5' end of the leader sequence, then the ribosome may be in conflict with the N protein when it scans along the leader sequence for the translation start codon. However, Tahara et al. (70) have speculated that N protein enhances the translation of MHV mRNAs. In this case, one may hypothesize that by binding to the leader RNA, N protein may reduce the secondary structure to increase the efficiency of translation.
N protein may also bind, with high affinity, to the TAS elements located along the coronavirus genome, and one may therefore speculate that this interaction is involved in the efficiency of transcription of (minus) sgRNAs. Also, N protein may bind to sites that are very similar in sequence to the core element (44) to facilitate transcription at these sites. Although the caveat is that N protein is not required to rescue all of the full-length coronavirus clones so far described, the presence of N protein does increase the rescue efficiency. An alternative is that the TAS sites act as nucleation sites for N protein to facilitate the encapsidation of virus RNAs. Such binding sites may act separately from the packaging signal that promotes a specific interaction with the N protein and the genomic RNA (18, 37) and presumably is required for the efficient packaging of the genomic RNA, but not sgRNAs, into core particles. MHV genomic RNA has been reported to be incorporated into the virion in the absence of N protein (38).
In their review, Laude and Masters (31) described how several workers speculated that phosphorylation of N protein may determine its binding affinity with RNA. Our data indicated that phosphorylation of N protein decreased the affinity of N protein for random RNA. Two possible mechanisms may account for this regulation. Phosphorylation of N protein may alter its conformation and thus affect the RNA binding site(s). Stohlman et al. (69) suggested that phosphorylation of MHV N protein could lead to substantial conformational changes in protein structure, and phosphorylation of human immunodeficiency virus type 1 Rev protein was shown to alter its conformation and RNA binding properties (21). Our mass spectroscopic data would support the hypothesis of Stohlman et al. (69), in that phosphorylation occurs proximal to RNA binding domains. Alternatively, phosphorylation of N protein could add negative charge to the RNA binding domains and thus reduce electrostatic interaction with target RNA. A similar mechanism has been proposed to regulate binding of potato virus A coat protein to RNA (26). While this would explain the overall slight binding difference between Nnonphos and Nphos with viral RNA, such an interaction probably does not account for the large difference in binding between Nphos protein and random RNA. Although the role of the conserved core sequence in N protein binding has been known for sometime, the mechanism by which this is achieved was not. Our kinetic study would suggest that the core sequence determines the specificity of IBV N protein binding with viral RNA by increasing the association rate. However, this is not the only site within the IBV leader that promotes high-affinity binding with N protein, and the possibly exists that these structures act in concert to promote optimal binding.
We thank Paul Britton for help in design of the leader RNAmers and Dave Cavanagh (IAH Compton) for the generous donation of pCD-61.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»