Previous Article | Next Article ![]()
Journal of Virology, June 2005, p. 6620-6630, Vol. 79, No. 11
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.11.6620-6630.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Burkhard Ludewig,2
Stuart G. Siddell,3 and
Volker Thiel2*
Institute of Virology and Immunology, University of Würzburg, Würzburg, Germany,1 Research Department, Kantonal Hospital St. Gallen, St. Gallen, Switzerland,2 Department of Pathology and Microbiology, School of Medical Sciences, University of Bristol, Bristol, United Kingdom3
Received 11 November 2004/ Accepted 25 January 2005
|
|
|---|
|
|
|---|
The coronavirus N protein has been described as a multifunctional protein displaying RNA-binding activity, protein-protein interaction (specifically with the M protein), and the ability for self-association (22, 29, 30). Clearly, many of these features will reflect the structural role of the N protein in the virus particle. However, there are also some observations that suggest the N protein may have a role in viral RNA synthesis. First, several studies have provided evidence for the binding of N protein to coronaviral RNA sequences that are involved in the regulation of RNA synthesis. These sequences include the coronavirus leader sequence, transcription regulatory sequences (TRS), and sequences corresponding to the 3' end of coronavirus genomes (2, 10, 11, 24, 34, 48). Second, it has been shown that in addition to a cytoplasmic distribution within the host cell, at least a fraction of the coronavirus N protein colocalizes with replicative proteins at the sites of viral RNA synthesis early in infection (12, 45, 55). Third, it has been demonstrated that there is a requirement for sustained translation of the N protein in trans or in cis for optimal replication of bovine coronavirus DI RNA and transmissible gastroenteritis virus-derived replicons (1, 9). Fourth, we and others have reported that the recovery of recombinant coronaviruses can be facilitated under conditions that allow sustained N protein expression (7, 59). And fifth, heterologous gene expression from coronavirus-based multigene vectors is greatly improved by cotransfection of vector RNA with a synthetic mRNA encoding the N protein (52).
Coronavirus RNA synthesis can be divided formally into two processes: replication of full-length virus genomes and transcription of subgenomic mRNAs. Genome replication is mediated through a full-length minus-strand copy of the genome that serves as a template for the production of progeny virus genomes. Coronavirus transcription is a complex process involving the discontinuous synthesis of up to eight minus-strand RNAs of subgenomic size which contain sequences corresponding to the 5' and 3' ends of the genome and serve as templates for the synthesis of a 5'- and 3'-coterminal set of subgenomic mRNAs (40, 47). According to the current model of coronavirus discontinuous extension, the synthesis of nascent minus-strand RNA initiates at the 3' end of the plus-strand genomic template and proceeds to TRS elements that are located at various sites within the virus genome (39, 41, 61). At these sites, a proportion of the nascent minus-strand RNA is translocated to the 5' leader sequence of the genome, and subsequently a minus-strand copy of the leader is added (42, 57, 61). Thus, the TRS elements determine the fusion sites of leader and "body"-derived sequences (5'- and 3'-end-derived sequences, respectively) of the subgenomic RNAs, and the number of TRS elements correlates with the number of subgenomic mRNAs produced by a particular coronavirus.
In both replication and transcription, proteins encoded by the replicase gene are critically involved. The coronavirus replicase proteins are synthesized as large polyproteins that are extensively processed by virus-encoded proteinases to produce a functional replicase/transcriptase complex (60). It has been suggested that his complex comprises so-called scaffolding proteins, together with proteins that have enzymatic functions, including RNA-dependent RNA polymerase, helicase, and uridylate-specific endoribonuclease (NendoU), and putative exonuclease and 2'-O-ribose methyl transferase activities (44). In addition, the active replicase/transcriptase complex may contain other proteins of viral or cellular origin (17, 21, 26, 27, 43, 44). These proteins might include the N protein. On the other hand, Snijder and colleagues have shown that all structural proteins of the closely related arterivirus equine arteritis virus, including the N protein, are dispensable for genome replication and subgenomic RNA synthesis (25); we have shown that the viral replicase gene products alone suffice for coronavirus transcription (50).
In this paper, we used a reverse genetic approach to study N protein function in the context of viral RNA synthesis. Our data suggest a role for the coronavirus N protein in virus replication. We show that efficient, sustained replication of coronavirus-based vector RNAs only takes place if the N protein is expressed by the vector RNA. We also show that the level of N protein expression determines the level of vector RNA-mediated reporter gene expression, which again is consistent with a role for N in vector RNA replication. And finally, by constructing and monitoring the replication of recombinant, mutated coranavirus genomes, we provide genetic evidence that sustained expression of the N protein facilitates replication of the genomic RNA.
|
|
|---|
Cloning of plasmid DNAs and recombinant vaccinia viruses. The isolation of recombinant vaccinia viruses vHCoV-vec-1, vHCoV-vec-CLG, and vHCoV-vec-GN has been described previously (15, 50, 52). The isolation of recombinant vaccinia virus vHCoV-SfiI is based on the recombinant vaccinia virus vHCoV-inf-1, which contains the full-length cDNA of the human coronavirus 229E (HCoV-229E) genome (49). The first step, the generation of recombinant vaccinia virus vRec-1, has been described previously (15). The genome of vaccinia virus vRec-1 contains a cDNA insert corresponding to HCoV-229E nucleotides (nt) 1 to 21145, followed by a 2.1-kbp DNA fragment containing the Escherichia coli guanine phosphoribosyltransferase gene (gpt) under the control of a vaccinia virus promoter and a cDNA fragment corresponding to the 3' end of the HCoV-229E genome, starting at nt 24646. The second step involved homologous recombination of vRec-1 DNA with pHCoV-SfiI DNA and the isolation of vaccinia virus vHCoV-SfiI using gpt-negative selection in D980R cells. To clone the plasmid pHCoV-SfiI, a PCR product corresponding to HCoV-229E nt 20478 to 24091 was generated using primers SacI-20500up (5'-ACGTGAGCTCGGTGCTTAGTCTTGTTAGGAGTGG-3') and StopS-Sndown (5'-ACGTGCGGCCGCGGCCATTACGGCCTTACTGTATGTGGATCTTTTCAACG-3') and vHCoV-inf-1 DNA as a template. The PCR product was cleaved with SacI and NotI and cloned in pBluescriptII KS(+) (Stratagene). The resulting plasmid DNA was cleaved with NotI and HindIII, and a NotI-HindIII DNA fragment derived from NotI-HindIII-cleaved vaccinia virus vNotI/tk (23) genomic DNA (corresponding to the DNA sequence located downstream of the 3' end of the HCoV-229E genome in the vaccinia virus vRec-1 genome) was inserted. The identity of pHCoV-SfiI was confirmed by sequencing analysis and used for homologous recombination with vRec-1 DNA to generate the recombinant vaccinia virus vHCoV-SfiI. Note that the HCoV-229E-derived sequence of vHCoV-SfiI encompasses the HCoV-229E nt 1 to 24091 (nt 24089 to 24091 represent the stop codon of the HCoV-229E S gene), followed by a SfiI restriction site which is unique in the vHCoV-SfiI genome.
To construct the plasmid pMHV-N, cDNA was amplified by RT-PCR using primer Sac-MN-ATG (5'-ACGTAGACTCACCATGTCTTTTGTTCCTGGGCAAGAAAATGC-3'), primer Bam-MN-stop (5'-ACGTGGATCCTTACACATTAGAGTCATCTTCTAACC-3'), and vMHV-inf-1 DNA (9a). as a template. The resulting PCR product, which contains the mouse hepatitis virus (MHV) N gene, was cleaved with SacI and BamHI and cloned into pBluescriptII KS(+).
To construct the plasmid pTRE-HN, the HCoV-229E N gene was amplified by PCR using the oligonucleotide primer SacII-HN-ATG (5'-ACGTAGAGCTCACCATGGCTACAGTCAAA TGGGCTGATGC-3', primer Bam-HN-Stop (5'-ACGTGGATCCTTAGTTTACTTCATCAATTATGTCAG-3'), and vHCoV-inf-1 DNA as a template. The PCR product was cleaved with SacII and BamHI and ligated into the SacII- and BamHI-cleaved plasmid pTRE DNA.
Generation of synthetic N protein cDNA templates for in vitro transcription. Plasmid pME (49) was used as template to generate a PCR product containing the HCoV-229E N gene using primers Nup (5'-ACGTAATACGACTCACTATAGGGACGAAACCATGGCTACAGTC AAATGGGCTG-3') and Ndown (5'-TTTTTTTTTTTTTTTTTTTTCAAACAACACAGTGGCA TGTTTAG-3'). The PCR product was used as template for in vitro transcription of a mRNA encoding the HCoV-229E N protein. Plasmid pMSN was a kind gift of M. Ackermann and K. Tobler, Department of Veterinary Medicine, University of Zürich, Zürich, Switzerland, and contains the porcine epidemic diarrhea virus (PEDV) N gene located downstream of a bacteriophage T7 RNA polymerase promoter. pMSN was linearized with NotI prior to in vitro transcription of a synthetic mRNA encoding the PEDV N protein. Plasmid pMHV-N was linearized with XhoI prior to in vitro transcription of a synthetic mRNA encoding the MHV N protein.
Generation of a stable cell line expressing the HCoV-229E N protein. To generate a stable cell line expressing the HCoV-229E N protein, BHK-21 cells were transfected with 5 µg of plasmid pcEFTet-On/NEO (37) and a stable cell line, designated BHK-Tet/On, which expresses the Tet-activator protein rtTA, was selected using G418 (400 to 800 µg/ml). BHK-Tet/On cells were transfected with 2.5 µg of plasmid pTRE-HN and 2.5 µg of plasmid pTK-Hyg (Clontech); a stable cell line, BHK-HCoV-N, expressing the HCoV-229E N protein in the presence of doxycyclin, was selected with hygromycin B (300 µg/ml). N protein expression in BHK-HCoV-N cells was analyzed by Western blotting using murine monoclonal antibody NG12 (16) as primary antibody and NA 931 (sheep anti-mouse immunoglobulin G, peroxidase-linked; Amersham) as secondary antibody in combination with ECL Western blotting detection reagents (Amersham), according to the manufacturer's instructions.
Generation of recombinant HCoV-229E cDNA containing randomized sequences. A DNA fragment containing the HCoV-229E-derived nt 24097 to 27277 with a randomized sequence at HCoV-229E nt 25671 to 25673 was generated. Two PCRs were carried out with plasmid pME as template and primers (i) Oli 17 (5'-TGTGGTGAGTATGTTGCT-3') and EspTRSNr-down (5'-ACGTCGTCTCTTCAGNNNAGAAAAAATGAAGCAATCTTTCGTTTTCTGTC-3'; N represents any nucleotide) or (ii) EspTRSNr-up (5'-ACGTCGTCTCCCTGAACGAAAAGATGGCTACAGTCAAATGGGC-3') and Hend-polyA (5'-TTTTTTTTTTTTTTTTTTTTGTGTATCCATATCGAAACCGTTCC-3'). Both PCR products were cleaved with Esp3I, then ligated in vitro, and used as a template for another PCR with primers Sfi4a-up (5'-ACGTGGCCGTAATGGCCCTAGGTTTGTTCACATTGCAACTTGTG-3') and Hend-polyA. The resulting PCR product contains HCoV-derived nt 24097 to 27277 (with randomized nt 25671 to 25673) plus a SfiI cleavage site and a synthetic poly(A) sequence encoded upstream and downstream, respectively. After cleavage with SfiI, the DNA fragment was ligated in vitro with Sfi-cleaved genomic DNA from purified recombinant vaccinia virus vHCoV-SfiI, and the resulting ligation product was used as template for bacteriophage T7 RNA polymerase-driven in vitro transcription.
In vitro transcription and electroporation. In vitro transcription using bacteriophage T7 RNA polymerase in the presence of an m7G(5')ppp(5')G cap analog was done as described previously (49). Vector RNA (15 µg) or full-length recombinant HCoV-229E RNA was electroporated as previously described (50) into BHK-HCoV-N or BHK-21 cells with or without 5 µg in vitro-transcribed RNA encoding the HCoV, PEDV, or MHV N proteins. A synthetic mRNA encoding a truncated E. coli ß-galactosidase protein has been described previously (54), and 5 µg was used as a negative control for electroporation.
Reporter protein analysis. Reporter protein expression was analyzed 3 days postelectroporation. Green fluorescent protein (GFP) expression was analyzed by fluorescence microscopy with a Leica DM IL fluorescence microscope and flow cytometry was analyzed wtih a FACSCalibur and CellQuest software (BD Biosciences). Sorting of green fluorescent BHK-21 cells that were transfected with HCoV-vec-1 RNA and HCoV-229E-N mRNA was done using a FACSVantage Instrument (BD Biosciences). Firefly luciferase expression was analyzed using the luciferase reporter gene assay (Boehringer, Mannheim, Germany), as described by the manufacturer.
RNA preparation, Northern blotting, RT-PCR, and sequencing analysis. Poly(A)-containing RNA was isolated from BHK-21 cells using oligo(dT)25 Dynabeads (Dynal, Oslo, Norway) as previously described (53). Northern blot analysis involved electrophoresis and transfer to nylon membranes as previously described (51). To detect HCoV-229E-specific RNAs, a 32P-labeled probe corresponding to the HCoV-229E nucleotides 26297 to 27273 was produced using the Multiprime DNA-labeling system (Amersham). Reverse transcription-PCR (RT-PCR) was done with Superscript II reverse transcriptase (Invitrogen) as described previously (53). To amplify the region containing the HCoV-229E TRS-N region by RT-PCR, in vitro transcripts or poly(A)-containing RNA from transfected cells was used as a template for reverse transcription with primer 25950-down (5'-GCATCTTTATGGGGTCCTGTGCC-3'). The RT products were used as templates to amplify the TRS-N region corresponding to the HCoV-229E genome and subgenomic N protein mRNAs using (semi)nested PCR protocols. To amplify the TRS-N region corresponding to recombinant HCoV-229E genomes, primers 25500-up (5'-CATGACAGTTGCCGTGCCGAGCAC-3') and 25900-down (5'-TGACAAATCCACCCGTTTGCCCT-3') were used for the first PCR; primers 25630-up (5'-CTGCAGTGAGCTCTCCCATGAGCA-3') and 25820-down (5'-GTCTTTCTTGTTGATGGGTACC-3') were used for the nested PCR. To amplify the leader-body junction of N protein mRNAs, the leader-specific primer 25-up (5'-CTTAAGTACCTTATCTATCTACAGATAG-3') and primer 25900-down were used for the first PCR, and primers 25-up and 25820-down were used for the seminested PCR. The resulting PCR products were analyzed directly by sequencing using primer 25820-down or cloned into plasmid pGem-T (Promega). The resulting plasmid clones have been sequenced to determine the sequence of individual genomic or subgenomic mRNAs at the TRS-N region. Sequencing analysis of plasmid DNA, RT-PCR products, and the recombinant vaccinia virus cDNA inserts was done by standard cycle sequencing methods with an ABI 310 Prism Genetic Analyzer. Computer-assisted analysis of sequence data was facilitated by the Lasergene bio-computing software (DNASTAR).
|
|
|---|
![]() View larger version (19K): [in a new window] |
FIG. 1. Structure of recombinant HCoV-229E and vector RNAs. (A) The structural relationship and sizes of HCoV-229E and vector RNAs used in this study are shown. Open reading frames are indicated as boxes designated by encoded gene products. L, leader RNA sequence; An, poly(A) sequence. (B) The structure of recombinant HCoV-229E genomes with a randomized N gene TRS is shown (bottom). Also shown are the cDNAs that have been ligated in vitro (at the SfiI site) to produce a full-length HCoV-229E cDNA template for in vitro transcription.
|
![]() View larger version (16K): [in a new window] |
FIG. 2. Transfection of vector RNAs. Vector RNA HCoV-vec-GN or HCoV-vec-1 (each, 15 µg) was transfected into BHK-21 cells with or without synthetic mRNAs (5 µg) encoding the HCoV-229E, PEDV, or MHV N proteins or a truncated ß-galactosidase protein as indicated. Three days posttransfection, the percentage of green fluorescent cells was analyzed by flow cytometry. Each column represents the mean value of three independent experiments.
|
Next, we asked whether the N proteins of other coronaviruses could provide the same trans-active function(s) as the HCoV N protein in the system described above. Thus, we produced two further synthetic mRNAs encoding the MHV and PEDV N proteins, respectively. As shown in Fig. 2, cotransfection of HCoV-vec-1 RNA with mRNAs encoding the MHV or PEDV N proteins resulted in <0.1 and 0.9% green fluorescent cells, respectively. We conclude that only the PEDV N protein mRNA gave rise to an increased number of green fluorescent cells, although less effectively than the HCoV-229E N protein mRNA. Notably, PEDV is a group I coronavirus and closely related to HCoV-229E (6, 20), whereas MHV is a group II coronavirus and more distantly related to HCoV-229E.
Detection of HCoV-229E vector-specific RNAs by Northern blot analysis. In addition to measuring the levels of GFP expression, we also used a 32P-labeled probe, corresponding to the 3' end of the HCoV-229E genome, to visualize the vector-specific RNAs produced in transfected cells. Despite using probes of high specific activity, we were not able to detect any RNAs from cells that were transfected with HCoV-vec-1 or HCoV-vec-GN vector RNA alone. After cotransfection of HCoV-229E N mRNA with HCoV-vec-1 RNA, we could detect a faint signal for a subgenomic mRNA encoding GFP (data not shown). This contrasted with the cotransfection of HCoV-229E N mRNA with HCoV-vec-GN RNA, where we could easily detect the vector RNA and two mRNAs encoding GFP and N protein (Fig. 3, lane 11). In order to increase the sensitivity of RNA analysis from cells that have been transfected with HCoV-229E N mRNA and HCoV-vec-1 RNA, we sorted the green fluorescent cells prior to the isolation of poly(A)-containing RNA. In this case, an RNA that corresponds to a subgenomic mRNA encoding GFP was easily detectable (Fig. 3, lane 7), but we were unable to detect full-length HCoV-vec-1 RNA (Fig. 3, lane 10), even after prolonged autoradiography.
![]() View larger version (47K): [in a new window] |
FIG. 3. Northern blot analysis. Poly(A)-containing RNA was isolated from HCoV-229E-infected MRC-5 cells (lane 1), BHK-21 cells (lane 6), BHK-21 cells that have been transfected with HCoV-229E N protein mRNA and HCoV-vec-1 RNA (lanes 7 and 10), BHK-21 cells that have been transfected with HCoV-229E N protein mRNA and HCoV-vec-GN RNA (lane 11), or BHK-21 cells that have been transfected with HCoV-229E N protein mRNA, HCoV-vec-CLG RNA, and HCoV-229E genomic RNA (lane 12). The RNA was analyzed by Northern blotting as described in Materials and Methods. Lane 2, no RNA; lanes 3 and 8, 1 ng of in vitro-transcribed HCoV-vec-1 RNA; lanes 4 and 9, 1 ng of in vitro-transcribed HCoV-vec-CLG RNA; lane 5, 1 ng of in vitro-transcribed HCoV-229E N protein mRNA. Lanes 8, 9, and 10 represent lanes 3, 4, and 7, respectively, after prolonged autoradiography. Genomic and subgenomic RNAs of HCoV-229E (RNA1 to -7) and HCoV-229E vector RNAs (arrows) are indicated.
|
These results show that vector RNAs encoding a functional N gene were able to transcribe and replicate RNA, whereas vector RNAs lacking the N gene were able to transcribe RNA but replication of full-length vector RNA was not detectable. However, these results cannot rigorously rule out the possibility that the N gene provides a cis-acting RNA signal that enhances replication. To address this question, we coelectroporated the HCoV-vec-CLG vector RNA (lacking the N gene) with recombinant full-length virus genome and an mRNA encoding the N protein into BHK-21 cells. In this case, rapidly available N protein was supplied by translation of the cotransfected N protein mRNA and sustained N protein expression was mediated by the full-length genomic RNA in trans. Again, 3 days posttransfection we analyzed poly(A)-containing RNA from transfected cells by Northern blotting. As shown in Fig. 3, lane 12, the full-length HCoV-vec-CLG RNA is readily visible, indicating that replication of vector RNA in the presence of replicating HCoV-229E virus had occurred. In summary, our data indicate that initially, after transfection of vector RNAs, the N protein (provided by cotransfected N protein mRNA) may be important for the establishment of a replicase/transcriptase complex; however, sustained N protein expression (e.g., if the N protein was encoded by the vector RNA itself or provided in trans by replicating virus) was required for efficient replication.
The level of HCoV-229E vector-mediated reporter gene expression correlates with the level of HCoV-229E N protein expression. The experiments described above demonstrate that cotransfection of an HCoV-229E N protein mRNA with HCoV-229E vector RNAs increases the number of green fluorescent cells. Even if the N protein is encoded by the vector RNA, cotransfection of an N protein mRNA is required, indicating that the N protein must provide a function that facilitates the establishment of a functional transcriptase or replicase complex early after transfection. To determine whether this effect was dose dependent, we produced a stable, regulatable BHK-21-derived cell line expressing the HCoV-229E N protein. This cell line, designated BHK-HCoV-N, is based on the Tet/ON system. The analysis of N protein expression by Western blotting revealed that the HCoV-229E N protein can be expressed to different levels with different concentrations of doxycyclin in the tissue culture medium (Fig. 4A). We have used this cell line for the transfection of 15 µg in vitro-transcribed HCoV-vec-CLG RNA and analyzed the vector-mediated expression of GFP and firefly luciferase 3 days posttransfection. The average percentage of green fluorescent cells in these experiments was approximately 3%, independent of the doxycylin concentration used in the medium (data not shown). This indicates first that cotransfection of an mRNA encoding the N protein with the vector RNA is no longer needed in N protein-expressing cells, and second, that the level of N protein expression did not influence the number of green fluorescent cells. However, as shown in Fig. 4B, the level of N protein expression did influence the level of reporter protein expression. At doxycyclin concentrations of 0.1 and 1 µg/ml, the firefly luciferase expression was about 30-fold higher than levels observed with HCoV-vec-CLG RNA-transfected cells in the absence of doxycyclin. Thus, although the overall number of green fluorescent cells did not change, the level of reporter protein expression increased at higher levels of N protein expression.
![]() View larger version (16K): [in a new window] |
FIG. 4. Firefly luciferase reporter protein analysis in BHK-HCoV-N cells. (A) Western blot analysis of BHK-HCoV-N cells. Each lane corresponds to a cytoplasmic lysate derived from 2 x 105 BHK-HCoV-N cells that were cultivated for 2 days in medium without or with 0.01, 0.1, or 1 µg/ml doxycyclin (Dox), as indicated. For comparison, a Western blot analysis of cytoplamic lysate derived from 2 x 105 HCoV-229E-infected MRC-5 cells is shown. (B)A total of 15 µg of HCoV-vec-CLG vector RNA was transfected into 107 BHK-HCoV-N cells. After transfection, the cells were split into four wells containing medium without or with 0.01, 0.1, or 1 µg/ml Dox. Luciferase activity was analyzed 3 days posttransfection. Results are shown for three independent experiments. In each experiment, luciferase activity of transfected BHK-HCoV-N cells that were cultivated without Dox were set as 1 relative unit. Bars indicate 95% confidence intervals.
|
The fusion of leader and body-derived sequences of the subgenomic HCoV-229E N protein mRNA has been determined to occur at the TRS-N sequence 25668UCUAAACU25675 (the numbers indicate HCoV-229E nucleotide positions) within the HCoV-229E genome (14). We made use of our full-length cDNA clone of HCoV-229E and modified the TRS-N to 25668UCUNNNCU25675, where N represents any nucleotide (Fig. 5A). The stretch of three random nucleotides within the TRS-N was introduced into a full-length cDNA of HCoV-229E by PCR and in vitro ligation as described in Materials and Methods. Following in vitro transcription, a population of (theoretically) 64 different full-length recombinant HCoV-229E genomes was produced. These RNA molecules were then transfected into BHK-21 cells (which are not susceptible to HCoV-229E infection), and 3 days later poly(A)-containing RNAs were isolated. We then compared the sequences of the input RNA genomes with those of the reisolated RNA genomes at the TRS-N region. To do this, DNA fragments containing the TRS-N region were amplified by RT-PCR from the RNA genomes, and a consensus sequence was determined. As shown in Fig. 5B, the reisolated RNA genomes have clearly undergone selection, since the nucleotides at the randomized positions had shifted to a predominance of adenines. However, there was also a dominant uridine peak detectable, corresponding to HCoV-229E nt 25673. To determine the sequences of individual RNA genomes, we cloned the RT-PCR products and sequenced a total of 44 plasmid DNAs derived from the input RNA population and 41 plasmid DNAs derived from the reisolated RNA population. The result of this analysis is summarized in Fig. 5C. HCoV-229E genomes that contained the authentic TRS-N sequence were represented at 4.5% in the input RNA population, and the percentage of these molecules increased to 9.8% in the reisolated RNA population. Similarly, the percentage of genomes that contained a uridine at position 25673 was 20.5% of the input RNAs and 41.5% of the reisolated RNAs. Thus, these two groups had obviously undergone positive selection during amplification in BHK-21 cells. Sequences that contained one nucleotide difference, compared to the authentic TRS-N (25668UCUAAACU25675) or leader-TRS (62UCUCAACU69), remained approximately at the same level (31.8% and 29.3% for input and reisolated RNAs, respectively). Other sequences that did not match to the groups mentioned above had presumably undergone negative selection in BHK-21 cells, since their percentage dropped from 43.2% in the input RNAs to 19.5% in the reisolated RNAs.
![]() View larger version (28K): [in a new window] |
FIG. 5. Sequencing analysis of recombinant HCoV-229E genomes with a randomized N gene TRS. (A) HCoV-229E nt 25663 to 25680 containing the core sequence of the N gene TRS (boxed) are shown together with the structure and size of the HCoV-229E genome. The HCoV-229E nt 25671AAA25673 within the TRS-N core sequence (underlined) was changed to a random sequence indicated as NNN. (B) An RT-PCR sequencing analysis of in vitro-transcribed HCoV-229E-based genomes with randomized HCoV-229E nt 25671 to 25673 is shown (input genomes) (left). These genomes were used for transfection of BHK-21 cells. Three days later, the poly(A)-containing RNA was isolated and analyzed by RT-PCR sequencing (reisolated genome) (right). Sequencing results are shown for HCoV-229E nt 25663 to 25680, and the region corresponding to the randomized sequence is underlined. (C) RT-PCR products obtained from input genomes or reisolated genomes were cloned and sequenced. The analysis comprised 44 individual plasmid clones corresponding to input genomes and 41 individual plasmid clones corresponding to reisolated genomes. On the basis of the sequence determined at randomized nt 25671 to 25673, the recombinant genomes were placed in one of four groups: group 1, recombinant genomes with the HCoV-229E wild-type sequence (AAA); group 2, recombinant genomes with a 1-nt change compared to the TRS-N or leader-TRS sequence; group 3, recombinant genomes containing a U nucleotide at HCoV-229E position 25673 (NNU); group 4, sequences that do not match to groups 1 to 3. The percentages of each group in the population of input and reisolated genomes are indicated.
|
![]() View larger version (50K): [in a new window] |
FIG. 6. Sequence analysis of subgenomic N protein mRNAs. (A) The structure and size of the recombinant HCoV-229E genome with randomized nt 25671 to 25673 are shown together with the core sequences of the leader TRS and TRS-N. Numbers indicate nucleotide positions within the HCoV-229E genome. (B) The leader-body junction sequences of subgenomic N protein mRNAs are shown, determined after leader-body-specific RT-PCR sequencing analysis. Leader and body-derived nucleotides are shown in gray and black letters, respectively. For each subgenomic N protein mRNA, the region of the TRS-N core sequence (box) and the sequence corresponding to the randomized nt 25671 to 25673 (shaded in gray) are indicated. For comparison, HCoV-229E nt 55 to 78 encoding part of the leader RNA and downstream sequences (top left) and the HCoV-229E nt 25623 to 25684 containing the TRS-N and flanking regions are shown (bottom). Also highlighted are two stretches of 8 nt encoded downstream of the leader TRS (nt 70 to 77) and upstream of the TRS-N (nt 25668 to 25674) that might be involved in base pairing during discontinuous extension (see Discussion). Alternative leader-body fusion sites are underlined.
|
|
|
|---|
In addition to the main conclusion, our analysis of the replication of mutated HCoV-229E genomes led to a number of observations that may be relevant to the mechanism of coronavirus transcription. First, it is evident that when the authentic N gene TRS is rendered nonfunctional, the virus is able to use alternative body leader fusion sites. The analysis of alternative leader-body junctions revealed that a stretch of 8 nucleotides located upstream of the TRS-N element (25645AAACGAAA25653) could also be found immediately downstream the leader TRS (70AAACGAAA77), which suggests that base pairing might have taken place at or near to these sequences. The actual leader-body junction sites were determined at body RNA-derived nucleotides 25640AC25642, 25634AAC25636, or 25645AAAC25648 (Fig. 6B). Thus, the leader-body junction was either within a putative base-pairing region (as in natural TRS elements) or at adjacent upstream sequences containing AC or AAC nucleotides. In this context, it is interesting that a common feature of the order Nidovirales is an open reading frame 1b-encoded protein that has NendoU activity with a strong preference for cleavage at GU(U) sequences in double-stranded RNA substrates (18, 46). It has been suggested that GU(U) sequences at the 3' terminus of nascent minus-strand RNAs, which correspond to conserved AAC nucleotides in the core of the HCoV-229E TRS element, might be substrates of this activity; therefore, NendoU activity might be involved in the transcription of subgenomic mRNAs (18). Our data support the functional importance of the AAC sequence in HCoV-229E TRS elements, but further studies are required to provide a direct link to the activities of enzymes such as the uridylate-specific endoribonuclease.
The data we have presented provide substantial evidence for a functional role of a structural protein in coronavirus RNA synthesis. To our knowledge, this is the first example of a structural protein that is involved in the RNA synthesis of a nonsegmented positive-strand RNA virus. Among all positive-strand RNA viruses, there is only a single group of plant viruses, namely, alfamoviruses and ilarviruses from the Bromoviridae family, for which a similar phenomenon has been reported (5, 19, 56). These viruses contain a tripartite positive-strand RNA genome with RNA1- and RNA2-encoding proteins involved in viral RNA replication and RNA3 giving rise to a subgenomic mRNA4 encoding the coat protein. It has been shown that a mixture of three genomic RNAs of alfamoviruses and ilarviruses is not infectious to plants unless the coat protein or mRNA4 is added to the inoculum. This phenomenon has been termed genome activation and appears to take place prior to minus-strand RNA synthesis, most probably through binding of coat protein to conserved RNA structures at the 3' end of the genomic RNAs, resulting in an enhancement of translation (32, 33). In addition, the coat protein is also involved in plus-strand RNA synthesis ("asymmetric plus-strand RNA accumulation"); interestingly, the coat protein is in fact part of the alfamovirus and ilarvirus replication complexes. Although there are striking similarities to the data presented here, further studies are required to elucidate the mechanism(s) of N protein function in coronavirus replication.
In contrast to positive-strand RNA viruses, an involvement of structural proteins in virus replication and transcription has been described for a number of negative-stand RNA viruses, particularly viruses with nonsegmented genomes. Vesicular stomatitis virus (VSV; order Mononegavirales, family Rhabdoviridae) is one of the most advanced experimental systems, and a number of studies related to replication and transcription have been reported (3, 58). For example, it has recently been proposed by Banerjee and colleagues (35) that two distinct polymerase complexes carry out transcription and replication of VSV genomic RNA. The virus proteins L and P (for large protein catalytic subunit and the essential phosphoprotein cofactor, respectively) are part of the transcriptase complex while the L, P, and N proteins are part of the replicase complex. It has also been proposed that the VSV N protein may promote read-through at transcription signals and that VSV replication may require a significant amount of N protein for the encapsidation of the nascent strand. Unlike positive-strand RNA viruses, the template for replication in the case of VSV is a ribonucloprotein complex composed of genome-sized RNAs and the N protein (4). Thus, it appears that in many negative-strand RNA viruses the processes of replication and transcription are tightly regulated by a structural protein. Whether distinct enzyme complexes also distinguish coronavirus replication and transcription and whether the coronavirus N protein has a regulatory role to play in these complexes remain to be determined.
In summary, the present study complements and extends our current understanding of coronavirus replication and transcription. We have identified a structural protein as a factor that facilitates coronavirus genome replication; to the best of our knowledge, this is unprecedented among nonsegmented positive-strand RNA viruses. The functional importance of coronavirus N proteins in fundamental aspects of the coronavirus life cycle, namely encapsidation and replication of virus genomes, suggests that the N protein provides an attractive target for antiviral intervention aimed at combating coronavirus infections.
This study was supported by the Swiss National Science Foundation; the Gebert-Rüf Foundation, Switzerland; and the German Research Council (DFG).
Present address: Rudolf-Virchow Center for Experimental Biomedicine, 97078 Würzburg, Germany. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»