Previous Article | Next Article ![]()
Journal of Virology, December 2006, p. 11598-11609, Vol. 80, No. 23
0022-538X/06/$08.00+0 doi:10.1128/JVI.01758-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Institutes of Molecular Biology,1 Infectology,2 Diagnostic Virology, Friedrich-Loeffler-Institut, 17493 Greifswald-Insel Riems, Germany,5 Centre for Cancer Research and Cell Biology, School of Biomedical Sciences, The Queen's University of Belfast, 97 Lisburn Road, Belfast BT9 7BL, United Kingdom,3 Institute of Virology and Immunology, University of Würzburg, Versbacher Straße 7, 97078 Würzburg, Germany4
Received 14 August 2006/ Accepted 12 September 2006
|
|
|---|
|
|
|---|
Recently, significant progress was made in the functional and structural characterization of the nidovirus replication/transcription complex, with Severe acute respiratory syndrome coronavirus (SARS-CoV) being the most extensively studied nidovirus to date (1, 2, 4, 19, 30, 31, 34, 42, 43, 48, 57, 58, 68-70). It is now generally accepted that the enzymology involved in nidovirus replication is significantly more complex than that of other plus-strand RNA viruses. Most probably, these additional enzymes are required to replicate the exceptionally large genomes of nidoviruses and synthesize the nested set of sg RNAs from which the structural and, in some cases, several accessory proteins are expressed (37, 47). Previous studies identified conserved proteins and mechanisms mediating the replication cycle of viruses from the various nidovirus genera, but they also revealed interesting differences. Thus, for example, most, but not all, nidovirus sg RNAs contain a 5' leader sequence derived from the 5' end of the genomic RNA (13, 17, 56, 65). Furthermore, the number of replicase gene-encoded enzymes varies between different nidovirus genera or even between different groups of the same genus (23, 54), and also, the functional and structural properties of several key replicative proteins, including the main proteases of the various genera, are remarkably diverse (1, 3, 53, 72). Taken together, the studies have made it clear that more information is needed to understand the biological meaning of the differential conservation of specific proteins and/or enzymatic activities among nidoviruses (and other plus-strand RNA virus families). In this context, additional sequence information, particularly from nidoviruses prototyping previously unknown and distantly related genera, can be expected to provide new insights into the evolution of the Nidovirales and, potentially, even provide clues for a better understanding of the mechanisms and driving forces that have governed the separation of the Nidovirales from the bulk of plus-strand RNA viruses featuring smaller genomes and less-complex genome replication and expression strategies.
Here, we report the complete genome sequence of White bream virus (WBV), a novel plus-strand RNA virus isolated from fish (Blicca bjoerkna L.) (27). The sequence, organization, and expression of the 26.6-kb genome qualify WBV as a new member of the order Nidovirales. In common with other nidoviruses, the WBV genome contains two large ORFs (1a and 1b) in the 5' region of the genome, with ORF1b probably being expressed by 1 ribosomal frameshifting, occurring at a putative UUUAAAC heptanucleotide "slippery" sequence located upstream of the ORF1a stop codon and a putative RNA pseudoknot structure. WBV is predicted to encode three structural proteins, spike (S) protein, membrane (M) protein, and nucleocapsid (N) protein, which are expressed from 5' leader-containing sg RNAs as shown by Northern blotting and sequence analysis. Upstream of each of the S, M, and N ORFs, a conserved nonanucleotide sequence, CA(G/A)CACUAC, was identified which is also present near the 5' end of the genome. By analogy with other nidoviruses, we predict this sequence to be the conserved core of the transcription-regulating sequence (TRS) elements that facilitate the template switch required to attach the complement of the 5' leader sequence to the 3' ends of subgenome-length minus strands, thereby producing the templates for the synthesis of 5' leader-containing plus-strand RNAs. The phylogenetic relationship between WBV and nidoviruses is further corroborated by the presence of a conserved array of putative functional domains in pp1a, including an ADRP domain, three TMs, and the presumed viral main protease, 3CLpro, whose activity was established in this study. Furthermore, an RNA-dependent RNA polymerase domain featuring the nidovirus-specific SDD signature, as well as ZBD, helicase, exoribonuclease, NendoU, and ribose-2'-O-methyltransferase domains, were identified in the C-terminal part of pp1ab. Sequence comparisons and phylogenetic studies lead us to conclude that toroviruses (followed by coronaviruses) are the closest relatives of WBV. In terms of classification, we think the phylogenetic position of WBV would be best reflected if the virus was assigned to a yet-to-be-established new genus.
|
|
|---|
Virus purification and RNA isolation. WBV was harvested by two cycles of freezing and thawing of infected cells. Cell debris was removed by centrifugation at 4,000 rpm (SW28 rotor; Beckman) for 10 min, and the supernatant was layered onto a 15% sucrose cushion and centrifuged at 20,000 rpm for 90 min. The resulting virus pellet was resuspended in STE buffer (0.15 M NaCl, 0.01 M Tris-HCl, pH 8.3, 0.01 M EDTA, pH 8.0) and applied on a continuous gradient of 5 to 50% sucrose in STE. After centrifugation at 20,000 rpm (SW28 rotor; Beckman) for 30 min, the virus-containing band was isolated and dialyzed against STE. The virus was concentrated by sedimentation at 40,000 rpm (SW60; Beckman) for 4 h, and viral RNA was extracted using guanidinium thiocyanate, followed by centrifugation through a 5.7 M cesium chloride-EDTA cushion according to standard protocols (44).
RNA transfection. EPC cells (5 x 105) were transfected with purified WBV genome RNA. To do this, 3 µg RNA was diluted in 50 µl diethyl pyrocarbonate-treated water containing 60 U RNasin (Promega) and mixed with 200 µl of serum-free OptiMEM (Invitrogen) containing 12.5 µl of Lipofectamine transfection reagent (Invitrogen). After incubation on ice for 5 min, the transfection mix was layered onto nearly confluent EPC cells maintained in OptiMEM. After incubation for 2 hours at 20°C, the medium was replaced by Eagle's minimal essential medium supplemented with 10% fetal bovine serum. Transfected cells were incubated for 6 days at 20°C in a 2.5% CO2 atmosphere.
Electron microscopy. For negative staining, infected cell cultures were scraped off from the cell culture dish, pelleted by low-speed centrifugation, and resuspended in phosphate-buffered saline. Formvar-coated grids were placed for 7 min on drops of cell culture supernatant or resuspended pellet. Negative staining was performed with 2% phosphotungstic acid (pH 7.4) for 7 min. Stained grids were examined with a Philips electron microscope 400T (Eindhoven, The Netherlands).
Cloning and sequence analysis of the WBV genome. Purified WBV genome RNA was used as a template for the construction of cDNA libraries. Viral RNA (0.5 to 1 µg) was incubated with 0.5 µg of oligo(dT)12-18 primers or 0.037 µg of random pd(N)6 hexadeoxynucleotides (TimeSaver cDNA synthesis kit; Amersham Biosciences). In the course of the study, additional cDNA libraries were generated by "primer walking," using WBV-specific oligonucleotides (DF14, DF20, DF16, DF29, and DF5) and previously described methods (49; data not shown). To further corroborate the WBV genome sequence analysis, a series of reverse transcription (RT)-PCRs were performed to amplify overlapping fragments covering the entire genome RNA (see Fig. 2). The amplicons obtained were cloned using pGEM-T Easy vector system II (Promega) according to the manufacturer's protocols. RT-PCR 17 (see Fig. 2) was performed using the one-step RT-PCR (QIAGEN) and SuperScript III one-step RT-PCR (Invitrogen) systems. For RT-PCR 28, poly(A)-containing RNA from WBV-infected EPC cells was isolated using previously described methods (60). The poly(A) RNA was reverse transcribed using Transcriptor reverse transcriptase (Roche) and oligonucleotide JZ562 (55°C, 60 min). PCR amplification was done using the TripleMaster PCR system (Eppendorf) and oligonucleotides JZ561 and JZ564. The WBV 5' and 3' genome termini were determined by rapid amplification of cDNA ends (RACE) techniques as described by Hoffmann et al. (29). The sequences of the oligonucleotide primers used for cDNA synthesis and RT-PCR are available upon request. Plasmid DNAs containing cDNA and RT-PCR inserts (see Fig. 2) were sequenced using standard procedures. At least three independently derived clones were analyzed on both strands to ascertain the obtained sequence. The product from RT-PCR 28 was sequenced directly. To determine the 5' and 3' genome ends, at least six clones from independent RACE reactions were sequenced. Analysis of sequencing data was done using Lasergene biocomputing software (DNAStar). Multiple-sequence alignments were generated using ClustalW v1.83 (http://www.ebi.ac.uk/clustalw/) (63) and ClustalX v1.8 (62) and used as input for the ESPript (version 2.2) program (http://espript.ibcp.fr/ESPript/cgi-bin/ESPript.cgi). Transmembrane domains were predicted using the hidden Markov model-based program TMHMM v.2.0 (http://www.cbs.dtu.dk/services/TMHMM-2.0/) (32), and signal peptides were identified using SignalP v3.0 (http://www.cbs.dtu.dk/services/SignalP/) (5). Phylogenetic trees were generated from multiple-sequence alignments by the neighbor-joining method in ClustalX v1.8 or MEGA 3.1 (http://www.megasoftware.net), and bootstrap values were determined by 1,000 replicates. Phylogenetic trees were plotted with TreeView v1.6.6 (36) and manually edited.
![]() View larger version (13K): [in a new window] |
FIG. 2. Structural organization and sequence analysis of the WBV (strain DF24/00) genome. (A) Given are the sizes and positions of cDNA clones from WBV genomic libraries that were used to determine the WBV genome sequence. Also shown are the 5'- and 3'-terminal amplicons generated by RACE. (B) Given are the sizes and positions of RT-PCR products used to ascertain the sequence derived from the cDNA clones shown in panel A. (C) Predicted functional ORFs in the WBV genome. Numbers indicate the 5' and 3' nucleotides, respectively, of predicted translation start and stop codons. Note that translation of ORF1b is predicted to involve a 1 ribosomal frameshift occurring just upstream of the ORF1a translation stop codon (and downstream of the most 5'-terminal AUG codon that is used here to indicate the ORF1b 5' end) (for further details, see the text and Table 1). (D) WBV-specific RNAs as determined in this study. The little black box at the 5' end of the genome indicates the 42-nt leader sequence, which is also present at the 5' ends of the three subgenome-length RNAs (Fig. 3 and 4). The available evidence from other nidoviruses (37, 47) suggests that attachment of the leader sequence to the coding (body) sequences of WBV subgenome-length RNAs is due to discontinuous extension of subgenome-length minus-strand RNAs. In this process, nascent minus strands switch their template at TRSs located upstream of the S, M, and N genes and bind then to an identical sequence called leader TRS near the 5' end of the genome, after which the leader sequence is copied to complete negative-strand synthesis (see the text and Fig. 4).
|
-32P-multiprime-labeled DNA probe specific for WBV nts 25992 to 26582. RNAs were analyzed by autoradiography. Poly(A)-containing RNAs from human coronavirus 229E (HCoV-229E)-infected cells (60) and Rep-1 RNA (28) were used as RNA size markers in this Northern blotting experiment. pRep-1 RNA (24.4 kb) was produced by in vitro transcription as described previously (28). HCoV-229E-specific RNAs and pRep-1 RNA were visualized by including in the hybridization buffer a second
-32P-multiprime-labeled DNA probe specific for the HCoV-229E nts 26857 to 27277 (34). To determine the leader-to-body fusion sites of WBV subgenomic RNAs, reverse transcription of poly(A)-containing RNA from WBV-infected cells was primed using oligonucleotides JZ590_RT-N (5'-26628GGTAATTTTATATCATACATGGATAGTCAT26599-3'), JZ618_RT-M (5'-25419CCGACCATTGGTAATCTACCAACG25396-3'), and JZ620_RT-S (5'-21940CAGTATTGTTCGGCTGTTGTTAAC21917-3'). PCR amplification was done using the WBV 5'-end (leader)-specific oligonucleotide JZ598 (5'-1GAGATTATTACGAATTTCGTTTTAACACAC30-3') in combination with body-specific oligonucleotides JZ593-N (5'-26582AGAAACGACATACACGCGAAATTG26559-3'), JZ619-M (5'-25390AATGTAGTAGACCAGAGTATTCAC25357-3'), and JZ621-S (5'-21790TTGTCATAGTTGAGTGTGTACGAC21767-3'). PCR products were sequenced directly using primers JZ593-N (amplicon JZ598/JZ593-N), JZ619-M (amplicon JZ598/JZ619-M), and JZ621-S (amplicon JZ598/JZ621-S). In addition, the PCR products were cloned into the unique EcoRV restriction site of pBluescript II KS(+) plasmid DNA (Stratagene). Leader-body junctions of the three subgenomic RNAs were then further analyzed by sequencing the relevant regions of individual plasmid clones, using T3 and T7 promoter-specific oligonucleotides. Cloning, expression, and activity of the WBV 3C-like protease domain. The predicted coding sequence of WBV 3CLpro, together with short flanking sequences, was amplified by RT-PCR using poly(A)-containing RNA isolated from WBV-infected EPC cells as a template and oligonucleotides JZ559 (5'-TCAGCATCAGAATGCATTCTGTAT-3') and JZ560 (5'-AAAGAATTCTTATTGCATGTTGTTAACTGGTGATGTG-3'). The 961-bp PCR product was treated with T4 DNA polymerase, phosphorylated with T4 polynucleotide kinase, cleaved with EcoRI, and inserted into XmnI/EcoRI-digested pMal-c2 plasmid DNA (New England Biolabs). The resulting plasmid, pMal-WBV-3CL_559-560, encoded the WBV pp1a/pp1ab residues Ser3424 to Gln3726 fused to the Escherichia coli maltose-binding protein (MBP). As a negative control, a mutant derivative, pMal-WBV-3CL_S3589A, was generated by site-directed mutagenesis of pMal-WBV-3CL_559-560, using PCR-based methods (73). E. coli TB1 cells transformed with pMal-WBV_559-560 and pMal-WBV-3CL_S3589A were grown at 37°C in Luria-Bertani (LB) medium containing 100 µg of ampicillin per ml until they reached a culture density (A595) of 0.6. Expression of the recombinant proteins was induced by the addition of 1 mM isopropyl-ß-D-thiogalactopyranoside (IPTG) for 3 h at 24°C. For analysis of recombinant protein expression, aliquots of the cell cultures were suspended in 2x Laemmli sample buffer and incubated at 94°C for 3 min, and the lysates were analyzed by electrophoresis in sodium dodecyl sulfate-polyacrylamide gels as described previously (72).
|
|
|---|
![]() View larger version (220K): [in a new window] |
FIG. 1. Purified WBV genome RNA is infectious: evidence for viral particle formation and release. The cell culture supernatant from EPC cells transfected with purified WBV genome RNA was analyzed at 6 days posttransfection by electron microscopy (negative staining). (C and D). For comparison, electron micrographs taken from purified WBV virions (27) are shown in panels A and B. Representative pictures of both intact (A and C) and partially opened (B and D) virions are shown. Bar, 150 nm.
|
|
View this table: [in a new window] |
TABLE 1. Predicted proteins expressed from WBV genomic and subgenomic RNAsa
|
Computer-aided sequence analysis further suggested that the WBV ORF3 gene product is a 227-residue, triple-membrane-spanning glycoprotein. Both the size of this protein and the predicted topology and length of the transmembrane regions (data not shown) link this protein to the triple-membrane-spanning M proteins of corona- and toroviruses (14, 52), even though at the primary structure level, the relationship is weak and the generation of reliable sequence alignments proved to be impossible. However, this distant relationship is not really surprising as the coronavirus and torovirus S and M proteins are only poorly conserved, with similarities being restricted mainly to general features, such as the numbers and positions of transmembrane domains, protease cleavage sites, and the general domain organization of these proteins (52).
Based on its 3'-terminal position, which in corona- and toroviruses is generally occupied by the N gene, and the quite reliable functional assignments for all of the other WBV gene products, it seemed reasonable to suggest that ORF4 specifies the WBV N protein (161 residues). The size of this protein corresponds well to that of the torovirus (but not coronavirus) N proteins, and also, some of the most conserved sequence signatures of torovirus N proteins appear to be partly conserved in the putative WBV N protein (data not shown). However, to unambiguously establish a structural relationship between the WBV and torovirus N proteins, further evidence has to be obtained. In support of this potential relationship, we note that the intracellular nucleocapsids formed by WBV and toroviruses share a rod-like structure (55) and also that the straight or slightly bent (kidney-like) rod-like morphology described for some (but not all) torovirus particles resembles the rod-like structure of WBV particles remarkably well (Fig. 1) (27). Taken together, the analysis suggests that WBV encodes three structural proteins, S, M, and N. The virus does not encode a homolog of the coronavirus E protein or a hemagglutin in esterase protein, the latter being conserved in toroviruses and several coronaviruses (52).
WBV-specific RNAs.
All previously characterized nidoviruses produce a 3'-coterminal nested set of sg RNAs to express their structural and, in some cases, several accessory proteins. Therefore, the identification of three putative structural protein genes in the 3'-terminal region of the genome led us to predict that WBV produces sg RNAs to express these downstream ORFs. To confirm this hypothesis, we isolated poly(A)-containing RNA from WBV-infected cells and used a 3' terminus-specific probe to detect WBV-specific genome-length and subgenome-length RNAs. The data we obtained in Northern blotting experiments (Fig. 3 and data not shown) demonstrate that WBV produces four RNAs (genome RNA and three sg RNAs) to express its genome. To determine the approximate sizes of the WBV-specific RNAs, we used two RNA markers, namely, (i) HCoV-229E genomic and sg RNAs and (ii) a 24.4-kb HCoV-229E-derived replicon RNA called Rep-1 (28). These RNA markers were detected by including an HCoV-229E-specific probe in the hybridization buffer. The observed sizes of the three sg RNAs strongly support the idea that RNAs 2, 3, and 4 are used to express the viral S, M, and N proteins, respectively. The size of RNA 1 was confirmed to be between 27.3 kb (HCoV-229E genome) and 24.4 kb (Rep-1 RNA), providing additional evidence for a WBV genome size of
26.6 kilobases which we had determined by genome sequence analysis.
![]() View larger version (23K): [in a new window] |
FIG. 3. Detection of WBV genome- and subgenome-length RNAs in virus-infected cells. Northern blot analysis of poly(A)-containing RNA isolated from WBV-infected EPC cells (lane 2). Poly(A) RNAs isolated from HCoV-229E-infected MRC-5 cells (lane 1) and HCoV-229E-derived replicon RNA Rep-1 (lane 3) (28) were used as RNA size markers in this experiment. To detect both the HCoV-229E- and WBV-specific RNAs, a mixture of -32P-multiprime-labeled probes specific for the 3'-terminal regions of HCoV-229E (nucleotides 26857 to 27277) and WBV (nucleotides 25992 to 26582) was used for hybridization. HCoV-229E genome- and subgenome-length RNAs and the in vitro-transcribed HCoV-229E Rep-1 RNA are indicated by black arrowheads, with sizes given in kilobases. White arrowheads indicate the four WBV-specific RNAs detected in this experiment. The longer exposure presented above shows the size of the WBV genomic RNA more clearly and allows its size to be compared with those of the 27.3- and 24.4-kb marker RNAs. The calculated sizes (Table 1 and Fig. 4) of the sg RNAs are 5,162 nts (RNA 2), 1,475 nts (RNA 3), and 774 nts (RNA 4) [including the 5' leader but excluding the poly(A) tail].
|
![]() View larger version (28K): [in a new window] |
FIG. 4. Leader-body junctions in WBV subgenomic RNAs. Shown are the junction sites between a short sequence, called leader, that is derived from the 5' end of the genome, and the coding (so-called body) sequences of subgenomic RNAs 2 to 4. The gray box highlights a sequence, CA(G/A)CACUAC, and its negative-strand complement that we predict to act as a core TRS element in WBV. As in corona- and arteriviruses (37, 47), this leader sequence is conserved near the 5' end of the genome (nts 44 to 52) and upstream of the translation start codon of each of the downstream ORFs specifying the viral structural proteins, S, M, and N. For each of the core TRS elements, the flanking sequences in the WBV genome RNA and the corresponding minus-strand sequence are given. Possible base-pairing interactions between the minus strand and the proposed leader TRS are indicated, and the leader-body (L-B) junction in the respective mRNA, as determined by RT-PCR and sequencing, is given below. Sequences derived from the 5' end of the genome (leader sequence) are boldfaced and underlined. Translation start codons of the S (RNA 2), M (RNA 3), and N (RNA 4) genes are boldfaced and italicized. Please note that, with respect to the minus-strand sequence, the actual fusion appears to occur slightly downstream of the fully complementary sequence rather than within this particular sequence.
|
Taken together, these data confirm and extend the previously established models for coronavirus and arterivirus sg RNA transcription (37, 47). At the same time, they make it clear that there is no simple relationship between the number of sg RNAs produced by specific groups of nidoviruses and the presence of a 5' leader.
Identification of a putative ribosomal frameshifting element at the ORF1a/1b junction. We identified a putative slippery sequence, 14549UUUAAAC14555, just upstream of the WBV ORF1a translation stop codon, and the sequence downstream of the slippery sequence could be modeled into an RNA pseudoknot structure (Fig. 5). Over the past years, coronavirus RNA pseudoknot structures have been studied extensively, both structurally and functionally, and their critical role in mediating a shift into the 1 reading frame during translation has been firmly established (8, 35). Although our study does not provide formal evidence for that, it seems reasonable to predict that, as in other nidoviruses, WBV ORF1b expression (generating the viral RdRp and other key replicative proteins) is regulated at the translational level by ribosomal frameshifting.
![]() View larger version (16K): [in a new window] |
FIG. 5. Model of the WBV ribosomal frameshifting element. By analogy with other nidoviruses, the element is proposed to consist of a putative RNA pseudoknot structure (comprised of two stems and two loops) and a slippery sequence (14549UUUAAAC15555) at which the actual frameshift is predicted to occur. The sequences boxed in gray indicate the predicted slippery sequence and the ORF1a translation termination codon.
|
![]() View larger version (21K): [in a new window] |
FIG. 6. Proteolytic activity of WBV pp1a/pp1ab amino acid residues Ser3424 toGln3726. Total cell lysates from E. coli TB1 cells transformed with pMal-WBV-3CL_559-560 (WT) (lanes 1 and 2) and pMal-WBV-3CL_S3589A (S3589A) (lanes 3 and 4) were separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis in a 12.5% polyacrylamide gel and stained with Coomassie brilliant blue R-250. The bacteria were mock induced (lanes 1 and 3) or induced with 1 mM IPTG for 3 h (lanes 2 and 4). The positions of the fusion protein and cleavage product are indicated by arrowheads. The molecular masses of marker proteins are given in kDa to the left.
|
![]() View larger version (26K): [in a new window] |
FIG. 7. Domain organization of nidovirus replicase polyproteins. (A) Comparison of the WBV pp1ab domain organization with those of representative viruses from other nidovirus genera: HCoV-229E (genus Coronavirus and family Coronaviridae), bovine torovirus (BToV; genus Torovirus and family Coronaviridae), and gill-associated virus (GAV; genus Okavirus and family Roniviridae). The polyproteins are processed by viral proteases that are part of the polyprotein. The coronavirus pp1ab proteolytic processing has been characterized in considerable detail (71, 75) and is illustrated here for HCoV-229E. To produce a total of 16 nonstructural proteins, three cleavages are carried out by papain-like proteases (PL) in the N-proximal region of the polyprotein (indicated by white arrowheads), and 11 cleavages are carried out by the 3C-like protease (3CL) in the central and C-terminal parts of the polyprotein (indicated by black arrowheads). For the genera Torovirus and Okavirus, only limited information on proteases and their cleavage sites is available (53, 72). The putative proteases of BToV have not been characterized, and only a few 3C-like protease cleavage sites (not shown) have been identified for GAV (72). Proteases and other conserved enzymatic activities are indicated by black boxes. A, ADP-ribose 1"-phosphatase (ADRP) related to cellular macro domain proteins (20, 40); Z, zinc-binding domain (51); HEL, helicase domain (50); ExoN, 3'-to-5' exoribonuclease (34); MT, putative ribose-2'-O-methyltransferase domain (21, 54); C, putative cyclic nucleotide phosphodiesterase (54); RFS, ribosomal frameshift site. Regions with predicted transmembrane domains (see Materials and Methods) are indicated by gray boxes. Note that the expression of the C-terminal part of pp1ab requires a ribosomal frameshift into ORF1b, which is predicted to occur just upstream of the ORF1a translation stop codon (Fig. 5). The sizes and positions of the polyproteins and functional domains are not precisely drawn to scale. (B) Partial sequence alignment of ADRP domains from SARS-CoV and HCoV-229E, whose activities have been characterized previously (20, 40, 41, 43), and the predicted ADRP domains from BToV (18) and WBV (this study). The alignment was generated using the ClustalX program (version 1.8). The secondary structure information was derived from the published SARS-CoV ADRP crystal structure (Protein Data Bank no. 2ACF) (43) and, together with the alignment, used as input for the ESPript program, version 2.2 (http://prodes.toulouse.inra.fr/ESPript/cgi-bin/ESPript.cgi). Sequences of the proteins were derived from the DDBJ/EMBL/GenBank database accession numbers NC_002645 (HCoV-229E pp1a/pp1ab residues Phe1299 to Lys1398), AY291315 (SARS-CoV pp1a/pp1ab residues Val1034 to Lsy1135), AY427798 (BToV [strain Breda-1] pp1a/pp1ab residues Y1668 to Ser1775), and DQ898157 (WBV pp1a/pp1ab residues Phe1667 to Lys1796). Black boxes, identical residues; white boxes, similar residues.
|
![]() View larger version (20K): [in a new window] |
FIG. 8. Phylogenetic analysis of WBV helicase and polymerase core domains. Phylogenetic trees were generated from multiple-sequence alignments of the most conserved regions of nidovirus RNA-dependent RNA polymerase (residues Thr4723 to Gln5396 in the WBV pp1ab sequence) and helicase domains (residues Ala5644 to Cys5924 in the WBV pp1ab sequence), using the neighbor-joining algorithm as implemented in the ClustalX 1.8 program (for details, see Materials and Methods). The WBV sequences were compared with those from Gill-associated virus (GAV, accession no. AF227196), Equine torovirus Berne (EToV, X52374), and Bovine torovirus Breda-1 (BToV, AY427798) as well as from viruses representing the three coronavirus groups, including the recently introduced subgroups 1a, 1b, 2a, and 2b (26). Group 1a, Transmissible gastroenteritis virus Purdue-115 (TGEV, accession no. Z34093) and Feline infectious peritonitis virus WSU 79/1146 (FIPV, DQ010921); group 1b, HCoV-229E (NC_002645), Human coronavirus NL63 Amsterdam I (HCoV-NL63, AY567487), and Porcine epidemic diarrhea virus CV777 (PEDV, AF353511); group 2a, Bovine coronavirus LUN (BCoV, AF391542), Human coronavirus OC43 serotype Paris (HCoV-OC43, AY585229), Mouse hepatitis virus A59 (MHV, NC_001846), and Human coronavirus HKU1 (HCoV-HKU1, NC_006577); group 2b, Severe acute respiratory syndrome coronavirus Frankfurt 1 (SARS-CoV, AY291315); group 3, Avian infectious bronchitis virus Beaudette (IBV, NC_001451).
|
The identification of nidoviruses in a very large number of mammalian species (39, 52, 59, 67) as well as in invertebrates (12) and fish (this study) suggests that nidoviruses, which in several respects are distinct from the huge variety of plus-strand RNA viruses (24), have managed to adapt to a remarkable diversity of biological niches. The continued sampling and characterization of nidoviruses, of which this study is a part, are anticipated to increasingly fill the major gaps that still exist between the individual clusters of nidoviruses. Additional sequence information and functional studies will help to identify the major forces and constraints that shape the evolution of nidoviruses. This information will also be required to unravel the basis for the differential conservation of specific replicase gene-encoded proteins among the various nidovirus families, genera, and species (24, 54) and help to relate these proteins to specific metabolic pathways and molecular mechanisms. More sequences and phylogenetic studies are also needed to determine the position of the WBV cluster within the nidovirus tree more precisely.
The work of R.U., B.S., S.B., and J.Z. was supported in part by the Deutsche Forschungsgemeinschaft (Zi 618/3 and SFB 479-A8).
Published ahead of print on 20 September 2006. ![]()
|
|
|---|
5' exoribonuclease that is critically involved in coronavirus RNA synthesis. Proc. Natl. Acad. Sci. USA 103:5108-5113.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»