| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Previous Article | Next Article ![]()
Journal of Virology, January 2008, p. 311-320, Vol. 82, No. 1
0022-538X/08/$08.00+0 doi:10.1128/JVI.01240-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Blood Systems Research Institute, San Francisco, California 94118,1 University of California, San Francisco, San Francisco, California,2 Centre for Infectious Diseases, University of Edinburgh, Summerhall, Edinburgh EH9 1QH, United Kingdom,3 Division of Infection Diseases, Stanford University, Stanford, California,4 BioReliance Corporation, 14920 Broschart Road, Rockville, Maryland 20850,5 Department of Fisheries and Oceans, Central and Arctic Region, 501 University Crescent, Winnipeg, Manitoba R3T 2N6, Canada6
Received 6 June 2007/ Accepted 6 October 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Ringed seals (Phoca hispida), one of the most abundant marine mammals species in the Arctic, are hunted by Canadian Inuit. Fluctuations in the ringed seal population in the Beaufort Sea of the Northwest Territories have been previously documented (8, 37) and are thought to be associated mainly with climatic conditions during the breeding season. No information on the number of seals in the Beaufort Sea population is available for the years 2000 to 2002, although ice conditions were normal (Lois Harwood, personal communication).
A recent analysis of lungs, lymph nodes, and nasal swabs from ringed seals hunted in 2000 to 2002 from Ulukhaktok (formerly known as Holman) on the shore of the Beaufort Sea demonstrated the presence of a virus causing strong cytopathic effects (CPE) in Vero cells. The causative agent of CPE passed through a 0.45-µm filter was resistant to detergent inactivation and was therefore thought to be a nonenveloped virus. Here, we analyzed this virus using sequence-independent PCR amplification and sequence similarity searches. We report the full genome sequence of a novel picornavirus with a deep root on the Picornaviridae family phylogenetic tree. Consistent with the nomenclature for other picornaviruses, we suggest the name seal picornavirus 1 (SePV-1) for this new virus and propose that it represent the prototype of a new picornavirus genus.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Random amplification, subcloning, and sequencing. Viral RNA was mixed with 50 pmol of primer RA01 (GCCGGAGCTCTGCAGATATCNNNNNNNNNN), denatured at 75°C for 5 min, and chilled on ice. A reaction mix of 9 µl containing 4 µl of 5x first-strand buffer (Invitrogen), 1 µl of 100 mM dithiothreitol (DTT), 1 µl solution containing each deoxynucleoside triphosphate (dNTP) at 10 mM, 8 units of recombinant RNase inhibitor (Promega), and 200 units of SuperScript II reverse transcriptase (RT) (Invitrogen) was added. The reaction mixture was incubated at 25°C for 10 min and then at 42°C for 50 min. After a denaturation step at 94°C for 3 min and chilling on ice (to reanneal primer RA01 to cDNA), 2.5 units (0.5 µl) of 3'-5' Exo– Klenow DNA polymerase (New England Biolabs) was added to extend RA01 and incubated at 37°C for 1 h, followed by an enzyme inactivation step at 75°C for 10 min. A total of 7.5 µl of the RT-Klenow-treated product was then used as a template in a subsequent 50-µl PCR mixture consisting of 1x AmpliTaq Gold PCR buffer II (100 mM Tris·HCl [pH 8.3], 500 mM KCl) (Applied Biosystems), 3 mM MgCl2, each dNTP at 0.3 mM, 50 pmol of primer RA02 (GCCGGAGCTCTGCAGATATC), and 2.5 units of AmpliTaq Gold DNA polymerase LD (Applied Biosystems). An initial denaturation step for 5 min at 95°C was followed by 40 cycles of PCR (95°C for 1 min, 55°C for 1 min, and 72°C for 2 min). Random PCR products were then separated on a 1.5% agarose gel, and DNA smears ranging in size from approximately 400 to 1,500 bp were excised and extracted using the QIAquick gel extraction kit (Qiagen). Five microliters of the eluted, purified PCR product was ligated into the pGEMT-Easy vector (Promega Inc.) and introduced into chemically competent Escherichia coli TOP-10 cells (Topo One Shot; Invitrogen). Bacteria were plated onto LB agar plates containing ampicillin, X-gal (5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside) and IPTG (isopropyl-β-D-thiogalactopyranoside). Ninety-six white colony inserts were sequenced using the T-7 forward primer.
Sequence analysis. Sequence data for all clones were imported into Sequencer 4.1 (Genecode) and trimmed of vector and primer (RA02) sequences. The remaining sequences were then assembled into contigs using an assembly parameter of a minimum 90% base identity with at least a 30-nucleotide overlap. A sequence similarity search was performed using tBLASTx (http://www.ncbi.nlm.nih.gov/BLAST/).
SePV-1 genome sequencing. Sequence contigs of sequences were assembled using Sequencher software. Contigs showing significant tBLASTx hits to picornaviruses (E value of <0.001) were then linked using PCR. To acquire the 3' end of viral genome, 10 µl of extracted RNA was mixed with 10 pmol of primer DT-01 (ATTCTAGAGGCCGAGGCGGCCGACATGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTVN, where V is A/C/G and N is A/C/G/T), denatured at 75°C for 5 min, and chilled on ice. A reaction mix of 9 µl containing 4 µl of 5x first-strand buffer (250 mM Tris·HCl [pH 8.3], 375 mM KCl, 15 mM MgCl2) (Invitrogen), 2 µl of 100 mM DTT, a 1-µl solution containing each dNTP at 10 mM, 8 units (0.2 µl) of recombinant RNase inhibitor (Promega), and 200 units of SuperScript II reverse transcriptase (Invitrogen) was then added and incubated at 45°C for 30 min, followed by 75°C for 10 min. Two units of RNase H was added, and the reaction mixture was further incubated for 10 min at 37°C. PCR was performed using a virus-specific primer, RdRp-1 (CTGTGCCTGATTTCCCTGAATCT), and DT-02 (ATTCTAGAGGCCGAGGCGGCC). PCR consisted of an activation step of 5 min at 95°C followed by 35 cycles of amplification at 95°C for 1 min, 60°C for 30 s, and 72°C for 2 min. To acquire the 5' end of the SePV-1 genome, 10 µl of extracted RNA was mixed with 10 pmol of virus specific-primer VP-R-1 (AGCCATACCCCCTTGGTCTT), denatured at 75°C for 5 min, and chilled on ice. An RT reaction mix similar to that used for 3' rapid amplification of cDNA ends was added, and the reaction mixture was incubated at 52°C for 30 min, followed by 75°C for 10 min. Two units of RNase H was then added, and the reaction mixture was further incubated for 10 min at 37°C. cDNA was purified using a Qiagen PCR purification kit, and a poly(C) tail was added using terminal deoxynucleotide transferase and dCTP. PCR was performed using the SePV-1-specific primers VP-R-2 (GCGACACGCACACAACTACA) and PPC01 (GGCCACGCGTCGACTAGTACGGGIIGGGIGGGGIGG, where I is deoxyinosine). PCR cycles consisted of an enzyme activation step for 5 min at 95°C followed by 35 cycles of amplification 95°C for 1 min, 60°C for 30 s, and 72°C for 2 min. PCR products were then directly sequenced using the virus-specific primers to acquire viral sequences of both extremities.
Detection of SePV-1 by RT-PCR. One hundred forty microliters of the cell culture supernatants was extracted using a viral RNA extraction kit (Qiagen) according to the manufacturer's instructions. The RNA was eluted with 50 µl of RNase-free distilled water containing 40 U of RNase inhibitor (RNasin; Promega). Fifty picomoles of a random octamer oligonucleotide was added to 10 µl of RNA, denatured at 75°C for 5 min, and then chilled on ice. A reaction mix of 9 µl containing 4 µl of 5x first-strand buffer, 2 µl of 100 mM DTT, a 1-µl solution containing each dNTP at 10 mM, 8 units (0.2 µl) of recombinant RNase inhibitor (Promega), and 200 units of SuperScript II reverse transcriptase (Invitrogen) was then added and incubated at 45°C for 30 min, followed by 75°C for 10 min. PCR was performed using primers VP1F1 (TGATGATGTGTTGGGAAGACTCCA) and VP1R1 (TACCCAGGAAACAAATTTGGCAAT), targeting positions 1844 to 3341 of the SePV-1 genome. PCR consisted of an AmpliTaq Gold (Applied Biosystems) activation step for 5 min at 95°C followed by 35 cycles of amplification at 95°C for 1 min, 55°C for 30 s, and 72°C for 2 min. The PCR products including the entire VP1, 2A, and 2B sequences were separated in 1% agarose gels and purified with a QIAquick gel extraction kit (Qiagen). Nucleotide sequences that were 1,428 nucleotides long were determined by using a 373A DNA autosequencer (PE-Applied Biosystems) using primers VP1R1 and VP1F1.
Phylogenetic analysis. Translated sequences from the coding region of SePV-1 and other picornaviruses were aligned using ClustalW with default settings (10). Amino acid sequence divergence of SePV-1 from other picornaviruses was determined for the 3D (pol) region and by sliding-window analysis of the nonstructural region using the program Sequence Distance within the Simmonic2005 v1.6 sequence editor (34).
Phylogenetic analysis on aligned regions within the NS region was carried out by neighbor joining of translated amino acid distances implemented in the program MEGA2 (24). Trees were rooted by their longest branch in the absence of a naturally occurring outgroup for the picornavirus sequences; the most similar positive-stranded RNA viruses, such as members of Comoviridae and Sequiviridae, were too divergent for defensible alignments in the regions analyzed. Confidence in phylogenetic groupings was obtained by generating consensus trees derived from 100 sets of bootstrap-resampled sequence data.
RNA structure determination. Prediction of RNA secondary structure in the 5'-untranslated region (5'UTR) and 3'UTR was carried out using a standard minimum free-energy method (MFOLD) (46) and PFOLD (21). For estimation of mean folding energies (MFEs) of fragments of the SePV-1 genome and control sequences, the program ZIPFOLD was used with default settings (46). Consecutive fragments of 498 bases overlapping by 249 bases were generated from aligned complete genome sequence data sets of each of the four virus groups (sequences listed below). MFEs were determined by MFOLD (46), and the sequence order-dependent component to folding was determined by comparison with sequence order-scrambled copies of each native sequence. The program NDR (34) was used for sequence order randomization to retain any biases in dinucleotide frequencies that might exist among native sequences. MFE results were expressed either as MFE differences (MFEDs), i.e., the percentage difference between the MFE of the native sequence from that of the mean value of the 50 sequence-order-randomized controls, or as a Z score, which is the position of the MFE of the native sequence within the distribution of MFEs of the randomized sequences expressed as the number of standard deviations from the mean value of the randomized sequences; thus, values between –2 and +2 fall within the range of 95% of their MFE value (43).
Picornavirus sequences. For determination of sequence divergences between SePV-1 and those of other picornaviruses, two or more representative serotypes from each species within each of the nine currently classified genera of picornaviruses were aligned. Sequences used for the comparison comprised the following: for the genus Aphthovirus, GenBank accession numbers NC_011450, FMDVALF, FDI320488, NC_004004, NC_002554, FMV7572, NC_004915, AY687334, NC_003992, FDI251473, AY593843, NC_011451, AY593853, and NC_011452 for foot-and-mouth disease virus (FMDV) and accession numbers NC_003982 and ERVPOLY for equine rhinitis A virus; for the genus Enterovirus, accession numbers NC_003988 for simian enterovirus, NC_002058, HPO132960, POL2LAN, POL544513, POL3L37, HPO293918, NC_001428, and CXA24CG for human enterovirus species C, NC_002347, NC_001360, NC_000881, NC_001657, NC_002601, NC_001342, NC_001656, and NC_001472 for species B, NC_001612 and ETU22522 for species A, NC_001430 and AY426531 for species D, BEVVG527 and AY508697 for bovine enterovirus, and NC_001617 and NC_001490 for human rhinovirus; for the genus Hepatovirus, accession numbers NC_003990, AJ225173, AY517471, and AY275539 for avian encephalomyocarditis virus, SHVAGM27 for simian hepatitis A virus (HAV), and HAVRNAGBM and NC_001489 for human HAV; for the genus Cardiovirus, accession numbers NC_001479 and XXEVCG for encephalomyocarditis virus and NC_001366 and AB090161 for Theiler's virus; for the genus Erbovirus, accession numbers NC_003077 and NC_003983; for the genus Parechovirus, accession numbers AF538689, AF327921, and AF327920 for Ljungan virus (LV) and L02971, NC_001897, AF055846, NC_008286, AB252582, NC_003976, AF538689, AF327922, AF327921, EF051629, DQ315670, AM235749, and AB084913 for human parechoviruses (HPeV); for the genus Kobuvirus, accession number NC_004421 for bovine kobuvirus and NC_001918 and DQ028632 for Aichi virus; for the genus Teschovirus, accession numbers PEN011380 and AF296117; for currently unclassified viruses, accession numbers EF382778, DQ249301, DQ249300, DQ249299, and EF093502 for duck hepatitis virus (DHV) and DQ641257 for Seneca Valley virus (SVV); and for members of the proposed "Sapelovirus" genus, accession numbers NC_003987 for porcine enterovirus A, AY064708 for simian picornavirus 1, and NC_006553 for duck picornavirus.
For RNA structure comparisons, the following complete genome sequences were used: for the genus Aphthovirus, GenBank accession numbers AF154271, FMDVALF, NC_002554, NC_003982, NC_003992, NC_004004, and NC_002527 for FMDV; for the genus Cardiovirus, accession numbers NC_001479 and MNGPOLY for encephalomyocarditis virus and NC_001366 for Theiler's virus; for the genus Enterovirus, accession numbers NC_001428, NC_001430, NC_001472, NC_001490, NC_001612, NC_001617, NC_001752, NC_001859, NC_002058, NC_003986, NC_003988, POL3L37, and SVDMPS; for the genus Erbovirus, accession number NC_003983; for the genus Hepatovirus, accession numbers NC_003990 for avian encephalomyocarditis virus, SHVAGM27 for simian HAV, and NC_001489 for human HAV; for the genus Kobuvirus, accession numbers NC_004421 for bovine kobuvirus and NC_001918 for Aichi virus; and for the genus Teschovirus, accession numbers AB038528, AF231769, AF296087, AF296091, AF296093, AF296115, AF296119, and NC_003985.
Nucleotide sequence accession number. The annotated genome sequence reported here has been deposited in GenBank under accession number EU142040.
| RESULTS |
|---|
|
|
|---|
The viral RNA-derived sequences revealed the presence of four contigs showing sequence similarities to picornavirus proteins with amino acid identities ranging from between 22 and 41%. Except for the extremities of the contigs, each base of the contigs was sequenced at least twice from different overlapping subclones. The three gaps between the four contigs were amplified from the randomly primed cDNA using primers located within contigs followed by direct PCR product sequencing. The 5' and 3' extremities of the viral genome were acquired using rapid amplification of cDNA ends (see Materials and Methods).
Relationship to other picornaviruses. The SePV-1 genome was 6,693 nucleotides long, excluding the poly(A) tail, and encoded a 2,027-amino-acid-long polyprotein. The genome showed a relatively low G+C content of 44% [excluding the poly(A) tail] compared to other picornaviruses. Its composition was most similar to those of HPeV and LVs in the Parechovirus genus (40% and 42%, respectively), DHV (44%), teschoviruses (44% to 45%), and equine rhinitis A virus equine rhinitis A virus (46 to 48%); greater than that of HAV (38%); and generally lower than those of other picornaviruses (e.g., FMDV [52 to 55%], cardioviruses [48 to 49%], enteroviruses [39 to 49%], and kobuviruses [55% to 60%]).
To investigate its sequence relationship with other picornaviruses, alignments of the SePV-1 polyprotein were made in the P1 and 2C-3D regions of the genome with representative members of each picornavirus genus (listed in Materials and Methods). Regions of amino acid sequence similarities were most apparent in the nonstructural region of the SePV-1 genome, such as the region surrounding highly conserved motifs in the active site of the RNA-dependent RNA polymerase. Alignment of the P1 region was more problematic, with several regions with no detectable homology between genera and a frequent necessity to introduce long gaps to align homologous sites. Although it is possible that both the 3D and P1 alignments could be optimized further, this would have only a small effect on sequence similarity values (Table 1 and Fig. 1A) or phylogenetic relations (Fig. 1B).
|
|
In the 2C-3D structural gene region, pronounced dips in amino acid divergence between SePV-1 and other genera corresponded to conserved motifs within 2C, the end of 3C, and four regions in 3D (Fig. 1A). SePV-1 showed no close sequence relatedness to any of the picornavirus genera or the currently unclassified SVV or DHV sequences, with the possible exception of marginally greater similarity to parechoviruses and DHV (6, 17, 38, 39). In the P1 region, SePV-1 showed a more apparent greater degree of sequence similarity with parechoviruses and DHV (Fig. 1A, black and gray lines, respectively) than other genera.
For analysis of sequence relationships between SePV-1 and different picornavirus genera in the P1 and nonstructural gene regions, the following regions were identified as being reliably alignable and showing unequivocal amino acid similarities. These comprised sequences between positions 912 and 1229 (P1 region A) (all genome positions are numbered according to the SePV-1 sequence), 1599 and 1995 (P1 region B), 3603 and 4044 (2C region C), 5001 and 5165 (3C region D), 5253 and 5423 (3D region E), 5670 and 5879 (3D region F), 5913 and 6167 (3D region G), and 6263 to 6521 (3D region H) (Fig. 1A). Phylogenetic trees constructed from picornavirus sequences from each region demonstrate consistent branching patterns with generally good differentiation of the nine currently classified genera, although the proposed "Sapelovirus" genus showed a substantially greater similarity to the Enterovirus genus than to other picornaviruses in both structural and nonstructural regions (Fig. 1B). An exception to the otherwise consistent branching patterns of the trees was SVV, which generally grouped within or as an outlier to the cardioviruses but fell within the Aphthovirus genus in region E. In this region, members of the proposed "Sapelovirus" genus also became interspersed with enteroviruses (Fig. 1B).
The phylogenetic relationship between SePV-1 and other picornavirus genera was similarly highly consistent where all regions except E grouped closest to the parechovirus genus and DHV (Fig. 1B). In regions A, B, F, G, and H, these sequences grouped in bootstrap-supported clades. Given the anomalous position of SVV in region E, the lack of grouping of SePV-1 with parechoviruses in this region may be the result of a lack of phylogenetic resolution in this relatively short fragment. Indeed, analysis of regions D and E combined restored the SePV-1/parechovirus/DHV grouping (data not shown). The relationships between SePV-1 with DHV and parechoviruses differed in different genomic regions. In the P1 region, the three virus groups formed a much more distinct, well-defined clade than in nonstructural regions of the genome (trees C to H), a difference also apparent from the similarity scan (Fig. 1A) (see above). Within the SePV-1/parechovirus/DHV grouping, DHV split earlier in the lineage in P1, while in the 2C-3D region, SePV-1 was consistently ancestral to the other viruses.
Despite the frequent bootstrap-supported grouping of SePV-1 with parechoviruses and DHV in the nonstructural 2C-3D region and the existence of identifiable homology elsewhere in the capsid-encoding region, the newly discovered virus was nevertheless highly divergent from each of these virus groups and indeed similar to sequence divergences that exist between other picornavirus genera, supporting the possibility that a new picornavirus genus was identified.
Polyprotein. A methionine codon starting at nucleotide position 507 was found in a standard Kozak context (RNNAUGG) and used to deduce the start of the polyprotein (22). Picornavirus structural and nonstructural proteins are typically generated by cleavage with virus-encoded proteinases. The hypothetical cleavage map of the SePV-1 polyprotein was derived from an alignment with other picornaviruses whose experimentally determined or hypothetical protease cleavage sites have been reported (Fig. 2). The presence of cleavage sites at interdomain junctions was sought based on the preference of picornaviruses for Q and E at the P1 position and a small amino acid residue (e.g., G, S, R, M, A, and N) at the P1' position (2, 14). Prediction of the cleavage sites in Fig. 2 were therefore determined by scanning for pairs of amino acids fitting this patterns around the interdomain regions inferred based on the amino acid alignment. The predicted cleavage sites result in a typical picornavirus gene order of VP0-VP3-VP1-2A1-2A2-2B-2C-3A-3B-3C-3D (Fig. 1A and 2).
|
An analysis of the N terminus of the 2A protein of SePV-1 revealed a sequence corresponding to the canonical cotranslational cleavage site DxExNPGP (Fig. 2). Cleavage at this site is cotranslationally mediated in cis by 2A and is present in cardioviruses, erbovirus, teschovirus, and aphthoviruses as well as in LV and DHV but not in HPeV, hepatoviruses, and the kobuvirus Aichi virus (7, 12, 18, 44, 45). Proteolytic cleavage at this site would therefore release a small 2A1 protein (Fig. 2). Based on the hypothesized cleavage sites, SePV-1 therefore appears to contain two structurally unrelated 2A proteins in a manner analogous to that of LV (13, 14, 17) but unlike HPeV, which contains a single 2A protein (11). Additionally, DHV may encode a third 2A protein absent in both LV and SePV-1 (6, 17, 38).
Untranslated terminal regions. The length of the SePV-1 5'UTR until the polyprotein initiation AUG was unusually short at 506 nucleotides and included two terminal uracils required to covalently link the RNA to the VPg (3B) protein, a characteristic common to all picornaviruses (3). The secondary structure of the 5'UTR RNA was predicted using a combination of a thermodynamic folding energy minimization algorithm (MFOLD) and a stochastic context-free grammar method (PFOLD), independent algorithms that produced generally concordant results for the main structural features (Fig. 3). The analysis carried out is necessarily limited by the availability of only a single SePV-1 sequence from the 5'UTR; the following therefore presents provisional predictions that will have to be confirmed once more sequences from this region become available. For this reason, the presentation of results has concentrated on the more obvious structural features or those supported by similarities to structurally similar internal ribosome entry sites (IRESs) of other viruses.
|
Although neither MFOLD nor PFOLD can predict tertiary RNA structure interactions, sequences at the 3' end of the 5'UTR can be convincingly modeled onto the previously proposed type IV IRES (9). Specifically, the predicted structure was most closely similar to IRES structures of members of the Sapelovirus and Teschovirus genera and to DHV, together classified as type IVB IRES elements. SePV-1 thus contains the highly conserved IIIe stem-loop (with the unpaired GAYA sequence), a CpG dinucleotide pairing (IIIf), a longer-range interaction to form stem 1, and, finally, a pseudoknot pairing between positions 491 and 495 and upstream positions 467 to 471). As well as structural conservation, there were also several regions of sequence identity between SePV-1 and other viruses with a type IVB IRES (Fig. 4), including both paired and unpaired bases, and a large number of covariant sites between the six sequences analyzed that lend further bioinformatic support for the proposed structure.
|
The 3'UTR was also the shortest one reported for a picornavirus, with a length of 34 nucleotides, the next biggest being from rhinoviruses, at 40 nucleotides. No folded RNA structures were detected.
Myristylation and leader peptide. Myristylation plays a crucial role in virion morphogenesis of most picornaviruses and involves the covalent linkage of myristic acid to an N-terminal glycine residue in a canonical GxxxT/S motif. SePV-1 contains a putative myristylation sequence starting at position 16 relative to putative methionine amino termini. This location for myristylation most resembles that of HPeV at position 13, while DHV has a canonical sequence at position 31 and LV has one at position 3. The non-near-terminal position of the putative myristylation site of SePV-1 indicates that, as proposed for DHV and as shown for HPeV, myristylation may not occur and that alternative modifications at the amino terminus of the polyprotein may direct it to a lipid environment (14, 17, 36).
Leader peptides of variable length are found in the Cardiovirus, Aphthovirus, Teschovirus, Kobuvirus, and Erbovirus genera, for which various roles have been proposed (18, 32, 33, 44, 45). HPeV and LV are not thought to encode a leader peptide, while DHV may encode a short leader peptide (6). SePV-1, like HPeV and LV, is not expected to encode a leader peptide.
GORS. Previous analyses of RNA structure formation among the different picornavirus genera indicated that members of the parechoviruses were unstructured, with no significant differences in MFEs between native and sequence order-scrambled sequences (34). The lack of genome-scale ordered RNA structure (GORS) in the Parechovirus genus contrasted with the high levels of sequence order-dependent RNA structure in the genera Aphthovirus, Kobuvirus, and Teschovirus. Since the possession of GORS varies between genera, we conducted a more detailed comparison of RNA structure between human parechoviruses and LV (Parechovirus genus), DHV, and SePV-1 (Fig. 5). Each of the four parechovirus-like virus groups showed low MFEDs averaged over the length of the genome, similar to values previously determined for the "unstructured" enteroviruses and hepatoviruses and distinct from those genera with the Picornaviridae previously shown to possess GORS. Analysis of subgenomic regions revealed that the sequence order-dependent RNA structure within SePV-1 and other parechovirus-like viruses was in the 5' and 3' noncoding regions (data not shown). Recomputation of MFEDs for sequences confined to the coding region produced mean values of 0.06% for SePV-1 and similarly lower values for HPeV, DHV, and LV (–0.57% to 0.95%).
|
| DISCUSSION |
|---|
|
|
|---|
Picornaviruses have traditionally been identified and classified based on biophysical/antigenic properties. Viral genome sequences have also been used for taxonomy purposes. The family Picornaviridae is currently divided into nine genera (Aphthovirus, Cardiovirus, Enterovirus, Hepatovirus, Parechovirus, Rhinovirus, Erbovirus, Kobuvirus, and Teschovirus) (18, 35). Rhinoviruses have recently been reclassified as a new species within the Enterovirus genus, while sapeloviruses, SVV, and DHV may become new Picornaviridae genera. The nearest genetic neighbors of SePV-1 are the parechoviruses (HPeV and LV) and the currently unclassified DHV, which has been recently proposed as the prototype member of a new picornavirus genus (6, 17, 38). Relative to the parechoviruses, SePV-1 has a more basal phylogenetic root than DHV1 in the nonstructural 2C and 3D regions, while DHV1 was more basal in the P1 region. SePV-1 is therefore divergent enough from other picornaviruses in both regions to qualify as the prototype of another picornavirus genus.
SePV-1 contains some but not all the characteristics of HPeV, LV, or DHV1. All four viruses show similar G+C contents (40 to 44%), but unlike the parechoviruses HPeV and LV, which have type II IRESs, SePV-1 and DHV1 contain 5'UTR structures similar to those of sapeloviruses and are classified as type IVB (9). Alternative relationships between these viruses existed in other attributes. For example, like HPeV and DHV but unlike LV, its canonical myristylation site appears to be too far from the C terminus of VP0 to be functional. The two 2A proteins of SePV-1 resemble the genetic organization of LV 2A1 and 2A2 proteins and are distinct from HPeV's single 2A or DHV's three 2A proteins. Like DHV1 and LV but unlike HPeV, its 2A1/2A2 boundary contains the canonical site required for cotranslational cleavage. Like HPeV and LV but unlike DHV1, the SePV-1 genome appears to be missing a leader protein. SePV-1 therefore also exhibits a unique mixture of picornavirus genetic characteristics.
Microarrays consisting of highly conserved viral sequences have been used for the identification of both known and novel viral species (20, 40-42). The level of sequence similarity between SePV-1 and the most closely related oligonucleotides on the latest version of the microarray was lower than that observed when the severe acute respiratory syndrome coronavirus was identified by cross-hybridization with preexisting sequences (42) (data not shown). It is therefore unclear if such divergent picornaviruses as SePV-1 would be detected using microarrays based on preexisting viral sequences, but the inclusion of SePV-1 sequences on future microarrays will allow further searches for new viruses in this region of viral sequence space.
Picornaviruses can replicate in numerous mammals and birds, and picornavirus-like viruses have been found in insects (18). A recent study (4) using degenerate PCR primers targeting conserved amino acids in the highly conserved RNA-dependent RNA polymerase gene (26) also found highly diverse picornavirus-like viral sequences in marine seawater, including a virus causing the lysis of a toxic bloom-forming alga (25) and one found in clams (19). The new picornavirus described here is distinct enough from those already known to infect mammals and birds to represent a novel genus clearly anchored in the Picornaviridae family. To our knowledge, it is also the first sequenced picornavirus shown to infect a marine mammal.
| FOOTNOTES |
|---|
Published ahead of print on 17 October 2007. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Mol. Cell. Biol. | Microbiol. Mol. Biol. Rev. |
|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|