Previous Article | Next Article ![]()
Journal of Virology, January 2007, p. 1013-1021, Vol. 81, No. 2
0022-538X/07/$08.00+0 doi:10.1128/JVI.00584-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
Çi
dem H. Williams,1,
Pamela J. Hughes,1
David P. Schnurr,2 and
Glyn Stanway1*
Department of Biological Sciences, University of Essex, Colchester CO4 3SQ, United Kingdom,1 California Department of Health Services, Richmond, California2
Received 23 March 2006/ Accepted 15 September 2006
|
|
|---|
|
|
|---|
Five HPeV and 4 Ljungan virus complete nucleotide sequences are available and reveal that parechoviruses have a number of distinctive features (3, 9, 15, 16, 33). For instance, whereas the single-stranded positive-sense RNA genome of most picornaviruses is enclosed in a capsid made of 60 copies of 4 proteins VP1 to 4, parechoviruses have only 3 proteins because the cleavage of VP0 to VP4 and VP2 seen in other picornaviruses does not occur (43). In addition, in common with that of members of another picornavirus genus Kobuvirus, the 2A protein is of an unusual form, sharing motifs with a family of cellular proteins involved in control of cell proliferation (14). Studies on entry, translation, and replication have started to underscore the significance of molecular features revealed by sequence analysis, but the relative paucity of sequence information hinders a fuller understanding of HPeV molecular biology, evolution, and epidemiology (4, 20, 21, 22, 30, 31, 39). To address this issue, we have analyzed 7 HPeV strains isolated in California between 1973 and 1992. We found that, on the basis of VP1 sequences, these fall into two genetic groups, one of which has not been previously observed, bringing the number of known groups to five. In view of the growing importance of molecular techniques in diagnosis, we suggest that genotype is an important criterion for identifying viruses, and we propose that these groups should be termed human parechovirus types 1 to 5. Complete nucleotide sequence analysis of two of the Californian isolates, representing two types, confirmed the identification of a new genetic group and suggested for the first time the occurrence of recombination in parechovirus evolution. It also allowed the identification of a putative HPeV1 CRE (cis-acting replication element) as well as the refinement of previously predicted RNA structures.
|
|
|---|
![]() View larger version (33K): [in a new window] |
FIG. 1. (a)
Phylogenetic tree, based on the partial amino acid sequence of VP1,
showing the relationship between the 54 available HPeV sequences for
this region. Sequences were aligned with CLUSTALW, and the tree was
constructed by the PHYML program, using the JTT model of amino acid
substitution. The corresponding VP1sequences of two more distantly
related parechovirues, Ljungan virus M1146 and 87-012, are included as
out-groups, and the branches to the Ljungan virus nodes have been
truncated for space reasons, as indicated by the dotted lines.
Bootstrap values from 100 pseudoreplicates are shown for the major
nodes. The HPeVs cluster into 5 genoypes, labeled types 1 to 5. (b)
Alignment of the C-terminal region of VP1 showing the RGD motif, seen
in 4 of the types, and well-conserved flanking residues in boldface
type.
|
Cell and virus propagation. Monolayers of A549 (lung carcinoma) cells were maintained in minimal essential medium containing 10% heat-inactivated fetal calf serum, 1% nonessential amino acids (all from Invitrogen), and 100 µg/ml gentamicin (Sigma) in 25-cm2 tissue culture flasks (Nunc). HPeV isolates were adsorbed to the cells for 1 h at room temperature in 1 ml of growth medium, then 4 ml of medium was added, and the cells were incubated at 37°C for 48 h. Virus was liberated by freeze-thawing, and the supernatant was used directly for RNA isolation.
RNA isolation and reverse transcription (RT)-PCR. To isolate viral RNA from cell cultures, a NucleoSpin RNA II kit (Macherey Nagel), allowing purification of up to 20 µg of highly pure total RNA, was used according to the manufacturer's instructions. Briefly, 100 µl of virus-containing cell culture supernatant and 350 µl of RA1 lysis buffer were mixed vigorously to lyse residual cells. The lysate was mixed once again with 350 µl of ethanol (70%) to adjust RNA binding conditions. The suspension was transferred to a NucleoSpin RNA II column and centrifuged (11,000 rpm, 30 s). The supernatant was passed through a silica membrane, then desalted by addition of 350 µl membrane desalting buffer, and centrifuged (14,000 rpm, 1 min) to dry the membrane. Genomic DNA was eliminated by applying 95 µl of DNase reaction mixture directly on the silica membrane. After a 15-min incubation at room temperature, the column was washed with 200 µl of buffer RA2 to inactivate DNase and centrifuged (11,000 rpm, 30 s). Then the column was washed with 600 µl and 250 µl of buffer RA3, respectively, to remove salts and other cellular components. The membrane was then completely dried by centrifugation for 2 min at 14,000 rpm. Total RNA was finally eluted with 40 µl RNase-free water and centrifuged at 14,000 rpm for 1 min.
Extracted RNA and two oligonucleotides complementary to the opposite ends of the region of interest were subjected to RT-PCR using a SuperScript one-step RT-PCR kit (Invitrogen) according to the manufacturer's instructions. For the RT-PCR, 5 µl of template RNA, 50 to 100 pmol of each primer, 1 µl of RT-Taq mix, and 25 µl of 2x reaction mix were mixed together in a total volume of 50 µl. Thermal cycling was carried out automatically using a thermal cycler (Perkin Elmer 2400 or Appligene Crocodile III). The specific amplicon was isolated by agarose gel electrophoresis and cloned into the pGEM-T Easy vector (Promega). The DNA was sequenced from both ends, and the sequence was completed using primers specific for the newly generated sequences. The sequences obtained were then used to design a second set of primers for RT-PCR. As these matched the viral RNA sequences exactly, they gave improved PCR products which could be sequenced directly. All of the sequences were determined directly in both orientations from these products.
Computer analysis. Sequences were aligned using ClustalW and phylogenetic trees produced using the maximum likelihood approach implemented in the program PHYML (11, 48). Potential recombination was studied with SimPlot version 3.5.1, using the Kimura 2 parameter (25), and RNA folding was predicted using MFOLD (28, 51).
|
|
|---|
All of the Californian isolates contain a C-terminal proximal RGD motif, shown to be essential for infectivity in HPeV1, as do all the HPeVs where this region has been sequenced, except those from HPeV type 3 (Fig. 1b). The HPeVs conform to the previously observed RGDM/L context seen in most other picornaviruses with a functional RGD motif (5, 6, 9), and the downstream amino acid, usually A, is also strongly conserved. There is weaker conservation of the +4 position than in other picornaviruses, where RGDXXXL is frequently seen.
Sequences of T75-4077 and T92-15. The virtually complete sequences of T75-4077 (type 4) and T92-15 (type 5), representatives of each of the two genetic groups including the Californian isolates, were determined by RT-PCR and sequencing of overlapping cDNA fragments of the genome. The first 20 nucleotides of T75-4077 were not determined, as this region was used for priming the PCR. In common with previous analyses of HPeVs, it proved difficult to amplify the first few nucleotides of T92-15, and a downstream primer was used, meaning that the unique sequence starts at position 46. Both T75-4077 and T92-15 possess previously identified HPeV-specific sequence features, including the N-terminal extension to VP3, relative to any other picornavirus, which contains several basic amino acids and the conservation of 2A motifs previously shown to be shared with a family of cellular proteins involved in the control of cell proliferation (14, 43) (data not shown).
A phylogenetic tree showing the relationship between the P1 amino acid sequences of completely sequenced HPeVs confirms the existence of the same 5 types defined by the VP1 analysis (Fig. 2a). In contrast, when the 3Dregion nucleotide sequences were analyzed, the isolates grouped differently (Fig. 2b). Here, T75-4077 clusters with the two type 3 viruses, which were distinct from other HPeVs in the P1 region. T92-15 is more diverse from its fellow type 5 virus CT86-6760 than in P1. Apart from the HPeV3 prototype and Can82853, which are 96% identical, there is no clear grouping in the 5' untranslated region (UTR) (data not shown).
![]() View larger version (12K): [in a new window] |
FIG. 2. Genetic
relationships between completely sequenced HPeVs. The designation of
each of the types, as defined in Fig.
1a, is indicated by the
number (1 to 5) on the right hand side of each tree. (a) Tree based on
amino acid sequences of the P1 region. Sequences were aligned with
CLUSTALW, and the tree was constructed by the PHYML program, using the
JTT model of amino acid substitution. The corresponding P1 sequences of
two more distantly related parechoviruses, Ljungan virus M1146
(Ljungan M) and 87-012 (Ljungan 87), are included as
out-groups, and the branches to the Ljungan virus nodes have been
truncated for space reasons, as indicated by the dotted lines.
Bootstrap values from 100 pseudoreplicates are shown for each node. (b)
Tree based on nucleotide sequences of the 3D region. Sequences were
aligned with CLUSTALW, and the tree was constructed by the PHYML
program, using the HKY model of nucleotide substitution and a
transition/transversion ratio of 4. The corresponding 3D sequences of
two more distantly related parechoviruses, Ljungan virus
M1146 (Ljungan M) and 87-012 (Ljungan 87), are included as
outgroups,
and the branches to the Ljungan virus nodes have been truncated for
space reasons, as indicated by the dotted lines. Bootstrap values from
100 pseudoreplicates are shown for each
node.
|
![]() View larger version (38K): [in a new window] |
FIG. 3. SimPlot
analysis of the relationship between completely sequenced HPeVs, based
on nucleotide similarities of other HPeVs to T92-15 (top panel) and
T75-4077 (lower panel). The Kimura 2 correction was applied, with a
Ts/Tv ratio of 3, and a window of 600 nucleotides and a step of 10
nucleotides were
used.
|
Identification of a putative HPeV CRE. The CRE has been shown to be critical for RNA replication in several other picornaviruses and to consist of an RNA stem-loop, generally but not exclusively located in the coding region (10, 24, 27, 38, 50). Alignment of the seven complete and six partial HPeV sequences revealed a section of the VP0-encoding gene with a high degree of nucleotide sequence identity (Fig. 4a). This sequence is not constrained by the occurrence of methionine or tryptophan codons or by a preponderance of twofold degenerate codons (data not shown). MFOLD analysis of the region shows that it can be predicted to fold into a stem-loop (Fig. 4b). Covariance in this stem provides evidence that the structure is significant. Several of the HPeV sequences differ from those of HPeV1 Harris at position 1392 and its predicted partner 1423, and there is also variation at the base pair predicted between positions 1395 and 1420, in each case, the stem structure being maintained by the pairs of differences. Within the predicted loop there is an AAA sequence motif common to other picornavirus CREs, and alignment of representative CREs with that predicted in HPeVs shows strong conservation of the larger motif CAAAC (Fig. 4c). The other Parechovirus species Ljungan virus (18) does not have a putative CRE at the corresponding location, but a similar analysis (data not shown) reveals a potential structure in the VPg-encoding region. Again a CAAAC motif is seen in the loop structure (Fig. 4c). Only one covariant position is seen, but the limited number of Ljungan virus sequences available probably reduces the possibility of observing covariance.
![]() View larger version (41K): [in a new window] |
FIG. 4. (a)
Alignment of the putative CRE region of HPeV strains. < and
> denote nucleotides participating in the predicted stem, and
covariant residues differing from the HPeV1 sequence are shaded. U/C
differences maintaining the structure are underlined. (b) MFOLD-derived
structure of the HPeV1 CRE. (c) Alignment of the loop regions of CREs
in picornaviruses, showing the strong conservation of the motif
CAAAC.
|
3' UTR structure. The 3' UTRs of other picornaviruses contain secondary and, in the case of enteroviruses, tertiary structure elements important in virus replication (29). Analysis of the HPeV sequences shows that in each case much of the 3' UTR is made up of two highly conserved repeats of the sequence AUUAGAACACUAAUUUG arranged in tandem (Fig. 5). The 3' UTR structure of HPeVs was analyzed using MFOLD, and in accord with a recent report (1), it is predicted to be composed of a single stem-loop with extensive base pairing between the poly(A) tail and the highly U-rich 5' part of the 3' UTR (predicted base paired regions shown in Fig. 5). The low degree of variability between the sequences means that there is little covariance support for the structure, but differences are largely concentrated in predicted loops, which is consistent with this being the structure of the region.
![]() View larger version (48K): [in a new window] |
FIG. 5. Alignment
of the 3' UTRs of HPeVs showing the tandem repeats (shaded).
< and > denote nucleotides participating in the
predicted stem-loop
structure.
|
|
|
|---|
In enteroviruses, there is a good correlation between VP1 sequence and serotype, and this has led to several new enteroviruses being defined entirely on the basis of sequence identity (32, 34, 37). These are called types, rather than serotypes, as neutralization data are now frequently not obtained. The relationship between HPeV genotype and serotype appears to be less clear than in enteroviruses, since HPeV2 Williamson and HPeV2 CT86-6760 represent two different genotypes (Fig. 1a). Thus, we are not suggesting that the new HPeV genotype necessarily represents a new serotype, although it should be noted that it has been shown that viruses belonging to this genotype are not neutralized by antisera against HPeV1 and HPeV2 reference strains (41). As virus detection is now mainly being performed using PCR, molecular features become increasingly important in defining differences between viruses. We therefore propose that, by analogy with enteroviruses, the 5 genetic groups we have defined should be considered HPeV types. Each of these types contains viruses which cluster relatively tightly, although the recent type 1 viruses are relatively different from the HPeV1 prototype. At around 90% amino acid identity in the region of VP1 used, these type 1 isolates would be on the margin of defining separate types in enteroviruses, and there seems to be no compelling case to separate them into different HPeV types. The analysis of more type 1 HPeVs, isolated during the long period (nearly 50 years) between the prototype and the recent isolates, may help to resolve this issue.
Our results show that there were at least two HPeV types circulating in California between 1973 and 1992. There is no clear-cut correlation between date of isolation and type. The type 4 viruses were isolated in the 1970s and 1980s and the type 5 viruses in the 1980s and 1990s, but more isolates would need to be analyzed to clarify whether they were cocirculating or whether type 4 was superseded by type 5. Currently, the situation from Japanese, Canadian, and Dutch studies seems to be that HPeV types 1 and 3 are the predominant viruses, with no recent isolations of the other types among more than 50 viruses studied (1-3, 16).
In the P1
region, type 3 appears to be relatively diverse from the other types
(Fig. 2a), and it is
interesting that these viruses lack the RGD motif seen in VP1 in the
other viruses compared here (Fig.
1b). This has been shown
to be essential for replication of the HPeV type 1 prototype Harris,
presumably being required for interaction with
vß1 and
vß3, the RGD-dependent ligands reported to be its
receptors (4). In another
picornavirus, coxsackievirus A9 (CAV9), an RGD-less mutant is viable
and grows efficiently in some cells but shows altered pathogenicity in
a mouse model (12,
13). There is some
evidence that HPeV type 1 and 3 infections differ in age profile and
symptoms, and receptor tropism could be a factor in this
(2). It would be
interesting to determine whether HPeV type 3 recognizes these molecules
by an RGD-independent interaction, or whether another, unknown receptor
is used. In all of the HPeV isolates studied here, the RGD motif is
followed by the amino acid M or L (Fig.
1b), also well conserved
in the other picornaviruses with VP1 RGD motifs (foot-and-mouth disease
virus, CAV9, and echovirus 9) and presumably involved in RGD function
(5,
6). Integrin
vß6 appears to play a role in infection of both
foot-and-mouth disease virus and CAV9, and the similarity in RGD
context could imply that this integrin is also recognized by HPeV1
(17,
49). Interestingly, there
is considerable conservation of the RGD region between the viruses. For
instance, the final 11 VP1 amino acids of the HPeV type 1 prototype
Harris and the type 5 viruses T92-15 and T82-659 are identical. This
conservation is consistent with the observation that the RGD region is
not a major determinant of antigenicity
(21).
The diverse nature of HPeV type 3 in the P1 region makes surprising the observation that both fully sequenced type 3 viruses are comparatively closely related to the type 4 virus T75-4077 in the nonstructural protein region (Fig. 2b). This raises the possibility that recombination plays a role in the evolution of these viruses. SimPlot analysis (Fig. 3) shows that T75-4077 is much more closely related to type 3 than to other HPeVs throughout the final 3,000 nucleotides of the genome, particularly the final 1,500, although this similarity is not great enough to prove that it is due to recombination (Fig. 3). It is also interesting that the other strain sequenced, T92-15, is highly related to CT86-6760 in the capsid region, but the degree of similarity falls off rapidly after position 4500, suggesting acquisition of 3' sequences with a different evolutionary history by recombination, rather than gradual drift. These observations could suggest that frequent recombination in HPeVs leads to the generation of mosaic viruses which do not necessarily have a clear linkage between the P1 region and P2/3, but more complete HPeV sequences, which may include those with genomic regions with more clearly identifiable origins, are required to investigate this aspect of parechovirus evolution. As recombination has been well documented in another genus of important human picornaviruses, Enterovirus, it is not surprising that it should occur in parechoviruses, given that these are common pathogens and so there is potentially a high incidence of dual infections.
The availability of significantly more HPeV sequence information gives insights into the molecular biology of these viruses. In addition to the conservation of the RGD motif in most HPeVs, alignment of the sequences shows the presence of the N-terminal, basic extension to VP3 relative to other picornaviruses and conservations of the 2A motifs shared with cellular proteins involved in the control of cell proliferation (data not shown) (14, 43-45). The new data are also consistent with the previously predicted cleavage sites. The sequences are particularly useful in defining RNA structural features which are likely to play a role in RNA replication and/or translation. Mapping of nucleotide sequence differences suggests that the previously predicted 5' UTR structure is essentially correct (9). The derived structure is similar to those present in picornaviruses with a type 2 internal ribosome entry site, and HPeVs share functional similarity with these viruses (30, 47). We have also presented a predicted structure for the HPeV 3' UTR (Fig. 5). An interesting feature of the 3' UTR is an apparent tandem duplication of 17 nucleotides, which is highly conserved in all of the HPeVs studied. The 3' UTR is also highly U-rich following the stop codon, a feature seen in a number of other picornaviruses. Much of the 3' UTR has the potential to fold into a single, imperfect stem-loop. The prediction of the 3' UTR gives a framework for the analysis of the role of this region in HPeV replication, which will be useful, as the picornavirus 3' UTR is still relatively ill understood.
One of the most important aspects of the structural predictions is the probable identification of the HPeV CRE, an element which plays a critical role in positive-sense RNA synthesis in other picornaviruses through VPg uridylylation (38). The element was detected through the identification of a region of HPeV with high nucleotide sequence identity (Fig. 4a and b). This contains an AAA motif and can be folded to give a simple stem-loop (Fig. 4b). There is strong phylogenetic support for this stem-loop, as two different pairs of covariant nucleotide substitutions maintain the structure among all the HPeVs analyzed. Although the predicted loop is larger than that seen in the CREs of other picornaviruses, this structural support and presence of the AAA strongly suggests that this is the parechovirus CRE. The putative CRE is located in VP0, which is also the location of the CRE seen in cardioviruses, but the two locations are not analogous, as the cardiovirus CRE is in the ßA1 strand, as opposed to the ßG strand for HPeVs. Interestingly, the predicted CRE in Ljungan virus is largely in the VPg-encoding region. The identification of a CRE in different positions in two parechovirus species is not surprising, as the two rhinovirus species have CREs in VP1 (Human rhinovirus B) and 2A (Human rhinovirus A) (50). An analysis of the loop region of all known picornavirus CREs shows little overall conservation, except that the AAA motif, which is the site of VPg uridylylation, is part of a strongly conserved CAAAC motif (Fig. 4c).
Human parechoviruses, members of the Parechovirus genus of the Picornaviridae, are frequent pathogens but have been comparatively poorly studied. The analysis we have presented here greatly extends the information available on parechoviruses and gives important insights into their molecular biology as well as significantly improving our understanding of their diversity and evolution.
Published
ahead of print on 27 September 2006. ![]()
M.A.-S.
and C.H.W. contributed equally to this work. ![]()
|
|
|---|
vß6 is an RGD-dependent receptor for
coxsackievirus A9. J. Virol.
78:6967-6973.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»