Department of Virology, Haartman Institute, FIN-00014 University of Helsinki, Finland,1 Microbiology and Tumour Biology Center, Karolinska Institutet, S-17177 Stockholm,2 Swedish Institute for Infectious Disease Control, S-17182 Stockholm, Sweden3
Received 24 March 2003/ Accepted 28 May 2003
| ABSTRACT |
|---|
|
|
|---|
G and U
C transitions, suggested that the population of L RNA molecules is represented by quasispecies. The mutation frequency in the L segment quasispecies appeared to be similar to the corresponding values for the S and M quasispecies. Analysis of the cDNA clones with the complete S segment sequences from passage 20 confirmed our earlier conclusion that the cell-adapted genotype of the virus is represented mostly by variants with mutated S segment noncoding regions. However, the spectrum of the S segment quasispecies appeared to be changing, suggesting that, after the initial adaptation (passages 1 to 11), the viral population is still being driven by selection for variants with higher fitness. | INTRODUCTION |
|---|
|
|
|---|
Why only some hantaviruses are pathogenic for humans remains unclear. Among the factors that might determine or influence the pathogenicity of hantaviruses, the nature of their rodent hosts is considered one of the most important (33, 34). To gain more insight on molecular markers associated with the host specificity and potentially the virulence of hantaviruses, we invented a model based on adaptation of wild-type Puumala virus from its natural rodent host, the bank vole, to primate cells in culture. We showed that when the wild-type Puumala virus passaged in colonized bank voles was passaged serially in Vero E6 cells, its ability to infect bank voles decreased rapidly, i.e., the virus became adapted to primate cells (24).
Sequence analysis of the complete S and the M RNA segments of the wild-type and Vero E6 cell-adapted variants revealed that the adaptation to the new host cells was accompanied by the accumulation of point mutations in the NCRs of the S segment but not the M segment. Notably, S RNA molecules carrying mutations in the 5' and 3' NCRs of the S segment (positions 26 and 1577) were accumulating gradually in the genetic (quasispecies) spectrum of the Vero E6 cell-adapted variant, starting from passage 3. At passage 11, the last passage monitored in that study, the mutant S RNA molecules represented the majority of the population. Sequencing of the S segment of another cell-adapted variant, Vero E6-II, obtained in an independent adaptation experiment, revealed a mutation at position 1580, only three nucleotides downstream of the mutation found earlier in cell culture-adapted variants of Puumala virus (24).
Comparative sequence analyses performed in the above-mentioned study did not include the L segment encoding the viral polymerase-replicase. As shown for Hantaan virus (8) and other members of the Bunyaviridae (11, 15), the L segment and the L protein can carry mutations associated with host range restriction and attenuation. To complete the genetic comparison of the wild-type and cell culture-adapted variants of Puumala virus, we present here the results of sequence analysis of their L segments. In addition, we performed a follow-up analysis of the mutations in the 5' and 3' NCRs of the S segment found earlier.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Sequence analysis of L segment. Total RNA was extracted from samples of lung or kidney tissue of infected colonized bank voles (wild-type Puumala virus strain Kazan) or from Puumala virus-infected Vero E6 cells (strain Kazan E6-I), approximately 2 x 106 cells, and purified by acidic guanidine thiocyanate-phenol-chloroform extraction (5). RNA was dissolved in 30 µl of RNase-free water. Half of each RNA preparation was precipitated with ethanol, dissolved in 4 µl of water, and used for RNA ligations. The L segments of both the wild-type and Vero E6-I viruses from passage 11 variants were reverse transcribed and amplified in three overlapping parts of 2,253, 2,176, and 2,344 nucleotides (nucleotides 1 to 2253, 2157 to 4332, and 4207 to 6550, respectively) (Fig. 1).
|
Sequencing of S segment. Cells infected with the Puumala virus Kazan Vero E6-I variant were harvested at passage 20, and reverse transcription (RT)-PCR for the complete S segment was performed as described previously (31). The PCR products were cloned into the pGEM-T plasmid and sequenced automatically as described above.
RNA folding. RNAstructure program version 3.71 was used for modeling of secondary RNA structures (26). The program is an implementation of the Zuker algorithm to predict RNA secondary structures from the sequence, based on the principle of minimizing free energy.
| RESULTS |
|---|
|
|
|---|
Initially, two cDNA clones were sequenced for each of the three overlapping parts of the L segment of the Puumala virus wild-type and Vero E6-I variants. Two mutations that distinguished the cDNA clones of the Vero E6-I variant from those of the wild-type virus were found, a silent transition, C1053U, and a transition, C6194U, that led to a Ser2053Phe substitution in the deduced L protein sequence. Both mutations were further confirmed by sequencing of a third cDNA clone for each of the two variants and independent RT-PCR for nucleotides 539 to 1140 and nucleotides 6112 to 6550, followed by direct sequencing. Thus, the master L segment sequences of the two Puumala virus variants differed at only two positions, and the deduced L protein sequences differed at one position only.
The substitution Ser2053Phe resides in the C-terminal part of the L protein, within a short, moderately conserved region spanning amino acid residues 2029 to 2072 and outside the highly conserved polymerase domains A to E (28). Most of the amino acid substitutions observed between different hantaviruses within this region are homologous, and only four nonhomologous substitutions occurred, at positions 2035 (in Tula virus and Sin Nombre virus), 2053 (in Sin Nombre virus), 2061 (in Dobrava virus and Saaremaa virus), and 2067 (in Tula virus and Sin Nombre virus). The amino acid mutation that accompanied the adaptation of wild-type Puumala virus Kazan to cell culture placed the nonpolar phenylalanine residue at position 2053, where all other hantaviruses except Sin Nombre virus have polar serine or cysteine residues. Since no differences in the sequences of the N protein and the G1-G2 proteins were observed earlier (24), the Ser2053Phe substitution in the L protein is the only amino acid substitution found after complete comparison of all proteins of the Puumala virus wild-type and Vero E6-I variants.
L RNA folding. The overall modes of folding of L viral RNA (minus-sense) and cRNA (plus-sense) of the wild-type and Vero E6 cell-adapted Puumala virus variants were similar (data not shown). Most important, both substitutions found in the cell-adapted variant of the virus led to local changes in L viral RNA folding. The C1053U substitution observed in cDNA clones (G5498A, in viral RNA) causes transformation of a double helix region into a loop (Fig. 2C and D). The C6194U substitution (G357A, in viral RNA) induces changes in another small double helix, not disrupting the structure totally but weakening it (Fig. 2A and B). Together, these two events led to a lower level of free energy for the folding of the viral RNA of the cell-adapted variant, -1,263.2 kcal instead of -1,266.9 kcal for the wild-type variant.
|
L segment quasispecies.
In addition to the mutations at position 1053 and 6194, 17 other mutations were observed in individual cDNA clones generated from the wild-type variant and 16 mutations in those generated from the Vero E6-I variant. All positions where the mutations occurred were rechecked with an additional cDNA clone for each variant. As none of the 33 mutations was found in more than one clone, they were considered not relative to the master sequences but instead representing L segment quasispecies, each occupying only a minor portion of the genetic swarm. Similar to what was observed for the S segment quasispecies, mutations in the L segment quasispecies were mostly A
G and U
C transitions (30% and 21%, respectively). The average frequency of nucleotide substitutions observed in the L segment quasispecies, 0.6 x 10-3, was comparable to those in the S segment and the M segment quasispecies found earlier (24). As the level of RT-PCR errors determined in the previous study was 0.2 x 10-3, some of the mutations in the L segment might have originated from nucleotide misincorporations during the RT or PCR steps. The majority, however, seem to represent "genuine mistakes" of the viral RNA-dependent RNA polymerase.
Follow-up of mutations found in L segment and S segment. To follow up on the two nucleotide changes observed in the L segment during the adaptation, the corresponding regions of the L segment (nucleotides 6112 to 6550) were also recovered from passages 12 and 20 by RT-PCR and sequenced directly. Both substitutions found in the Vero E6-I variant on passage 11 (positions 1053 and 6194) were seen on the following passages, showing that these mutations are firmly associated with the Vero E6-I geno- and phenotype.
To follow up on the nucleotide changes at positions 26 and 1577 that were observed earlier in the 5' and 3' NCRs of the S segment of the Vero E6-I variant (24), cDNA clones containing complete S segment sequences were recovered from passage 20 and analyzed (Fig. 3). It was found that the mutant variants again, as for passage 11, represented a majority of the S segment sequences. However, the structure of the S segment quasispecies seemed to be different from that observed on passage 11. First, the G26U transversion that was previously found in 50% of cDNA clones from the cell-adapted variant (passage 11) now appeared in only one of eight cDNA clones. Instead, a new nucleotide replacement, a U25C transition, was found in five of eight cDNA clones analyzed. Second, only two of the eight cDNA clones carried the U1577G transversion found in the majority of clones (seven of nine) at passage 11. Notably, both cDNA clones with the U1577G mutation carried the U25C transition as well. The likely reasons for the observed differences are discussed below.
|
| DISCUSSION |
|---|
|
|
|---|
We have previously developed a model for studies on hantavirus host adaptation and initiated genetic analysis of Puumala virus variants passaged in colonized bank voles and in cultured Vero E6 cells (24). With the data presented in this paper, the sequence comparison of the wild-type and Vero E6-adapted variants of Puumala virus strain Kazan has been completed. No mutations were found in the coding regions of the S and M segments, suggesting that the amino acid sequences of the nucleocapsid protein and surface glycoproteins G1 and G2 were not changed during the adaptation process. The only amino acid substitution that distinguished the two virus variants was found in the L protein, Ser versus Phe at position 2053. Not surprisingly, it was located outside the highly conserved motifs shared by RNA-dependent RNA polymerases of different origins (28, 37). An intriguing feature of this substitution is that the nonpolar phenylalanine appeared at a position where all other known hantaviruses except Sin Nombre virus have polar serines or cysteines (Fig. 1).
Another mutation found in the L segment, the silent transition C1053U, could result from the selection of a variant with altered (and more favorable for Vero E6 cell culture) L RNA folding. Random genetic drift at this position cannot be totally excluded. Notably, both mutations were found at passage 20 as well and thus were firmly associated with the Vero E6-I variant of the virus and considered advantageous for its survival in cell culture. They could lead to local changes in the L viral RNA folding (Fig. 2), increasing the efficiency of transcription and/or replication. The amino acid substitution in the L protein could contribute to more efficient replication in cell culture as well, e.g., by supporting a conformation which works better on the mutated RNA templates and/or in concert with some as yet unknown cellular factor(s).
Our findings of mutations in the polymerase-encoding gene during the cell culture adaptation of Puumala virus are in line with data accumulated earlier for other bunyaviruses (11, 15, 43). As for hantaviruses, it was recently shown that one mutation in the L segment contributes to a better replication of Hantaan virus in suckling mice and thus increases its virulence (8). The authors, however, suggested that a single amino acid substitution observed in the G1 protein was primarily responsible for the altered phenotype of the virus, while the mutation in the L segment was considered less crucial.
The nucleotide substitutions that we observed in individual L cDNA clones, most of them A
G and U
C transitions, suggested that the population of L RNA molecules is represented by quasispecies. Although calculated from a limited number of nucleotides sequenced (approximately 27,000 altogether), the mutation frequency in the L segment quasispecies appeared to be similar to the corresponding values for the S and M quasispecies (and well above the level of RT-PCR errors) (24). Such a similarity confirms their common origin, base misincorporation occurring during virus replication. This is the mechanism known to operate in RNA viruses in general (7, 17) and in hantaviruses in particular (13, 32).
It should be noted that the very terminal sequences of all three genome segments of the wild-type and Vero E6 cell-adapted variants, which were used as annealing sites for primers in reverse transcription and PCR amplification, could not be compared in the same way as the rest of the Puumala virus genome. We therefore performed this comparison by RNA ligation followed by RT-PCR through the ligation point, an approach exploited earlier to recover terminal sequences of other hantaviruses (4, 23). Not surprisingly, we found that all three genome segments of the wild-type and cell-adapted variants had deletions in the 3' termini, while the 5' termini were mostly intact. However, none of the 12 cDNA clones representing the termini of the Vero E6-I S segment from passage 11 were shown to carry the G26U mutation that was earlier observed in half of the cDNA clones with complete S segment sequences. Similarly, no mutations at position 25 or 26 were seen in 10 cDNA clones representing termini of the Vero E6-I S segment from passage 20. These observations led to the conclusion that cDNA clones prepared via ligation followed by RT-PCR through the ligation point and those obtained via RT-PCR for the complete S segment represented different (sub)populations of S RNA molecules from the infected cells. It would be of interest to study this phenomenon in greater detail, perhaps exploiting other methodologies as well (see, e.g., reference 27). This, however, is beyond the scope of the current study.
Analysis of the cDNA clones with complete S segment sequences from passage 20 confirmed our earlier conclusion (24) that the cell-adapted genotype of the virus is represented mostly by variants with mutated S segment NCRs. However, the structure of the S segment quasispecies (as can be seen within the modest number of the cDNA clones sequenced) appeared to be changed. The two mutations described earlier for the cell-adapted variant at passage 11 (U1577G and G26U) remained in the population but were seen in only a small portion of the cDNA clones. Instead, S RNA molecules with a U25C mutation became dominant. One can hypothesize that the S RNA molecules with the U25C mutation (and, consequently, the viral particles that carry them) possess a higher fitness for growth in cell culture than the molecules carrying the mutation at position 26. Although no new mutations appeared at passage 20 in the 3' NCR, the U1577G substitution was found in only two out of eight cDNA clones. A likely reason for the "stepping down" of this mutation might be that the U25C substitution alone produces a better-fit virus that is able to maintain stable growth in cell culture. Alternatively, having only a portion of the S RNA molecules with mutated nucleotide 1577 might be enough for the virus to survive in cell culture. Looking from this angle, it is of interest that both cDNA clones with the U1577G substitution also carried the U25C substitution.
The noncoding regions of many RNA viruses are known to carry important molecular determinants of virulence (reference 21 and references therein). For instance, the poliovirus type 3 vaccine strain consistently reverted to a neurovirulent phenotype by acquiring the same single nucleotide substitution in the 5' NCR (12). In alphaviruses, modifications of the 3' and 5' NCRs had an impact on their virulence, and double mutants carrying mutations in both NCRs were more attenuated than single mutants (22). It was suggested that these mutations could change the viral phenotypes through modification of the secondary structure of genome RNA and/or alteration of binding sites for cell-specific protein factors. In a study of two Hantaan virus variants with different virulence for suckling mice (8), one mutation was found in the 3' NCR of the L segment. Some heterogeneity of complementary sequences at the 5' and 3' termini in the L and M segments was also observed. It has already been mentioned that the authors hold the single amino acid substitution in the G1 protein primarily responsible for the altered phenotype of the virus.
Taken together with our earlier observations (24), the data obtained in this study suggested that the cell-adapted genotype of Puumala virus Kazan is associated with mutations in the L segment and S segment NCRs. These findings naturally led to the question of how functionally sound the observed mutations are. Unfortunately, we were not able to adapt the cell-adapted virus back into bank voles. The few animals that showed the presence of viral nucleocapsid antigen in their lungs (24) failed to produce enough virus for further passaging; our attempts to recover viral sequences from them were not successful either (our unpublished observations). For many RNA viruses, the answer to the question above was found from the fruitful field of reverse genetics (16). Manipulations of the hantavirus genome, however, have proven to be a formidable task, and even minigenomes (which would allow detailed mutational analysis of hantavirus promoters) remained elusive until very recently (14). This hampered functional dissection of individual mutations observed via comparative sequence analyses, such as the one reported here for Puumala virus or elsewhere for the Dobrava and Saaremaa viruses (29). More hope for progress in this direction also rises from recent findings on transfection-driven recombination in cell culture, which yields functionally competent and genetically stable hantavirus (35).
| ACKNOWLEDGMENTS |
|---|
This work was supported by grants from the Academy of Finland, the Sigrid Jusélius Foundation, Helsinki, Finland, the Swedish Medical Research Council (no. 12177 and 12642), and the European Community (no. QLK2-1999-01119 and QLRT-2001-01358) and by the Nordic Academy for Advanced Study (guest professorship for A.P.).
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Mol. Cell. Biol. | Microbiol. Mol. Biol. Rev. |
|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|