Previous Article | Next Article ![]()
Journal of Virology, October 2006, p. 9850-9860, Vol. 80, No. 19
0022-538X/06/$08.00+0 doi:10.1128/JVI.00715-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada,1 University of Iowa, Iowa City, Iowa,2 University of Manitoba, Winnipeg, Manitoba, Canada,3 University of Victoria, Victoria, British Columbia, Canada4
Received 7 April 2006/ Accepted 7 July 2006
|
|
|---|
|
|
|---|
125 kb of linear double-stranded DNA containing approximately 71 open reading frames (ORFs). The viral structure is similar to that of other alphaherpesviruses, consisting of two unique regions, unique long and unique short, each flanked by inverted repeats; short repeats termed terminal repeat long and internal repeat long border the unique long region, while larger repeats termed terminal repeat short (TRS) and internal repeat short (IRS) border the unique short region. A number of factors likely contribute to the overall stability of the VZV genome, one of which is the efficient proofreading activity of the DNA polymerase, which exhibits 3' to 5' exonuclease activity as documented in herpes simplex virus type 1 (HSV-1) (2). The synonymous and nonsynonymous mutation rates among the herpesviruses have been estimated at 1 x 107 and 2.7 x 108 mutations/site/year, respectively, based on the highly conserved gB gene (31). These mutation rates are an estimated 10 to 100 times faster than the equivalent sections of their host genomes but still very low relative to other viral families. As the alphaherpesviruses are so stable genetically with such a low rate of nucleotide substitution, recombination is thought to have played a crucial role in the evolution of different branches of the phylogenetic tree.
Initial attempts to examine genetic variation among isolates primarily used scattered restriction polymorphisms (ORF 38 PstI, ORF 54 BglI, and ORF 62 SmaI) and differences in the reiteration regions in the genome (1, 26, 27, 29, 51). A primary goal of these studies was to distinguish between circulating strains and the attenuated Oka vaccine strain, and rapid methods for differentiation of wild-type and vaccine strains have been developed based on these studies (53). In recent years, the expanding knowledge of VZV genetics has led to a number of groups proposing sequence-based typing schemes. These areas have both been the focus of recent proposals for a common worldwide VZV typing scheme, revealing evidence for geographical segregation of isolates. The development of a robust VZV genotyping scheme is useful from a molecular epidemiology perspective (linking cases/outbreaks within and between geographic regions), and phylogenetic analysis can also provide insights into VZV molecular evolution.
The increase in sequencing data available from geographically diverse isolates has enabled the identification of a large number of single-nucleotide polymorphisms (SNPs) spread across the genome, and strengthened our knowledge of the phylogenetic relationships between circulating VZV isolates. Studies on genetic variation and classification into different subgroups have been described for all other human herpesviruses, including HSV-1 (39) and HSV-2 (45), Epstein-Barr virus (47), cytomegalovirus (6, 7), and human herpesviruses 6 (9), 7 (14), and 8 (34), although none of these studies have included multiple complete genomic sequences of genetically unrelated strains in their analyses.
One such genotyping scheme for VZV was proposed by Faga et al. (13) and Wagenaar et al. (57), which amplified and sequenced 6 genes, gB (ORF 31), gE (ORF 68), gH (ORF 37), gI (ORF 67), gL (ORF 60), and IE62 (ORF 62), encompassing nearly 13 kb of the genome. Strains were then placed in a phylogenetic tree to determine relationships, with 4 major clades identified by this glycoprotein/IE62 scheme. The arbitrarily designated clade A is comprised primarily of European/North American isolates and includes the prototype VZV-Dumas, clade B is an Asian (primarily Japanese) cluster and includes VZV-Oka, clade C is an Asian-like cluster that shares some characteristics with European/North American strains, and clade D is also a European/North American cluster.
Barrett-Muir et al. (3, 4) and Quinlivan et al. (43) used heteroduplex mobility assays to study VZV isolates from the United Kingdom, Africa, Asia, and Brazil to locate a number of informative SNPs across the genome (in ORFs 1, 21, 50, and 54). Using this scattered SNP scheme, strains were assigned a genotype based on shared alleles into one of four major groups. Genotype A1/A2 strains are ubiquitous in Africa, Asia, and the Far East, genotypes B and C (containing VZV-Dumas) are chiefly European but have also been described in North America and Brazil, and genotypes J1/J2 are Japanese isolates (including VZV-Oka) that differ from genotype A strains at 2 SNPs. A third strategy proposed by Loparev et al. (30) involved sequencing a 447-bp portion of ORF 22 and, based on information from a limited number of variable SNPs, assigning strains one of three genotypes, either genotype E (European), genotype J (Japanese), or genotype M (mosaic). Genotype M is comprised of a mixture of E- and J-like alleles, and a number of variants have been observed, allowing further subclassification of strains as M1 or M2.
To our knowledge, no study has examined the relationships between VZV genotypes at a full-genome level; indeed, no study with such breadth of coverage has been carried out for any individual herpesvirus to our knowledge. In this study, we sequenced 11 full VZV genomes, bringing the number of complete genomic sequences publicly available to 18, encompassing each of the distinct VZV clades from Wagenaar et al. (57). With this newfound information, it is the goal of this study to compare and incorporate results of previous genotyping studies and formulate a consolidated VZV genotyping scheme, one which is able to distinguish between isolates from different clades as well as within an individual clade. As an important consequence of this genomic analysis, we also present evidence for recombination having occurred between the major circulating VZV clades, as Canadian strain VZV-8 from clade C contains a mixture of characteristics from European/North American clade A and Japanese clade B. Recombination events have already been demonstrated between various vaccine and wild-type pseudorabies virus (PRV, suid herpesvirus 1) strains replicating in pigs and other animals (16, 21, 22, 24). If recombination events occur during VZV infection in a frequency sufficient that they can be documented by sequencing less than 20 genomes, this observation would challenge current concepts about the inherent stability of the VZV genome. Also, this observation may have implications for immunization strategies with live attenuated human herpesvirus vaccines.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Strains used in the complete genome alignment and phylogenetic analysis
|
Several of the clinical isolates were found to replicate with very low titers, and as a result, it was difficult to obtain sufficient amounts of pure VZV DNA for the above approaches to be successful without resorting to extremely large scale virus production. In these cases, whole-genome amplification was carried out using the GenomiPhi DNA amplification kit (Amersham Biosciences/GE Healthcare, Baie d'Urfé, Québec, Canada) prior to construction of the shotgun libraries. This amplification approach is based on an isothermal amplification of the starting material using random primers with a strand displacing DNA polymerase and can generate microgram quantities of DNA from nanogram levels of input template. The initial assessment of libraries constructed using DNA amplified in this manner indicated that the terminal regions (approximately 5 to 7 kb from the ends of the genome) as well as segments of the internal repeat and reiteration regions were severely underrepresented. These effects are known problems or deficiencies with this approach. To overcome this, two sets of linkers were utilized, one for EcoRI and one for NgoMIV-SphI, and enzymes were chosen based on their cleavage patterns in the Dumas sequence and intended to either facilitate recovery of terminal sequences or otherwise split up the underrepresented regions into smaller segments that would potentially be amplified more efficiently. These linkers were then ligated onto blunt-ended nucleocapsid DNA. The resulting DNA was digested with the appropriate enzyme, and the fragments were subjected to conditions favoring intramolecular ligation. These circularized fragments, along with undigested nucleocapsid DNA, were used as templates in the GenomiPhi amplification prior to shearing and shotgun cloning.
Prior to sequencing, plasmid DNA was prepared using Wizard SV96 kits (Promega, Madison, Wis.), and sequencing reactions were carried out using either Big Dye v3.1 (Applied Biosystems, Inc., Foster City, CA) or BigDye containing enhancer A (Invitrogen Canada, Inc., Burlington, Ontario, Canada). Sequencing reactions were run on ABI3100 or ABI3730 DNA/genetic analyzers, and the data were assembled using SeqmanII (DNAStar, Madison, WI) or the Staden Package (49). The sequence data were assembled against the reference sequence VZV-Dumas, and all polymorphisms identified were confirmed as authentic by being present in two or more independent/overlapping clones. In cases where regions were identified as being variable within a strain and/or multiple clones were not available, the sequences were amplified, the PCR products were cloned, and several independent clones were sequenced to arrive at a consensus sequence for that region. Because of the nature of the shotgun approach, it was not always possible to specifically identify sequences originating from the inverted repeats as belonging to either the internal or terminal repeat. As a result, the data for this region were assembled en masse. Sufficient sequence data were obtained which spanned into the unique regions flanking both inverted repeats so that the boundaries of the repeat unit could be identified. The sequence for this region was then transposed and inserted into the sequence data to arrive at the complete sequence.
In addition, PCR primers were also designed to generate amplicons spanning the junction of the genome during its replicative phase (ORF 0 to ORF 62). The terminal sequences were further characterized using a technique commonly employed to obtain cDNA ends (rapid amplification of cDNA ends). Nucleocapsid DNA was blunt ended with T4 and Klenow DNA polymerases, a custom adaptor was ligated on, and PCR was conducted using adaptor-specific and VZV-specific primers targeting either end. In all cases where direct sequencing of the PCR products indicated that multiple sequences were present, these amplicons were cloned and several independent clones were subsequently sequenced. All modifying and restriction enzymes used in this study were purchased from New England Biolabs (Pickering, Ontario, Canada).
Sequence analysis. In addition to the VZV genome sequences derived in the current study, the analysis included the sequences of the prototype strain VZV-Dumas (10), VZV-Oka, parental and vaccine strains (17), VZV-MSP and VZV-BC (19), and the sequences from the commercially produced VZV vaccine strains VarilRix and VariVax (56). The Virus Bioinformatics Resource at http://www.virology.ca (12) was used extensively in the analysis of the data. The annotated genomes were entered into the Viral Orthologous Clusters module to analyze the gene products, while the aligned genomes were analyzed in the Base-By-Base (BBB) module to identify polymorphic sites and amino acid changes. The typing strategy proposed by Wagenaar et al. (subsequently referred to as the glycoprotein/IE62 method) (57), was used to evaluate the phylogenetic relationship of these strains, and the data for strains described previously (13, 57) were included to provide a background population for comparison. Alignments and phylogenetic analyses were conducted using MEGA v3.1 (25) and ClustalW, employing the Kimura two-parameter model with complete deletion of gaps or missing data. All strains were also examined using the typing strategies of Loparev et al. (subsequently referred to as the ORF 22 method) (30) and Barrett-Muir et al. (subsequently referred to as the scattered SNP method) (4), using the descriptive locations identified to assign genotypes. SimPlot (version 2.5; distributed by the author, Stuart C. Ray, Division of Infectious Diseases, John Hopkins University School of Medicine) in conjunction with PHYLIP (Phylogeny Inference Package, version 3.6; distributed by the author, J. Felsenstein, Department of Genome Sciences, University of Washington, Seattle) was used in the study of potential recombination events.
Nucleotide sequence accession numbers. The genomic sequences for the newly sequenced isolates described by this work were deposited in GenBank under accession numbers DQ479953 to DQ479963.
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. Differentiation of VZV strains based on currently accepted typing schemes
|
The genotypes that have been described by groups previously that do not have any fully sequenced representatives in this study were genotype M1 from the ORF 22 study (30) and genotype A from the scattered SNP method (4). These both appear to be prevalent in Africa and India and may represent a fifth distinct genotype. For convention's sake, the glycoprotein/IE62 designations for genotyping VZV strains have been retained in describing strains herein (57). These were chosen due to the greater amount of sequence examined per isolate and its ability to distinguish between each of the isolates examined, including those in the same clade. All coordinates used herein refer to the equivalent nucleotide in VZV-Dumas.
Phylogeography and clades. Based on the hypothesis that VZV has coevolved with humankind, clades representative of different geographic regions should be evident (32). In examining the full-genome alignment, some general observations regarding the clades can be made. Clade A isolates included the prototype VZV-Dumas, with other members of the clade displaying a 99.93 to 99.97% identity to Dumas at the nucleotide level, discounting insertions and deletions, which corresponded to 37 to 43 total site differences (not including VZV-32 P72 with 65). Clade B in the full-genome alignment had only pOka as a wild-type isolate, with a 99.83% identity to Dumas at the nucleotide level consisting of 188 total site differences. Clade C was only represented by VZV-8 in the alignment, which may not be fully representative of the clade as a whole; regardless, VZV-8 had 165 total site differences compared to Dumas, showing a 99.85% identity at the nucleotide level. Clade D strains, all Canadian clinical isolates in the alignment, ranged from 134 to 146 total site differences from Dumas for the three strains, showing a 99.87 to 99.88% identity at the nucleotide level. The similarities suggest that clades A and B were the two most divergent clades studied. A more in-depth characterization of sequence variations between the isolates examined in this full-genome alignment is being prepared as a separate study.
The phylogenetic relationship of all 18 strains based on the full-genome alignment is presented in Fig. 1A. The resulting dendrogram revealed a distinct clustering of the strains as anticipated; however, the placement of VZV-8 in relation to the Japanese isolates was somewhat surprising, as this strain was isolated from a patient in New Brunswick, Canada. The epidemiological information on this isolate was limited to the source being a 73-year-old zoster patient. To determine if this placement was the result of some unique feature found in this isolate, all of the strains were further assessed using the Wagenaar et al. glycoprotein/IE62 scheme (57). As can be seen in Fig. 1B, the clustering of the strains was similar to what was obtained based on the full-genome comparison. Using this typing scheme, the majority of our strains fell within clades A and D, indicative of European/North American origin.
![]() View larger version (27K): [in a new window] |
FIG. 1. Phylogenetic trees of VZV strains based on full-genome sequence (A), aligned sequences of five glycoprotein genes and IE62 (13, 57) (B), and origin of replication region (C). Strains in boldface type are those represented in the full-genome alignment, while strains in italics were taken from previous studies.
|
Only by full-genome sequencing can unpredicted variation be uncovered. At least 17 variable locations have been identified as a result of insertions/deletions and SNPs in the VZV origin-associated region, not including variable copy numbers of the TA and GA subunits in the actual ORI; these variable positions are summarized in Fig. 2. For the newly sequenced isolates, the region was an exact duplicate between the two inverted repeats, but variation existed between the two regions in two sequences, VarilRix and VariVax. It is known that the vaccine preparations are a mixture of distinct genetic subtypes (42, 56). As a result, the sequences reported for the three vaccine strains would likely not fully encompass the genetic variability that would be present in the mixed vaccine batches. Certain portions of the genome have been identified as variable within an isolate, for example, homopolymer stretches and the GA/TA subunits within the ORI, but little information is available on potential sequence differences between the IRS/TRS inverted repeats. The sequencing strategies used in this study did not allow for the distinction of sequences covering the inverted repeats as originating from IRS or TRS.
![]() View larger version (12K): [in a new window] |
FIG. 2. Variable locations within the VZV origin-associated region located between reiteration region R4 and ORF 63/70. As there was noted variability between the copy numbers of TA and GA subunits within the origin of replication between virus sequences within a clade, the differences in copy numbers are not shown. Dumas Location refers to the coordinates of the features in the VZV-Dumas IRS copy of the origin of replication; these locations are duplicated in TRS. The nucleotide at each of these variable locations is listed below the coordinates for the four clades examined. Poly-G, Poly-A, and Poly-C refer to homopolymer stretches that have displayed variability in length both between and within clades. A dash () indicates that the clade does not have a base at this location due to an insertion or deletion event. The bases listed for clade C are based on the sequence of VZV-8 only. *, VZV-11 from clade D was the only strain reported to contain a G at location 109994.
|
|
View this table: [in a new window] |
TABLE 3. Clade-specific SNPs based on the full-genome alignmenta
|
In addition to the 50 clade D-specific SNPs described in Table 3, 7 shared SNPs restricted to clade D strains have been located, all shared between VZV-22 and VZV-03-500. The 45 polymorphic sites identified in a single clade D isolate are not summarized but may be of eventual use in further classifying clade D isolates. If a greater number of clade D strains were sequenced, there would undoubtedly be more shared SNPs specific to clade D. As the full-genome alignment contained only one strain classified as clade C (VZV-8) and one wild-type clade B strain (VZV-pOka), there was limited information available on potential SNPs that may be localized to these clades. Further sequencing of isolates from these clades should refine the list of SNPs and allow for better differentiation between subgroups within a clade.
The nucleotide identity between strains from clades A and B suggested that these were the two most divergent clades in this study. In the genomic alignment, all polymorphisms shared by clade A and clade B strains were also shared with either clade C or D isolates. No SNPs were located that were shared by clades A and B alone, confirming that these two groups were the most divergent of the clades examined. Interestingly, the 50 clade D-specific SNPs in the full-genome alignment were clustered in three portions of the genome, with 24 located from ORFs 0 to 22, 16 spread between ORFs 35 to 50, and 10 spread between ORFs 62 to 71 in the inverted repeats. This was a relatively constant rate of approximately 1 SNP/1.4 kb in these particular regions. The intervening regions in which no SNPs specific to clade D have been located (ORFs 23 to 34 and 51 to 61) do contain SNPs shared by all clade D strains, but in all instances, these polymorphisms were shared with clade B and, in some cases, with clade C as well.
Evidence for recombination in strain VZV-8. When the dendrogram from the full-genome alignment is examined (Fig. 1A), VZV-8 was found to be a distinct outlier on the branch containing the Japanese isolates. By using the glyocoprotein/IE62 typing scheme, VZV-8 was found to loosely cluster with a group of isolates from Singapore in clade C, which is distinct from clade B harboring the Japanese strains (Fig. 1B). To better understand the factors underlying this divergence, a condensed full-genome alignment in which only the variable sites were represented was created (Fig. 3A). In this alignment, singleton SNPs (polymorphisms occurring in a single isolate) and other variations that were deemed to be uninformative (reiteration regions, homopolymer stretches, and insertions/deletions) were removed.
![]() View larger version (28K): [in a new window] |
FIG. 3. Evidence for recombination in VZV. (A) Alignment of variable sites. The full-genome alignment was condensed to only the variable sites by removing the high-passage isolates derived from a single strain (i.e., 32p22, 32p72, vOka, VarilRix, and VariVax) as well as by removing noninformative variable sites, such as singletons and other strain-specific variations. The Dumas coordinates of the variable sites are indicated above the alignment to provide a relative reference. The boxed regions indicate the segments proposed to be involved in the recombination events and correspond to Dumas coordinates bp 20656 to 25067 for the first segment and bp 38714 to 44835 for the second. Positions in these regions which are conserved between the clade B and D strains are highlighted in red, and the locations in which clade A strains are proposed to have developed mutations after the recombination event are highlighted in yellow. (B) SimPlot analysis of the variable site alignment. Analysis was conducted by grouping the strains based on their phylogenetic designations. Clade A was selected as the query group using a 20-bp window and a 10-bp step value. Similarities to clade D are indicated in green, those to clade C are indicated in blue, and those to clade B are indicated in red. (C) Phylogenetic analysis of the recombination region. The segment of the full-genome alignment between Dumas coordinates bp 13000 to 52000 was assessed using the neighbor-joining algorithm with 500 bootstrap replicates (bootstrap values are indicated).
|
To test the recombination hypothesis, consensus sequences from clades A and D along with the pOka (clade B) and VZV-8 (clade C) sequences from the modified alignment were compared using the Bootscan function of SimPlot. The software works in conjunction with the phylogenetic analysis suite PHYLIP and plots the results of successive bootstrap analyses on a group of sequences by moving a sliding window across the alignment in incremental steps until the entire alignment has been assessed. SimPlot was developed to study recombination in human immunodeficiency virus (28, 46), and it has been used to study recombination of other viruses (20, 59, 61), including clinical isolates of HSV-1 (39). The SimPlot Bootscan analysis, presented as Fig. 3B, clearly indicated a shift in similarity between the sequences, indicative of a recombination between clades A and C in two regions.
To discount the possibility that editing the alignment had introduced bias into the SimPlot analysis, we performed bootstrap analysis of the unedited alignment covering the region of the putative recombination events (using MEGA3.1). The results, shown in Fig. 3C, also confirm the clustering of VZV-8 with clade A in this region. The region studied, from 13 to 52 kb (ORFs 10 to 29), included a change back to the A/D and B/C homology and three variable reiteration regions (R1, R2, and R3), but the overall influence of the SNPs in this region supported the observations seen with the condensed alignment. The recombination events proposed to have occurred to shape the genetic profile of VZV-8 are presented as Fig. 4.
![]() View larger version (19K): [in a new window] |
FIG. 4. Schematic diagram depicting the recombination events proposed to have shaped the genetic profile of the Canadian clinical isolate VZV-8. Open arrows depict ORFs, and reiteration regions R2 and R3 are depicted by solid black boxes. The superimposed vertical red lines depict locations where VZV-8 shares an SNP with either the clade A or clade B progenitor strain. SNPs which are specific to a single clade, as indicated in Fig. 3A, are not shown.
|
|
|
|---|
Each of the previously reported SNP/sequence-oriented typing schemes (4, 13, 30, 57) has its advantages and its drawbacks. The ORF 22 method is unable to distinguish between the European/North American clades A and D, with both classified as genotype E, but is advantageous in ease of use, requiring sequencing of a single 447-bp amplicon to genotype a strain. The scattered SNP method requires sequencing of 4 different ORFs to assign a genotype, which can be somewhat unclear, i.e., VZV-8 and pOka are both classified as genotype J1 but are clearly distinct based on the full-genome alignment, while vOka, VarilRix, and VariVax, clearly related to pOka, are classified as genotype J2. The scattered SNP method is advantageous in that, by studying a large number of geographically separated strains, it is able to identify useful markers for subtyping genotypes A and J in outbreaks. The glycoprotein/IE62 scheme requires the most sequencing of the three, examining 6 genes and nearly 13 kb of the genome. It suffers due to studying a relatively limited number of isolates for comparison between clades, with a focus on laboratory strains. The benefits of this scheme are that, by examining a larger amount of the genome, it is better able to distinguish between closely related isolates within a clade and provides a clearer view of the phylogenetic relationships between strains.
Fifty SNPs specific for clade D have been identified from the full-genome alignment in conjunction with data from previously described strains. It is unclear at this time whether the clustering of clade D SNPs is a result of divergent evolution, recombination between heretofore uncharacterized progenitor strains, or evolutionary artifacts due to selective pressure. Based on our observations, it does not appear as though the intervening regions are less variable on the whole in VZV when the overall polymorphic sites by ORF are examined, though different portions of clade D genomes have likely evolved via different mechanisms, similar to the case in VZV-8.
Variation in the VZV origin of replication. The variability in the VZV origin of replication was the single most striking observation because of the lack of precedence in HSV sequence analyses. Although others have noted variation in the VZV origin of replication both within and between isolates (17, 19, 50), the extent of the variability observed in the full-genome alignment was unexpected. We have located a number of clade-specific SNPs in the region between R4 and ORFs 63/70 encompassing the ORI which are sufficient to distinguish between isolates from each of the clades studied (Fig. 2). When the phylogenetic tree created using this region (Fig. 3C) is compared to that from the full-genome alignment, it is observed that all strains are classified in a similar manner. With the amount of variation observed between isolates representing the four clades characterized by Wagenaar et al. (57), it is likely that the origin region of any previously uncharacterized genotypes, i.e., African and Indian isolates, would exhibit unique profiles relative to currently classified clades.
An origin-based typing scheme appears to be a useful typing mechanism, but we acknowledge that there are some potential drawbacks to such a method that may limit its ease of use. Aside from the technical issues inherent with amplifying and sequencing a portion of the genome that contains variable homopolymer stretches and a variable number of subunit copies in the actual origin of replication, the region is duplicated in VZV, increasing the potential for variation. At the current time, a limited number of strains have been studied, with the majority of North American isolates classified as clade A. A greater number of clade B and C representative strains would enhance the analysis by refining the number of clade-specific SNPs, along with providing better background data for the construction of phylogenetic trees.
Nevertheless, the cumulative data from this report and earlier genomic analyses described herein have clearly established that there are VZV clades found predominantly in humans of European ancestry (including immigrants to North America) and humans of Asian ancestry. In turn, this observation in more contemporary times extends the hypothesis of McGeoch and coworkers that all mammalian herpesviruses have coevolved with their hosts through geologic times dating back to millions of years ago (31-33). In other words, our data are compatible with the hypothesis that, as humankind migrated out of Africa, not only did humans undergo divergence in Asia and Europe but also their accompanying VZV strains.
Recombination in the alphaherpesviruses. As the alphaherpesvirus family is so genetically stable with very low rates of nucleotide substitution, recombination may play a crucial role in the evolution of different branches of the phylogenetic tree (32). Interspecific recombinant viruses rarely survive in the natural environment, and the mechanism is inefficient relative to intraspecific recombination (35, 48). However, a natural interspecies recombinant virus between equine herpesvirus 1 (EHV-1) and EHV-4 has recently been reported (40), the first such naturally occurring recombinant described in the alphaherpesviruses. The recombination described was in the ICP4 gene, homologue to IE62. No recombination has been documented in this region of VZV, but it is interesting that one end of the recombination region between EHV-1 and EHV-4 is composed of reiterated 12-bp and 15-bp subunits. This seemingly suggests that reiteration regions can play a role in recombination events between viruses, and the proximity of R2 and R3 to the recombination in VZV-8 may be significant. Interspecific recombinants have been previously generated in vivo between HSV-1 and HSV-2 (60) as well as between bovine herpesvirus 1 (BoHV-1) and BoHV-5 (35).
Intraspecific homologous recombination has been noted both in vitro and in vivo for a number of alphaherpesviruses, including HSV-1 and HSV-2 (23, 38, 55), PRV (16, 21, 22, 24), BoHV-1 (48), and feline herpesvirus 1 (15). In HSV-1, recombination has played an important role in the evolution of genotypes. It has been noted that most full-length genomes consist of a mosaic pattern of fragments from different genotypes, with a number of isolates containing putative recombination points that are located within a gene (39). The BoHV-1 in vivo recombination studies involving coinfection of calves by a natural route of infection have indicated that recombination occurs as frequently in vivo as it does in vitro and that resulting recombinants are persistent and stable after reactivation from latency (48). Putative wild-type intraspecific recombinants have been described for HSV-1 (5, 39), PRV (8), Epstein-Barr virus (36, 58), and human herpesvirus 8 (41), demonstrating the commonality of recombination events of herpesviruses in nature.
Recombination events in the evolutionary history of VZV have likely been underestimated due to the relative genetic stability of worldwide genotypes, and full-genome sequencing discriminating between widely spaced SNPs is the only definitive method of identifying recombinant viruses. The most important prerequisite for recombination to occur is coinfection of a host by the different progenitor strains, with a high degree of genetic similarity between these strains another important factor in homologous recombination (52). The remarkably stable nature of VZV genomes in general suggests that sufficient homology between two strains would not be a hindrance in regard to recombination.
Recombination in VZV is not a new phenomenon. An early study by Dohner et al. (11) noted putative in vitro recombination between viral strains in coinfected cells by mixing two viruses with unique restriction profiles, resulting in hybrid plaque isolates containing neither parental virus in its original form. These hybrids appeared to show differing patterns of recombination, including complete crossover recombination and a partial recombination in which a segment of one parent was inserted into the other. Although the potential recombination regions did not coincide with the regions of VZV-8 associated with recombination, such an event can occur at any point which shows sufficient identity between genomes (52). Additionally, a probable recombinant Brazilian VZV isolate as well as two potential recombinant isolates from the United Kingdom have been described previously (4, 37). The Brazilian genotype A/C recombinant virus matches the profile for genotype C (clade A) throughout the loci studied in ORFs 1 and 21, but at informative sites studied in ORFs 50 and 54, it was found to contain the genotype A (African/Asian clade) markers. It is possible that a single recombination event covering ORFs 50 to 54 has resulted in the mosaic profile observed, but the divergence from its putative progenitor clades may indicate that other factors have played a role in creating its genetic profile.
Recently, the analysis of the complete sequence of herpesvirus papio 2 (54), a simian alphaherpesvirus similar to HSV, has shown evidence of recombination between progenitor viruses. Interestingly, the regions involved in the putative recombination contain genes homologous to those involved in the recombination events described for VZV-8 in this work, consisting of the UL41 to UL44 genes (homologous to VZV ORFs 14 to 17) and a localized portion of the UL36 gene (homologous to VZV ORF 22).
The observations regarding VZV-8 seemingly indicate that there has been at least one recombination event during the evolution of the virus. Such an event is not likely to have been a recent occurrence, as the putative recombination regions have undergone evolution in developing a number of polymorphisms distinguishing them from their progenitor strains. Since VZV-8 shares a number of characteristics with VZV-pOka, it may represent an Asian-like lineage that was involved in recombination with a clade A progenitor that has led to two segments of the genome having a clade A-like profile. Reciprocal (homologous) recombination among precursor viruses during the evolution of the clades is a potential explanation for the observed SNPs. An alternative hypothesis for the SNP clustering could be that certain polymorphisms have arisen independently in a number of different lineages of VZV, but based on the similarity between the different clades and the number of sites involved, this possibility seems unlikely. Further genomic analyses will clarify these issues and likely provide greater insight into the phenomenon of recombination in VZV. If recombination can occur between wild-type isolates, recombination presumably could occur also between wild-type and vaccine strains. Thus, recombination events may need to be considered in strategies to immunize large populations with live attenuated varicella vaccines as well as other herpes vaccines.
We thank the DNA Core Facility, National Microbiology Laboratory, for oligonucleotide synthesis and DNA sequencing, which was invaluable for the completion of this project. We also thank Richard Garceau and Kevin Fonseca for providing strains used in the sequencing and Vasily Tcherepanov with the Virus Bioinformatics Resource (TVBR) at the University of Victoria, Victoria, BC, Canada, for bioinformatics assistance.
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»