Previous Article | Next Article ![]()
Journal of Virology, April 2008, p. 3509-3516, Vol. 82, No. 7
0022-538X/08/$08.00+0 doi:10.1128/JVI.02646-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

P14 Molekulare Genetik und Epidemiologie von Herpesviren, Robert Koch-Institut, D-13353 Berlin, Germany,1 Wildlife and Emerging Diseases Section, Centre for Tropical Veterinary Medicine, Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Roslin, Midlothian EH25 9RG, United Kingdom,2 Laboratoire Interactions Hôtes-Virus, Institut Pasteur de la Guyane, B.P. 6010, 97306 Cayenne, French Guiana,3 Centre for Fish and Wildlife Health, Institute of Animal Pathology, Vetsuisse Faculty, University of Berne, CH-3001 Berne, Switzerland,4 Friedrich-Loeffler-Institut, Bundesforschungsinstitut für Tiergesundheit, Institut für Neue und Neuartige Tierseuchenerreger, D-17493 Greifswald-Insel Riems, Germany,5 Medical Research Council Virology Unit, Institute of Virology, University of Glasgow, Church Street, Glasgow G11 5JR, United Kingdom6
Received 13 December 2007/ Accepted 15 January 2008
|
|
|---|
|
|
|---|
Despite the tremendous accumulation of knowledge on the existence of hitherto unknown herpesviruses, only limited sequence information (i.e., a few hundred base pairs) became available in most of the cases. This information is sufficient to assess whether a virus is already known or novel and allows for assignment to a herpesvirus subfamily. However, a more precise phylogenetic analysis is often not possible, and more extensive sequence data are therefore desirable.
In the present study, we wanted to further extend insight into gammaherpesvirus (GHV) evolution by analyzing mammalian hosts from different orders and various geographic sites, either living in the wild or held in captivity. Members of mammalian orders for which no GHV was known were of particular interest. For the detection of unknown herpesviruses, we envisaged a bigenic PCR approach targeting two conserved genes, the DPOL and the glycoprotein B (gB) genes, separately. The advantage was that these genes are adjacent in GHV genomes, and long-distance PCR (LD-PCR) could be used to close the gap between the generated gB and DPOL sequence information, resulting in a final sequence of approximately 3.4 kbp. This bigenic approach had been used already for the amplification of novel betaherpesviruses and GHVs of primates, artiodactyls, bats, and rodents (3, 11, 25, 33). Here, we used it for a broad analysis of mammalian Gammaherpesvirinae and discovered novel GHVs in eight different host species from six mammalian orders.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Mammalian species and novel GHVs
|
For the amplification of the gB genes of as-yet-unknown GHVs, two degenerate, deoxyinosine-containing primer sets (GH1 and GH2) (Table 2) were used. The primer sequences were deduced from the gB gene sequences of equine herpesvirus 2 and rhesus monkey rhadinovirus (Table 3), respectively. PCR was carried out at an annealing temperature of 46°C under the conditions described for the DPOL gene.
|
View this table: [in a new window] |
TABLE 2. Primers for amplification of the gB gene
|
|
View this table: [in a new window] |
TABLE 3. Viruses, abbreviations, and accession numbers
|
Phylogenetic analyses. Analyses were carried out at the level of encoded amino acid sequences. Sets of amino acid sequences for DPOL and gB from partial gene sequences were aligned using ClustalW (30). Positions in alignments that had missing characters in any sequence and regions considered too divergent for justified aligning were removed before the alignments were used for phylogenetic analyses.
Measures of amino acid sequence divergence used the matrix of Jones et al. (15). The PHYLIP package (12) was applied in a preliminary evaluation of trees by the neighbor-joining (NJ) method with bootstrap values (1,000 replicates). The PAML package (version 3.15) (34) was employed in making maximum-likelihood (ML) evaluations of amino acid sequence alignments for sets of candidate trees to examine aspects of branching patterns, with a discrete gamma distribution of five classes of substitution rates across sites and with the scoring of output trees by the RELL method (16).
Trees were also derived from alignments by Bayesian analysis using Monte Carlo Markov chains (BMCMC) with MrBayes version 3.1 (26). DPOL and gB sequences were treated in separate partitions with independent parameters, the rate model specified a discrete gamma distribution of four classes of substitution rates plus a class of invariant sites, and runs were for one million generations with sampling every 100th generation. The program's defaults were used for other parameters, including the specification of priors and two independent runs of one cold and three heated chains. At least 5,001 (out of 10,001) samples were discarded as burn-in.
Provisional nomenclature and abbreviations. The viruses from which the de novo detected sequences originated were named after the host species and the herpesvirus subfamily to which the virus was tentatively assigned, for example, Sorex araneus gammaherpesvirus 1. They are listed with their abbreviations and corresponding GenBank accession numbers in Table 3, together with the names of the previously described viruses and the accession numbers of the sequences that were analyzed for comparison.
Nucleotide sequence accession numbers. The nucleotide sequences of the novel herpesviruses have been deposited in GenBank under the following accession numbers: Crocuta crocuta gammaherpesvirus 1, DQ789371; Diceros bicornis gammaherpesvirus 1, AY197560; Hexaprotodon liberiensis gammaherpesvirus 1, AY197559; Panthera leo gammaherpesvirus 1, DQ789370; Rupicapra rupicapra gammaherpesvirus 1, DQ789369; Saimiri sciureus gammaherpesvirus 2, AY138584; Sorex araneus gammaherpesvirus 1, EU085380; and Tupaia belangeri gammaherpesvirus 1, AY197561.
|
|
|---|
Before assays for herpesviruses, the authenticity of the specimens was checked by the amplification and sequencing of roughly 0.3 kbp of the cytB gene. Species identification by means of cytB gene sequencing is a commonly used technique (20). Here, the morphological species determination was confirmed for all animals (data not shown).
To detect herpesvirus sequences, we first performed a pan-herpesvirus DPOL PCR. In blood and/or tissue samples of each mammalian species (except the gorilla and Asian elephant), a novel DPOL sequence was found (data not shown), and by comparison with known herpesvirus sequences, each sequence was determined to originate from an unknown GHV (Table 1). In the gorilla and elephant samples, sequences were found which had been published recently by others (18, 32).
For the detection of the gB gene of each novel GHV, two degenerate PCR sets were used (sets GH1 and GH2) (Table 2). With either the GH1 set or the GH2 set, gB sequences from all GHV-positive specimens were amplified (data not shown). The primer sets GH1 and GH2 were also used to amplify gB sequences of four GHVs for which we had already published short DPOL sequences (Table 1). These GHVs originated from the following hosts: Equus zebra and Tapirus terrestris (order Perissodactyla) (7) and Sus barbatus and Babyrousa babyrussa (order Artiodactyla) (10).
In total, partial gB and DPOL sequences for 14 GHVs were now available. To confirm that the two members of each of the 14 gB-DPOL sequence pairs did in fact originate from the same virus genome, we chose specific nested primers for each virus, with the sense primers located in the gB gene and the antisense primers located in the DPOL gene. By using the TaKaRa-Ex PCR kit, fragments of 2.7 to 3 kbp from all specimens were obtained and sequenced by primer walking (primers are not listed). The sequences of these fragments overlapped the initial gB and DPOL sequences without mismatches. Finally, contiguous sequences of about 3.4 kbp for all 14 GHVs were successfully compiled. The sequences were subjected to open reading frame and subsequent BLAST analyses. Partial open reading frames of gB and DPOL genes in each sequence were identified (Table 1).
The 14 GHVs were detected in specimens from hosts from seven orders (Primates, Proboscidea, Artiodactyla, Perissodactyla, Carnivora, Scandentia, and Eulipotyphla), and 8 GHVs were previously unknown. The gB genes of five novel GHVs (EmaxGHV-1, Tupaia belangeri gammaherpesvirus 1, Sorex araneus gammaherpesvirus 1, Panthera leo gammaherpesvirus 1, and Hexaprotodon liberiensis gammaherpesvirus 1) were only roughly 60% or less identical to those GHV genes which were used for the design of the degenerate gB primers. These findings demonstrate for the first time that a limited number of degenerate gB primers is sufficient for the universal detection of mammalian GHVs and that the primers can be designed on the basis of a small number of GHV sequences. In particular, the primer set GH1 (primers 2759 to 2762) was a powerful tool in the present study, as well as in several earlier investigations (10, 25, 33).
In previous studies using the same techniques of gB and DPOL gene amplification, LD-PCR was sometimes not successful (11, 33). This failure may have resulted from the reduced sensitivity of LD-PCR compared to that of a short-fragment standard PCR, sometimes causing a negative LD-PCR result for specimens with low virus copy numbers. In addition, a negative LD-PCR result can be attributed to the fact that a gB sequence and a DPOL sequence, although amplified from the same specimen, may not originate from the same virus genome, because individuals are often (latently) infected with more than one herpesvirus. Such complex situations, i.e., infections of single animals with similar GHVs, were observed during a previous analysis of multivirus-infected chimpanzees and macaques. In that study, oligonucleotides with locked nucleic acid substitutions were used to specifically inhibit the PCR amplification of a GHV sequence, thereby enabling the amplification of a second, different GHV sequence from the same specimen with the same degenerate primers (25). In the present study, the nested LD-PCR was successful for all gB-DPOL sequence pairs analyzed. This may indicate that additional GHVs were not present in these samples.
Phylogenetic relationships in the Gammaherpesvirinae. Novel sequence data were available for the 14 viruses described above plus 31 other GHVs for which information has been reported previously in the literature. Alignments of partial sequences of DPOL and gB, which contained 664 and 285 amino acid residues, respectively, after the removal of loci with gaps and unalignable regions, were made for these 45 species. Based on previous experience (21, 22), we regarded this data set to be of adequate size to resolve most details of GHV phylogeny. In order to locate the GHV root position, a second pair of alignments was made that additionally contained data for 17 alphaherpesviruses and 12 betaherpesviruses. Final DPOL and gB lengths for this pair were 523 and 238 residues, respectively. DPOL and gB alignments were concatenated for phylogenetic analyses. Phylogenetic trees were derived by two separate routes: first by the NJ method, followed by ML analysis to examine particular aspects, and second by BMCMC.
The NJ method with bootstrap values was applied to define a preliminary tree. In conjunction with previous work (23), this approach served to identify 11 clades with good confidence (Table 4). Three of these (clades 1, 3, and 6) correspond to genera currently defined (Lymphocryptovirus) or awaiting approval by the International Committee on Taxonomy of Viruses (Macavirus and Percavirus), and four additional clades (7 to 10) contain representatives of the genus Rhadinovirus. The remaining four clades (2, 4, 5, and 11) apparently do not belong to any of the currently defined genera. Specific aspects of branching patterns, concerning mostly deep branches, were then pursued by ML analysis as described previously (23). This process gave a top-scoring tree, together with runner-up variants that indicated uncertainty about certain branching details. Separately, the BMCMC method was employed and gave a single tree with high posterior probabilities associated with all branches. The NJ-plus-ML approach and the BMCMC approach both yielded the same top-scoring tree, and this tree is shown in Fig. 1A. It is known that BMCMC can yield overpositive results (1, 6, 29), and we have dealt with this possibility by producing an alternative tree (shown in Fig. 1B) in which four loci where the ML analyses gave uncertain branching assignments have been collapsed into multifurcations. We regard the latter tree as the more conservative, secure interpretation of the analyses. Two of the unresolved branching loci, in the Macavirus (alcelaphine herpesvirus 1)-like clade (clade 3) and in the HHV8-like clade (clade 8), we consider to reflect simply the relatively short alignment length available plus the closeness of the species involved and to be likely to be fully resolvable with more input data. The other two are deeper in the tree and may represent fundamental properties of the phylogeny.
|
View this table: [in a new window] |
TABLE 4. Major clades of GHVs
|
![]() View larger version (15K): [in a new window] |
FIG. 1. Phylogenetic trees for the Gammaherpesvirinae. Panel A shows the phylogenetic tree as obtained by BMCMC (equivalent to the top-scoring tree from the NJ-ML analysis). The root position was determined by analyses using alpha- and betaherpesvirus sequences to provide an out-group. Three closely related viruses (rhesus monkey rhadinovirus 2695, Macaca fascicularis rhadinovirus 1 [MfasRHV-1], and MfasRHV-2) are shown together as rhesus monkey rhadinovirus (RRV) isolates. The major lineages are indicated with numbers 1 to 11. Panel B shows the same tree collapsed to form multifurcations at four locations of uncertainty in the NJ-ML analysis. The divergence scale at the bottom, indicating the number of substitutions per site, applies to both panels. EBV, Epstein-Barr virus; RLV, rhesus monkey lymphocryptovirus; CalHV-3, callitrichine herpesvirus 3; OvHV-2, ovine herpesvirus 2; RrupGHV-1, Rupicapra rupicapra gammaherpesvirus 1; CpRHV-2, caprine rhadinovirus 2; AlHV-1, alcelaphine herpesvirus 1; PLHV-1, porcine lymphotropic herpesvirus 1; HlibGHV-1, Hexaprotodon liberiensis gammaherpesvirus 1; McerRHV-1, Mus cervicolor rhadinovirus 1; BsavRHV-1, Bandicota savilei rhadinovirus 1; CcroGHV-1, Crocuta crocuta gammaherpesvirus 1; EzebGHV-1, Equus zebra gammaherpesvirus 1; EHV-2, equine herpesvirus 2; BadHV, badger herpesvirus; SaraGHV-1, Sorex araneus gammaherpesvirus 1; HVA, herpesvirus ateles; TbelGHV-1, Tupaia belangeri gammaherpesvirus 1; PtroRHV-1, Pan troglodytes rhadinovirus 1; GgorRHV-1, Gorilla gorilla rhadinovirus 1; BoHV-4, bovine herpesvirus 4; DbicGHV-1, Diceros bicornis gammaherpesvirus 1; BbabRHV-1, Babyrousa babyrussa rhadinovirus 1; SbarRHV-1, Sus barbatus rhadinovirus 1; PleoGHV-1, Panthera leo gammaherpesvirus 1; AflaRHV-1, Apodemus flavicollis rhadinovirus 1; BindRHV-4, Bandicota indica rhadinovirus 4; CglaRHV-1, Clethrionomys glareolus rhadinovirus 1; SsciGHV-2, Saimiri sciureus gammaherpesvirus 2.
|
A detailed examination of phylogenetic relationships in the Gammaherpesvirinae was previously carried out using alignments of amino acid sequences corresponding to up to 28 genes from 12 virus species (23), that is, far fewer species than were included in the present study but much longer input alignments. Figure 2A shows a summary tree from that investigation, with seven major lineages, of which four were placed in a clade whose branching details could not be confidently resolved (this clade was termed the multifurcated clade [MF clade]). At the time of that analysis, additional species for which smaller but usable sets of sequence data were available all fell into one of these seven major lineages. The present analysis assigned more species to all of these seven lineages, except for the LCV lineage (clade 1). Figure 2B shows the equivalent tree for the present analysis, with four additional major lineages (EmaxGHV-1, Mus musculus rhadinovirus 1 [MmusRHV-1] like, bat gammaherpesvirus 1 [BatGHV-1] like, and TterGHV-1 like). The branching pattern of the older tree remains unchanged, except that we now assign additional structure to what was the MF clade, with the herpesvirus saimiri (HVS)-like lineage branching first. Also in this region, we note that the previously uniquely extended terminal branch for murid herpesvirus 4 (MuHV-4) (Fig. 2A) is now unremarkable within a heterogeneity of branch end points (Fig. 1, clade 10).
![]() View larger version (9K): [in a new window] |
FIG. 2. Major lineages in the Gammaherpesvirinae. Panel A shows major features of the GHV tree derived by McGeoch et al. (23), with seven lineages. Panel B reduces the tree in Fig. 1B to 11 major lineages (equivalent to the clades in Table 4), with the mammalian orders to which the hosts of each lineage belong listed. In both panels, the extents of regions of multiple branching near tips are shown as heavy lines. Clades whose branching details could not be confidently resolved are termed MF. EHV-2, equine herpesvirus 2; BoHV-4, bovine herpesvirus 4; Perisso., Perissodactyla; Artio., Artiodactyla; Carni., Carnivora; Eulipo., Eulipotyphla.
|
In earlier analyses of Herpesviridae phylogeny, it was possible to discern within each of the subfamilies a substantial degree of congruence between the tree branching pattern and the corresponding pattern for lineages of the mammalian hosts, and this congruence was taken to indicate extensive cospeciation of herpesviruses and hosts (see reference 24). In these interpretations, the Gammaherpesvirinae presented the most complicated (and, thus, the least satisfactory) case, and the applicability of cospeciational interpretation declines further with the extensive detail now available for the GHV tree. First, among the 11 lineages defined, only those for which the host species come solely or predominantly from a single mammalian order can be of primary utility. Second, the deeper branching details of the tree prove rather unproductive for constructing any unified coevolutionary correspondence across host lineages. In particular, the two deepest distinct lineages, i.e., those of LCV and EmaxGHV-1, are not simultaneously compatible with a single cospeciational scheme, and in the MF2 clade, the unresolved nodes for major lineages do not enable any compelling interpretation. On the other hand, clear dispersed examples of cospeciation can be seen in the terminal branchings within major lineages. Most notably, these include the LCVs (clade 1), the macaviruses (clade 3), two lineages of rodent GHVs (MmusRHV-1 like and MuHV-4 like; clades 4 and 10, respectively), and the two distinct sublineages of HHV-8-like viruses (clade 8) in the MF1 clade. The Macavirus clade was previously used as a primary calibration point in importing a time scale from host divergence dates (23), with the divergence of swine and ruminant viruses dated as 63.8 million years before the present, and we regard that application as remaining valid. In summary, there are substantial indications in the GHV tree of evolution both by cospeciation with host lineages and by transfer between widely distinct hosts.
Overall, our analysis makes Gammaherpesvirinae phylogeny the most extensively characterized among those of the three subfamilies in terms of the number of viruses included and also the most complex, considering the number of distinct deep lineages and the details of relationships among them.
T.L. was supported by the Royal (Dick) School of Veterinary Studies, University of Edinburgh, and the joint National Institutes of Health/National Science Foundation Ecology of Infectious Diseases Program under grant no. NSF/DEB0225453 (field work).
Published ahead of print on 23 January 2008. ![]()
|
|
|---|
2-herpesvirus of the rhadinovirus 2 lineage in chimpanzees. Genome Res. 11:1511-1519.
-Herpesvirinae. Philos. Trans. R. Soc. Lond. B 356:421-435.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»