Previous Article | Next Article ![]()
Journal of Virology, August 2005, p. 10487-10497, Vol. 79, No. 16
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.16.10487-10497.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Zoology, University of Oxford, South Parks Road, Oxford, OX1 3PS, United Kingdom,1 Laboratoire de la Rage, Institut Pasteur, 25-28, Rue du Dr Roux, 75724 Paris, France,2 Animal Sciences Group, Wageningen University Research, P.O. Box 65, NL-8200 AB Lelystad, The Netherlands,3 Department of Virology, Danish Institute for Food and Veterinary Research, Lindholm, DK-4771 Kalvehave, Denmark4
Received 3 January 2005/ Accepted 9 May 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Currently, there are seven recognized genotypes of lyssavirus defined on the basis of their genetic similarity (6, 17): rabies virus (genotype 1), Lagos bat virus (genotype 2), Mokola virus (genotype 3), Duvenhage virus (DUVV; genotype 4), European bat lyssavirus type 1 (EBLV-1; genotype 5), European bat lyssavirus type 2 (EBLV-2; genotype 6), and Australian bat lyssavirus (genotype 7). All of the genotypes except Mokola virus have bat reservoirs, hinting that lyssaviruses originated in these mammals (4). Additionally, four new lyssavirus genotypes that infect bats in central and southeast Asia have been proposed: Aravan virus (24), Khujand virus (24), Irkut virus (5), and West Caucasian bat virus (5). Genotype 1 is responsible for classical rabies in terrestrial mammals throughout the world and in bats in North and South America, as well as for most rabies-related human deaths worldwide (39). In Europe, lyssaviruses isolated from bats belong to EBLV-1 and EBLV-2 (2).
More than 600 cases of EBLV infection have been reported throughout Europe, most recently in Scotland (7). EBLV-1 infection has also been reported in a small number of terrestrial mammals, including five sheep in Denmark (32) and a stone marten in Germany (28). These animals exhibited clinical signs consistent with rabies and later died or were euthanized. In addition, four human deaths due to EBLV have been reported (15). The overall rarity of EBLV-associated mortality in terrestrial animals may be because the cross-species transmission of EBLV from bats to other mammals is infrequent, because cases go undetected, or because the virulence of EBLV is low compared to that of genotype 1 lyssavirus. Furthermore, it is possible that bats develop persistent EBLV infection, manifested as the long-term carriage of viral RNA without overt disease. In particular, some wild-caught bats have antibody to EBLV for up to 3 years, and viral RNA has been detected in the tissues of apparently healthy bats (14, 31, 36, 40), although it is also possible that bats experience an unapparent acute infection and are subject to continual reinfection.
Like other lyssaviruses, the negative-sense EBLV genome encodes five proteins: the nucleoprotein (N), phosphoprotein (P), matrix protein (M), glycoprotein (G), and RNA polymerase (L) in the order 3'-N-P-M-G-L-5'. Evolutionary studies of lyssaviruses have tended to focus on the N protein, a well-conserved structural protein, and the G protein, which contains domains responsible for host cell receptor recognition and membrane fusion and which is the major target for the host neutralizing-antibody response (3).
Although several molecular epidemiological studies of genotype 1 lyssaviruses have been undertaken, to date there has been only a single such analysis of EBLV (2). This study, which utilized the first 400 bp of the N gene, showed that EBLV-1 can be subdivided into two distinct lineages or subtypes, designated EBLV-1a and EBLV-1b. However, nothing is known about the forces shaping EBLV evolution, particularly compared to genotype 1 lyssaviruses, even though this may provide crucial insights into the propagation of EBLV in bat populations and the evolutionary interaction between the virus and its mammalian hosts. Previous studies of genotype 1 and 7 lyssaviruses have shown that these viruses are subject to stronger selective constraints than many other RNA viruses (18, 21). Whether these constraints reflect a lack of immune-driven selection pressure or are due to the adaptation of lyssaviruses to replicate in a broad range of cell types, producing complex fitness trade-offs, is unknown (21).
Here, we present the first comprehensive evolutionary analysis of EBLV using a variety of analytical methods employed on sequence data encompassing the G and N genes, as well as noncoding gene regions of the viral genome. In particular, we explored whether the selection pressures and rates of evolutionary change observed in these viruses might reflect the peculiarities of their epidemiology in bats, including the possibility that they establish persistent infections and have low virulence.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Phylogenetic analysis.
We estimated
phylogenetic trees for a variety of data sets, including (i) 63 N gene
sequences combining EBLV-1, EBLV-2, and DUVV; (ii) 54 G gene sequences
combining EBLV-1, EBLV-2, and DUVV; (iii) 55 N gene sequences from
EBLV-1; (iv) 47 G gene sequences from EBLV-1; (v) 38 concatenated
noncoding region sequences from EBLV-1; (vi) 38 concatenated G, N, and
noncoding sequences from EBLV-1; and (vii) 49 concatenated G and N gene
sequences from EBLV-1. All phylogenetic trees were estimated using a
maximum-likelihood (ML) method under the general time-reversible (GTR)
model of nucleotide substitution, with the rate of each substitution
type estimated from the data and using successive rounds of tree
bisection reconnection branch swapping. The ML base
frequencies were also estimated from the data, as were the proportion
of invariable sites (I) and a gamma distribution of rate variation
among sites (
) with four rate categories. Parameter values are
available from the authors on request. The robustness of each tree was
assessed using a bootstrap resampling method, with all 1,000 replicates
estimated using the neighbor-joining procedure but with the input
genetic distances produced under the ML substitution model. All trees
were estimated using the PAUP* package
(38).
Recombination tests.
Two approaches were
employed to determine whether recombination had occurred in the
evolutionary history of the EBLVs. First, the N and G gene sequence
alignments were split into nonoverlapping fragments of 250 bp.
Neighbor-joining trees (incorporating the HKY85-
substitution
model) were then estimated for each region. Large-scale phylogenetic
movements, indicative of recombination, were assessed using bootstrap
resampling. Second, Sawyer's run test (GeneConv program)
(35) was used to search
for gene conversion events between pairs of sequences (that is, runs of
sequence similarity that conflicted with correct phylogenetic
relationships).
Analysis of selection pressures. To determine the selection pressures acting on the EBLVs, we estimated the numbers of nonsynonymous (dN) and synonymous (dS) substitutions per site for individual codons and lineages using a maximum-likelihood method that compares various models of sequence evolution (43). To assess the selection pressures at individual codons, we compared the M7 and M8 models. M7 assumes that codons take 1 of 10 dN/dS values, each estimated from the data, although no dN/dS category is allowed to exceed 1.0, so that the model specifies neutral evolution. In contrast, M8 adds an 11th category of codons at which dN/dS can exceed 1, thereby allowing positive selection. Consequently, positive selection is supported if M8 rejects M7 under a likelihood ratio test and contains a category of codons with dN/dS values of >1. The selected codons can then be identified using a Bayesian approach. To assess the selection pressures acting on individual lineages of the EBLV tree, we used a likelihood ratio test to compare the M0 model, which estimates a single dN/dS value for all branches and codons, with the "free-ratio" model, which allows a different dN/dS ratio for each lineage of the EBLV trees. Finally, M0 was also compared with a "two-ratio" model in which separate dN/dS values are estimated for the internal and external branches of the EBLV trees. All of these methods were implemented using the CODEML program of the PAML package (42).
Estimating substitution and population dynamics.
Rates of nucleotide substitution per
site were estimated using the Bayesian Markov Chain Monte Carlo method
available in the BEAST package
(http://evolve.zoo.ox.ac.uk/).
This method utilizes the differences in branch lengths between viruses
sampled at different time points and explores various models of
sequence evolution whose parameters include tree topology, substitution
model, and substitution rate. Uncertainties in the data are reflected
in the 95% high-probability density (HPD) values and hence account for
variation in the molecular clocks among lineages
(13). As before, this
analysis utilized the GTR-I-
model of nucleotide substitution
with parameters optimized during multiple runs. Three population
dynamics models were compared: constant population size, exponential
population growth, and logistic population growth. The models were
compared with Akaike's Information Criterion, with the constant and
exponential and the exponential and logistic models differing by a
single parameter and the constant and logistic models differing by two
parameters. Finally, for the best-fit model, we estimated the rate of
population growth (r), expressed as the number of new
infections per individual per year, and also the time taken for the
epidemic to double in size (
) in years using
= ln(2)/r.
Analyses of viral dispersion. The longitude and latitude for each EBLV strain were determined using the Alexandria Digital Library Gazetteer (http://middleware.alexandria.ucsb.edu/client/gaz/adl/index.jsp). The degree of linear association between pairwise geographic and genetic distances was evaluated using the standard Pearson correlation coefficient, with significance determined using Mantel permutation tests with 10,000 row-column permutations (37) and genetic distances estimated under the ML model described previously. To determine the geographical location of the putative common ancestor (PCA) of EBLV-1a and EBLV-1b in the N and G gene data sets, we first inferred the ancestral sequence for each data set using a parsimony reconstruction procedure (in PAUP*) on the ML trees estimated previously. This PCA sequence was then added to the original data set, and a new ML tree was inferred so that the pairwise distance of each viral isolate (of known geographical location) from the PCA could be determined.
Nucleotide sequence accession numbers. The new sequences determined in this study have been submitted to GenBank and assigned accession numbers AY863220 to AY863408 (EBLV) and AY996321 to AY996324 (DUVV).
| RESULTS |
|---|
|
|
|---|
A maximum-likelihood phylogenetic tree of 63 complete N gene sequences is shown in Fig. 1 and has a number of significant features. In particular, the Duvenhage, EBLV-1, and EBLV-2 genotypes were each identifiable with 100% bootstrap values, and there was clear evidence for the existence of two subtypes of EBLV-1 (EBLV-1a and EBLV-1b), although in this tree there was only weak bootstrap support for the EBLV-1b clade. Interestingly, although an EBLV-2a clade was clearly visible, the EBLV-2b group was not monophyletic in this tree, with the EBLV-2b strains from Switzerland clearly related to the EBLV-2a strains from Holland. Unfortunately, the small sample size of the EBLV-2 data set precludes further analysis of these viruses. Also notable was the fact that EBLV-1 and EBLV-2 did not form a monophyletic group, as EBLV-1 and DUVV were more closely related to each other than EBLV-1 was to EBLV-2, although this assumes midpoint rooting. As expected given the lack of recombination, similar tree topologies were inferred for the G gene and the noncoding sequences and when various data sets were combined (all trees are available from the authors on request).
|
|
|
Geographic spread of EBLVs. To investigate the pattern of EBLV dispersal in Europe, we examined the association between geographic and genetic distances for the combined N, G, and noncoding data sets. We found a significant positive correlation between geographic and genetic distances for both EBLV-1a (Mantel test; r = 0.69; P < 0.0009; n = 23) and EBLV-1b (r = 0.71; P < 0.0001; n = 13). This association is particularly strong for EBLV-1b; based on the current sample of sequences, the virus appears to have spread along a north-south axis in the west of Europe (Fig. 4). Although the point of origin is difficult to identify, particularly given the uncertainty in the phylogenetic analysis, it is notable that the French isolates tend to fall deep in the tree and have the shortest distances to the putative common ancestor.
|
Rates of population growth in EBLV-1.
The
best-fit population dynamic model for EBLV-1a and EBLV-1b was that of
exponential population growth (Table
2). Moreover, the estimated rates of population growth were both similar
and low for the G and N genes, and we found no clear difference in
growth rates between EBLV-1a and EBLV-1b. For the G gene, the mean
growth rate in EBLV-1a was 0.049 new infections per individual per
year, which gives an average epidemic doubling time of
14
years, which was faster than that observed in EBLV-1b at 0.010/year
(mean doubling time,
69 years). However, for the N gene the
situation was reversed and the growth rate was higher in EBLV-1b
(0.053/year; mean doubling time,
13 years) than in EBLV-1a
(0.038/year; mean doubling time,
18 years). Given this mixed
pattern and the fact that the HPD values overlap between EBLV-1b and
EBLV-1a, we conclude that these two virus populations have grown at
similar and relatively low rates. Because EBLV-1a and EBLV-1b are
phylogenetically distinct and do not present a single panmictic
population, it is inappropriate to estimate rates of population growth
for a combined EBLV-1a and EBLV-1b data
set.
|
Given these rates of nucleotide substitution, the most recent common ancestor of the genetic diversity sampled in EBLV-1 was estimated to have existed approximately 500 to 750 years ago (Table 2). Estimates of the time of the most recent common ancestor for EBLV-1a were similar for the N and G genes; for G, the mean age was 239 years, and for N, the mean value was 204 years. The values for EBLV-1b were more variable. For the G gene, the value was high compared to that of EBLV-1a (mean age = 422 years), although the HPD was extremely wide (52 to 1,168 years), making it difficult to infer a date. For the N gene, the mean estimate was extremely low at 72 years, but the HPD was narrower than for G (26 to 152 years). The genetic diversities observed within EBLV-1a and EBLV-1b are therefore of roughly the same age.
Selection pressures in EBLV. For both the N and G genes of EBLV-1 and EBLV-2, we found no evidence for positive selection (dN/dS > 1) acting at any codon or lineage (the full results are available from the authors on request). The mean dN/dS values (estimated using the M0 model) were low in all cases, revealing the action of relatively strong selective constraints. In EBLV-1, the mean dN/dS value for the G gene, 0.088, was nearly twice that observed in the N gene, 0.049. Mean dN/dS values for EBLV-2 were slightly higher than for EBLV-1 at 0.097 and 0.068 for G and N, respectively, but followed the same pattern. Interestingly, the dN/dS value was consistently higher in EBLV-1a than in EBLV-1b for both the N and G genes: 0.065 and 0.034 for N and 0.096 and 0.079 for G. Finally, a comparison of the ratio of dN/dS in the internal and external branches of the tree (using the "two-ratio" model) revealed more nonsynonymous substitutions on the external branches (tips) of the trees than on the interior branches; the ratios of dN/dS on the external branches to that on the internal branches were 4.95 for G and 5.75 for N in EBLV-1.
| DISCUSSION |
|---|
|
|
|---|
Of particular importance was the observation that there was more population mixing in EBLV-1a than in EBLV-1b. In EBLV-1a, the phylogenetic homogeneity of isolates across geographic regions suggests that there is an established viral traffic among bat populations in northern Europe. This process cannot be explained by the behavior of Eptesicus serotinus bats, which are not a migratory species (12) and suffer mortality from EBLV infection. It is therefore possible that long-distance transmission is facilitated by migratory bat species that roost with E. serotinus. The large roost sizes and high densities of many bat species make them well suited to the sustained transmission and exchange of RNA viruses (26), as well as a variety of other pathogens (16). Under these conditions, the transmission of EBLV is most likely maintained through the transfer of infectious saliva during licking and biting (16). Although 95% of all EBLV-positive bats submitted for diagnostic tests are E. serotinus, this could result from ascertainment bias toward anthropophilic species, those that are not experiencing a decline in population size, or those that suffer fatal infections. However, the bat species involved in this presumed viral traffic in northern Europe are unknown. One migratory bat species that is widely distributed in northern Europe and which has been proposed to play a role in EBLV transmission is Pipistrellus nathusii, although there is currently no evidence for its involvement (8, 36).
In contrast, the greater spatial structure in the EBLV-1b tree indicates that there has been less contact between bat populations from diverse regions in Europe. In particular, the Dutch strains form a discrete phylogenetic group, implying that contact networks between northern and southern Europe are not well established. This may, in part, be due to a decline in bat populations, as reported in countries like Germany (1, 27), for reasons that include human intervention. If roosts in intermediary locations disappear, thereby reducing contact between bat communities, then viral population subdivision will also be enhanced. Similarly, there may be an absence of migratory species able to connect the foci of EBLV-1b in southern Europe (Spain and southern France) with those in more western (Brittany) and northern (northern France and The Netherlands) locations. As a case in point, Tadarida teniotis and Miniopterus schreibersii, which are present in Mediterranean regions, can fly long distances, and are known to carry EBLV-1 (36), are not encountered in northern and western France.
Our analysis of the place of origin for EBLV-1a and EBLV-1b was more complex, particularly given the small number of samples available from southern Europe. In the case of EBLV-1b, a combination of methods suggested that the sampled lineages most likely arose in France and then spread northward and southward, although an initial entry into Europe via Spain and southern France cannot be excluded. The spread of EBLV-1a was even harder to infer, and we were unable to locate a consistent place of origin. However, the observation that the virus occupies an east-west axis across northern Europe implies that the virus spread in a directed manner across the region.
Evolutionary processes in EBLV. A key aspect of our work was the observation that EBLV is subject to strong constraints against amino acid change. In particular, the dN/dS values estimated for EBLV are relatively low for RNA viruses (41), even in comparison to the G gene of genotype 1 lyssaviruses in canids (21). Similarly strong selective constraints have previously been documented in arthropod-borne RNA viruses, presumably because of the fitness trade-offs that are inherent when replicating in cell types as different as those from mammals and insects (41). However, the stronger purifying selection against the N gene relative to the G gene was expected, as envelope glycoproteins, which often interact with host cell receptors and are the main targets of the immune response, show greater amino acid diversity than nucleocapsid proteins in most RNA viruses (41).
The
constrained evolution of EBLV-1 was even more apparent in our analysis
of rates of nucleotide substitution. Specifically, the substitution
rates we estimated for EBLV-1 are not only among the lowest seen in RNA
viruses (19,
23), but lower than those
previously reported for genotype 1 lyssaviruses. These
estimates range from relatively high rates of
1 x
103 substitutions/site/year
(19,
22) to lower rates of 3.1
x 104 to 5.5 x
104 substitutions/site/year
(4,
6). The lowest
evolutionary rate thus far recorded for genotype 1 lyssaviruses was
5.06 x 105 substitutions/site/year at
nonsynonymous sites (21).
However, as the synonymous rate was an order of magnitude greater, at
4.10 x 104 substitutions/site/year, the
overall substitution rate will be higher than that reported here for
EBLV-1.
There are a variety of plausible explanations for the low rates of evolutionary change in EBLV-1. First, it is possible that EBLV-1 replicates slowly in its bat hosts so that few mutational errors are produced per unit time. Very low levels of EBLV RNAs have been detected in several tissues (brain, blood, lung, heart, tongue, and esophagus-larynx-pharynx) and saliva of apparently healthy bats, suggesting either that there is a nonproductive infection in these species or that these RNAs are remnants of an earlier lyssavirus infection (36, 40). Nevertheless, some hosts do develop symptoms and succumb to disease (40). Why this is so is not known. As such, it seems premature to conclude that EBLV is characterized by especially low rates of replication.
A second possibility is that peculiarities of the bat immune system have altered the selection pressures faced by the EBLVs, in turn putting a brake on rates of evolutionary change. Although studies of fruit bats have revealed well-developed immune systems (30), lower levels of agglutinating, hemagglutinating, and complement-fixing antibodies are produced in response to various antigens than in conventional laboratory animals (20), and the peak of primary antibody response after antigenic challenge is delayed (9). Additionally, the activation of T lymphocytes is significantly delayed in bats compared to mice (10, 30). Despite these differences, a previous study of genotype 1 lyssaviruses reported similar dN/dS values in bats and terrestrial vertebrates (21), and as yet there is no evidence for reduced rates of nucleotide substitution in rabies viruses isolated from bats (4). This suggests that immune selection is not especially weak in bats compared to other mammals, although this clearly needs to be explored in more detail.
Whatever the explanation for the low rate of evolutionary change in EBLV-1, our study suggests that the virus has reached an adaptive peak, so that most amino acid changes reduce fitness and are therefore removed by purifying selection. This interpretation is supported by a number of observations. First, as discussed above, rates of nucleotide substitution, especially at nonsynonymous sites, were low in EBLV-1. More striking was that the dN/dS ratios on the internal branches of the EBLV-1 trees were far lower than the equivalent values seen on external branches. This means that very few nonsynonymous changes have reached fixation during the evolutionary history of EBLV-1, as expected if fitness benefits are difficult to generate. Hence, the nonsynonymous mutations we see in EBLV-1 most likely represent transient deleterious and slightly deleterious amino acid changes that are eventually removed from the population by purifying selection; this explains why they fall at the tips of phylogenetic trees. Intriguingly, unusually low rates of nucleotide substitution have also been recorded in another rhabdovirus, vesicular stomatitis virus (19). The fact that vesicular stomatitis virus induces a persistent infection in its mammalian hosts (25) hints that this form of virus-host relationship might often be associated with reduced rates of nucleotide substitution. Finally, although the population sizes of EBLV in European bats are increasing, the measured growth rates are very low, suggesting that the virus exists in a near-equilibrium state in many bat species.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Mol. Cell. Biol. | Microbiol. Mol. Biol. Rev. |
|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|