Previous Article | Next Article ![]()
Journal of Virology, September 2006, p. 8503-8509, Vol. 80, No. 17
0022-538X/06/$08.00+0 doi:10.1128/JVI.00585-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Zoology, The University of Hong Kong, Hong Kong, China,1 Department of Computer Science, University of Auckland, Auckland, New Zealand,2 Department of Zoology, The University of Oxford, Oxford, United Kingdom,3 Department of Mathematics, The University of Hong Kong, Hong Kong, China4
Received 23 March 2006/ Accepted 7 June 2006
|
|
|---|
|
|
|---|
The epidemiological event leading to the emergence and expansion of vvIBDV is an open question. Phylogenetic analyses have revealed independent evolutionary histories of the two genome segments (17, 44), suggesting that a reassortment event may have played a role in the emergence of vvIBDV. Previous reports suggested that both the major capsid protein (VP2) and the RNA-dependent RNA polymerase (VP1), which are located on genome segments A and B, respectively, contribute to the virulence of IBDV (1-3, 26, 43). Questions have been raised regarding the phylogenetic origins and roles of the unique mutations of the two genome segments in relation to the expansion of vvIBDV.
Temporally spaced sequence data from rapidly evolving RNA viruses provide opportunities to address problems regarding their evolutionary behaviors, e.g., past population dynamics and evolutionary rates (7, 9, 10). Here we investigated the phylogenetic origins and demographic behavior of vvIBDV, as well as the time of emergence of the most recent common ancestor (TMRCA) of its genomes. Several approaches, including linear regression (LR), maximum likelihood (ML), and Bayesian Monte Carlo Markov chain (BMCMC), under the assumption of both strict and relaxed molecular clocks, were employed for TMRCA estimations. An alternative technique for estimating the TMRCA of the VP1 data set, site-stripping clock detection (SSCD) (24), was also used, in an attempt to accommodate deviations from the molecular clock. Comparison of TMRCAs inferred with these approaches provides comprehensive estimations of the time of emergence of both genome segments of vvIBDV. The past population dynamics of vvIBDV were reconstructed by using coalescence approaches in both the ML and BMCMC frameworks (10, 35), allowing us to estimate the time of onset of vvIBDV expansion. By reconstructing the chronological order of these epidemiological events, this report provides insights into the possible mechanism leading to the emergence and expansion of vvIBDV.
|
|
|---|
To investigate the phylogenetic origin of genome segment B of vvIBDV, all available nonidentical VP1 sequences of all birnaviruses were retrieved from GenBank. The codons were aligned, and the sites with gaps were removed. A BMCMC tree was constructed for this VP1 data set as described above. The vvIBDV clades of each data set, designated vvfVP2, vvpVP2, and vvVP1, were defined as the clades containing typical vvIBDV isolates, including DV89, UK611, and OKYM (42), with Bayesian posterior clade probability (BP) support of at least 0.85.
Molecular clock analysis and SSCD. The sampling years of taxa in vvIBDV clades were obtained from references or personal communication with the authors. The alignments are available on request. The phylogenies of the vvVP1, vvfVP2, and vvpVP2 clades were reestimated with PAUP* (Sinauer Associates, Inc.) using an ML heuristic search with a tree bisection-reconnection branch-swapping algorithm under the nucleotide substitution models suggested by MODELTEST. Due to the large number of taxa (n = 156) of the vvpVP2 clade and the computational intensiveness of ML analysis, phylogenetically similar taxa were discarded manually (n = 48 [after removal]). Branch lengths of the phylogenies were reestimated in an ML framework under a generally unconstrained model (no clock [M0]) or the assumption of a single substitution rate (global clock [M1]), which are implemented in PAML (45). To test for the presence of a molecular clock, a likelihood ratio test (LRT) was used to compare the performance of both models in each data set.
In the vvVP1 clade, since M0 showed a significantly better fit to the data than M1 (P = 0.03), i.e., the molecular clock hypothesis was rejected, an SSCD procedure based on likelihood ratio reduction (24) was applied to recover sites exhibiting clocklike behavior. Briefly, the natural log likelihoods (lnL) of each site under both M0 and M1 were estimated, and the relative contribution to the likelihood ratio statistics, i.e., 2
lnL, was then calculated. Those sites that contributed most to the rejection of the molecular clock hypothesis, i.e., sites with the largest 2
lnL, were removed (five by five) until M0 no longer fit the data significantly better than M1, i.e., P was >0.05 under the LRT. Random site stripping was used as a control to demonstrate that the insignificant result is not simply the result of lack of power due to the removal of data. The JAVA codes of the SSCD procedure can be found in the PAL package (8).
Estimation of TMRCA. Under the assumption of a molecular clock, TMRCAs of vvVP2 and the site-stripped vvVP1 data set were estimated with ML, BMCMC, and LR approaches. The ML estimations were carried out with PAML by rescaling the branches of the ML topologies into a calendar time scale under a strict global molecular clock (M1) as described above. The BMCMC estimations were carried out with BEAST (http://evolve.zoo.ox.ac.uk/beast/) under the general time-reversible substitution model with gamma-distributed rate variation (four categories) and a proportion of invariant sites. Posterior distribution of TMRCAs was then summarized across four independent MCMC chains with TRACER (http://evolve.zoo.ox.ac.uk/software.html?id = tracer). In TMRCA estimation by the LR method, ancestral states of the vvIBDV clades were first inferred under the ML criterion, and then the pairwise genetic distances (with the optimal substitution model suggested by MODELTEST) of taxa from their inferred TMRCA, which are implemented in PAUP*, were calculated. Correlations of the estimated pairwise genetic distances and sampling years were examined by LR analysis implemented in GraphPad PRISM (San Diego, CA).
To evaluate the possible biases of the SSCD procedure on TMRCA estimations of the vvVP1 clade, its TMRCA was also estimated in a Bayesian framework under a relaxed molecular clock model using PAML and MULTIDIVTIME (22), according to the procedures described by Rutschmann (http://www.plant.ch), which take into account both uncertainty in branch length estimation and lineage-specific rate variation. The mean normally distributed prior substitution rate was set at 0.7 x 103 substitutions per site per year with a standard deviation (SD) of 0.1 x 103, and the mean normal prior TMRCA was set at 1980 with an SD of 15 years, based on the posterior distribution of the site-stripped vvVP1 data set under the assumption of a molecular clock. It is noted that the SSCD-estimated mutation rate of the site-stripped vvVP1 data set, i.e., 0.7 x 103 substitutions per site per year, is within the general mutation rate range of other RNA viral genes (20) and is comparable to that of other viral RNA-dependent RNA polymerases (40). Different prior TMRCAs, including 1985 ± 10 years, 1965 ± 30 years, and 1945 ± 50 years, were used in order to assess the influence of priors on posterior distribution. The posterior distribution of the TMRCA was summarized across four independent MCMC chains.
Inference of past population dynamics. To describe the epidemic history of vvIBDV in terms of change of effective population size through time, i.e., Ne(t), a Bayesian skyline plot (BSP) was performed on the vvpVP2 data set (n = 153). Ne(t) can be thought of as the number of infections contributing to new infections rather than the total number of prevalent infections (16). BSP is a nonparametric coalescence analysis for estimating past population dynamics through time without dependence on a prespecified parametric model; it has proven very useful as a demographic model selection tool (10). The BSP of the vvpVP2 data set was jointly estimated with the BMCMC TMRCA inference using BEAST, as described above. To select a parametric model that best describes the past population dynamics of vvIBDV, the goodness-of-fit of five demographic models in the time-scaled genealogy was evaluated in terms of likelihoods and LRT, which were calculated with GENIE (35, 36). These models include the piecewise expansion, logistic, constant, piecewise logistic, and exponential models (35). The demographic parameters of the best-fit model were estimated with confidence intervals, in both the BMCMC and ML frameworks, by BEAST and GENIE, respectively. The BMCMC and ML parametric estimates of the chosen demographic model were then evaluated by superimposing its nonparametric estimates through time, i.e., Bayesian and classical skyline plots (CSP) for BMCMC and ML estimations, respectively. CSP is a piecewise-constant model of population size and typically produces "noisy" plots that display the stochastic variability inherent in the coalescence process, while BSP is an advanced version of CSP that naturally produces a smoother estimate due to the "averaging" effect of MCMC sampling (10). Bayesian and classical skyline plots were estimated with BEAST and GENIE, respectively.
|
|
|---|
![]() View larger version (17K): [in a new window] |
FIG. 1. Phylogenetic origins of both genome segments of vvIBDV. (A) BMCMC tree of full-length VP2 of serotype I IBDV, rooted with serotype II strain OH. The branch length of the root (dotted line) is not drawn to scale. (B) BMCMC tree of the full-length VP1 of all birnaviruses. For clarity, only the IBDV lineage is shown, and the branch length of its root to other birnaviruses (dotted line) is not drawn to scale. Asterisks represent clades with BP support of at least 0.85. The scale bars are in units of nucleotide substitutions per site. The vvIBDV clades in both trees are highlighted in gray.
|
|
View this table: [in a new window] |
TABLE 1. Molecular clock and TMRCA estimations of both genome segments of vvIBDV based on various approaches
|
![]() View larger version (31K): [in a new window] |
FIG. 2. Estimation of vvVP1 TMRCA under both strict and relaxed molecular clock assumptions. (A) Reduction of relative contribution to the likelihood ratio statistics (2 lnL) by SSCD. The upper limit of 2 lnL for not rejecting the molecular clock hypothesis in the vvVP1 data set is approximately 14. (B) ML estimations of vvVP1 TMRCA with different numbers of sites stripped. (C) Comparison of posterior distributions of the vvVP1 TMRCA estimated under both strict and relaxed molecular clocks.
|
![]() View larger version (28K): [in a new window] |
FIG. 3. Estimations of TMRCA of both genome segments of vvIBDV and onset of its expansion under different approaches. (A) TMRCA estimations of vvfVP2 and the site-stripped vvVP1 data set with the LR approach. The dotted lines and the solid lines represent 95% CIs and best-fit values, respectively. (B) TMRCA estimations of vvfVP2 and the site-stripped vvVP1 data set, as well as the onset of vvIBDV expansion with the ML approach. The dotted lines and the solid dots represent the 95% CIs and best-fit values, respectively. (C) TMRCA estimations of vvVP2 and vvVP1, as well as the onset of vvIBDV expansion, with the BMCMC approach. The posterior distribution of vvVP2 TMRCA represents pooled BMCMC samples of the vvfVP2 and vvpVP2 data sets (BEAST), while that of vvVP1 TMRCA represents pooled BMCMC samples of the original vvVP1 data set under a relaxed clock (MULTIDIVTIME) and the site-stripped vvVP1 data set under a strict clock (BEAST). The time axes of the three panels are drawn to the same scale.
|
)/r, where r is the exponential growth rate and
is the population size prior to change, as a proportion of the population size at present (Fig. 5A). It is noted that the piecewise expansion demographic model is not implemented in BEAST, version 1.3, and the codes for this particular model were developed by the authors; they are available on request. Superimposing the ML and Bayesian parametric estimates under the piecewise expansion model onto their nonparametric skyline plots revealed a good fit to the demographic signal within the vvpVP2-95 data set (Fig. 5B and C). The ML estimation of exponential growth phase start time is 1983 (1970 to 1987), while the BMCMC estimation yielded consistent results with more precise CIs, i.e., 1988 (1986 to 1990). Although ML estimation of demographic parameters based on a single genealogy ignores possible phylogenetic errors, overlapping results were produced with the BMCMC estimation, which takes phylogenetic uncertainty into account (10), reflecting the reliability of our estimations of the onset of vvIBDV expansion.
![]() View larger version (17K): [in a new window] |
FIG. 4. Nonparametric estimation of past population dynamics of vvIBDV with BSP. The BSP was estimated from the vvpVP2 data set and is presented with its 95% high-probability density (HPD) values (gray area).
|
![]() View larger version (17K): [in a new window] |
FIG. 5. Estimation of the demographic parameters of vvIBDV under the piecewise expansion model. (A) Schematic representation of the piecewise expansion model of population growth. (B) BSP (nonparametric) and BMCMC parametric estimation of Ne of the vvpVP2-95 data set under the piecewise expansion model by using BEAST. (C) CSP (nonparametric) and ML parametric estimation of Ne of the vvpVP2-95 data set under the piecewise expansion model by using GENIE. The BSP is presented with its 95% high-probability density (HPD) values (gray area). Only the mean or best-fit values are presented in the BMCMC and ML parametric estimations, respectively.
|
|
|
|---|
The epidemiological events related to the initiation of vvIBDV expansion have not been investigated systematically. While most of the interest regarding virulence and attenuation of the virus was focused on genome segment A, as it encodes neutralizing epitopes and the putative receptor binding protein (43), our estimations of the vvfVP2 TMRCA indicate that genome segment A of vvIBDV emerged more than 20 years before the first documented case of vvIBDV, suggesting that the vvIBDV expansion may not be directly initiated by the emergence of vvVP2. This observation argues against the hypothesis that mutation of genome segment A is the major contributing factor in the emergence and expansion of vvIBDV (33). Alternatively, the vvVP1 TMRCA was estimated to around the early to mid-1980s, which coincides with the estimated onset of IBDV expansion in a period of approximately 5 years, suggesting a possible role of vvVP1 in the initiation of vvIBDV expansion. The origin of vvVP1 is unknown, but its phylogenetic independence from all other known IBDV strains suggests that it may be reassorted from an unidentified reservoir. A serotype II IBDV strain was successfully isolated from African blacked-footed penguins (Spheniscus demersus) (19), implying the presence of other possible natural hosts of IBDV. Moreover, serological surveys of aquatic fowl (12, 15) and free-living wild birds (13, 31) suggested that several migratory and sedentary avian species, including cattle egrets (Ardeola ibis), pigeons (Columba livia), carrion crows (Corvus corone), and Antarctic penguins (Pygoscelis adeliae), etc., may also be carriers or reservoirs of IBDV based on their seroprevalence.
A number of reports have demonstrated the drastic effects of mutations of VP2 residues on attenuation of the virulence of vvIBDV (3, 43); these residues were proposed to be related to the protein's receptor binding efficiency (5), but they are not the sole determinants of virulence (2). On the other hand, VP1 was also demonstrated to be a determinant of virulence in vivo, possibly by altering the replication efficiency (1, 26). A recent study reported the isolation of a naturally reassorted IBDV with genome segment A of very virulent origin but genome segment B of non-very virulent origin; this isolate showed significantly reduced pathogenicity compared with typical vvIBDVs (25), demonstrating that the determinants on genome segment A may not be enough to induce the hypervirulence of typical vvIBDVs. These observations suggest that the hypervirulence of vvIBDV may be a synergistic effect of mutations on both of its genome segments, although experimental evidence is needed for further elucidation. In addition, examples of hypervirulence of other RNA viruses caused by mutations on their polymerase genes are well documented (6, 21, 27, 41). Taken together with our date estimates, the phylogenetic origin of vvIBDV genomes, the possible roles of both VP1 and VP2 in hypervirulence, and the reported evidence of the existence of possible reservoirs, we hypothesize that vvIBDV expansion may have been initiated by the reassortment of its genome segment B with a mutant VP2 background, which caused a sudden increase in virulence and hence expansion in the mid-1980s, rather than being directly related to the emergence of vvVP2 in the 1960s, as vvVP2 alone may not be enough to induce the hypervirulence of vvIBDV (25). The presence of rare IBDV isolates carrying vvVP2 and non-vvVP1 further supports this hypothesis (25), as these isolates can be considered the descendants of those rare "ancient" vvIBDVs that appeared after the estimated TMRCA of vvVP2 in the mid-1960s but before the estimated reassortment of vvVP1 and the expansion of vvIBDV in the mid-1980s.
Genome reassortment of multisegmented RNA viruses plays important roles in the emergence or expansion of new virus strains with altered antigenicity or pathogenicity (29, 32, 37), best demonstrated by the devastating pandemics of influenza A (39). Investigations of possible evolutionary mechanisms leading to the emergence and expansion of novel pathogenic virus strains would certainly provide insight into the scope of surveillance and prevention efforts (14, 28). In the case of IBDV, while most of the sequencing efforts have been focused on genome segment A of viruses in chickens, larger-scale surveillance in other avian species and more sequence data on genome segment B would certainly provide important clues toward the understanding of the diversity and origin, as well as the evolutionary epidemiology, of IBDV.
C.-C.H., F.Z., and F.C.C.L. designed the research; C.-C.H. performed all ML analyses and wrote the paper; T.-Y.L. performed SSCD analyses; A.D. and A.R. performed all BMCMC analyses; C.-W.Y. and P.-Y.L. performed LR analyses; and Y.-F.L. and P.T.W.N. performed molecular clock tests. We declare that there were no conflicts of interest relevant to this study.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»