Previous Article | Next Article ![]()
Journal of Virology, December 2005, p. 15123-15130, Vol. 79, No. 24
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.24.15123-15130.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Virology, U.S. Army Medical Component, Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand,1 Queen Sirikit National Institute of Child Health, Bangkok, Thailand,2 Center for Infectious Disease Dynamics, Department of Biology, The Pennsylvania State University, Mueller Laboratory, University Park, Pennsylvania 168023
Received 3 December 2004/ Accepted 20 September 2005
|
|
|---|
|
|
|---|
In recent years, there has been considerable interest in describing the genetic structures of DENV populations and determining their underlying evolutionary processes (13, 27). These studies have revealed that each serotype contains phylogenetically distinct clusters of viruses that have been christened "genotypes" or "subtypes" (13, 39). More-revealing observations are that (i) DENV strains from humans and nonhuman primates often fall into different genotypes, indicating that the human and sylvatic cycles are usually independent; (ii) some genotypes have widespread (or "cosmopolitan") geographic distributions, while others are restricted to particular localities; and (iii) viral genetic diversity is greatest in Southeast Asia, suggesting that it is the "source" population for many dengue outbreaks, particularly those recently observed in the Americas (although our sampling from Africa remains sparse).
An increasingly common observation in phylogenetic studies of DENV is that of lineage turnover. This takes two forms. First, most viral lineages sampled from a particular time point fail to generate any daughter lineages, resulting in phylogenetic trees with a strong temporal (ladder-like) topology (4, 15). This most likely occurs because these lineages are defined by viruses that contain deleterious or slightly deleterious mutations that are eliminated by purifying selection (12, 15) and/or because fully functional viruses are not sampled in later years due to regular bottlenecks in viral population size, perhaps due to seasonal and spatial fluctuations in vector population size and density (30). A second, more dramatic mode of lineage turnover occurs when an entire clade of viruses that has persisted in a particular locality for a number of years is not evident on subsequent sampling, indicating that it has dropped dramatically in frequency, perhaps even experiencing extinction, and is sometimes replaced by a new clade of viruses. A number of occurrences of such dramatic clade replacement have been reported (4, 32, 41), although the evolutionary processes controlling these events remain uncertain. In particular, this large-scale phylogenetic pattern could reflect the action of either dramatic population bottlenecks or natural selection, such that clades with enhanced fitness out-compete those that are less fit.
There is growing experimental evidence that strains of DENV (that is, either entire genotypes or smaller groupings of viruses within them) differ in fitness, as required for selectively driven clade replacement. Most notably, the "American" genotype of DENV-2 appears to be less well adapted to Aedes aegypti from the Americas than are Asian genotypes of this virus (2, 3) and also does not replicate as well in certain human cell types (6). Together, these genetic differences are thought to have lowered the fitness of American genotype viruses in comparison to those viruses imported from Southeast Asia. However, such fitness differences will be manifest only at times of selective competition because the American genotype is still highly transmissible under suitable epidemiological conditions (40).
Well-characterized and archived dengue viruses serially collected within Thailand provide a unique opportunity to explore the evolutionary processes which underlie clade replacement. In particular, (i) all four DENV serotypes cocirculate with varying relative predominance from year to year, (ii) dramatic changes in clade frequency have previously been observed in DENV-2 and DENV-3, and (iii) epidemiological and clinical data are available from 1973 to the present from the metropolitan area of Bangkok (24). Herein, we undertake a detailed analysis of the temporal dynamics of DENV-1 and DENV-3 evolution over a 30-year sampling period in Thailand. We include viruses isolated from patients experiencing a variety of disease manifestations and also explore the spatial diffusion of both serotypes in geographical locations other than Bangkok. In particular, we noted a number of cases of clade replacement in these viruses and explored whether stochastic or selective forces were the cause of these major phylogenetic events. As Bangkok is considered a potential epicenter of DENV transmission throughout Thailand (7) and potentially beyond, a better understanding of the forces influencing dengue virus evolution and epidemiology within Bangkok has potential widespread implications within Southeast Asia.
|
|
|---|
RNA extraction and RT-PCR. Virus RNA was extracted from cell culture supernatant (samples before 2000) or serum (samples after 2000) using the QIAamp viral RNA mini kit (QIAGEN, Germany) according to the manufacturer's instructions. Oligonucleotide primers for RT-PCR amplification and sequencing were designed using the Primer3 program (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi. The sequences and positions of primers used to determine the complete genomes of DENV-1 and DENV-3 are listed in the supplemental material. All specimens used for sequencing were pretested using RT-PCR and nested PCR to confirm assignment to DENV-1 and DENV-3 and to adjust virus concentration. Genomic RNA was converted to cDNA using random hexamer oligonucleotides with the SuperScript first-strand synthesis system (Invitrogen) according to the manufacturer's instructions. All DNA fragments of the envelope (E) gene and the overlapping DNA fragments which covered the entire genome for 10 strains of DENV-1 and 6 strains of DENV-3 were amplified by PCR with Proofstart (QIAGEN, Germany) and Taq DNA polymerase (Roche Ltd.). The PCR-amplified DNA fragments were purified using QIAquick PCR purification kits and the QIAquick gel extraction kit (QIAGEN, Germany) according to the manufacturer's instructions. Purified DNA fragments were used for sequencing.
Sequencing and analysis. Cycle sequencing reactions were performed using the DYEnamic ET dye terminator sequencing kit (Amersham Pharmacia Biotech), according to the manufacturer's instructions, with sequencing primers described in the supplemental material. The sequencing products were cleaned by standard precipitation before sequencing on a MegaBACE 500 automated DNA sequencer (Amersham Pharmacia Biotech). Overlapping nucleic acid sequences were combined for analysis and edited with the aid of Sequencher software (Gene Code Corp.).
Phylogenetic analysis.
For both DENV-1 and DENV-3, a sample of 120 complete E gene sequences was used in the phylogenetic analysis. This included all the Thai DENV data newly generated here, other than four DENV-1 and two DENV-3 sequences that were identical to another sequence in the data set (ThD1-PT0214/96, ThD1-0120/00, ThD1-0049/01, ThD1-0261/01, ThD3-0007/87, and ThD3-1283/98). These Thai sequences were then combined with a sample of viruses representing the global genetic variability of DENV-1 and DENV-3. Sequence alignment was undertaken manually, which resulted in alignments of 1,485 bp and 1,479 bp for DENV-1 and DENV-3, respectively. To explore the issue of clade replacement in more detail, we also conducted a phylogenetic analysis on the complete coding regions of 65 DENV isolates representing all four serotypes. In this case, sequence alignment was undertaken using the ClustalX program (35) and then checked manually, resulting in a final data set 10,185 bp in length. All data sequences used in this study are listed in the supplemental material. Maximum likelihood (ML) phylogenetic trees were then estimated for all three data sets. In all cases, we used the most general GTR + I +
4 model of nucleotide substitution (where GTR is general time reversal, I is invariant sites, and
is rate parameters) available in the PAUP* package (33) and undertook successive rounds of branch swapping. All parameter values are available upon request. To determine the support for particular groupings on the phylogenetic trees, we performed a bootstrap resampling analysis using 1,000 replicate neighbor-joining trees estimated under the ML substitution model.
Estimating selection pressure. We used a variety of computational methods to explore the selection pressures acting on DENV-1 and DENV-3. We also analyzed two different types of data: (i) the complete coding regions from the 10 Thai DENV-1 strains determined here and nine complete DENV-3 genomes, comprising the six Thai viruses determined here plus three other complete genomes available in GenBank, and (ii) a sample of 80 E gene sequences representative of DENV-1 and DENV-3 in Thailand. To match these selection analyses with the occurrence of clade replacement, we restricted the E gene analysis to DENV-1 strains assigned to genotype I and DENV-3 strains assigned to genotype II.
Mean ratios of nonsynonymous to synonymous substitutions per site (dN/dS) for each data set were measured using the pairwise method of Nei and Gojobori (23) as implemented in the MEGA2 package (17). To obtain codon- and lineage-specific measures of selection pressure, we employed the maximum likelihood method in the CODEML program (43). This involved the use of four models of codon evolution (44). To analyze selection pressures at individual codons, we compared the M7 and M8 models; in the former codons, we took 1 of 10 categories of dN/dS ratios, all estimated from the data but where no category had a dN/dS ratio of >1.0 so that the M7 model specified only neutral evolution. M8 allowed positive selection by adding an 11th category of codons at which dN/dS can exceed 1.0. Evidence for positive selection is obtained if M8 significantly rejects M7 in a likelihood ratio test and has at least one category of codons with a dN/dS ratio of >1. To analyze selection pressures along each lineage of the DENV-1 and DENV-3 phylogenies, we compared M0, in which each lineage has the same dN/dS ratio, with the "free-ratio" model, in which lineages are allowed to take on different values of dN/dS. For the M0 and "free-ratio" models, dN/dS is estimated from the data, and their support is again assessed using a likelihood ratio test.
We also examined the biological characteristics of the amino acid changes that were associated with the major clade replacement event in DENV-1. First, we mapped amino acid changes onto the complete genome ML phylogenetic tree for DENV-1 using the parsimony approach available in the MacClade package (19) and noted all those that occurred on the branches associated with clade replacement. To determine whether amino acid changes occurring on branches associated with clade replacement were common or rare in dengue virus evolution, we recorded the frequency of all types of amino acid changes on the ML tree of the 65 complete genomes from all four viral serotypes. This allowed us to characterize 1,909 unambiguous amino acid changes in all genes across the DENV phylogeny. We then counted the frequency of amino acid replacements from each amino acid and determined whether the types of amino acid changes that occurred on the branches associated with clade replacement were more or less common than expected, given their overall frequency in DENV evolution. For example, of the 1,909 unambiguous amino acid changes, 150 were from the amino acid Gly to another amino acid, so that the expected frequency of these types of amino acid changes is 7.9 (150 Gly amino acid changes divided by 19 possible amino acid changes). The major clade replacement event in DENV-1 is associated with a Gly-to-Ala change. As the observed frequency of this change is 9/150, we conclude that this particular amino acid change occurs at its expected frequency and, hence, is not rare in DENV evolution.
Nucleotide sequence accession numbers. All sequences produced here have been submitted to GenBank (accession numbers AY732378 to AY732483 for DENV-1 and AY676348 to AY676421 for DENV-3).
|
|
|---|
![]() View larger version (33K): [in a new window] |
FIG. 1. ML tree of 120 strains of DENV-1, including 92 newly sampled strains from Thailand. Thai strains sampled from regions other than Bangkok are denoted as follows: A, Ayuthaya; CN, Chai Nat; K, Kamphaeng Phet; NB, Nonthaburi; P, Pichit; PT, Pathum Thani; and S, Samut Prakan. Key clades are indicated. Bootstrap support values are shown for major nodes on the tree. The tree is midpoint rooted for purposes of clarity only, and all horizontal branch lengths are drawn to scale.
|
Molecular epidemiology of DENV-3. Our analysis of 120 DENV-3 isolates identified the four genotypes that have been recognized previously in this serotype (18, 38) (Fig. 2). Briefly, genotype I comprises viruses from Southeast Asia and the Pacific (although none have been sampled since 1992); the large genotype II is entirely Asian in composition, containing viruses from Malaysia, Myanmar, and Thailand; genotype III, which first invaded the Americas in the early 1990s (38), comprises viruses sampled from Asia (Sri Lanka, India), the Pacific (Samoa), and Latin America (Brazil, Martinique, Mexico, Venezuela); and genotype IV, which may have gone extinct, consists of two viruses sampled from Puerto Rico in the 1960s and 1970s. Finally, two viruses did not fall into a clear genotype: one sampled from the Philippines in 1956 and another sampled from Thailand in 1962. All the Thai DENV-3 sequences generated here fell into genotype II. Furthermore, clear phylogenetic groups associated with time of sampling were present within this genotype, with the majority of those strains sampled post-1991 (and none earlier) falling into a well-supported group. This is referred to as the "1991 to 2002" clade.
![]() View larger version (29K): [in a new window] |
FIG. 2. ML tree of 120 strains of DENV-3, including 66 newly sampled strains from Thailand. Thai strains sampled from regions other than Bangkok are denoted as follows: A, Ayuthaya; C, Chachoengsao; N, Nakhon Pathom; NB, Nonthaburi; KB, Kanchana Buri; PT, Pathum Thani; S, Samut Prakan; and SB, Suphan Buri. Key clades are indicated. Bootstrap support values are shown for major nodes on the tree. The tree is midpoint rooted for purposes of clarity only, and all horizontal branch lengths are drawn to scale.
|
Clade replacement in DENV-1 and DENV-3. The most striking pattern in our phylogenetic analysis was that of clade replacement. This is most clearly seen within genotype I of DENV-1, in which the frequency of the 1980 to 1994 clade drops dramatically and may even have experienced extinction (although this will need to be confirmed with larger sample sizes), to be replaced by the 1990 to 2002 clade. Specifically, during the period from 1990 to 1994, these two clades of DENV-1 cocirculated in Thailand, but after 1994, one of them (the 1990 to 2002 clade) dominated. There is also some evidence for another replacement event in DENV-1 in Thailand during the early 1980s, in which the genotype III viruses were replaced by those assigned to genotype I. Similarly, in genotype I of DENV-3, a clade replacement took place during the early 1990s, resulting in the 1991 to 2002 clade.
To help determine whether the major clade replacement event in DENV-1 was caused by differences in viral fitness (that is, by the positive selection of one clade over the other), we examined selection pressures among all genes and lineages. We found no evidence for adaptive evolution acting on any individual gene, site, or lineage of the DENV-1 data sets (Table 1). Indeed, the overwhelming evolutionary pressure was that of purifying selection; the mean dN/dS ratios across genes were extremely low, ranging from 0.010 to 0.129, although there was a 7-fold variation in dN among genes and a 2.7-fold variation in dS. A similar picture of selective constraint across all genes was observed in DENV-3, with a 3-fold variation in dN and a 3.4-fold variation in dS (Table 2). Even more notable was the lack of compelling evidence for positive selection accompanying the lineage replacement events in DENV-1. In no gene did dN exceed dS on the branches separating the 1980 to 1994 and 1990 to 2002 clades, and in most cases, the dN/dS ratios on these branches were no greater than the means across all branches of the DENV-1 tree, further suggesting that they are not especially elevated. The exceptions to this were in the E, NS2A, and NS4A genes (Table 1), although the dN/dS values were still sufficiently low (reaching a maximum of 0.203 in NS2A) to imply that purifying selection was again the dominant evolutionary process acting on this lineage. A similar picture was seen in DENV-3. Here, the branch associated with clade replacement in 1991 had slightly elevated dN/dS ratios in the E, NS2A, NS3, and NS4A genes, although none suggested a major change in selection pressures (Table 2).
|
View this table: [in a new window] |
TABLE 1. Selection pressures in DENV-1 on a gene-by-gene basisa
|
|
View this table: [in a new window] |
TABLE 2. Selection pressures in DENV-3 on a gene-by-gene basisa
|
|
View this table: [in a new window] |
TABLE 3. Amino acid changes associated with the major clade replacement event in DENV-1
|
|
|
|---|
A potentially more important observation is that patterns of intraserotype genetic diversity in DENV vary greatly on a temporal scale. Most notably, through the analysis of longitudinally sampled data, it has become clear that individual lineages or entire clades of viruses frequently arise, persist for a period of time, and then disappear. Consequently, DENV phylogenetic trees often have a ladder-like structure, although it is less pronounced than that observed in the hemagglutinin gene of human influenza A virus (9), interspersed with episodes of clade replacement. In principle, clade replacement events could be produced by three contrasting evolutionary processes: (i) the clades in question differ in fitness, so the "winning" clade has a fitness advantage over the preceding clade, as suggested for influenza A virus (5); (ii) replacement events are due entirely to stochastic factors, most likely involving major fluctuations in mosquito population size and density; and (iii) clade replacement is dependent on the composite "herd" immunologic profile (antibody and T-cell priming) to each of the four serotypes. We employed a variety of computational methods in an initial attempt to tease apart the respective contributions of natural selection and genetic drift to dengue virus evolution. Strikingly, we found no evidence for positive selection in DENV-1 and DENV-3 in Thailand acting on the gene, codon, or lineage levels. Indeed, dN/dS values were usually <0.2, and frequently <0.05, indicating that the vast majority of nonsynonymous mutations that arise in DENV are slightly deleterious, fall onto the external branches (that is, the tips) of molecular phylogenetic trees, and are eventually removed from the population by purifying selection. The importance of transient deleterious mutations has been increasingly recognized in DENV evolution (12, 15, 41) and supports a "slightly deleterious" model of molecular evolution in which most mutations lower fitness and which can be fixed by genetic drift only in relatively small populations (25, 26).
Despite the preponderance of transient deleterious mutations in DENV evolution, we are unable to rule out the possibility that positive selection has acted on a small subset of amino acid changes that cannot be detected by most analytical methods, particularly where substitutions are isolated to one branch of the tree or interact epistatically. Consequently, computational methods, such as those employed here, should be used only as a way of identifying those sites of interest for future experimental analysis. This is most pertinent for those branches associated with lineage replacement. In particular, the major lineage replacement event in DENV-1, which took place in the mid-1990s, was associated with 13 amino acid changes across the viral genome. Of these, we reasoned that the changes most likely to change viral fitness were those that (i) could be classified as nonconservative in terms of their physicochemical effect; (ii) occurred at sites that rarely change in DENV evolution, suggesting that they are of great functional importance; and (iii) involved types of amino acid replacements that occur rarely in DENV evolution, again suggesting that they usually have important effects on viral protein structure. In the case of DENV-1, only one amino acid change fulfilled all these criteriathat from Gln to His at position 466 in NS3; this amino acid change is nonconservative, occurs relatively rarely in DENV evolution, and is at a site that is invariant in the other three serotypes. NS3 encodes the critical RNA helicase and has previously been shown to contain T-cell epitopes, including those that are cross-reactive for all serotypes (20, 22, 28, 45, 46), although site 466 does not fall into an epitope region defined to date. However, without further experimental verification, we are unable to determine the fitness consequences of this mutation.
The current lack of definitive evidence for selectively driven clade replacement in dengue virus has led to the reasonable conclusion that such events are due entirely to chance processes (32). Although possible, this hypothesis is difficult to reconcile with the observation that the major clade replacement event in DENV-1 took place over a number of years in the mid-1990s rather than at a single point in time, as expected with a sudden population bottleneck. Moreover, it seems unlikely that changes in vector population size or public health measures would affect clades of DENV-1 in such a different manner. We therefore considered whether the selection pressures imposed by different serotypes cocirculating in the population, which will shape patterns of cross-immunity, also had a bearing on these events. A number of observations suggest that this is indeed the case. First, there is a striking inverse correlation between the prevalence of DENV-1 and that of DENV-4 recorded in the patients attending the QSNICH in Bangkok; DENV-1 tends to peak in prevalence when DENV-4 is at low levels and vice versa (Fig. 3). More striking is that the genetic (clade) diversity within DENV-1 also peaks at times of high prevalence of this serotype and that clade replacement is associated with periods of low prevalence. Hence, the small peak in DENV-1 prevalence during the early 1980s is associated with the cocirculation of genotype I and III viruses and low levels of DENV-4. When DENV-1 then declines in prevalence, DENV-4 increases in frequency and the genotype III viruses are not further sampled. The effect was even more dramatic during the 1990s. Again, the peak of DENV-1 prevalence is associated with very low levels of DENV-4 and the cocirculation of the two genotype I clades. The subsequent disappearance of the 1980 to 1994 clade and replacement by the 1990 to 2002 clade are associated with a trough in DENV-1 prevalence and the rise of DENV-4. From this, we propose that the DENV-1 clades that survived a decline in serotype prevalence did so because they were more antigenically distinct from DENV-4 than were the DENV-1 clades that declined in frequency, which were subject to some cross-protective immune activity. A similar proposal has recently been made to explain the replacement of genotype III by DENV-1 genotype I in neighboring Myanmar (36). Consequently, although our selection analysis suggests that the amino acid changes that distinguish the clades within DENV-1 were fixed by chance at a time of relative serotype abundance, their effect on viral fitness is not realized until the wider serologic landscape changes and selection pressures become stronger. Although this hypothesis will require rigorous experimental testing, it is notable that all clade replacement events documented to date involve viruses with amino acid differences in the immunogenic E gene, which is compatible with the notion that the viruses experience different levels of cross-protection. For example, of the four amino acid changes in the E gene that distinguish the 1980 to 1994 and 1990 to 2002 clades of DENV-1, two, at positions 8 and 324, make the latter clade more genetically different from DENV-4 than the former clade.
![]() View larger version (46K): [in a new window] |
FIG. 3. Relative sampling frequency of each DENV serotype over 30 years as identified from patients admitted to the QSNICH in Bangkok, Thailand, from 1973 to 2002. The predominant dengue serotype varied through time: DENV-2, 1973 to 1986; DENV-3, 1987; DENV-2, 1988 to 1989; DENV-1, 1990 to 1992; DENV-4, 1993 to 1994; DENV-3, 1995 to 1999 (24); and DENV-1, 1999 to 2002 (A. Nisalak, unpublished observations). Note the inverse relationship between the prevalence of DENV-1 and that of DENV-4, while DENV-2 and DENV-3 cocirculated relatively constantly throughout the 30 years and only DENV-3 was associated with severe dengue epidemic years. Levels of genetic diversity and clade replacement in DENV-1 are described in the context of relative DENV-1 and DENV-4 prevalence (see Discussion).
|
We thank the nursing staff of QSNICH and AFRIMS and the previous chiefs of the Department of Virology at AFRIMS, particularly Bruce Innis, David Vaughn, and Timothy Endy, for managing the service specimen collection. We thank two anonymous referees for useful comments.
The opinions or assertions contained herein are the private views of the authors and are not to be construed as official or reflecting the views of the U.S. government.
Supplemental material for this article may be found at http://www.jvi.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»