Previous Article | Next Article ![]()
Journal of Virology, July 2007, p. 7620-7628, Vol. 81, No. 14
0022-538X/07/$08.00+0 doi:10.1128/JVI.00083-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

INSERM U 552, Université Paris 7 - Denis Diderot, Faculté de Médecine Xavier Bichat, Paris, France,1 INSERM U 722, Université Paris 7 - Denis Diderot, Faculté de Médecine Xavier Bichat, Paris, France2
Received 12 January 2007/ Accepted 30 April 2007
|
|
|---|
|
|
|---|
Following changes in antiretroviral treatment, recombination could influence the viral evolution in two distinct ways. First, recombination may contribute to the development of drug resistance. Consistent with this idea, several groups have shown that when cell lines are infected in vitro with two viruses carrying distinct resistance mutations and cultured in the presence of antiretroviral drugs, double mutants produced by recombination that express an improved drug resistance profile can readily be selected (21, 29, 36, 57). However, this process has not been clearly documented in vivo.
Recombination also has the potential to influence viral diversity during the development of drug resistance. Following a change in antiviral therapy, the emergence of viruses expressing mutations required for resistance to the new regimen has been shown to be associated with a decrease in viral diversity (14, 24, 30, 37, 47, 58). Consistent with the occurrence of a genetic sweep, this loss of diversity can affect both the region where novel resistance mutations have emerged and regions not directly subjected to the change in selective pressure, such as the env gene. Although a decrease in viral diversity in env has been described for this setting, such reductions in diversity may be transient (14, 30, 58) and have not been identified for all patients (4, 16, 24, 30, 47). A potential explanation for the preservation of genetic diversity outside the region under selection following a genetic sweep is the occurrence of recombination between the newly emerging virus and the preexisting majority viral population (30, 37).
Despite its potential importance, the direct demonstration that recombination either contributes to the evolution of drug resistance or influences viral diversity in individual patients has been hampered by technical problems inherent in the evaluation of viral sequences amplified by RT-PCR from clinical samples. First, it is difficult to establish whether putative recombination events observed in sequences amplified from clinical samples reflect events occurring in vivo or were introduced during the amplification reaction (27, 35). Second, sequences from widely separated regions must be amplified in separate reactions. Thus, it is impossible to directly compare diversity in such widely separated genomic regions from the same viruses. Recently we described an approach based on the analysis of contemporaneous clonal viral populations that overcomes these limitations (8). The analysis of such clones from several patients demonstrated that extensive recombination is occurring in vivo, confirming studies performed using limiting-dilution PCR (49).
Using this approach, we have evaluated clonal virus populations obtained from the same patient before and after a treatment change that, while unsuccessful in controlling viral replication, led to the emergence of viruses expressing an alternative pattern of resistance mutations. We observed that (i) the pol gene of the viruses that emerged after the treatment change was created through recombination; (ii) the emergence of the recombinant viruses was associated with a loss of diversity in the regions of the pol gene that were under selective pressure, but diversity was preserved in other genomic regions; and (iii) recombination events between the emerging recombinant virus and the prior majority viral population made a contribution to this preservation of diversity in regions not under selective pressure by antiviral treatment.
|
|
|---|
![]() View larger version (13K): [in a new window] |
FIG. 1. Treatment history and evolution of viral load and CD4+ T-cell counts in the study subject. The log10 viral load (solid symbols) and CD4+ T-cell counts (open symbols) are shown as a function of time after initiating treatment with antiretroviral agents. The agents received by the patient are indicated below the graph. Abbreviations: ABC, abacavir; APV, amprenavir; ddI, didanosine; EFV, efavirenz; 3TC, lamivudine; LPV, lopinavir; NFV, nelfinavir; NVP, nevirapine; RTV, ritonavir; SQV, saquinavir; d4T, stavudine; TDF, tenofovir; ddC, zalcitabine; AZT, zidovudine. The open triangles indicate the times at which clonal viral populations were obtained (left and right, M77 and M90, respectively).
|
Sequencing of viral genomes. DNA was extracted from infected cell pellets using a QIAmp viral DNA mini kit (QIAGEN, Valencia, CA) and used to amplify proviral DNA corresponding to the following nucleotide regions: (i) HBX2 nucleotides 1147 to 2549, coding for the C-terminal portion of Gag and all of protease; (ii) nucleotides 2670 to 3308, coding for amino acids 41 to 253 of RT; and (iii) nucleotides 6246 to 7568, coding for the C1, V1/V2, C2, V3, C3, and a portion of the V4 regions of envelope. Amplification products were directly sequenced in both directions using an ABI automated sequencing platform (Applied Biosystems, Foster City, CA). Sequences were aligned using CLUSTAL X (version 1.81) (53), and alignments in regions with insertions were verified manually. All chromatograms were visually inspected, and none contained sequences with ambiguous or polymorphic bases.
Standard genotypic resistance testing was performed on bulk plasma viral RNA using the consensus technique of the ANRS Resistance study group (42) or the TruGene HIV-1 genotyping kit (Bayer Healthcare, Eragny, France).
Analysis of nucleotide sequences. All viral nucleotide positions correspond to the HXB2 reference strain. For the sake of this study, a resistance mutation is defined as mutation that satisfied either or both of the following two criteria: (i) a resistance mutation identified as such by the expert panel of the International AIDS Society-USA (26) and/or (ii) a nonpolymorphic treatment-selected mutation identified in the Stanford HIV drug resistance database (44). The following antiretroviral drugs were used: abacavir, amprenavir, didanosine, efavirenz, lamivudine, lopinavir, nelfinavir, nevirapine, ritonavir, saquinavir, stavudine, tenofovir, zalcitabine, and zidovudine.
Neighbor-joining phylogenetic trees and bootstrap scores (1,000 replicates) were obtained using the Mega package (version 3.1) (31). Maximum-likelihood (M-L) phylogenies were obtained using the HKY85 model; nucleotide substitution and gamma distribution were estimated from the original data using PAUP* (version 4.0b10) (50). Topological incongruence of the M-L topologies was assessed using the incongruence length difference test as implemented in PAUP*.
S-H tests. To evaluate the relatedness of different portions of the pol sequence of clonal viruses obtained at 90 months to viral sequences obtained earlier in the patient's course, the following approach was used. For each region of the pol sequence considered, an unconstrained M-L phylogeny was generated and compared to tree topologies in which the M90 sequences were constrained to be monophyletic with each of the earlier pol sequences. These comparisons were made using the Shimodaira-Hasegawa (S-H) test (20, 48) as implemented in PAUP*. A significant result in this test (P < 0.05) indicates that the constrained topology is significantly worse and therefore incongruent with the unconstrained topology.
Nucleotide diversity and spectrum analysis. Nucleotide distance was determined using the Tajima-Nei method (52), as implemented in MEGA; gaps, when present, were handled by pairwise deletion. Statistical comparisons of diversity were performed by one-way analysis of variance; posttest comparisons, performed only if P values were <0.05, were made using Bonferroni's multiple comparisons test. Tajima's D statistic was calculated without an outgroup as described previously (51) using DNA-SP (46). Fay and Wu's H statistic for different genomic regions of the month 90 clones was determined using the consensus sequence for month 77 clones as an outgroup (17). Coalescent simulations to determine the statistical significance of the H statistic were performed using DNA-SP.
|
|
|---|
![]() View larger version (29K): [in a new window] |
FIG. 2. Graphical representation of the protease (top) and RT (bottom) genotypes obtained for the study subject. Each bar represents a codon that was polymorphic in at least one of the sequences obtained from the patient. The bars are color coded as follows: red, resistance mutation as defined in Materials and Methods; green, alternative resistance mutation found at the same position; dark blue, other nonsynonymous polymorphism; light blue, alternative nonsynonymous polymorphism found at the same position; violet, synonymous polymorphism. For the genotypes obtained by sequencing bulk cDNA obtained from plasma by RT-PCR (M40, M45, M64, M66, M69, and M71), the consensus sequence is shown. For sequences representing a single clonal virus (M77 clone D11D and M90 clone C8G), all polymorphisms are shown. For sequences representing groups of clones (the group of 17 M90 clones [M90 clones x17], M90 clones x16, and M77 clones x28, etc.), only polymorphisms expressed by the majority of the sequences in the given group are shown, although most polymorphisms were present in all clones in a given group. Resistance mutations were expressed by all the clones in a given group with the following exceptions: for protease, M77 clones x28, one clone expressed the V82I resistance mutation, not the V82A mutation, and for RT, M77 clones x24, one clone did not express the K70R mutation. Boxes surround genotypes obtained at earlier times that are most similar to the sequences of the M90 clones in the regions coding for amino acids 15 to 99 of protease (region 1), 41 to 100 of RT (region 2), and 101 to 236 of RT (region 3). The positions of all resistance mutations encountered in these sequences are also identified.
|
The evaluation of nucleotide sequences obtained at earlier times in the patient's treatment history indicated that viruses expressing the combination of resistance mutations in protease and RT identified at M90 had not previously been identified. We noticed, however, that a pattern of resistance-associated mutations very similar to that observed at M90 could be produced by associating the C-terminal portion of the protease observed at M69 (a time when the patient was undergoing a structured treatment interruption), the N-terminal portion of the RT observed at M66 or M71, and the C-terminal portion of the RT observed at M40 (boxed regions in Fig. 2). The pattern of resistance mutations of this putative recombinant virus would be identical to that observed at M90, with the exception of the T215I mutation in lieu of the T215F mutation. The T215F resistance mutation requires two nucleotide changes and is often preceded by the T215I mutation. In addition, all neutral and silent polymorphisms present in the putative M69+M66/M71+M40 recombinant virus are also present in the M90 sequence.
To further evaluate the possibility that the pol region of M90 viruses was produced through recombination events, the sequences were divided into the three regions indicated in Fig. 2, and phylogenetic studies were performed. As shown in Fig. 3, all M90 clones were monophyletic with the M69 nucleotide sequence in the neighbor-joining tree covering region 1 (C-terminal portion of protease), and all M90 clones except for clone C8G were monophyletic, respectively, with the M66/M71 sequences for region 2 (N-terminal portion of the RT) and the M40 sequence for region 3 (C-terminal portion of RT). As expected, the M90 clone C8G clustered with the M77 clones in regions 2 and 3 but not region 1. Compatible with the occurrence of recombination events between these regions, the phylogeny for each of these three regions was incongruent with those of the other two regions according to the incongruence length difference test (P
0.02 for each comparison).
![]() View larger version (24K): [in a new window] |
FIG. 3. Neighbor-joining phylogenetic trees for the three genomic regions identified in Fig. 2 and covering the sequences coding for amino acids 15 to 99 of protease (A), 41 to 100 of RT (B), and 101 to 236 of RT (C). The genotypes obtained by sequencing bulk cDNA obtained from plasma by RT-PCR (Geno) and sequences from individual clones obtained at M77 and M90 are shown. The names of the bulk genotypes that grouped most closely in each region with the clones obtained at M90 have been boxed. Selected bootstrap scores are also shown. The scale line at the bottom of each tree represents 0.5% divergence.
|
|
View this table: [in a new window] |
TABLE 1. S-H tests of topological incongruence for regions of the pol genea
|
Impact of the emergence of viruses expressing a novel pol sequence on viral diversity. Nucleotide diversity in gag, pol, and env at M77 and M90 is shown in Fig. 4A. The diversity of the viruses expressing a new pattern of resistance mutations that emerged at M90 was low in both the protease and RT regions, and nucleotide diversity in the RT region was significantly lower than that observed at M77. In contrast, diversity in the gag and env regions was significantly higher than that observed for protease and RT (P < 0.01 for all comparisons), and no significant difference in viral diversity was observed for gag and env when sequences obtained at M77 and M90 were compared. If the low nucleotide diversity seen for protease and RT and the reduction in diversity observed for RT between M77 and M90 resulted from the occurrence of a selective sweep, a significant excess of low-frequency allelic variants would be expected, producing negative values for Tajima's D statistic. Indeed, in both the protease and RT, D was significantly negative at M90 and had decreased compared to the value observed at M77 (Fig. 4B). In the gag region, however, D was not significantly negative, and for the env region, the value had increased compared to the value obtained at M77.
![]() View larger version (14K): [in a new window] |
FIG. 4. Comparison of nucleotide diversities and Tajima's D statistic values for different genomic regions of clones obtained at months 77 (open bars) and 90 (solid bars). (A) Nucleotide distance was calculated pairwise for all clones using the method of Tajima and Nei (52), and results are expressed as the mean ± standard error of the mean. (B) Tajima's D statistic was determined as described previously (51). Significant departure from neutrality is indicated by asterisks (*, P < 0.05; **, P < 0.01).
|
![]() View larger version (15K): [in a new window] |
FIG. 5. Fay and Wu's H statistic values for different genomic regions of clonal viruses obtained at month 90. The H statistic was determined using the consensus sequence of M77 clones as an outgroup. In the upper panel, a sliding window representation of the H statistic is shown (window length, 100; step size, 25); the corresponding regions of the genome are shown in boxes below. The H statistic was also determined for each of the genomic regions, and the significance of the value compared to neutral expectations was determined by coalescent simulation, as implemented in DNA-SP.
|
In addition, if polymorphisms were recovered by recombination with viruses in the preexisting majority population, polymorphisms with a higher allele frequency in the preexisting viral population should have a higher probability of being present following the evolutionary sweep. Polymorphisms were identified at 161 positions in envelope sequences obtained at M77. For polymorphisms present in only a single sequence, 26/107 (24%) were also polymorphic at M90, whereas 28/54 (52%) of polymorphisms present in two or more sequences at M77 were also present at M90 (P < 0.001 using Fisher's exact test).
Consistent with extensive recombination, the comparison of the amino acid sequences for individual envelope regions identified numerous examples where two or more alternative haplotypes present in the viral population before the evolutionary sweep had been preserved in viruses obtained after the sweep. For example, of the five alternative V4 region haplotypes present at M77, three were identified at M90 (Table 2).
|
View this table: [in a new window] |
TABLE 2. Preservation of alternative haplotypes in the envelope V4 region after an evolutionary sweep in pola
|
|
|
|---|
Recombination and drug resistance. Studies performed in vitro have clearly shown that following simultaneous infection of cells with two viral strains expressing distinct drug resistance mutations, recombinant viruses that express both mutations and exhibit improved drug resistance can be selected (21, 29, 36, 57). The extent to which this process contributes to the development of drug resistance in vivo is less clear. Mathematical modeling indicates that whether recombination would accelerate or retard the appearance of such double mutants in vivo is dependent on numerous factors, including viral population size, the rates of mutation and recombination, and epistatic effects (1), none of which are accurately defined. Thus, although recombination events have been demonstrated to occur in vivo (8, 33, 42, 49), the contribution of this process to the evolution of drug resistance has not previously been clearly documented.
For the patient evaluated here, however, a treatment change was associated with the emergence of recombinant viruses carrying gene segments derived from distinct viral populations previously identified in the patient, indicating that recombination can participate in the evolution of drug resistance. The sequence of segments forming the protease and the N-terminal portion of the RT of the viruses identified at M90 were essentially identical to the sequences of these segments in viruses identified at M69 and at M66/M71. The segment most similar to the C-terminal portion of the RT was identified at M40, but viruses present at this time expressed the T215I resistance mutation, not T215F. Thus, the actual donor sequence contributing the C-terminal portion of the RT, although closely related to the M40 sequence, may have already undergone the additional mutational event required to produce T215F. Alternatively, this event could have occurred after the formation of a recombinant virus. It is also noteworthy that M90 viruses carried five synonymous base changes that had not been identified in the consensus sequences of viral populations previously identified in the patient and were not present in the M77 clonal viruses. The fixation of silent mutations is to be expected during a genetic sweep, such as would occur during the emergence of a recombinant virus.
Several factors helped us identify these viruses as true recombinants. First, the evaluation of biological clones, instead of viral sequences obtained by RT-PCR, assured that the sequences obtained were colinear in the same virus. Thus, artifacts due to recombination occurring during PCR were not a concern. Second, nucleotide sequences of the pol region had been obtained at multiple times during the development of resistance in this patient, allowing us to identify potential donor fragments involved in recombination. Third, the magnitude and nature of the changes in the pattern of resistance mutations observed was helpful in excluding the possibility that the M90 viruses arose through convergent evolution. In particular, only 8 months after discontinuation of treatment with protease inhibitors, many but not all of the resistance mutations in the protease were no longer detected at month 90. Studies evaluating patients primarily infected with multidrug-resistant viruses have repeatedly found that resistance mutations are very slowly lost through back-mutation (2, 3, 6, 13, 19, 41), excluding the possibility that the virus identified at M90 could have arisen through convergent evolution from a virus expressing resistance to protease inhibitors. Conversely, the persistence of protease resistance mutations in the N-terminal portion of the protease, as well as the pattern of resistance mutations and other polymorphisms present in the RT, was not consistent with convergent evolution from a virus not previously expressing drug resistance. Taken together, these arguments support the idea that recombination was responsible for creating the novel pattern of resistance mutations observed for the viruses that emerged at M90 following the change in treatment. Interestingly, a number of mutations associated with resistance to RT inhibitors have been shown to directly increase the rate of viral recombination (38). Thus, the development of drug resistance may promote the emergence of such recombinant viruses.
Impact of recombination on viral diversity. The emergence of viral variants carrying a novel pattern of resistance mutations following changes in antiretroviral therapy, such as the recombinant viruses described here, can lead to a loss of diversity throughout the viral genome, even when selective pressure is exerted only upon the pol region. Indeed, reductions in viral diversity in env associated with the emergence of drug-resistant viruses have been seen in some studies (14, 24, 30, 37, 47, 58), but such changes can be transient and are not detected for all patients (4, 16, 24, 30, 47). The recent study by Kitrinos et al. (30) using heteroduplex analysis found that losses in diversity in env were most marked when treatment changes produced an initial large reduction in viral load, whereas little or no loss of heterogeneity in envelope regions was seen for patients whose viral load did not initially decrease. Based on these findings, these authors suggested that an initial drop in viral load reduces the likelihood of recombination between the emerging virus and the prior majority population, which otherwise preserved diversity in env. Our results provide strong genetic support for this model. In the patient evaluated here, a marked fall in viral load following the treatment change did not occur. The continued replication of quasispecies in the prior majority population permitted recombination events to occur between this population and viruses expressing the new pattern of resistance mutations in pol that subsequently became dominant. Consistent with this interpretation, the envelope sequences present after the selective sweep in pol appeared to be derived through recombination from those expressed by viruses present before the sweep, as indicated by the extensive sharing of the same polymorphisms in the two populations and the preferential retention of polymorphisms present at higher frequency before the genetic sweep in the viral population emerging after the sweep.
Previous studies have found that viral populations expressing different resistance profiles, including those with suboptimal drug resistance, can continue to replicate in "sanctuary sites" (7, 15, 40, 43, 54, 55). It is noteworthy that treatment changes may influence the repertoire of resistant viruses available to participate in recombination events. In our patient, the discontinuation of protease inhibitors was associated with striking changes in the pattern of RT resistance mutations (including the loss of resistance mutations V118I, Y181C, G190A, L210W, and 218E), despite continued treatment with the same RT inhibitors. These mutations, which had accumulated during treatment with other RT inhibitors, including nonnucleoside RT inhibitors, were not required for optimal resistance to the currently used regimen. The replication of viruses not expressing these RT mutations (e.g., the viruses present at M40) may have been suppressed by treatment with amprenavir, because protease resistance was less developed in these viruses. Following the discontinuation of protease inhibitors, the improved replication of these viruses would favor their participation in recombination events, thereby permitting the elimination of RT resistance mutations not required for resistance that exerted a fitness cost. These findings raise the possibility that when treatment changes are anticipated for patients failing a current regimen, a reinforced regimen including drugs effective against viruses expressing patterns of resistance mutations previously detected in the patient might be useful until viral load has fallen to low levels. Even if these drugs are ineffective against the current majority viral population, the enhanced suppression of the replication of viruses expressing alternative genotypes might reduce the risk that preexisting resistance mutations can be recovered through recombination, thereby limiting one avenue for rapid viral evolution.
We gratefully acknowledge the help of Florence Damond and Diane Descamps in performing these studies.
Published ahead of print on 9 May 2007. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»