Previous Article | Next Article ![]()
Journal of Virology, March 2006, p. 2472-2482, Vol. 80, No. 5
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.5.2472-2482.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Tamara Nora,1,
Olivier Tenaillon,2
François Clavel,1 and
Allan J. Hance1*
INSERM U 552,1 INSERM U 722, Université Paris 7René Diderot, Faculté de Médecine Xavier Bichat, Paris, France2
Received 4 August 2005/ Accepted 7 December 2005
|
|
|---|
|
|
|---|
The advantages that accrue through recombination and the impact of recombination on HIV-1 pathogenesis in individual patients are not clearly defined (45). The shuffling of polymorphisms found in distinct viral quasispecies could play a role in generating viral diversity (33, 45). The ability to maintain extensive diversity may be extremely important for viral pathogenesis, because it ensures the availability of viral quasispecies able to escape changes in the selective pressures exerted by the immune response or by antiretroviral therapy. Indeed, results obtained with other models support the idea that diversity generated by recombination can be beneficial for adaptation to changing evolutionary pressures (15, 40).
In addition to generating diversity, viral recombination could also be useful in preserving existing diversity. In the course of infection in an individual patient, diversity can be threatened by evolutionary bottlenecks. Although reflective of an initial selective advantage, the emergence of a genetically homogeneous population could prove disadvantageous, because the progeny might later become susceptible to elimination by an immune response focused against shared antigenic determinants. An evolutionary bottleneck resulting from the emergence of a unique viral species with high resistance to antiretroviral agents would pose this risk. If, however, during or subsequent to emergence, such strains could recombine at high rates with preexisting strains, much viral diversity could be maintained in regions outside those responsible for the bottleneck.
Despite its potential importance, the role of recombination in generating and preserving viral diversity in vivo has been difficult to define for technical reasons. The analysis of sequences obtained by amplification and cloning of viral genomes from plasma cannot address this question, because recombination is known to occur during amplification by PCR (23, 32), and individual sequences can be over- or underrepresented in such bulk amplification products due to differences in the efficiency of amplification of individual sequences with a given pair of primers. Recombination during amplification can be avoided by performing limiting-dilution PCR, but this technically demanding approach also has certain liabilities. In particular, the possibility of recombination occurring during the reverse transcription reaction is difficult to exclude, some sequences may not be amplified using any given pair of primers and therefore would be lost from the analysis, and the amplification of a single fragment containing widely separated genomic regions (e.g., pol and env) is not feasible.
To further evaluate the extent that recombination contributes to the generation and preservation of the diversity of HIV-1 in the course of infection, we developed a technique that permits the isolation of contemporaneous clonal viral populations resulting from single infectious events by plasma-derived viruses and which therefore allows the assessment of recombination between distant regions of viral genomes from individual patients. Using this technique, we generated such clones from patients experiencing virological treatment failure, and sought evidence that (i) high rates of recombination occur throughout the viral genome in vivo, (ii) the restriction of diversity resulting from evolutionary bottlenecks in regions coding for the enzymes targeted by antiretroviral drugs need not extend to other genomic regions, and (iii) the rearrangement between gene segments directly contributes to genetic diversity.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Clinical characteristics of patients at the time of study and summary of the number of clones obtaineda
|
Sequencing of viral genomes.
DNA was extracted from infected cell pellets using a QIAamp Viral DNA mini kit (QIAGEN, Valencia, CA) and used to amplify four segments of proviral DNA, two adjacent segments in gag-pol (total of
1,300 bp) and two adjacent segments in env (total of
1,300 bp). For several patients, the testing of several sets of primers was required to identify pairs that amplified DNA from all clones of a given patient. In difficult cases, a fragment surrounding the area of interest was cloned and sequenced, permitting the design of suitable primers for amplification of proviral DNA. Amplification products were directly sequenced in both directions using an ABI automated sequencing platform (Applied Biosystems, Foster City, CA). All chromatograms were visually inspected to exclude the presence of sequences with ambiguous or polymorphic bases. Sequences were aligned using CLUSTAL X (version 1.81), and alignments in regions with insertions were verified manually.
Detection of multiple infectious events. To evaluate the relationship between the proportion of infected wells in limiting-dilution experiments and the proportion of wells infected with more than one virus, a suspension containing approximately equal numbers of two distinct clonal viruses (A and B) was prepared, and MT4-R5 cells were infected with serial dilutions of this suspension. DNA was isolated from cell pellets from infected wells, and a 670-bp fragment of RT and/or a 579-bp fragment of env were amplified by PCR. Amplified fragments were purified and digested with PstI (RT) or HindIII (env), which recognized sites present only in the fragments from virus A or virus B, respectively, thereby allowing the identification of wells infected with one or both viruses. This approach could reliably detect both sequences in mixtures in which the minority sequence represented 10% of total viral DNA. Exploring the two regions using DNA from the same infected well always gave concordant results (data not shown).
Evaluation of the evolution of viral sequences during culture. To examine whether the viral sequence, as determined by bulk sequencing, was subject to change during culture, MT4-R5 cells were infected under limiting-dilution conditions with viruses obtained by transfection of HeLa cells with an infectious proviral molecular clone derived from pNL4-3, but carrying protease resistance mutations M46I and I54V and V82A or I54V and V82A, combinations of mutations known to severely impair viral replicative capacity (29). DNA from 36 infected wells was extracted, proviral DNA encompassing the protease region was amplified, and the bulk products were sequenced.
Assessment of recombination. Phylogenetic trees were obtained by maximum likelihood using the DNAML program in the Phylip package (version 3.6). The congruence between phylogenetic trees was evaluated using the incongruence length difference (ILD) test as implemented in PAUP* (version 4.0).
Rmin was determined using the RecMin software designed by Myers and Griffiths (34). The output of this program includes the position of gene segments with obligate recombination events (in the absence of homoplasy), permitting an assessment of the distribution of recombination events.
To estimate recombination rates, the Pairwise program in the LDhat package was used (31). This program is a modification of that proposed by Hudson (18). It estimates recombination rates by combining the coalescent likelihoods of pairwise comparisons of segregating sites and has been extended to take into account the occurrence of recurrent mutations at a single site. In implementing the test, the population mutation parameter (
= 4Neµ, where Ne is the effective population size and µ is the per-site mutation rate) was determined for each sample set (52), and confidence limits for
were determined by coalescent simulation using DnaSP (48). Likelihood matrices were then constructed for each sample using the estimated value of
for population recombination parameters (
= 4Ner, where r is the per-site recombination rate) ranging from 0 to 1,000. To assess confidence intervals of
, values whose log probability was ±2 of the value with maximum probability were determined.
Assessment of nucleotide divergence. To evaluate nucleotide divergence between sequences for a given segment of the viral genome, the number of synonymous substitutions per potential synonymous site (dS) was calculated pairwise for all clones by using the modified Nei-Gojobori method (36). In this analysis, a transition/transversion ratio of 2.0 was used, the Jukes-Cantor correction was applied, and gaps, when present, were handled by pairwise deletion. The mean distance of each clone relative to all other clones from the same patient and the overall mean divergence were determined. We were interested in comparing the nucleotide divergence of different regions of the viral genome for clones from the same patient. It is known, however, that a variety of factors, such as codon usage, amino acid composition, and RNA secondary structure, can influence dS, independent of differences in mutation rate (1). To control for such differences, a training set of 11 HIV-1 subtype B sequences from the Los Alamos database was used to identify regions in gag, protease, RT, and env which met the following criteria: (i) all were coding regions, (ii) insertions and deletions did not prevent unambiguous alignment, (iii) mean dS was similar among the regions. The following regions were identified (positions correspond to the HXB2 reference strain): gag, 1963 to 2160; protease, 2253 to 2327 and 2412 to 2486; RT, 2865 to 3203; envelope C1, 6324 to 6617; envelope C3, 7110 to 7379. The overall similarity in dS of these regions was confirmed (see Results) by analyzing all HIV-1 subtype B viruses in the Los Alamos alignment database for which sequences of all regions were available (n = 43). These regions were then evaluated for all clones from each patient.
Evaluation of viral tropism. The tropism of envelope sequences was evaluated using the algorithm described by Jensen et al. (20). Variants with predicted X4 tropism were detected for 3 of 6 patients (patient 1, 5 of 25 clones; patient 3, 5 of 12 clones; patient 4, 1 of 19 clones). For a given patient, the sequence of the V3 region was identical in the different clones. The tropism of representative clones bearing these V3 sequences was evaluated by generating recombinant viruses expressing the envelopes (gp120 plus extracellular domain of gp41) and evaluating their ability to infect U373-CD4-CCR5 cells and U373-CD4-CXCR4 cells in a single-cycle colorimetric assay. The recombinant viruses from patients 3 and 4 showed exclusive X4 tropism. The recombinant viruses from patient 1 had low infectivity in this assay, and predicted tropism could not be confirmed.
Data presentation and statistical methods. All viral nucleotide positions correspond to the HXB2 reference strain. Resistance mutations are as defined by the IAS-USA Drug Resistance Mutations Group (22). Statistical comparisons of diversity (dS) were made using the Kruskal-Wallis test; posttest comparisons, performed only if P was <0.05, were made using the Dunn test.
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. The proportion of wells infected with single or multiple viruses depends on the proportion of infected wells, as predicted by the Poisson distributiona
|
Clonal viruses generated from plasma of HIV-1-infected patients.
Clonal viral populations were generated from five chronically infected patients who were failing antiretroviral therapy (Table 1, patients 1 to 5), as well as one treatment-naïve patient (patient 6). For all plasma samples studied, the large majority of infectious events were detected between 13 and 29 days after initiation of culture (81% of 165 total clonal viruses). To characterize the genome of clonal viruses, DNA was extracted from infected cell pellets, and genomic regions of
1,300 bp in gag-pol and env were amplified from proviral DNA and sequenced. For the 127 clonal viruses evaluated (25, 19, 12, 19, 34, and 18 clones, respectively, for patients 1 to 6), base calls were unambiguous at all positions sequenced. In the course of the studies, proviral DNA from only two infected wells produced sequences in which ambiguous bases were identified, and these samples were excluded from analysis. Clonal viruses from a single patient grouped closely together in phylogenetic trees but were widely separated from the viruses obtained from other patients or laboratory strains, regardless of the region evaluated (data not shown). A comparison of the results of population-based genotyping of plasma viruses (available for the protease and RT for four patients) and the sequences of the clonal viruses indicated that the clonal viruses were representative of the those in the total population. For positions that were monomorphic in the plasma genotype, the consensus sequence of the clonal viruses was, without exception, identical (3,622 total bases). For the 62 positions that were polymorphic in the plasma genotype, the same polymorphisms were identified among the sequences from clonal viruses in 59 of 62 cases. In contrast, many polymorphisms identified in low proportions among the sequences of the clonal viruses were not detected by bulk genotyping of plasma viruses.
Evidence for recombination in contemporary viruses. Visual inspection of aligned sequences from contemporary clonal viruses revealed numerous examples consistent with recombination. To systematically evaluate the recombination between different regions of the viral genomes, a phylogenetic method was employed (14). For each patient, phylogenetic trees corresponding to different parts of the genome were generated using the maximum likelihood method, and the topologies were compared using the incongruence length difference test. Rejection of the null hypothesis indicates significant differences between the topologies and strongly supports the occurrence of recombination. An example is shown in Fig. 1, in which sequences that clustered together in the phylogenetic tree of the envelope C2-V4 region are indicated by different colors. In the phylogenetic tree for the envelope C1-V2 region of the same viruses, some clones remain closely associated (for example, the pairs of sequences marked by an asterisk). More commonly, however, viruses that are clustered in the C2-V4 tree are dispersed throughout the C1-V2 tree, and the topologies of these two trees are strongly incongruent by the ILD test (P < 0.001). Similarly, viruses that are clustered in the C2-V4 tree are also dispersed in the tree comparing the gag-protease region of the same viruses (P was <0.001 using the ILD test). As shown in Table 3, phylogenetic trees for different genomic regions were usually incongruent (P < 0.05) when viruses from all six patients were evaluated using this approach, a result compatible with extensive recombination between all genomic regions evaluated. For several cases, the analysis was repeated after systematic removal of single sequences or even pairs of sequences, but no such deletions resolved the incongruence between regions, indicating that incongruence was not determined by only a small number of recombination events (data not shown). The ILD test showed significant incongruence between RT and protease for only three of the six patients evaluated. It is noteworthy that the three patients for whom this test was not significant for these regions are the three patients found to have very low diversity in the protease region (see below), and limited diversity is known to impair the ability of the ILD test to detect recombination (44).
![]() View larger version (35K): [in a new window] |
FIG. 1. Phylogenetic trees corresponding to different parts of the genome of clonal viruses from a single patient are incongruent. Genomic regions corresponding to protease + RT (2253 to 3338), C1-V2 envelope (6221 to 6871), and C2-V4 envelope (6872 to 7567) for 25 clonal viruses obtained from patient 1 were sequenced, and phylogenetic trees were generated using the maximum-likelihood method. Clonal viruses that clustered together in the C2-V4 envelope tree are highlighted with the same color. Although certain clones remain clustered in the other trees (e.g., those marked with an asterisk in the middle panel), clonal viruses that cluster together in one tree are generally dispersed throughout the other trees. The trees were incongruent using the ILD test (P < 0.001 for all pairwise comparisons).
|
|
View this table: [in a new window] |
TABLE 3. Evaluation of the congruence between phylogenetic trees obtained using nucleotide sequences of clonal viruses from different regions of the viral genomea
|
Estimations of the rates of mutation and recombination.
To estimate the contribution of mutation and recombination to sequence diversity, the coalescent-based technique described by McVean et al. (31) was used to determine the population recombination parameter (
= 4Ner), where r is the per-site recombination rate per generation and Ne is the effective population size. By comparison with the population mutation parameter (
= 4Neµ), where µ is the per-site mutation rate per generation, the relative rate of recombination compared to mutation (r/µ) can be determined. As shown in Fig. 2, the population mutation parameters were not strikingly different in a comparison of the six patients (median, 0.021; range, 0.018 to 0.032). Considerably greater variation was seen in the population recombination parameters (median, 0.03; range, 0.002 to 0.11). Thus, the rates of recombination relative to the rates of mutation were 4.1, 1.8, 0.8, 0.5, 5.7, and 0.1, respectively, for the six patients. The recombination parameter did not correlate with nucleotide diversity. No obvious correlations were observed between the clinical characteristics of the patients and the population recombination parameter, although this parameter was considerably lower for the single untreated patient studied. More sequences were available from the two patients with the highest population recombination parameter (patients 1 and 5), but when analyses were repeated using randomly selected subsets of these sequences similar in number to those available for the other patients, no significant effect on the recombination parameter was observed, and the recombination parameter remained elevated (data not shown).
![]() View larger version (22K): [in a new window] |
FIG. 2. Contribution of mutation and recombination to sequence diversity. For each patient, the population recombination parameter and the population mutation parameter were determined using the coalescent-based approach described by McVean et al. (31).
|
22%), the population mutation parameters was somewhat higher than those obtained when the entire nucleotide sequence was considered but remained similar among the patients studied (median, 0.047; range, 0.033 to 0.053). Restriction of the analysis to silent mutations had only a small effect on the population recombination parameters (median, 0.034; range, 0.008 to 0.123). In this analysis, the rates of recombination relative to the rates of mutation were 2.0, 0.7, 0.7, 1.0, 2.7, and 0.2, respectively, for the six patients. We conclude that the rate of recombination among contemporary viruses is often high and can occur at a rate that approaches or surpasses the mutation rate. Diversity in different genomic regions. To evaluate sequence divergence between clones from a given patient, the dS was determined for different regions in the genome. In this analysis, the mean distance of each clone relative to all other clones from the same patient and the overall mean divergence were determined. To permit comparisons between different regions in the genome, this analysis was restricted to segments in the genome for which the extent of diversity due to silent mutations was shown to be similar when sequences of viruses infecting 43 unrelated patients were evaluated using the same approach (Fig. 3A). As shown in Fig. 3B, the extent of diversity due to the presence of synonymous substitutions could be quite different in different portions of the genome for viruses from the same patient. When the patterns of sequence divergence in different portions of the genome for all the patients are compared, however, several common features emerge (Fig. 4). In particular, sequence diversity in the protease region was often lower than that observed in other regions, and it was strikingly low in three cases (patients 1, 2, and 4). This pattern was not observed for some patients, including the individual who had not received antiretroviral therapy (patient 6). As shown above, the diversity observed in the protease region was not different from that of other regions when the sequences from 43 heterologous viruses from the Los Alamos database were compared (Fig. 3A). Similarly, the mean pairwise diversity between the consensus sequences for the six patients was, as expected, considerably higher than that seen between clones from a single patient, but no significant difference was found in a comparison of the different genomic regions (data not shown). These findings argue against the existence of selective pressure operating on synonymous sites in the segment of the protease that reduces diversity that was analyzed in these studies. Because all the patients for whom reduced diversity in the protease region was observed had been treated with protease inhibitors, the results are compatible with the possibility that this region of the genome had passed through one or more bottlenecks associated with the emergence of viruses carrying resistance mutations. To further test this idea, a similar analysis was performed using sequences in the protease region obtained from the same patient prior to and during the evolution of protease resistance (patient C in reference 5). Phylogenetic analysis suggested the occurrence of bottlenecks associated with the emergence of viruses carrying the I54V and A71V mutations between 3 and 23 months and the fixation of the I93L mutation between 24 and 34 months. As shown in Fig. 5, considerable diversity was observed in the protease region prior to and immediately following treatment with protease inhibitors but, at later times, a significant decrease in the diversity in the protease region was observed. Taken together, these findings suggest that evolutionary bottlenecks can limit diversity in the protease region but that such reductions need not extend to adjacent genomic regions.
![]() View larger version (20K): [in a new window] |
FIG. 3. Sequence diversity in different portions of the viral genome. The dS was determined for selected regions in genes coding for the C-terminal portion of gag (gag), protease (prot), RT, envelope C1 (C1), and envelope C3 (C3) using the modified Nei-Gojobori method (36). Each point represents the average pairwise distance of each sequence to all other sequences analyzed, and the bar represents the overall mean divergence. Panel A shows the analysis of all HIV-1 subtype B viruses in the Los Alamos alignment database for which sequences of all regions were available (n = 43), and panel B shows results of the analysis of the 19 clonal viruses from patient 4. The small arrows in panel B indicate groups of clones whose gag sequences were similar at all silent positions but whose sequences differed from those of the other groups by 3 or 4 silent mutations.
|
![]() View larger version (21K): [in a new window] |
FIG. 4. Patterns of sequence diversity in different portions of the viral genome observed for clonal viruses. For each patient, the dS was determined for selected regions in genes coding for the C-terminal portion of gag (gag), protease (prot), RT, envelope C1 (C1), and envelope C3 (C3) as described in the legend to Fig. 3. The result for each of the six patients is presented as the mean ± standard deviation of the average pairwise distance of each clonal virus compared to those of all other clonal viruses obtained from that patient.
|
![]() View larger version (13K): [in a new window] |
FIG. 5. Selection of protease resistance mutations is associated with a loss of sequence diversity. The dS was determined for the region of the protease gene described in the legend to Fig. 3 by using sequences obtained at different intervals after initiation of treatment with protease inhibitors. Previously published phylogenetic studies indicated that this patient experienced bottlenecks associated with the emergence of viruses carrying the I54V and A71V mutations between 3 and 23 months and fixation of the I93L mutation between 24 and 34 months (5). Results are presented as means ± standard deviations of the average pairwise distances of each sequence compared to those of all other sequences obtained at the same time point.
|
Factors influencing apparent diversity between clones. Although dS is commonly used as a measure of "evolutionary distance" resulting from the accumulation of synonymous mutations, a second factor that influenced this parameter in our data was identified. In the example shown in Fig. 3B, the mean value of dS for the gag fragment analyzed was 0.03, suggesting moderate diversity. When the distribution of values for each clone relative to all other clones is examined, however, the values fall into three distinct groups with one outlier (Fig. 3B, arrows). Within each of these groups, the nucleotide sequence of the gag fragment was identical but differed from that of the other groups by 3 or 4 silent mutations. Thus, the finding of a low value for dS was useful in identifying regions that were essentially homogeneous due to prior bottlenecks, but apparent high diversity often, but not always, reflected situations in which a small number of alternative sequences were used repeatedly by different clones (see below).
Recombination between gene segments contributes to the diversity of env. The findings presented above indicate that the diversity within short segments of the HIV-1 genome could be either variable, restricted to a relatively small number of alternative choices, or even essentially homogeneous. Nevertheless, considerable diversity could be maintained by splicing together these segments in novel combinations. To test this idea further, the amino acid sequences of the various envelope domains for clones from a given patient were compared in order to determine the number of distinct genotypes that were present in each domain and the extent to which clones that shared a similar genotype in one domain did or did not also express similar genotypes in adjacent domains. For this analysis, genotypes that differed by a single amino acid change not identified in another clone were considered to be similar.
The results obtained for patient 3 are shown in Fig. 6A. Three clearly distinct genotypes were observed in the V3 domain, one group with exclusive X4 tropism and two distinct groups with R5 tropism, and the clones have been separated into groups on this basis. The number of distinct sequences within a given domain ranged from two (C2 domain) to nine (V1 domain). For all domains, clones sharing similar genotypes were always present. Some tendency for coevolution of env domains was observed for X4 viruses, which shared closely related C3 domains (data not shown) and often used V4 domains not identified in R5 viruses (Fig. 6A). This association was not absolute, however, and one X4 virus (clone 11) used a V4 domain also identified for both of the R5 populations. In contrast, the same V1 or V2 domain could be used by viruses expressing different V3 domains, and the same V3 domain could be associated with a variety of different V1, V2, and C2 domains. As a consequence of this recombinatorial diversity, no two clones expressed the same combination of V1-to-V4 segments.
![]() View larger version (47K): [in a new window] |
FIG. 6. Recombination between envelope domains contributes to the diversity of env. At the top of each panel, the consensus amino acid sequences of the V1, V2, V3, and V4 regions of envelope for the 12 clonal viruses from patient 3 (panel A) and the 25 clonal viruses from patient 1 (panel B) are shown. Only amino acid changes different from the consensus sequence are shown for each clone. For each domain, sequences that are identical or that differ by a single amino acid substitution not identified in another sequence are highlighted with the same color. The tropism of the clonal viruses predicted by the algorithm of Jensen et al. is also indicated.
|
The results for the other four patients were generally similar to those presented in Fig. 6. The number of distinct genotypes within a given domain ranged from 1 (V3 domain, patients 2, 5, and 6; V1 domain, patient 2) to 12 (C2 domain, patient 5). The proportion of clones using the same combination of envelope segments was 0/19 (patient 2), 0/18 (patient 4), 3/35 (patient 5), and 4/18 (patient 6).
Some, but not all, of the apparent diversity within a given domain appeared to result from recombination events. For example, the V4 genotype of clone 7 from patient 3, although scored as "distinct," could be produced by recombination between clone 1 and clone 10. Many other examples were present and were observed in all domains except for the V3 domain. The V3 domain is short and often showed the least diversity; both of these factors would reduce the likelihood of identifying recombination events. Nevertheless, it is possible that recombination events that change the amino acid sequence in this domain often have a negative impact on viral infectivity and tend to be underrepresented when infectious viral clones are evaluated.
It is also noteworthy that some clones used a sequence for one domain that was markedly different from that of the other viruses from that patient (e.g., V1 domain in clone 7 from patient 3, and V4 segment in clone 1 from patient 1), but other domains of these viruses were identical to those of other viral clones, suggesting that these "outlier" segments may have been derived from recombination events with minority or archived viral populations.
Drug resistance genotypes and recombination. As shown above, diversity, as measured by the accumulation of silent mutations, was often limited in the protease and could also be low in the RT. Consistent with this finding, only limited diversity in the resistance genotypes was observed. For three treated patients (patients 1, 2, and 4), diversity was restricted to the gain or loss of one or two resistance mutations by single clones. For example, for the 25 clones of patient 1, one clone had gained the F53L mutation in protease, while other clones had lost either the V118I or L100I and K103N mutations in RT. Otherwise, all clones expressed the same 8 protease resistance mutations and the same 11 RT resistance mutations.
For two of the patients studied, two resistance mutations in protease and two resistance mutations in RT were expressed by only a subset of clones. For these four pairs of mutations, some clonal viruses expressed each mutation individually, while others expressed both mutations simultaneously (data not shown). This pattern cannot be explained by the sequential appearance of the mutations alone, and either recombination or the occurrence of more than one mutation or reversion at a single site (homoplasy) must be invoked. Considerable shuffling of protease and RT resistance genotypes was also observed for these two patients. Protease genotypes represented by more than a single clone were usually found in association with several different RT genotypes. Thus, when the protease and RT regions were considered together, the most abundant resistance genotype was expressed by only 25% (patient 3) and 38% (patient 5) of total clonal viruses. Clonal viruses isolated from patient 3 had both R5 and X4 tropism. Clonal viruses in both populations could express the same resistance genotype, and several different genotypes were identified among X4 and R5 viruses. Thus, tropism and resistance genotype did not cosegregate in this patient.
|
|
|---|
The analysis of clonal viruses proved to be a useful method to evaluate recombination among contemporaneous viruses from infected patients. Control experiments indicated that under limiting-dilution conditions, most wells had been infected with a single virus, and the wells in which more than a single infectious event had occurred could be identified and excluded after sequence analysis. In studies evaluating the replication of viruses carrying protease resistance mutations that impaired viral replicative capacity, no evidence for the reversion of mutations or the appearance of compensatory mutations was found, suggesting that the time in culture necessary to produce viral clones was not sufficient for variants occurring in vitro to emerge to the extent that their presence was perceptible in bulk amplification products obtained by amplifying proviral DNA. The sequencing of proviral DNA from infected wells in the course of these studies further supported these conclusions. In only 2 of the 129 wells evaluated were ambiguous bases identified in any of the genomic regions that were sequenced. The low frequency of sequences containing ambiguous signals further supports the idea that most wells had been infected with a single virus and that the evolution of viral sequences during culture had little or no impact on the results. By using this approach, any segment of the genome is accessible to study, and evidence of recombination or linkage between distant regions can be evaluated.
Our results confirmed studies performed using limiting-dilution PCR that contemporaneous viruses show evidence of extensive recombination. For the six patients studied here, estimates of the rate of recombination relative to the rate of mutation obtained using a coalescent approach ranged from 0.1 to 5.7, and the recombination rate exceeded the mutation rate for three individuals. Shriner et al., analyzing a different genomic region using sequences obtained by limiting-dilution PCR from a single patient, found that the recombination rate was 5.3-fold higher than the mutation rate (51). The method used in these studies assumes neutral evolution, and extensive homoplasy could result in an overestimation of the recombination rate. Analyses restricted to the evaluation of synonymous mutations, however, gave similar results, supporting the conclusion that this bias did not strongly influence the results. Further analysis of our clones suggested that recombination could have an impact on viral diversity in several distinct but important ways.
Recombination allows regeneration of viral diversity following bottlenecks. The extent of diversity in different portions of the genome, as measured by the accumulation of silent mutations, could be quite variable. In particular, low levels of diversity were often encountered for the protease region in patients receiving antiretroviral treatment. The presence of such bottlenecks is consistent with the idea that during the evolution of drug resistance, the occurrence of specific combinations of resistance mutations that give a decisive fitness advantage is followed by strong selection of viruses carrying this genotype. Previous studies evaluating the impact of treatment-induced bottlenecks occurring in the protease or RT on diversity in other genomic regions have given varying results, with evidence of reduced diversity, transient reductions in diversity, or no effects having been observed (4, 7, 10, 13, 19, 24, 37, 53). Our results suggest that because of the extensive recombination between viral quasispecies, bottlenecks in one region did not preclude the preservation of considerable diversity in other genomic regions. It should be emphasized, however, that none of the patients we evaluated had marked reductions in viral load resulting from treatment. Thus, during the time necessary for selection of an optimal protease sequence, ample opportunity was available for recombination between other loci. In contrast, patients in whom reductions in env diversity has been noted have usually demonstrated marked reductions in viral load prior to the emergence of resistant strains (10, 19, 24, 37), thereby limiting potential partners for recombination. We also observed evidence that viruses had recovered env segments from highly divergent strains (possibly from archived viruses or minority populations replicating in protected sites) through recombination, suggesting a mechanism through which env diversity could be reconstituted, even in cases where transient bottlenecks had occurred. Findings consistent with this scenario have recently been presented (24). Thus, our results suggest that recombination can help restrict evolutionary bottlenecks to minimal segments of the genome carrying the selective advantage while preserving diversity in adjacent regions.
Recombination contributes to the generation of diversity. Our results also emphasize that recombination plays an important role in the generation of diversity. When individual envelope domains were evaluated, we found that the diversity of a given domain was often limited to a relatively small number of distinct genotypes. For example, no more than four distinct V3 loop sequences were detected for any patient; for several patients, only a single V3 loop sequence was identified. Most patients studied here were known to have been infected with HIV-1 for more than 10 years and therefore had probably passed the time when peak diversity in envelope is seen (50). Recombination between envelope domains, however, was extensive. Thus, even when only limited diversity was observed in individual domains, envelope sequences, when considered in their entirety, showed considerably greater diversity. When all variable and constant envelope regions were included in the analysis, each clone from patients 1 to 4 expressed a unique combination of envelope domains and, although clones using the same combination of envelope domains were observed for patients 5 and 6, these populations represented only 3 of 34 and 4 of 18 total clones, respectively.
Similarly, recombination contributed to the generation of diversity in resistance genotypes, as has previously been observed in vitro (33). Viruses from a given treated patient expressed in common most of the protease and RT resistance mutations identified, consistent with the existence of prior bottlenecks in these regions. In most patients, however, some positions associated with resistance were polymorphic. In cases where such polymorphic sites were expressed by more than a single clone, considerable shuffling of mutations, both within the protease or RT and between these regions, was also observed. Thus, patient 5 had two polymorphic sites in RT and two in protease, but nine distinct resistance genotypes were observed, and the most common genotype was expressed by only 38% of the viral clones studied.
It should be emphasized that our analysis was restricted to viral clones that were infectious for the MT4-R5 target cells. As previously described by others (11, 25, 43), we found that the proportion of viruses in plasma that gave rise to infectious events was extremely low, varying from 0.002 to 0.016% for the six patients studied here. Both viruses using CCR5 and CXCR4 coreceptors were identified, suggesting that tropism did not restrict infectivity in MT4 cells. Numerous other factors may contribute to the apparent low infectivity of plasma viruses, including the presence of lethal mutations in the genome, structural defects in the particles, neutralization by antibodies, and loss of infectivity in the interval between venipuncture and infection of target cells. Viral recombination may also have a deleterious effect on infectivity of plasma viruses by associating incompatible gene segments in individual virions; given the high rates of recombination observed in this study, this factor may play an important role. In this regard, it is noteworthy that among the infectious clonal viruses studied here, no example of recombination in the envelope V3 loop that modified the amino acid sequence of this region was identified, whereas apparent recombination within other envelope segments was frequent. Studies comparing the envelope sequences of infectious clonal viruses and sequences derived by limiting-dilution PCR from the same sample of plasma coupled with evaluation of the functional properties of the corresponding gene segments would help define the extent that recombination may impair the infectivity of viruses in vivo.
The reasons for the prevalence of recombinogenic "sexual" reproduction in biology, a characteristic shared by HIV-1, have not been defined. Currently, two general models are often invoked: recombination could increase the rate of adaptive evolution, and/or improve the efficiency of eliminating the effects damaging mutations (15, 40). Recently, an evaluation of the impact of combinations of drug resistance mutations on fitness in the absence of drug pressure suggested that epistasis between individually deleterious mutations in HIV-1 is positive (antagonistic) (2). Although the epistasis between beneficial mutations was not examined, these findings do not offer support for the second alternative. Our findings demonstrate that recombination in HIV-1 makes an important contribution to the diversity of viral quasispecies, both by restricting evolutionary bottlenecks to gene segments and by generating novel genotypes in pol and env. This increased diversity resulting from recombination may be critical to adaptive evolution in the face of changes in immune pressure or antiretroviral therapy. Further studies evaluating the contribution of recombination to the sequential evolutionary changes observed for individual patients will be helpful in exploring this possibility.
C.C. and T.N. contributed equally to this work. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»