| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Previous Article | Next Article ![]()
Journal of Virology, November 2006, p. 10591-10599, Vol. 80, No. 21
0022-538X/06/$08.00+0 doi:10.1128/JVI.00644-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
,
Haynes W. Sheppard,3 and
Julie A. E. Nelson1*
Department of Molecular Genetics, Biochemistry and Microbiology,1 Department of Medicine, University of Cincinnati, Cincinnati, Ohio,2 California Department of Health Services, Richmond, California3
Received 30 March 2006/ Accepted 21 August 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The V1-V2 and V4-V5 regions of gp120 are highly variable in sequence and length within an infected individual and between infected individuals (19, 37, 41). The domains are heavily glycosylated, and changes in the location and number of N- and O-linked glycosylation sites in the V1-V2 and V4-V5 regions are associated with escape from neutralization (reviewed in reference 29). Removal of some of the glycosylation sites from the V1 and V2 regions results in increased immunogenicity of the domains (31) and has been shown in one study to redirect the immune response toward V3 rather than V1-V2 (7). Sequence changes observed in V1-V2 and V3 to V5, including insertions, deletions, and point mutations, have been linked to escape from neutralizing antibodies. The accumulation of nonsynonymous substitutions was associated with escape in two separate studies (11, 12), while changes in the glycosylation sites of gp120 have been described in terms of an evolving glycan shield that leads to escape from neutralizing antibodies in a third study (42). It has also been shown that mutations in env lead to escape from neutralizing antibodies through conformational masking of epitopes (21). Mutations in env have also been shown to affect cytotoxic T-lymphocyte (CTL) epitopes (11, 17), indicating that both neutralizing antibodies and CTL apply selective pressure on HIV-1 in vivo. High levels of env sequence diversity have been linked with both slower disease progression and more-effective immune responses against the virus, both in simian immunodeficiency virus and HIV (3, 8, 14, 16, 26), thereby implying a link between strong immune selection and slower disease progression.
Heteroduplex assays are powerful methods for displaying the number and variety of variants within a viral population (reviewed in reference 4) without the potential selection bias inherent to sequence-based analyses. Heteroduplex mobility assays reveal the diversity within a sample through the visualization of heteroduplexes formed between different variants within the sample, while heteroduplex tracking assays (HTA) use a radiolabeled probe to display differences between the probe and the sample. We have previously used HTA to examine the diversity and changes in V1-V2 in monthly samples from subjects with low CD4 counts (18). In 12 of 21 of these subjects, at least one V1-V2 variant population was gained or lost over 5 to 9 months. Sequence analysis of the V1-V2 regions for several of the subjects revealed point mutations, recombination events, and deletions as the main mechanisms of sequence change. Delwart et al. analyzed the V3 to V5 variant populations in semiannual samples from subjects with different rates of progression, showing that subjects with faster CD4+ T-cell decline had slower diversification of the region from V3 to V5 and that progression of disease was correlated with reduced diversity (8). These latter results support the idea that the level of sequence diversity directly correlates with immune selection.
In the present study, we have used HTA analysis to examine the V1-V2 and V4-V5 regions of HIV-1 env in semiannual plasma samples from subjects in the San Francisco Men's Health Study. Subjects with different rates of progression, as defined by rate of CD4 cell loss, were compared for rates of change in the env viral population using a newly developed HTA index algorithm that highlights periods of increasing env diversity.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Viral RNA isolation and RT-PCR. Viral RNA was isolated from 140 µl of plasma using the QIAamp viral RNA mini kit (QIAGEN) following treatment with heparinase I (Sigma) or from 200 µl plasma using the QIAamp UltraSens virus kit (QIAGEN). In both cases, RNA was eluted with 60 µl buffer. For samples with less than 20,000 copies/ml, 0.5 ml of plasma was pelleted prior to RNA isolation. V1-V2 reverse transcription-PCR (RT-PCR) conditions were modified from those of Kitrinos et al. (18) as follows. Primers for RT-PCR were V1new (HXB numbering 6548 to 6577) (5'-AATCAGTTTATGGGATCAAAGCCTAAAGCC-3') and V2 (HXB numbering 6951 to 6980) (5'-CTTAATTCCATGTGTACATTGTACTGTGCT-3') or V4 (HXB numbering 7349 to 7378) (5'-TTTTAATTGTGGAGGGGAATTTTTCTACTG-3') and V5 (HXB numbering 7647 to 7676) (5'-ATATAATTCACTTCTCCAATTGTCCCTCAT-3') for V1-V2 and V4-V5 amplification, respectively. The V4-V5 primers were anchored on invariant positions in the subtype B consensus sequence from the HIV Sequence Database (http://hiv-web.lanl.gov) and bounded the V4 and V5 coding regions. RT reactions were generated via a two-step method as follows. (i) Five microliters of viral RNA (at least 200 copies of RNA), 2 mM (each) deoxynucleoside triphosphates, and 15 pmol primer V2 or V5 were mixed, heated to 85°C for 5 min, and then cooled on ice. (ii) Eight microliters of an RT mix (1x RT buffer, 2.5 mM MgCl2, 6.5 mM dithiothreitol, 20 U RNase OUT, and 100 U Superscript III reverse transcriptase [Invitrogen]) was added to each tube. RT reactions were incubated first at 55°C for 1 h and then at 70°C for 15 min to inactivate the enzyme. After a brief cooling of the reaction mixtures, 2 U of Escherichia coli RNase H (Invitrogen) was added to each reaction mixture and incubated at 37°C for 20 min to degrade residual RNA. The reaction mixtures were cooled, and 35 µl of a PCR mix (1x High Fidelity buffer [Invitrogen], 1.25 mM MgSO4, 2 mM deoxynucleoside triphosphate mix, 7.5 pmol primer V2 or V5, 15 pmol primer V1new or V4, and 2.5U High Fidelity Platinum Taq DNA polymerase [Invitrogen]) were added. The PCR mixtures were amplified in a Stratagene Robocycler 40 with the following program: 1 cycle of 95°C for 3 min, 55°C for 1 min, and 72°C for 2 min; 9 cycles of 95°C for 1 min, 55°C for 1 min, and 72°C for 2 min; 10 cycles of 95°C for 1 min, 55°C for 1 min, and 72°C for 3 min; 10 cycles of 95°C for 1 min, 55°C for 1 min, and 72°C for 4 min; 4 cycles of 95°C for 1 min, 55°C for 1 min, and 72°C for 5 min; and a final cycle of 95°C for 1 min, 55°C for 1 min, and 72°C for 10 min. Expected product lengths were approximately 420 bp for V1-V2 and approximately 350 bp for V4-V5. Each RT-PCR was performed at least twice.
Heteroduplex tracking assays.
The HTA probes were generated using the same RT-PCR primers as described above. For each region, two probes from molecular HIV-1 clones were generated. The Ba-L (35) and JR-FL (20) sequences were selected for V1-V2 analysis, while the NL4-3 (2) and YU-2 (25) sequences were selected for V4-V5 analysis because of the availability of the sequences and their length differences in these regions. Initial RT-PCR products from a subject were analyzed with both probes to determine which probe resulted in the best separation of the bands; this probe was used for subsequent HTA analysis for that subject. The V1new and V2 primers were used to amplify the V1-V2 region from the Ba-L and JR-FL envelope clones (from Nathaniel Landau and Irvin Chen, respectively). The V4 and V5 primers were used to amplify the V4-V5 region from the NL4-3 and YU-2 envelope clones (from Malcolm Martin and the NIH AIDS Research and Reference Reagent Program). All of the PCR products were cloned using the Perfectly Blunt cloning kit (pT7Blue-3 for the V1-V2 probes and pT7Blue for the V4-V5 probes, both vectors from Novagen). The downstream EcoRI site was filled in using Klenow fragment of DNA polymerase I for each V1-V2 probe. Probes were end labeled essentially as described previously (28) by digesting the plasmids with EcoRI, adding [
-35S]dATP and Klenow fragment to radioactively label one end of the probe sequence, and digesting the plasmids with PstI (V1-V2) or NheI (V4-V5) to release the probe. Radiolabeled probes were purified using ProbeQuant columns (Amersham).
Heteroduplex reactions contained 8 µl RT-PCR product, 1x annealing buffer (0.1 M NaCl, 10 mM Tris-HCl [pH 7.5], 2 mM EDTA), 0.5 to 1 µl labeled probe, and 0.1 µM V1 or V4 primer. Reaction products were denatured at 95°C for 2 min and then cooled at room temperature for 5 min before loading onto 6% (V1-V2) or 7% (V4-V5) polyacrylamide gels. Gels were run at 20 mA, vacuum dried, and exposed to both X-ray film and a PhosphorImager plate (Molecular Dynamics).
Duplicate RT-PCR products were analyzed separately by HTA to compare the patterns. If the two products from the same RNA sample did not have very similar patterns, a third RT-PCR was run to determine whether one of the first products should be excluded. Two products with very similar patterns were obtained for all RNA samples from which RT-PCR was successful. Missing lanes in the HTA are due to a lack of amplification from that sample.
Cloning of V1-V2 RT-PCR products. RT-PCR products were cloned into the pT7Blue-3 vector according to the manufacturer's instructions (Perfectly Blunt cloning kit; Novagen). Clones were screened by colony PCR and HTA analysis as described previously (18). Clones of interest were prepared and sequenced by the University of Cincinnati DNA Core Facility.
Data analyses.
The phosphorimaging data were used to quantify the relative abundance of each band within a sample using FragmeNT Analysis software (Molecular Dynamics). Individual bands/variants were identified as previously described (18). Bands were compared for migration distance using plots generated using ImageQuant software (Molecular Dynamics). Plots were aligned using the single-stranded probe band. The relative migration distance and relative abundance of the bands were used to generate matrices representing the HTA patterns. The difference between lanes representing 6-month intervals was calculated as total change using the formula:
![]() |
Ai =
Bi = 1. A band that is present in only one lane is assigned a frequency of zero in the lane from which it is absent. The absolute value of each difference was taken to allow for both increases and decreases in relative abundance. The sum of all differences is divided by two to reflect the fact that each increase in relative abundance results in a concomitant decrease. Thus, two lanes with no bands in common (reflecting a complete turnover in the virus population) are assigned a total change of 100%. Shannon entropy (S) was calculated for each lane using the following equation:
![]() |
A major limitation of the total change and entropy measures previously described is that they do not highlight the emergence of new bands in the context of other band pattern changes and do not account for differences in time intervals between samples, limiting the intervals that could be compared. Thus, we developed a new mathematical algorithm that emphasizes the emergence of new bands, termed the HTA index, or
, for each lane B in a gel as a comparison of lanes A and B. It is described by the following equation:
![]() |
), the amount of variability in lane B (
), and the difference in the time interval from the standard 6 months (
). The HTA index and its derivation are described in more detail in the supplemental material. The HTA index algorithm can be downloaded at http://www.hepato-site.net/evoindex/.
Statistical analyses.
To confirm the extent to which each predictor variable affected the overall index for each patient, simulations were generated by varying each predictor by a predetermined amount within reasonable ranges given the observed data. All components of the HTA index were evaluated in this systematic manner. For regression of HTA index simulation results on predictor variables, dummy variables were created for changes in viral load (stable versus changing, increasing versus not increasing, decreasing versus not decreasing) and new band count. Predictor variable contribution to index variability was assessed by the adjusted R2. Goodness of fit was assessed by assessing normality and homoskedasticity of residuals. Due to observed heteroskedasticity, robust standard errors were generated as suggested by Davidson and MacKinnon (7a). Manual backwards stepwise multiple linear regression was performed on all predictor variables examined in univariate analysis, first fitting the full model and then eliminating factors that were insignificant or colinear. All analyses were performed in Stata version 8.0 (Stata Corp., College Station, TX). In all analyses, a two-tailed alpha of
0.05 was considered statistically significant.
Total change, entropy, and HTA indices were transformed and compared for each region (V1-V2 and V4-V5) in Stata (version 8.0; Stata Corp., College Station, TX) using Student's t test with the Satterthwaite correction for unequal variance. Because total change values (expressed as a decimal) fall between 0 and 1, the logit transformation was used. The HTA index was normalized via log10 transformation.
The outcomes of percent change over time and entropy were assessed for predictor variables, including rapid/slow CD4 loss, CD4 count (continuous) and category (above/below 150 cells/µl), and HIV viral load, via a generalized estimating equation (GEE) model in SAS (version 8.0; SAS Institute, Cary, NC) with an inverse Gaussian distribution and identity link, and exchangeable correlation structure. The GEE method was used because it is highly robust to deviations from normality and correlation structure can be specified. This model is able to parse within-subject variability from between-subject variability with great flexibility, meaning that the model accounts for differences in sampling time or number and differences in magnitude of within-subject correlation (10). Model results are reported as increase/decrease in total change compared to referent group, P value for the difference, and 95% confidence interval (95% CI). Model fit was assessed by comparing naïve and robust standard errors, which did not differ by more than 10%.
Pairwise correlations were used to assess correlations between (i) V1-V2 and V4-V5, (ii) total change and entropy, (iii) HTA index and entropy, and (iv) HTA index and total change.
Nucleotide sequence accession numbers. The V1-V2 sequences from subjects 1048 and 778 have been deposited in GenBank under the accession numbers DQ885201 to DQ885217.
| RESULTS |
|---|
|
|
|---|
|
|
Because of the variable number of intervals per subject, we examined the relationship between the CD4 count to the degree of band changes in V1-V2 and V4-V5 over time using GEE models including total change values for all 6-month intervals and entropy values for all samples. The models indicated that the CD4 count was predictive of total change, with each increase in CD4 count of 10 cells/µl predicting 0.4% increases in both total V1-V2 and total V4-V5 changes (for V1-V2, 95% CI of 0.1 to 0.7, P = 0.006; for V4-V5, 95% CI of 0.2 to 0.6, P < 0.0001). However, when intervals with CD4 counts less than 150/µl were excluded from the data set, CD4 count was no longer predictive of total change (P = 0.80 for V1-V2 and P = 0.21 for V4-V5). Other variables including viral load and presence of X4 variants (J. A. E. Nelson, unpublished data) did not show any significant correlation with total change for either region. GEE analysis did not demonstrate a significant association between CD4 count and changes in entropy over time for either V1-V2 or V4-V5 regions (P = 0.30 and 0.23, respectively). Therefore, the low CD4 count intervals (mostly from subjects 778 and 1048) were responsible for the predictive value of CD4 count on total change; there was no difference in the amount of change over the course of infection in the other subjects.
A novel index of viral evolution reveals the emergence of new variants. A limitation of the calculation of total change in the variant populations using the data from this cohort was that the length of time between samples was not considered. Although the San Francisco Men's Health Study involved 6-month visits, in many cases samples were not available such that there were time intervals significantly longer than 6 months. In addition, neither total change in HTA pattern or entropy highlighted the emergence of new bands as an indicator of env evolution. Therefore, we developed a new algorithm, the HTA index, that combines quantitative aspects of the total change calculation and the band distribution of entropy. Terms for the emergence of new bands and a time interval correction were also added (see supplemental material for a full discussion). Importantly, the HTA index is not dependent on the total number of samples per subject, and thus, the index is valid whenever there are two chronologically adjacent lanes. Simulations of the HTA index in which single variables were changed indicated that the baseline band count (in the first lane of adjacent lanes) and the number of new bands in the second lane had significant positive effects on the HTA index, while the distribution of the bands within the second lane and the length of the time interval had significant negative effects on the HTA index (Table 2). Other variables including viral load, lane differences, and differences in band count between adjacent lanes were not significant or were colinear with either baseline band count or number of new bands. This analysis demonstrated that high numbers of new bands led to high HTA indices and that long time intervals and poor band distribution decreased the HTA index despite high numbers of new bands. A high baseline band number also had a small positive effect on the HTA index.
|
The total change and HTA index values did not show the same trend in some cases, i.e., the total change value increased while the HTA index stayed low. Examples include subject 1048 (V1-V2 and V4-V5 at 7.5 years), subject 778 (V4-V5 at 4.5 years), subject 179 (V1-V2 at 1.3 years), and 822 (V1-V2 at 3.2 years) (Fig. 1). These examples illustrate the difference between these two measures of sequence evolution, in that there can be a great deal of change in the HTA pattern due to changes in band intensity or loss of bands, but the HTA index remains low because there are few if any new bands emerging. We are most interested in new bands emerging because they are due to sequence evolution and virus escape. These differences between the two measures were the exceptions, however, because the HTA index correlated strongly and significantly with total change for both regions (V1-V2, r = 0.48; V4-V5, r = 0.41; P < 0.001 in both cases). Similar correlations were observed between the HTA index and entropy (V1-V2, r = 0.54; V4-V5, r = 0.60; P < 0.001 in both cases). This suggests that the trends observed in our novel index follow the trends seen in more-standard, yet less-sensitive, measures of change.
V1-V2 and V4-V5 evolution occurs at lower rates in subjects with rapid loss of CD4 cells. To determine whether there were differences between the subjects with slow and rapid loss of CD4 cells, we compared mean total change and mean HTA index values between these two groups (Fig. 2). The mean total change for V4-V5 was significantly higher for subjects with slow CD4 loss, but the mean total change for V1-V2 was not different between the two groups (Fig. 2A). This was surprising because we observed the same stabilization of bands in subjects 778 and 1048 in both V1-V2 and V4-V5. However, the mean HTA index was significantly different between the two groups for both env regions, with higher mean HTA indices in the subjects with slow CD4 loss (Fig. 2B). Therefore, the subjects with rapid CD4 loss had lower new-band emergence (measured by HTA index) than the subjects with slow CD4 loss but equivalent changes in the relative abundance of existing bands (measured by both methods). Since long intervals (>0.8 year) could have skewed this analysis with excess low HTA indices, the long interval HTA index values were removed and the data were reanalyzed. The difference between the two groups of subjects was still significant for both V1-V2 and V4-V5 (not shown).
|
|
| DISCUSSION |
|---|
|
|
|---|
Analysis of total change in HTA pattern revealed significantly more change in V4-V5 HTA patterns among subjects with slow CD4 loss, while analysis with a new HTA index algorithm, with its emphasis on the emergence of new bands, showed significantly higher indices for both V1-V2 and V4-V5 regions in the subjects with slow CD4 loss. Statistical analyses showed that CD4 count was correlated with total change and HTA index values only when low CD4 counts were included in the data set, indicating that low CD4 counts under 150/µl were associated with reduced changes in the HTA pattern and reduced emergence of new variants. Sustained low CD4 counts in two subjects with rapid CD4 loss were associated with stabilization of V1-V2 and V4-V5 variant populations as reflected by both low total change and low HTA index values.
The present study is the first to use HTA to follow the evolution in two separate regions of env during the advanced stages of chronic infection. Most previous HTA and sequencing studies have focused on only one region of env (8, 18, 24, 36, 37). A recent study by Frost et al. (13) analyzed more than one region of env by bulk sequencing full-length env in samples from acute infection and 1 year later. The sequence changes were compared to neutralization activity against autologous virus measured over the first 3 years of infection and found that multiple amino acid substitutions were associated with neutralization escape. Our study complements the findings of Frost et al. by showing that sequence turnover continues until late in infection. Thus, immune selection against HIV appears to continue until late in infection.
Our observation that both V1-V2 and V4-V5 populations stabilized at sustained low CD4 counts is evidence that immune selection against Env wanes at low CD4. The rapid turnover of env variant populations when CD4 counts are above 150/µl suggests that the immune system continues to be able to develop new responses against escape variants. Previous studies have shown evidence of both selection and genetic drift in driving HIV evolution (1, 13, 18, 24, 39), but the stabilization of env variants in two subjects in the present study argues against genetic drift as a significant force in env evolution and in favor of continually emerging immune responses that decline at persistently low CD4 counts. Our data also suggest, however, that these continually emerging immune responses against Env are largely ineffective in reducing viral burden, since the virus is constantly escaping as the viral loads remain high. It remains to be determined whether the selective pressure on the env gene plays a role in the equilibrium between HIV-1 and its human host and whether this selective pressure can be directed to be more effective in controlling the virus.
| ACKNOWLEDGMENTS |
|---|
We thank Ronald Swanstrom, Claire Chougnet, and Jason Blackard for advice and critical readings of the manuscript; Kathryn Kitrinos for advice on V1/V2 HTA and V4/V5 RT-PCR; and Dale Dondero and Brent Sugimoto for sample shipments. The Ba-L, JR-FL, and NL4-3 molecular clones were obtained from Nathanial Landau, Irvin Chen, and Malcolm Martin, respectively. We received the following reagent from the NIH AIDS Research and Reference Reagent Program: pYU-2 from Beatrice Hahn and George Shaw.
| FOOTNOTES |
|---|
Published ahead of print on 6 September 2006. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
Present address: Department of Pediatrics, Tulane University School of Medicine, New Orleans, LA. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Mol. Cell. Biol. | Microbiol. Mol. Biol. Rev. |
|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|