Previous Article | Next Article ![]()
Journal of Virology, January 2006, p. 999-1014, Vol. 80, No. 2
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.2.999-1014.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Departments of Pathobiology,1 Microbiology, University of Washington, Seattle, Washington 98195,2 Theoretical Biology and Biophysics Group, Los Alamos National Laboratories, Los Alamos, New Mexico 87545,3 Viral Vaccines Program, Seattle Biomedical Research Institute, Seattle, Washington 981094
Received 23 June 2005/ Accepted 10 October 2005
|
|
|---|
|
|
|---|
In recent years, insight has been gained from structural analyses of the bound (as a ligand) and unbound gp120 core (12, 36) as well as from several studies looking at the dynamics of the glycan shield in both simian immunodeficiency virus (SIV) (11) and HIV (15, 17, 61). Such studies have documented that the acquisition or rearrangement of carbohydrates can allow for neutralization escape mutants (8, 13, 52), while many of the individual glycans play important roles in protecting the CD4 binding site (CD4bs) and gp41 ectodomain from neutralizing antibodies (42).
Neutralizing antibodies are known to influence the evolution of HIV within a host and to play an important role in disease progression. NAb titers correlate with a lack of disease progression in some HIV-infected long-term nonprogressors (10), and viral persistence may depend on the selection of NAb escape variants (16, 61). In some patients, the NAb response can show evidence of affinity maturation and recognition of common determinants conserved among distinct isolates (55). However, this maturation is a multiyear process and is not universal, occurring in a minority of patients (43). Thus, an understanding of the mechanism(s) governing this response would be valuable for vaccine design.
A major question that remains unanswered involves the boundaries of variation that exist as the virus evolves in individual hosts and the relationship between these changes and the adaptive immune response. This has been difficult to address for HIV-1 infection of humans due to the distinct viruses present in each individual. This question can be addressed in part by comparing the sequences from different individuals over time during disease progression and when there are alterations in Env functionality, such as its coreceptor usage. Such a comparison was performed using published sequences found in the Los Alamos National Laboratory (LANL) HIV database. An analysis of 49 published full-length HIV-1 gp160 sequences in the LANL database revealed that approximately half of these glycans are in conserved regions (66). For those that are not conserved, there appear to be small windows, or "hot spots," where the glycans can vary in relative position and number as coreceptor switching occurs and immune responses develop. However, this type of analysis cannot directly link Env mutations to the development of antibody or cytotoxic T lymphocyte (CTL) responses.
Another approach to addressing this question is to determine whether key changes are similar in different individuals infected with the same virus. Using molecularly cloned or low-diversity virus isolates to infect nonhuman primates, it is possible to monitor the fate of viral sequences over months or years postinfection in multiple individuals with greater or lesser degrees of control of viremia. These types of studies have been valuable for understanding viral mutations leading to immune escape from cytotoxic T cells (1). Much of the data available from the analysis of mutations in Env in nonhuman primates have utilized SIV, where changes to PNGs can directly affect both resistance to neutralization by sera from infected macaques and the pathogenicity of the virus itself (11, 31). SIV is closely related to HIV-2 and differs significantly in Env from HIV-1; until recently, SIV and HIV-1 were not known to share any neutralization determinants. A recent report demonstrated that CD4-induced antibodies cross-neutralize primate lentiviruses in the presence of soluble CD4 (20). Despite this shared epitope, it is likely that few specific escape mutations in SIV Env will be directly informative regarding HIV-1 in humans. In contrast, chimeric simian/human immunodeficiency viruses (SHIVs) bearing HIV-1 env genes have the potential for direct comparison with HIV-1 env genes in their native context.
For this study, we performed an analysis of 11 macaques that were infected with SHIV-89.6P (49) for approximately 1 year. The Env proteins in seven of these macaques underwent significant divergence from that in the inoculum, while four had minimal divergence. SHIV-89.6P was constructed with the HIV-89.6P (a subtype B virus) env gene (50). Subtype B viruses, unlike A and C viruses, do not seem to accumulate PNGs over time from infection, while A and C viruses tend to have reduced numbers of PNGs in early infection and gradually acquire more (14, 22). Despite this difference among subtypes, the population of subtype B viruses in the HIV database tends to have comparable numbers of PNG sites and levels of diversity to those in the other subtypes. In this study, using a subtype B env gene in an SIV background (SHIV-89.6P), there was no inherent tendency for the total number of PNG sites to gradually increase, although Env proteins from macaques with divergent viruses did acquire significantly more PNGs than those from macaques with minimally divergent viruses. We observed a remarkable degree of conservation in patterns of change in Env PNGs and alterations in V2 and V5 in all seven macaques with divergent viruses, despite variations in overall amino acid sequences that were unique to each macaque. The majority of these changes made the SHIV-89.6P virus more similar to the typical patterns seen for HIV-1 clade B. The data from this study suggest that the glycan shield and certain other key regions of HIV-1 Env may be under considerable constraints during viral divergence.
|
|
|---|
![]() View larger version (15K): [in a new window] |
FIG. 1. Phylogenetic analysis of individual HIV-1 envelope gp120 proviral clones from macaques infected with SHIV-89.6P at 32 weeks postinfection. The maximum likelihood tree is rooted on the consensus of the inoculum. Sequences from each macaque are clustered together and uniformly colored. Branches were collapsed if the bootstrap value was below 50; bootstrap values over 50 are labeled on the tree.
|
|
View this table: [in a new window] |
TABLE 2. Convergent PNG changes in gp120 at 32 wpia
|
Nested PCR was performed on a 1:25 dilution of first-round PCR product, using the primers 3'gp120 89.6 (CGATTCATCTTTTTTCTCTTTGCACTGT) and EO (TAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTA). The following cycling conditions were used: denaturation at 94°C for 2 min and 25 cycles of 94°C for 15 s, 64°C for 3 min, and 68°C for 3 min, with a 5-s increase in each cycle, followed by a final extension at 68°C for 7 min. PCR products were purified using a QIAGEN QIAquick PCR purification kit according to the manufacturer's instructions. For each sample, the purified nested PCR products were pooled and ligated into the 2.1 TOPO-TA vector (Invitrogen, Carlsbad, CA), using 4 µl of PCR product and 1 µl of vector. Two microliters of ligation mix was transformed into TOP10F' chemically competent cells (Invitrogen). Transformed cells were selected with ampicillin and X-Gal (5-bromo-4-chloro-3-indolyl-ß-D-galactopyranoside). Ten clones were obtained from each sample, and DNAs were prepared using a QIAGEN miniprep kit according to the manufacturer's instructions. Inserts were confirmed by digestion with EcoRI. Cloned V1-C5 was sequenced using the primer 319 (CATGGTAGAACAGATGCATGAGG) and the TOPO-TA cloning primers M13F and M13R. Sequencing was performed with Big Dye chemistry, version 3, on an Applied Biosystems 3100 genetic analyzer. Sequences giving missense predicted proteins were removed from analysis, as were duplicate sequences, resulting in an average of nine clones analyzed per animal per time point. The proviral env gene of the inoculation virus was amplified from the PBMC stock from which it was prepared in accordance with the same methods described above. A consensus was created from the eight sequenced clones obtained. The consensus sequence was identical to the published sequence of SHIV-89.6p KB9 (accession no. U89134).
Sequence analysis.
Sequences were assembled using Lasergene DNAstar Seqman and Editseq (2) and aligned with ClustalX (58). Nucleotide alignments were manually edited using BioEdit 5.0.9 (27). A maximum likelihood tree of the nucleotide sequences was constructed with PAUP*4.0 (57). Maximum likelihood bootstrapping was performed with 100 replicates, using the HKY85 model with a subtree-pruning-regrafting-branch-swapping algorithm. The starting tree was obtained by neighbor joining, and the starting branch lengths were obtained using the Rogers-Swofford approximation method. Branches with a bootstrap value of <50 were collapsed, and the tree was rooted on the consensus of 89.6P KB9 (accession no. U89134). For an analysis of diversity, nucleotide alignments were analyzed in MEGA3.0 (34), using the Kimura two-parameter model with pairwise deletions (transition-to-transversion ratio of 2). For an analysis of divergence, a consensus sequence from eight clones of the inoculating strain was obtained using the LANL Seqpublish program (http://www.hiv.lanl.gov/content/hiv-db/SeqPublish/seqpublish.html). The consensus of the inoculation virus was compared to each clone sequence, and divergence was measured with MEGA3.0, using the Kimura two-parameter model with pairwise deletions (transition-to-transversion ratio of 2). The average divergence at each time point for each animal is reported. All sequences were compared to the published sequence to produce the percent difference from the published sequence. All of the clones from the inoculum were then compared to the clones from each animal in order to test whether or not the inoculum was significantly different from the animal's quasispecies by using a Mann-Whitney test (http://eatworms.swmed.edu/
leon/stats/utest.html). In order to compute selective pressure, the ratio of synonymous (dS) to nonsynonymous (dN) rates of change was calculated with the program PAML (65), using maximum likelihood analyses. Calculations utilized the tree topology shown in Fig. 1. A dN/dS ratio of 1 denotes neutral selection, ratios of >1 denote positive selection, and ratios of <1 denote purifying selection. N-linked glycosylation sites were tracked and tallied using the LANL N-Glycosite program (http://www.hiv.lanl.gov/content/hiv-db/GLYCOSITE/glycosite.html). The frequency of use of each PNG was determined by manually creating a matrix of PNG locations in Excel (Microsoft, Redmond, WA). The nucleotide sequences were translated to amino acids using JavaScript DNA Translator 1.1 (https://deathstar.bcc.washington.edu/public/dnatranslater/index.html), aligned with ClustalX, and manually edited with BioEdit 5.0.9. The reference sequence HIV-HXB2 (accession no. K03455) was included in the alignment in order to standardize the numbering position of each amino acid residue, as recommended by Korber et al. (33). Briefly, each aligned residue was numbered according to the corresponding position in HXB2, and insertions were given the number of the residue prior to the insertion, with a letter to denote each inserted residue. Each clone was then scored for the presence or absence of the canonical PNG motif (Asn-X-Ser/Thr, where X represents any residue other than proline). The presence or absence of a given PNG motif was then compared to the consensus of the inoculum and reported as a percentage of use for a given PNG site. The prevalence of a PNG motif within an animal's quasispecies was compared to the prevalence of the same PNG motif within the inoculum, using Fisher's exact t test (http://www.unc.edu/
preacher/fisher/fisher.htm). P values of <0.05 are reported as significant.
Three-dimensional mapping of PNG sites. The PNG sites were mapped on a model structure of gp120. The core structure corresponded to the X-ray structure of YU2 (35) (pdb code 1RZK). For completeness and for the visualization of all PNG sites, model structures for the V1, V2, and V3 loops corresponding to YU2 were added to the core structure. Added loops were minimized while restraining the rest of the atoms of gp120 with a large force constant that was gradually relaxed to 500 kcal/mol-Å2 for backbone atoms of the crystal structure. Molecular dynamic simulations were carried out with flexible loops on a rigid core in a vacuum at high temperature. The resulting structure was solvated with 16,307 water molecules and minimized with restrained core backbone atoms. Next, all-atom molecular dynamic simulations were carried out, with periodic boundary conditions of 300 K for 2 ns. In the simulations, only water molecules and V1/V2/V3 loops were allowed to equilibrate on a rigid core. This provided an efficient sampling of loops on the solvated core structure. The above calculations were carried out using a modified version of AMBER software, and simulation settings and conditions conformed to the program defaults (46). We used these modeled loops on the CD4-bound structure of gp120 to guide us in modeling the relative locations of the N-glycosylation sites. Three-dimensional images were made using VMD software.
Construction of homologous pseudovirus.
In order to test for homologous neutralizing antibodies, a single round of infection with a SHIV-89.6P pseudovirus was utilized. The viral backbone plasmid Q23
Env was kindly provided by Julie Overbaugh (40). The 89.6P Env-encoding plasmid was constructed in pEMC*. The proviral 89.6P gp160 env gene was first cloned into the 2.1 TOPO-TA vector as described above. The first-round PCR primers were EO (TAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTA) and EO1 (TCCAGTCCCCCCTTTTCTTTTAAAAA), and the cycling conditions used were as follows: denaturation at 95°C for 5 min and 35 cycles of 94°C for 40 s, 61°C for 30 s, and 72°C for 3 min 15 s, with a final elongation step of 72°C for 10 min. The second-round primers were 89.6p 3' gp41 ClaI (GGCGGCGGCATCGATTCACAAGAGAGTGAGCTCAAGC) and 89.6p 5' gp120 NheI (GCGGCGGCGGCTAGCACAGAAAAATTGTGGGTCACAG), and the cycling conditions used were as follows: 92°C for 5 min, 40°C for 1 min, 68°C for 5 min, 92°C for 1 min, 45°C for 1 min, 68°C for 5 min, 2 cycles (92°C for 1 min, 50°C for 1 min, and 68°C for 5 min), 2 cycles (92°C for 1 min, 55°C for 1 min, and 68°C for 5 min), 25 cycles (92°C for 1 min, 60°C for 1 min, and 68°C for 5 min), and 68°C for 10 min. The second-round primers inserted an in-frame NheI restriction site 3' of the env leader sequence and a ClaI restriction site at the 3' end of gp41. Transformants were generated as described above. The purified plasmid was digested with NheI (Roche, Indianapolis, IN) and ClaI (Roche, Indianapolis, IN), and the gp160 fragment was ligated into pEMC* with T4 DNA ligase (NEB, Beverly, MA). Total plasmid DNA (5.5 ng) was transformed into 50 µl of Max Efficiency DH10B competent cells (Invitrogen, Carlsbad, CA) per the manufacturer's instructions and grown at 30°C for 24 h. The plasmid was purified with a QIAGEN miniprep kit as described above and sequenced for verification of the insert. 293T cells were transfected with Fugene 6 (Roche, Indianapolis, IN) per the manufacturer's instructions. Cells were plated to 50% confluence in a T75 flask, and 4 µg total DNA with a 2:1 backbone-to-Env ratio was prepared in a mixture of 12 µl Fugene and 188 µl Dulbecco's modified Eagle's medium. Virus was harvested 48 h later, spun at 2,000 rpm for 10 min, and stored at 70°C until use. The pseudovirus was titrated on Tzm-bl cells, and 200 50% tissue culture infective doses were added to each neutralization assay.
Neutralization assay. Serum from each animal was tested at 0, 12, 20, 32, and 40 weeks postinfection. Sera were heat inactivated at 56°C for 1 h and spun at 14,000 rpm for 5 min to remove coagulants. The Tzm-bl neutralization assay was performed in 96-well plates (60). Briefly, 200 50% tissue culture infective doses of virus were added to serial dilutions of sera in the presence of 7.5 µg/ml DEAE-dextran and incubated in a total volume of 150 µl medium (Dulbecco's modified Eagle's medium, 10% fetal calf serum, 1% L-glutamine, 1% penicillin-streptomycin) for 1 h at 37°C. Each well received 100 µl of Tzm-bl cells resuspended in medium at 1 x 105 cells/ml. Forty-eight hours later, cells were lysed for 2 min directly in the neutralization plate, using 100 µl of Bright-Glo luciferase assay substrate (Promega, Madison, WI), and immediately analyzed for luciferase activity on a luminometer. The reciprocal dilution of serum necessary to achieve 50% neutralization is reported. Note that the collection of late sera ranged from week 39 to week 53 but is referred to as week 40 for ease of reporting. Neutralization values greater than three times the prebleed background level are considered significant. The lowest dilution of prebleed sera tested (1:100) never reached 50% neutralization; therefore, values above 300 were considered significant. Any sample that did not neutralize at a dilution of 1:100 is reported as 50. All values were calculated with respect to virus only [(value for virus only minus cells only) minus (value for serum minus cells only)] divided by (value for virus minus cells only).
Nucleotide sequence accession numbers. All sequences included in this study have been deposited in GenBank (accession numbers DQ105094 to DQ105195).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Characteristics of SHIV-89.6P-infected Macaca nemestrina
|
Further analysis of the aligned env nucleotide sequences allowed for quantification of the horizontal distance depicted on the phylogenetic tree. Divergence is defined as viral evolution away from the original sequence, while the quasispecies or sequence variation present within an animal at a given time defines diversity. As expected, there was only slight heterogeneity (0.3%) within the inoculum. Sequences obtained from 4 of the 11 animals showed little divergence (mean, 0.35%; range, 0.3% to 0.4%) and little diversity beyond those of the inoculum (mean, 0.58%; range, 0.5% to 0.7%). These animals included K97107, K98099, L97191, and J97168 and are referred to as "minimally divergent" from here on. In contrast, the sequences obtained from each of the remaining seven animals showed marked divergence (mean, 1.5%; range, 0.9% to 1.9%) and diversification (mean, 1.4%; range, 1.1% to 1.6%). These animals included T98098, A98069, J98071, J97156, K97246, T98108, and L98152 and are referred to as "divergent" from here on (Table 1). The average divergence of all 11 animals at 32 wpi was 1.1%, giving an average rate of 1.8% divergence in gp120 per year. This is comparable to the 1 to 2% per year rate of divergence reported for HIV-1-infected patients (5, 54).
Several clones within the four minimally divergent quasispecies and the inoculum had a limited number of unique changes from the consensus sequence of SHIV-89.6P. These are represented as individual short terminal branches in the phylogenetic analysis (Fig. 1). In contrast, the seven macaques with divergent sequences each formed unique branches. This supports the conjecture that the clustered sequences were derived from the same macaque and that each quasispecies is unique to an animal. None of the sequences obtained from the inoculum cluster matched with any of the seven divergent quasispecies, suggesting that the divergence does not represent the common outgrowth of a minor preexisting sequence.
The rate of nonsynonymous to synonymous change was calculated for each clone, and the average for each animal is reported in Table 1. All four minimally divergent animals had dN/dS ratios of <1, while six of the seven divergent animals had evidence of selective pressure, with dN/dS ratios of >1.
Relationship between viral load and Env divergence. Six of the seven macaques with high viral set points (>104 RNA copies/ml) developed quasispecies that diverged from the inoculum (Table 1). The one exception was animal L97191, which had the highest viral load set point but whose viral population was only 0.3% divergent from the inoculum at 32 wpi. In contrast, only one of the four animals with low viral set points (<104 RNA copies/ml) developed quasispecies that diverged from the inoculum. The control animals and those that received ineffective vaccines (DNA-DNA or DNA-protein) tended to be the animals with higher viral loads and greater divergence of their quasispecies. This trend did not reach significance in a two-tailed t test (P value = 0.09).
Total number of predicted glycans increases in macaques with divergent Env proteins. The published 89.6P KB9 sequence (accession no. U89134) has 23 total PNGs in gp120. This is less than the median number of 26 PNGs seen in gp120 of HIV-1 clade B viruses (66). Therefore, it was of interest to analyze if the total number of PNGs in 89.6P increased over time to become more similar to the median for HIV-1 clade B viruses. Sequences from each animal were obtained from V1 through the end of gp120 and did not span a known PNG in C1 (N88). Therefore, the sequenced region in this study included 22 PNGs present in the consensus of the inoculum (and, accordingly, 25 in HIV-1 clade B). Env sequences from macaque T98098 did not span the complete V1 region and were therefore not considered for this analysis. When sequences from all remaining macaques were included in the analysis, we found moderate variation in the total number of PNGs per Env but no significant trend overall to suggest that the Env of 89.6P becomes more heavily glycosylated by 32 wpi. However, there is a difference in total PNGs when the sequences are grouped according to divergence levels. Of the six divergent macaques analyzed, five had sequences with a net gain of PNGs (A98069, J98071, J97156, K97246, and T98108), while only one demonstrated a net loss (L98152). In contrast, three of four minimally divergent macaques demonstrated a net loss of PNGs (K98099, L97191, and J97168), while one (K97107) had sequences that remained the same as the inoculum (Table 1). As Env diverges in these animals, the number of potential N-linked glycosylation sites increases; Env proteins from minimally divergent macaques show an average of 21.7 PNGs, while those from divergent macaques show an average of 23.4 PNGs (unpaired two-tailed t test with unequal variances; P < 0.001).
Many predicted glycans are highly conserved in gp120 even as Env diverges. The SHIV-89.6P consensus contained 22 PNGs from V1 through the end of gp120 (Fig. 2A). Nineteen were present at the same locations in over 90% of all sequences analyzed as a group (Fig. 2B, blue lines). Seven of the highly conserved PNGs were located within the variable loops, including four in V1 (N130, N136, N139, and N156), one in V3 (N301), and two in V4 (N392 and N406) (Fig. 2B). The remaining 12 PNG motifs that are highly conserved (N197, N230, N234, N241, N262, N289, N295, N332, N339, N356, N362, and N448) lie within the conserved regions, or core, of gp120 (Fig. 2B). When the quasispecies of each macaque were analyzed individually, 11 of the 19 PNGs described above were found to be well conserved in each of the macaques tested (N156, N197, N230, N241, N289, N295, N301, N392, N406, N332, and N339).
![]() View larger version (44K): [in a new window] |
FIG. 2. Graphical representation of potential N-linked glycosylation sites in SHIV-89.6P Env and HIV-1 clade B Env. N-Glycosite was used to quantify the locations and prevalence of PNGs within the SHIV-89.6P inoculum (A), pooled SHIV-89.6P variants from all macaques (B), and an HIV-1 clade B reference alignment (C). The amino acid number of each PNG is labeled on the x axis. The prevalence of each PNG is graphed as a fraction of the total on the y axis. Variable regions are denoted as a bar at the bottom of each figure; shaded boxes within the variable regions represent hot spots of PNG variation. PNGs that are conserved (>90% prevalent) are shown in blue, PNGs that are heterologous within the inoculum are shown in orange, convergent PNG additions are shown in green, and convergent PNG shifts are shown in purple. The PNGs that are conserved in both SHIV-89.6P and HIV-1 clade B are denoted with asterisks in panels B and C.
|
Convergent PNG changes occur in the quasispecies of diverged Envs from independent macaques. An analysis of the published HIV-1 env sequences revealed that the majority of PNG changes fall within limited windows, or hot spots, of the HIV envelope. These hot spots include the N-terminal side of V1, a small region on the C-terminal side of V2, most of V4, and a small region of V5. These hot spots, first noted by Zhang et al. (66), are highlighted at the bottom of Fig. 2C. It is interesting that although all of these hot spots fall within variable regions, there are many PNGs within the variable loops that remain relatively conserved (Fig. 2C).
For each macaque's quasispecies, an average of nine gp120 clones were analyzed for changes in PNGs. The sequences were aligned with that of the reference strain HXB2 (accession no. K03455), and each clone was scored for the presence or absence of a given PNG. For each macaque and the inoculum, the prevalence of each PNG site is reported as a percentage. Each PNG location that underwent a significant level of change from the inoculum (Fisher's exact t test; P < 0.05) is reported in Table 2. Within the SHIV-89.6P data set, the variation in PNGs within independent macaques was shown to occur in remarkably similar ways. We found a total of six convergent PNG changes that occurred independently in multiple macaques, including four additions (N141 [8/11 animals], N276 [6/11 animals], N386 [6/11 animals], and N397 [6/11 animals, although this site was a mixture in the inoculum]) and two shifts (N186 to N188 [6/11 animals] and N460 to N462/3 [4/11 animals]). Five of these six PNG changes in the SHIV-89.6P model fall within the hot spots defined in the HIV-1 sequences (Fig. 2B). The addition of PNGs at all four of the convergent sites was predominantly due to simple G-to-A transitions common in HIV infection, although some sequences did acquire a new PNG by a relatively rare transversion event. These nucleotide changes are described in Table 3. For each PNG addition described, the prevalence of the site within each macaque quasispecies is shown in Table 2, while the location of the site and how it relates to HIV-1 clade B are shown in Fig. 2B and C. PNG N141 in V1 was significantly added (Fisher's exact t test; P < 0.05) in five of the seven divergent animals and increased as a minor species in three additional macaques. This site lies within a hot spot of HIV-1 clade B, and PNG additions within this region are common. PNG N276 in the outer region of C2 was significantly added (Fisher's exact test; P < 0.05) in four of the seven divergent animals and increased in two additional macaques. This site has been defined as part of the CD4bs (36) and is well conserved in HIV-1 clade B. PNG N386 on the N-terminal side of V4 was significantly added (Fisher's exact t test; P < 0.05) in five of the seven divergent animals, while trending towards an increase in one additional macaque. This site is part of the 2G12 epitope (53) and is commonly present in HIV-1 clade B. PNG N397 on the N-terminal side of V4 was present in 38% of the 89.6P inoculum clones used for this study. The prevalence of this site significantly increased (Fisher's exact test; P < 0.05) in four of the seven divergent animals. This site lies within a hot spot of HIV-1 clade B, and PNG additions within this region are common. It is interesting that sequences from several macaques also showed a decreased prevalence of this site compared to the inoculum. The observed mixture in the inoculum suggests that stochastic events at transmission rather than convergent selection may have impacted the observed pattern of occurrence for this particular PNG.
|
View this table: [in a new window] |
TABLE 3. Mutations resulting in recurrent additions of PNG sites
|
|
View this table: [in a new window] |
TABLE 4. Mutations resulting in recurrent shifting PNG sites
|
![]() View larger version (29K): [in a new window] |
FIG. 3. PNG combinations within the Env quasispecies members of each macaque at 32 weeks postinfection. Each horizontal bar corresponds to a combination of PNG changes that exists within the quasispecies of each animal. The number of clones with a given combination is shown to the left of the bars. The position of each PNG is shown along the top. If the PNG is present, it is shown as a vertical line. Identical combinations found within different animals are denoted with symbols on the right.
|
![]() ![]() View larger version (77K): [in a new window] |
FIG. 5. Accumulation of Env sequence changes over time in two macaques. The variable regions of representative longitudinal sequences from two macaques with significant divergence are shown. The consensus sequence of Env in the SHIV-89.6P inoculum is shown along the top. A charged 40-residue region from positions 152 to 192 is highlighted in V1/V2.
|
Mapping of PNG residues onto the gp120 structure. PNGs were mapped onto a gp120 structure that was created by modeling the variable loop sequences of HIV-1 YU2 onto the core crystal structure of CD4-bound YU2 gp120 (35, 36). This three-dimensional mapping facilitates interpretation of the PNG patterns in the SHIV 89.6P Env proteins that we observed within the context of the modeled protein structure (Fig. 4A). All six convergent PNG changes described above map to the neutralizing face of gp120 and are proximal to the CD4bs (Fig. 4B). In contrast, the PNGs on the silent face are predominantly conserved (Fig. 4C, blue residues). The only PNG within the V3 loop (N301) is highly conserved in both 89.6P and HIV-1 clade B; it is in close proximity to the chemokine receptor binding site and can influence coreceptor usage (44).
![]() View larger version (76K): [in a new window] |
FIG. 4. Three-dimensional structural illustration of locations of N-glycosylation sites. Locations of PNGs from the SHIV-89.6P clones were mapped onto a model of gp120 based on the X-ray structure of the CD4-bound YU2 gp120 core (35) (PDB code 1RZK). The V1, V2, and V3 loops (pink) of YU2 are modeled onto the core (red) for completeness and identification of N-glycosylation sites. In the gp120 orientation shown, the viral and target membranes would be located above and below the protein, respectively. The yellow lines show the disulfide bridges. The N-glycosylation sites that are stable (blue), added (green), shifted (purple), and heterologous within the inoculum (orange) are shown as balls and labeled with the corresponding amino acid numbers. The predicted CD4bs is shown in gray; residues were included based on the contacts identified within 5 Å of CD4 in the CD4-bound YU2 crystal structure (35). (A) Cartoon diagram showing the locations of all PNGs in 89.6P. The inset highlights a subset of PNGs proximal to the disulfide bridges. (B) Space-filling model of the inner face. The predicted CD4bs (gray) is partially occluded by the modeled V1V2 loop (pink). All variant PNGs (underlined) lie on the inner face of Env. (C) Space-filling model of the outer face. The PNGs on the outer domain of gp120 are predominantly conserved.
|
Longitudinal changes in PNGs occur at different rates. The viral quasispecies from a subset of four macaques were analyzed longitudinally, comprising two animals with rapid PNG changes (K97246 and J97156) and two animals with delayed PNG changes (J98071 and L98152); sequences derived from one animal in each group are shown as representatives in Fig. 5. The addition of N386 is an example of how some PNG changes occur more rapidly in some macaques than in others; this addition began to occur in clones from macaque K97246 by 12 wpi, while it was still only seen in one clone from macaque J98071 at 32 wpi. All four macaques underwent a steady accumulation of consistent changes over time, yet all of the changes within an animal did not occur together temporally. This is demonstrated with the quasispecies of macaque K97246, which underwent a complete PNG shift in V2 from N186 to N188 by 12 wpi, while the addition of N276 in C2 did not occur until 32 wpi.
Changes in charge and potential changes in structure are conserved in multiple macaques. Although the majority of common sequence changes occur at PNG sites, we observed two additional types of changes in the hypervariable regions. There is a 40-residue stretch of V1V2 that has a strong net positive charge in the consensus sequence of the SHIV-89.6P virus used to challenge the macaques. In the cloned inoculum, 88% of the sequences have a net charge of +4 or +5. The four animals with viral sequences that did not diverge from the inoculum shared this charge profile (91% of sequences had a net charge of +4 or +5). In all seven macaques with divergent quasispecies, this region of V1V2 became more acidic by 32 wpi, with only 17% of the clones having a net charge of +4 or +5 (Fig. 6). Also, a proline was added to V5 by 32 wpi in five of the seven macaques with divergent quasispecies (mean, 66%; range, 38% to 100%). This mutation occurred at either residue 462 or 465. This region of V5 is flanked by residues that are predicted to be involved in the CD4bs (36).
![]() View larger version (31K): [in a new window] |
FIG. 6. Net charge of the V2 region becomes more acidic at week 32 in animals with diversified quasispecies. A 40-residue region from positions 152 to 192 had a median charge of +4 in the inoculum. The four macaques with minimal divergence (K97107, K98099, L97191, and J97168) maintained this charge. The seven animals with viruses that diverged from the inoculum (A98069, T98098, J97156, K97246, L98152, T98108, and J98071) had mutations in this region that resulted in a more acidic (more negative) net charge. The vertical line divides animals that do not have any significant PNG changes from those that do. Macaque numbers on the x axis are arrayed in order of increasing levels of divergence.
|
![]() View larger version (32K): [in a new window] |
FIG. 7. Development of homologous NAbs as a function of time postinfection. Sera from the SHIV-89.6P-infected macaques were tested against pseudotyped SHIV-89.6P in a Tzm-bl cell assay. Reciprocal titers for 50% neutralization are shown. The y axis for animal K98099 is different (higher values) from the rest. Each sample was run in triplicate, and most data points represent the geometric means of three independent assays. Macaques K97107, L97191, J97168, and L98152 had no detectable NAbs at any time point tested (not shown).
|
|
|
|---|
The remarkable level of glycan conservation seen in this study could result from several different selective pressures. The monomeric form of gp120 consists of an inner domain, an outer domain, and a bridging sheet that connects the two (36). In the trimeric form, the outer domain of gp120 is theoretically the region most accessible to neutralizing antibodies, yet few antibodies are known to recognize this heavily glycosylated "silent face" (63, 64). Nevertheless, removal of carbohydrates from this region renders the virus more susceptible to neutralization (42). Mapping the conserved PNGs identified in this study onto the HIV-1 gp120 core demonstrated that most of the 19 highly conserved sites are located on the outer domain of gp120. The removal of the conserved glycans on the silent face may be selected against since exposure of the epitopes beneath may readily elicit neutralizing antibodies. Many of the conserved PNG sites also lie within close proximity to the disulfide bonds that define the variable loops. These strategically placed PNG sites may define the conformational space of loops and influence their flexibility and orientation. Alternatively, these sites may be conserved due to cotranslational folding requirements; for example, it has been proposed that N-glycans either protect the cysteine residue pair or recruit the necessary cellular machinery to the correct location for the formation of proper disulfide bonds in influenza virus hemagglutinin (18). In support of this, N-linked glycosylation plays a role in ensuring the proper folding of newly synthesized gp120 in the endoplasmic reticulum (ER) (39), and carbohydrates are known to prevent aggregation, allow indirect interactions through the resident lectin chaperons calnexin and calreticulin of the ER (37), and provide spatial constraints that facilitate folding (18). Thus, the conserved PNGs near cysteines in HIV-1 Env may guide the formation of correct disulfide bonds during cotranslational folding and direct the loops to functionally relevant spatial regions in folded gp120.
SHIV-89.6P contains a clade B envelope that is less glycosylated than those of the average HIV-1 clade B viruses in the LANL database. Surprisingly, we observed four additions and two shifts of PNGs that resulted in more "typical" representations of the clade B glycosylation profiles. Two of the PNG additions that occur in SHIV-89.6P are at locations outside the hot spots that are lacking in the inoculum yet highly conserved in HIV-1 clade B (N276 and N386). The other two additions (N141 and N397) occur within hot spots identified on HIV-1 clade B. Two PNG shifts (N186 to N188 and N460 to N462) occur within HIV-1 clade B hot spots, in each case shifting the PNG N-terminally to a region within the hot spot with a higher prevalence of PNGs. Although the four PNG additions resulted from simple G-to-A transitions, the recurrent shifts in V2 and V5 were much more mutationally complex, involving transversions and multiple base substitutions, and were realized in multiple ways in different macaques. When considering these six PNG changes, it is noteworthy that although multiple changes occurred within the same clone, the same combination of changes rarely occurred in clones from different macaques, and the temporal development of the changes was not consistent. In the two incidents where the same combination of PNG changes was found in two macaques, these PNGs occurred in different sequence backgrounds. This suggests that the sequon changes were arrived at in an independent manner and did not arise from the outgrowth of a minor variant from the inoculum. There is a scattering of unique PNG changes that occur in only one animal or at low levels; these may reflect positions that are not vital to but are generally favored for the function of the virus. Other studies have documented that specific PNGs may be less important than the density of PNGs in a given region; the unique PNG changes seen in this model may lend further evidence to this phenomenon (45, 48).
It has been shown that the relative positions and number of PNG sites can vary without sacrificing infectivity (28, 45); however, these sites do have functional roles. Several of the convergent PNG changes occur within hot spots proximal to the CD4bs, a conformational determinant of Env that is conserved among the primate lentivirus family. At least some of the broadly neutralizing activity in human serum has been shown to map to this region (55), and human MAbs such as immunoglobulin G1b12 have confirmed that some antibodies targeted to this region can neutralize divergent isolates (9). An interesting study of HIV resistance to plant lectins hints at an interpretation for the PNG shift seen in V2. Plant lectins have specificity for high-mannose oligomers and thus can inhibit the entry of HIV into its target cell (3). Intriguingly, under selective pressure by high-mannose-specific lectins, the PNG at site 186 of HIV-1 (IIIb) shifted two residues downstream to position 188 (4), comparable to the shift we see here. It is possible that the shifting PNG site at 186/188 is involved in the entry of HIV into target cells. It is unlikely, however, that a unique functional role can be assigned to each PNG change seen in this study. The PNGs may collectively perform a function, or alternatively, a single PNG may participate in many functions. For example, the addition of a PNG at N386 was seen both in the SHIV-89.6P Envs in this study and in clade B viruses (Fig. 2C). N386 is proximal to the coreceptor binding site as well as a disulfide bond, mutations at N386 influence antibody accessibility to the CD4 binding site (32), and changes at N386 can impact mutational pressure at the tip of the V3 loop (30). A mass spectroscopic approach that assigned 25 of 26 consensus glycosylation sites of HIV-1 SF2 gp120 found that high-mannose carbohydrates and complex carbohydrates each cluster together, with little overlap between the two forms (67). An examination of the fully glycosylated SIV gp120 crystal structure also shows clustering of high-mannose carbohydrates (12). In that structure, two of the three clusters arise from a group of three PNGs where the Asn residues are within 12 Å of each other (C-alpha distance). The mapping of PNGs in 89.6P Env gp120 also revealed two groups of three PNGs that are structurally close to each other, within a 12-Å distance according to this model. The close proximity of these Asn residues does not necessarily imply that they form a glycan cluster, but it is interesting to speculate whether they might interact. Interestingly, these groups appear on opposite faces of gp120, and a residue from each group is also part of the epitope of the MAb 2G12.
Since the coreceptor binding site is only available after conformational changes induced by CD4 (12) and since the gp41 ectodomain is only thought to be available after conformational changes induced by coreceptor binding (12), it could be argued that the CD4bs is the most vulnerable region of the viral Env and thus that mutations that shield it from antibodies would provide a fitness advantage to the virus. It has been proposed that the V1V2 and V3 loops interact to form a protective barrier around the CD4 binding site (19, 69). The V3 loop is thought to play the predominant role in physical interactions with the coreceptor, and its strong net positive charge can facilitate binding to the negatively charge CCR5 coreceptor (68). Due to this requirement, the polarity of the V3 loop is relatively constrained. However, if the V1V2 loops adopt a more negative charge, this could influence the packing of the variable loops around the CD4bs (25, 47). In this study, all seven animals with divergent quasispecies had sequences that had evolved a more acidic V1V2. Both the N- and C-terminal amino acids in the V5 loop are known to comprise part of the CD4bs, yet little is known about how the V5 region changes over time. In this study, in five of the seven animals with divergent quasispecies a proline was added in V5; such a change could alter the shape of the V5 loop and may in turn alter the CD4bs. Additional analyses of the binding properties of individual Env proteins bearing these mutations are needed to test this hypothesis directly.
A major goal of this study was to determine how nonsynonymous changes in Env are related to the development of homologous and heterologous NAbs. The calculation of dN/dS ratios suggested that the changes that were observed in these macaques are the result of selective pressure. Irrespective of vaccine treatment, the macaques with higher viremia were typically the ones to develop persistent homologous neutralizing antibodies as well as divergent quasispecies. Two exceptions to this trend were observed, i.e., macaques K98099 and L98152. Both macaques received a vaccine consisting of DNA expressing Env from both 89.6 and 89.6P and vaccinia virus expressing Env from 89.6 prior to challenge, and both animals maintained low viremia. K98099 mounted a strong homologous neutralizing antibody response after challenge yet underwent minimal divergence. We speculate that the limited antigen in this macaque resulted in the rapid decline of homologous NAbs and little to no divergence of the viral Env. In contrast, L98152 failed to develop detectable neutralizing antibodies, yet its virus diverged. If the vaccine primed a response that controlled viremia, it did not prevent divergence. This result suggests that divergence per se does not require high levels of virus in the periphery, nor does it imply a poor clinical outcome. The development of persistent homologous NAbs may be both antigen dependent and require exposure to the diverging quasispecies, while divergence of the quasispecies may be driven partially by viral fitness and partially by immune escape. Evidence for viral replication driving antibody maturation has been observed in another system, namely, influenza virus vaccination (7). Five of the seven divergent macaques received vaccines that were ineffective in generating significant B- or T-cell responses prior to challenge, and after challenge these SHIV-specific responses and control of viremia were indistinguishable from those in mock-vaccinated controls. We thus consider it likely that these vaccines had no effect on the development and selection of viral variants after challenge.
It is interesting that several of the animals maintained steady virus loads and that the homologous NAbs were either maintained or continued to increase despite the changes occurring in the viral Env. However, further work is necessary to determine whether any of the mutations that we have sequenced encode escape variants. In the macaques studied here, the majority of homologous NAbs may be directed against the immunodominant V3 loop, which did not undergo change in the time frame of this study. It is possible that mutations in Env sequences contribute to the development of higher-avidity NAbs or to NAbs effective against heterologous isolates. An investigation of this possibility is currently in progress, utilizing the cloned Envs to make pseudovirions and to determine the neutralization profiles of each variant. This study ended after 1 year of observation, at which point none of the plasma samples showed significant evidence of heterologous neutralization. This finding is consistent with studies in HIV-infected humans, who do not typically develop heterologous NAbs prior to 3 years of infection (43, 51).
There is precedent for convergent patterns of change as HIV evolves within independent hosts. It is well documented that the HIV subtype B viruses often undergo defined changes as the virus switches from an R5 to an X4 phenotype (29, 56). Hirsch et al. have shown with the SIV model that convergent changes in the viral Env can affect viral fitness by leading to a CD4-independent virus (21). These examples highlight patterns in the way that the virus evolves to enhance viral fitness. Data from this work demonstrate additional mechanisms of convergent change in the viral envelope that involve glycosylation, the V1V2 charge, and the V5 structure. These changes correlate with both higher viral loads and higher homologous NAb titers. Further work is under way to define whether these predicted convergent changes are utilized and how they affect viral fitness or viral escape. As we continue to deepen our understanding of the intricacies of HIV evolution, we may be able to better inform the design of effective vaccines capable of generating the elusive broadly reactive NAbs.
Env was kindly provided by Julie Overbaugh. Virus stocks and cell lines used in neutralization assays were obtained from the NIH AIDS Research and Reference Reagent Program. This study was supported by PHS grant P01-AI54564 (N.L.H.), a Center for AIDS Research predoctoral fellowship (P30-AI27757) (W.B.), a grant from the Murdock Charitable Trust (N.L.H.), and donations from the James B. Pendleton Charitable Trust.
|
|
|---|
2 mannose residues on the outer face of gp120. J. Virol. 76:7306-7321.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»