Previous Article | Next Article ![]()
Journal of Virology, June 2007, p. 5658-5668, Vol. 81, No. 11
0022-538X/07/$08.00+0 doi:10.1128/JVI.00257-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
2 Amphipathic Helix and Acquisition of Length in gp120 Hypervariable Domains Are Associated with Resistance to Autologous Neutralization of Subtype C Human Immunodeficiency Virus Type 1
Jeffrey N. Sfakianos,8
John L. Mokili,5,
Mark Muldoon,9
Joseph Mulenga,10
Susan Allen,3
Beatrice H. Hahn,6,7
George M. Shaw,6,7
Jerry L. Blackwell,2,4
Bette T. Korber,5,11
Eric Hunter,1,2 and
Cynthia A. Derdeyn1,2*
Department of Pathology and Laboratory Medicine,1 Yerkes National Primate Research Center,2 Department of International Health,3 Division of Infectious Diseases, Emory University, Atlanta, Georgia,4 Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico,5 Department of Medicine,6 Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama,7 Department of Cell Biology, Yale University School of Medicine, New Haven, Connecticut,8 Department of Mathematics, University of Manchester Institute of Science and Technology, Manchester, United Kingdom,9 Zambia Blood Transfusion Service, Lusaka, Zambia,10 The Santa Fe Institute, Santa Fe, New Mexico,11
Received 6 February 2007/ Accepted 7 March 2007
|
|
|---|
2) located on the gp120 outer domain. High nonsynonymous-to-synonymous substitution (dN/dS) ratios, indicative of positive selection, were also found at these five positions in subtype C sequences in the database. Nevertheless, exchange of the entire
2 helix between resistant donor Envs and sensitive recipient Envs did not alter the NAb phenotype. The combined mutual information and dN/dS analyses suggest that unique mutational patterns in
2 and insertions in the V1-to-V4 region are associated with NAb resistance during subtype C infection but that the selected positions within the
2 helix must be linked to still other changes in Env to confer antibody escape. These findings suggest that subtype C viruses utilize mutations in the
2 helix for efficient viral replication and immune avoidance. |
|
|---|
The HIV-1 genome encodes two glycoproteins; the transmembrane subunit gp41 anchors the Env complex in the viral membrane and mediates fusion with the host cell membrane, while the surface subunit gp120 facilitates interactions with receptor molecules (30). HIV-1 gp120 is heavily glycosylated and contains five "hypervariable" domains (V1 to V5) that tolerate sequence heterogeneity (47). The V1V2 and V4 domains, in particular, can also accommodate dramatic insertions, deletions, and variation in the pattern of N-linked glycosylation (N-Gly) sites. The V1V2 domain has been shown to influence sensitivity to NAb in studies of subtype B viruses (6, 13, 26, 33, 35, 36, 49-51, 54, 59, 63, 70, 71, 74) and, more recently, subtype C viruses (66). The V1V2 domain is thought to regulate neutralization sensitivity by masking conserved neutralization targets (39, 40, 44, 59, 66, 76) and can also present type-specific neutralization epitopes (20, 21, 23, 28, 32, 53, 58, 73). The mechanism by which V4 could influence neutralization is less clear, although this region could modulate glycan packing on the outer surface of the Env trimer (74).
There is evidence that distinct selective pressures are active against Env during infections with different subtypes (14, 22). In an analysis of subtype C sequences from the HIV Databases (http://www.hiv.lanl.gov), a high density of positions with nonsynonymous-to-synonymous substitution (dN/dS) ratios of >1 was located in the third conserved region (C3) of gp120, which is downstream in the linear sequence from the third hypervariable domain (V3) (22). High dN/dS ratios are indicative of strong diversifying selection (77). In contrast, high dN/dS ratios in subtype B sequences were concentrated within the boundaries of V3, which, in contrast, is relatively conserved in subtype C viruses (14, 22, 38). These findings suggest that the nature of NAb targets in the V3/C3 region differs between subtypes B and C (7, 24, 25, 46, 48, 74). Furthermore, we recently demonstrated that an amphipathic helix (
2) encoded in C3 exhibits mutational and structural differences in subtypes B and C (22a). Thus, the distributions of amino acid residues under selective pressure in the V3/C3 region are dramatically different in subtypes B and C, but the relationships between mutational patterns, viral subtypes, and sensitivity to autologous NAb have not been elucidated.
We previously demonstrated that Envs from chronically infected (donor) transmission partners exhibited a range of resistance against contemporaneous donor NAb in a pseudovirus assay, but the Envs from their newly infected recipient partners were uniformly sensitive to NAb in this plasma sample (18). Using sequence and NAb data generated in the previous study from 48 donor and 25 recipient Envs, we applied two statistical approaches to probe the determinants of neutralization sensitivity to donor plasma in the present study and used a biological assay to evaluate specific contributions. The results suggest that acquisition of length in the hypervariable domains and mutation of the
2 helix domain track with NAb resistance, but they demonstrate that the positions within the
2 helix must be linked to still other domains in Env that confer antibody escape. These findings suggest that the Env trimers of subtype C viruses utilize mutations in the
2 helix to escape NAb. This could be a pathway unique to subtype C viruses and is therefore important for development of a vaccine that induces potent and cross-reactive NAb responses.
|
|
|---|
Determination of BL.
A maximum likelihood (ML) tree was constructed previously for each transmission pair (18), with gapped regions (including sequences in the first, second, and fourth hypervariable domains [V1V2 and V4]) stripped out of the analysis. Modeltest was used to identify the optimal evolutionary model (60), and for each tree, the model selected included base frequencies, a general reversible model for base substitution, and differences in rate variation at different sites assigned according to a gamma distribution with four categories of rates and invariant sites. A further comparison was made with a REV model with ML assigned rate variation at different sites, but the additional parameters were not justified in a likelihood ratio test for these data, and thus the Modeltest model was used. The specific value for each of the model parameters was estimated for each of the eight trees separately. PAUP was then used to construct a likelihood tree for each transmission pair. BranchLength.pl (B. Korber [www.santafe.edu/
btk/science-paper/bette.html]; a web-based version of this code is now available at http://hcv.lanl.gov/content/hcv-db/BRANCHLEN/branchlength.html) was used to calculate the branch length (BL) to the patient's ancestral node for each donor and recipient sequence.
Correlation analysis. A nonparametric Kendall tau rank correlation test was applied to previously described paired sequence and neutralization data (18) to determine whether the 50% inhibitory concentration (IC50) of plasma was correlated with the following different parameters: BL to the patient's ancestral node on the ML tree, amino acid length of the V1-to-V4 region, and the number of N-Gly sites in V1-to-V4. P values of <0.05 were considered statistically interesting for hypothesis forming as a foundation for further experimental exploration. The correlation tests were performed with the donor plasma IC50 and pooled plasma IC50 against each sequence attribute.
Mutual information analysis. Shannon entropy is a measure of uncertainty used in communication theory (64). It can be applied to other problems where character states are considered, such as sequence analysis (37). For example, for position i in a protein alignment, the Shannon entropy function H(i) is defined by the probability of different amino acids, s, in that position, as follows: H(i) = S·P(si)·log P(si). This is a measure of the uncertainty in the column of data; if only one amino acid appears in a column, it has zero entropy. Maximal entropy occurs when the frequencies of all of the characters are equal. Here we considered j a character state assigned according to the level of neutralization sensitivity associated with a V1-to-V4 sequence (a, IC50 of <50; b, IC50 of 50 to 249; c, IC50 of 250 to 999; d, IC50 of 1,000 to 4,999; and e, IC50 of >5,000). We asked whether j covaries with amino acid patterns found in any given position in an alignment. Given the joint probability distribution P(si,sj), H(i,j) = S·P(si,sj)·log P(si,sj) and the mutual information distribution M(i,j) = H(i) + H(j) H(i,j) (37), we measured how much the data covaries, given that the mutual information is 0 when either i and j are completely independent or there is no variation. To test the significance of these results, 10,000 randomizations of neutralization phenotype assignments to viral sequences were performed, and the number of times the mutual information for the random data was greater than or equal to the original data was tallied and divided by 10,000 to estimate the P value. This statistical approach is based on the assumption that the data are drawn from a random pool; however, the sequence data have strong underlying phylogenetic correlations (2). To partially compensate for this, the randomization of neutralization assignments to sequences was done within donor-recipient pairs; however, the associations identified must therefore still be considered with this limitation in mind and thus viewed in a hypothesis-forming framework to guide experiments and for comparison with other results, such as the dN/dS ratios. Hypervariable loops were omitted from this analysis because we could not exclude the possibility that alignment artifacts could occur in the association patterns.
Codon-specific dN/dS ratios. Previously described data regarding codon-specific dN/dS ratios were adapted (22). Nonsynonymous mutations become fixed with greater or lesser probability than synonymous mutations depending on whether a site is subject to positive or purifying selection, and thus the dN/dS ratio can be used to characterize specific codons and regions of proteins evolving under positive selection (77). In viruses, regions under strong positive selection occur where immune escape confers a selective advantage to new variants that arise within a host. Accordingly, codon-based models of molecular evolution can be used in conjunction with sequence data and ML parameter estimation to identify sites of potential immunological interest. The dN/dS ratio is indicative of the selection pressure at the protein level, as follows: a dN/dS ratio of <1 is indicative of purifying selection and amino acid conservation because of structural and functional constraints, and a dN/dS ratio of >1 is indicative of diversifying, positive selection where amino acid substitutions confer an advantage. In this study, we asked whether or not the precise positions under positive selective pressure may be lineage specific for HIV-1, comparing subtype B and subtype C, as representatives of both lineages are under consideration as vaccine candidates in trial populations where the HIV-1 C clade dominates. The program CODEML (S8 [http://abacus.gene.ucl.ac.uk/software/paml.html]) was used to determine the regional distributions of dN/dS ratios in the V3 loop and flanking regions for two sets of 25 HIV-1 sequences, one for subtype B and the second for subtype C. The subtype C sequences used did not include those from the Zambian transmission pairs analyzed for neutralization sensitivity here. For each subtype, a three-rate prior with flexible locations and weights was fitted to the data. Bayes's formula was then used to calculate the posterior distribution of the rate ratio at each codon. The posterior mean was taken as a site-specific estimate of the dN/dS ratio.
Three-dimensional mapping. The core structure used for mapping corresponds to the X-ray structure of CD4-bound YU2 gp120 (43) (PDB code 1RZK). Variable loops V1V2 and V3 were modeled for clarity as described previously (5). Discussions on an unliganded gp120 core structure were based on the fully glycosylated simian immunodeficiency virus (SIV) structure (12) (PDB code 2BF1). Signature positions were mapped onto this structure based on the alignment of sequences. Modeling calculations were performed using a modified version of AMBER (56). Three-dimensional images were generated using VMD (29).
Molecular dynamics simulations.
Comparative all-atom molecular dynamics simulations were carried out on the modeled gp120 to capture the local differences in the V3-C3-V4 regions of gp120 between B and C subtypes. The following three sets of simulations were carried out under identical conditions: (i) gp120 from YU2 (subtype B), (ii) YU2 gp120 with the
2 helix consensus sequence from subtype C, and (iii) gp120 with the subtype C consensus sequence. In all cases, the gp120 protein was solvated in
16,000 water molecules. The starting structure for subtype C gp120 was based on the YU2 model described above. The subtype C consensus was homology modeled from the subtype B structure, using MODELLER software (69). All molecular dynamics simulations were performed using the force field described by Cornell et al. (15) and the AMBER 6.0 suite of programs (9). A rectangular box of TIP3P water molecules (31) was added to solvate the complex, with the final system containing
55,000 atoms within a box dimension of
79 by 79 by 89 Å. Periodic boundary conditions were applied, and the particle mesh Ewald approach (16) was used to accurately treat electrostatic interactions. Both systems were minimized and equilibrated with the same protocol. The temperature was maintained at 300 K, using the Berendsen temperature algorithm (3). A 2-fs time step was used, and all bonds involving hydrogen atoms were constrained using the SHAKE algorithm (68). Simulations were carried out for approximately 4 ns, and the last 2 ns were considered for analysis. Additional details of the simulations and the results are provided in a separate publication (22a). The relative solvation between subtypes was obtained by calculating the number of water molecules in the first solvent shell surrounding a secondary structure (e.g., the
2 helix) or a specific residue. The contact distances between C-alpha carbons were used to identify interactions between regions or differences between the clades. A 10-Å cutoff between C-alpha carbons was used as a measure for contact.
Construction of chimeric Envs.
PCR amplification and cloning of the donor and recipient Envs from pairs 55 and 135 have been described previously (18). The env genes are in the cytomegalovirus-driven expression vector pCR3.1 (Invitrogen), which is used to generate viral pseudotypes. Each of the donor
2 helix domains shown in Fig. 3A was inserted into the corresponding recipient Env by using a strategy similar to that described previously (66). A DNA duplex corresponding to each donor
2 helix was synthesized as a double-stranded oligonucleotide by IDT Technologies (Coralville, IA) and gel purified to ensure that each fragment was of the correct length. These fragments were each blunt end ligated to an
8-kb PCR amplicon corresponding to the recipient Env, excluding the
2 helix and including the pCR3.1 vector sequences. The single-stranded DNA sequences corresponding to the 135 F and 55 M donor
2 helices (HXB2 nucleotides [nt] 7227 to 7279) were as follows: for pair 135, F42a (5'-AAAAAGCTATGGAATAACACTTTAGACGGGATAAAGGAAAAATTAGCAAGATAC-3'), F48a (5'-AAACAAAATTGGACTAAAACTTTAGAAGAGGTAAAGACAGAATTAAGAAAATAC-3'), and F67a (5'-AGACAGAAATGGAATAGCACTTTAGACGGGATAAAGGAAAAATTAGCAAGATAC-3'); and for pair 55, M7a (5'-GAACAAAAATGGAGTACAACTTTAAAAAGGGTAGAGAAAAAATTAAAAGAGCAC-3'), M4.1 (5'-GGAAAAGAATGGAATACAACTTTAACAAGGGTAAAGGAAAGATTAAAAAAGCAC-3'), and M2.13 (5'-AGAAAAAAATGGAATATAACTTTAGAAAGGGTAAAGGAAAGATTAAAAGAGCAC-3').
![]() View larger version (36K): [in a new window] |
FIG. 3. Neutralization sensitivities of donor-recipient 2 helix chimeras. (A) An alignment of the predicted amino acid sequences for the 2 helix region is shown for Envs from two transmission pairs (55 [male to female] and 135 [female to male]). The consensus subtype C sequence is shown at the top of each set of sequences. Red arrows show NAb signature residues 335, 336, 337, 343, and 350. The recipient sequences are shown below the consensus (55 F4a, 55 F28a, and 135 M1a). Two recipient Envs with different degrees of neutralization sensitivity were evaluated for pair 55. (B) The neutralization sensitivity of the parental and chimeric Env pseudotypes was evaluated using contemporaneous plasma from donor 55 M or 135 F. The percent virus infectivity relative to the control (lacking donor plasma) is plotted along the vertical axis, and the reciprocal plasma dilution of donor plasma (from 1:20 or 1:100 to 1:62,500) is plotted along the horizontal axis on a log10 scale. Each panel contains infectivity curves for the parental donor (green lines) and recipient (blue lines) Envs and the 2 helix-chimeric recipient Envs (dashed black lines). Infectivity curves for pair 55 and 135 chimeras are shown on the left and right, respectively.
|
PCR amplification conditions for the recipient Env backbones were as follows: 1 cycle of 95°C for 3 min; 35 cycles of 95°C for 1 min, 62°C for 30 s, and 72°C for 10 min; 1 cycle of 72°C for 15 min; and storage at 4°C. Pfu Turbo DNA polymerase (Stratagene) was used to generate a blunt-ended PCR amplicon, which was gel purified from an agarose gel by using a QIAquick gel extraction kit (QIAGEN) prior to ligation. The 25-µl PCR mix contained 50 ng of each primer, 10 ng of the plasmid template, 2.5 mM MgCl2, a 0.2 mM concentration of each deoxynucleoside triphosphate, and 1x reaction buffer. Each synthesized
2 helix DNA fragment was ligated to the purified recipient env backbone by using T4 DNA ligase (5 U/µl; Roche) at 4°C overnight. A portion of the ligation reaction mix was transformed into maximum efficiency XL2-Blue supercompetent cells (1 x 109 CFU/µg DNA; Stratagene). The entire transformation mix was plated onto LB-ampicillin agar plates, generally resulting in
50 colonies per ligation reaction. These colonies were screened by PCR to identify colonies in which the fragments ligated together in the correct orientation, using forward primer EnvA and a reverse primer that anneals to the 3' end of the helix. The colony screening primers were as follows: forward, 5'-CAGCACAGTACAATGTACACATGGAA-3' (HXB2 nt 6950 to 6975); and reverse, 5'-TTCCTTTATCCCGTCTAAAGT-3' (HXB2 nt 7254 to 7274) for 135 F42a or 135 F67a
2 helix screening, 5'-TGTCTTTACCTCTTCTAAAGT-3' (HXB2 nt 7254 to 7274) for 135 F48a
2 helix screening, 5'-TTTTAATCTTTCCTTTACCCT-3' (HXB2 nt 7254 to 7274) for 55 M4.1 or 55 M2.13
2 helix screening, or 5'-TTTTAATTTTTTCTCTACCCT-3' (HXB2 nt 7254 to 7274) for 55 M7a
2 helix screening. Colonies that were positive by PCR screening were inoculated into LB-ampicillin overnight cultures, and the plasmids were prepared using a QIAprep Spin miniprep kit (QIAGEN). Plasmids were then screened by (i) restriction digestion, (ii) biological function assay (described below), and (iii) nucleotide sequencing of the
2 helix and flanking regions to confirm that they contained the correct sequences.
Neutralization assay.
All neutralization assays were performed using a pseudotyped virion assay as described previously (17, 18, 48, 66, 74). In this assay, the NAb activity of plasmas collected contemporaneously with the Envs from donors 135 F and 55 M was evaluated against virions pseudotyped with recipient Envs containing chimeric
2 helix regions in parallel with the parental donor and recipient Envs. For the present study, to evaluate the neutralization sensitivities of the donor, recipient, and chimeric Envs together, it was necessary to conserve limited amounts of donor plasma. Therefore, the highest plasma dilution tested was 1:100 for donor 55 M and 1:20 for donor 135 F.
|
|
|---|
![]() View larger version (31K): [in a new window] |
FIG. 1. Correlation between sequence adaptation and neutralization sensitivity. Each correlation plot (A to F) contains data points generated from NAb (at the IC50)-Env combinations. Viruses were pseudotyped with either a donor (green) or recipient (blue) Env protein, and neutralization was performed with the linked donor plasma (A, C, and E) or a heterologous subtype C plasma pool (B, D, and F). Data for 48 donor and 27 recipient Envs from five previously described heterosexual transmission pairs in Zambia are shown (18). The P value for correlation between each set of parameters was calculated using Kendall's tau rank correlation test. P values of <0.05 are considered significant. Correlation plots show reciprocal NAb IC50 titers (A to F), BLs from the patients' ancestral nodes on ML trees (A and B), amino acid lengths for V1 to V4 (C and D), and the numbers of N-Gly sites in V1 to V4 (NXS or NXT, where X is any amino acid except proline) (E and F). A reciprocal NAb IC50 titer of 0.1 corresponds to a 1:10 dilution of plasma, a titer of 0.01 corresponds to a 1:100 dilution, and a titer of 0.001 corresponds to a 1:1,000 dilution. As previously described (18), reciprocal NAb IC50 titers were rounded to the nearest two decimal places to facilitate statistical analysis; NAb-sensitive Envs with reciprocal titers of <0.005 were therefore plotted at 0.
|
Accumulation of N-Gly sites is not correlated with NAb resistance. Tandem repeat sequences inserted within the longer V1V2 and V4 domains often encoded N-Gly sites (66; data not shown), and in four of the five donors, the number of N-Gly sites was correlated with V1-to-V4 length, using a Spearman rank correlation test (data not shown). Nevertheless, the number of N-Gly sites was not significantly correlated with neutralization by either type of plasma (Fig. 1E and F). This finding could reflect a lack of statistical power for the number of samples analyzed, given the relatively small window of N-Gly sites (17 to 23 sites for all but one Env). Alternatively, the relative positions and packing of the glycans could be more important than net gains or losses for neutralization escape (74).
Individual positions in V1 to V4 are associated with the neutralization phenotype.
A mutual information analysis based on Shannon entropy was applied to the V1-to-V4 sequence and NAb data for the donor and recipient Envs to search for individual positions in the sequence that were predictive of a neutralization-resistant phenotype. Nine "signature" positions were significantly associated with the NAb phenotype after a Bonferroni correction for multiple tests (uncorrected P value, <0.0001) and were mapped three-dimensionally onto a gp120 model that was based on the X-ray structure of the CD4-bound YU2 gp120 core with modeled variable loops (Fig. 2A). Together, the signature residues form a linear display along the immunologically silent face of gp120 (75) and are proximal to the
2 helix (Fig. 2B). Five of the nine residues are located within the
2 helix, and these five positions overlap those previously demonstrated to be under strong positive selection, with dN/dS ratios of >3, using subtype C sequences in the database (Fig. 2B) (22). The convergence of these two independent analyses supports a role for these sequences in neutralization escape. Also, since the database sequences were drawn from a broader sampling that did not include the Envs studied here, these findings could be generally relevant to subtype C viruses and not limited to those circulating in Zambia. Importantly, the positions of the NAb escape signature residues are not subject to the same degree of positive selection in subtype B sequences, where the highest dN/dS ratios are concentrated in V3 (Fig. 2C). Taken together, these data suggest that substitutions in this region could be driven by the autologous NAb response and that changes in this region, in conjunction with alterations in the hypervariable domains, could contribute to neutralization escape during subtype C infection.
![]() View larger version (47K): [in a new window] |
FIG. 2. Three-dimensional structural illustration of positions associated with NAb escape and positive selection. Locations were mapped onto a model of gp120 based on the X-ray structure of the CD4-bound YU2 gp120 core (43) (PDB code 1RZK). V1V2 and V3 loops were modeled onto the core for completeness. In the orientation shown, the viral and cellular membranes would be located above and below the protein, respectively. (A) Space-filling model showing locations that were significantly associated with the neutralization phenotype in the mutual information analysis (red balls; HXB2 positions 236, 305, 332, 335, 336, 337, 343, 350, and 393). Positions 335, 336, 337, 343, and 350 are located in the 2 helix. (B and C) Cartoon diagrams showing locations under positive selection, as determined by dN/dS ratios for subtype C sequences (B) and subtype B sequences (C) from the Los Alamos Database (22). Red indicates strong positive selection (dN/dS ratio of >3). Residues associated with the neutralization phenotype in the mutual information analysis are shown as spheres in panel B. In the orientation shown in panels B and C, the viral and cellular membranes would be located above and below the protein, respectively. The orientation in panel A roughly corresponds to rotating the structure in panel B about an axis normal to the viral membrane 90 degrees counterclockwise.
|
2 helix does not directly alter the neutralization phenotype.
The
2 helix is located on the outer domain of gp120, and as an amphipathic helix, the outer face could be partially exposed (Fig. 2A). Based on the mutual information analysis and dN/dS ratios from the database analysis, five positions on the outer surface of this structure appear to be under strong positive selection in subtype C viruses. To investigate whether determinants of neutralization resistance are actually contained within the
2 helix, we selected Envs from transmission pairs 55 (male to female) and 135 (female to male) for further study (18). In pair 55, only 6 of the 18 positions in the helix were completely conserved (Fig. 3A), and these were located on the inner surface facing the gp120 core (22a). Residues in all but one of the five NAb signature positions were heterogeneous in the pair 55 Envs (Fig. 3A). Likewise, for pair 135, the five internal positions of the helix were completely conserved, and all NAb signature residues were heterogeneous (Fig. 3A). Chimeric Envs were created in which the entire helix of a recipient Env was replaced with the corresponding sequences from three different donor Envs. In the case of pair 55, two recipient Envs with different degrees of neutralization sensitivity determined by a charged residue in V2 were used (66). The entire
2 helix domain was exchanged instead of performing targeted site-directed mutagenesis due to the limited quantities of donor plasma available and the extensive sequence heterogeneity in this region (Fig. 3A). Despite the strong association between neutralization sensitivity and the amino acid residues in this region that emerged in the mutual information analysis, exchange of this region resulted in only slight changes in NAb sensitivity between the recipient and chimeric Envs for both transmission pairs (Fig. 3B). The reciprocal constructs were created and evaluated for pair 135 (recipient helix into donor Envs), and again, only minimal changes in neutralization phenotype were observed (data not shown). These results argue that despite the strong selective pressure on the
2 helix and its association with NAb resistance, this structure does not directly modulate resistance to autologous NAb. |
|
|---|
Increasing sequence divergence, as measured by BL on ML trees, showed a strong association with decreased sensitivity to neutralization by donor plasma and a heterologous plasma pool. This result suggests that point mutations in regions outside the hypervariable domains V1V2 and V4, which were excluded from the trees, facilitate escape from antibodies that target some cross-neutralizing epitopes and is consistent with the correlation observed previously between the IC50 titers for neutralization by donor and pooled plasmas (18) and V1V2-independent pathways to NAb resistance (66). Despite the inherent limitations, given that the analysis was based on multiple sequences from five transmission pairs, a striking relationship between signature sites was apparent, in that all nine neutralization "signature" residues identified in the mutual information analysis cluster together on the outer face of the gp120 molecule, nestled among glycans that were resolved on the unliganded SIV gp120 structure (Fig. 4A and B). Variation in one residue identified in the mutual information analysis, i.e., residue 236 (based on HXB2 numbering) in C2, can alter glycosylation at N234, and its effect appeared to differ among the transmission pairs. For example, the presence of a glycan at N234 was associated with neutralization resistance in pair 135, while the loss of the same glycan was associated with resistance in pair 53; this sort of patient-specific gain or loss of a glycan illustrates why there may be no discernible association between the net number of N-Gly sites and the resistance phenotype. Notably, glycosylation at the same position was previously associated with cumulative escape from autologous neutralization in a subtype B-infected subject (74).
![]() View larger version (73K): [in a new window] |
FIG. 4. Three-dimensional structural illustration of positions associated with NAb escape on the unliganded SIV gp120 structure. Locations were mapped onto a model of unliganded gp120 based on the fully glycosylated X-ray structure of SIVmac32H gp120 (12) (PDB code 2BF1). The V1V2 and V3 loops were truncated in the crystallized protein and replaced with a GAG sequence spanning the positions where the loop would have been. N-linked carbohydrate moieties are shown in purple. In the orientation shown, the viral and cellular membranes would be located above and below the protein, respectively. (A) Space-filling model showing locations that were significantly associated with the neutralization phenotype in the mutual information analysis (in red; HXB2 positions 236, 332, 335, 336, 337, 343, 350, and 393). Positions 335, 336, 337, 343, and 350 are located in the 2 helix. Position 305 is missing from the structure. (B) Cartoon diagram showing residues associated with the neutralization phenotype in the mutual information analysis as red spheres.
|
2, and one in V4) showed a trend of association with transmission status (P. Hawkins et al., unpublished data), and none of these overlapped with the nine residues shown here to track with NAb resistance. Thus, the positions we have identified in the current study as being associated with neutralization resistance appear to be independent of transmission status, given the sample size under consideration.
In the trimer model of gp120, based on the structure of HXB2 (45), the 18-residue
2 helix is located on the outer surface of gp120. Even though this model is based on CD4-bound gp120, a comparison of liganded and unliganded gp120 structures does not indicate major changes in the outer domain upon CD4 binding (12). Five of the nine NAb signature residues were located within the
2 helix (Fig. 2B), and the same residues were under strong positive selection in subtype C database sequences (22); the coincidence of the two independent results raised the interesting hypothesis that substitutions in this region could be related to antibody escape. Nevertheless, exchange of the
2 helix between donor and recipient Envs did not significantly alter the neutralization sensitivity of the related recipient Env, suggesting that it alone does not contain direct determinants of NAb resistance. It is not surprising that an exchange of the
2 helix failed to produce direct changes in patterns of neutralization resistance, since this region of Env is likely protected from antibody recognition by the glycan shield and thus, in itself, is unlikely to represent a neutralization epitope (74). More likely, this region of the Env glycoprotein is responsible for contributing to the three-dimensional structure of the protomer and the quaternary structure of the trimer that confers neutralization escape by mechanisms in addition to epitope variation (e.g., epitope masking, steric hindrance, conformation energetics, and glycan packing) (42, 74). Molecular dynamics simulation studies provided evidence for an extensive interaction between the
2 helix and a flexible V4 loop in subtype C, with certain residues in the helix associated with restricting V4 loop length (22a). The nonpolar face of the
2 helix is predicted to interact with structural domains ß10, ß11, ß14, and ß24, which could help to stabilize the gp120 core structure. It is therefore possible that to see a phenotypic effect of the
2 helix on neutralization sensitivity, other domains must be present. Perhaps the outer face of the
2 helix compensates structurally for the adaptations that are directly involved in NAb escape of subtype C viruses. If so, mutations in the helix would track with a resistant phenotype without physically defining or altering an epitope.
Sequence and structural studies revealed at least two major differences in the
2 helix between subtypes B and C. Even though the helix of subtype C exhibits high sequence entropy, the amphipathicity of the helix is preserved to a higher degree than that of subtype B (22a). The helix of subtype C is also more solvated than that of subtype B (S. Gnanakaran, unpublished data), with high-sequence-entropy positions occurring at the solvent-exposed face. All five signature residues identified here occupy solvent-exposed positions of the helix, while residues that face toward the core are well conserved (22a). Thus, in subtype C viruses, the
2 helix may act as a flexible interface to maintain the integrity between NAb escape mutations occurring on the outer domain and the more highly conserved gp120 inner core. Alternatively, these mutations could accommodate escape mechanisms occurring in the hypervariable regions. In subtype C, the N-terminal half of the helix interacts extensively with the V4 loop, while the C-terminal half interacts with the V5 domain.
Finally, mutations at some of the signature positions could affect the local stability of the
2 helix. Alpha helices represent a common secondary structural motif in proteins and are stabilized by intramolecular hydrogen bonding between the i and i + 4 residues on parallel turns of the helix. However, the first four and last four residues of an
helix lack this intramolecular hydrogen bonding and are often stabilized by alternative hydrogen bonding partners and hydrophobic interactions. This leads to differences in the propensities of amino acids found in the N-terminal and C-terminal capping motifs. Several systematic studies on the positional frequency of each of the amino acids found in helices in protein structures show statistical deviations from a random distribution (1, 19, 41, 65). For example, asparagine has a higher preference of occurring at the N-terminal cap position, whereas the structurally similar glutamic acid does not. Further studies have established a relationship between capping motifs and the stability of an
helix. Interestingly, five of the nine signature positions identified here have the potential to play a structural role as capping residues of the
2 helix. Four of them coincide with the N-terminal capping positions, and one coincides with a C-terminal capping position in the
2 helix. Furthermore, the first turn of the N-terminal capping sequence is located adjacent to the antiparallel beta strand just downstream from the V3 domain. Thus, any local unfolding of the N-terminal region of the
2 helix could potentially impact the projection of the V3 domain out from the core. Thus, the
2 helix of gp120 appears to be a novel and important source of structural adaptation during subtype C infection, and its structural role in the context of the entire gp120 molecule needs to be explored further.
The present study combines statistical, biological, and molecular dynamics simulation analyses to elucidate the poorly understood pathways to neutralization resistance in subtype C virus infection. NAb are clearly a strong evolutionary force during chronic infection with subtype C HIV-1, driving sequence divergence in structural domains and also dramatic length changes in the hypervariable regions of Env. Moreover, mutations in a purely structural domain (in a region that historically has been considered well conserved) track with NAb resistance, supporting a novel and potentially subtype-specific mechanism for neutralization escape. Further investigations will be required to understand the selective forces directed at the
2 helix and to determine why this region is differentially targeted depending on the viral subtype, as these questions have direct implications for global vaccine design.
This work was supported by NIH grants R01-AI-58706 (C.D.) and R01-AI-51231 (E.H.), by The Bill & Melinda Gates Grand Challenges Program (G.M.S.), and by Yerkes National Primate Research Center base grant RR-00165, awarded by the National Center for Research Resources of the National Institutes of Health.
Published ahead of print on 14 March 2007. ![]()
Present address: La Jolla Institute for Allergy and Immunology, La Jolla, California. ![]()
Present address: Department of Statistics, Oxford University, Oxford, United Kingdom. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»