Previous Article | Next Article ![]()
Journal of Virology, March 2003, p. 3838-3845, Vol. 77, No. 6
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.6.3838-3845.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Department of Microbiology and Immunology,1 Department of Medicine, Pennsylvania State University College of Medicine, The Milton S. Hershey Medical Center, Hershey, Pennsylvania 170332
Received 9 September 2002/ Accepted 19 December 2002
|
|
|---|
2 helix of the central domain of the human immunodeficiency virus type 1 integrase, affected the choice of nonviral target DNA sites. We have now extended these findings to the integrases of a nonprimate lentivirus and a more distantly related alpharetrovirus. We found that substitutions at the analogous positions in visna virus integrase and Rous sarcoma virus integrase changed the target site preferences in five assays that monitor insertion into nonviral DNA. Thus, the importance of this protein residue in the selection of nonviral target DNA sites is likely to be a general property of retroviral integrases. Moreover, this amino acid might be part of the cellular DNA binding site on integrase proteins. |
|
|---|
![]() View larger version (27K): [in a new window] |
FIG. 1. Integrase assays. The names of the assays are shown above the horizontal arrows, and the key aspects of the readouts (e.g., the position of the radiolabel or the pairing of PCR primers) are shown below the arrows. The CA bases near the 3' ends of viral DNA are shown in boldface, the terminal nucleotides are indicated by NN, asterisks or solid circles indicate 32P groups, and P1, P2, and P3 refer to primers. (A) Processing, during which integrase makes a site-specific nick to form a labeled product two nucleotides shorter than the substrate. (B) DNA joining or strand transfer, during which integrase inserts processed viral DNA ends into any of various sites on either DNA strand to yield labeled products longer than the substrate. (C) Insertion, during which integrase inserts processed viral DNA ends into any of various sites in plasmid DNA (which is shown in its linear form); the joined products are then identified by PCR with primers P1 and P2, followed by a nested runoff reaction with 5'-radiolabeled primer P3. (D) Nonspecific alcoholysis, during which integrase uses certain nucleophilic alcohols (shown as ROH) to nick nonviral DNA at any of various sites; the labeled products will be simple oligonucleotides if the substrate is labeled at the 5' end but will have attached R groups if the substrate is labeled near the 3' end.
|
Within the central domain of integrase, many amino acids can be replaced without affecting target site preferences (2, 13). However, we recently identified residue 119, the second amino acid in the
2 helix in the central domain of HIV-1 integrase, as strongly affecting the choice of nonviral target DNA sites (13). This residue was identified by a novel approach that involved screening a large set of patient-derived HIV-1 integrase variants for alterations in nonviral target site selection, comparing the sequences of proteins that exhibited similar target site preferences, and using this information to guide mutagenesis of a laboratory HIV-1 integrase (13). In fact, HIV-1 integrases with any of five different amino acids (Ser, Thr, Gly, Ala, or Lys) at position 119 exhibited five different patterns of target site selection in nonviral DNA. To test the hypothesis that these results are generalizable to other retroviral integrases, we have now assessed the role of the analogous protein residue in the integrases from a nonprimate lentivirus (visna virus) and a more distantly related alpharetrovirus (Rous sarcoma virus [RSV], formerly classified in the Oncovirinae retrovirus subfamily). Because the preferred sites of viral DNA insertion can differ depending on whether Mn2+ or Mg2+ is present during reactions (13), it was important that each of these divalent metal cations be used for these analyses.
To introduce amino acid substitutions into integrase, we used the QuikChange site-directed mutagenesis system (Stratagene, La Jolla, Calif.) with pQE-30 plasmids (Qiagen, Chatsworth, Calif.) that encoded the wild-type integrases, as described previously (13, 41). The entire integrase-coding region for all proteins was confirmed by DNA sequencing, and proteins were purified from M15[pREP4] bacteria (Qiagen) (13). The purified proteins were tested under conditions known to optimize activity for visna virus and RSV integrase, including the use of oligonucleotides derived from the U3 end of viral DNA (21, 22, 42). Conditions for the standard oligonucleotide-based assays (Fig. 1A, B, and D) were as described previously (13) but included either 10 mM Mn2+ or 5 mM Mg2+. In the case of visna virus integrase and its derivatives, reactions that included Mg2+ were supplemented with 30% dimethyl sulfoxide (DMSO) because of our previous demonstration that this maneuver enhances the Mg2+-dependent activity of visna virus integrase (31). For the plasmid insertion assays (Fig. 1C) (26, 34), double-stranded 30/32-mers representing the preprocessed U3 end of viral DNA were incubated under the same reaction conditions but in the presence of 0.5 µg of
X174 DNA (Invitrogen, Carlsbad, Calif.) that had been linearized with SstII. The insertion reactions were stopped by fivefold dilution into 40 µl of 10 mM Tris-HCl (pH 8.0)-1 mM EDTA, and 7 µl was transferred to a tube for PCR. PCR was conducted in 25-µl reaction mixtures that contained 10 mM Tris-HCl (pH 8.3), 50 mM KCl, 0.2 or 0.4 mM each deoxynucleoside triphosphate, 2 mM MgCl2, 1 U of Taq polymerase (Fisher, Pittsburgh, Pa.), and 15 pmol each of primer P1, which matched the viral donor DNA strand (for visna virus, 5'CAGGGTAGGCATTTGTTCTCTGTCCTGACA3'; for RSV, 5'AAGACTACAAGAGTATTGCATAAGACTACA3'), and primer P2, which was derived from
X174 (5'GGCGACCATTCAAAGGATAAACAT3'). The reaction mixtures underwent 35 cycles at 95°C for 45 s and 65°C for 3 min, with a final extension at 72°C for 10 min. Subsequently, 3 µl of each PCR was transferred to a 10-ul nested runoff reaction that contained 0.24 pmol of 5' 32P-labeled
X174 primer P3 (5'GGCAGTCGGGAGGGTAGTCGG3') and 1 U of Taq polymerase under the same buffer conditions as for PCR but for one cycle of 95°C for 2.5 min, 55°C for 4 min, and 72°C for 20 min. All reactions were analyzed by autoradiography after electrophoresis on denaturing polyacrylamide gels (20% gels for the processing, joining, and alcoholysis assays and 6% gels for the PCR-based insertion assay).
Effects of amino acid substitutions in visna virus integrase.
Visna virus integrase shares 89 (32%) of its 281 amino acids with HIV-1 integrase (22). Although there is no structural information available for the ovine protein, there is particularly strong sequence homology near the region that aligns with the
2 helix of HIV-1 integrase, with four of the six amino acids at or near this helix being identical (Fig. 2). We therefore made single-amino-acid substitutions at visna virus integrase position 121, which corresponds to HIV-1 integrase position 119, to examine the effects of these changes on target site selection. In particular, we changed the proline at position 121 to alanine (P121A) or glycine (P121G), both of which were included in the original set of five amino acids at HIV-1 position 119 that we had studied earlier.
![]() View larger version (22K): [in a new window] |
FIG. 2. Amino acid numbering for the central region of different integrases. A schematic of the central domain of HIV-1 integrase is shown with the locations of the 3 acidic residues of the active-site D,D-35-E motif indicated (the number 35 refers to the spacing between the last two conserved residues of this motif). Amino acid sequences of the indicated region from HIV-1, visna virus, and RSV are shown, with dashes indicating identity to the HIV-1 sequence. Numbers that mark landmarks, or are referred to in the text or in this legend, are provided. Boxes indicate the ß4 strand, 2 helix, and 3 helix as defined in the original papers that described the HIV-1 and RSV crystal structures (4, 9). The dashed arrows highlight HIV-1 residues D116 and S119; the latter also is marked with an asterisk. Solid circles indicate amino acids in the visna virus and RSV proteins for which substitutions were made. The HIV-1 sequence shown and used for these studies is that of BH10, which is related to HXB2 but has an S instead of G at position 123 and a K instead of R at position 127 (35, 36). It differs from the NL4-3 strain used for the HIV-1 crystal structure described by Dyda et al. (9) by having an I rather than V at position 113 and an A rather than T at position 124. The source of the visna virus and RSV integrases have been described previously (41), and the sequences presented are invariant between known strains.
|
![]() View larger version (102K): [in a new window] |
FIG. 3. Target site selection by visna virus integrases. Autoradiograms of denaturing polyacrylamide gels are shown. (A) Processing assay. Double-stranded 18-mers derived from the U3 end of visna virus DNA were 5' labeled on the strand that contains the conserved CA and incubated with protein buffer (lanes 1 and 5), wild-type (WT) visna virus integrase (lanes 2 and 6), or visna virus integrases that contained the indicated amino acid substitutions (lanes 3, 4, 7, and 8). Reactions 1 to 4 were conducted with 10 mM Mn2+, and reactions 5 to 8 contained 5 mM Mg2+ and 30% DMSO. Nucleotide sizes are shown at the right, and the 18-mer substrate and 16-mer product are indicated. (B) Joining assay. Double-stranded 30/32-mers representing the preprocessed U3 end of visna virus DNA were 5' labeled on the strand that contains the conserved CA and incubated with the same proteins and conditions as described for panel A. The 30-mer substrate and strand-transfer products are indicated. Lanes 3 and 4 were loaded with 50% as many counts per minute as the other lanes. (C) Insertion assay. Double-stranded 30/32-mers representing the preprocessed U3 end of visna virus DNA were incubated with plasmid DNA and the joined products were revealed by PCR and a nested runoff reaction. Two PCR results are shown for each of the indicated proteins but only from reactions conducted with Mg2+; similar results, but different patterns, were obtained in reactions with Mn2+. (D) Nonspecific alcoholysis assay. Double-stranded 23-mers of nonviral sequence were 5' labeled on one strand and incubated with the same proteins as described for panel A in reaction mixtures that included 10 mM Mn2+ and 40% glycerol. Lane M has sequence-specific markers (20); lengths in nucleotides are shown at the left, and the sizes of prominent products created by the wild-type visna virus integrase are circled. Bands indicated by dashes or open or filled circles or squares in panels B, C, and D are discussed in the text.
|
To complement the above results, in which the enzymes used different target DNA molecules but similar nucleophilic donor molecules, we assessed the target site preferences of the proteins under conditions in which they use different nucleophilic donor molecules (Fig. 1D). As previously established (23), wild-type visna virus integrase uses various exogenous alcohols to nick nonviral DNA at every site except those near DNA ends. However, this integrase creates a signature pattern on the 23-mer substrate used in the assay such that the most prominent bands are detected at the 5, 15, and 19 positions (Fig. 3D, lane 2) (these bands are indicated by the circled numbers in the figure). When the P121A and P121G proteins were tested on this substrate, the preferred cleavage patterns were reproducibly different from each other and from that of the wild-type enzyme. For example, these proteins produced bands with greater intensity at the 11 position (Fig. 3D, lanes 3 and 4 [filled circle]) and lower intensities at the 8, 9, and 17 positions (open circles). The 5'-labeled products shown in Fig. 3D arose from the unique alcoholysis activity of integrase and not from contaminating nucleases, because attachment of the attacking nucleophilic alcohol to the appropriate sites was documented in assays that used a 3'-labeled substrate (Fig. 1D and data not shown) (23). The distinct target site preferences of the three visna virus integrases were further confirmed in similar assays that used a different nonviral DNA substrate that was 42 bp in length (data not shown). Although these assays were conducted only with Mn2+ because of the inefficiency of nonspecific alcoholysis with Mg2+, the results strengthen the conclusion that visna virus integrases with any of three amino acids at position 121 exhibit three different patterns of target site preference in nonviral DNA.
Effects of amino acid substitutions in RSV integrase.
To extend these results to a nonlentivirus, we assessed the target site preferences of avian retroviral integrases containing single-amino-acid substitutions. RSV integrase shares 72 (25%) of its 286 amino acids with HIV-1 integrase (1), and there is particularly strong sequence homology near the region that aligns with the
2 helix of HIV-1 integrase, with 8 of the 9 amino acids at or near this helix being identical (Fig. 2). By this alignment, HIV-1 integrase residue 119 corresponds to RSV integrase residue 124. However, unlike the situation with visna virus, crystal structures of the RSV integrase central domain are available (4). Surprisingly, some secondary structures are slightly misaligned relative to the sequence homology (Fig. 2) (1, 4, 9). In particular, HIV-1 residues 118 to 121 align by sequence with RSV amino acids 123 to 126 but RSV amino acids 124 to 127 comprise its
2 helix. Thus, HIV-1 integrase residue 119 also corresponds to RSV residue 125, because each is the second amino acid within this helix. We therefore made single-amino-acid substitutions at each of the RSV positions that correspond to HIV-1 position 119 by changing the serine at position 124 to alanine (S124A, similar to one of the substitutions made for HIV-1 and visna virus) and the cysteine at position 125 to asparagine (C125N [which makes this stretch of nine amino acids identical to those of HIV-1]). We also changed the phenylalanine at RSV position 126 (which is identical in the alignments of all three sequences) to a tyrosine (F126Y).
As previously established, RSV integrase removed two or three nucleotides from the 3' ends of RSV viral DNA ends in the presence of Mn2+ (converting an 18-mer to a 16-mer or 15-mer, as shown in Fig. 4A, lane 2) but specifically removed the terminal two nucleotides in the presence of Mg2+ (Fig. 4A, lane 7) (41). We found that each of the new proteins with single-amino-acid substitutions (S124A, C125N, and F126Y) exhibited a level of processing activity comparable to that of the wild-type RSV integrase, whether reactions were conducted in the presence of Mn2+ or Mg2+ (Fig. 4A). To compare the patterns of joined DNA products, we used a longer, preprocessed substrate derived from the 32 bp at an RSV DNA end but lacking the two nucleotides after the CA (Fig. 4B). As expected, RSV integrase created distinct strand-transfer patterns in reactions with either of the two divalent cations (Fig. 4B, lanes 2 and 7). However, of the three singly substituted proteins, only the S124A protein created novel patterns that clearly differed from those of the wild-type protein. Under Mn2+-dependent conditions, the most prominent band created by the S124A protein (Fig. 4B, lane 3 [open circle]) differed from the most prominent band created by the wild-type, C125N, and F126Y proteins (Fig. 4B, lanes 2, 4, and 5 [filled circle]). This difference was underscored by comparing the intensities of the band at the filled circle to that at the open circle for each protein, which yielded a ratio of only 0.8 for the S124A protein but greater than 1.5 for the other three proteins. Differences in intensities of minor bands near the top of the lanes (Fig. 4B [filled triangles]) also reproducibly distinguished the S124A protein from the wild-type enzyme. In contrast, the C125N and F126Y mutants created patterns of joined products that were very similar to that of the wild-type RSV except for slight enhancement of one minor band with C125N (Fig. 4B, lane 4 [dash]). For reactions performed with Mg2+, differences for the S124A protein again were clearly evident, with certain bands having greater or lesser intensity than those seen with the wild-type pattern (Fig. 4B, lane 8 [some examples are indicated by the filled and open squares, respectively, which mark three sets of adjacent bands]). The distinctiveness of the S124A pattern is underscored by comparing the intensities of the slower-migrating band and the faster-migrating band in each of these sets of bands, yielding ratios of 2.4, 5.6, and 1.8 for the pairs near the top, middle, and bottom, respectively, of lane 8; in contrast, the relative intensities for these three pairs of bands were 0.4, 1.2, and 0.6, respectively, for the wild-type protein. The C125N and F126Y proteins had ratios for these pairs of bands very similar to those for the wild-type enzyme. In fact, the C125N and F126Y proteins (Fig. 4B, lanes 9 and 10) exhibited patterns of target site preferences that were superimposable with that of the wild-type enzyme. The strand-transfer patterns of the four RSV enzymes, both with Mn2+ and with Mg2+, were highly reproducible between multiple experiments.
![]() View larger version (106K): [in a new window] |
FIG. 4. Target site selection by RSV integrases. Details and reaction conditions were as described in the legend to Fig. 3, except that the viral DNA substrates were derived from RSV and reactions conducted with Mg2+ did not contain DMSO. (A) Processing assay. Reactions were incubated with protein buffer (lanes 1 and 6), wild-type RSV integrase (lanes 2 and 7), or RSV integrases with the indicated amino acid substitutions (lanes 3 to 5 and 8 to 10). (B) Joining assay. Lanes 4 and 10 were loaded with approximately twofold more counts per minute than the other lanes. (C) Insertion assay. Two PCR results are shown for each of the indicated proteins but only from reactions conducted with Mg2+; similar results, but different patterns, were obtained in reactions with Mn2+. (D) Nonspecific alcoholysis assay. The prominent products created by the wild-type RSV integrase are circled at the left. Bands indicated by the dash or geometric symbols in panels B, C, and D are discussed in the text.
|
The S124A protein also exhibited altered target site preferences in the nonspecific alcoholysis assay. The signature pattern for wild-type RSV integrase includes frequent nicks at positions that are 8, 12, 15, and 19 nucleotides from the 5' end of the 23-mer substrate used in this assay (Fig. 4D, lane number 2 [prominent bands are indicated by the circled numbers]). However, the S124A mutant reproducibly exhibited much less preference for the 8 and 12 positions (Fig. 4D, lane 3 [open circles]) but enhanced cleavage at the 7 and 11 positions (lane 3 [filled circles]). In contrast, the nicking patterns created by the C125N and F126Y proteins were indistinguishable from that of the wild-type enzyme (Fig. 4D, lanes 4 and 5). The 5'-labeled products in Fig. 4D were confirmed to have arisen from the unique alcoholysis activity of integrase by the use of a 3'-labeled substrate, as described earlier (data not shown). Similar results were found when the enzymes were tested on a 42-bp nonviral substrate; i.e., the S124A protein made a novel pattern and the C125N and F126Y proteins were indistinguishable from the wild-type enzyme (data not shown). Thus, the S124A substitution altered the patterns of target site selection in several assays (Fig. 4B [with either metal], Fig. 4C [with either metal], and Fig. 4D), whereas the C125N substitution only had a minor effect in one assay (Fig. 4B with Mn2+) and the F126Y protein was indistinguishable from the wild-type protein in all of the assays.
Implications. In the absence of structural data from crystals of integrase with bound DNA (either viral or nonviral), other approaches are necessary to identify parts of the enzyme that interact with substrate DNA. Accumulating evidence supports a role for the central domain of integrase in interactions with cellular DNA (2, 10, 12, 14, 15, 24, 25, 38). In particular, we recently found that HIV-1 integrases with Ser, Thr, Gly, Ala, or Lys at position 119 exhibited five different patterns of target site selection in nonviral DNA. These results were obtained in oligonucleotide joining assays conducted with Mn2+ or Mg2+ and in the Mn2+-dependent nonspecific alcoholysis assay. It has subsequently been found that these proteins also exhibit five different patterns of viral DNA insertion into plasmid DNA in the PCR-based assay in the presence of both Mn2+ and Mg2+ (A. L. Harper and M. Katzman, unpublished data). HIV-1 integrase with a Pro substitution at position 119 also has been shown in oligonucleotide and plasmid insertion assays to have altered target site preferences (2). The new data demonstrated in the present report extend these findings to other integrases. In particular, visna virus integrases with Pro, Ala, or Gly at the analogous position in its central region exhibited three different patterns of target site preferences in nonviral DNA when tested in all five assays that monitor target site selection (i.e., joining of oligonucleotides in the presence of Mn2+ or Mg2+, insertion of DNA ends into plasmid DNA in the presence of Mn2+ or Mg2+, and nonspecific alcoholysis in the presence of Mn2+). Moreover, changing RSV integrase amino acid 124, which aligns by sequence with HIV-1 position 119, also altered the target site preferences in the five assays. Thus, this position in the central domain of various integrases either interacts with, or influences interactions with, nonviral DNA during catalysis. That these data were obtained with two lentiviruses and an alpharetrovirus suggests that this finding is a general property of retroviral integrases and similar enzymes.
The idea of the importance of the residue analogous to HIV-1 position 119 is strengthened by the finding that substitutions at RSV integrase amino acids 125 and 126 had minimal or no effect on the choice of nonviral target sites. RSV integrase with an F126A substitution was also shown to have wild-type target site preferences by another group (28). In fact, substitutions at more than one-quarter of the HIV-1 integrase central domain residues, including simultaneous replacement of most or all of the amino acids in the
3 or
4 helices, have been shown not to affect target site selection (2, 13). The significance of finding similar roles for HIV-1 and RSV amino acids that align by sequence (i.e., HIV-1 position 119 and RSV position 124), but not for amino acids that align by structure (i.e., HIV-1 position 119 and RSV position 125, each of which is the second amino acid in the
2 helix), is unclear. Perhaps other substitutions at RSV position 125 would reveal a greater influence on target site selection. Alternatively, it should be noted that the available structural information comes from crystals of only parts of these proteins. It is possible that structural elements in the full-length proteins, especially when bound to DNA and in the context of the milieu within a cell nucleus, more closely match the sequence alignment. Additional crystallographic information should help resolve this issue.
Our data show that integrases tolerate several amino acids at the position analogous to that of HIV-1 residue 119 without impairing the efficiency of their catalytic activities, even though their nonviral target site preferences are altered. Future studies should correlate these in vitro findings with the target site preferences of preintegration complexes that mediate integration in vivo. However, it is worth noting that natural HIV-1 isolates contain several amino acids at integrase position 119 (18, 27, 37, 40), and our preliminary data indicate that RSV virions with the S124A substitution replicate at levels similar to that of wild-type virus (Harper and Katzman, unpublished). Indeed, viral fitness is unlikely to be adversely affected by nonviral target site preferences that do not cause frequent integration into sites critical for cell survival.
It is significant that the results demonstrated in this report were obtained with both Mn2+ and Mg2+. Although all integrases exhibit greater activity in standard in vitro assays conducted with Mn2+ and most are inactive with Mg2+ (the RSV and related avian myeloblastosis virus integrases being exceptions), it is generally presumed that Mg2+ is the more relevant divalent metal cation within cells. Moreover, target site preferences in nonviral DNA depend upon which metal is available to integrase. In contrast, specificity for nicking viral DNA during processing generally does not change as a function of the divalent cation (31), although the avian integrases frequently nick between the conserved C and A as well as 3' to the CA in reactions that use Mn2+. In this regard, it is interesting that the RSV S124A protein was reproducibly more specific at removing the terminal two nucleotides under Mn2+-dependent conditions than was the wild-type enzyme (e.g., Fig. 4A, lane 3 [note the greater selectivity for making the 16-mer rather than 15-mer product]). Thus, under certain conditions, side chains at this protein position can also influence the choice of target sites in viral DNA. This observation underscores the fact that processing and joining use related mechanisms, despite being distinct catalytic events that are separated in time and space (i.e., processing can occur in the cell cytoplasm and joining occurs in the nucleus). Perhaps the amino acid at this position influences the flexibility of the enzyme as a function of the divalent metal in the active site. Such a possibility is likely, given the relative flexibility of the RSV
2 helix (29) and the proximity of RSV residue 124 to residue 121, an active-site residue implicated in binding the metal cofactor (3, 5).
Although other parts of integrase might affect interactions with cellular DNA (12, 22, 24, 25, 38), our data show that one particular residue in the central region strongly affects nonviral target site selection. Whether this effect involves direct binding to nonviral DNA or is mediated indirectly is unknown. However, other data are consistent with the idea that this amino acid is part of the cellular DNA binding site on the integrase protein. In particular, cross-linking studies suggest that the nearby residue 117 in HIV-1 integrase contacts the phosphodiester backbone of cellular DNA (15). Moreover, analyses of crystal structures indicate that residue 119 is very close to the three conserved residues of the D,D-35-E active-site motif and that all four of these residues are on the surface of the protein (9, 11, 30). Thus, the interaction between HIV-1 integrase residue 119 and cellular DNA might provide a new target for antiretroviral agents.
|
|
|---|
helices within the core domain of retroviral integrase in selecting target DNA sites for integration. J. Biol. Chem. 276:45848-45855.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»