Previous Article | Next Article ![]()
Journal of Virology, February 2003, p. 1904-1915, Vol. 77, No. 3
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.3.1904-1915.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Department of Pharmacology,1 Gene Therapy Center,2 Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 275993
Received 26 July 2002/ Accepted 4 November 2002
|
|
|---|
|
|
|---|
The AAV genome is composed of linear single-stranded DNA
4.7 kb long (42, 49), and while AAV2 is the most thoroughly characterized AAV, studies of other serotypes are under way (10, 12, 38). These include monkey-specific AAV4, which was recently cloned and found to have a high degree of homology to AAV2 (12). The AAV4 genome has two open reading frames, like that of AAV2, and codes for similar Rep proteins (Rep78 [78 kDa], -68, -52, and -40). For example, the AAV4 Rep68 protein has 90% amino acid identity to AAV2 Rep68, with 5% of the divergent amino acids being similar in hydrophobicity or structure (12). There is more divergence in capsid proteins (Vp1, -2, and -3), with only 60% homology at the amino acid level. In the inverted terminal repeats (ITRs), similarity between the putative hairpin structures in the two viruses is retained, but the AAV4 ITR sequences diverge at two specific regions important in Rep binding. The AAV4 ITR Rep binding element (RBE) is expanded, with four direct GAGY tetramer repeats (GAGTGAGTGAGCGAGC), not three perfect GAGC tetramers as seen in type 2 (12, 30). Additionally, type 4 virus diverges at another possible Rep contact site, the flip hairpin secondary structure, having a 5'-CTCTG-3' at the apex (12, 40). The exact importance of these changes and their impact on the virus life cycle are still unknown. However, the interaction of the AAV2 Rep proteins with the RBE, spacer, and terminal resolution site (trs) of both AAV2 and human ch19 has been previously characterized. DNase I footprinting of type 2 Rep68 binding to linear AAV2 demonstrated protection of 32 nucleotides (nt), which included the RBE, the 13-nt spacer, and the trs (11), and evidence of Rep-specific interaction with nucleotides of these sites was confirmed by chemical interference (40). Similar to the linear ITR, DNase I footprinting of the ch19 site demonstrated 33 nt (non-trs strand) protected by type 2 Rep68 (24). Additionally, other studies suggest that stable binding is required for Rep to be functionally active (8, 31). In vivo studies have determined the biological importance of these sequences in targeting AAV2 to ch19 (13, 14, 27, 28, 59). Therefore, it is not clear how changes in these important sequences will affect targeted integration. Also, sequence divergence is not restricted to type 4 virus. The type 5 ITR has four direct GAGY repeats, one being imperfect (GAGCCAGCGAGCGAGC). The trs of this serotype is also distinct, as is the spacer between the RBE and the trs, which has an additional 5 nt. The impact of these sequence changes on AAV replication has been measured by the inability of either type 2 or 5 Rep to cleave the other's ITR substrate (10). No studies have determined the impact of these divergent ITR elements on site-specific integration.
Central to AAV2 site-specific integration is the presence of these elements (RBE, spacer, and trs) on human ch19 (41, 50). Not only are they present, but each component is required for type 2 targeted integration (13, 14, 27, 28, 59). Similar to type 2 Rep cleavage activity, evidence for nicking and covalent linkage on ch19 during targeted integration has been reported (22, 23). Almost all experiments characterizing AAV latency have been performed using the AAV2 serotype and human cell lines (6, 13, 14, 20, 26, 28, 39, 48). Demonstration of AAV2 site-specific integration in nonhuman targets has generally employed "knock-in" animal models carrying the human ch19 sequences (39, 58). The biology of the monkey-specific AAV4, with its expanded RBE, has not been thoroughly studied, nor has the role of the simian chromosomal site-specific integration locus. Only Southern analysis, which demonstrated the presence of sequences with homology to the ch19 targeting locus in the monkey genome, has been carried out (45). To date, no information about the suitability of this sequence for AAV site-specific integration has been obtained. With the known divergence of the type 4 ITR, it was of specific interest to determine the exact sequence present in the monkey genome and, more importantly, to test the abilities of these elements to carry out site-specific integration as seen with type 2 in human cells.
In this report, we have identified an AAV integration locus in CV-1 cells from the African green monkey and characterized its interaction with the AAV4 Rep68 protein. The AAV4 integration site has perfect homology to the human trs and spacer; however, the simian RBE diverges, adding two additional GAGC tetramer repeats, creating five perfect direct repeats. We constructed an AAV4 Rep68-histidine fusion protein (AAV4 Rep68H) and demonstrate that this protein will specifically bind the simian RBE with a sixfold increase in affinity compared to the human RBE. Analysis of binding by methylation interference (MI) suggested that the increase is due in part to specific interaction of Rep68H with the extra guanine (G) nucleotides of the expanded RBE, as well as an increased interaction overall. Electron microscopy (EM) analysis demonstrated sequence-specific protein-DNA complexes indistinguishable from those generated using type 2 Rep and human DNA substrates. These in vitro studies were followed up by in vivo analysis, which demonstrated AAV site-specific integration into the simian Cos-7 cells, establishing that the simian RBE, spacer, and trs function analogously to the human ch19 integration locus. It is interesting that we consistently observed integration frequencies in Cos-7 cells that were twofold greater than those in HeLa cells. Although further studies are required to rule out the potential roles of additional genomic elements, variability in type 2 and type 4 Rep proteins, or host proteins that may influence targeted integration, the expanded simian RBE was the only obvious difference. Overall, these results establish AAV site-specific integration in the African green monkey and are consistent with previous studies that determined that the RBE, spacer, and trs DNA elements individually influence site-specific integration.
|
|
|---|
Cloning the African green monkey AAV integration site. pMkRBE contains the RBE, spacer, and trs from the African green monkey genome. The plasmid was constructed by TA cloning a PCR-amplified fragment (302 bp) from CV-1 cells into pCR2.1 (Invitrogen). CV-1 cells were maintained in Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% fetal calf serum and penicillin-streptomycin. PCR was performed as described by the manufacturer with TaKaRa Ex Taq (PanVera Corp.) and 100 ng of CV-1 genomic DNA in a standard reaction mixture supplemented with 30 mM MgCl2 and deaza-dCTP. Temperature cycling consisted of 2 cycles of 97°C for 30 s, 58°C for 30 s, and 72°C for 1.2 min, 35 cycles of 94°C for 1 min, 58°C for 30 s, and 72°C for 1 min, and 1 cycle of 94°C for 1 min, 58°C for 30 s, and 72°C for 7 min. PCR was performed initially with the primers 5'-GACATCGCACCGCCCGCCCG-3' and 5'-TACTTACTTACTTACCC-3'. Then a second round of PCR was performed with the primers 5'-GACATCGCACCGCCCGCCCG-3' and 5'-TGACTTCAGGCGTCGA-3'. The clones were sequenced at the University of North Carolina (UNC) Sequencing Facility.
Viral propagation and purification. AAV4 was purchased from the American Type Culture Collection and amplified in Cos-7 cells as previously described (12). The Cos-7 cells were maintained in DMEM supplemented with 10% fetal calf serum and penicillin-streptomycin. The cells were grown at 37oC in a 5% CO2 humidified environment. The virus was purified using a cesium chloride (CsCl2) gradient, and titers were determined by dot blotting (3).
AAV4 Rep68H protein production. AAV4 pRep68H contains the gene for the AAV4 Rep68 protein inserted into pQE-70 (Qiagen). This vector places a six-histidine tag on the C terminus of the protein. To clone the AAV4 Rep68 gene, PCR was performed using Takara Ex Taq polymerase in a standard reaction mixture containing AAV4 genomic DNA (100 ng) with the oligonucleotides 5'-TCTATCTAGACTTGGCCACTCCCTCTCTGCGCGC-3' and 5'-AGGCCTTAAGAGCAGTCGTCCACCACCTTGTTCC-3'. The unspliced gene was then digested with SphI and BglII (all restriction enzymes were purchased from New England BioLabs) and ligated into pQE70 according to the manufacturer's instructions. To construct the intron minus AAV4 Rep68H, the clone was digested with AatII and BglII and ligated to the correctly spliced C-terminal synthetic DNA. The oligonucleotides used to construct the C-terminal synthetic DNA were 5'-GATCTGAGAGTTGTCCTCTAGCCAACCTGTCCGCGTAGTCCACCGGAGCTTCCGCGTCTGACGT-3' and 5'-CAGACGCGGAAGCTCCGGTGGACTACGCGGACAGGTTGGCTAGAGGACAACCTCTCA-3'.The AAV4 Rep68H gene was sequenced at the UNC Sequencing Facility. The amino acid sequence was wild type except for the six C-terminal histidines and a proline-to-alanine change at amino acid 536. The AAV4 and AAV2 Rep68H proteins were expressed and purified from SG13009 (Qiagen) as previously described (58).
EMSAs.
Competition electromobility shift assays (EMSAs) were performed as previously described (31). Briefly, DNA substrates (200 bp) were made by digesting pMkRBE with the restriction enzymes HindIII-AatII and digesting pRE2 with PvuII-AatII; then, the fragments were gel purified following standard procedures (3). For experiments using synthetic DNA as the substrate, oligonucleotides (41 bp) were annealed. Rep68H (1 pmol) was placed in a 20-µl binding reaction mixture containing cold competitor DNA, 10 mM HEPES-KOH [pH 7.5], 8 mM MgCl2, 40 mM KCl, 0.2 mM dithiothreitol, 8% glycerol, and 0.1 µg of poly(dI-dC)/µl. Next, a [
-32P]ATP end-labeled DNA substrate (6,000 cpm) was added, and the reaction mixture was incubated for 20 min. Separation was done at 90 V for 1 to 2 h using nondenaturing 4% polyacrylamide gel electrophoresis (PAGE). The free and bound fractions were quantified using a PhosphorImager (Molecular Dynamics).
DNase I protection assay.
Protection assays were performed as previously described (3, 31). Briefly, DNA substrates were made by restriction enzyme digestion of pMkRBE and pRE2 with BspE1-DraIII and BspE1-EcoRI, respectively. Rep68H (15 pmol) was incubated with the [
-32P]ATP end-labeled DNA substrate (50,000 cpm) in the EMSA binding buffer for 20 min at room temperature. Then, DNase I (diluted to 0.5 U/µl) was added and allowed to hydrolyze the DNA for 30 to 60 s at room temperature. The sample was separated using 10% denaturing PAGE and run at 1,600 V for approximately 1 h, and then the gel was exposed to film (Kodak).
MI.
For the MI assay, 300-bp substrates were produced by restriction enzyme digestion of pMkRBE and pRE2 with BspE1-AatII and BspE1-AccI, respectively. MI assays were performed as previously described (3, 40). Briefly, after being [
-32P]ATP end labeled, the substrate was treated with dimethyl sulfate (Sigma). The Rep protein was incubated with the methylated substrate (100,000 cpm) and electrophoresed as in the EMSA protocol. The free and bound fractions were gel purified (3); then, Maxam-Gilbert chemical sequencing was performed (3). The bands were quantified using a PhosphorImager, and the relative band intensity ratio for each G residue was plotted as previously described (40).
EM. EM analysis was performed at the UNC EM Core Facility as described previously (58). Purified AAV4 Rep68H was incubated with a 2.2- or 2.3-kb DNA fragment containing the African green monkey or human ch19 RBE, respectively. The binding reaction used a 1:1 mass ratio of protein-DNA, and the conditions for binding were the same as for the EMSA.
Detection of latent AAV. Cos-7 and HeLa cells were transfected with either pDD-neo, pDD-puro, or pDD-GFP alone or with AAV4 or AAV2 pHIVRep68 using Superfect reagent (Qiagen). All cells were maintained in DMEM supplemented with 10% fetal calf serum and penicillin-streptomycin. Colonies transfected with either pDD-neo or pDD-puro were selected with G418 and puromycin, respectively (3, 56). After selection, genomic DNA was extracted following standard protocols (3). For GFP-expressing cells, a modular flow cytometer (MoFlo; Cytomation Inc.) at the UNC Flow Cytometer Facility was used to detect fluorescence and to place GFP-expressing cells into individual wells. To demonstrate the presence of latent AAV, 100 ng of the DNA was analyzed by PCR as previously described (52). Briefly, Deep Vent Exo(-) polymerase (New England Biolabs) and the primers 5'-CGGGGAGGATCCGCTCAGAGGACA-3' and 5'-CGGCCTCAGTGAGCGAGCGAGC-3' were used along with genomic DNA in standard PCR. The temperature cycle was 99°C for 1 min, followed by 35 cycles of 99°C for 10 s and 72°C for 4 min. A fraction of the PCR product was blotted and probed with a ch19 probe. As a negative control, each primer was run separately in a reaction with genomic DNA to quantify the background signal. Colonies were positive for integrated AAV if a signal greater than that of background was obtained for the PCR (53).
|
|
|---|
![]() View larger version (30K): [in a new window] |
FIG. 1. Sequence alignment of the AAV integration site from human ch19 and the 302-bp DNA fragment identified in the African green monkey genome (Monkey) as cloned from CV-1 cells (Vector NTI software; InforMax). The three elements of the AAV integration locus, the RBE, spacer, and trs, are underlined. The human ch19 RBE has three direct GAGC tetramers (dark shading), while the simian RBE is expanded, having two additional GAGC tetramers (light shading). The simian trs is also darkly shaded, and an 8-bp spacer is located between this element and the simian RBE, just as in the human ch19. Sequence homology is indicated by dots, dashes represent gaps in the sequence, and nonconsensus bases for the monkey sequence are given (bottom line).
|
![]() View larger version (62K): [in a new window] |
FIG. 2. (A) Reciprocal DNase I protection analysis of AAV2 and AAV4 Rep68H proteins on the human ch19 and African green monkey DNA substrates containing the RBE (underlined), spacer, and trs complement (underlined). The [ -32P]ATP end-labeled substrate (non-trs strand) was incubated with 15 pmol of Rep before DNase I digestion and denaturing PAGE analysis. For lanes G+A, substrate DNA was chemically sequenced using formic acid. For lanes Substrate, substrate DNA was treated with DNase I without any addition of Rep. (B) DNase I results are summarized for human (top) and simian (bottom) sequences. The bar indicates the bases fully protected by both AAV2 and AAV4 Rep68H proteins. The RBE and nicking site are both underlined.
|
70% (16 of 23 molecules) of the fragments had single AAV4 Rep68H bound to the simian RBE. All of the remaining 30% (7 of 23 molecules analyzed) had multiple Rep proteins, generally two, with one Rep complex binding the RBE and another bound nonspecifically to the DNA and not in contact with the RBE-bound Rep. Similar results were observed for the AAV4 Rep68H protein bound to the human ch19 RBE (data not shown).
![]() View larger version (76K): [in a new window] |
FIG. 3. EM analysis of AAV4 Rep68H binding to the AAV site-specific integration loci from African green monkeys (A) and human ch19 (B). The simian fragment was 2.2 kb long and contained the 302-bp simian site from CV-1 cells, with the expanded simian RBE located 0.4 kb from one terminus. The human ch19 DNA fragment was 2.3 kb in length, with the RBE located 0.8 kb from one terminus. For both substrates, the AAV4 Rep68H protein covered an average of 60 bp as previously reported (58).
|
Rep68H relative affinity to human and simian chromosomal DNAs.
Next, we determined the relative affinities of the proteins to the human and simian RBEs by performing competitive EMSAs. The 302-bp fragment containing the simian RBE was [
-32P]ATP end labeled, and the binding of either AAV2 or AAV4 Rep68H to this substrate was competed away by cold (either the simian or human RBE) competitor DNA (Fig. 4A). In these reciprocal assays, the concentrations of Rep68H proteins and the lengths of the DNA fragments, for the substrate and competitor, were held constant. In addition, to accurately compare affinities, the competition curve was performed over a broad range of cold-competitor concentrations. Using this range, we could perturb binding to the labeled DNA only slightly or completely abolish it, providing an accurate determination of the relative equilibrium dissociation constants (15, 46). The AAV4 and AAV2 Rep68H proteins demonstrated a 6- and 10-fold increase in relative binding affinity, respectively, for the simian RBE compared to the human RBE (Fig. 4B). To further explore the possibility that the increased affinity was due to a greater number of tetramer repeats, EMSA experiments were conducted using synthetic DNA substrates. For these, the AAV2 Rep68 protein was used to bind substrates containing either three (3x), four (4x), five (5x), or seven (7x) direct repeats of the GAGC tetramer. The results are represented graphically in Fig. 4C and suggest that relative affinity increases as the number of repeats increases, up to five tetramers. At that point, the relative affinities of Rep for the 5x and 7x substrates were essentially equivalent (Table 1). The fivefold increase in relative affinity of the AAV2 Rep protein binding the 5x versus the 3x substrate was less than the 10-fold increase demonstrated above. This may be due to the short fragment length (41 bp) of the synthetic DNA substrates destabilizing binding. For both experiments, however, the basis for the increase in affinity for substrates having a greater number of tetarmers was not apparent, considering that the simian and human substrates produced nearly identical footprints. Next, we compared the base-specific interactions of the Rep proteins on the human ch19 and the simian DNA counterpart.
![]() ![]() View larger version (69K): [in a new window] |
FIG. 4. (A) Reciprocal competition EMSAs were performed to determine the relative affinities of AAV4 Rep68H to the simian (left) and human (right) DNA fragments. The African green monkey substrate (200 bp) was [ -32P]ATP end labeled and titrated with cold competitor DNA (200 bp), either the simian fragment or the human fragment, before the addition of the AAV4 Rep68H protein (1 pmol). For lanes Rep-, no Rep protein was added. Separate experiments using the AAV2 Rep68H protein were conducted in an identical manner. (B) After the bound and free fractions were quantified (using a PhosphorImager), the bound fraction was graphed as a function of the cold-competitor concentration, simian fragment (open squares) and human fragment (solid squares). (C) Additionally, AAV2 Rep68 was used in competition EMSAs with synthetic DNA as the substrate, and the results were graphed. The synthetic substrates (41 bp) contained either three (3x), four (4x), five (5x), or seven (7x) direct repeats of the GAGC tetramer. The substrates and their relative affinities to cold competitor 3x are given in Table 1. For all experiments, the competitor concentrations required to achieve a 50% reduction in the fraction bound were compared for each protein, and the relative binding affinity is the ratio of these competitor concentrations.
|
|
View this table: [in a new window] |
TABLE 1. Synthetic RBE substrates
|
![]() View larger version (61K): [in a new window] |
FIG. 5. (A) Reciprocal MI analysis of the AAV4 and AAV2 Rep68H proteins binding the human ch19 (left) and simian (right) sequences. The substrates (non-trs strand) were [ -32P] end labeled and methylated before being incubated with either of the Rep proteins (2 pmol). Then, the bound and free fractions were purified and cleaved before separation by denaturing PAGE. For lanes Substrate, methylated substrate DNA was run without any addition of the Rep protein. For lane G+A, substrate DNA was chemically sequenced using formic acid. The long and short vertical lines next to lanes Substrate indicate the GAGC tetramers and the trs location, respectively. Because of variable amounts of substrate in each lane, the concentration of substrate was normalized by quantification of bands outside the Rep binding site before graphical analysis. (B) MI graphical representation of AAV4 Rep68H and AAV2 Rep68H binding either the methylated simian substrate (solid bars) or the methylated human substrate (shaded bars). The curved lines on the graph indicate symmetrical patterns produced by the interaction of the Rep proteins with the substrate. The sequences for the simian and human RBE, spacer, and trs are given beneath the graphs, and the GAGC tetramers are in boldface. F/B, free/bound ratio.
|
![]() View larger version (60K): [in a new window] |
FIG. 6. AAV4 Rep68 mediates the site-specific integration of an AAV2 ITR construct targeting the African green monkey RBE, spacer, and trs in Cos-7 cells. The cells were cotransfected with AAV4 pHIV-Rep68 and pDD-neo and selected for G418 resistance before PCR analysis. Primers ch19/AAV represent a PCR using one primer (ch19) that anneals to the simian genome and a second primer (AAV) that anneals to the AAV2 ITR sequence. After PCR, the reaction product was blotted and hybridized to a ch19 probe. The AAV and ch19 primers were used alone as negative controls in separate PCRs.
|
![]() View larger version (41K): [in a new window] |
FIG. 7. Cos-7 and HeLa cells were cotransfected with pHIV-Rep68 and pDD-GFP and monitored for GFP expression for 105 days. Three to 5 days posttransfection, GFP-expressing cells were sorted by fluorescence-activated cell sorter into individual wells. Cells transfected with pDD-GFP alone were also monitored. The data are graphed as the fraction of green fluorescent cell colonies to the total number of colonies as determined on different days.
|
|
|
|---|
Architectures of human and simian integration sites. In the African green monkey, the spacer and trs sequences are identical to the human ch19 sequence. However, the simian RBE has five rather than three direct repeats of the GAGC tetramer. This expansion of the RBE sequences has also been noted in the AAV4 ITR, where there are four rather than three GAGY repeats, as seen in AAV2. The expansion of the RBE sequence in both the simian chromosome and the AAV4 ITR suggests coevolution between virus and host. Although the simian targeting locus and the type 4 viral ITR have diverged from their human counterparts (ch19 and the AAV2 ITR, respectively) with respect to the RBE, the spacer and trs are conserved. The AAV2 and -4 ITRs, compared to the corresponding targeting loci (monkey and human ch19), contain different spacer sequences (13 versus 8 nt, respectively). It is not clear why both the human and monkey chromosomal loci have 13-nt spacers while AAV2 and -4 have evolved to carry only 8 nt. Surprisingly, AAV5, which was isolated from humans, has an 18-nt spacer which is distinct from those of all other AAV ITRs and from those of the human and simian chromosomal counterparts. Of the AAV serotypes, type 5 virus is the only example where the spacing between the RBE and trs is not conserved. In addition, Rep protein derived from serotypes 1 to 4 will utilize all other AAV ITRs except type 5 ITR for replication substrates, and type 5 Rep is unable to utilize type 1 through type 4 ITRs for replication (unpublished observations and reference 10). These observations beg the question of how the AAV origin evolved (human to virus or virus to human) (8). In fact, using information concerning the number of RBEs present in a serotype-specific ITR, one can strongly suggest the species of origin. When we compared all of the ITR sequences available (serotypes 1 to 7) against type 2 with the hope of identifying the origin of monkey versus human isolates, type 5 appeared similar to type 4 in that the ITR also carries an expanded repeat (10). If this is correct, type 5 maybe the result of a nonhuman primate virus that jumped into humans. More importantly, the newly described types 6 and 7 have sequence identity to type 1 ITRs. This may represent human isolates that have infected monkey colonies (16). Recently, it was demonstrated that a viable nicking site on the viral substrate is not necessary for site-specific recombination (59). It will be interesting to determine if type 5 can carry out targeted integration, as well as to determine what cis-acting elements may be required. In addition, to these sequence differences, we demonstrated that both AAV4 and AAV2 Rep68H proteins have greater affinity for the expanded RBE that is at least in part due to the greater interaction with the guanine bases of the added tetramers. We also documented that the simian locus could serve as a suitable target for site-specific integration in the monkey genome. Based on our in vitro and in vivo data, the expansion of the simian chromosomal RBE did not hamper the type 2 targeting frequency and may have influenced integration in a positive way (a twofold increase in targeting in Cos-7 versus HeLa cells). Although not specifically addressed in this study, it would be interesting to determine if an expansion of the RBE in the AAV2 ITR, such as occurs in AAV4, will also affect targeting. Elucidation of these factors will contribute to clarifying the integration mechanism and the complex relationship between the AAV ITR and chromosomal sequences.
Rep68-DNA stoichiometry. The AAV2 and AAV4 Rep68H proteins demonstrated greater affinity for the expanded simian RBE, and initially we presumed that a greater number of Rep molecules may be binding the five directly repeating GAGC tetramers. Recent studies, using a Rep protein mutated in the multimerization domain, suggested that the ability of Rep to form multimers is critical to its function (8, 22, 32, 53, 55). Rep forms a multimeric protein complex when bound to the RBE, spacer, and trs, and evidence for Rep78 forming dimeric to hexameric structures on binding the AAV2 ITR has been reported (36, 47). Our DNase I protection and EM data demonstrated that the footprints Rep produced on binding either simian or human sequences were almost identical in size, suggesting that the numbers of Rep molecules forming the protein complex are equivalent. This is in agreement with our laboratories' EM results, which suggested that equivalent numbers of molecules form a Rep complex whether binding the AAV2 ITR, the human RBE, the simian RBE, or nonspecific DNA (2, 57, 58). Previously, Ryan et al. examined Rep interaction with mutated RBEs and suggested that the RBE, spacer, and trs contain a single Rep recognition site (40). Together, the data suggest that Rep does not regulate its action by forming different multimeric protein structures at its various DNA binding sites. Despite this, most investigators performing EMSAs on Rep-RBE interactions have noted the appearance of multiple (generally three to six) bands or laddering in the bound fraction. McCarty et al. suggested these bands corresponded to either multiple DNA molecules bound to a single Rep complex, multiple Rep complexes bound to a single DNA molecule, or different numbers of Rep molecules forming the Rep complex. Based on DNase I protection and EM data (2, 57, 58), a likely cause of this laddering, which occurs readily at high concentrations of Rep, may be multiple Rep complexes (with identical stoichiometries) binding a single DNA molecule. Additionally, DNA conformation transitions, such as bending or looping, may be occurring (15). Under certain conditions, the presence of Rep-induced DNA bending can be observed by EM analysis. It is unknown whether these DNA conformation changes are artifacts or have some biological significance (2, 57, 58). Protein-induced DNA bending is known to be associated with many types of protein-DNA interactions, including recombination, and additionally is known to increase a protein's binding affinity for DNA (15, 51). In the case of Rep, an increase in binding affinity for the simian RBE may positively relate to integration frequency in African green monkey cells. It is also possible that the looping previously observed by EM analysis (57, 58) affects AAV targeting by altering Rep-ch19 interaction in some manner. We suspect that these DNA conformational changes are Rep induced and as such warrant further investigation.
Rep68-RBE interaction. In agreement with previous studies, we demonstrated that Rep has greater affinity for substrates with increased numbers of the GAGY tetramer (55). Our competition EMSAs demonstrated that AAV4 and AAV2 Rep68H proteins have 6- and 10-fold-greater affinity, respectively, for the expanded simian RBE than for the human ch19 RBE. Interestingly, we performed EMSAs with synthetic DNA containing as many as seven copies of the GAGC tetramer and determined that with more than five directly repeating tetramer copies, the relative affinity does not significantly increase (Fig. 4C). We believe that this result is due to the inability of Rep to contact more than five GAGY tetramers in series, as demonstrated by DNase I protection analysis, and its inability to form protein complexes with variable stoichiometries. This observation may indicate the importance of three, four, and five direct repeats located in the type 2 and 4 viruses and the simian targeting locus, respectively. It is of interest to note that the AAV2 ITR and human ch19 retain similar numbers of GAGY tetramers (three each), whereas the AAV4 ITR and the simian targeting locus are different (four versus five, respectively). Compared to humans, monkeys as a group are evolutionarily more divergent from each other, and additional sequence analysis of targeting loci from related and unrelated primates, as well as ITR analysis of new monkey serotypes, should facilitate the investigation of the importance of these RBE expansions.
cis-acting sequences and emerging integration models. The basis for the greater affinity of Rep for the simian RBE was demonstrated by MI, where both AAV2 and AAV4 Rep68H proteins specifically contacted all five GAGC tetramers, producing a quantitatively stronger interaction than with the human RBE. The interaction was symmetrical about the RBE as previously described (40) and demonstrated significant sequence contacts within the spacer element. This increased interaction at the chromosome spacer may indicate the importance of Rep contact within the spacer region, as recently indicated by the in vitro integration assay (34). In that study, an RBE, trs, and spacer of human ch19 were present in an Epstein-Barr virus shuttle vector, and it was demonstrated that mutating or removing the spacer significantly decreased or abolished integration. A developing model for Rep-ch19 interaction proposes that each elementthe RBE, spacer, and trshas a particular function in AAV site-specific integration. In summary, the number of GAGC tetramers contained in the RBE affects the affinity of Rep and its ability to locate the target site, while the spacer may be important in orienting the Rep complex toward the trs.
AAV integrates site specifically, targeting the simian RBE. Our study clearly demonstrated that the simian sequence could serve as a locus for site-specific integration when transfection assays were carried out with AAV plasmid integration substrates. Site-specific recombination in the simian genome (using Cos-7 cells) was demonstrated using AAV2 ITR neo constructs with AAV4 Rep68 and AAV2 ITR puro or GFP constructs with AAV2 Rep68. Using these substrates, we demonstrated an approximately twofold increase in targeting in Cos-7 versus HeLa cells (Table 2). Comparatively, targeting efficiency in Cos-7 cells was consistently greater than in HeLa cells. Additionally, the 42 and 36% targeting efficiencies in HeLa cells are consistent with previous reports (reviewed in reference 43). Although in this study we compared the efficiency and frequency of targeting for a relatively small number of cells (Fig. 6 and Table 2), all of the cells (HeLa and Cos-7) were treated in an identical manner to provide the most accurate comparison, and in the case of ITR-GFP constructs, there was no drug selection; instead, cell sorting was employed to normalize for transfection differences.
|
View this table: [in a new window] |
TABLE 2. Summary of AAV targeting frequencies in Cos-7 and HeLa cells as determined by PCR analysis
|
90%, and additionally, we enriched for positive GFP-expressing cells at day 5 postinfection by sorting. Therefore, diluting out cells carrying AAV proviral DNA appeared unlikely. We also noted a slow decline over months in cells expressing GFP, which is inconsistent with the dilution of episomal templates. Cells lost fluorescence intensity, suggesting a low turnover rate for the GFP protein within the cell (known half-life, 3 to 4 weeks). Regardless, our experiments were designed to compare relative integration frequencies in the Cos-7 and HeLa cells, and the results from these studies suggest that frequencies of integration in both cell lines are relatively low. This is in agreement with the results of other laboratories, which have also reported low integration frequencies in human cell lines (
3%) with recombinant AAV2 constructs (reviewed in reference 43). Previously in our laboratory, low integration frequencies were also observed when long-term expression from the LacZ reporter gene was monitored in HeLa cells for >18 months (AAV2 Rep68-mediated targeting). In that study, 1% of the HeLa cells stably expressed the LacZ gene product 70 days after transfection, and this percentage continued unchanged for the entire 18 months of the experiment (56). In the present study, we did not have access to the AAV4 ITRs, and the integration analysis was derived from type 2 and 4 Rep68 proteins using AAV2 ITRs. This combination may have affected the true targeting frequency in simian cells (type 4 in Cos cells). unique The experimental results were consistent for the three constructs (pDD-neo, pDD-puro, and pDD-GFP), and integration in the simian genome was consistently twofold greater than in the human genome. A similar relationship has been recognized in transposon integration, where higher affinity of the transposase for its DNA binding site correlates to greater targeting frequency (29). Comparing recombinant AAV integration in human and simian cells, other than an increased Rep binding affinity for the expanded simian RBE, we are unaware at this point of any mechanisms that may be contributing to the increase in targeting frequency, although many other factors (e.g., host proteins) could be at play. However, the RBE expansion in the African green monkey may make the site inherently more distinctive in the genome, providing more information for Rep and, we suspect, giving Rep68 a greater functional specificity (4, 5, 54). It will be interesting to investigate the AAV integration sites in other nonhuman primates to determine if the RBE expansion is universal in this diverse group. In addition, testing of artificially generated targeting loci whose RBEs, as well as other critical cis-acting elements, vary will eventually define the importance of each of these sequences in facilitating AAV site-specific recombination and the future development of AAV-targeting vectors.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»