Department of Biochemistry, Case Western
Reserve University School of Medicine, Cleveland, Ohio
44106-49351; Washington State
University, Pullman, Washington 991642; and
Department of Microbiology and Immunology, Northwestern
University School of Medicine, Chicago, Illinois
606113
We have described a reconstituted avian sarcoma virus (ASV)
concerted DNA integration system with specially designed mini-donor DNA
containing a supF transcription unit, a supercoiled plasmid acceptor, purified bacterially expressed ASV integrase (IN), and human
high-mobility-group protein I(Y). Integration in this system is
dependent upon the mini-donor DNA having IN recognition sequences at
both ends and upon both ends of the same donor integrating into the
acceptor DNA. The integrated DNA product exhibits all of the features
associated with integration of viral DNA in vivo (P. Hindmarsh et al.,
J. Virol., 73:2994-3003, 1999). Individual integrants are
isolated from bacteria containing drug-resistant markers with amber
mutations. This system was used to evaluate the importance of sequences
in the terminal U5 and U3 long terminal repeats at positions 5 and/or
6, adjacent to the conserved CA dinucleotide. Base-pair substitutions
introduced at these positions in U5 result in significant reductions in
recovered integrants from bacteria, due to increases in one-ended
insertion events. Among the recovered integrants from reactions with
mutated U5 but not U3 IN recognition sequences were products that
contain large deletions in the acceptor DNA. Base-pair substitutions at positions 5 and 6 in U3 mostly reduce the efficiency of integration of
the modified donor. Together, these results indicate that sequences directly 5' to the conserved CA dinucleotide are very important for the
process of concerted DNA integration. Furthermore, IN interacts with U3
and U5 termini differently, and aberrant end-processing events leading
to nonconcerted DNA integration are more common in U5 than in U3.
 |
INTRODUCTION |
The retrovirus encodes an enzyme,
integrase (IN), that is both necessary and sufficient to catalyze the
integration of viral DNA into the host chromosome. This enzyme forms a
homodimer or a higher-order multimer, which recognizes the terminal
sequences in the U3 and U5 long terminal repeats (LTRs). In most
retroviruses, these sequences are related to one another in that they
are nearly perfect inverted repeats. The minimum number of base pairs
required for specific IN recognition varies from virus to virus and is between 10 and 20 (8, 9, 12, 15). Integration occurs as a
concerted reaction, which is dependent on bringing both the U3 and U5
LTR ends from a single viral DNA into a complex with IN and acceptor
DNA. Concerted DNA integration reactions have been reconstituted from
purified components (1, 6, 7, 16). The simplest in vitro
system utilizes purified avian sarcoma virus (ASV) IN, a host protein,
high-mobility-group protein 1 (HMG-1) or HMG-I(Y), an acceptor DNA, and
a small 300-bp donor DNA containing blunt ends with only 15 bp of the
ASV U3 and U5 LTR termini (1, 10). A comparable system has
been developed for human immunodeficiency virus type 1 IN
(10). The products from these reactions exhibit all of the
hallmarks of in vivo DNA integration. These include 3' end processing,
joining, and dependency on both LTR termini from a single donor DNA
molecule, relative sequence-independent integration into the acceptor
DNA, and small base-pair duplications of the acceptor DNA at the site
of integration (1, 10). Vora et al. (16)
described a separate ASV reconstituted integration system which
utilized purified IN, an acceptor DNA, and a 480-bp DNA substrate
containing NdeI-modified ends with two-base 5' overhangs
resembling processed U3 and U5 LTR termini. Concerted DNA products with
this system arose from integration of two donor DNA molecules into the
acceptor (16). In addition, complexes capable of
catalyzing concerted DNA integration can be isolated from
virus-infected cells. Such complexes, referred to as preintegration
complexes, were shown to contain IN and the host protein, HMG-I(Y)
(5).
Within the ASV U3 and U5 LTR terminal sequences, as with all
retroviruses, there is a highly conserved CA dinucleotide at positions
3 and 4 relative to the 3' termini. Previous in vivo studies
demonstrated that base substitutions (underlined) in viral RNA placed
adjacent to and including position 4 in the ASV U5 LTR
(CTTCATT to GAAGATT) resulted in a delay in
propagation of virus (4) most likely due to defects to
initiation of reverse transcription and integration. Surprisingly, a
smaller substitution of only two nucleotides at positions 5 and 6 (CTTCATT to CAACATT) caused a more
pronounced growth defect than the four-base substitution. Analysis of end processing and joining reactions in vitro using duplex
oligodeoxyribonucleotide substrates representing the terminal 15 bp of
wild-type and mutated U5 LTRs also suggested that 2-bp substitutions
caused a stronger defect in integration reactions than 4-bp
substitutions (4, 12). The 4-bp substitution mutant was
examined using the concerted DNA integration assay described by Aiyar
et al. (1) in reactions stimulated by HMG-1 and was found
to cause a small decrease in the efficiency of integration in vitro.
More recently, Vora et al. (17) examined the effect of the
U5 2-bp substitution using the NdeI-treated donor DNA and found that in contrast to the results in vivo, the 2-bp substitution appeared to increase the efficiency of integration in vitro. However, these and the above in vivo studies (4, 12, 17) provide little information about the effects of these mutations on the mechanism of integration.
We have now examined the effects of U5 base-pair substitutions at
positions 5 and 6 on the process of integration by using the previously
described concerted DNA integration assay (1, 10). We
found that the 2- but not 4-bp substitutions increased the apparent
frequency of integration when products were analyzed by agarose gel
electrophoresis, thereby confirming the results of Vora et al.
(17). However, combining the 2-bp U5 substitution mutation
with a donor DNA that lacks the U3 IN recognition sequence resulted in
an apparent efficiency of integration at least comparable to that of a
wild-type donor substrate. Since a donor lacking the U3 IN recognition
sequence is a poor substrate for two-ended concerted DNA integration
(1), the above finding indicates that the U5 2-bp
substitution promoted a significant increase in one-ended nonconcerted
insertion events, which resulted from disrupting the normal integration
complex. Similar but even more severe defects were obtained with a 1-bp
substitution at U5 position 5 or 6. When comparable U3 2- and 4-bp
substitutions at positions 5 and 6 and 4 to 7, respectively, were
introduced into the U3 LTR, significant decreases in the efficiency of
integration were detected and very few integrants were recovered after
biological selection. This indicates that comparable mutations in U3
and U5 behave differently.
 |
MATERIALS AND METHODS |
Bacterial strains and growth conditions.
Escherichia
coli DH5
(Bethesda Research Laboratories, Bethesda, Md.) and
MC1061/P3 (Invitrogen) were used for these studies. MC1061/P3 is a
derivative of MC1061 containing the male episome, P3, which can be
selected for the presence of an encoded kanamycin resistance gene. In
addition, P3 possesses amp(Am) and tet(Am) genes,
the expression of which can be rescued by the supF amber suppressor tRNA. Under these conditions, MC1061/P3 can be selected for
ampicillin, tetracycline, and kanamycin resistance.
Reagents.
ASV IN was prepared by G. Merkel (Fox Chase Cancer
Center, Philadelphia, Pa.) as described by Jones et al.
(11). HMG-I(Y) was purified as described by Nissen et al.
(14). HMG-1 was purified as described by Chow et al.
(3). Proteinase K (30 U/mg) and glycogen were from
Boehringer Mannheim Biochemical. Vent DNA polymerase (2 U/µl) was
from New England Biolabs (Beverly, Mass.). Oligodeoxyribonucleotides were synthesized on an Applied Biosystems synthesizer (purchased from
Midland Certified Reagent Company, Midland, Tex, or Genosys Biotechnologies Inc., The Woodlands, Tex.). Oligodeoxyribonucleotides were purified by denaturing polyacrylamide gel electrophoresis followed
by reverse-phase chromatography as previously described (1).
The oligodeoxyribonucleotides used in this study were U5(WT) (5'
AATGAAGCCTTCTGCTGGGCGGAGCCTATG 3'),
U5(CTTC
CAAC) (5'
AATGTTGCCTTCTGCTGGGCGGAGCCTATG 3'),
U5(CTTC
CTAC) (5'
AATGTAGCCTTCTGCTGGGCGGAGCCTATG 3'),
U5(CTTC
CATC) (5'
AATGATGCCTTCTGCTGGGCGGAGCCTATG 3'),
U5(CTTC
GAAG) (5'
AATCTTCCCTTCTGCTGGGCGGAGCCTATG 3'), U3(WT) (5'
AATGTAGTCTTATGCGTTGCCCGGATCCGG 3'),
U3(CTAC
GATG) (5'
AATCATCTCTTATGCGTTGCCCGGATCCGG 3'),
U3(CTAC
CATC) (5'
AATGATGTCTTATGCGTTGCCCGGATCCGG 3'),
U3(CTAC
CAAC)
(5'AATGTTGTCTTATGCGTTGCCCGGATCCGG 3'),
U3(CTAC
CTTC) (5'AATGAAGTCTTATGCGTTGCCCGGATCCGG 3'),
U3
(5'AGCAATGGCAACAACGTTGCCCGGATCCGG 3'), U5seq (5'
TTCAAAAGTCCGAAA 3'), and U3seq (5' AGAATTCGGCGTTGC 3').
U5(WT) and U3(WT) were used to prepare the wild-type donor DNA
substrate; U5(CTTC
GAAG),
U5(CTTC
CAAC),
U5(CTTC
CTAC),
U5(CTTC
CATC), U3(CTAC
GATG)
U3(CTAC
CATC),
U3(CTAC
CATC), and
U3(CTAC
CATC) were used to prepare donor
substrates with mutations in the U5 or U3 terminus, as indicated. In
each case, the sequence refers to the 3' cleaved strand of the U3 or
the U5 LTR IN recognition sequence. Sequences at the U3 terminus in the
U3 construct differ substantially from wild type. U5seq
(complementary to plasmid
vx nucleotides 116 to 130) and U3seq
(complementary to plasmid
vx nucleotides 326 to 312) were used as
sequencing primers. Donor DNAs lacking the U3 IN recognition sequence
was prepared as previously described (1).
Plasmid constructions and preparations.
Plasmid Sp2, used as
a template to amplify donor DNA, is a variation of pBCSK+
in which a wild-type ASV donor DNA PCR product was inserted into pBCSK+. This plasmid was propagated in E. coli
MC1061/P3 under the conditions described above. The integration
acceptor was plasmid pBCSK+ (Stratagene, La Jolla, Calif.),
which was propagated in E. coli DH5
. Plasmids were
purified with Qiaprep columns (Qiagen, Chatsworth, Calif.) according to
manufacturer's instructions. The growth of E. coli DH5
containing pBCSK+ was selected for by addition of
chloramphenicol (35 µg/ml).
Preparation of donor DNAs.
Integration donors were amplified
by using thermostable Vent DNA polymerase and the primers listed above.
For each PCR, 25 pmol of each primer and 50 ng of Sp2 DNA as the
template were used. Vent DNA polymerase was used according to
manufacturer's instructions. A total of 20 rounds of amplification
were performed in each reaction: 3 rounds at 94°C for 2 min, 50°C
for 1 min, and 72°C for 1 min, followed by 17 rounds at 94°C for 2 min, 57°C for 1 min, and 72°C for 45 s. The resultant product
donor DNA was isolated after electrophoresis on 2% agarose gels run in
0.5× Tris-borate-EDTA (10). The purified DNA (600 ng) was
recovered using Qiaex-II resin (Qiagen) and then precipitated with
ethanol. The recovered DNA was suspended in either TE buffer (10 mM
Tris-HCl [pH 8.0], 1 mM EDTA) or deionized distilled water. The
integration donors, which were approximately 300 bp in length, were
internally labeled during the PCR by the inclusion of
[
-32P]dCTP (3,000 Ci/ mmol, 10 mCi/ml; New England
Nuclear, Boston, Mass.). The final concentrations of
deoxyribonucleoside triphosphates during amplification reactions were
0.25 mM each unlabeled dATP, dGTP, and dTTP. The final dCTP
concentration was 0.0502 mM (12 Ci/mmol, 0.6 mCi/ml).
Standard integration reaction conditions.
The integration
reaction conditions were similar to those described by Aiyar et al.
(1). Briefly, 15 ng (0.15 pmol of ends) of donor was mixed
with 50 ng of acceptor (0.02 pmol) and 180 ng of ASV IN (6 pmol) in a
26-µl preincubation reaction mixture containing, at final
concentrations, 20 mM Tris-HCl (pH 7.5), 166 mM NaCl, 5 mM
dithiothreitol, 10% dimethyl sulfoxide, 0.05% Nonidet P -40, 1%
glycerol, 1.6 mM HEPES (pH 8.0), and 3.3 mM EDTA. The IN was diluted in
a buffer containing 30% glycerol, 0.5 M NaCl, 50 mM HEPES (pH 8.0), 1 mM dithiothreitol, and 0.1 mM EDTA. Where specified, either HMG-1 or
HMG-I(Y) was added to the reaction mixtures in the concentrations
indicated. The preincubation reaction mixtures were placed on ice
overnight. The volume of each preincubation mixture was then increased
to 30 µl with the addition of MgCl2 to a final
concentration of 6.7 mM, and the integration assay mixture was
incubated at 37°C for 90 min. The reactions were stopped by
increasing the volume to 150 µl by the addition of EDTA (final
concentration of 4.25 mM), sodium dodecyl sulfate (final concentration
of 0.44%), and proteinase K (final concentration of 0.06 mg/ml). After
digestion for 60 min at 37°C, the reaction mixtures were extracted
with phenol followed by phenol-chloroform-isoamyl alcohol (25:24:1
mixture); 17 µl of 3 M sodium acetate (pH 5.2) was added along with 1 µl of glycogen (10-mg/ml stock solution). The reaction products were
precipitated by the addition of 400 µl of 100% ethanol and washed
twice with 70% ethanol prior to electrophoresis and autoradiography.
The reaction products were separated on a 1% agarose gel run in 0.5×
Tris-borate, EDTA, and ethidium bromide at 10 V/cm for 2 h.
Following electrophoresis, gels were submerged in 5% trichloroacetic
acid for 20 min or until the bromophenol blue dye turned bright yellow.
After being washed with water, the gels were dried on DE-81 paper
(Whatman) in a Bio-Rad slab gel dryer at 80°C for approximately 2 h
under vacuum. The dried gels were exposed to autoradiographic film
overnight at
80°C in a film cassette with GAFMED TA-3 or Kodak
midspeed screens.
Preincubation of IN with donor DNA.
In general, conditions
for preincubation of IN with donor DNA were identical to those of our
standard ASV integration reactions outlined above except that the
acceptor DNA was omitted from the incubation on ice and the incubation
volume was 25 µl. The reaction volumes were increased to 30 µl with
the addition of 50 ng of acceptor (0.02 pmol) and MgCl2 as
above, and the reaction mixture was incubated at 37°C for 120 min.
Cloning and sequencing of integrants.
In all experiments,
the integration products were used directly for transformation of
bacteria after pooling several separate integration reactions.
Integration products were introduced into E. coli MCI061/P3
by electroporation in a Bio-Rad electroporator with 0.1-cm
electroporation cuvettes, 1.8-kV voltage, 25-mF capacitance, and
200-ohm resistance. The P3 episome is maintained at a low copy number.
Therefore, only 40 µg of ampicillin, 15 µg of kanamycin, and 10 µg of tetracycline per ml were required for selection. Under these
conditions, we detected no colonies after supF selection when the donor, acceptor, or donor and acceptor were electroporated into cells in the absence of IN. Plasmid DNAs were recovered from individual clones, and integration junctions were sequenced by using
primers U3seq (for sequencing the U3 junction) and U5seq (for
sequencing the U5 junction). Sequencing was performed using a Sequenase
or Thermo-Sequenase kit as instructed by the manufacturer (U.S.
Biochemical, Cleveland, Ohio).
 |
RESULTS |
Reconstitution of ASV IN-dependent integration in
vitro.
The ASV reconstituted concerted DNA integration system used
in this study (Fig. 1A and B) utilizes a
mini-donor DNA containing a supF transcription unit, a
supercoiled plasmid acceptor replicative form I (RF I), purified
E. coli-expressed ASV IN, and human HMG-I(Y) (1,
10). Products from these reconstituted reactions and their
relative positions of migration as analyzed by gel electrophoresis are
shown in Fig. 1C. The wild-type donor integrates into the acceptor DNA
(Fig. 1C, RF II product) via a concerted mechanism using mostly a
single donor DNA as depicted diagrammatically in Fig. 1A (product a).
Less than 10% of the integrants found in the RF II products were
previously shown to arise by a nonconcerted mechanism through either
single or multiple donor one-ended insertion events (Fig. 1A, products
c and e) (1, 10). Products c and d would migrate at the
same position as product a, because a single donor has been inserted;
product e would migrate more slowly since two donors are inserted into
the target. Product d is a hypothetical intermediate in which a single
donor integrates into a target, but with the ends of the donor being
inserted at distant sites. The RF III product (1) (Fig.
1C) probably arises by concerted DNA integration of two donor DNAs
integrating at the same site in the acceptor DNA as depicted in Fig. 1A
(product b). When the products from a standard integration reaction are
introduced into bacteria containing a P3 plasmid containing drug
resistance markers with amber mutations, which can be suppressed in the
presence of the supF tRNA transcription unit derived from
the donor, individual integrants can be isolated and sequenced.

View larger version (33K):
[in this window]
[in a new window]
|
FIG. 1.
Reconstitution of ASV IN-dependent DNA integration with
wild-type donor DNA. (A) A diagram of donor DNA showing the U3 (left)
and (U5) LTR sequences is shown at the top. The highly conserved CA
dinucleotide is underlined; the closed rectangle represents a
supF tRNA transcription unit. The donor and acceptor DNAs
were incubated on ice with IN and HMG proteins as described in
Materials and Methods, and the integration reaction was initiated at
37°C with the addition of MgCl2. Below are diagrammatic
representations of concerted DNA integration products resulting from
use of both LTR termini from a single donor (product a) and from use of
different LTR termini from two donors (product b) and of nonconcerted
integration products resulting from one-ended integration from a single
donor (product c), using both ends from a single donor with insertion
at different sites on the acceptor DNA (product d), and using one-ended
integration from two donors at different sites on the acceptor DNA
(product e). (B) Modified integration reaction conditions where the
acceptor DNA is introduced into the assay after preincubation overnight
at 4°C. (C) Gel electrophoresis analysis of integration products
formed with a wild-type donor DNA. Positions of RF II and RF III forms
of the acceptor DNA, the RF II and RF III products from integration of
the wild-type donor into the acceptor DNA, and possible intermediates
as shown in panel A are indicated. The radiolabel is in the donor DNA
whose migration position is at the bottom of the gel.
|
|
Analysis of U5 base-pair substitutions at positions 5 and 6.
Using this in vitro system, we analyzed the effects of introducing
base-pair substitutions into the donor at U5 positions 5 and 6 (CTTCATT
CAACATT) near the conserved CA
dinucleotide. The ability of the modified donor to undergo integration
in vitro, stimulated by HMG-I(Y), was 1.5- to 2-fold greater (Fig.
2A, lane 2) than observed with the wild
type (Fig. 2A lane 1). This is similar to the results obtained by Vora
et al. (17). However, if the U5 position 5 and 6 base-pair
substitution mutation was combined with a donor DNA lacking the U3 IN
recognition sequence, there was little or no decrease in the amount of
integration product compared to a donor with a U5-5A6A and a wild-type
U3 (Fig. 2B, lane 6). A donor DNA which contains only the wild-type U5
IN recognition sequence is considerably less efficient in supporting
integration (Fig. 2B, lane 4) than a donor with both IN recognition
sequences (Fig. 2B, lane 2) (1). A donor with a U5-5A6A,
in the absence of the wild-type U3 termini, integrates at a greater
efficiency than a donor with only a wild-type U5 terminius. Taken
together, these results indicate that the base-pair substitutions at
positions 5 and 6 alter concerted DNA integration, resulting in an
increase in single or multiple U5-mediated one-ended donor insertion
events. Note also that in each case the presence of the HMG-I(Y)
protein stimulated integration of donors containing only the one LTR IN recognition sequence (Fig. 2B, lanes 3 to 6).

View larger version (51K):
[in this window]
[in a new window]
|
FIG. 2.
Gel electrophoresis analysis of integration products
formed with wild-type and U5-substituted donor DNA. (A) Integration
reactions under the standard conditions as described in the legend to
Fig. 1A were carried out with 6 pmol of IN and 4 pmol of HMG-I(Y) with
wild-type (WT) donor DNA (lane 1) or donor DNAs containing U5-5A6A
(lane 2), U5-5A (lane 3), or U5-6A (lane 4) base-pair substitutions.
Integration reactions were also carried out with 6 pmol of IN and 4 of
pmol HMG-1 with wild-type donor DNA (lane 5) or donor DNAs containing
U5-5AS6A (lane 6), U5-5A (lane 7), or U5-6A (lane 8) base-pair
substitutions. (B) Integration reactions with (lanes 2, 4, and 6) or
without (lanes 1, 3, and 5) HMG-I(Y) using a wild-type (WT) donor
(lanes 1 and 2) or a donor DNA lacking the U3 IN recognition sequence
but maintaining a wild-type U5 (lanes 3 and 4) or U5-5A6A (lanes 5 and
6) IN recognition sequence.
|
|
One-ended or multiple one-ended donor insertion products would be lost
when introduced into bacteria. When integrants having the 5A6A
substitution were electroporated into bacteria, the number of colonies
recovered after supF selection was reduced to 25% relative
to the wild-type donor (Table 1, standard
reaction conditions with the acceptor DNA present during the
preincubation). In addition, when DNA recovered from individual clones
derived from HMG-I(Y)-stimulated reactions was analyzed by agarose gel
electrophoresis, about 20% of the RF I DNA migrated faster than an
acceptor DNA containing a donor inserted by a concerted mechanism (data
not shown). This suggested that deletions had been introduced into
these acceptors, which was confirmed by direct nucleotide sequencing as
summarized in Table 2. Sizes of the
deletions ranged from 196 to 2,736 bp (Fig.
3). The deletion products lacked
base-pair duplications at the sites of donor DNA insertion but
contained characteristic 2-bp deletion from the ends of the LTRs. When
wild-type donors were used, no such acceptor DNA deletions were
detected (10).

View larger version (10K):
[in this window]
[in a new window]
|
FIG. 3.
Deletions introduced into the acceptor DNA by
nonconcerted DNA integration of mutated donors. The solid horizontal
lines represent deletions from individual integrants as depicted above
or below the plasmid map. The plasmid acceptor DNA is as described in
the legend to Fig. 4.
|
|
Of the concerted DNA integration products recovered and sequenced, the
mutations were still present, and as with the wild type, there were
duplications introduced into the acceptor at the site of donor
insertion and 2 bp removed from the ends of the LTRs (Table 2A). Sizes
of the base-pair duplications in the acceptor and the distribution of
the sites of insertion of the U5-5A6A donors were similar to those for
the wild type (Fig. 4). When the same
U5-5A6A mutated donor DNA was analyzed in reactions stimulated by
HMG-1, an increase in total integration products was also noted (Fig.
2A, compare lanes 5 and 6). However, after biological selection, the
number of recovered integrants was reduced to 4% of the wild-type
value (Table 1, standard conditions), and those few integrants
recovered arose by a nonconcerted DNA integration mechanism introducing
deletions into the acceptor DNA (Table 2).

View larger version (26K):
[in this window]
[in a new window]
|
FIG. 4.
Sites of concerted integration of wild-type and mutant
donor ASV DNA. The locations and lengths of flanking duplications of
plasmid DNA for integrants using wild-type (A) and mutant ASV donor
DNAs with base pair substitutions U5-5A6A (B), U5-5A (C), U5-6A (D),
U3-5T6A (E), and U3-6A (F) as described in Tables 2 to 5 are presented.
Data for wild-type ASV are from Aiyar et al. (1) and
Hindmarsh et al. (10). The plasmid acceptor data are drawn
in a linear representation to scale. The genes of the plasmid and
origin of replication (ori) are indicated by open boxes. MCS, multiple
cloning site. The thick vertical lines represent numbers of nucleotides
in the plasmid DNA. Each thin vertical line represents a separate
sequenced integration event. Lengths of the thin vertical lines
represent 4-, 5-, 6-, or 7-bp duplications of the acceptor DNA as
indicated.
|
|
Analysis of U5 base-pair substitutions at position 5 or 6.
A
similar in vitro analysis was performed with donor DNAs that contained
single-base-pair substitutions at position 5 or 6. In contrast to the
results described above, there was no difference in the total
integration products compared to wild type in HMG-I(Y)-stimulated reactions analyzed by agarose gel electrophoresis (Fig. 2A, lanes 3 and
4, respectively). Nevertheless, when introduced into bacteria, the
number of recovered colonies compared to wild type was reduced to 15%
(Table 1, standard conditions) for both mutant donors, and the majority
of those integrants recovered from bacteria and sequenced contained
deletions in the acceptor DNA (Table 3). We detected only one concerted DNA integration product with a donor DNA
containing a base-pair substitution at position 5 and one with a
base-pair substitution at position 6, though in the latter instance
there was a concomitant 5-bp deletion in the U5 IN recognition
sequence.
Preintegration incubation of IN with donor DNA.
We considered
the possibility that formation of one-ended or other nonconcerted DNA
integration products with U5-modified donor DNA (Fig. 2B) may be caused
by a failure of IN to form multimeric complexes that bring the two LTR
termini together. Since IN is a strong nonspecific DNA binding protein,
we tried to correct the defect by increasing the relative specific
activity of IN to the donor DNA ends by omitting the acceptor DNA from
the preincubation on ice (Fig. 1B). It was added instead after the
preincubation period along with MgCl2 to start the reaction
at 37°C as described in Materials and Methods. Under these modified
conditions using a wild-type donor, the total number of colonies
recovered from bacteria was reduced to about one-third of the number
found under standard conditions, which included the acceptor DNA in the
first preincubation on ice. All of the recovered integrants sequenced were formed by a concerted integration reaction (Table 1). When integrants were derived from reactions with donors containing base-pair
substitutions at positions 5 and 6 or at either position 5 or 6 of the
U5 IN recognition sequence, under the modified preincubation conditions, the recovery of colonies approached wild-type levels. Also,
most sequenced integrants arose by a concerted mechanism; this was
particularly noticeable in reactions with U5-5A6A-substituted donor and
HMG-1, U5-5A-substituted donor and HMG-I(Y), and U5-6A-substituted donor and HMG-I(Y) (Table 4). The sites
of insertion in the acceptor DNA were again widely distributed (Fig.
4), and the sizes of the base-pair duplications were similar to those
obtained with a wild-type donor DNA.
View this table:
[in this window]
[in a new window]
|
TABLE 4.
Sites of integration, in the presence of HMG-I(Y) or
HMG-1, with donor DNA containing U5-5A6A, U5-5A, or U5-6A
substitutions preincubated without acceptor
DNAa
|
|
Analysis of U3 base-pair substitutions at positions 5 and 6 and at
positions 4 to 7.
The U3 and U5 LTR termini are related to one
another by being nearly perfect inverted repeats, and position 5 is one
of three mismatched base pairs. We therefore prepared a series of
U3-modified LTR substrates, which inverted the base pairs at position 5 and/or position 6, to determine whether this caused a similar defect observed with comparable substitutions introduced into U5. As seen
after agarose gel electrophoresis (Fig.
5), for reactions with HMG-I(Y) using the
standard assay conditions that included the acceptor in the
preincubation, there is a substantial decrease in integration compared
with wild type to less than 20% with the 2-bp substitutions (U3-5T6A)
(compare lanes 1 and 2). Base-pair substitutions of positions 4 to 7 also caused a significant decrease in the efficiency of integration in
vitro (compare lanes 1 and 3). In contrast, a single-base-pair
substitution at position 5 in U3 caused about a 50% reduction in
integration (lane 7) relative to use of a wild-type donor (lane 9). A
single-base-pair substitution at position 6 had little detectable
effect on the efficiency of integration (lane 8). Similar results were
obtained when HMG-1 was used instead of HMG-I(Y) (lanes 4 to 6).

View larger version (61K):
[in this window]
[in a new window]
|
FIG. 5.
Effects of U3 LTR base-pair substitutions on integration
into acceptor DNAs. Integration reactions as described in the legend to
Fig. 2 were stimulated by HMG-I(Y) (lanes 1 to 3 and 7 to 9) or HMG-1
(lanes 4 to 6). Lanes 1, 4, and 7, wild-type (WT) donor DNA. Donor DNAs
contained U3-5T6A (lanes 2 and 5), U3-4G5T6A7G (lanes 3 and 6), U3-5T
(lane 7), or U3-6A (lane 8) base pair substitutions.
|
|
When integration products from HMG-I(Y)-stimulated reactions with
mutant U3 donors were separately introduced into bacteria, individual
integrants were recovered only for the 2-bp substitutions at positions
5 and 6 (U3-5T6A) and the 1-bp substitution at position 6 (U3-6A). The
numbers of colonies recovered were reduced to approximately 11 and 35%
relative to wild type (Table 1) for the U3-5T6A and U3-6A donors,
respectively. No integrants were recovered for the single-base-pair
substitution at position 5 (U3-5T donor). Because of the very low
efficiency in collection of colonies, IN reactions were scaled up
fourfold to obtain the clones whose sequences are shown in Table
5. Of the recovered integrants that were
sequenced, all integrated by a concerted mechanism (Table 5). This was
evidenced by the characteristic duplication of the acceptor DNA at the
site of donor insertion. The distribution of integration sites in the acceptor and the sizes of the base-pair duplications at the sites of
insertion for the U3-5T6A mutant were similar to wild-type results
(Fig. 4). Similar results were obtained with HMG-1-stimulated reactions
except that fewer colonies were recovered (Tables 1 and 5).
View this table:
[in this window]
[in a new window]
|
TABLE 5.
Sites of integration, in the presence of HMG-I(Y) or
HMG-1, with donor DNA containing U3-5T6A or U3-6A
substitutionsa
|
|
Base-pair deletions introduced into the donor DNA.
We
had previously reported that when a 4-bp substitution was introduced
into U5, a small number of concerted DNA integrants sequenced contained
deletions in the donor DNA that were not observed with a wild-type
substrate (1, 10). Such integrants all arose by a
concerted DNA integration mechanism since they had the characteristic base-pair duplication in the acceptor DNA. In this study, of the 97 concerted DNA integrants that were sequenced, 14% contained deletions
in the donor DNA (Tables 2 to 5). When base-pair substitutions were
placed in position 5 or 6 or at positions 5 and 6 in U5, 6 of 67 concerted integrants contained the same 7-bp deletion in U3, to use the
first internal GA dinucleotide for integration. One integrant contained
a U5 deletion of 5-bp and used the first internal CA dinucleotide that
was introduced by the base-pair substitution at position 6. Of the 25 integrants sequenced with U3 base-pair substitutions at positions 5 and
6 or position 6 alone, 5 contained base-pair deletions in the U3 region
of the donor. One had the same 7-bp deletion as seen with U5 mutations using the GA dinucleotide, another had a 12-bp deletion to utilize an
internal CA dinucleotide, and three had a 16-bp deletion using a CG
dinucleotide for integration (Table 5). Thus, even though the CA
dinucleotide is a highly conserved feature of an IN recognition sequence, it is not absolutely required for integration in this reconstituted system. For all but one LTR deletion integrant described above, the deletions appeared in the U3 IN recognition sequence regardless of whether the original site of mutation was in U3 or in U5.
The reason for this asymmetry is not known. These LTR deletions were
not observed in experiments using a wild-type donor.
 |
DISCUSSION |
IN, in the form of a dimer or larger multimer, forms a complex
with the host cell acceptor DNA and the viral donor DNA. The positioning of IN on the acceptor allows for staggered breaks to be
introduced, which determines the length of the base-pair duplications
at the sites of integration. By juxtaposing both ends of the donor DNA
together into the complex, the processing of the ends is closely linked
(13) and the mechanism of integration is concerted.
Introduction into either LTR terminus of mutations that alter the
interaction with IN and that disturb the formation of multimer
complexes that bring the two LTR ends together might affect the
coordination of end processing and result in a significant increase in
one-ended and other nonconcerted DNA integration events. Base-pair
substitutions introduced at positions 5 and/or 6 in the U5 LTR termini
have this phenotype. Integration in vitro with these modified U5 donors
is largely independent of a second LTR terminus on the same donor,
consistent with a disruption of the normal integration complex
resulting in a preference for one-ended insertion events. This explains
the increase in total RF II integrated products detected by gel
electrophoresis analysis concomitant with a significant reduction in
recovered integrants when introduced into bacteria. The gel analysis
does not discriminate between one- or two-ended insertion events (see
Fig. 1C), while only the latter would be recovered from bacteria. Thus,
the 2-bp positions in the U5 IN recognition sequence adjacent to the
conserved CA are very important for the mechanism of concerted DNA integration.
The concerted integration phenotype in vitro can be rescued by
increasing the molar ratio of IN to donor DNA ends in the preincubation on ice. While we can drive the formation of the presumed multimeric complex of IN and donor DNA, it is at a cost of sharply reducing the
efficiency of the in vitro integration reaction to a third of the
wild-type level. Nevertheless, under these conditions almost all of the
integrants recovered arose by a concerted mechanism.
Another consequence of introducing substitutions at positions 5 and/or
6 in the U5 IN recognition sequence can be seen among the integrants
recovered from the standand reaction conditions. Approximately 20% of
the sequenced integrants contained deletions in the acceptor. These
events were not observed with a wild-type donor. The deletions observed
were sizable, occasionally excising the majority of the acceptor
plasmid, maintaining only segments of the ColE1 origin of replication
(Fig. 3). This is explained by the fact that deletions introduced into
the origin of replication would result in the loss of plasmid in the
bacteria and hence loss of integrants. This is consistent with the
finding that we also do not detect donor insertions into this region
(Fig. 4). Deletions in the acceptor DNA could arise by several
mechanisms. For example, both ends from a single donor DNA could
integrate into the same acceptor DNA but at distal sites (Fig. 1A,
hypothetical product d). Alternatively, two different donors could
integrate into the same acceptor also at distal sites (product e). How
the deletions are physically introduced is not known, but the relative positioning of the divergent insertion sites probably defines the size
of the deletion.
In this study, we have found that single-base-pair substitutions at U5
position 5 or 6 caused more of an integration defect in vitro than the
combination of the 2-bp substitutions in the same donor. This is
evidenced by the percent decrease in recovery of colonies after the
biological selection for each mutant donor reaction under the standard
reaction conditions and by the sequences of recovered integrants. Among
those sequenced, we found only nonconcerted DNA products with large
acceptor deletions from reactions with the U5 donor base-pair
substitutions at position 5 or 6. In contrast, we found both
nonconcerted, with large acceptor deletions, and concerted DNA
integration products from reactions with the U5 donor base-pair
substitutions at positions 5 and 6. For U5 donor base-pair
substitutions at positions 4 to 7, we previously reported that we
detect only concerted DNA integration products (1). Thus,
the four-base U5 substitution leaves the mechanism of integration
mostly intact, while the overlapping two-base substitution at positions
5 and 6 causes a significant change in integration mechanism. These
results are consistent with in vivo analysis of the comparable
mutations introduced into the ASV genome, where the substitutions at
positions 5 and 6 caused a greater delay of virus growth than the
substitutions at positions 4 to 7 (4).
Similar substitutions at positions 5 and/or 6 in the U3 IN recognition
sequence under the standard assay conditions caused reductions both in
total integration products observed on the gels and in recovered
integrants from the bacteria. In fact, the reductions caused by the U3
mutations were greater than found for comparable substitutions placed
in the U5 IN recognition sequence. Since it is known that the wild-type
U3 IN recognition sequence is processed by ASV IN more efficiently than
the U5 IN recognition sequence, this result is not surprising. This
finding does highlight the difference in recognition of the terminal U3
and U5 sequences by IN, but an explanation for these differences awaits
structural information of IN with a bound substrate.
The in vitro reconstituted system used in this study employs HMG
proteins as cofactors. Several HMG proteins, including HMG-1, -2, and
-I(Y), stimulate integration in vitro (1, 10). In this
study, we have found that there is a quantitative difference among the
different HMG proteins in response to the base-pair substitutions at
positions 5 and 6 in U5. If there is a tendency of an in vitro
integration system to favor nonconcerted integration in the presence of
HMG-1 or HMG-2, this tendency can be partly reversed by the presence of
HMG-I(Y) (reference 10 and this study). Coupled with the
finding of HMG-I(Y) in preintegration complexes (5), these
results are consistent with HMG-I(Y) rather than HMG-1 serving as a
cofactor for integration in vivo. The DNA binding domains of the HMG-1
and HMG-I(Y) proteins have markedly different three-dimensional
structures and somewhat different DNA binding properties
(2). HMG-1 proteins interact in a sequence-independent manner with the minor groove of DNA, whereas HMG-I(Y) proteins bind
preferentially to the minor groove of AT-rich regions of B-form DNA.
Therefore, one possible explanation for the above differences could be
that HMG-I(Y) but not HMG-1 can recognize and preferentially bind
AT-rich stretches, which are abundant in the donor termini
(2). We have previously shown that HMG-I(Y) and IN do not
interact directly in a ternary complex in which each is bound to the
DNA (10). We suspect that HMG proteins function in vivo
and in vitro as either DNA bending proteins, aiding in bringing the two
ends of the donor together, or alternatively by melting the ends of the
LTR termini and facilitating the end-processing step of integration.
The finding that HMG-I(Y) proteins stimulate integration of donor DNAs
lacking one of the two LTR IN recognition sequences suggests that the
mechanism by which HMG proteins act is more through facilitating end
processing and joining than by bending DNA.
This work was supported in part by National Institutes of Health
research grants CA-38046 (J.L.) and GM46352 (R.R.).