Previous Article | Next Article 
Journal of Virology, August 1999, p. 7061-7064, Vol. 73, No. 8
0022-538X/99/$04.00+0
Copyright © 1999, American Society for Microbiology. All rights reserved.
The Integration Machinery of ZAM, a Retroelement
from Drosophila melanogaster, Acts as a
Sequence-Specific Endonuclease
Pascal
Leblanc,
Bernard
Dastugue, and
Chantal
Vaury*
Unité INSERM U384, 63001 Clermont-Ferrand, France
Received 18 December 1998/Accepted 7 May 1999
 |
ABSTRACT |
Retroviruses and retrotransposons insert into the host genome with
no obvious sequence specificity. We examined the target sites of the
retroelement ZAM by sequencing each host-ZAM junction in the genome of
Drosophila melanogaster. Our overall data provide compelling evidence that ZAM integration machinery recognizes and leads
to ZAM insertion into the sequence 5'-GCGCGCg-3'. This unique property of ZAM will facilitate the development of new tools to
study the integration process of retroelements.
 |
TEXT |
Integration of proviral DNA into
host chromosomes is an essential part of the retroviral life cycle.
This step in the replication cycle is mediated by integrase, a key
enzyme encoded by the element itself (10).
During the early stages of infection, the integrase protein processes
the termini of viral long terminal repeats (LTRs) by endonucleolytic
cleavage of the 3'-terminal dinucleotides. Integration of this
processed proviral DNA occurs by staggered transesterification of the
recessed 3' ends and the phosphodiester backbone of the target DNA. It
has been demonstrated for some retroviruses that the cleavage reaction
is performed within a high-molecular-weight preintegration complex
(2, 18). Preintegration complexes are transported to the
nucleus, and the proviral DNA is integrated into the host chromosome.
Retrotransposons and retroviruses are structurally related and have
similar modes of integration into host DNA. They display various
degrees of bias in selecting target sites for integration in vivo. The
mechanism and factors governing these preferences are not completely
documented, but host proteins are known to affect selection of the
insertion site through changes in DNA structure or by interacting
directly with the integration machinery. For example, yeast
retroelements such as Ty1 and Ty3 target genomic regions lying
immediately upstream of genes transcribed by polymerase III, and Ty5
targets transcriptionally silent regions of the yeast genome (6,
14-16). Furthermore, sequence data from large numbers of
integration sites for Ty1 (3), human immunodeficiency virus (HIV) (12), and Gypsy (4) have revealed weak
consensus sequences, but no retroelement that displays strict sequence
specificity in its target has been described.
The ZAM retroelement is an 8,435-bp retrovirus-like element present
within the genome of Drosophila melanogaster
(17). On the basis of sequence, structural, and functional
similarities, ZAM displays a striking resemblance to vertebrate
retroviruses (for a review, see reference 7). It has
three open reading frames analogous to the retroviral gag,
pol, and env genes surrounded by LTRs. The ZAM
pol gene is subdivided into three regions, which encode
typical retrovirus-like enzymes: protease, reverse transcriptase-RNase H, and integrase (IN). The latter polypeptide displays all the characteristics of canonical retroviral IN (10). It contains three domains: a zinc finger amino-terminal motif (HHCC), a core or
catalytic domain characterized by the DD35E motif, and a
carboxy-terminal part of the protein which displays a high basicity
similar to the DNA binding domain of retroviral integrases. All of
these characteristics suggest that the mechanism of ZAM integration resembles that of retroviruses.
In this paper, we report that ZAM is highly sequence specific in its
integration. By exhaustive analyses of ZAM insertions in a defined
strain of flies, we show that the target sequence chosen by nearly
every ZAM element is 5'-GCGCGCg-3' (lowercase "g"
indicates a <50% occurrence of that base).
In a previous work, we identified two independent ZAM insertions
flanked by the duplicated sequence CGCGC (17), suggesting that ZAM could require a specific target site for integration. Interestingly, this sequence is also recognized by two restriction endonucleases, HhaI and ThaI, which cleave the
sequences GCGC and CGCG, respectively. Thus, we postulated that if ZAM
copies present in a genome were indeed inserted into a CGCGC sequence, then HhaI and ThaI endonucleases must cleave
within the 5' and 3' end duplications of ZAM insertions.
Since the ZAM genome itself contains three HhaI restriction
sites (Fig. 1 [restriction map below the
autoradiograph]), cleavage of genomic DNA with HhaI should
reveal bands of 475 bp and 3.2 kb when probed by sequences
complementary to the ZAM LTR (Fig. 1A). Additionally, bands of 1.4 and
3.3 kb are expected after hybridization with the internal E6 probe
(Fig. 1B). This is indeed what is seen when DNA from strains bearing a
high copy number of ZAM elements, such as Charolles and
wIR6Rev, or strains bearing a low copy number of ZAM
elements, such as Canton S and wIR6, are subjected to
HhaI digestion (Fig. 1). Most of the restriction fragments
produced were of the expected sizes in all of the strains studied. By
using a Bio-Rad GS-525 phosphorimager, we estimated the copy numbers of
ZAM elements detected in this experiment. We found that the 3.2-kb or
475-bp bands obtained in wIR6Rev and Charolles contain 6 to
8 more copies of ZAM than any faint band visualized on the blot or a
band revealed by a probe of the white gene, taken as an
internal standard to estimate the intensity of a sequence that is
unique in the genome (data not shown). If a single copy of ZAM is
present in faint bands, then we estimate the total copy number to be 15 to 20, which is in good correlation with previous data estimated
through in situ and Southern blot analyses (5, 17).
Identical results were obtained when a similar experiment was performed
with the ThaI endonuclease (Fig. 2).

View larger version (41K):
[in this window]
[in a new window]
|
FIG. 1.
(Upper panel) ZAM target duplication sequences contain
an HhaI restriction site. Genomic DNAs of high- and
low-copy-number strains were treated with the HhaI
restriction endonuclease and hybridized to the LTR (A) or E6 (B) probe.
(Lower panel) ZAM HhaI restriction map. Probes used for
Southern analysis (LTR and E6 probes) and expected digested fragments
are depicted below the restriction map. The HhaI restriction
sites in shaded boxes are located in ZAM target duplication
sequences.
|
|

View larger version (42K):
[in this window]
[in a new window]
|
FIG. 2.
(Upper panel) ZAM target duplication sequences contain a
ThaI restriction site. Genomic DNAs of high- and
low-copy-number strains were treated with the ThaI
restriction endonuclease and hybridized to the LTR (A) or E6 (B) probe.
(Lower panel) ZAM ThaI restriction map. Probes used for
Southern analysis (LTR and E6 probes) and expected digested fragments
are depicted below the restriction map. The ThaI restriction
sites in shaded boxes are located in ZAM target duplication
sequences.
|
|
Several conclusions can be drawn from these experiments: (i) most ZAM
retroelements contain both an HhaI and a ThaI
restriction site at each end; (ii) sequence-specific integration is
found in all strains tested, independently of ZAM copy number; and
(iii) internal HhaI and ThaI restriction sites
are well conserved in all members of the ZAM family.
In order to confirm these data, we performed systematic sequencing of
each host-ZAM junction in the wIR6Rev genome. This strain
was chosen because its genome has been subjected to a recent and
massive invasion by ZAM elements (5). To collect as many
independent integration sites as possible, we used an inverse PCR
approach that allowed cloning of integrated copies of ZAM together with
their flanking cellular sequences. Total wIR6Rev genomic
DNA of one fly was extracted according to the protocol of Gloor and
Engels (8, 9). Inverse PCR experiments were performed on
ligated TaqI-cut DNA by using pairs of backward-oriented primers within the 3' LTR (ZAMLTR [8340 to 8361], 5' AAT TCT CCC AAG ACG ACC GTG 3'; ZAMLTRic [8302 to 8323],
5' ACG TCT ACA AGT GTT TGC TGC 3'). PCR amplification was as
follows: 2 min at 95°C for one cycle; 45 s at 94°C, 45 s
at 60°C, and 45 s at 72°C for 35 cycles; and 10 min at 72°C for one cycle. Fragments of variable sizes (200 to 500 bp) were amplified, cloned into the pGEMT vector (Promega), and sequenced on an
ABI 377 sequencer (Perkin-Elmer). Sequences of 60 clones were
determined, and 16 of these appear to be unique. This sample size of 16 clones fits well with the predicted ZAM copy number present in the
wIR6Rev genome and previously estimated through Southern
blots and in situ experiments (1, 3, 11). Sequences of
independent ZAM integration sites are shown in Table
1. Alignment allowed us to derive a
consensus sequence, 5'-CGCGCg-3', bordering ZAM insertions
(Table 2). As expected, this consensus
contained the two restriction sites recognized by the HhaI
and ThaI restriction endonucleases. It is noteworthy that
the strict consensus sequence CGCGC is represented in as many as 6 of
the 16 ZAM insertions sequenced. If the stringency is relaxed by
allowing a single mismatch to this consensus, then more than 93% (15 of 16) of the insertions conform (the last one displaying only one
additional mismatch). These small sequence variations in the targets of
ZAM insertions may explain the presence of the few additional bands
visualized on Southern blots (Fig. 1 and 2). However, some of these
extra bands may have also been generated by enzymatic restriction site polymorphisms within the sequences of numerous ZAM copies.
Polymorphisms were indeed detected when blots were probed with an
internal ZAM sequence (probe E6 [Fig. 1B and 2B]).
In order to determine the genomic regions where ZAM copies were
inserted, the 16 flanking sequences in Table 1 were compared with
sequences present in data banks. We found that seven of these sites
display 100% homology to cosmids sequenced in the course of the
D. melanogaster sequencing project, two identify different ZAM insertions within the retroelement R1Dm, and another corresponds to
the ZAM element integrated upstream of the transcriptional start site
of the white gene. Identification of the latter insertion was a good indirect control of our screen because it allowed us to
isolate an insertion previously identified in this strain by a
different method (17). Alignment of all of the ZAM empty
target sites confirmed that the sequence recognized upon ZAM insertion spans the 6-bp target site identified through the inverse PCR approach.
An additional G present in all of the empty sites and located at the 5'
end of the CGCGCg sequence was also detected (Table
3). These data suggest that the target
site recognized upon ZAM insertions is the consensus sequence
GCGCGCg, which is longer than the duplicated sequence
generated upon its insertion and previously reported for its insertion
at the white locus. It is interesting to note that this
consensus is palindromic, as are other consensus sequences described
for Ty1 (12) and Tn10 transposition
(11). In addition, identification of these genomic loci
confirmed previous results indicating that ZAM copies are dispersed on
chromosomal arms of the strains and are not clustered in
heterochromatic or telomeric regions of the genome (Table 3) (1,
5).
The capacity of the ZAM integration machinery to select specific
sequences for integration makes ZAM unique among the retroelements described so far. Indeed, available studies, including a systematic sequencing of retroviral integration targets, are often limited and do
not reveal consensus sequences with such a strict arrangement of bases
(3, 4, 12, 20). The specificity of the ZAM integration
machinery for the sequence 5'-GCGCGCg-3' is more reminiscent of the well-characterized property of the sequence-specific restriction endonucleases that trigger DNA cleavage in bacteria. Like ZAM, these
enzymes recognize and cleave DNA at specific sites and, under certain
conditions, may extend their cleavage to noncanonical ("star") sites.
The development of molecular and genetic approaches for understanding
ZAM integrase activity is a worthwhile goal. Analysis of integrase
activity will identify functional domains and their roles in sequence
recognition. Furthermore, it is essential to identify ZAM partners
involved in preintegration complexes as well as cellular factors
present on the host chromosome at the insertion sites. Indeed, the
alternating pyrimidine and purine residues in the GCGCGCg
target sequence of ZAM form a DNA sequence which is able to adopt
a particular conformation potentially facilitating recognition by
specific host factors. Cellular factors such as Ini1 and HMGI(Y) have
been shown to interact with HIV and Moloney murine leukemia virus
proteins and enhance integration (13, 18, 19). The genetic
potential offered by the Drosophila genome will provide an
invaluable tool for beginning to answer these questions. Because of the
high degree of similarity between ZAM integrase and integrases encoded
by retroviruses, such a development will be of general interest and
extend our understanding of retroviral insertion. It is tempting to
predict that it would then be possible to engineer retroelements and
drive their insertion into preexisting genome targets.
 |
ACKNOWLEDGMENTS |
We thank J. M. Heard, S. Ronsseray, and J. Miller for critical
reading of the manuscript and Valérie Calco for technical assistance.
This work was supported by INSERM grants and by a grant from
Ministère de l'Enseignement Supérieur et de la Recherche
(MESR). P.L. received a graduate grant from MESR and a grant from the Conseil Régional d'Auvergne (Bourse Régionale
d'Excellence 1998).
 |
FOOTNOTES |
*
Corresponding author. Mailing address: Unité
INSERM U384, 28 Place Henri Dunant, 63001 Clermont-Ferrand, France.
Phone: 33 4 73 60 80 24. Fax: 33 4 73 27 61 32. E-mail:
chantal.vaury{at}inserm.u-clermont1.fr.
 |
REFERENCES |
| 1.
|
Baldrich, E.,
P. Dimitri,
S. Desset,
P. Leblanc,
D. Codipietro, and C. Vaury.
1997.
Genomic distribution of the retrovirus-like element ZAM in Drosophila melanogaster.
Genetica
100:131-140[Medline].
|
| 2.
|
Bukrinsky, M. I.,
N. Sharova,
T. L. McDonald,
T. Pushkarskaya,
G. W. Tarpley, and M. Stevenson.
1993.
Association of integrase, matrix, and reverse transcriptase antigens of human immunodeficiency virus type 1 with viral nucleic acids following acute infection.
Proc. Natl. Acad. Sci. USA
90:6125-6129[Abstract/Free Full Text].
|
| 3.
|
Carteau, S.,
C. Hoffmann, and F. Bushman.
1998.
Chromosome structure and human immunodeficiency virus type 1 cDNA integration: centromeric alphoid repeats are a disfavored target.
J. Virol.
72:4005-4014[Abstract/Free Full Text].
|
| 4.
|
Dej, K. J.,
T. Gerasimova,
V. G. Corces, and J. D. Boeke.
1998.
A hotspot for the Drosophila gypsy retroelement in the ovo locus.
Nucleic Acids Res.
26:4019-4024[Abstract/Free Full Text].
|
| 5.
|
Desset, S.,
C. Conte,
P. Dimitri,
V. Calco,
B. Dastugue, and C. Vaury.
1999.
Mobilization of two retroelements, ZAM and Idefix, in a novel unstable line of Drosophila melanogaster.
Mol. Biol. Evol.
16:54-66[Abstract].
|
| 6.
|
Devine, S. E., and J. D. Boeke.
1996.
Regionally-specific, targeted integration of the yeast retrotransposon upstream of genes transcribed by RNA polymerase III.
Genes Dev.
10:620-633[Abstract/Free Full Text].
|
| 7.
|
Finnegan, D. J.
1994.
Retroviruses and transposons. Wandering retroviruses.
Curr. Biol.
4:641-643[Medline].
|
| 8.
|
Gloor, G. B.,
C. R. Preston,
D. M. Johnson-Schlitz,
N. A. Nassif,
R. W. Phillis,
W. K. Benz,
H. M. Robertson, and W. R. Engels.
1983.
Type I repressors of P element mobility.
Genetics
135:81-95[Abstract].
|
| 9.
|
Gloor, G. B., and W. R. Engels.
1991.
Single fly DNA preps for PCR.
Dros. Inf. Ser.
71:148-149.
|
| 10.
|
Goff, S. P.
1992.
Genetics of retroviral integration.
Annu. Rev. Genet.
26:527-544[Medline].
|
| 11.
|
Halling, S. M., and N. Kleckner.
1982.
A symmetrical six-base-pair target site sequence determines Tn10 insertion specificity.
Cell
28:155-163[Medline].
|
| 12.
|
Ji, H.,
D. P. Moore,
M. A. Blomberg,
L. T. Braiterman,
D. F. Voytas,
G. Natsoulis, and J. D. Boeke.
1993.
Hotspots for unselected Ty1 transposition events on yeast chromosome III are near tRNA genes and LTR sequences.
Cell
73:1007-1018[Medline].
|
| 13.
|
Kalpana, G. V.,
S. Marmon,
W. Wang,
G. R. Crabtree, and S. P. Goff.
1994.
Binding and stimulation of HIV-1 integrase by a human homolog of yeast transcription factor SNF5.
Science
266:2002-2006[Abstract/Free Full Text].
|
| 14.
|
Ke, N.,
P. A. Irwin, and D. F. Voytas.
1997.
The pheromone response pathway activates transcription of Ty5 retrotransposons located within silent chromatin of Saccharomyces cerevisiae.
EMBO J.
16:6272-6280[Medline].
|
| 15.
|
Kim, J. M.,
S. Vanguri,
J. D. Boeke,
A. Gabriel, and D. F. Voytas.
1998.
Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence.
Genome Res.
8:464-478[Abstract/Free Full Text].
|
| 16.
|
Kirchner, J.,
C. M. Connolly, and S. B. Sandmeyer.
1995.
Requirement of RNA polymerase III transcription factors for in vitro position-specific integration of a retrovirus-like element.
Science
267:1488-1491[Abstract/Free Full Text].
|
| 17.
|
Leblanc, P.,
S. Desset,
B. Dastugue, and C. Vaury.
1997.
Invertebrate retroviruses: ZAM, a new candidate in D. melanogaster.
EMBO J.
16:7521-7531[Medline].
|
| 18.
|
Li, L.,
C. M. Farnet,
W. French Anderson, and F. D. Bushman.
1998.
Modulation of activity of Moloney murine leukemia virus preintegration complexes by host factors in vitro.
J. Virol.
72:2125-2131[Abstract/Free Full Text].
|
| 19.
|
Miller, M. D.,
C. M. Farnet, and F. D. Bushman.
1997.
Human immunodeficiency virus type 1 preintegration complexes: studies of organization and composition.
J. Virol.
71:5382-5390[Abstract].
|
| 20.
|
Shih, C.-C.,
J. P. Stoye, and J. M. Coffin.
1988.
Highly preferred targets for retrovirus integration.
Cell
53:531-537[Medline].
|
Journal of Virology, August 1999, p. 7061-7064, Vol. 73, No. 8
0022-538X/99/$04.00+0
Copyright © 1999, American Society for Microbiology. All rights reserved.
This article has been cited by other articles:
-
Labrador, M., Sha, K., Li, A., Corces, V. G.
(2008). Insulator and Ovo Proteins Determine the Frequency and Specificity of Insertion of the gypsy Retrotransposon in Drosophila melanogaster. Genetics
180: 1367-1378
[Abstract]
[Full Text]
-
Conte, C., Dastugue, B., Vaury, C.
(2002). Coupling of Enhancer and Insulator Properties Identified in Two Retrotransposons Modulates Their Mutagenic Impact on Nearby Genes. Mol. Cell. Biol.
22: 1767-1777
[Abstract]
[Full Text]