J Virol, July 1998, p. 5502-5509, Vol. 72, No. 7
0022-538X/98/$04.00+0
Copyright © 1998, American Society for Microbiology. All rights reserved.
Department of Genetics1 and Howard Hughes Medical Institute,2 Duke University Medical Center, Durham, North Carolina 27710
Received 20 January 1998/Accepted 26 March 1998
| |
ABSTRACT |
|---|
|
|
|---|
Although DNA binding sites specific for the Bel-1 and Tas transcriptional activators, encoded, respectively, by the human and simian foamy viruses, have been mutationally defined, they show little evident sequence identity. As a result, the sequence determinants for DNA binding by both Bel-1 and Tas have remained unclear. Here, we report the use of a novel in vivo randomization and selection strategy to identify a Tas DNA binding site consensus. This approach takes advantage of the fact that Tas can effectively activate gene expression in yeast cells via a Tas DNA binding site derived from the simian foamy virus type 1 (SFV-1) internal promoter. The defined Tas DNA binding site consensus extends over approximately 25 bp and contains a critical core sequence of ~5 bp. Positions adjacent to this core sequence, while clearly also subject to selection, show a significantly higher level of sequence variation. Surprisingly, the wild-type SFV-1 internal promoter Tas DNA binding site fails to conform to the consensus at several positions. Further analysis demonstrated that the consensus sequence bound Tas more effectively than did the wild-type sequence in vitro and could mediate an enhanced Tas response in vivo when substituted into the SFV-1 internal promoter context. These findings explain the limited sequence identity observed for mutationally defined Tas or Bel-1 response elements and should facilitate the identification of Tas DNA target sites located elsewhere in the SFV-1 genome.
| |
INTRODUCTION |
|---|
|
|
|---|
Foamy viruses, including human foamy virus (HFV) and the simian foamy viruses (SFVs), are unusual among retroviruses in that they contain two distinct promoter elements (25). As in all retroviruses, a promoter element located in the viral long terminal repeat (LTR) directs the synthesis of a genome-length transcript that gives rise to the viral structural proteins Gag, Pol, and Env. The second foamy virus promoter, termed the internal promoter (IP), is located at the 3' end of the envelope gene (4, 16, 19). The IP is primarily responsible for the expression of two open reading frames located between env and the 3' LTR, one of which encodes a transcriptional transactivator termed Bel-1 in HFV and Tas in the SFVs (4, 15). Importantly, both of these promoter elements are strongly activated upon expression of the cognate Bel-1 or Tas regulatory protein (4, 14, 15, 21, 26, 28). Mutational analysis has demonstrated that IP function is critical for foamy virus replication in culture, most probably because loss of IP function results in a marked drop in the expression of Bel-1/Tas, which is known to be essential for viral replication (17, 18).
Analysis of the mechanism of action of Bel-1 in HFV and of Tas in SFV-1 has demonstrated that these virally encoded transcription factors are DNA binding proteins, a property which distinguishes Bel-1/Tas from the functionally equivalent human immunodeficiency virus type 1 Tat and human T-cell leukemia virus type 1 Tax proteins (12, 30). Binding sites for Bel-1 have been defined in both the IP and LTR promoter of HFV (12, 13), while binding sites for Tas have been identified in the SFV-1 IP and in the gag gene (3, 30), although the role and importance of this latter Tas-dependent enhancer element remains unclear. For HFV, it has been demonstrated that the Bel-1 DNA binding site located in the IP displays a far higher affinity for Bel-1 than the major Bel-1 DNA binding site located in the LTR promoter (13). It has been hypothesized that this difference in affinity may, at least in part, explain the observation that the HFV IP is activated significantly earlier than the LTR promoter during the foamy virus life cycle (15, 19). This difference in affinity, if functionally important, could also explain the limited homology noted between the HFV LTR and IP Bel-1 binding sites (13). Similarly, the SFV-1 IP and Gag gene Tas binding sites also display only limited homology to each other and to Tas-responsive DNA sequences present in the SFV-1 LTR (3, 20, 30). These findings, combined with the observation that Bel-1 and Tas are specific only for target sequences present in their cognate viral genome (4, 13), have meant that no consensus DNA binding site for either Bel-1 or Tas has been proposed, even though minimal DNA binding sites for both proteins have been defined (3, 12, 13, 30).
In this study, we have used a novel in vivo randomization and selection procedure to define a consensus DNA binding site for the SFV-1 Tas protein. We demonstrate that this consensus DNA sequence binds Tas with a higher affinity than does the wild-type IP Tas binding site both in vivo and in vitro. In addition, we have constructed a highly variant Tas DNA binding site that, while identical to the minimal IP Tas binding site at only 8 of 25 positions, can nevertheless respond effectively to Tas in vivo. The flexibility of the DNA binding specificity of Tas appears to explain the marked variability in the sequence of Tas responsive sequences previously identified in the SFV-1 genome.
| |
MATERIALS AND METHODS |
|---|
|
|
|---|
Construction of molecular clones.
The yeast reporter
plasmids pLGSD5 and pJLB have been described previously (8,
10). Both contain the lacZ indicator gene under the
control of a basal yeast cyc promoter. However, pJLB differs
from pLGSD5 in that the former lacks the yeast cyc promoter regulatory element UASG, which contains negative regulatory
elements (8). As a result, the pJLB vector displays a higher
basal activity but also a significantly stronger response to weak
transactivators. A pJLB-based yeast indicator plasmid containing the
minimal Tas DNA binding site (
69 to
45) from the SFV-1 IP has been
described previously (13). This wild-type Tas binding
sequence, the minimal Bel-1 binding site (
163 to
139) defined in
the HFV IP, and various mutant Tas DNA binding sites (UP, VA, HS1, and
HS2 [see Fig. 3]), were synthesized as oligonucleotides with flanking
XhoI sites, annealed to make them double stranded, and then
cloned into the XhoI site 5' proximal to the cyc
promoter present in the indicator plasmids pLGSD5 and pJLB. In
addition, wild-type and defective (knockout [KO]) forms of a larger
SFV-1 IP sequence, extending from
150 to
29 relative to the IP
transcription start site, were PCR amplified with primers that
introduced flanking XhoI restriction sites, using the
corresponding mammalian expression constructs (described below) as
templates, and cloned into the XhoI site of pJLB.
Orientation of inserts was screened by PCR and confirmed by DNA
sequencing.
357 to +95) introduced between the Asp718 and
BamHI sites of plasmid pCAT-22A2s (27), has been
described previously (4). Shorter fragments of the SFV-1 IP,
extending from
159 to +13 or from
86 to +13, were PCR amplified and
cloned into the same sites in pCAT-22A2s. A variant Tas target sequence
predicted by the consensus Tas binding site (VA [see Fig. 3]), a
putative up-mutant of the Tas DNA binding sequence (UP), and a
defective Tas DNA binding sequence (KO) were each introduced into the
SFV-1 IP in place of the wild-type Tas binding site by combinatorial
PCR, and the resultant chimeric IP fragments were cloned into the
Asp718-BamHI digested pCAT-22A2s plasmid. The
mammalian expression plasmids pcTas and pBC12/CMV have been described
previously (5, 13).
In vivo randomization selection of Tas response sequences. The randomized Tas binding site libraries R1 to R5, as well as R3A and R3B, were generated by a single-step PCR procedure. The 5' primers used were as follows: 5'-GACCGCTCGAG TT GCA ATCAC TGGAA ATAGA AGTTA CAGATCCGCCAGGCGTGT-3' R1 R2 R3 R4 R5
The primer bears an XhoI site near the 5' end, followed by the minimal Tas binding sequence (
69 to
45)
derived from the SFV-1 IP, and finally a short sequence homologous to
pLGSD5 vector sequences located immediately 3' to the unique
XhoI site located adjacent to the cyc promoter.
The 25-bp Tas binding sequence was divided into five regions (R1 to R5,
as shown in the underlined sequence of the primer) and each region was
replaced with five random nucleotides in each respective primer (e.g.,
NNNNN in place of TTGCA in the R1 primer, where N refers to an equal
mixture of all four bases). The R3 region was further subdivided into two subregions, R3A and R3B, with randomization of either the two 5'
nucleotides (TG) or the three 3' nucleotides (GAA). In each case, the
3' primer used was 5'-TATGCTACAAAGGACCTAATG-3', corresponding to a coding region in the lacZ gene in
pLGSD5. PCRs were carried out with the parental pLGSD5 plasmid as the
template and resulted in products with a XhoI site at the 5'
end of the randomized Tas binding sequence and a unique SphI
site in the amplified pLGSD5 vector sequence. The PCR products were
then digested with XhoI and SphI and cloned
between these two sites in the indicator plasmid pLGSD5. The resultant
plasmids contained randomized Tas binding sequences inserted 3' to the
XhoI site present in the cyc promoter 5'-flanking
region of the pLGSD5 indicator plasmid.
The ligated randomized libraries were phenol-chloroform extracted,
ethanol precipitated, and then electroporated into Escherichia coli DH5
. After selection for ampicillin-resistant
transformants, each library was found to consist of more than 4,000 independent clones. DNA derived from each pooled library was then
introduced into Saccharomyces cerevisiae PSY 316 (11) along with the Tas expression plasmid pYCplacIII-Tas
(9, 13). The transformed yeast cells were plated on uracil-
and leucine-deficient (Ura
Leu
) plate
covered with a Hybond-N nylon membrane (Amersham). After 3 days of
growth, the nylon membrane with yeast colonies was lifted from the
plate, frozen at
140°C for 10 min, and then thawed at room
temperature. An in situ
-galactosidase (
-gal) assay was carried
out by placing the nylon membrane on filter papers soaked in 0.5× Z
buffer (1) with 0.3 mg of
chlorophenolred-
-D-galactopyranoside (CPRG; Boehringer
Mannheim) per ml and 0.1% (vol/vol) 2-mercaptoethanol. After 1 h
of incubation at room temperature, colonies that turned dark red
(positives) or those that remained white (negatives) were picked and
recovered on Leu
Ura
plates, and the yeast
indicator plasmids harboring candidate Tas target sequences were
rescued after overnight culture. A second yeast transformation and
-gal assay were then performed with the selected indicator plasmids
together with either pYCplacIII-Tas or the parental plasmid pYCplacIII
(9) to quantify the level of transactivation by Tas and to
identify any false (i.e., Tas-independent) positive clones. The Tas
response sequences in the true-positive or true-negative clones were
then obtained by ABI automatic cycle sequencing (PE Applied
Biosystems).
Yeast transformation and analysis.
The lacZ
indicator plasmids containing various mutant Tas DNA target sequences
and the single-copy, pYCplacIII-based Bel-1 or Tas yeast expression
plasmid (9, 13) were cotransformed into the yeast strain
PSY316. After 3 days of growth selection on Ura
Leu
plates, overnight cultures were prepared and cell
extracts were assayed for
-gal activity as described previously
(1).
Mammalian cell culture and transfection. COS cell cultures (35-mm plates) were transfected by the DEAE-dextran procedure (6) with 1 µg of a CAT-based indicator construct containing wild-type or mutant forms of the SFV-1 IP, and up to 200 ng of the pcTas expression plasmid. To maintain a level of 1.2 µg of DNA per transfection in all experiments, transfection cocktails were also supplemented with the parental, negative control plasmid pBC12/CMV where necessary. Induced CAT activities were assayed at ~50 h posttransfection as previously described (22).
Gel retardation analysis.
The fusion protein glutathione
S-transferase (GST)-Tas was expressed in the
protease-deficient E. coli XA90 and purified as previously
described (13, 30). The SFV-1 IP DNA sequences used for gel
retardation and competition analysis were generated by PCR and extended
from
159 to +13 in the SFV-1 IP. The wild-type DNA probe was labeled
with [
-32P]ATP with T4 polynucleotide kinase, and the
total isotope incorporation was determined by scintillation counting
after column purification. The binding reaction was carried out with
~104 cpm (~0.2 ng) of the probe and 50 ng of GST-Tas
fusion protein in 40 µl of binding buffer (10 mM Tris-Cl [pH 7.5],
65 mM KCl, 2.5 mM MgCl2 1 mM dithiothreitol, 5% glycerol)
containing 200 ng of poly(dI-dC) and 40 µg of bovine serum albumin.
Binding was allowed to proceed for 30 min at 4°C, and the reaction
products were resolved on a 5% native polyacrylamide gel and
visualized by autoradiography. Competitor DNA fragments were PCR
amplified from the respective cat-based mammalian indicator
plasmids described above. A 154-bp DNA fragment containing the
Mason-Pfizer monkey virus constitutive transport element (7)
was amplified by PCR and served as a nonspecific competitor DNA. For
competition experiments, competitor DNAs were added at an 8- or 80-fold
molar excess over the labeled probe fragment and were incubated with
GST-Tas for 10 min before addition of the probe. The results of
competition experiments were quantitated with a PhosphorImager and
Image QuaNT software (Molecular Dynamics).
| |
RESULTS |
|---|
|
|
|---|
Previously, Zou and Luciw (30) defined a 25-bp Tas DNA
binding site in the SFV-1 IP by using DNase I protection analysis and
also showed by gel shift analysis that this DNA sequence was sufficient
for specific Tas binding in vitro. Subsequently, we demonstrated that a
single copy of this 25-bp sequence, which extends from
69 to
45 in
the SFV-1 IP relative to the transcription start site, is sufficient to
render a yeast cyc promoter element highly responsive to Tas
when introduced 5' to this minimal promoter and analyzed in yeast cells
(13).
To define the consensus sequence for Tas DNA binding in yeast cells in
vivo, we subdivided the minimal 25-bp SFV-1 IP Tas DNA binding site
into five regions of 5 bp each, termed R1 through R5, as shown in Fig.
1. Each 5-bp segment was then
individually randomized in the context of the lacZ-based
indicator construct pLGSD5 to give five libraries, each containing
1,024 different candidate Tas target sequences (see Materials and
Methods for details). Each library was cloned into E. coli
and analyzed for integrity before being introduced into the yeast
strain PSY316 together with a yeast Tas expression plasmid. After 3 days of selection, yeast colonies containing DNA sequences that
retained Tas responsiveness were identified by staining with the
-gal substrate CPRG. Approximately 4,000 to 5,000 colonies were
screened per library. Individual responsive colonies were then picked, and the responsive plasmids were recovered. The Tas responsiveness of
each indicator plasmid was validated by retransformation into yeast in
the presence and absence of a Tas expression plasmid, and the precise
level of
-gal activity induced by Tas, for each responsive indicator
plasmid, was quantified as previously described. As shown in Fig. 1, a
global analysis of the five libraries obtained during this research
effort demonstrated that the central R3 library contained very few
strongly positive clones (~0.5%). A higher but still modest
percentage of positive colonies was obtained with the closely flanking
libraries R2 and R4 (~5 and ~2%, respectively). In contrast, the
R1 and R5 libraries, representing the 5' and 3' extremes of the SFV-1
IP Tas binding site, displayed high levels of positivity, with ~70
and ~20%, respectively, showing readily detectable
-gal activity.
|
The small number of strongly positive clones observed with the R3 library, all of which displayed the wild-type sequence, prompted us to generate two additional libraries, termed R3A and R3B, in which only the 5' 2 bp or the 3' 3 bp of the R3 sequence were randomized. This allowed us to identify additional, non-wild-type sequences covering this region that retained at least partial Tas responsiveness. Such clones were difficult to identify in the original R3 library since this particular library generated a significant background of transcriptionally active but Tas-nonresponsive plasmids (data not shown).
Consensus Tas DNA binding sequence. An analysis of the Tas-responsive sequences obtained in the above screen is shown in Fig. 1, and the data are summarized in Fig. 2, which also presents the derived consensus Tas binding sequence. Based on these data, it is apparent that the core 5'-TGGAA-3' sequence covered by the R3 library is critical for Tas binding. Three of these positions were found to be invariant in this analysis, while variation at the other two positions was infrequent and, when observed, resulted in a marked drop in Tas responsiveness.
|
Biological activity of Tas binding site variants in yeast. The data presented in Fig. 1 and 2 suggest that it should be possible to derive highly variant forms of the Tas IP DNA binding site that retain good, or even enhanced, Tas binding ability. To test whether this was indeed the case, we designed two 25-bp oligonucleotides termed UP and VA (for variant). The UP sequence differs at seven positions from the wild-type SFV-1 IP Tas binding site (Fig. 3A) and contains the optimal nucleotide sequence for Tas binding at every position, as predicted from the data shown in Fig. 1. In contrast, the 25-bp VA sequence was designed to be maximally different from the wild-type SFV-1 IP sequence yet to still largely conform to the Tas DNA binding site consensus given in Fig. 2. It was therefore anticipated that the VA sequence, which differs from the wild-type sequence at 17 of 25 positions (Fig. 3A), would retain detectable but not maximal Tas binding ability.
|
|
150 to
29 relative to the IP transcription
start site, and further suggested that this sequence contained only a
single Tas DNA binding site. To confirm that this was indeed the case,
we compared the biological activity in yeast cells of the extended
150 to
29 IP sequence with the minimal
69 to
45 IP Tas binding
sequence after insertion into the pJLB yeast indicator construct (Table
1). An additional construct, containing the KO mutation predicted to
block Tas binding (Fig. 3C) in the
150 to
29 sequence context, was
also tested. If the
150 to
29 IP sequence contains a second Tas DNA
binding site, we would expect the
150 to
29 sequence to retain
significant Tas responsiveness after introduction of the KO mutation
and would also expect it to give a stronger response to Tas than the
minimal
69 to
45 sequence, given the known synergistic activity of
Tas binding sites in yeast cells (13). In fact, the extended
150 to
29 indicator construct was not more responsive to Tas than the minimal
69 to
45 construct, and this activity was entirely abrogated by the KO mutation. These data therefore serve to confirm the
proposal, derived from an earlier in vitro analysis (30), that the SFV-1 IP contains only a single Tas binding site.
Biological activity of Tas binding site variants in mammalian
cells.
All the in vivo data on Tas DNA binding presented thus far
were obtained with yeast cells, and we therefore wished to confirm these findings in the more physiologically relevant context of cultured
simian cells. Previous mutational analysis of the SFV-1 IP (4,
30) has suggested that this promoter contains only three sequence
elements required for maximal Tas response (Fig. 4A). These are a TATA box, seen in the
majority of RNA polymerase II-specific promoters; the Tas binding site,
located between residues
45 and
69 relative to the transcription
start site; and a third DNA sequence, termed the distal element,
located between nucleotides
108 and
140 (30). While the
distal element remains only poorly characterized, it does not bind Tas
(30) (Table 1) and is therefore thought to serve as a
cellular factor binding site. Surprisingly, previous work (4,
30) has demonstrated that mutation of either the distal element
or the Tas binding site sequence alone results in only a two- to
fourfold drop in the Tas responsiveness of the IP; however, mutation of
both sites results in the loss of Tas response (30). The
observation that the single DNA target site for Tas present in the
SFV-1 IP can be mutated with only a partial loss of Tas responsiveness
is surprising and prompted us to reexamine the biological activity of
the SFV-1 IP in mammalian cells and, in particular, to test the ability
of wild-type and mutant SFV-1 IP derivatives to respond to a range of
different Tas expression levels.
|
357 to +95 promoter sequence
previously reported by Campbell et al. (4) to contain all
the sequences required for full Tas responsiveness (Fig. 4A). In
addition, we constructed two IP deletion mutants. The first of these
retains sequences extending from
159 to +13 in the SFV-1 IP and
should represent a minimal, fully Tas-responsive IP. A second
construct, extending from
86 to +13, has lost the distal-element
sequence yet retains the complete Tas binding site (Fig. 4A). This
promoter is predicted to retain partial Tas responsiveness.
As shown in Fig. 4B, both the
357 to +95 and the
159 to +13
construct were highly Tas responsive when transfected into the simian
COS cell line. Importantly, both promoters displayed an equivalent
level of activity over the entire range of Tas expression tested. In
contrast, the
86 to +13 IP sequence displayed a two- to
threefold-lower level of Tas response than these more complete SFV-1 IP
constructs over the entire range of Tas levels tested in this
experiment. These data therefore confirm the earlier work of Luciw and
coworkers (4, 30), obtained with only saturating levels of
Tas expression, showing that the
159 to +13 IP sequence contains all
the elements required for full Tas responsiveness and that a sequence
located between
159 and
86 in the SFV-1 IP modestly enhances the
Tas response of the IP.
Based on the data presented in Fig. 4, we next constructed derivatives
of the
159 to +13 and
86 to +13 forms of the SFV-1 IP containing
the UP, VA, and KO mutations of the TBS described in Fig. 3. These
mutant SFV-1 IP promoters were then tested for their ability to respond
to Tas in transfected COS cells after transfection of 200 ng of the
pcTas expression plasmid, which is predicted to give a maximal Tas
response (Fig. 4B), or 8 ng of the pcTas plasmid, which is predicted to
give a half-maximal response that is therefore in the linear range of
this assay (Fig. 4B).
As shown in Fig. 5, introduction of these
mutations into the complete
159 to +13 SFV-1 IP sequence gave rise to
only modest phenotypic changes. In particular, the KO mutation that
completely abrogates Tas binding in yeast cells (Table 1) reduced Tas
responsiveness by only two- to threefold, as indeed predicted by the
earlier data of Zou and Luciw (30). Introduction of the UP
mutation, which is predicted to enhance Tas binding (Table 1), resulted in a modest, ~twofold increase in Tas responsiveness that was significant only at low levels of Tas expression (Fig. 5). Finally, the
VA sequence, which differs at 17 of 25 positions from the wild-type Tas
binding site, permitted the same level of activation by Tas as the
wild-type IP sequence when introduced into the
159 to +13 IP sequence
context and tested in mammalian cells.
|
159 to +13 IP sequence context and also by
the work of others (30), the distal-element sequence appears
to be able to mediate activation by Tas even in the absence of a
functional Tas binding site. However, in the context of the
86 to +13
IP sequence lacking the distal element (Fig. 4A), the Tas response
should be entirely dependent on the integrity of the Tas binding site.
As shown in Fig. 5, this is indeed the case. Thus, the KO mutation that
had only a modest (ca. two- to threefold) effect in the
159 to +13
sequence context dramatically inhibited the Tas response in the
86 to
+13 IP sequence context. Further, the UP mutation of the Tas binding
site had a clear positive phenotype in the
86 to +13 sequence
context, resulting in a level of Tas response that was equivalent to
that seen with the wild-type form of the extended
159 to +13 promoter
element. Finally, the VA mutant again showed a level of Tas response in
the
86 to +13 sequence context that was equivalent to that seen with
the wild-type
86 to +13 sequence. Overall, these data obtained with
mammalian cells therefore demonstrate that the DNA sequence
requirements for Tas binding defined above in the yeast cell context
are also valid in mammalian cells.
Tas DNA binding specificity in vitro. As a final confirmation of the validity of the Tas binding consensus given in Fig. 2, we examined the ability of the UP, VA, and KO mutants of this SFV-1 sequence to compete with the wild-type SFV-1 IP for binding to recombinant Tas protein by using a competitive electrophoretic mobility shift assay. As shown in Fig. 6, addition of recombinant GST-Tas to a labeled SFV-1 IP DNA probe resulted in the detection of two specific DNA-protein complexes labeled C1 and C2. The origin of the C2 complex is unclear, but this could reflect dimerization of either the Tas protein or GST, which is known to form dimers. Formation of both these complexes is effectively competed by an excess of the unlabeled wild-type SFV-1 IP DNA sequence (lanes 3 and 4) but not by an equivalent level of a nonspecific competitor DNA (lanes 11 and 12). Importantly, the addition of an SFV-1 IP-derived DNA sequence bearing the UP mutation resulted in a significantly enhanced level of competition (lanes 5 and 6), demonstrating that the UP mutation indeed enhances Tas binding in vitro. In contrast, an SFV-1 IP DNA competitor bearing the KO mutation did not compete for binding any more effectively than the nonspecific DNA competitor did (compare lanes 9 and 10 with lanes 11 and 12). Finally, a sequence containing the VA mutation of the SFV-1 IP also competed for Tas DNA binding in vitro, although perhaps slightly less efficiently than the wild-type IP sequence did (lanes 7 and 8). This result appears to mirror the in vivo biological activity of the VA sequence observed in yeast cells (Table 1), which was also slightly lower than that seen with the wild-type IP sequence. In conclusion, these in vitro Tas binding data are therefore in agreement with the in vivo data derived from the yeast (Table 1) and mammalian (Fig. 5) systems and serve to further validate the Tas DNA binding consensus given in Fig. 2.
|
| |
DISCUSSION |
|---|
|
|
|---|
The replication of primate foamy viruses is dependent on the biological activity of not only the viral LTR promoter but also a second promoter, the IP, located toward the 3' end of the env gene (4, 16, 19). Both promoter elements are strongly activated by the virally encoded Bel-1/Tas transcriptional transactivator (4, 14, 15, 21, 26, 28), and it is therefore the interplay of Bel-1/Tas with these two promoter elements that largely controls the level of foamy virus gene expression.
Analysis of the biological activity of HFV Bel-1 and of SFV-1 Tas has
demonstrated that these viral transcription factors are DNA binding
proteins and has identified minimal,
25-bp binding sites for Bel-1 in
the HFV LTR and IP and for Tas in the SFV-1 IP and gag gene
(3, 12, 13, 30). However, comparison of these sequences with
each other and with those of other phenotypically defined Bel-1 or Tas
response elements has failed to identify a clear DNA binding consensus.
The lack of such a consensus reflects two findings. First, Bel-1 and
Tas interact only with sites present in their cognate virus (4,
13), thus making any comparison of the DNA binding sites of these
two related proteins uninformative at present. Second, it appears that
the foamy virus IP has evolved to contain a significantly
higher-affinity target sequence for Bel-1/Tas than has the LTR promoter
(13), thus suggesting that Bel-1/Tas binding sites in the
LTR are likely to diverge significantly from the optimal Bel-1/Tas DNA
binding consensus.
In this paper, we report the use of a novel randomization and selection strategy with yeast cells to define the DNA binding consensus for SFV-1 Tas. Yeast cells were chosen because these lower eukaryotes were previously shown to be permissive for transcriptional activation by both Bel-1 and Tas via their respective, mutationally defined DNA binding sites (1, 12, 13). We chose the SFV-1 IP Tas DNA binding site for this analysis because Tas gives a significantly higher level of transcriptional activation than Bel-1 in yeast cells (13), thus facilitating the identification of positive clones, and because the IP, as noted above, contains a particularly high-affinity DNA binding site for Tas (13). An alternative strategy, used for several DNA binding proteins (2, 23, 24, 29), would have been to identify a consensus Tas DNA binding site in vitro by using several cycles of PCR amplification and selection for Tas binding. An advantage of this latter, in vitro technique is that it would have permitted the simultaneous randomization of essentially the entire Tas binding site. In contrast, the in vivo randomization/selection strategy described here is not able to analyze more than five or six nucleotide positions simultaneously because (i) more extensive randomization would require screening an unwieldy number of yeast transformants and (ii) we have found that the percentage of false (i.e., Tas-independent)-positive clones, which largely represent yeast transcription factor binding sites, increases when more nucleotides are randomized. The fact that in any given random library, much of the DNA binding site must therefore remain fixed could affect the range of positive sequences obtained by this approach. However, this concern is balanced by the anticipation that this in vivo approach may be more likely to identify physiologically relevant DNA-protein interactions.
A series of Tas-responsive and nonresponsive sequences identified using by in vivo randomization and selection approach are listed in Fig. 1 and summarized in Fig. 2, which also gives the derived consensus sequence for Tas DNA binding. Based on these data, we suggest that Tas DNA binding sites consist of a core recognition sequence that approximately coincides with the R3 sequence 5'-TGGAA-3'. Flanking this, on either side, are sequences that are defined largely by the R2 and R4 libraries that, while critical for Tas binding, are nevertheless more tolerant of sequence variation. Finally, while the R5 and, particularly, R1 regions contribute relatively little critical sequence information to the Tas DNA binding site, DNA sequences present in these regions can significantly modulate the affinity of Tas for its target site (Fig. 1). Overall, these data indicate that the consensus DNA target site for Tas is somewhat unusual in terms of both its large size and the relative plasticity of this sequence outside of the core R3 region. While it is interesting to speculate that the large size of this target DNA sequence could reflect binding by a Tas dimer or multimer, we note that the Tas binding consensus (Fig. 2) does not give any evidence of being a direct or inverted repeat sequence. Perhaps the most surprising aspect of the Tas DNA binding consensus given in Fig. 2 is the implication that the wild-type SFV-1 IP Tas DNA binding site is suboptimal at several locations, suggesting that this site has not been selected for maximal Tas binding efficiency. Indeed, several of the recovered mutant sequences shown in Fig. 1 proved to be more effective target sites for Tas when tested in yeast cells.
To validate the relevance of the consensus Tas DNA binding sequence given in Fig. 2, we used this consensus to design a sequence, termed UP, that should be fully optimal for Tas binding and a second sequence, termed VA, that should be able to bind Tas effectively despite differing at 17 of 25 positions from the wild-type SFV-1 IP sequence (Fig. 3A). Analysis of the UP and VA sequences in yeast cells (Table 1) and in vitro (Fig. 6) showed that the UP sequence indeed bound Tas more effectively than did the wild-type IP sequence while the VA sequence also bound Tas but somewhat less well than did the wild type. In contrast, an SFV-1 IP sequence mutated in the Tas binding-site core, termed KO, failed to detectably bind Tas in either assay system.
Demonstration of the relevance of the Tas DNA binding consensus in simian cells was complicated by the fact that the SFV-1 IP contains two partially redundant Tas response elements (Fig. 4A). While the first of these coincides with the Tas DNA binding site, the second sequence, referred to as the distal element, fails to bind Tas both in vitro (30) and in vivo (Table 1). This finding, combined with the fact that the distal-element sequence can mediate Tas function in mammalian cells (30) (Fig. 4B) but not yeast cells (Table 1), strongly suggests that the distal element represents a DNA binding site for a cellular cofactor(s). Despite this complication, it nevertheless proved possible to also validate the Tas binding consensus in mammalian cells. In particular, in the context of a minimal SFV-1 IP promoter lacking the distal-element sequence, the KO mutation was found to block activation by Tas in mammalian cells whereas the UP mutation was found to increase the Tas response by two- to threefold. The maximally variant VA mutation was also observed to retain Tas responsiveness (Fig. 5). Together, these data demonstrate that Tas DNA binding in mammalian cells (Fig. 5), as in yeast cells (Table 1) and in vitro (Fig. 6), is effectively predicted by the Tas DNA binding consensus given in Fig. 2.
The data presented in Fig. 5 may also shed some light on the role of the putative cellular distal-element DNA binding factor. The distal-element sequence is not a Tas binding site (30) (Table 1) and has no phenotype in the absence of Tas (Fig. 5), but it can mediate significant activation of the SFV-1 IP by Tas in the absence of an intact Tas DNA binding site (30) (Fig. 4B). This finding suggests that the cellular distal-element binding protein acts by facilitating the recruitment of Tas to the SFV-1 IP. Consistent with this hypothesis, while deletion of the distal element significantly reduces the activation of the SFV-1 IP by Tas, this reduction can be overcome by substitution of the high-affinity UP Tas DNA binding site in place of the wild- type sequence (Fig. 5). In contrast, the UP mutant has only a minimal phenotype in mammalian cells in the context of an SFV-1 IP containing an intact distal element (Fig. 5), suggesting that Tas DNA binding is already highly efficient in this environment. The hypothesis that a cellular cofactor specific for the distal-element sequence significantly enhances Tas binding to the SFV-1 IP in mammalian cells thus may explain why the IP Tas DNA binding site is not more closely homologous to the optimal Tas DNA binding site (Fig. 2). However, it is also possible that the sequence of the IP Tas DNA binding site is simply constrained by the underlying envelope coding sequence.
A final point is that the proposed Tas binding site consensus should
permit the identification of Tas binding sites located elsewhere in the
SFV-1 genome. Indeed, a comparison of the SFV-1 gag gene Tas
DNA binding site (3) with the consensus sequence shows
extensive sequence homology (Fig. 7),
particularly around the core region. An analysis of regions within the
SFV-1 LTR that have previously been implicated in mediating Tas
transactivation of the SFV-1 LTR promoter (20) identified
several regions with homology to the consensus, including particularly
between
1052 and
1028, between
375 and
351, and (in the
antisense orientation) between
956 and
980 and between
163 to
187. The question whether these sequences are indeed important for
Tas-mediated activation of the SFV-1 LTR promoter can now be
experimentally addressed.
|
| |
ACKNOWLEDGMENTS |
|---|
We thank Paul Luciw for several SFV-1 clones used in this work and Hal Bogerd and Wade Blair for helpful discussions.
This research was funded by the Howard Hughes Medical Institute.
| |
FOOTNOTES |
|---|
* Corresponding author. Mailing address: Howard Hughes Medical Institute, Duke University Medical Center, Box 3025, Durham, NC 27710. Phone: (919) 684-3369. Fax: (919) 681-8979. E-mail: Culle002{at}mc.duke.edu.
| |
REFERENCES |
|---|
|
|
|---|
| 1. |
Blair, W. S.,
H. Bogerd, and B. R. Cullen.
1994.
Genetic analysis indicates that the human foamy virus Bel-1 protein contains a transcription activation domain of the acidic class.
J. Virol.
68:3803-3808 |
| 2. |
Blackwell, T. K., and H. Weintraub.
1990.
Differences and similarities in DNA-binding preferences of MyoD and E2A protein complexes revealed by binding site selection.
Science
250:1104-1110 |
| 3. |
Campbell, M.,
C. Eng, and P. A. Luciw.
1996.
The simian foamy virus type 1 transcriptional transactivator (Tas) binds and activates an enhancer element in the gag gene.
J. Virol.
70:6847-6855 |
| 4. |
Campbell, M.,
L. Renshaw-Gegg,
R. Renne, and P. A. Luciw.
1994.
Characterization of the internal promoter of simian foamy viruses.
J. Virol.
68:4811-4820 |
| 5. | Cullen, B. R. 1986. trans-activation of human immunodeficiency virus occurs via a bimodal mechanism. Cell 46:973-982[Medline]. |
| 6. | Cullen, B. R. 1987. Use of eukaryotic expression technology in the functional analysis of cloned genes. Methods Enzymol. 152:684-704[Medline]. |
| 7. | Ernst, R. K., M. Bray, D. Rekosh, and M. Hammarskjöld. 1997. A structural retroviral RNA element that mediates nucleocytoplasmic export of intron-containing RNA. Mol. Cell. Biol. 17:135-144[Abstract]. |
| 8. |
Finley, R. L., Jr.,
S. Chen,
J. Ma,
P. Byrne, and R. W. West, Jr.
1990.
Opposing regulatory functions of positive and negative elements in UASG control transcription of the yeast GAL genes.
Mol. Cell. Biol.
10:5663-5670 |
| 9. | Gietz, R. D., and A. Sugino. 1988. New yeast-Escherichia coli shuttle vectors constructed with in vitro mutagenized yeast genes lacking six-base pair restriction sites. Gene 74:527-534[Medline]. |
| 10. |
Guarente, L.,
R. Yocum, and P. Gifford.
1982.
A GAL10-CYC1 hybrid yeast promoter identifies the GAL4 regulatory region as an upstream site.
Proc. Natl. Acad. Sci. USA
79:7410-7414 |
| 11. | Harper, J. W., G. R. Adami, N. Wei, K. Keyomarsi, and S. J. Elledge. 1993. The p21 Cdk-interacting protein Cip1 is a potent inhibitor of G1 cyclin-dependent kinases. Cell 75:805-816[Medline]. |
| 12. | He, F., W. S. Blair, J. Fukushima, and B. R. Cullen. 1996. The human foamy virus Bel-1 transcription factor is a sequence-specific DNA binding protein. J. Virol. 70:3902-3908[Abstract]. |
| 13. |
Kang, Y.,
W. S. Blair, and B. R. Cullen.
1998.
Identification and functional characterization of a high-affinity Bel-1 DNA binding site located in the human foamy virus internal promoter.
J. Virol.
72:504-511 |
| 14. |
Keller, A.,
K. M. Partin,
M. Löchelt,
H. Bannert,
R. M. Flügel, and B. R. Cullen.
1991.
Characterization of the transcriptional trans activator of human foamy retrovirus.
J. Virol.
65:2589-2594 |
| 15. |
Löchelt, M.,
R. M. Flügel, and M. Aboud.
1994.
The human foamy virus internal promoter directs the expression of the functional Bel-1 transactivator and Bet protein early after infection.
J. Virol.
68:638-645 |
| 16. |
Löchelt, M.,
W. Muranyi, and R. M. Flügel.
1993.
Human foamy virus genome possesses an internal, Bel-1-dependent and functional promoter.
Proc. Natl. Acad. Sci. USA
90:7317-7321 |
| 17. | Löchelt, M., S. F. Yu, M. L. Linial, and R. M. Flügel. 1995. The human foamy virus internal promoter is required for efficient gene expression and infectivity. Virology 206:601-610[Medline]. |
| 18. | Löchelt, M., H. Zentgraf, and R. M. Flügel. 1991. Construction of an infectious DNA clone of the full-length human spumaretrovirus genome and mutagenesis of the bel 1 gene. Virology 184:43-54[Medline]. |
| 19. | Mergia, A. 1994. Simian foamy virus type 1 contains a second promoter located at the 3' end of the env gene. Virology 199:219-222[Medline]. |
| 20. |
Mergia, A.,
E. Pratt-Lowe,
K. E. S. Shaw,
L. W. Renshaw-Gegg, and P. A. Luciw.
1992.
cis-acting regulatory regions in the long terminal repeat of simian foamy virus type 1.
J. Virol.
66:251-257 |
| 21. |
Mergia, A.,
K. E. S. Shaw,
E. Pratt-Lowe,
P. A. Barry, and P. A. Luciw.
1991.
Identification of the simian foamy virus transcriptional transactivator gene (taf).
J. Virol.
65:2903-2909 |
| 22. | Neumann, J. R., C. A. Morency, and K. O. Russian. 1987. A novel rapid assay for chloramphenicol acetyltransferase gene expression. BioTechniques 5:444-447. |
| 23. |
Oliphant, A. R.,
C. J. Brandl, and K. Struhl.
1989.
Defining the sequence specificity of DNA-binding proteins by selecting binding sites from random-sequence oligonucleotides: analysis of yeast GCN4 protein.
Mol. Cell. Biol.
9:2944-2949 |
| 24. |
Rauscher, F. J., III,
J. F. Morris,
O. E. Tournay,
D. M. Cook, and T. Curran.
1990.
Binding of the Wilms' tumor locus zinc finger protein to the EGR-1 consensus sequence.
Science
250:1259-1262 |
| 25. | Rethwilm, A. 1995. Regulation of foamy virus gene expression. Curr. Top. Microbiol. Immunol. 193:1-24[Medline]. |
| 26. |
Rethwilm, A.,
O. Erlwein,
G. Baunach,
B. Maurer, and V. Ter Meulen.
1991.
The transcriptional transactivator of human foamy virus maps to the bel 1 genomic region.
Proc. Natl. Acad. Sci. USA
88:941-945 |
| 27. | Sparger, E. E., B. L. Shacklett, L. Renshaw-Gegg, P. A. Barry, N. C. Pedersen, J. H. Elder, and P. A. Luciw. 1992. Regulation of gene expression directed by the long terminal repeat of the feline immunodeficiency virus. Virology 187:165-177[Medline]. |
| 28. |
Venkatesh, L. K.,
P. A. Theodorakis, and G. Chinnadurai.
1991.
Distinct cis-acting regions in U3 regulate trans-activation of the human spumaretrovirus long terminal repeat by the viral bel1 gene product.
Nucleic Acids Res.
19:3661-3666 |
| 29. |
Wright, W. E.,
M. Binder, and W. Funk.
1991.
Cyclic amplification and selection of targets (CASTing) for the myogenin consensus binding site.
Mol. Cell. Biol.
11:4104-4110 |
| 30. |
Zou, J. X., and P. A. Luciw.
1996.
The transcriptional transactivator of simian foamy virus 1 binds to a DNA target element in the viral internal promoter.
Proc. Natl. Acad. Sci. USA
93:326-330 |
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Mol. Cell. Biol. | Microbiol. Mol. Biol. Rev. |
|---|