Previous Article | Next Article ![]()
Journal of Virology, April 2005, p. 4918-4926, Vol. 79, No. 8
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.8.4918-4926.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Microbiology-Immunology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois
Received 26 July 2004/ Accepted 1 December 2004
|
|
|---|
|
|
|---|
The HPV life cycle is linked to the differentiation state of the host cell (30). Viruses infect cells in the basal epithelia and translocate to the nucleus where viral genomes are established as episomes. In infected basal cells, viral genomes are replicated coordinately with cellular DNA, early viral proteins are synthesized at low levels, whereas late genes are not expressed. After cell division, one daughter cell migrates away from the basal layer and begins to differentiate. Upon differentiation in suprabasal cells, viral DNA amplification is activated, along with late transcription, leading to virion production (23). A link between productive viral replication and late gene expression has been well documented in other DNA tumor viruses such as simian virus 40 (SV40) (53, 54), polyomavirus (8, 9, 27, 31), and adenoviruses (51), but it is not clear if this extends to papillomaviruses.
Elements that regulate early gene expression are located in the upstream regulatory region (URR) of papillomaviruses (57). Many of the elements in the URR that are required for the induction of the early promoter (P97 for HPV31 and HPV16) have been characterized, and our understanding of the transcriptional control of the early promoter is extensive. The elements that regulate early transcription include keratinocyte-specific enhancers, as well as repressor sequences (3, 10, 12, 18, 55). Although there exist variations in the positioning of transcription factor binding sites among the different types of HPVs, most URRs contain binding sites for the same subset of factors, including AP1, C/EBPß, NF1, Oct1, Sp1, TEF-2, and YY1 (1, 2, 6, 7, 11, 19-21, 28, 38, 50, 55, 56). It is believed that these factors act in concert with each other to activate early transcription (4, 29, 32, 35), whereas a subset directly represses transcription or quench the activity of other factors (3, 22, 34, 36, 37, 41, 52). High-risk HPVs contain four binding sites in the URR for the E2 protein, which in bovine papillomavirus acts as the primary regulator of viral transcription (23). Binding of HPV E2 protein to sites distal to the start site of early transcription stimulates low levels of expression from p97 (47). In contrast, binding to promoter-proximal E2 sites inhibits early transcription (47). This inhibition of transcription appears to act by displacement of Sp1, along with TBP from sequences that overlap the promoter-proximal E2 binding site (14, 49). The regulation of early transcription by E2 may contribute to copy number control in undifferentiated basal cells by modulating expression of the viral replication proteins (48).
During the productive viral life cycle, HPV gene products are expressed from polycistronic transcripts (see Fig. 1). In HPV31, the early transcripts initiate upstream of the E6 open reading frame (ORF) at sequences around nucleotide (nt) 97 (P97). Upon differentiation, transcripts initiating at P97 appear to be unchanged or are modestly repressed, whereas a novel, late promoter is activated (26). The HPV31 late promoter initiates transcription at heterogeneous sites around nt 742 (P742) in the E7 ORF (25, 39). Coordinate with the activation of late expression, a change in DNase hypersensitivity around P742 has been observed, suggesting that a change in chromatin configuration occurs upon differentiation (13). Although many studies have investigated the role of various cis-acting elements in the activation of the early promoter, few have examined the factors or sequences responsible for activation of the late promoter. One potential regulator of late transcription has been suggested to be CDP, a factor that represses late transcription in undifferentiated cells but not in differentiated cells (36, 41). Additional potential transcription factor binding sites for Oct-1, SOX5, and SRY were identified around P742 (13), but it is not clear if these actually function to activate late expression.
![]() View larger version (13K): [in a new window] |
FIG. 1. HPV31 genome organization and transcript representation. (A) The HPV31 genome is depicted. Early and late ORFs are shown, as well as the early and late promoters and polyadenylation sites. The URR is represented by the solid black bar elements. (B) Some of the major early and late polycistronic messages are shown.
|
|
|
|---|
DNA and RNA analysis. Total genomic DNA was isolated by using previously described techniques (15). For Southern analysis total genomic DNA (5 µg) was digested with BamHI, separated on a 0.8% agarose gel, treated with alkaline, and transferred onto DuPont GeneScreen nylon membrane (NEN Research Products, Boston, Mass.) as described by the manufacturer. Hybridization was carried out by using the entire HPV31 genome as a probe as previously described (15). Total RNA was isolated with TRIzol reagent (Gibco-BRL/Invitrogen) according to manufacturer's recommendations. For Northern analysis, total RNA (10 µg) was separated on a 1.0% agarose gel and transferred onto a Zeta-Probe membrane (Bio-Rad, Hercules, Calif.). Membranes were then hybridized with probes for the entire HPV31 genome, GAPDH (glyceraldehyde-3-phosphate dehydrogenase), and involucrin by previously described methods (15). RNase protection analyses used a probe encompassing the E7 ORF which identifies both early and late transcripts as described by Pena et al. (13).
Plasmids.
The H31Luc plasmid has previously been described (24). Briefly, this plasmid contains the entire URR of HPV31 and sequences through the N terminus of E1 (nt 892) upstream of the firefly luciferase gene. The luciferase coding sequences are fused in frame with the E1 coding sequences at the tenth amino acid. Plasmid constructs p
7045-7557, p
7045-7816, p
7045-125, p
7045-609, p
7045-683, p
7557-7816, p
7816-125, p
124-611, and p
613-683 were made by digestion with convenient restriction endonuclease sites for KpnI (nt 7045) in the multiple cloning sites of H31Luc and SpeI (nt 7557), NsiI (nt 7816), PstI (nt 125), Bsu36I (nt 609), and Pvu II (nt 683) located in the URR and sequences upstream of P742 (see Fig. 2). After digestion, a fill-in reaction was performed, followed by blunt end ligation leading to the generation of the deletion mutant. Smaller 10-bp deletion constructs p
611-620, p
621-630, p
631-640, p
641-650, p
651-660, p
661-670, p
671-680, p
681-690, p
691-700, p
701-711, p
711-720, p
721-730, and p
731-740 were made by using the Gene Editor kit by Promega (Madison, Wis.) according to the manufacturer's instructions. Point mutations TAG*ATAA, TATAG*AA, and TAG*AG*AA in the early promoter, P97, were also created by using the Gene Editor kit. The Renilla vector with a thymidine kinase promoter, pRLTK, was obtained form Promega. All constructs were sequenced to confirm that they contained the desired mutation.
![]() View larger version (75K): [in a new window] |
FIG. 2. Southern analysis of viral genomes. LKP31 cells were suspended in methylcellulose without or with 40 µg of AraC/ml for 24 and 48 h. Total DNA was isolated, digested with BamHI (which does not cut the genome), electrophoresed on a 0.9% agarose gel, blotted, and hybridized with a probe for the HPV31 genome. Multimeric, circular/nicked, and supercoiled forms of the genome are indicated.
|
Electrophoretic mobility shift assays (EMSAs). Nuclear extracts were prepared as previously described (45). Briefly 2 x 106 cells, either LKP31 cells from monolayer cultures with feeders removed or LKP31 cells harvested after methylcellulose treatment, were collected and washed with TBS (Tris-buffered saline). The cell pellets were resuspended in cold buffer A (10 mM HEPES -pH 7.9], 10 mM KCl, 0.1 mM EDTA, 0.1 mM EGTA, 1 mM dithiothreitol [DTT], 0.5 mM phenylmethylsulfonyl fluoride [PMSF]) and allowed to swell on ice for 15 min. Nonidet NP-40 was added, and pelleted cells were resuspended in cold buffer C (20 mM HEPES [pH 7.9], 0.4 M NaCl, 1 mM EDTA, 1 mM EGTA, 1 mM DTT, 1 mM PMSF) and rocked for 15 min. The cells were pelleted, and the nuclear extract supernatant was divided into aliquots and stored at 70°C. Duplex DNA oligonucleotides for the following HPV sequences were purchased from IDT and reconstituted according to manufacturer's instructions: nt 625 to 645, 635 to 655, 655 to 675, 675 to 695, 685 to 705, 695 to 715, 705 to 725, 715 to 735, and 725 to 745. EMSA reactions were carried out as previously described (29). Briefly, 15 µg of nuclear extract was incubated with 2 µg of poly(dI-dC) in the presence or absence of a 100-fold molar excess of unlabeled competitor DNA oligonucleotides on ice for 20 min in a 25-µl reaction containing 10% glycerol, 25 mM HEPES (pH 7.9), 50 mM KCl, 4 mM MgCl2, 0.5 mM EDTA, 0.5 mM DTT, and 0.5 mM PMSF. After incubation, 10,000 cpm of 32P-end-labeled duplex DNA was added and incubated for 30 min at room temperature. The DNA-protein complexes were separated from free probe on a 4% polyacrylamide gel. The gel was dried and subjected to autoradiography.
|
|
|---|
It was next important to examine whether activation of late transcripts occurred in the absence of viral genome amplification. RNA was isolated from both AraC-treated and untreated differentiated cultures and examined for the presence of late transcripts by Northern analysis (Fig. 3). Upon differentiation, the induction of transcripts initiated at p742 was observed, including messages encoding E1/E4/E5, E1/E2/E5, as well as E1^E4/E5. The level of late transcripts produced in the presence of AraC was found to be markedly reduced but still significant upon differentiation. HPV31 late transcripts increased approximately 10- to 15-fold upon differentiation of untreated LKP31 cells, but less than a threefold increase, at both time points, was observed in the presence of the inhibitor. Longer exposure revealed a low level of late transcripts in the untreated sample; however, this is consistent with previous results showing a 1 to 2% spontaneous activation of late transcription in monolayer cultures of HPV-positive cells (42). Analysis of transcripts encoding GAPDH synthesis was also carried out to investigate whether AraC induced a general reduction in transcript levels. Previous studies have shown that GAPDH transcription decreases modestly during differentiation (46), and we observed similar changes with or without AraC (Fig. 3). This suggests that AraC does not alter the transcription patterns in treated cells. This observation is in agreement with previous published reports that demonstrate that AraC does not affect transcription (33). Furthermore, addition of AraC did not alter cellular differentiation as evidenced by a lack of change in the expression of the differentiation marker involucrin (Fig. 3). Involucrin expression has previously been shown to closely parallel induction of DNA amplification (43). We also performed RNase protection assays, and these demonstrated that the late transcripts seen in AraC-treated cells were in fact initiated from the late promoter (Fig. 4). These experiments were repeated three times with LKP31 cells, and similar results were seen in another HPV31-positive cell line, CIN612-9E, which was derived from a cervical cancer biopsy (data not shown). Our results indicate that the induction of late transcription can occur independent of viral amplification but that replication acts to increase the template number, leading to elevated levels of late expression.
![]() View larger version (61K): [in a new window] |
FIG. 3. Northern analysis of viral transcripts. LKP31 cells were suspended in methylcellulose without or with 40 µg of AraC/ml for 24 and 48 h. Total RNA was isolated, separated by electrophoresis, blotted, and hybridized with probes for the HPV31 genome, GAPDH, and involucrin. Various early and late HPV31 transcripts are indicated.
|
![]() View larger version (57K): [in a new window] |
FIG. 4. RNase protection analysis. CIN612-9E cells were suspended in methylcellulose without or with 40 µg of AraC/ml for 24 and 48 h. Total RNA was isolated and subjected to RNase protection analysis with P742 as a probe. Early and late spliced and unspliced transcripts are indicated.
|
![]() View larger version (13K): [in a new window] |
FIG. 5. H31Luc plasmid constructs. The H31Luc plasmid contains HPV31 sequences from the URR through the N terminus of E1 fused to the firefly luciferase gene (luc). Positioning of the early (P97) and the late (P742) promoters, as well as transcription factor binding sites within the URR, are indicated. Deletion mutants created by restriction endonuclease digestion using sites within the URR and other sequences upstream of P742 are shown.
|
![]() View larger version (20K): [in a new window] |
FIG. 6. Luciferase activity of early promoter mutants. (A) pBAS698 contains the entire URR and HPV31 sequences through nt 698 fused to the firefly luciferase gene. (B) The activity of pBAS698 early promoter mutants in monolayer cultures is represented. LKP31 cells were transfected with 2 µg of the luciferase plasmid, along with 50 ng of pRLTK. Cells were harvested after 48 h, and the ratio of firefly to Renilla luciferase activity was calculated. (C) The H31Luc plasmid and H31Luc/P97M plasmid were transfected into LKP31 cells, along with an internal Renilla control as described in Materials and Methods. After transfection, cells were induced to differentiate by suspension in methylcellulose. The ratio of firefly to Renilla luciferase activity was calculated for both monolayer and methylcellulose-treated cultures, and the fold induction upon suspension in methylcellulose is shown. The Renilla activity was slightly decreased upon differentiation; however, this decrease was consistent and still allowed from comparison between firefly constructs.
|
We next constructed a series of mutant reporter plasmids that included successive deletions from the 5' end of the URR proceeding through sequences immediately upstream of nt 742. In addition, a set of reporters containing internal deletions within the URR, as well as E6/E7 coding sequences upstream of p742, were constructed (Fig. 5). After transfection and suspension in methylcellulose, the firefly/Renilla ratios were determined and compared to the levels seen in monolayer cultures. The fold induction was calculated, and the results are shown in Fig. 7. Experiments were repeated at least three times, and most reporters were examined up to 10 times. The values presented in Fig. 7 represent averages of these experiments.
![]() View larger version (23K): [in a new window] |
FIG. 7. Induction of H31Luc and mutant plasmids upon differentiation. LKP31 cells were transfected with various deletion plasmids in both the wild type (A) and the P97 TATA box mutation, P97M (B). After transfection, cells were induced to differentiate by suspension in methylcellulose. Firefly luciferase activity was measured, along with the Renilla activity of a cotransfected control. The ratio of firefly to Renilla luciferase activity was calculated for monolayer and methylcellulose cultures, and the fold induction upon differentiation in methylcellulose was determined.
|
It was then important to more closely define regulatory sequences through the use of reporter constructs containing internal deletions in the E6/E7 region. For these analyses, we first focused in the region between nt 613 and 683 since this appeared to be a major regulatory region. This region overlaps sequences mapped by del Mar Pena and Laimins (13) found to contain initiation sites of late transcription, as well as regions that underwent chromatin rearrangement upon differentiation, as evidenced by the appearance of DNase-hypersensitive sites. We therefore constructed a series of plasmids that contained sequential 10-bp deletions of the region from positions 613 to 683 in the context of all upstream sequences. These constructs were tested in our transient assays at least three times, and many were tested up to five times, with the results shown in Fig. 8A. Of the eight reporters tested, we found the most significant effects upon deletion of sequences 661 to 670 and 681 to 690. Deletion of two other regions, positions 631 to 640 and 641 to 650, also appeared to have an effect on the induction of the late promoter, although it was not as pronounced as for the sequences from nt 661 to 670 and nt 681 to 690.
![]() View larger version (31K): [in a new window] |
FIG. 8. Induction of the H31Luc and mutant plasmids from nt 613 to 683 (A) and nt 691 to 740 (B). LKP31 cells were transfected with various deletion plasmids. After transfection, cells were induced to differentiate by suspension in methylcellulose. Firefly luciferase activity was measured, along with the Renilla activity of a cotransfected control. The ratio of firefly to Renilla luciferase activity was calculated for monolayer and methylcellulose cultures, and the fold induction upon differentiation in methylcellulose was determined.
|
EMSAs demonstrate factor binding to regions in the E6/E7 coding region. We next sought to investigate the factors that bound to the sequences identified in the E6/E7 coding region to be important for the activation of the late promoter through the use of EMSAs. Oligonucleotides were designed such that the 10-bp sequence examined in the previous luciferase assays was centered in a 20-bp duplex oligonucleotide. The strongest binding activities were detected for duplex sequences from nt 635 to 655 and 655 to 675, whereas oligonucleotides spanning sequences from nt 675 to 695 demonstrated moderate binding. A representative series of EMSAs for regions 625 to 645, 635 to 655, and 655 to 675 are shown in Fig. 9. Finally, duplex sequences from nt 625 to 645, 685 to 705, 695 to 715, 705 to 725, 715 to 735, and 725 to 745 all exhibited weak binding. In these initial studies, no significant differences were observed between binding of nuclear extracts from monolayer cells and cells treated with methylcellulose. However, we cannot exclude the possibility that there are moderate quantitative differences in binding activities.
![]() View larger version (68K): [in a new window] |
FIG. 9. Representative EMSA. Oligonucleotide (Oligo.) probes for sequences from nt 625 to 645, 635 to 655, and 655 to 675 were end labeled and incubated with nuclear extracts (N.E.) from monolayer (Mon.) and methylcellulose-treated (Diff.) cultures of LKP31 cells in the presence (+) or absence () of unlabeled oligonucleotide competitor (Comp.) DNA. A "B" indicates the specific retarded band, while an "F" indicates the unbound free probe.
|
|
|
|---|
Since our studies indicated that productive replication is not strictly required for activation of late transcription, it seemed appropriate to use reporter plasmids in transient reporter assays to study the cis sequences required for activation of the late promoter. Using a protocol in which cells were transfected with reporter plasmids and then induced to differentiate in semisolid media, we observed a low level of late expression in undifferentiated cells that was further enhanced by approximately three- to fourfold after suspension in methylcellulose. The low-level transcriptional activity in undifferentiated cells is consistent with the low-level activation we have observed in some monolayer cultures of cells that maintain viral episomes (42). In the present study, we observed that the major determinants of late transcription were found within the E6/E7 coding sequences. When we analyzed expression of a series of reporters containing deletion from the L1 side of the URR, we found that the most significant reduction in activity occurred upon loss of the E6/E7 sequences. In contrast, when the E6/E7 region was deleted in the presence of URR sequences, we observed a less significant reduction. This suggests there may be redundant elements in the URR that can substitute in activating late expression when the primary regulatory sequences in E6/E7 are deleted. We also identified a moderate negative regulatory element at the 5' end of the URR. Whether the negative element in the URR has physiologic significance is unclear at this time. However, a role for a negative regulator has been suggested in HPV6 where the binding of CAAT displacement protein to this region has been implicated as a regulator of late gene expression, and CEBP binding sites are located in this region (41).
We identified nine short regions within the E6/E7 region that seem to play important roles in the activation of late expression. These regions were also found to bind proteins in nuclear extracts, suggesting potential transcription factor binding. While some of these elements exhibited strong binding activities in our EMSAs, other sequences that scored in transient assays failed to exhibit significant binding activities. This may indicate either that we have not used the complete binding regions in our gel shifts or that spatial separation of sequences plays an important role in mediating transcriptional activation. A more detailed point mutational analysis will allow us to distinguish between these two possibilities.
We also failed to observe significant differentiation-specific changes in the binding activities in our EMSA analysis. This may suggest that either small quantitative differences in factor level may be responsible for differentiation-dependent activation or that constitutively expressed transcription factors act in the context of differentiation-dependent chromatin changes to mediate late expression. A detailed analysis of the factors involved will help to elucidate these possibilities.
A search of potential transcription factor binding sites by using the TRANSFAC database suggested several potential regulating factors (Table 1); however, most of these sites are nonconical, with only 70% homology to their consensus sequences. These factors include CEBP, the Oct-1 family of transcription factors, Ap-4, SOX, SRY, and CREB proteins. It is also possible that since start sites of late transcription are very heterogeneous and utilize initiator elements to direct initiation, some of these sequences may act as initiator elements. The identification of the binding factors involved in the regulation of HPV late expression will help to understand the mechanism by which differentiation-specific activation occurs.
|
View this table: [in a new window] |
TABLE 1. Potential transcription factor binding sites as determined by a search using the TRANSFAC database
|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»