Previous Article | Next Article ![]()
Journal of Virology, October 2003, p. 10213-10226, Vol. 77, No. 19
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.19.10213-10226.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, Alabama 35294-0005
Received 7 March 2003/ Accepted 30 June 2003
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The HPV and bovine papillomavirus (BPV) replication origin recognition protein and transcription factor E2 bind as a dimer with high affinity and specificity to multiple copies of a consensus E2 binding site (E2BS), ACCGN4CGGT, in the viral origin of replication (ori) (4, 9, 24, 36, 42, 46, 51, 60, 62, 75, 78a). The importance of E2 to replication in transfected cells is underscored by the requirement for one or more intact E2BSs and an E2 protein capable of binding to them (8, 41, 55, 71, 74). In vitro, the BPV type 1 (BPV-1) E2 protein prevents nucleosome formation around the ori, and this is critical for efficient assembly of a preinitiation complex (34). The other essential papillomavirus replication protein is the DNA helicase E1, which binds to its cognate site in the ori with low affinity and low specificity (27, 38, 39, 63, 78, 78a, 79). Efficient targeting of E1 proteins to the ori requires their direct interaction with E2 proteins bound to the E2BS that flank the E1BS. In vitro, E1 assembles on the ori as a hexamer or dihexamer (21, 38, 40, 61), a process greatly stimulated by the heat shock/chaperone proteins Hsp70 and Hsp40 (38, 40). The E1 dihexamer functions as a replicative helicase and is required throughout initiation and elongation of DNA replication (21, 38, 39). To establish the preinitiation complex, E1 recruits DNA polymerase
/primase and the single-stranded DNA binding protein RPA (3, 14, 23, 35, 45, 50). Efficient activation of initiation requires phosphorylation of E1 by the gatekeeper of S-phase entry, cyclin E/cdk2, which is recruited to the preinitiation complex by direct interaction with a motif in the amino-terminal domain of E1 (15, 37, 43). E2 protein is absent from the active replication complexes (39). In fact, the presence of E2 inhibits the E1 helicase, but the heat shock proteins are able to promote E2 dissociation from the DNA, relieve this repression, and enable de novo DNA synthesis (38).
It is thought that E1 and E2 are expressed in the basal and parabasal cells of the squamous epithelia to support extrachromosomal viral DNA maintenance replication as these cells enter S phase periodically. In productive infections, their expression is up-regulated in differentiated spinous strata to enable viral DNA amplification. While E1 functions mainly in replication, E2 also helps regulate promoters that control viral E6-E7 oncogene expression (12, 25). Moreover, the BPV-1 E2 protein tethers the viral DNA plasmids to mitotic chromosomes, enabling viral genome segregation in dividing cells (29, 33, 67, 77). HPV E2 proteins also play a role in viral DNA partitioning, but they mediate an association with the mitotic spindles rather than with mitotic chromosomes (B. A. Van Tine, L. Dao, T. M. Sonbuchner, B. Y. Lin, T. R. Broker, and L. T. Chow, unpublished data). Thus, the role of E2 protein changes during different stages of the virus life cycle. The exact E1/E2 ratio might then be expected to change correspondingly. Indeed, differentiation induces high levels of HPV type 31 (HPV-31) E1 and E2 transcription. The E1 mRNA reaches a greater abundance than E2 mRNAs (30, 49). Another study showed that elevated levels of an HPV-1 E1 expression vector increased transient ori DNA replication, while increasing levels of an E2 expression vector reduced replication (76). A slight repression of BPV-1 DNA replication by elevated E2 expression was also reported (44).
The primary transcripts for the HPV-11 E1 and E2 replication proteins extend from alternative 5' ends to the early region polyadenylation addition sites (nucleotide [nt] 4390 or 4392) (Fig. 1A) (47). HPV-11 E1 protein is encoded by sequences spanning nt 832 to 2778 (termination codon, nt 2779 to 2781), while the E2 protein is encoded in a different reading frame from a dedicated initiation codon at nt 2723 to 2725 through nt 3823 (termination codon, nt 3824 to 3826). Thus, the E2 coding region partially overlaps the E1 gene. Embedded within the E2 gene in a different reading frame is a short E4 open reading frame (ORF) (nt 3231 to 3578; termination codon, nt 3579 to 3581). The E4 ORF is accessed through the dominant splice nt 847^3325. Translation of this message begins with the amino terminus of E1 (nt 832 to 847) and continues into E4 after the splice (Fig. 1A). In productively infected, hyperproliferative lesions, an unspliced mRNA capable of encoding the E1 protein is only rarely detected. The majority of this primary transcript is processed to generate the predominant E1^E4 mRNA species, whereas a much less abundant species containing the nt 847^2622 splice encodes the E2 protein (5, 6, 13, 47, 58, 59, 66, 69) (Fig. 1A). Spliced species that may encode other forms of E1 or E2 proteins have also been reported (7, 12, 58, 59) (Fig. 1A). Complex mRNA splicing patterns have also been observed in other mucosotropic HPVs (28, 57, 64, 65). Thus, alternative mRNA splicing might regulate the production of both the E1 and E2 proteins, thereby modulating viral DNA replication. Until now, there have been very limited investigations of this issue. One study showed that splicing within the HPV-31 upstream E6 ORF modulated E1 and E2 expression (26). Two other studies mutated the predominant splice acceptor site in the corresponding E2/E4 region or a minor donor site within E1 in genomic HPV-31. Neither mutant was able to maintain extrachromosomal viral DNA (31, 70). However, this phenotype could have been a result of indirect effects of the mutations on the production of proteins other than E1 and E2.
|
| MATERIALS AND METHODS |
|---|
|
|
|---|
Cell cultures and transfections. The monkey kidney epithelial cell line COS7, which expresses the simian virus 40 large T antigen, and human kidney epithelial cell line 293 transformed by adenovirus E1A and E1B regions were maintained and transfected by electroporation as described previously (9, 80). 293 cells were used for transient replication assays and Western blotting, whereas COS7 cells were transfected for direct fluorescence microscopy, Western blot analysis, and mRNA isolation.
Transient replication assays. Transient replication assays were performed in 293 cells as described previously (9, 80). In assays using individual E1 and E2 expression plasmids, 0.5 µg of the HPV-11 origin-containing plasmid p7730-99 was cotransfected together with the indicated amounts of E1 and E2 plasmids. In assays using replicons, only the indicated amounts of replicons were transfected. In some experiments, an E1 or E2 expression plasmid was cotransfected with the replicon as indicated. Low-molecular-weight DNA was harvested 48 h posttransfection. One-half of the recovered DNA was digested with HindIII, which linearized the ori plasmid, while the other half was digested together with DpnI, which cut unreplicated input plasmids into small fragments. The digested DNAs were resolved in 0.8% agarose gels, Southern-blotted, and detected after hybridization with [32P]dCTP-labeled probes using the Megaprime DNA Labeling system (Amersham Biosciences Corp., Piscataway, N.J.). The probes were generated from either the ori plasmid p7730-99 or the pEGFP-C1 plasmid, the backbone of the replicons. The results were analyzed by PhosphorImager (Molecular Dynamics Inc., Sunnyvale, Calif.).
Direct fluorescence microscopy. After electroporation, 105 COS7 cells were grown on two-well chamber slides (Nalge Nunc International, Naperville, Ill.) for 24 h, fixed at room temperature with 4% paraformaldehyde in phosphate-buffered saline (PBS), washed five times with PBS, and overlaid with coverslips by using mounting medium with DAPI (4',6-diamidino-2-phenylindole) (Vector Laboratories, Inc., Burlingame, Calif.). Images were captured and processed with an Olympus AX70 fluorescence microscope with fluorescein isothiocyanate (FITC)/DAPI filters using an AxioCam digital camera (Carl Zeiss, Inc., Thornwood, N.Y.).
Antibodies and Western blotting. Rabbit polyclonal antibodies against HPV-11 E1, E2, and E1^E4 were raised in our lab (7, 9). Mouse monoclonal antibody JL-8 against GFP was purchased from Clontech, Inc. For Western blots, COS7 cells were lysed in radioimmunoprecipitation assay (RIPA) buffer (50 mM Tris [2-amino-2-hydroxymethyl-1,3-propanediol] [pH 7.5], 150 mM NaCl, 1% Nonidet P-40, 0.5% sodium deoxycholate, 0.1% sodium dodecyl sulfate [SDS], 1 mM phenylmethylsulfonyl fluoride, and 1% protease inhibitor cocktail [Sigma-Aldrich Co., St. Louis, Mo.]) 24 h posttransfection. 293 cells were lysed in the same way 48 h posttransfection. Protein concentrations were determined by a Bradford assay using Protein Assay Dye Reagent (Bio-Rad Laboratories, Inc., Hercules, Calif.). Proteins from each sample (in the amounts given in figure legends) were resolved by electrophoresis through SDS-12% polyacrylamide gels, followed by immunoblotting with individual antibodies, and were detected by enhanced chemiluminescence (ECL) reagents (Amersham Biosciences Corp.).
mRNA isolation, Northern blotting, and RT-PCR.
COS-7 cells were harvested 24 h posttransfection. Total and cytoplasmic poly(A) RNAs were isolated using an Oligotex Direct mRNA Mini Kit (Qiagen Inc., Valencia, Calif.). Five hundred nanograms of mRNA from each sample was resolved in formaldehyde gels under conditions recommended by the supplier and transferred to a Hybond N+ nylon membrane (Amersham Biosciences Corp.) in 20x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate [pH 7.2]). The membrane was washed with 5x SSC, cross-linked using a Bio-Rad GS Gene Linker, incubated at 42°C overnight in prehybridization buffer containing 50% formamide, 5x Denhardt's solution, 0.5% SDS, 200 µg of single-stranded DNA/ml, and 5x SSPE (1x SSPE is 0.18 M NaCl, 10 mM NaH2PO4, and 1 mM EDTA [pH 7.7]), and hybridized with [
-32P]dCTP-labeled probes. The 32P-labeled probes specific for GFP or for E1 (nt 1072 to 2556) were prepared by nick translation. mRNA bands were detected by PhosphorImager.
To perform RT-PCR, mRNA samples were reverse transcribed with the Invitrogen SuperScript First-Strand Synthesis system. The cDNAs then served as templates for PCR amplification using Taq DNA polymerase (Invitrogen Co.) with different primer pairs, as indicated in each figure. Briefly, sense-strand primers were K (nt 5' 613 to 634 of pEGFP-C1), M (nt 5' 1333 to 1355 of pEGFP-C1), and P (nt 5' 848 to 870 of HPV-11 E1), and antisense primers were N (nt 3' 2757 to 2781 of HPV-11 E1), O (nt 3' 1467 to 1488 of pEGFP-C1), Q (nt 3' 3826 to 3803 of HPV-11 E2), and R (nt 3' 3383 to 3400 of HPV-11 E2).
| RESULTS |
|---|
|
|
|---|
|
|
To substantiate our hypothesis, Northern blot hybridization was conducted using total or cytoplasmic poly(A) RNAs isolated from COS7 cells transfected with pGFP-11E1 or the pEGFP-C1 vector. When a GFP-specific probe was used, a single, highly abundant GFP mRNA of 1.0 kb (Fig. 3A, band 1) was observed in pEGFP-C1-transfected cells. In cells transfected with pGFP-11E1, we detected a rare full-length 3.0-kb GFP-11E1 transcript (Fig. 3A, band 2) and several more-abundant but shorter species: a minor 1.6- to 1.8-kb doublet (band 3) and a major 1.2-kb species (band 4), containing E1 sequences of about 600 to 800 and 200 bases, respectively. This would explain why the probe to a portion of the E1 region detected only the full-length transcript (Fig. 3A, right panel). These data suggest that alternative E1 intragenic mRNA splicing events occurred. For comparison, the full-length E1 mRNA generated from the pMT2-E1 plasmid was also of very low abundance, presumably due to the same intragenic splices (Fig. 3A).
|
Mutation at the dominant splicing donor site at nt 847 abolishes the major spliced RNA and greatly increases E1 mRNA and protein expression To construct an efficient expression vector for the GFP-11E1 protein, we mutated the splicing donor site at nt 847 without changing the coding sequence. pGFP-11E1dm was then transfected into COS7 cells. Northern and Western blotting, fluorescent microscopy, and RT-PCR were performed, and the results were compared with expression from the wild type pGFP-11E1 vector.
Northern blotting with a GFP probe showed that the 3-kb unspliced transcript was the predominant RNA species (Fig. 3A, band 2). There were also some minor shorter species of approximately 1.6 to 1.8 kb (band 3) and 1.25 kb (band 5). In particular, band 5 appeared to migrate slightly more slowly than band 4, which was generated by the wild type pGFP-11 clone. Hybridization with the E1-specific probe (Fig. 3A, right panel) confirmed a significant increase in levels of the full-length GFP-E1 mRNA from the vector containing the mutation.
RT-PCR was then performed using primers (K-O and K-N) (Fig. 3B, left panel) to span the whole GFP-E1 sequence. The result showed that the nt 847^2622 splicing was abolished in the pGFP-11E1dm mRNA preparation, because band 4, which was abundant in pGFP-11E1 mRNA, was no longer detected. With primers P and N, we obtained an unspliced, nearly full length E1 product (nt 848 to 2781) (band 2) as well as several shorter products (Fig. 3B, right panel). By DNA sequencing, three alternative intragenic splices were identified. The splices of nt 1459^2622 (band 3a) and of nt 1272^2622 (band 3b) were identical to those recovered from the wild-type vector, but the band intensities were stronger. In addition, band 5, which was the shortest and had the highest intensity, contained a splice of nt 916^2622. This splice was not detected in mRNA derived from the wild-type vector. Collectively, these results showed that the loss of the major spliced species of nt 847^2622 increased the unspliced transcript level and also potentiated the formation of minor or new alternatively spliced species relative to the RNA species generated from the pGFP-11E1 plasmid containing the wild-type E1 sequence.
Consistent with the above analyses on RNAs, Western blotting (Fig. 2B) using an anti-E1 antibody clearly showed that pGFP-11E1dm expressed a much higher level of GFP-E1 fusion protein than pGFP-11E1. When blots were probed with the anti-GFP antibody, full-length GFP-E1 was the predominant protein expressed from pGFP-11E1dm. In contrast, the 33-kDa variant GFP observed with pGFP-11E1 was no longer detected. These results indicate that intragenic splicing plays a significant role in regulating the level of E1 protein expression.
The GFP distribution as revealed by fluorescent microscopy was consistent with the Western blotting results. When cells were transfected with pGFP-11E1dm, GFP was localized exclusively to the nucleus in the majority of the cells (Fig. 2A). However, in about 20 to 30% of GFP-positive cells, there was still some cytoplasmic GFP, with or without nuclear GFP as well. This might be a result of GFP-E1 overproduction. Alternatively, the subcellular localization of E1 could be cell cycle regulated, but the cytoplasmic E1 could have escaped detection in the past because of the low level of native E1 protein expression. When cells were cotransfected with an HPV-11 E2 expression vector, bright green dots were observed in the nuclei (data not shown), consistent with a previous report showing colocalization of E1 and E2 in distinct nuclear foci (72).
HPV ori replication is differentially regulated by the relative amounts of E1 and E2 in transient replication. We proceeded to examine the effects of the absolute and relative amounts of E1 and E2 proteins on the level of HPV ori replication in 293 cells by cotransfecting different amounts of E1 and E2 expression vectors. With small amounts of pGFP-11E1 and a fixed 5 µg of pMT2-E2, replication increased with increasing amounts of the E1 expression plasmid (Fig. 4A and B). In parallel, the pGFP-11E1dm plasmid was much more efficient than the wild-type pGFP-11E1 plasmid. Only 5 to 10 ng of pGFP-11E1dm was able to promote a detectable level of replication. In comparison, about 100 ng of the wild-type GFP-E1 plasmid was required to achieve a similar level of replication. Unexpectedly, with further increases in the amount of the pGFP-11E1dm plasmid to 0.5 to 1.0 µg or more, the extent of replication progressively decreased (Fig. 4A and B). These results were reproducible in several experiments (data not shown). To investigate this phenomenon, we repeated the experiments by using as much as 20 µg of the E1 plasmids in the presence of 5.0 µg of the E2 vector. Replication reached a plateau at approximately 1.0 µg of pGFP-11E1dm. Much beyond this input, replication decreased. In contrast, this decline was not observed with pGFP-11E1. The results of one of several such experiments are shown in Fig. 4C and D. To correlate the extent of ori replication with the level of E1 protein expression, Western blotting was performed with equal amounts of lysates of 293 cells transfected with a range of either E1 plasmid. Figure 4E shows that GFP-E1 protein was detected only when 10 µg of pGFP-E1 was transfected, but not with smaller amounts. In agreement with the results for COS cells, pGFP-E1dm generated higher levels of E1 protein, and 1 µg of plasmid expressed detectable GFP-E1 protein.
We then performed replication assays using increasing amounts of the E2 expression vector in the presence of 100 ng input of either pGFP-11E1 or pGFP-11E1dm, a level at which no negative effect of the pGFP-11E1dm was observed (Fig. 5A and B). Replication consistently increased with increasing levels of the E2 plasmid. Even at 50 µg of the E2 plasmid, we observed no repression. In addition, GFP-E1dm was always more efficient than GFP-E1 because of the higher levels of E1 protein produced by the mutated vector. These results were reproducibly observed. We also examined the effects of increasing the amount of the E2 expression plasmid in the presence of 1 µg of either of the E1 plasmids, a level at which replication by pGFP-E1dm peaked in the previous experiments (Fig. 4). The results showed that replication increased with E2 input (Fig. 5C and D). Although pGPF-E1dm was still more effective than pGFP-E1, the difference was not as dramatic as that in experiments in which 0.1 µg of the E1 expression vector was used. At the highest input level of E2 plasmid tested, the two E1 expression vectors supported comparable replication (Fig. 5C and D). However, as the amount of the E2 plasmid increased, a significant amount of slow-migrating, DpnI-resistant ori DNA was detected in cells transfected with pGFP-E1dm, but not in cells transfected with pGFP-E1. This material cannot be attributed to incomplete digestion of restriction enzymes, because an input of expression plasmids as high as 20 µg was completely digested.
|
Multiple spliced RNA species are generated from a replicon containing the genomic E1 and E2 sequences. It is possible that, with the separate expression vectors, the replication experiments described above did not reflect the true relative E1/E2 ratio normally achieved by the virus. To simulate the viral genome, we constructed a replicon, p11Rc (Fig. 1B), which contained the overlapping genomic E1 and E2 coding sequences (nt 832 to 3826). GFP was fused in frame with E1 as in pGFP-11E1. Into the plasmid, the 1-kb HPV-11 URR region (nt 7070 to 7933/1 to 99) spanning the ori was inserted upstream of its CMV promoter, which controls the expression of the viral proteins. In productively infected HPV-11 lesions, the primary transcript ending at the E region poly(A) site gives rise to E1, E2, E1^E4, E2C (i.e., E8^E2C), and possibly other proteins by alternative splicing. The p11Rc replicon contained all known splicing donor sites (nt 847, 1272, and 1459) and acceptor sites (nt 2622, 3325, and 3377) in the E region. We anticipated that the replicon would encode a GFP-E1 protein from an unspliced mRNA and a native E2 protein from a transcript with a splice of nt 847^2622.
The p11Rc replicon, pEGFP-C1, and pGFP-11E1 were separately transfected into COS7 cells. Lysates were analyzed by Western blotting in parallel. No GFP-E1 protein was detected from the replicon with the anti-GFP antibody except after prolonged exposure (Fig. 6A, lane 3; also data not shown). Rather, the antibody revealed three lower-molecular-mass bands. The fastest-migrating band had a mobility corresponding to that of GFP (Fig. 6A; compare lanes 1 and 3); it was much weaker than the other two at a shorter exposure and could be a result of cleavage in the linker between the GPF and fusion protein moieties (data not shown; but see Fig. 2B, lower panel). The middle band had a mobility similar to that of the variant GFP detected in pGFP-11E1-transfected cells (Fig. 6A; compare lanes 2 and 3), but it was not this protein (see below). The slowest and by far the most abundant of the three bands had the mobility of a GFP-E1^E4 protein (40 kDa) predicted from an mRNA with a splice between the dominant splice donor and acceptor sites, nt 847^3325.
|
Consistent with these interpretations, RT-PCR with primer pair K-Q or M-Q detected only one product, the spliced species of nt 847^3325 (Fig. 6C, bands 7 and 8). However, the predominance of this splice in the shortest of all the messages prevented the detection of any longer but much less abundant spliced mRNAs, such as E2 mRNA with the splice of nt 847^2622. To detect minor spliced species, we used a forward primer (primer P) immediately downstream of the splice donor site at nt 847. When forward primer P was used in conjunction with reverse primer R, five faint products were detected. The identities of these species are discussed in the following section.
A replicon with a mutation at the dominant splice donor site at nt 847 abolishes the E1^E4 mRNA and increases E1 mRNA production as well as that of alternatively spliced species. To test how a donor mutation would affect the production of both the E1 and E2 proteins and hence DNA replication, we introduced the same donor mutation at nt 847 as that present in pGFP-11E1dm into the replicon (p11Rc-E1dm). RT-PCR of transcripts from p11Rc-E1dm-transfected COS7 cells revealed the successful elimination of the dominant splice of nt 847^3325 (Fig. 6C, left panel). Western blotting showed that the major GFP fusion protein was full-length GFP-E1, which was much more abundant when generated by p11Rc-E1dm than when generated by p11Rc (Fig. 6A; compare lanes 3 and 6 with 4 and 7). The trio of low-molecular-weight GFP fusion proteins was below detection (Fig. 6A, lanes 4 and 10). In agreement with these findings, fluorescence microscopy revealed that GFP was primarily nuclear, as expected of the nuclear GFP-E1 protein (Fig. 6B).
We then performed RT-PCR to determine the spliced species generated by the p11Rc-E1dm replicon which could encode E2, in parallel with those generated by p11Rc. The sense-strand primer P (nt 5' 848 to 870) (Fig. 3B and 6C) and the reverse primer R (nt 3' 3383 to 3400) were used. This reverse primer is located downstream of all known splice acceptors, but it reduces the lengths of PCR products, facilitating their recovery. A number of bands from p11Rc-E1dm transcripts were detected (Fig. 6C). The sequences of six of the more prominent bands were determined. Each was a spliced species: 1459^2622, 1272^2622, 916^2622, 1459^3325, 1272^3325, and 916^3325. The first five spliced species were also observed in RNA from p11Rc, but the bands were much weaker (Fig. 6C). This is because the majority of the transcripts from p11Rc were the E1^E4 species (nt 847^3325), which was not amplified by the primer pair used. Sequence analysis showed that translation from all three spliced species using the nt 2622 acceptor site terminates upstream of the E2 initiation codon at nt 2723. Ribosome reinitiation could then give rise to a native E2 protein, as it does in the known E2 mRNA containing the nt 847^2622 splice. At present, we do not know which of the three alternative species encodes E2 in vivo. The nt 916^3325 splice fuses the amino-terminal 28 amino acids of E1 to the E4 frame; we call this product E1N^E4 (Fig. 1A). We attribute the high relative abundance of this RNA species in RT-PCR to preferential amplification of this shortest template. In actuality, GFP-E1N^E4 must be present in very small amounts, since the E4 antibody did not detect any band of the expected size (Fig. 6A, lane 10).
Interestingly, we could not detect E2 protein from either p11Rc-transfected or p11Rc-E1dm-transfected COS7 cells. For comparison, E2 from lysates of cells transfected with pMT2-11E2 was readily detected (Fig. 6A; compare lanes 11 and 12 with lane 13). Nevertheless, we infer that E2 was expressed from the wild-type and mutated vectors, because both replicated in 293 cells in the absence of a separate E2 expression vector, as described below. Collectively, these results suggest that the E2 messages generated from the genomic context are of reduced abundance or are translated less efficiently than a dedicated E2 mRNA from pMT2-E2. But part of the distinction might also be attributable to a difference in promoter strength between these vectors.
p11Rc replicates more efficiently than p11Rc-E1dm. Transient replication assays with both p11Rc plasmids were performed in 293 cells (Fig. 7). With a wide range of transfected DNA amounts, the full-length, linearized product generated from p11Rc was always more abundant than that from p11Rc-E1dm. These results suggest that E2 might be expressed at a reduced level from the mutated replicon relative to expression from the wild-type vector. To test this hypothesis, we complemented the replicon with either an E1 or an E2 expression vector. When 5 µg of either replicon was cotransfected with 0.2 µg of pGFP-11E1dm, replication increased only by a moderate amount, indicating that the E1 protein was not limiting (Fig. 7). However, when p11Rc-E1dm was cotransfected with 2.0 µg of the GFP-E1dm plasmid, a depression in replication was observed, consistent with the negative effects of elevated E1 protein levels or of an elevated E1/E2 ratio observed with individual expression vectors (Fig. 4). Cotransfection of 2.0 µg of the GFP-E1dm plasmid had very little effect on p11Rc (Fig. 7). We speculate that replication by p11Rc may have declined after peaking between the addition of 0.2 and 2 µg of p11Rc-E1dm, as described for pGFP-11E1dm (Fig. 4). In contrast, when 5 µg of either replicon was cotransfected with the pMT2-11E2 plasmid, replication increased greatly, up to about threefold with 10.0 µg of the E2 plasmid (Fig. 7), indicating that E2 was limiting for both replicons in the genomic context. However, the wild-type replicon still yielded more of the linearized full-length replication product than the mutated replicon. Interestingly, at a moderate to high input of p11Rc-E1dm, there was slower-migrating, newly replicated DNA (Fig. 7). Levels of this material increased significantly when the E2 plasmid was cotransfected, a finding similar to those observed for transfections with large amounts of pGFP-E1dm and E2 expression plasmids (Fig. 5C).
|
| DISCUSSION |
|---|
|
|
|---|
In the context of the E1 expression plasmid, the majority of the transcripts were a spliced species of nt 847^2622, and very little unspliced mRNA remained to encode the E1 protein (Fig. 2 and 3). Viral mRNA with this splice encodes the E2 protein (58). The small peptide of 29 residues resulting from the splice has not been detected previously from E1 expression vectors due to its small size. It was revealed in this study only because of the fusion to GFP (Fig. 2). We believe that this splice allows ribosome reinitiation to access the E2 ORF, after termination of the short upstream peptide, as is also hypothesized for the effect of E6 intragenic splices on the translation of E7 in the oncogenic HPVs (68). In contrast, in the context of the replicon, the predominant splice was nt 847^3325, generating the highly abundant GFP-E1^E4 mRNA and protein (Fig. 6). Consequently, only very low levels of E1 and E2 mRNA and protein are generated from the replicon (Fig. 6). It is interesting that, despite the rarity of the unspliced E1 mRNA, the E2 protein, but not the E1 protein, is limiting, as replication of the replicon is greatly enhanced when the E2 expression vector is cotransfected. In comparison, the effect of cotransfection of an E1 expression vector was less significant (Fig. 7). We speculate that E1 is efficiently translated, since it is the first protein encoded in the message, whereas E2 synthesis relies on translation reinitiation after the termination of an upstream peptide regardless of which of the splice donors is used (see below).
Since the splicing donor and acceptor sites and their various combinations in multiple spliced messages are highly conserved (13), our results concerning the regulation of E1 and E2 expression by alternative mRNA splicing are most likely applicable to other HPVs. Interestingly, although many comparable splice sites are also conserved in BPV-1, the corresponding E1 intragenic splice (nt 864^2558) has not been reported. Rather, the BPV-1 E2 mRNA is initiated from a separate promoter immediately upstream of the E2 ORF (2). Consequently, Rangasamy and colleagues (53, 54) were able to express a nuclear BPV-1 E1 protein tagged at the amino terminus with GFP and observed no abnormal distribution of GFP. Thus, this splice, if it indeed exists in BPV-1 mRNA, must occur at a very low frequency.
We have shown that, when the dominant splice donor at nt 847 was mutated, most of the E1 transcript was unspliced, accounting for the highly elevated production of E1 protein either from the E1 expression vector or from the replicon. Lysate dilutions showed that the level of E1 protein expressed in COS7 cells was 50- to 100-fold that detected from the wild-type expression vector (Fig. 2B and data not shown). In 293 cells, in which the replication assays were conducted and the expression plasmids did not replicate, a significant difference was also observed (Fig. 4E). These results validate our interpretation that the differences in the E1 protein levels and the ratios of E1/E2 proteins account for the differences in their replication activities (Fig. 4A through D). In addition, RT-PCR showed that the usage of previously known alternative minor splice donors at nt 1272 or nt 1459 increased relative to that observed in wild-type plasmids (Fig. 3B). A cryptic donor site at nt 916 was uncovered. The splice of nt 916^2622 was more frequently used by p11Rc-E1dm than by p11Rc (Fig. 6), and the nt 916^3325 splice was observed only with the mutated replicon. This novel donor site at nt 916 has not been reported previously in RNA preparations from lesions. We speculate that it is scarcely used because of its proximity to the dominant donor site at nt 847. Only when the dominant donor site is mutated is this cryptic site potentiated. Even then, the predicted E1N^E4 protein was not detected by Western blotting (Fig. 6).
As is observed in natural infections, there is also a hierarchy in the usage of the splice acceptor sites. The predominant spliced species from the replicon was nt 847^3325. E1^E4 is also the most abundant viral mRNA and protein in HPV-infected cells (5, 6, 13, 18, 52, 56, 66). Notably, the splice nt 847^2622 was recovered by RT-PCR only from the E1 expression vector, not from the wild-type replicon (data not shown). In fact, because this nt 847^3325 species was highly abundant and also because it was the shortest product, we recovered the minor alternative splices only when the forward primer was placed downstream of this donor site (Fig. 3 and 6). We also note that the splices of nt 916^2622, 1272^2622, and 1459^2622 have not been described previously. Rather, nt 1272 or 1459 is spliced to nt 3325 or 3377, generating either E2C or fusion peptides between portions of the E1 and E2 coding regions (7, 59) (Fig. 1A). We believe that at least one of these new species encodes the E2 protein, as p11Rc-E1dm supported its own replication. Indeed, a group of alternatively spliced HPV-16 mRNAs containing the E2 ORF has also been identified (64), and all of these can be translated to E2 in vivo and in vitro (1).
Our data show that E1/E2 ratios can significantly influence the level of replication. This was evident in three-plasmid cotransfection experiments (Fig. 4), where a high input of pGFP-11E1dm depressed rather than increased replication at a fixed input of the E2 expression vector. We believe that, for the same reason, the mutated replicon, which produced high levels of GFP-E1 protein, had a reduced efficiency of replication relative to the wild-type replicon (Fig. 6 and 7). It is possible that the negative effect of a high level of E1 protein is further exacerbated by less efficient alternative mRNA splicing or translation reinitiation of these transcripts from the mutated replicon relative to that of the primary E2 mRNA with the splice of nt 847^2622 produced by the wild-type replicon. This repression of replication by a high HPV E1/E2 ratio has not been observed previously because the wild-type E1 expression vector produces mostly spliced transcripts, so that it does not express an amount of E1 protein comparable to that generated by the E1dm mutant examined in this study.
What may have caused replication depression in the presence of high E1 concentrations? It is not likely due to a titration of critical interacting host proteins, as these proteins should be highly abundant and sufficient to support host DNA replication. A more plausible possibility is that multiple initiations from the template DNA could have occurred, resulting in disruption of orderly replication. DNA breakage and recombination might then lead to the formation of slow-migrating products (Fig. 5 and 7). Männik et al. (44) reported a smear of products, consisting primarily of small fragments shorter than the BPV-1 replicons, when the replicons produced large amounts of BPV-1 E1 protein. These abnormal products were attributed to an onion-skin mode of replication reinitiation from the ori. However, we observed only significantly longer products. The precise origin and nature of these products remain to be investigated.
In contrast to the negative effect of a high E1/E2 ratio, increasing the amount of the E2 expression vector invariably increases replication, indicating that E2 is limiting (Fig. 5). This result agrees with the previous observation in cell-free replication (14) but is in contrast to the report on HPV-1 (76). The reason for the discrepancy is unclear, but we note that, unlike the situation with the mucosotropic HPVs, HPV-1 E1 alone can initiate replication in the absence of E2 in transfected cells (22). What is unexpected is that E2 is limiting even for the wild-type replicon (Fig. 7). One might ask why the virus would want to limit E2 protein so as not to be able to achieve the highest level of replication possible for the amounts of E1 protein available. The answer may lie in the facts that the viral origin and E6 promoter overlap and that there are different modes of viral DNA replication as the keratinocytes differentiate. HPVs need E7 and E6 proteins to induce an S-phase environment in the differentiated, stratified epithelial cells in order to support productive viral DNA amplification. However, large amounts of E2 would inhibit transcription from the E6 promoter (10, 16, 17, 19, 20, 48, 73). Thus, a delicate balance must be struck between transcription and replication regulation.
The question of why so many spliced species are observed in transfected cells and in natural lesions is intriguing. There are several possible explanations. First, some of the alternatively spliced minor mRNAs indeed code for functional proteins. However, once the splice sites exist, the virus many not have a way to, or may not care to, preclude other, unintended splice combinations. Since most of the alternative spliced species tend to be rather minor or rare, the presence of the minor proteins encoded may not interfere with the more abundant functional proteins. Second, the less dominant splice sites might be beneficial for the virus because the same protein could be generated from a variety of spliced transcripts, as demonstrated in our study of p11Rc-E1dm. The third possibility is that the use of different splice sites could be conditional and might in fact be employed as a mechanism to regulate gene expression posttranscriptionally. Our data demonstrate that, by alternative mRNA splicing, the virus can control the expression of multiple proteins, their absolute levels, and their relative ratios without resorting to regulating separate promoters. This is a very flexible and efficient way to regulate its gene expression. In the case of the replication proteins, little or no E1 and E2 protein expression is needed in the quiescent basal cells until the cells enter S phase. More is needed in the rapidly cycling parabasal keratinocytes to ensure proper partitioning of viral DNA into the daughter cells. Still higher levels of E1 and E2 are needed to promote vegetative DNA amplification for progeny virion assembly in the differentiating spinous cells. Additional experiments will be needed to address this last possibility in vivo. Finally, our evidence that an E1dm mutant has high protein expression and authentic protein localization will facilitate future research on the E1 protein.
| ACKNOWLEDGMENTS |
|---|
DNA sequencing was performed by the UAB CFAR DNA sequencing core facility.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
but not replication protein A. J. Virol. 69:2341-2350.[Abstract]
E2C fusion protein encoded by human papillomavirus type 11 is a sequence-specific transcription repressor. J. Virol. 65:3317-3329.
/primase. J. Biol. Chem. 274:2696-2705.
-primase p68 subunit. J. Virol. 72:7407-7419.
E4 protein. Virology 159:433-439.[CrossRef][Medline]