Previous Article | Next Article ![]()
Journal of Virology, February 2005, p. 2356-2365, Vol. 79, No. 4
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.4.2356-2365.2005
Daniel Michalowski,1,
Andrei S. Zolotukhin,1
Ralf Schneider,2
Jenifer Bear,1
Patricia Roth,3
George N. Pavlakis,3 and
Barbara K. Felber1*
Human Retrovirus Pathogenesis Section,1 Human Retrovirus Section, National Cancer InstituteFrederick, Frederick, Maryland,3 Institut für Experimentelle Genetik/AG BIODV, GSF-Forschungszentrum für Umwelt und Gesundheit GmbH, Oberschleissheim, Germany2
Received 16 July 2004/ Accepted 21 September 2004
|
|
|---|
|
|
|---|
In contrast, simple retroviruses, such as simian type D retrovirus (4, 9, 10, 33), and the avian sarcoma viruses, like Rons sarcoma virus (23-26), depend on cellular trans-acting factors for the transport of their unspliced RNA. Type D virus expression depends on the viral cis-acting constitutive transport element (CTE) (4, 9, 10, 32, 33, 38). Structure-function analysis of the CTE revealed an extended stem-loop structure with two essential symmetrical internal loop regions (9, 33), which are conserved among all type D retroviral CTEs (9, 32, 33). In addition, we had also identified a CTE-related element in an intracisternal A particle retroelement (IAP) (32) which is located within an intron of the murine osteocalcin-related gene. This element, CTEIAP, preserves only the overall RNA structure and the sequence of the internal loop regions of the type D retroviral CTEs. The sequences of the double-stranded regions of CTEIAP are divergent, but the changes are of compensatory nature, demonstrating that the stem structure and not the sequence of this region is important for function. The conserved internal loops contain the direct binding sites for the cellular mRNA export factor NXF1 (previously named TAP) (14). NXF1 not only promotes the nucleocytoplasmic export of the CTE-containing RNA but, importantly, is also a key factor in the export of cellular mRNAs (3, 8, 14), a function which is conserved among eukaryotes (16, 29, 34). We previously identified an evolutionary precursor, the TAP-binding element, of type D retroviral CTEs and CTEIAP, which consists of a minisatellite sequence encoding the essential motifs found in the internal loops of the CTE. This sequence also folds into a structure that is similar to that of type D retroviral CTEs and CTEIAP, presenting the essential sequence motifs within a CTE-like context, and the TAP-binding element is functionally analogous to the CTE (36).
The discovery of the RNA transport element (RTE) identified another potent RNA export element which uses a cellular factor(s) for export. The RTE was identified in complementation experiments using a rev- and RRE-defective HIV-1 molecular clone. This screen was designed to identify RNA export elements in the mammalian genome by selecting for rescue of virus replication. Viruses rescued by this method contained the RTE, which belongs to a mouse IAP. Using the virus rescue assay, it was found that the RTE is as potent as the CTE in complementing a Rev- and RRE-mutated HIV (22).
Interestingly, on the basis of the presence of the RTEs (22) or CTE-related element, CTEIAP (32), the murine IAPs can be grouped into two subclasses. The RTE functions in many cell types of different species, indicating that its export factor(s) is widely expressed and evolutionarily conserved. Microinjection experiments have shown that the RTE also functions in Xenopus laevis oocytes (22). Here we present a detailed structure and function analysis of the originally identified RTE (22) by using assays of both microinjected Xenopus oocyte nuclei and transfected human cells. We further identified an extensive family of RTEs and RTE-related elements in the mouse genome. Comparison of the sequences and the predicted structures of these RTE-related elements further supported the experimentally identified RTE structure and the role of its key features in RNA export.
|
|
|---|
RNA export from Xenopus oocyte nuclei.
RTE mutants were generated by PCR, sequenced, and cloned into the unique SacII site in plasmid pBSAd1 (18) linearized with EcoRV (22). The CTE-containing adenovirus precursor RNA, U1
Sm RNA, and U6
ss RNA were described previously (18, 27). Capped, radiolabeled RNA was prepared from 1 µg of the linearized plasmids by using a MaxiScript kit (Ambion) as described previously (22). One injection contained approximately 2 x 104 cpm of pre-mRNA. Cold RNAs were obtained using an mMessageMachine kit (Ambion). Nuclear and cytoplasmic fractions were obtained by manual enucleation. RNA was extracted from a pool of five oocytes after proteinase K digestion, and equivalents of one-half oocyte were loaded onto 10% polyacrylamide gels containing 7 M urea.
HIV-1 Gag expression in mammalian cells. The RTE mutants were cloned into the unique SacII site of the pNLgag plasmid (30) or the BamHI-KpnI sites of pB37gag located between gag and the 3' long terminal repeat. pNLgag contains the major splice donor of HIV-1 located 5' of gag and a cryptic splice acceptor site located 3' of gag and the RTE, whereas pB37gag lacks the major HIV-1 splice donor and does not undergo splicing (28, 33). One microgram of plasmid was used to transfect HeLa-derived HLtat cells, which constitutively produce HIV-1 tat (11), with FuGene reagent (Roche). In addition to pNLgag, each transfection mixture contained 0.5 µg of the green fluorescent protein expression vector pFRED25 (31) as an internal control. Transfection efficiency was determined by fluorimetry of green fluorescent proteins. Expression of p24gag was determined with an HIV-1 p24 antigen enzyme-linked immunosorbent assay kit from Zeptometrix.
Nucleotide sequence accession numbers. The GenBank accession numbers for the RTE-related elements described in Figure 2 are AY876078 (group B) for the element found in AL663101, AY876079 (group C) for the element found in AL671215, and AY876077 (group D) for the element found in AC003061.
![]() View larger version (36K): [in a new window] |
FIG.2. Family of RTE-related elements in the mouse genome. (A) Alignment of the consensus sequences of RTE (group A) and of the RTE-related groups B, C, and D. The 226-nt RTE was used to identify related elements in the mouse genome. The shaded areas indicate the loops defined for RTE as shown in Fig. 1A. The letters k, m, r, s, w, and y are the degenerate DNA alphabet. (B) Alignment of RTE and a representative member of each group of RTE-related elements, found under GenBank accession numbers AL663101 (group B), AL671215 (group C), and AC003061 (group D). The shaded areas indicate the loops defined for RTE as shown in Fig. 1A. (C) Phylogenetic tree and comparison of the RTE structure (from Fig. 1A) and the predicted secondary structures for the RTE-related elements shown in panel B.
|
|
|
|---|
G of 84 kcal/mol.
![]() ![]() View larger version (98K): [in a new window] |
FIG. 1. Secondary structure of RTE. (A) The secondary structure of RTE is modeled using the mfold program and modified by experimental data. Arrows show the locations of specific sites of digestion induced by enzymatic probing and lead ion-induced cleavages. Thin and thick arrows indicate strong and very strong cuts, respectively. (B) Enzymatic and chemical probing of the 5'-end-labeled RTE molecule. Lanes: C, control (RTE RNA incubated without treatment); MB (mung bean nuclease I), P1, S1, T2, and T1, digestions using the indicated enzymes; Pb, cleavages induced by lead ions; LE, RNase T1 hydrolysis ladder; LF, formamide hydrolysis ladder. The positions of selected guanine residues and SL I to SL IV are indicated on the right.
|
RTE belongs to a family of related elements present in the mouse genome. We previously identified the presence of additional RTE-related elements in the rodent genomes (22). For a more complete analysis of these elements, the sequence of RTE (nt 1 to 226) was used as a query in a database search which was based on a rigorous Smith-Waterman algorithm (SSearch, implemented in the Genetics Computer Group package with the default parameters) by using the nonredundant rodent database, as available in 2001. The raw SSearch output was used with Pileup (Genetics Computer Group package) to generate multiple sequence alignments, which were manually refined to represent only full-length matches. This process resulted in a library of 326 elements sharing at least 70% identity with RTE. Upon visual examination of the alignment, the elements were subdivided into four groups, named A (which contains the previously identified RTE) through D. Figure 2A shows the comparison of the consensus sequences of RTE in group A (83 sequences) and the RTE-related elements in groups B (27 sequences), C (72 sequences), and D (144 sequences). Figure 2B shows the alignment of each group's representative, namely, RTE (A) and the RTE-like elements identified under GenBank accession numbers AL663101 (B), AL671215 (C), and AC003061 (D). Compared to RTE, there are nucleotide insertions in the region separating loops I and II in groups C and D and within a loop in SL II of groups B, C, and D. Interestingly, the sequences of loops in SL I and SL III are conserved, whereas the loop in SL IV in groups B and D has a 7-nt insertion (5'-AAGCCTG-3'). Figure 2C shows the predicted structures of the RTE-related elements identified in groups B, C, and D in comparison to the RTE structure. These data reveal that all the RTE-related elements are highly similar to the previously identified RTE (22) and share the key structural features of SL I to SL IV. Of note, all elements share the long stem of SL IV. This structural conservation is maintained despite the differences in sequence (Fig. 2A and B) because the sequence changes are found to be mostly compensatory to maintain the predicted stem structures. This finding is reminiscent of the mouse CTEIAP element, which has the same overall secondary structure as the type D retroviral CTEs (32) but shares only the sequences of the two internal loops while the sequences of the stem structures are different. There are also marked differences among the predicted structures of the RTE-related elements and the experimentally defined structure of RTE. The elements in groups A and C share a small loop in SL IV, while the elements in groups B and D have an insertion of 7 nt, resulting in a bigger loop. On the other hand, the elements of groups A and D have a big loop in SL III, while this loop is smaller in the elements of groups C and B. Most of the changes are concentrated in the SL I and SL II regions of the elements, affecting the predicted structures in these regions more severely. However, functional analysis (see below) showed that these regions are not essential for nuclear export of the RTE RNA, which may explain the tolerance of bigger variation in this part of the RTE. On the other hand, most of the changes in the remaining regions are of a compensatory nature and are predicted not to disturb the overall secondary structure.
Due to the subsequent availability of the complete mouse genome sequence, we performed new searches using SSearch with RTE as the query sequence, which identified more than 3,000 RTE-related elements, of which 105 elements are identical to the 226-nt RTE. The elements are distributed to all the chromosomes, and no clusters of RTEs were found. For comparison, the previously identified functional CTE-related element (CTEIAP) (32) is found in 275 highly related copies in the mouse genome (of which 1 element has 100% identity to the primary sequence, 272 elements have 82 to 97%, and 2 elements have 66 to 67%). Therefore, these searches show a high abundance of RTE and RTE-related elements in the genome, suggesting an association of these elements with more active IAPs. Whereas some of the RTEs are found to be located within IAPs like the originally identified RTE (22), the vast majority of IAP-related sequences in rodents represents molecular fossils of once intact ancestral retroelements, and it is therefore likely that the majority of the identified RTEs and RTE-related elements may not be found associated with IAP sequences.
Minimal RTE.
To identify the minimal functional RTE, we generated a series of 5'- and 3'-end deletion mutants from RTE (nt 1 to 226), such as M31 (nt 28 to 226), M2 (nt 57 to 217), M12 (nt 65 to 206), M13 (nt 104 to 199), and M14 (nt 116 to 189), as shown in Fig. 3A. The mutant RTEs were inserted into the intron of an adenovirus pre-mRNA and were tested for their ability to export intron lariats from microinjected Xenopus oocyte nuclei. As shown in Fig. 3B and previously (22), the intron lariat containing the intact RTE is exported to the cytoplasm. For a negative control, we tested the export of the empty adenovirus lariat, which remains in the nuclear compartment and shows only about 10 to 13% export. U1
Sm RNA and U6
ss RNA were coinjected to serve as quality controls and demonstrated the functionality of the nuclear export machinery and the intactness of the nuclei, respectively. Mutant RTEs M2 and M12 mediated export of the intron lariat, while mutants M13 and M14 were inactive. Quantitation of the export efficiency showed that although M12 is active, its function is impaired (20 to 24% export) compared to that of RTE (45 to 54% export) or M2 (30 to 40% export).
![]() View larger version (17K): [in a new window] |
FIG. 3. Analysis of RTE deletion mutants in oocytes. (A) Schematic representation of RTE deletion mutants. Circled nucleotides indicate the beginnings and ends of the corresponding RTE mutants. The boxed nucleotide indicates the start of the 5'-end deletion in RTE M31. M15 lacks SL III, as indicated by the boxed area. (B) RNA export activity of RTE mutants in microinjected X. laevis oocyte nuclei. Radiolabeled adenovirus pre-mRNAs containing the indicated mutant RTE were injected into Xenopus oocyte nuclei. Total (T), cytoplasmic (C), and nuclear (N) fractions were prepared after a 3-h incubation. The injected RNA mixture also contained U1 Sm RNA, which is exported into the cytoplasm (an indicator of RNA export), and U6 ss RNA, which remains in the nucleus (an indicator of nuclear integrity). The positions of the pre-mRNAs and intron lariats are indicated. The presence of the RTE is indicated by an open triangle in the lariat loop.
|
10-fold-higher activation. We also tested M2 in a simpler reporter plasmid, pB37, which lacks functional splice sites, and similar data were obtained (data not shown). These data indicated that the bottom portion of the RTE structure spanning SL I and SL II (Fig. 1A) is essential for function in mammalian cells. Since M2 was able to transport the intron lariat from the Xenopus oocyte nucleus but failed to mediate Gag production in the mammalian assay, we conclude that mutant M2 still retains the key interaction site(s) for the RNA export factor(s). As shown in Fig. 2, the region spanning SL I and SL II is most variable among the different groups in both sequence and structure. Our data indicate that this region may also play a role in the overall structure of the RTE. We cannot exclude the possibility that this region also contributes to the posttransport steps by promoting translation of the mRNA, which is the readout of the transfection assay. Alternatively, the different nature of the test RNAs containing the RTE mutant M2 or the test systems may explain the discrepancy between the two assay systems. For these reasons, the complete 226-nt RTE was chosen for further mutational analysis.
![]() View larger version (12K): [in a new window] |
FIG. 4. Analysis of RTE mutants by gag expression in human cells. The HIV-1 gag expression vector pNLgag, containing either no insert, the RTE, or the RTE mutant indicated, was transfected into HeLa-derived HLtat cells. Two days later, the cells were harvested and the level of Gag production was determined using an antigen capture assay. The increase in Gag production in the absence (pNLgag) or presence of RTE and RTE mutants was determined. The mean n-fold induction ± standard deviations of the results for the tested RTE and mutant RTEs are shown. Two independent clones for each construct containing RTEs were tested in duplicate.
|
![]() View larger version (14K): [in a new window] |
FIG. 5. The two variants of the SL IV loop are functional. (A) Cartoon of RTE and M20. Mutant M20 was engineered to have a larger SL IV as present in RTE-related elements in groups B and D. The nucleotide sequence of the RTE loop is shown in grey; the extra nucleotides in the longer variant loop are shown in black. (B) M20 promotes RNA export from the nucleus of X. laevis oocyte nuclei. Adenovirus pre-mRNAs containing either no insert (Ad), RTE, or M20 were generated. Upon injection of X. laevis oocyte nuclei with the radiolabeled precursor RNA, cytoplasmic (C) and nuclear (N) fractions were analyzed as described in the Fig. 3 legend.
|
![]() View larger version (26K): [in a new window] |
FIG. 6. Analysis of the stem structure of SL IV. (A) Mutant M7 contains a deletion of the distal stem (boxed). (B) RNA export of M7 from microinjected X. laevis oocyte nuclei. Adenovirus pre-mRNAs containing RTE or M7 were injected into nuclei, and the total (T), nuclear (N), and cytoplasmic (C) fractions were analyzed as described in the Fig. 3 legend. (C) Compensatory changes in SL IV were designed to change the primary sequence but to preserve the base-paired structure. Changed regions (boxed) for M21, M22, and M24 are shown. Mutated nucleotides are shown in bold. (D) RNA export of M21, M22, and M24 from microinjected X. laevis oocyte nuclei. Adenovirus pre-mRNAs lacking (Ad) or containing RTE or RTE mutants were injected into nuclei, and nuclear and cytoplasmic fractions were analyzed as described in the Fig. 3 legend.
|
![]() View larger version (16K): [in a new window] |
FIG. 7. Mutants containing changes in the middle region of RTE. (A) Boxed areas indicate the mutated regions, and bold letters indicate the nucleotide changes in the indicated RTE mutants. M30 contains an insertion of 10 G-C base pairs. (B) RNA export of the indicated RTE mutants from X. laevis oocyte nuclei. Adenovirus pre-mRNAs lacking (Ad) or containing RTE or RTE mutants were injected into nuclei, and the nuclear (N) and cytoplasmic (C) fractions were analyzed as described in the Fig. 3 legend.
|
|
|
|---|
Comparison of the RTE secondary structure to the computer-predicted structures of the RTE-related elements showed major differences in SL I and SL II and the loop of SL IV, whereas the stems of SL III and SL IV are highly conserved. We noted that several of the RTE-related elements have compensatory nucleotide changes preserving the structure of double-stranded stem regions. Although we did not individually test the RTE-related elements experimentally, we used these comparisons to design several of our RTE mutants. We addressed some of these changes experimentally and found that the regions of SL I and SL II are dispensable for export from X. laevis oocyte nuclei. In addition, both variants of the loops of SL IV are functional (wild-type RTE and mutant M20). Interestingly, all the RTEs share the long stem in SL IV, and experimental data showed that its role is of functional importance. Deletions in this region inactivate the element, but changes in its primary structure while preserving the base pairing are tolerated (compare M7 to M21, M22, and M24). This finding indicates that the structure of SL IV is important for function. Both the stem and the loop of SL III show a high degree of secondary structure conservation. SL III is essential for RTE function (M15 and M27). Notably, some mutants disrupting the potential tertiary interactions of SL III with SL IV (extending the lower stem of SL IV [M30] and changing the sequence between SL III and SL IV [M28]) were inactive. Together, these data indicate not only that SL III and SL IV are necessary but that they may be involved in higher-order interactions which are required for function. Therefore, the putative cellular binding factor(s) may interact with more than a single region, or, alternatively, more than one factor interacting with distinct regions is necessary to mediate RTE function. The interaction(s) appears to require proper spatial arrangement within RTE. Interestingly, we found one mutant (M26) able to further improve RNA export function. Notably, the nucleotide changes introduced into M26 involve four unpaired nucleotides which are conserved among all the elements.
The studies of retroviruses and retroelements have revealed the existence of potent RNA export elements which serve as important tools to understand retrovirus expression as well as mechanisms mediating cellular mRNA transport. These studies led further to the discoveries of the major pathways mediating mRNA and protein trafficking through the nucleus. Two distinct export pathways have been revealed: CRM1, utilized by the retroviral Rev, Rex, cORF, and an array of nuclear export signal-containing cellular proteins; and NXF1, utilized to export CTE-containing mRNA as well as cellular mRNAs. Similar to CTE, RTE utilizes a conserved cellular mRNA export pathway and its function is independent of CRM1 (22). Although previous work showed that the NXF1 protein itself is not a high-affinity binder to RTE (22), competition experiments have indicated NXF1 involvement (S. Smulevitch and B. K. Felber, unpublished data). One proposed model is that RTE export involves a factor(s) leading to NXF1 recruitment and thus linking RTE to a major export pathway.
Although the respective export factors interact with few nucleotides within RRE, RXRE, CTE, and RcRE/K, these elements contain several regions that are essential for their secondary structure, and the protein interaction site is part of a complex structure. Our findings for RTE are reminiscent of observations of the other studied retroviral cis-acting RNA elements, implying the existence of a complex RNA structure that is essential for function, which places the protein-binding motif into a context essential for the specific interaction. Since retroviruses and retroelements have evolved to take advantage of cellular pathways, the use of these systems has provided important tools for the identification of the cellular factors that mediate key steps in controlling eukaryotic gene expression. Recently, retrotransposition-competent IAPs have been identified in the mouse genome (7). Our analysis indicates that these retroelements contain intact RTEs belonging to the D group (our unpublished data). It will be of interest to study the contribution of these RTEs to IAP expression. The identification of active IAPs will allow us to further study the role of these elements in retrotransposition.
S.S. and D.M. contributed equally to the present study. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»