Previous Article | Next Article ![]()
Journal of Virology, September 2006, p. 8566-8581, Vol. 80, No. 17
0022-538X/06/$08.00+0 doi:10.1128/JVI.00837-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Kathleen Boris-Lawrie,3,4,5 and
Biao Ding1,4,5*
Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center, Ohio State University, Columbus, Ohio 43210,1 Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, Ohio 43404,2 Department of Veterinary Biosciences, Ohio State University, Columbus, Ohio 43210,3 Molecular, Cellular and Developmental Biology Graduate Program, Ohio State University, Columbus, Ohio 43210,4 OSU RNA Consortium, Ohio State University, Columbus, Ohio 432105
Received 23 April 2006/ Accepted 19 June 2006
|
|
|---|
|
|
|---|
Viroids are the smallest known nucleic acid-based infectious agents and self-replicating genetic units. Their "genomes" consist of single-stranded, circular RNAs ranging in size from 250 to 400 nucleotides (23). Viroids can replicate and spread throughout an infected plant, although they do not encode proteins, do not have encapsidation mechanisms, and do not require helper viruses. Furthermore, they cause devastating diseases by altering host gene expression and developmental processes (11, 59). Evidently, viroid RNA genomes contain all of the sequence and structural information needed to mediate or trigger the various functions associated with infection.
Potato spindle tuber viroid (PSTVd) is the type member of the family Pospiviroidae (8, 12). The PSTVd genome consists of 359 nucleotides and assumes a rod-shaped secondary structure in the native state (48) with five structural domains, as shown in Fig. 1A (26). This secondary structure, which is typical of viroids in the family Pospiviroidae, comprises many loops and bulges flanked by short Watson-Crick helices. Formation of this secondary structure is necessary for infection (62). During asymmetric rolling-circle replication of PSTVd (5), the plus circular strands serve as templates for the synthesis of concatemeric, linear minus strands, which then function as the replication intermediates for the synthesis of concatemeric, linear plus strands. These are subsequently cleaved into monomers and ligated into circular molecules (Fig. 1B). Without encoding proteins, PSTVd replicates in the nucleus of a host cell and therefore presents an ideal model to address the question of how the plant nuclear transcription machinery replicates an RNA template.
![]() View larger version (8K): [in a new window] |
FIG. 1. Secondary structure (A) and replication model (B) of PSTVd. (A) The rod-like secondary structure of PSTVd and the numbering of nucleotides are based on a report by Gross et al. (16). The five structural domains (26) include (i) the left terminal domain (TL, 1 to 46 and 315 to 359), (ii) the pathogenicity domain (47 to 73 and 286 to 314), (iii) the central domain (74 to 120 and 240 to 285), (iv) the variable domain (121 to 148 and 212 to 239), and (v) the right terminal domain (TR, 149 to 211). The loop E motif is in the central conserved region. (B) Asymmetric rolling-circle replication of PSTVd (5). The incoming circular monomeric plus-strand PSTVd serves as the initial template to synthesize linear, concatemeric minus-strand PSTVd. The latter functions as the replication intermediate to direct the synthesis of concatemeric plus-strand PSTVd, which is cleaved into unit length monomers and further ligated into circular forms.
|
The loop E motif, located in the central conserved region (CCR) of the central domain (Fig. 1A) of PSTVd, has been implicated in replication and other functions. This motif is similar to the loop E motifs of archaeal and eucaryal 5S rRNAs, which are also found in the conserved sarcin/ricin loops of 23S rRNA (29, 32). In vitro studies with longer-than-unit length PSTVd transcripts showed that cleavage and ligation occur between G95 and G96, which appear to involve the formation of a metastable tetraloop motif, followed by a conformational change into a stable loop E motif (3). A single nucleotide substitution, C259U, in loop E enhances replication by 5- to10-fold in cultured cells of tobacco (Nicotiana tabacum) (43). The enhanced replication in tobacco cells may account for the adaptation of PSTVd to this host (63, 66). The U257A substitution in the loop E motif also enables PSTVd to infect tobacco (43, 66) and confers lethal symptoms in tomato (42). These in vitro and gain-of-function mutational results suggest that loop E plays one or more crucial roles in viroid-plant interactions.
Several critical issues must be addressed in order to assign a specific function of viroid replication in vivo to a particular viroid structural element. First, whether a motif exists in vivo needs to be demonstrated. Second, reversions of mutated nucleotides to the wild type or gain-of-function mutations do not provide direct evidence for the role of the affected RNA structural element in a biological process. Loss-of-function mutations, on the other hand, will provide definitive proof linking the affected structural element to a function. Third, infection assays on whole plants do not reveal whether failure of infection of a mutant results from defects in a specific step of the replication cycle or in intra- or intercellular trafficking. To address these issues, it is imperative to integrate structural, genetic, cellular, biochemical, and molecular approaches.
Genetic alteration of an RNA structural element for functional studies requires precise understanding of its tertiary structure and the effects of mutations on that structure. However, while the secondary structures of viroids have been well established through thermodynamic, chemical mapping, mutational, and nuclear magnetic resonance (NMR) studies (9, 55, 62), much less is known about their tertiary structures and specific functions. Minimum-free-energy calculations (e.g., mfold; 67), coupled with conventional comparative sequence analysis, reveal RNA secondary structures comprising double-stranded helices formed by Watson-Crick base pairs, punctuated by so-called loops and bulges (hairpin, internal, and junction or multihelix loops) defined by lack of Watson-Crick base pairs. However, most RNA loops in structured, biologically active RNA molecules have been shown by X-ray crystallography and NMR spectroscopy to form specific, well-structured three-dimensional (3D) motifs in which many of the bases form non-Watson-Crick base pairs and/or stacking interactions (31-33). Thus, current thermodynamic calculations generally cannot reveal the nature of non-Watson-Crick base pairing within loops and offer little help in designing mutations that disrupt or restore such base pairing.
As shown in Fig. 2, RNA bases pair by hydrogen bonding at their Watson-Crick, Hoogsteen, or sugar edge and with their glycosidic bonds oriented cis or trans relative to each other (34). Non-Watson-Crick base pairs result when either or both bases interact at their Hoogsteen or sugar edge. All base pairs, including Watson-Crick and non-Watson-Crick base pairs, can be classified geometrically into 12 families by noting (i) the base edges used to form hydrogen bonds and (ii) the relative orientations (cis or trans) of the glycosidic bonds of the interacting nucleotides (32, 34). Two examples of these families, the cis and trans Hoogsteen/sugar edge base pairs, are shown in the lower part of Fig. 2. In non-Watson-Crick base pairs, the Watson-Crick edges may be available for further interactionswith other RNAs, proteins, or small molecules. In fact, one base can potentially pair with up to three other bases at the same time, allowing the formation of complex motifs involving base triples, quadruples, etc.
![]() View larger version (30K): [in a new window] |
FIG. 2. Geometric classification of RNA base pairing. The upper panel shows that each nucleotide base has three edges (Watson-Crick, Hoogsteen, and sugar) that can potentially hydrogen bond with one of the three edges of another base. Thus, each base can be represented schematically by a triangle and can potentially pair with up to three other bases. The interacting bases can pair with a cis or trans relative orientation of their glycosidic bonds. This is illustrated in the lower panels for the cis and trans orientations of nucleotides pairing at the Hoogsteen edge of one base and the sugar edge of the second base. In these base pairs, the Watson-Crick edges of the interacting bases are available for further interactionswith other RNAs, proteins, or small molecules. The cross and circle in the triangle where the Hoogsteen and sugar edges meet indicate 5' 3' and 3' 5' orientations, respectively, of the sugar-phosphate backbones relative to the plane of the page. W-C, Watson-Crick edge; H, Hoogsteen edge; SE, sugar edge. (Adapted from reference 34 with permission from the RNA Society.)
|
In this paper, we present our studies of the PSTVd loop E tertiary structure and its function in PSTVd replication by combining isostericity matrix analysis with genetic and functional assays. Our results provide the first loss-of-function genetic evidence for the essential role of a viroid 3D motif in replication. They also demonstrate that the integration of rationalized mutagenesis, based on isostericity matrix analysis of non-Watson-Crick base pairing, with in vitro and in vivo functional assays provides a novel, comprehensive approach to investigate the precise tertiary-structure-function relationships for a wide range of RNA motifs in diverse biological processes.
|
|
|---|
Additional examples of isosteric base pairs published since the original compilation of Leontis et al. (30) were obtained with the geometrical RNA motif search program FR3D (Find RNA 3D; M. Sarver, J. Stombaugh, A. Mokdad, C. L. Zirbel, and N. Leontis, submitted for publication).
Plasmid construction. Plasmid pRZ6-2 carries cDNA of PSTVdInt (16) that is flanked by ribozymes at both ends. It was constructed by Hu et al. (22) and kindly provided by Robert Owens. All mutants were generated by site-directed mutagenesis with a QuikChange site-directed mutagenesis kit (Stratagene, La Jolla, CA) according to the manufacturer's instructions. The introduced mutations were confirmed by sequencing.
Plasmids pInter(+) and pInter(), which were used as templates for in vitro synthesis of riboprobes specific for minus- and plus-strand PSTVd RNAs, respectively, were described by Qi and Ding (43).
The full-length cDNA of enhanced green fluorescent protein (GFP) was amplified by PCR with specific primers and with pEGFP-1 as the template (Clontech, Mountain View, CA) and cloned into pGEM4Z (Promega, Madison, WI) at EcoRI and BamHI sites to give rise to pGEM4Z:EGFP to be used for in vitro transcription.
In vitro transcription. To prepare a PSTVd inoculum, plasmids containing the PSTVd cDNAs were linearized with HindIII. The linearized plasmids were used as templates for in vitro transcription with a T7 MEGAscript kit (Ambion, Austin, TX) according to the manufacturer's instructions. Because of the nature of the restriction site used for DNA cloning, the PSTVd transcripts after ribozyme self-cleavage have G88 at the 5' end and G87 at the 3' end. These unit length PSTVd transcripts were used as the inoculum.
To prepare PSTVd RNA substrates for UV cross-linking or ligation analyses, in vitro transcription was performed with the T7 MAXIscript kit (Ambion) in the presence of [
-32P]UTP. To make GFP transcripts that served as control substrates in UV cross-linking experiments, plasmid pGEM4Z:EGFP was linearized by EcoRI to serve as the template for in vitro transcription. The 32P-labeled PSTVd in vitro transcripts were separated on 5% polyacrylamide-8 M urea gel, and unit length monomeric bands were eluted in 0.3 M NaCl and precipitated with ethanol.
For microinjection, PSTVd transcripts were synthesized with a T7 MAXIscript kit (Ambion) in the presence of Alexa Fluor 488-5-UTP (Molecular Probes, Eugene, OR). Fluorescently labeled 434-nucleotide vector RNA transcripts were synthesized from NaeI-digested pGEMT (Promega) template DNA with an SP6 MAXIscript kit (Ambion).
After all transcription reactions, DNA templates were removed by DNase I treatment. RNA transcripts were purified with a MEGAClear kit (Ambion) and further quantified by UV spectrometry or with a scintillation counter (for radioactive riboprobes).
Protoplast inoculation. Nicotiana benthamiana suspension cells were cultured as described by Sunter and Bisaro (57). Protoplasts were prepared and inoculated with PSTVd transcripts by electroporation as described by Qi and Ding (43). Detailed protocols for the maintenance of N. benthamiana suspension cells, preparation of protoplasts, and electroporation were described by Zhong et al. (65). At 3 days postinoculation, transfected protoplasts were harvested for RNA extraction and Northern blot analysis.
RNA extraction and Northern blotting.
Total RNAs were isolated from infected plants and protoplasts with an RNeasy Plant Mini Kit (QIAGEN, Valencia, CA). The procedures were essentially the same as those described by Qi et al. (44). Ten-microgram RNA samples were run on 5% polyacrylamide-8 M urea gel, transferred to a Hybond-XL nylon membrane (Amersham Biosciences, Piscataway, NJ) with a vacuum blotting system (Amersham), and immobilized by UV-cross-linking. Hybridization with [
-32P]UTP-labeled riboprobes was carried out at 65°C with ULTRAhyb reagent (Ambion). After overnight hybridization, the membranes were washed twice at 65°C in 2x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate)-0.1% sodium dodecyl sulfate (SDS) and twice in 0.2x SSC-0.1% SDS and exposed to a storage phosphor screen (Kodak, Rochester, NY). Hybridization signals were quantified with a Molecular Imager FX with Quantity One-4.1.1 software (Bio-Rad, Hercules, CA).
Sequencing of RNA progeny. The protocols for preparing cDNAs of the PSTVd progeny isolated from protoplasts were performed essentially as described by Qi and Ding (43). Briefly, cDNAs of PSTVd RNA were reverse transcription (RT)-PCR amplified and sequenced in both directions with an ABI 377 DNA sequencer (Perkin-Elmer, Boston, MA) at the DNA Sequencing Facility at Ohio State University.
Preparation of N. benthamiana nuclear extract (NbNE). NbNE was prepared as described by Baumstark and Riesner (2) and Roberts and Okita (47), with slight modifications. All steps were performed on ice. Protoplasts prepared from cultured cells of N. benthamiana (see above) were first resuspended in 15 ml of buffer A (20 mM morpholineethanesulfonic acid-KOH [pH 5.8], 20 mM K-acetate, 15% Ficoll 400, 0.15 mM spermine, 0.5 mM spermidine, 10 mM ß-mercaptoethanol, 0.5 mM phenylmethylsulfonyl fluoride, 0.15 µM pepstatin, 0.6 µM leupeptin) and then lysed in a Dounce homogenizer with 10 to 15 strokes. 4',6'-Diamidino-2-phenylindole (DAPI) staining was used to monitor the release of nuclei. The nuclei were purified by loading the homogenates onto a two-step gradient consisting of 8 ml of buffer B (87.6% [vol/vol] Percoll, 0.62x buffer C [buffer A minus Ficoll]) and 12 ml of buffer D (buffer A with 18% instead of 15% Ficoll 400). The mixtures were centrifuged at 4,000 x g for 60 min at 4°C. The interphase containing the nuclei was collected with a glass pipette, and 1.5 volumes of buffer C was added. The mixtures were centrifuged at 1,400 x g for 30 min at 4°C, and the pelleted nuclei were resuspended in buffer E (60 mM HEPES-KOH [pH 7.9], 0.12 mM EDTA, 0.84 mM Mg-acetate, 0.72 µM leupeptin, 0.18 µM pepstatin, 12 mM ß-mercaptoethanol, 0.6 M KCl). The sample was stirred gently for 1 h at 4°C and centrifuged at 36,000 x g for 60 min at 4°C with an L8-80 M Ultracentrifuge with an SW41 rotor (Beckman, Fullerton, CA). Saturated (NH4)2SO4 was slowly added to the resulting supernatant to a final concentration of 75% and centrifuged at 36,000 x g for 30 min at 4°C to precipitate proteins. The pellet was resuspended in buffer F (20 mM HEPES-KOH [pH 7.9], 10 mM Mg-acetate, 50 mM K-acetate, 5 mM EDTA, 12 mM ß-mercaptoethanol, 25% glycerol) and dialyzed overnight at 4°C in buffer F. Aliquots were quickly frozen in liquid nitrogen and stored at 80°C.
RNA circularization assay. The RNA circularization assay was conducted by incubating 32P-labeled and gel-purified unit length PSTVd in vitro transcripts and control RNAs in NbNE. In a reaction mixture with a final volume of 20 µl, 1 x 105 cpm of linear unit length plus-strand PSTVd transcripts was mixed with 4 µl of 5x ligation buffer (100 mM Tris-HCl [pH 8.0], 30 mM Mg-acetate, 1 mM spermidine, 2 mM EDTA) (40), 1 µl of RNaseOut (Invitrogen), and 10 µl of NbNE and incubated at 37°C for 2 to 3 h. The reaction was terminated by addition of 260 µl of 2x proteinase K buffer (200 mM Tris-HCl [pH 8.0], 25 mM EDTA [pH 8.], 1% SDS) and 10 µl of 20-mg/ml proteinase K and incubation at 65°C for 30 min, followed by phenol extractions (phenol was saturated with 10 mM Tris-HCl buffer [pH 4.0]). The reaction products were ethanol precipitated and analyzed on 5% polyacrylamide-8 M urea gel.
UV-cross-linking to analyze PSTVd loop E.
UV cross-linking was performed as described by Schrader et al. (49) under conditions that favor the formation of a stable PSTVd secondary-structure (equivalent to the ExL conformation of their substrate) containing loop E. Briefly, 32P-labeded in vitro transcripts of gel-purified unit length PSTVd transcripts (
105 cpm) were preincubated in high-ionic-strength buffer (500 mM NaCl, 4 M urea, 1 mM Na-cacodylate, 0.1 mM EDTA [pH 7.9]) at 40°C for 45 min, followed by slow cooling to room temperature overnight. The reaction mixtures in Eppendorf tubes were placed on ice and irradiated with 258-nm UV light at different time intervals with a Stratalinker (model 1800; Stratagene) in a cold room. The samples were ethanol precipitated and analyzed on 5% polyacrylamide-8 M urea gel.
Microinjection and microscopy. A small patch of epidermis was removed from the lower surface of an N. benthamiana leaf and covered with distilled water immediately. The leaf was mounted onto a glass plate to fit the stage of a Nikon E600 epifluorescence microscope (Nikon, Tokyo, Japan). Fluorescently labeled RNA transcripts were loaded into glass pipettes made from thin-walled glass tubes (World Precision Instruments, Sarasota, FL) on a pipette puller (model PB-7; Narishige, Tokyo, Japan). Injection was performed with an MMO-203 micromanipulator (Narishige) and a 1 M-5B injector (Narishige). Subcellular localization of the injected RNA transcripts was visualized with a filter set consisting of a 450- to 490-nm excitation filter, a 510-nm dichroic mirror, and a 520- to 560-nm barrier filter. Images were recorded and processed with a SPOT 2 Slider charge-coupled device camera and the associated software (Diagnostics Instruments Inc., Sterling Heights, MI). DAPI was injected into the same cells to visualize the nuclei after imaging of RNA localization. DAPI fluorescence was visualized with a filter set consisting of a 330- to 380-nm excitation filter, a 400-nm dichroic mirror, and a 435- to 485-nm barrier filter and recorded as described above.
Quantitative real-time RT-PCR. Total RNA isolated from N. benthamiana protoplasts was treated with a TURBO DNA-free kit (Ambion) according to the manufacturer's instructions. One microgram of total RNA was reverse transcribed with the ThermoScript RT-PCR system (Invitrogen) with primer (+)-97-125 (5'-CCTTTTTTGCCAGTTCGCTCCAGGTTTCC-3') for plus-strand PSTVd, primer ()-13-39 (5'-CGTGGTTCCTGTGGTTCACACCTGACC-3') for minus-strand PSTVd, and primer 1093-1114R (5'-CCCGGAACCCAAAAACTTTG-3') for 18S rRNA. The ThermoScript reverse transcriptase was inactivated by incubating the mixture at 85°C for 5 min. The template RNA was removed by RNase H treatment at 37°C for 1 h. Real-time PCR was performed with a QuantiTect SYBR green PCR kit (QIAGEN) in a PCR mixture containing a 2-µl cDNA sample (the RT products diluted fourfold), 0.5 µM forward and reverse primers, and 1x QuantiTect SYBR green PCR Master Mix. The primer combinations used for real-time PCR were as follows: primer (+)-97-125 and primer (+)-13-39 for amplification of either plus- or minus-strand PSTVd cDNA templates and primer 1093-1114 R and primer 1000-1020 F (5'-GATCAGATACCGTCCTAGTC-3') for amplification of the 18S rRNA cDNA template. The real-time cycler conditions for the LightCycler system (Roche, Indianapolis, IN) were 10 min at 95°C for activation of HotStar Taq DNA polymerase, followed by 40 cycles of reactions (95°C for 30 s, 55°C for 30 s, and 72°C for 30 s). The melting curve analysis was performed to determine the specificity of the amplified products. Positive (standard-curve) and negative (no-template) controls were measured in each of the PCR runs. The standard curve was generated by serial dilutions ranging from 100 to 1010 copies of pRZ:PSTVdInt. 18S rRNA was used for normalization of PSTVd RNA accumulation levels. PSTVd copy numbers were calculated with the standard curve. The PCR results were analyzed with LightCycler software version 3.5 (Roche, Indianapolis, IN). For wild-type PSTVd and each of the mutant forms, results from six replicate RT-PCR experiments (three from samples of each of the two biological replicates) were used to calculate the mean and standard error. To facilitate comparison, the level of PSTVdInt accumulation has been arbitrarily set to a value of 1 and the levels of the mutants are presented as relative values.
|
|
|---|
![]() View larger version (24K): [in a new window] |
FIG. 3. Tertiary-structural model of PSTVd loop E. (A) Paradigmatic sarcin/ricin motif based on X-ray crystal structures (adapted from reference 29 with permission from Elsevier). (B) Inferred PSTVd loop E structural model. The dashed arrows indicate local changes in the strand orientation. All symbols that denote non-Watson-Crick base pairs and strand orientations are based on a report by Leontis and Westhof (34). Circles, squares, and triangles indicate the participation of Watson-Crick, Hoogsteen, and sugar edges, respectively. Open symbols indicate base pairs with a trans orientation of the glycosidic bonds, and closed symbols indicate base pairs with a cis orientation.
|
![]() View larger version (21K): [in a new window] |
FIG. 4. The five non-Watson-Crick base pairs that form the core of the PSTVd loop E motif. Examples of natural variants or experimentally designed mutations that can form isosteric base pairs are also illustrated. Y, pyrimidine (U or C); W, water molecule. Ribose is represented by a closed circle, except when it is involved in hydrogen bonding.
|
![]() View larger version (37K): [in a new window] |
FIG. 5. The functional status of viroid mutants and natural variants mapped onto the respective isostericity matrices for each base pair, according to the geometric family to which it belongs (30). I1 to I5 denote subfamilies within each of which the base pairs are isosteric (see Materials and Methods for the definition of isostericity and reference 30 for details of the classification of subfamilies). Each blank and red box indicates that non-Watson-Crick base pairing between the two nucleotides in the family has not been reported in X-ray crystal structures or cannot be formed on the basis of isosteric predictions (30). Chrysanthemum stunt viroid (CSVd) has sequences identical to those of PSTVd in loop E. For sources of data and abbreviations of all other viroids, see Table 1.
|
The U/A pair can form by weak hydrogen bonding between A (C2; carbon at position 2 of adenine) and U (O4) or between A (O2'; the 2' OH of adenosine) and U (C5). One example of this type of base pair occurs in association with a protein and indicates that U/A has a C1'-C1' distance of 6.8 Å (PDB file 1EC6, U12/A11 in C chain) and thus can also substitute isosterically for U/U, U/C, and U/G. The RNA structural search programs used at the time of the original compilation of base pairs (30) did not detect the U/A pair.
U260/A99 trans Watson-Crick/Hoogsteen edge pair. The Watson-Crick edge of U260 hydrogen bonds with the Hoogsteen edge of A99 in the trans orientation. The U at this position is very conserved in most loop E motifs because it also uses its Hoogsteen edge to form a second base pair with G (i.e., a cis Hoogsteen/sugar edge pair). In other contexts, however, C can substitute for U in trans Watson-Crick/Hoogsteen base pairs to form isosteric C/C, or nearly isosteric C/A or C/G, base pairs of this type (30).
A258/A100 trans Hoogsteen/Hoogsteen edge pair. The Hoogsteen edges of A258 and A100 interact to form a symmetric trans Hoogsteen/Hoogsteen base pair. Mutation of A100 to G still allows for hydrogen bonding with A258 by the formation of a hydrogen bond between A (N6; nitrogen at position 6 of adenine) and G (O6; oxygen at position 6 of guanine). This substitution has been observed in a composite loop E motif in the 23S rRNA of Haloarcula marismortui (PDB file 1S72, A913/G1071) with a C1'-C1' distance of 12.2 Å. Thus, the A/G base pair is nearly isosteric with A/A.
A101/U257 trans Hoogsteen/sugar edge pair. The Hoogsteen edge of A101 hydrogen bonds with the sugar edge of U257. Note that this base pair is oriented oppositely to trans Hoogsteen/sugar edge pair A261/G98, as shown by the symbols annotating these base pairs in Fig. 4. The structures of three isosteric variants, A/G, A/C, and A/A, are shown in Fig. 4. Very often, the isosteric C/U base pair variant occurs at this position, as shown for the generic sarcin/ricin loop in Fig. 3A. The structure of the C/U base pair is shown in the upper inset of Fig. 4.
C256/C102 cis Watson-Crick/Watson-Crick bifurcated pair. The C256/C102 cis Watson-Crick/Watson-Crick bifurcated pair is outside the conserved core structure of loop E and sarcin/ricin motifs. For C/C to form an edge-to-edge Watson-Crick base pair, one cytosine must be protonated at the N3 position. While this has been observed in some structures, more commonly the cytosines adopt the Watson-Crick bifurcated geometry in which the N4 amino group of one cytosine hydrogen bonds with both N3 and O2 of the other cytosine. This bifurcated geometry results in a longer C1'-C1' distance of about 10.4 Å, which is nearly isosteric to cis Watson-Crick A/U and G/C base pairs or with wobble Watson-Crick base pair U/G or C/A (30).
The tertiary-structural model of PSTVd loop E accounts for natural sequence variations and viable mutations. To test the validity of the proposed tertiary-structural model of PSTVd loop E, we first examined whether this model could account for the functional state of the reported natural sequence variations or viable mutations in loop E motifs of PSTVd and other viroid species in the family Pospiviroidae. The natural sequence variants and viable mutants are summarized in Table 1. Figure 4 illustrates the molecular structures of selected sequence variants, mapped onto the annotated non-Watson-Crick base pairing structure that forms the core of the PSTVd loop E motif. Figure 5 shows the functional status of mutants and natural variants mapped onto the respective isostericity matrices for each base pair, according to the geometric family to which it belongs (30).
|
View this table: [in a new window] |
TABLE 1. Effects of nucleotide sequence variations on the predicted structure of loop E motifs and the viability of viroidsa
|
U substitution at this position is also detected in the loop E-like motif of a natural variant of Citrus exocortis viroid (CEVd), a member of the family Pospiviroidae (51). These are operationally classified as natural variations in Table 1. Furthermore, the C259U substitution was detected in the PSTVd progeny in transgenic tobacco plants expressing the wild-type PSTVd sequences (63, 66). These are classified as spontaneous mutations in Table 1. The experimentally constructed C259G and C259A substitutions, designated engineered mutations in Table 1, retained PSTVd viability as well as the C259U spontaneous mutation in tobacco and N. benthamiana protoplasts (43). These natural variations and viable mutations can all be accounted for by the fact that U260 will form isosteric cis Hoogsteen/sugar edge pairs with A, C, G, or U at position 259 of PSTVd or at the equivalent position within loop E of CEVd. cis Hoogsteen/sugar edge base pairs can occur between immediately successive nucleotides, as in the PSTVd loop E motif, or between nucleotides separated by one or more other nucleotides. The first type comprises isosteric subgroup I1, and the second type comprises subgroup I2. At the time of the 2002 compilation of base pairs (30), the only example of a cis Hoogsteen/sugar edge U/A pair detected by the computation search programs was of type I2. The recently developed FR3D search program (Sarver et al., submitted), however, identified a type I1 U/A cis Hoogsteen/sugar edge base pair (in this case, in complex with a protein) (PDB file 1EC6, U12/A11 in C chain). The C1'-C1' distance of 6.8 Å makes this base pair isosteric with U/C (7 Å), U/G (6.7 Å), and U/U (7 Å) pairs (30). Therefore, all of these mutations could maintain the correct tertiary structure of loop E, if the corresponding bases are paired (in some cases, the pairing may occur only in the presence of another factor, such as a protein, that stabilizes this interaction). Because loop E motifs in cellular RNAs are well known to serve as important binding sites for RNAs and proteins (29), it will be of great interest to determine in future studies whether binding of a cellular protein to PSTVd loop E contributes to the stabilization of U/C and its viable variant U/U, U/G, and U/A base pairs. It should be noted that the C259G mutation in loop E was shown to inhibit in vitro 3' cleavage and ligation of a minicircle RNA derived from the CCR of PSTVd (49). Given that (i) the U/G cis Hoogsteen/sugar edge base pair is common at the equivalent position in other loop E motifs (Fig. 3 shows the generic sarcin/ricin motif) (30), (ii) the C259G mutation is not expected to alter the tertiary structure of PSTVd, and (iii) this mutation does not inhibit PSTVd replication in tobacco and N. benthamiana protoplasts (43) and in potato plants (data not shown), the reason for its effect on the in vitro processing of the minicircle RNA remains to be understood.
Zhu et al. (66) detected a U257A spontaneous mutation in transgenic tobacco expressing the wild-type PSTVd sequence. Experimentally constructed U257G and U257C substitutions retained PSTVd viability in tobacco and N. benthamiana protoplasts (43). Interestingly, a U
G substitution occurs at the equivalent position in loop E motifs of several CEVd natural variants (51, 54, 61), as well as Mexican papita viroid, Tomato planta macho viroid, and Tomato apical stunt viroid (53). Furthermore, a U
C substitution occurs in CEVd-C at this position (17). All of these observations can be accounted for by the fact that A101 can form isosteric trans Hoogsteen/sugar edge pairs with A, C, G, or U at position 257 of PSTVd and the equivalent position in the loop E motifs of all of the other viroids (Table 1; Fig. 4 and 5). All of these base pairs belong to the same isosteric subgroup, I1, of the trans Hoogsteen/sugar edge geometric family.
The A100G substitution does not affect in vitro cleavage-ligation of a minicircle RNA containing repeated sequences of the CCR of PSTVd (49). This observation was at first difficult to explain because the A/G trans Hoogsteen/Hoogsteen base pair has a C1'-C1' distance of 12.9 Å according to Leontis et al. (30), which places it in a different isosteric subfamily than an A/A trans Hoogsteen/Hoogsteen base pair having a C1'-C1' distance of 11 Å (30). However, since the publication of the original compilation of base pairs (30), we have identified other examples in higher-resolution structures for some of the base pairs reported. Thus, we have a higher-resolution example (2.4 Å versus 3.1 Å) of an A/G trans Hoogsteen/Hoogsteen base pairA913/G1071 in the 50S subunit of H. marismortui (PDB file 1S72). In this base pair, the C1'-C1' distance is 12.2 Å, which is closer to the 11 Å seen for an A/A base pair of this type. More significantly, this base pair is actually part of a loop E motif (30) and thus shows directly that A/G can substitute for A/A at positions 258/100. Whether this substitution affects the biological activities of PSTVd in vivo remains to be determined.
Outside the non-Watson-Crick base pair core of loop E, the C256/C102 cis Watson-Crick/Watson-Crick bifurcated pair also shows viable sequence variations. Owens et al. (38) reported a C256A substitution in a natural variant of PSTVd (together with a C259U substitution). Nucleotide A is also found at the equivalent position in Mexican papita viroid and Tomato apical stunt viroid (53), and A or C is found at the equivalent position in CEVd (51, 61). These variants can be accounted for by the fact that an A at this position in the motif can form a wobble base pair with C102 of PSTVd and with the C at the equivalent position in the other viroids. The C1'-C1' distance in the C/A wobble Watson-Crick pair is
10.4 Å, almost identical to the C1'-C1' distance in the bifurcated Watson-Crick C/C pair, so the A/C and C/C base pairs are nearly isosteric (30).
In conclusion, we find that all reported biologically viable sequence variations or mutations in viroid loop E motifs can be accounted for by the isosteric base pairing rules as summarized in the isostericity matrices, within the scope of the literature of which we are aware. This supports the notion that an intact loop E motif is required for PSTVd viability. Furthermore, this argues for the functional significance of intact loop E motif in other viroids in which this motif occurs. In particular, the loop E motifs of Chrysanthemum stunt viroid and PSTVd have identical sequence signatures (26).
Application of isostericity matrix analysis to design disruptive mutations in PSTVd loop E motif. To investigate the function of the PSTVd loop E motif in replication by loss-of-function genetics, we applied isostericity matrix analysis to design mutations that disrupt this motif, based on the tertiary-structural model in Fig. 3B. Changing A99 to C (mutation A99C) prevents the trans Watson-Crick/Hoogsteen interaction between positions 260 and 99 because the Watson-Crick edge of U260 cannot hydrogen bond with the Hoogsteen edge of C99 (30). This is indicated in the isostericity matrices for trans Watson-Crick/Hoogsteen base pairs by the red box for the matrix element corresponding to the U row and the C column (Fig. 5) (30). Likewise, changing A261 to C prevents the trans Hoogsteen/sugar edge base pair at positions 261/98 from forming because the Hoogsteen edge of C261 cannot hydrogen bond with the sugar edge of G98. This is indicated by the red box at the corresponding positions (row C, column G) of the isostericity matrices for trans Hoogsteen/sugar edge base pairs (Fig. 5) (30). Computer programs that calculate minimum free energy secondary structures, such as mfold, do not predict the formation of non-Watson-Crick base pairs. Therefore, mutations affecting the structures of motifs consisting of ordered arrays of non-Watson-Crick base pairs, such as loop E, are also not correctly accounted for by mfold and similar programs.
To test physically whether the A99C and A261C mutations each indeed disrupt the tertiary structure of loop E, we subjected the corresponding mutants to UV cross-linking experiments. These mutants are referred to as PSTVdIntA99C and PSTVdIntA261C, respectively. The names indicate that they are derived from the wild-type PSTVd intermediate strain (PSTVdInt) (16). Previous studies have demonstrated that UV treatment specifically cross-links bases G98 and U260 in loop E of PSTVd (3, 4) and of a minicircle RNA derived from the PSTVd CCR (49). It also cross-links the corresponding G and U bases in loop E of the human 5S rRNA from HeLa cells (4). This cross-linking occurs because of a unique partial cross-strand stack of G on U in loop E motifs that gives rise to photoreactivity. This cross-linking causes RNA mobility retardation on denaturing gels (Fig. 6). We used unit length linear in vitro transcripts as substrates. These transcripts are predicted to fold into the same rod-shaped secondary structure as the circular RNA, except for the nick between the two ends (G88 and G87; see Materials and Methods for details). As shown in Fig. 6, the A99C and A261C mutations greatly reduced the accumulation of cross-linked products. For instance, after 1 min of UV irradiation, the cross-linked product was already evident for PSTVdInt RNA but barely visible for the two mutants. After 5 min of UV irradiation, 32% of the wild-type RNA was cross-linked whereas only 6 to 7% of the mutant RNAs were cross-linked. The similar sizes of the cross-linked products in the wild-type and mutant PSTVd RNAs suggest that UV cross-linking occurred in the same region of these RNAs. Furthermore, the specificity of the G98-U260 cross-linking was demonstrated with two mutants that lack specific G98-U260 base stacking in loop E (Fig. 6). The absence of cross-linked products from GFP RNAs further demonstrates the specificity of UV cross-linking (Fig. 6). The quantitative difference indicates that a substantial portion of the PSTVdIntA99C and PSTVdIntA261C mutant RNA molecules failed to form the correct loop E structure, so as to place G98 and U260 into the necessary proximity to allow UV cross-linking, in contrast to the wild-type transcripts under the same incubation conditions. The band observed between the cross-linked product and the substrate RNA was also observed in previous experiments, and its nature remains to be determined (49).
![]() View larger version (22K): [in a new window] |
FIG. 6. Analysis of loop E formation by UV-cross-linking. Gel-purified substrate RNAs were preincubated under conditions that favor loop E formation and subjected to UV-cross-linking as described in Materials and Methods. (A) Diagram showing structural conformations of the substrate and cross-linked RNAs in their native state and in denaturing gels. G88 and G87 are the 5' and 3' ends, respectively, of the folded linear substrate RNA. The established cross-linking site (U260/G98) is marked in red. (B) Autoradiography of gel blots showing the presence or absence of UV-cross-linked products from the substrate RNAs as indicated. Efficiency of cross-linking is presented as the percentage of substrate RNAs that is converted into the cross-linked products. Note that UV treatment of the two compensatory mutants PSTVdIntA261C/G98U and PSTVdIntA99C/U260C, which are predicted by isostericity matrix analysis to restore loop E tertiary structure and showed restored replication, resulted in little cross-linked product formation. This indicates that G98/U260 base stacking, in addition to proper loop E tertiary structure, is required for specific UV cross-linking. The data from these two mutants thus provide evidence that UV cross-linking occurs specifically in loop E.
|
![]() View larger version (52K): [in a new window] |
FIG. 7. Northern blot assays showing accumulation of PSTVd RNAs in protoplasts of N. benthamiana. The rRNA served as a loading control. c-PSTVd and l-PSTVd indicate monomeric circular and monomeric linear PSTVd RNAs, respectively.
|
A99C and A261C mutations do not impair nuclear import function. Following delivery into the cytoplasm of protoplasts and leaf cells, the viroid transcripts must first enter the nucleus to be replicated. To determine whether the diminished accumulation of mutants PSTVdIntA99C and PSTVdIntA261C in the inoculated protoplasts is due to defects in nuclear import, we injected fluorescently labeled plus-strand in vitro transcripts into the cytoplasm of leaf cells. Both single mutants, like the wild type, appeared in the nucleus to visible levels under a fluorescence microscope 20 to 30 min after injection (Fig. 8; Table 2). Fluorescently labeled in vitro transcripts derived from vector sequences did not show nuclear accumulation (Fig. 8; Table 2), indicating specificity of nuclear import of the PSTVd RNAs. Therefore, the diminished accumulation of the mutants in the inoculated protoplasts can be attributed to their impaired capacity to replicate within the nucleus, rather than to defects in nuclear import. These results demonstrate that the intact tertiary structure of loop E is required for PSTVd replication in the nucleus. Furthermore, the fact that the fluorescently labeled transcripts of PSTVdIntA99C, PSTVdIntA261C, and PSTVdInt are imported into the nuclei and accumulate there, unlike the cytoplasmic accumulation of vector transcripts, indicates that A99C and A261C mutations do not make PSTVd RNA more vulnerable to degradation.
![]() View larger version (11K): [in a new window] |
FIG. 8. Nuclear import of fluorescently labeled in vitro transcripts of PSTVd. The transcripts of the wild type (Int) and the two mutants (A99C and A261C) appear in the nucleus within 30 min of injection into the cytoplasm (upper row). The vector transcripts do not show nuclear accumulation. DAPI staining (lower row) shows the position of the nucleus in each injected cell.
|
|
View this table: [in a new window] |
TABLE 2. Nuclear import of PSTVd RNAs
|
![]() View larger version (77K): [in a new window] |
FIG. 9. Circularization of loop E mutants in NbNE. Gel-purified and radiolabeled unit length linear plus-strand PSTVd transcripts were incubated in NbNE for 2 h and analyzed by urea-polyacrylamide gel electrophoresis. c-PSTVd and l-PSTVd indicate monomeric circular and monomeric linear PSTVd RNAs, respectively.
|
![]() View larger version (30K): [in a new window] |
FIG. 10. Determination of minus-strand PSTVd (A) and plus-strand PSTVd (B) RNA levels in infected N. benthamiana protoplasts by quantitative real-time RT-PCR. For details, see Materials and Methods. M, mock inoculation. To facilitate comparison, the accumulation level of the wild type (Int) was arbitrarily set to a value of 1 and the levels of the mutants are presented as relative values on a log scale.
|
|
|
|---|
We have developed a novel approach that integrates comparative sequence analysis of non-Watson-Crick base pairs with isostericity matrices, mutagenesis, and in vitro, as well as in vivo, functional assays to dissect the role of one conserved RNA motif in the rolling-circle replication of PSTVd. This systematic approach is necessary to pinpoint the process(es) affected by mutations in a noncoding and multifunctional RNA so as to determine the specific biological role of an RNA structural motif.
A tertiary-structural model for viroid loop E that can guide functional studies. Branch et al. (4) first recognized that PSTVd contains an RNA motif very similar or identical to loop E of eucaryal (or archaeal) 5S rRNA on the basis of specific UV cross-linking between the conserved U260 and G98 bases. On the basis of nuclease and chemical mapping data and comparisons with the NMR structures of Xenopus laevis (eucaryal) 5S rRNA loop E (64) and of the sarcin/ricin loop in 28S rRNA (58), Gast et al. (13) suggested a scheme of non-Watson-Crick base-pairing interactions in the PSTVd loop that is similar to these rRNA motifs. The Gast model depicted the A261/G98, A99/U260, and A100/A258 base pairs essentially as the same as in our model. However, it depicted A101/U257 as a canonical Watson-Crick base pair and did not specify the nature of the C102/C256 base pair or the possible existence of the cis Hoogsteen/sugar edge U260/C259 interactions. Our isostericity matrix analysis is based on high-resolution X-ray crystal structures of other loop E motifs and the principles of isosteric non-Watson-Crick base pairing (30). The tertiary-structural model we present here specifies the precise nature of the edge-to-edge interactions for all of the non-Watson-Crick base pairs. The model is currently well supported by several lines of biological and physical data. First, this model can explain mechanistically why certain mutations in loop E do not affect PSTVd viability (43, 49, 63, 66). Second, it can account for the naturally occurring nucleotide sequence variations in the loop E motifs of PSTVd and other viroids in the family Pospiviroidae (17, 38, 51, 53, 54, 61). Third, UV cross-linking data provide further evidence that the A99C and A261C substitutions disrupt the loop E structure by preventing the formation of isosteric non-Watson-Crick base pairs. Fourth, as discussed below, the functional analyses pinpointed the deleterious effects of these mutations on PSTVd transcription. Finally, the two compensatory mutations that are predicted to restore the loop E tertiary structure on the basis of isosteric base-pairing principles and on the basis of the X-ray crystal structures of similar base pairs also restored PSTVd replication. Further studies on the effects of other nucleotide mutations within loop E should provide additional tests of the validity of this structural model. All of these data also suggest strongly that the PSTVd loop E motif exists and functions in vivo.
PSTVd loop E plays a crucial role in RNA-templated RNA transcription. Our functional analyses of the PSTVd loop E mutations provide new insights into the role of this motif in PSTVd replication at the cellular level. Our experiments showed that the A99C and A261C mutations, which disrupt the loop E structure, inhibit accumulation of PSTVd progeny in inoculated protoplasts. Further analyses indicated that these mutations do not impair nuclear import function or circularization of the linear inoculum transcripts. Therefore, the failure of the PSTVd mutants to replicate and accumulate in inoculated protoplasts likely results from a defect(s) in one or more of the steps of rolling-circle replication. The requirement for intact loop E in replication is further supported by the observation that compensatory mutations predicted to restore non-Watson-Crick base pairs within the motif were observed to restore the replication function.
The A99C and A261C loss-of-function mutations could inhibit the transcription, cleavage, or ligation step of rolling-circle replication. Although our present data cannot rule out completely the possibility that these mutations affect cleavage-ligation, several observations suggest strongly that they mainly inhibit transcription. First, the A99C and A261C mutations severely inhibit the synthesis of minus-strand PSTVd RNA. Second, if these mutations in fact inhibited cleavage rather than transcription, one would expect to see accumulation of high levels of the multimeric plus-strand PSTVd RNAs. This was not observed in Northern blot assays. Third, if these mutations inhibited ligation, rather than transcription or cleavage, one would expect to see accumulation of high levels of the linear monomeric plus-strand PSTVd RNAs. This was not observed either in Northern blot assays. On the basis of these considerations, we propose that the loop E motif plays a crucial role in transcription, in addition to its well-documented role in cleavage-ligation (3), during PSTVd replication. The multifunctionality of loop E is discussed below.
How might loop E be involved in transcription? Loop E motifs in cellular RNAs are well known to serve as important binding sites for RNAs and proteins (29). The loop E motif of PSTVd may serve as a binding site for the RNA polymerase or for a cellular factor(s) that recruits the RNA polymerase for transcription. Alternatively, loop E may interact with a cellular factor to localize the PSTVd RNA to a particular subnuclear site to facilitate access to the nuclear transcription machinery. It is also possible that this motif is required for interacting with a cellular factor that leads to a conformational change in the viroid RNA, allowing it to be recognized by the transcription machinery. These possibilities can be tested by further experimental studies.
It should be noted that a HPI is predicted to form as a thermodynamically metastable structure through pairing of nucleotides 79CGCUUCAGG87 and nucleotides 110GCGAGGUCC102 with nucleotides 88 to 101, forming a 14-nucleotide loop as a result of HPI formation (46). Whether the mutations we generated would affect the structure and function of HPI requires further analyses. In fact, it remains to be determined whether HPI exists in vivo and has any biological functions. A putative tetraloop containing A99 was postulated by Baumstark et al. (3) to be important for in vitro processing. Whether such a tetraloop structure exists and functions in vivo and whether A99C would affect its structure and function remain to be investigated.
PSTVd loop E is a useful model to study RNA structure-function relationships of broad significance. Loop E is one of the most extensively studied RNA structural motifs (29, 31). Besides serving as a model to dissect PSTVd replication and the general mechanisms of RNA-templated replication by the nuclear transcription machinery, the PSTVd loop E motif offers an attractive system to study RNA structure-function relationships of general significance. First, the multifunctionality of this motif is biologically significant. This motif is involved in transcription (this study), processing (3), pathogenicity (42), and host adaptation (43, 63, 66). To accomplish such diverse functions, submotifs of loop E may interact with distinct cellular factors. Elucidating the underlying mechanisms will greatly expand our understanding of the capacity of a single RNA motif to regulate multiple biological processes. Such multifunctionality of a single RNA motif may be important for a wide range of RNA-based pathogens to expand their genome functions during evolution without increases in genome size.
Second, loop E is a recurrent motif found in many RNAs, including 5S, 16S, and 23S rRNAs; group I and group II introns; bacterial RNase P; ribozyme of Tobacco ringspot virus satellite RNA (reviewed in reference 32); and lysine riboswitches (18, 56), where it plays critical roles in RNA-RNA and RNA-protein interactions. While mutational analyses of cellular RNA motifs may be limited by the potential lethality of motif-disruptive mutations to an organism or by the high structural complexity of large cellular RNAs, such limitations do not apply to PSTVd, a host parasite. Therefore, further mutagenesis can be performed on PSTVd loop E and other motifs to determine their sequence signatures and gain detailed insights into RNA structure-function relationships that may not be feasible with cellular RNA. Furthermore, the recurrent nature of this motif allows broad application of the knowledge obtained.
An integrative approach to address structure-function relationships of RNA motifs. Our findings from PSTVd loop E structure and function studies demonstrate that integration of isostericity matrix analysis, rationalized mutagenesis, and systematic functional studies can provide important insights into the structure-function relationship of an RNA motif and further into the elaborate cellular control over this relationship. Isostericity matrix analysis can be used to predict base substitutions that will prevent the formation of isosteric (or nearly isosteric) base pairs, thereby disrupting the 3D structure of an RNA motif. If the structural integrity of the motif is critical for function, these mutations are likely to result in loss of function. This is well demonstrated by the replication defects of PSTVd carrying either the A99C or the A261C mutation that disrupts loop E. Isostericity matrix analysis can also predict which base substitutions can retain or restore the tertiary structure of a motif, providing a useful means to infer the tertiary structure of RNA motifs. This analysis alone, however, may not always predict a priori which isosteric or nearly isosteric base substitutions in a motif will retain or restore the function of a given RNA in the cellular environment, because the functioning of the motif requires interactions with other factors that can only be elucidated through biological assays. This is illustrated well by the results from the two compensatory mutants, PSTVdIntA261C/G98U and PSTVdIntU260C/A99C. Both mutants are predicted by isostericity matrix analysis to have restored the loop E tertiary structure. However, while the replication level of PSTVdIntA261C/G98U is close to that of the wild type, the replication level of PSTVdIntU260C/A99C is less than that of the wild type. Presumably, the loop E motif with the C259/C260/C99 base triple in the compensatory mutant PSTVdIntU260C/A99C is sufficient, but not as optimal as the motif with the C259/U260/A99 base triple as in the wild type, to be recognized by the cellular machinery for replication. These results therefore demonstrate that a combination of isostericity matrix analysis and genetic and biological experiments can provide a foundation to elucidate the elaborate RNA motif-cellular factor interactions that regulate diverse biological processes and to address the fundamental question of how sophisticated features of RNA structures have evolved to achieve optimal functions.
This work was supported by grants from the National Science Foundation (IBN-0238412 and IOB-0515745) and from the U.S. Department of Agriculture National Research Initiative Competitive Grants Program (2004-35304-15005) to B.D., by National Institutes of Health grant 3R15-GM55898 to N.L., and by National Institutes of Health grant CA97024 to K.B.L.
Present address: Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»