Previous Article | Next Article ![]()
Journal of Virology, December 2008, p. 11824-11836, Vol. 82, No. 23
0022-538X/08/$08.00+0 doi:10.1128/JVI.01078-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Centre for Infectious Diseases, University of Edinburgh, Summerhall, Edinburgh EH9 1QH, United Kingdom,1 Department of Biochemistry, Microbiology & Immunology, University of Ottawa, Ottawa, Ontario K1H 8M5, Canada, and Steacie Institute for Molecular Sciences, National Research Council of Canada, Ottawa, Ontario K1A 0R6, Canada,2 Department of Biological Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom3
Received 22 May 2008/ Accepted 10 September 2008
|
|
|---|
|
|
|---|
Of the many methods devised to predict RNA secondary structures, the algorithm used in MFOLD and RNAFold (48) is probably the most widely used, and it is based on an energy minimization algorithm using empirically determined values for the various base pairs that form simple stem loops (29). In a recent investigation of folding free energies of RNA sequences using MFOLD (40), we obtained evidence for extensive RNA structure formation in many families and genera of both positive-strand animal and plant RNA viruses. There was remarkable variability in the occurrence of what we termed genome-scale ordered RNA structure (GORS) in different virus genera; for example, hepatitis C virus (HCV) in the family Flaviviridae showed thermodynamic evidence for an extensive RNA structure within the polyprotein-coding region that was absent in both the related Pestivirus and Flavivirus genera. Similar genus-associated variability was observed in the Picornaviridae, the Caliciviridae, and many plant virus families. Since replication strategies usually are conserved within a family, we considered that this genus-specific characteristic was unlikely to have a role in a fundamentally conserved aspect of replication or genome encapsidation. However, the presence of GORS was invariably associated with the ability of the virus to persist in their natural hosts, raising the intriguing possibility of a role for GORS in the subversion or avoidance of innate intracellular defense mechanisms. We speculated that this may take the form of modulating or blocking defense pathways triggered by double-stranded RNA, analogous in function to the expression of structured RNA transcripts by large DNA viruses (11, 30, 37). GORS is not restricted to viruses that infect animals and is indeed widespread in many plant virus groups/genera. Again, we considered that this may be involved in shielding functions, for example, from Dicer-mediated defense pathways (6, 27).
In the work we have carried out to date, the detection of GORS was based on quantifying differences in the minimum free energy (MFE) of native sequences from the same sequences scrambled in sequence order by a variety of algorithms that preserve different organization features of the nucleotide sequences, e.g., dinucleotide biases, codon structure, protein coding, or all three (40). The aim of the current study is to apply a wider range of bioinformatic RNA structure prediction programs to substantiate our previous evidence for large-scale structural differences between different RNA viruses. A number of methods have been developed recently for the large-scale screening of human and other large genome sequences for secondary structure elements, as they often locate to functional regions (e.g., controlling the transcription and processing of coding and noncoding mRNA sequences) and therefore can contribute to their annotation. In the current study, we have used two algorithmically independent methods. RNAz uses thermodynamic predictions that are weighted by their phylogenetic conservation and the occurrence of covariant sites (13). A second method, Pfold, applies pairing rules to a reconstructed evolutionary history and stochastic context-free grammar to give a probability distribution of structures (22).
We coupled these bioinformatic approaches with an experimental analysis using an oligonucleotide probe accessibility assay and atomic force microscopy (AFM) to investigate the link between the predicted RNA structure and physical solution-phase structures of full-length genomic strands of viruses with and without GORS. Our results strongly support our previous bioinformatic analysis and provide biophysical evidence and further insights into the existence of fundamental structural differences between the RNA genomes of persistent and nonpersistent mammalian RNA viruses.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Virus and rRNA sequence alignments
|
Each sequence data set was analyzed by Pfold (22) and RNAz (10) using default settings. The automated submission of thousands of sequences to the Pfold server (http://www.daimi.au.dk/
compbio/rnafold/) was achieved using Perl scripts, and the results were similarly retrieved and analyzed. Results from Pfold were expressed as the mean frequency of bases in alignments for each virus or control that were predicted to be paired with a confidence level of 95% or greater. RNAz was locally compiled, and Perl scripts again were used to automate the submission, retrieval, and analysis of data. Due to the limitations of RNAz, which can handle a maximum of six sequences of 400 nt simultaneously, input sequences were randomly sampled in batches of six before being windowed (using the associated script rnaz-window.pl provided by the authors of RNAz) and then analyzed by RNAz. Results were expressed as the mean number of predicted stem loop structures per 1,000 bases for each set of alignments.
Hybridization accessibility assay. DNA oligonucleotide probes were designed to be complementary to viral genomic RNA sequences. A series of 21-nt oligonucleotide sequences complementary to each transcript were specified throughout the viral genomes at approximately 300-nt spacing and with G+C contents between 48 and 52% (see Table S1 in the supplemental material). Additionally, two oligonucleotide probes (PVcre-A and PVcre-B) were specifically designed to be complementary to single-stranded (loop) and double-stranded (stem) regions of the poliovirus (PV) cis-acting replication element (CRE) RNA structure, respectively.
Genome-length virus RNA was produced by in vitro transcription from full-length clones of hepatitis G virus/GB virus-C (HGV/GBV-C) (45), HCV (21), PV (4), rubella virus (RV) (42), murine norovirus type 3 (MNV3), and the L segment of bunyavirus (BV) (7). The transcription template for HCV/BV chimeric RNA was cloned by inserting the 3.5-kb NsiI fragment of pT7RiboBUNL(+) into the unique NsiI restriction site in pJFH-1_GND in the sense orientation, creating the clone pJFH-1_GND
3kbBV. Biotinylated RNA was generated from plasmid DNA templates linearized with the appropriate restriction enzyme (Table 2). The tick-borne encephalitis virus (TBEV) transcription template was generated by PCR using the Expand Long Range dNTPack kit (Roche) from pIC Hypr 3157-11167 (containing TBEV nonstructural genes and the 3'UTR) using the primers TAATACGACTCACTATAGGGATCGATAATGCTGACGTGGTGG (the T7 promoter is in boldface) and CGAGTCACACATCACCTCCTTG. RNA transcriptions (except HCV/BV chimeric RNA) were performed using the MegaScript T7/SP6 kit (Ambion) with the inclusion of biotin-11-UTP at a molar ratio of 1:4 with unlabeled UTP and incubated at 37°C for 6 h. HCV/BV chimeric RNA was synthesized using the T7 RiboMAX express kit (Promega) with the inclusion of 50 nmol biotin-11-UTP and was incubated at 37°C for 1 h. DNA template was removed by DNase I digestion, and biotinylated RNA was purified using the RNeasy mini kit (Qiagen) with elution in nuclease-free water. RNA integrity was confirmed by electrophoresis through a 1% agarose gel. The RNA concentration was determined by measuring the A260, followed by aliquoting and storage at –80°C.
|
View this table: [in a new window] |
TABLE 2. Full-length template sequences for RNA transcripts
|
AFM.
RNA transcripts were diluted in AFM imaging buffer (20 mM HEPES, 10 mM MgCl2, 3 mM NiCl2, pH 7) to 0.5 to 2 ng/µl. Prior to RNA deposition, freshly cleaved mica (grade V2; Ted Pella Incorporated, Redding, CA) was treated with 4 mM NiCl2 for 1 min at room temperature and washed twice by being soaked in 10 ml nuclease-free water for 10 and 1 min, respectively. The NiCl2-treated mica then was dried under a stream of nitrogen. Ten microliters of diluted RNA was dropped on the surface of the NiCl2-treated mica and allowed to adsorb for 5 min at room temperature. Nonadsorbed RNA was removed by two washes with 10 ml of nuclease-free water as described above, after which the sample was dried under a stream of nitrogen and imaged. Imaging was performed at room temperature (22 ± 1°C) on a PicoSPM atomic force microscope (Molecular Imaging) in AC mode using aluminum-coated silicon tips with spring constants of
40 N/m and resonance frequencies between 250 and 325 kHz. Fields of 0.5 to 3.0 µm were scanned at 1 to 1.5 Hz. Images were flattened, and approximate cluster diameters were determined using PicoScan 5.3.3 software (Molecular Imaging). Two or three independently prepared samples were imaged for each RNA transcript, and several areas were scanned for each sample.
|
|
|---|
![]() View larger version (20K): [in a new window] |
FIG. 1. Specificity of different RNA structure prediction methods. RNA structure prediction methods using thermodynamic (MFOLD; upper panels), stochastic context-free grammar (Pfold), and phylogenetic methods (RNAz; lower panels) methods were analyzed. Each data set was initiated with coding sequences of previously predicted unstructured RNA virus genomes of different G+C contents and two independently sequence-order-randomized sequences using a codon-based method that also retains dinucleotide frequencies (CDLR). For each of the three viruses, native and scrambled sequences were subjected to artificial sequence drift (x axis) to generate increasingly diverse aligned sequences that were assayed for RNA structure. For MFOLD, both MFEDs and Z scores were recorded. HAV, hepatitis A virus.
|
Other methods for RNA determination were assessed. Multiple random subsets of six sequences in each alignment were selected and analyzed by RNAz. Multiple sampling of the alignments ensured that all sequences were analyzed. For alignments containing sequences with 2.5% or less divergence, a number of false-positive structure predictions were made. However, for datasets with divergence values in the range of the native RNA virus alignments (7.7 to 38.5%) (Table 1), specificity was high, with little if any RNA structure predicted, irrespective of G+C content (apart from a low frequency of structure predicted for hepatitis A virus). In contrast, mean frequencies of base pairing predicted by Pfold showed consistent nonspecificity that increased with sequence divergence irrespective of G+C content (Fig. 1).
Detection of large-scale RNA structure in RNA virus genomes. The same methods (MFED calculation, RNAz, and Pfold) were used to screen a total of 36 virus and 14 rRNA alignments for large-scale RNA secondary structures (Fig. 2). The virus groups selected represent the principal families, genera, and groups of mammalian positive-stranded icosahedral symmetry RNA viruses for which sufficient comparative sequence data are available.
![]() View larger version (44K): [in a new window] |
FIG. 2. Prediction of RNA structure in virus sequence alignments. Shown are the RNA structure predictions in alignments of native sequences for the 36 virus and 14 rRNA alignments listed in Table 1. In both sequence sets, alignments were ordered from left to right by ascending MFEDs. With the exception of Pfold values in panel D, all axes were plotted to the same y axis scale as that used for Fig. 1. DHV, duck hepatitis virus; JEV, Japanese encephalitis virus; HAV, hepatitis A virus; BVDV, bovine viral diarrhea virus; YFV, yellow fever virus; CSFV, classical swine fiver virus; OW, Old World alphaviruses; HEV, hepatitis E virus; FCV, feline calicivirus; NW, New World alphaviruses; OG: ocean group; and FMDV, foot-and-mouth disease virus.
|
0 to <–8 (HGV/GBV-C); values were tightly correlated with MFEDs (R2 = 0.933). Although absolute values of Z scores are fragment length dependent (longer sequences show tighter distributions of MFEs) (38), the same 14 alignments with high MFEDs all showed mean Z scores below –2. rRNA sequences showed MFEDs ranging from 4.8 to 21% and mean Z scores almost invariably below –2 (Fig. 2C), reflecting the detectable order component in known highly structured RNA molecules. Each alignment also was analyzed using RNAz and Pfold (Fig. 2B). rRNA sequences contained a variable frequency of strongly predicted stem loop structures using RNAz (expressed as the number of predicted structures/1,000 bases), although it was almost invariably greater than that for control unstructured sequences (Fig. 1), and showed an increasing frequency with greater MFEDs (Fig. 2D). There was a similar good correlation between the RNAz score and MFEDs in virus alignments (R2 = 0.802; P = 2 x 10–12) (Fig. 3A), with few if any structures predicted for viruses showing MFEDs of less than 6%. Pfold found high frequencies of paired bases in each of the rRNA alignments, with values frequently much higher than those found in virus alignments (note the different y axis scales). This may reflect the use of evolutionary and structural parameters by the KH-99 model underlying Pfold originally developed using alignments of tRNA and LSU sequences (22). The correlation between MFEDs and Pfold predictions in virus alignments subsequently was less good, with what are likely to be false-positive results in the HRV-B and enterovirus species B and C alignments (both predicted to be unstructured by RNAz and UNAfold) and failures to detect RNA structures in ocean-group caliciviruses and kobuviruses, both of which are clearly highly structured by other methods. Nevertheless, there was an overall correlation between base-pairing frequencies and MFEDs (Fig. 3B) that achieved statistical significance (R2 = 0.267; P = 0.02).
![]() View larger version (14K): [in a new window] |
FIG. 3. Correlation between MFEDs for RNA genomes with structure predictions by (A) RNAz and (B) Pfold. (C) A subset of viral RNA sequences was expressed as labeled transcripts, and filter hybridization to complementary probes was recorded at two temperatures ( , 37°C; , 65°C) (see the legend to Fig. 5). The proportion of transcripts showing signal intensities of greater than 0.2 was recorded on the y axis. For the hybridization experiment carried out at 37°C, individual points represent results from the three duplicate reactions.
|
Although large-scale RNA structures were predicted for genera or groups from each of the main virus families analyzed (picornaviruses, flaviviruses, and caliciviruses) (Fig. 2A), detection was highly variable and frequently a genus-associated property (e.g., present in hepaciviruses but absent in pestiviruses and flaviviruses in the Flaviviridae). In the specific case of the Norovirus genus (Caliciviridae), however, genogroup 2 (GG2) (along with GG1, GG3, and GG4; data not shown) showed MFEDs close to zero, while MNV (GG5) showed a mean MFED of 7.6%.
Hybridization accessibility assay. The bioinformatic predictions of differences in large-scale RNA structures between viral RNAs should be detectable by physical structure determination methods. In the current study, we developed a method to quantify the binding of complementary filter-immobilized probes to transcripts in solutions of viruses (reverse hybridization) with predicted structured and unstructured genomes.
To investigate the ability of the method to differentiate between base-paired and unpaired RNA sequences, we designed probes complementary to the stem and terminal loop of the PV CRE RNA secondary structure element (9) and compared their levels of hybridization to that of a full-length labeled PV transcription in solution (Table 2, Fig. 4A). During the incubation of the probe and target at 37°C (i.e., at an approximately physiological temperature), the probe complementary to predominantly unpaired RNA of the terminal loop and flanking regions (the probe A loop) effectively captured the PV transcript and produced an intense hybridization signal (Fig. 4B), while there was barely detectable binding to the probe (B stem) hybridizing to the predominantly base-paired CRE stem.
![]() View larger version (33K): [in a new window] |
FIG. 4. Analysis of the hybridization of capture probes complementary to predominantly unpaired (PVcreAA) or paired (PVcreB) regions of the PV CRE in the reverse hybridization assay. Filters were spotted with two quantities of capture probe (50 and 5 pmol) to increase the quantitation range.
|
![]() View larger version (82K): [in a new window] |
FIG. 5. Reverse hybridization of filter-immobilized probes to biotin-labeled RNA transcripts of predicted structured (upper) and unstructured (lower) virus genomes. Representative filters are shown with 50 pmol (left) and 5 pmol (right) of each probe spotted onto the membrane. The upper left corner of each filter is the 5' end of the genome, and the lower right corner is the 3' end. Entero C, human enterovirus species C.
|
Probes for hybridization were selected from target RNA every 200 to 300 bases down the genome. Duplex regions were specifically selected to have similar G+C contents (48 to 52%) to ensure equal binding strengths irrespective of the overall base composition of the transcript. Filter hybridization intensities of different transcripts were normalized between experiments relative to standard amounts of serially diluted RNA transcripts, where a hybridization intensity of 1.0 corresponds to the spot intensity of 2.5 pg biotinylated RNA transcript following chemiluminescent detection. Where the hybridizations to 50 pmol oligonucleotide probe gave a signal above the linear range of detection, 10 times the signal from hybridization to 5 pmol oligonucleotide probe was used. All filter hybridizations were carried out in triplicate; hybridization intensities, although variable between probes, were highly reproducible between replicates (the mean standard errors for the seven transcripts were within 35% of the mean value; data not shown).
Hybridization data of different transcripts was compared by recording the frequency of probe hybridizations showing signal intensities above 0.2. These ranged from 23% for HGV/GBV-C (i.e., less than a quarter of the probes bound strongly to the target transcript) to 100% (BV L segment) (Fig. 5). For the predicted structured viruses HGV/GBV-C, HCV, and MNV, there was great variability between probes in hybridization intensity, with most failing to hybridize to target sequences in the transcript, while a minority showed very high signal intensities. In the predicted unstructured transcripts, however, hybridization intensities were much more uniform, with the majority of probes hybridizing efficiently to their targets.
Low or absent binding of transcripts to probes indicates that their target sequences are inaccessible to hybridization. To confirm that internal base pairing in the transcript prevented hybridization, transcripts of HGV/GBV-C, HCV, and MNV (all structured) and PV and BV (unstructured) were hybridized to their complementary probes on filters at 65°C. This intermediate temperature was selected to disrupt the transcript secondary structure but to retain the hybridization of target RNA to the oligonucleotide probes (expected melting temperature,
70°C). Hybridization at the elevated temperature had no effect on probe binding to PV and BV transcripts, whereas the frequency of probes hybridized to structured transcripts actually was substantially increased (e.g., the MNV transcript at 65°C compared to that at 37°C) (Fig. 5). Similar results were obtained for the other transcripts (Fig. 3C); for example, 70% of probes hybridized to the predicted structured transcript (HGV/GBV-C) at 65°C, whereas <25% hybridized at 37°C.
The mean probe hybridization frequencies for the seven transcripts showed a strong correlation with MFEDs but not with MFEs or other composition variables (R2 = 0.964; P = 8 x 10–5) (Fig. 3C). Probe accessibility was similarly associated with results from other RNA structure prediction methods (RNAz and Pfold) (Table 3), although R2 and P values were lower. Remarkably, there was no correlation between hybridization accessibility and either the G+C content or MFE, although the latter two variables were strongly correlated with each other (R2 = 0.858; P = 0.028). Indeed, the higher values of MFE predicted by MFOLD and other energy-minimizing algorithms that originate from the greater frequencies of base pairing and thermodynamic stability of RNA pairing predicted for G+C-rich genomes were not factors in determining hybridization accessibility. In our analysis, this was almost entirely dependent on the sequence order; the R2 value of 0.964 with the MFED suggests that almost all the observed variability in hybridization is accounted for by sequence order-dependent RNA secondary structure.
|
View this table: [in a new window] |
TABLE 3. Correlation between probe hybridization, RNA structure prediction, and composition variablesa
|
![]() View larger version (57K): [in a new window] |
FIG. 6. (A) Strategy for the construction of the HCV (structured RNA)/BV (unstructured RNA) chimera using NsiI restriction sites. (B) Hybridization of BV (upper) and HCV (lower) probes to native BV and HCV transcripts (hybridization controls are at the left) and to transcripts of the chimera (right). The excised parts of the BV probe/chimera filters contain probes that are absent from the chimera sequence.
|
![]() View larger version (63K): [in a new window] |
FIG. 7. AFM analysis of RNA transcripts of (A) predicted structured viral RNA genomes (HGV/GBV-C and HCV) and (B) predicted unstructured viral RNA genomes (RV and PV). All scale bars are 200 nm, and the Z scale ranges from 0 to 3 nm.
|
/3 x RG3, where RG is the radius of gyration) (17). In contrast, both PV and RV RNA transcripts appeared as pleomorphic globular clusters with frequently observed protrusions of what appeared to be single-stranded RNA often several hundred nanometers in length (Fig. 7B). Transcripts of both viruses were substantially more spread out on the mica solid phase than the relatively compact HCV and HGV/GBV-C transcripts, and they showed a lower mean height (z axis measurements of 2.5 ± 0.9 nm [n = 10] and 2.6 ± 1.1 nm [n = 10], respectively). |
|
|---|
RNAz determines RNA secondary-structure conservation based on computing a consensus secondary structure and a measure for thermodynamic stability normalized with respect to sequence length and base composition, enabling the identification of RNA secondary structures with high sensitivity and specificity (10). The specificity of the method was demonstrated by an analysis of our control sequences with differing degrees of divergence (Fig. 1), producing results consistent with those of MFED determination.
Pfold is a nonthermodynamic prediction method in which pairing rules and the assessment of the probability of specific pairings were determined using a reconstructed evolutionary history and stochastic context-free grammar to give a prior probability distribution of structures (22). Pfold, however, showed a degree of nonspecificity, with relatively high frequencies of predicted conserved pairings (at >95% probabilities) in all three control datasets, with increasing frequencies with greater sequence divergence. Even taking this nonspecificity into account, Pfold structure prediction using viral datasets (Fig. 2) showed frequent discrepancies (likely false-positive and -negative results in a variety of virus alignments, e.g., those for HRV-B, enterovirus B, and kobuviruses). However, there was still a broad overall agreement between all three methods with which RNA viruses were structured and unstructured.
RNA structure prediction and hybridization. The hybridization accessibility assay provides a convenient method to probe RNA structure formation in RNA transcripts in solution. The effectiveness of the method in identifying structured and unstructured RNA was demonstrated by the marked difference in hybridization signal between probes complementary to naturally paired and unpaired regions in the PV CRE (Fig. 4).
Intriguingly, it was only the sequence order-dependent component of RNA secondary structure (identified by MFE differences between native and scrambled sequences and by RNAz and Pfold predictions) that correlated with hybridization accessibility results (Fig. 3, Table 3). Particularly informative was the hybridization data for the RV transcript, whose high G+C content (71%) was associated with a very high MFE (–112 kcal/mol for the 300-base fragments) but which appeared virtually unstructured in the hybridization accessibility assay, which correlates with its low MFED. In contrast, the three predicted structured virus transcripts (HGV/GBV-C, HCV, and MNV; MFEDs of 7.5 to 12.5%) with consistently lower MFEs (–95 to –112 kcal/mol) were inaccessible to hybridization with the majority of probes.
The difference between MFEs and MFEDs in their ability to predict the RNA secondary structure in solution may arise through qualitative differences in the nature of the predicted pairings. For example, the type of pairings predicted in unstructured genomes or randomized sequences may be one (metastable) configuration out of many similarly energetically favored structures; dynamic transitions between structures may enable greater probe accessibility than is possible with sequence order-dependent structures identified by high MFEDs and other bioinformatic methods. In contrast, the evolutionary process that created functional structures such as the PV CRE and presumably the large-scale RNA structure in persistent viruses may have created much more stable pairings that shield viral RNA from external hybridization. A numerical analysis of the predicted pairings in structured and unstructured RNA, such as the frequencies of different duplex lengths and pairing distances, would be of value in the future in resolving this issue.
AFM. The extremely high resolution possible with this method provides the means to directly visualize RNA at the molecular level (1, 12, 23, 31, 35). Although previously used to investigate specific interactions between RNA molecules (such as virus genome circularization [1] or kissing loop interactions [12]), the method also has been used for the visualization of the larger-scale shape of RNA molecules (23) and may thus allow the visualization of the effect of the large-scale secondary structure on the configuration of RNA virus genomes (Fig. 7). In the current study, we indeed found that HCV and HGV/GBV-C RNA transcripts adopted a tightly packed condensed state that was largely maintained during the deposition process (Fig. 7A and insets), in contrast to the irregular, pleomorphic appearance of the PV and RV RNA transcripts and protruding strands of single-stranded RNA (Fig. 7B). The latter appearance of predicted unstructured virus RNAs closely matched that of virion RNA released from PV nucleocapsids after proteolytic digestion in a previous study (23), where the initially compacted genomic RNA rapidly unraveled to leave long strands of unpaired single-stranded RNA visible by AFM. The inclusion of RV was particularly informative; RNA secondary structure formation in its genomic transcript was the most energetically favored of those analyzed (a consequence of its high G+C content), yet both AFM and hybridization accessibility assays predicted an overall open, likely metastable (see the previous section) physical configuration resembling those of other viruses we analyzed with low MFEDs.
Although not the purpose of the investigation, the observed and unexplained differences between PV and several plant viruses in their appearance after capsid dissolution did indeed correlate with their degree of predicted RNA secondary structure. Thus, the predicted unstructured genome of turnip yellow mosaic virus (MFED, 3.4%) (40) resembled PV in rapidly unraveling to single-stranded RNA, while the predicted structured brome mosaic virus, satellite tobacco mosaic virus, and tobacco mosaic virus (all with MFEDs of >9%) (40 and unpublished observations) showed various degrees of compaction and retention of globular shape despite prolonged incubation times in solution, which is comparable to the appearance of HCV and HGV/GBV-C in the current study.
Despite the high resolution of RNA structures possible by AFM, the method does introduce artifacts associated with the deposition process, most notably the flattened appearance of the transcripts (e.g., the measured x and y diameters of HCV and HGV/GBV-C RNA transcripts were approximately 15 times greater than their height) (12, 35). This is caused by the partial dissolution of the condensed state due to interactions with the surface divalent metal ions that become energetically favorable as the buffer is removed. Nonetheless, AFM imaging reveals differences in the overall structures between the RNA transcripts with predicted structured and unstructured genomes that substantiate the bioinformatic and probe hybridization data. Importantly, the estimated volumes of the HCV and HGV/GBV-C transcripts we visualized agree closely with the theoretical size of a compacted RNA calculated by Flory's Law, despite the likely shape distortions arising from the AFM method.
Virus persistence. The analysis of the expanded data set of mammalian RNA sequences confirmed and extended the previously described association with host persistence (40). All viruses with MFEDs below 6% are naturally nonpersistent in the natural, immunocompetent host, while all of those above this threshold establish persistent infections (where known). This distinction is particularly clear in the families Flaviviridae, Caliciviridae, and Picornaviridae, which comprise clearly separate structured and unstructured categories in the various genera and groups. Major differences in predicted structures were observed between similar viruses; for example, MNV and (human) norovirus GG2 sequences show substantial homology, and coding regions can be readily aligned (20, 41). In this specific example, it is perhaps revealing that human norovirus infections are acute and rapidly cleared (usually in less than 24 h), whereas infections with the predicted structured MNV are persistent and nonpathogenic in immunocompetent mice. (15).
At present, it is difficult to conceptualize what mechanism underlies the association between large-scale RNA structure and host persistence. The RNA secondary structure, calculated as the probability of the internal base pairing of individual bases in an RNA molecule, has been shown to be an important predictive factor for short interfering RNA (siRNA) targeting (24, 25). The formation of GORS and the consequent reduction in the access of genomic RNA to hybridization by guide RNAs or cleavage by Dicer may represent an evasion strategy for viruses infecting organisms that use siRNA for antiviral defense. Intriguingly, the majority of positive-stranded plant viruses show evidence of large-scale RNA structures equivalent in extent to those detected in mammalian RNA viruses (40). It is tempting to imagine that similarly extensive RNA structures and the inaccessibility to hybridization in mammalian RNA viruses plays a similar role in host defense evasion, particularly as the local RNA secondary structure has been shown to similarly prevent access to guide RNAs on human RNA-induced silencing complex and to protect RNA from degradation (2). However, although antiviral resistance mediated by RNA-induced silencing complexes can be induced experimentally in mammalian cells, it is still unclear whether RNA interference (RNAi) actually is used as an antiviral defense pathway at the whole-organism level. Although many DNA viruses were demonstrated to encode and express viral microRNAs (miRNAs) that usurp the host cell miRNA pathway to promote viral replication, none were predicted or experimentally detected in cells infected with HCV or yellow fever virus (32). In further contrast to the established plant and insect siRNA/virus paradigm, the miRNA miR-122 that is expressed abundantly in human liver actually is required for efficient HCV replication, an interaction mediated through the binding of the miRNA to a single-stranded region of the virus 5'UTR (19). In vertebrates, the principal role of RNAi seems to be that of a posttranscriptional regulator of gene expression mediated by endogenously encoded miRNAs.
For mammalian cells, the interferon pathway is the principal antiviral defense mechanism driving several antiviral effector pathways such as protein kinase R and 2'-5'oligoadenylate synthetase, as well as activating the acquired immune system (8, 34). As demonstrated for RNAi interactions, it is possible that the physical inaccessibility of structured RNA we found in the current study hinders these effector functions or, alternatively, prevents the recognition of incoming or replicating RNA by retinoic acid-inducible gene I (RIG-I), melanoma differentiation-associated gene 5 (MDA5), and toll-like receptor 3 (TLR3) (3, 47). We have very recently obtained preliminary evidence that large-scale RNA structure sequesters the 5' triphosphate moiety of genomic RNA (R. Blundell, D. J. Evans, and P. Simmonds, unpublished data). Since this is one of the major targets for recognition by RIG-1 (14, 33), GORS may prevent or delay interferon induction during initial infection or replication. While much remains to be done in documenting this phenomenon and other possible consequences of GORS, such as RNA stability within cells, the finding in the current study that a large-scale structure makes RNA genomic transcripts almost entirely inaccessible to external hybridization is likely to be an important, evolutionarily developed attribute of RNA viruses that profoundly influences their interactions within the mammalian cell.
The work was supported by a project grant from the Wellcome Trust.
Published ahead of print on 17 September 2008. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»