Advances in sequencing technology have made abundant RNA sequence information available, but the challenge of how to interpret these data remains. The RNA sequence contains many layers of information. RNA sequences code for proteins and small RNAs, such as microRNAs or transacting small interfering RNAs (siRNAs). RNA encodes information about both structure and function. Viral RNA structures, such as riboswitches, internal ribosome entry sites (75), and panhandles (71), regulate the stages of the viral life cycle, including replication (100), transcription (99), splicing (40, 48), aminoacylation (31, 55), translation (12, 75, 98), and encapsidation (27, 60, 71, 84). Because viral RNAs are structurally dynamic, current prediction methods focusing on a single minimum free-energy structure may not always identify functionally relevant structures without additional experimental restraints. Because RNA structure determination is often experimentally difficult despite tremendous advances in RNA crystallography, nuclear magnetic resonance spectroscopy, and chemical modification, RNA structure prediction is an important tool for generating hypotheses about structure-function relationships in RNA. RNA structure prediction can be useful for interpreting or designing mutagenesis experiments, identifying conserved structural features, and designing siRNA strategies. This review will briefly outline the basic ideas and assumptions underlying RNA structure prediction, compare different approaches to RNA structure prediction from a user's perspective, and discuss some applications of RNA structure prediction to viral RNA interference (RNAi) research.
The RNA sequence, or primary structure, determines the secondary structure, or pattern of canonical Watson-Crick pairs forming duplexes and irregular regions, such as loops and single-stranded regions. The secondary structure then determines the tertiary structure or overall three-dimensional shape of the molecule. The tertiary structure contains the interaction sites for proteins, other RNA molecules, carbohydrates, or other small molecules and thus determines the quaternary or higher-order structure. The greater thermodynamic stability of RNA helices than that of tertiary interactions makes the RNA folding process hierarchical (35). For example, the favorable free energy involved in forming two stacked base pairs (0.4 to 0.9 kcal/mol per nucleotide) (105) is larger than the favorable free energy to form tertiary interactions, such as ribose zippers or tetraloop-receptor interactions (0.1 to 0.5 kcal/mol per nucleotide) (25, 97). Note that 1.4 kcal/mol is equivalent to 1 order of magnitude in a binding constant at 37°C. The hierarchical nature of RNA folding makes RNA structure prediction a tractable challenge (92).
WHAT IS RNA STRUCTURE PREDICTION?
Phylogenetic analysis of RNA sequences remains the gold standard of RNA structure prediction. Phylogenetic analysis involves generating an alignment of many RNA sequences and looking for patterns of covariation between two nucleotides. Two nucleotides would covary if, for example, nucleotide 1 was usually a G but sometimes a U, if nucleotide 2 was a C whenever nucleotide 1 was a G, and if nucleotide 2 was an A whenever nucleotide 1 was a U. This pattern of variation maintains the formation of an isosteric Watson-Crick base pair. Phylogenetic analysis requires a good alignment of many RNA sequences over a diverse range of species with enough variation to observe covariation patterns but also enough conservation to establish a good alignment. When regions of 100% conserved nucleotides are shown as large loops in secondary structure diagrams generated by phylogenetic analysis, no information is known about the secondary structure. This does not mean that the nucleotides must necessarily be single stranded. Because RNA-protein interactions and the RNA tertiary structure are also often conserved in evolution, phylogenetic analysis indirectly accounts for such tertiary and quaternary structures. However, sometimes only one or a few sequences are available or the sequences show too little or too much variation for a good alignment. In these cases, computational approaches can generate possible RNA secondary structures.
Computational approaches typically use a recursive algorithm to evaluate possible base pairs for a given RNA sequence. There are 1.8N possible structures for any given sequence where N is the number of nucleotides; thus, there are 3.37 × 1025 possible structures for a 100-mer RNA. This is far more structures than even the fastest computer today could ever calculate in anyone's lifetime. All computational approaches make some approximations or simplifications in order to reduce the complexity of the RNA folding problem. In a stochastic context-free grammar (SCFG), the most common approximation in RNA structure prediction programs, nucleotide pairings are evaluated as one of four possibilities: paired with each other, two cases where one nucleotide is paired but the other nucleotide is unpaired, or both nucleotides paired but not to each other (Fig. 1A to D) (29, 72). A recursive algorithm systematically evaluates each possible pair of nucleotides. This approach makes computation of RNA secondary structures practical and reasonable but excludes the possibility of nonnested pairing interactions, such as pseudoknots, kissing hairpins, and base triples (Fig. 1E). All prediction program listed in Tables 1 and 2 are capable of predicting the two hairpins (Fig. 1D) which are compatible with the formation of nonnested pairing interactions and tertiary structures. In addition, several programs specifically address the computational challenges of nonnested pairing and the limitations of SCFG grammars.
Four possible relationships between nucleotides i and j. Dots represent nucleotides i < k < j. Lines represent sequential covalent connections. Dashed lines indicate hydrogen-bonded base pairing. (A) i and j are paired. (B) i is unpaired and j is paired. (C) i is paired and j is unpaired. (D) Both i and j are paired but not to each other (29). (E) A nested base pair, such as a base pair between nucleotides i and k in panel E, forms a helical stem. Nonnested interactions, such as a base pair between j+1 and k (a base triple), a base pair between k+1 and i+6 (a pseudoknot), or a base pair between i+5 and k+7 (a kissing hairpin interaction), are not allowed in an SCFG and require a higher-level grammar, more memory, and more run time. A base pair between nucleotides j+2 and i−1 would create a multibranch loop. Consideration of possible multibranch loops requires an additional array. Vienna Websuite and RNAstructure calculate the stabilities of multibranch loops differently. Three-dimensional motifs, such as ribose zippers and tetraloop-receptor interactions, are built from many pairing interactions.
Characteristics of programsa
Websites and key references for programs
The next step is to rank possible pairs and structures and store this information in what is often called a “fill step.” Free energy is a common ranking system. The free energies of RNA base pair stacks can be measured experimentally. The free energy of a stack of two base pairs accounts for both the hydrogen bonds in Watson-Crick pairs and the interactions between stacked bases, which include electrostatic interactions, dipole-dipole interactions, van der Wals forces, and hydrophobic effects. This smallest unit of an RNA double helix is referred to as a nearest neighbor (7, 32, 93, 105). Expanded nearest neighbors describe irregular or single-stranded regions; for example, the nucleotides in a hairpin loop and the nucleotides in the closing base pair and all the effects of changes in the phosphodiester backbone as the RNA strand makes this hairpin turn would be grouped together as the smallest unit of which to measure the loop free energy. The nearest-neighbor approximation assumes that the free energy of each smallest unit or motif, for example, a stack of base pairs or a hairpin loop, is the same in any secondary structure context. For example, a GNRA tetraloop hairpin would have the same energetic stability in a group I intron, 23S rRNA, or the human immunodeficiency virus (HIV) genome. This approximation is most often reasonable, although non-nearest-neighbor effects have been observed for some noncanonical pairs, especially GU pairs, and single nucleotide bulges (43, 52, 85, 86). This approximation allows the free energy of a secondary structure to be calculated by adding together the energetic stabilities of each smallest unit or motif.
The Turner laboratory members and associates have made extensive free-energy measurements of RNA motifs (32, 49, 66, 67), and these thermodynamic parameters form the core of many RNA structure prediction programs. In order to experimentally measure the free energies of RNA motifs, the UV absorbance of small synthetic oligonucleotide duplexes (typically 8 to 12 nucleotides) is recorded as a function of temperature in an optical melting experiment. One assumption of these experiments is that the RNA exists in only two states, duplex or single stranded; thus, intermediates occur very quickly and do not contribute significantly to the UV absorbance. This assumption often does not hold true for complex motifs, such as large loops, multibranch loops, or pseudoknots, and complicates the thermodynamic analysis of these motifs (17, 33, 65). Differential scanning calorimetry (8) or single-molecule force-extension experiments (57) can measure RNA free energies without using a two-state assumption. The free-energy measurements of hairpin, internal, and bulge loops enable predictions for unmeasured loop sequences, but not every possible sequence of every motif has been measured yet. Loop sequences with surprisingly stable energies are still being discovered. Thus, improvements in the collection of RNA free-energy parameters continue as an active area of research.
Free-energy minimization approaches to RNA structure prediction assume that the lowest free-energy structure will form. Often, this assumption works well; for example, the base pairing in tRNA and group II introns are correctly predicted 83% and 88% of the time, respectively, by using free-energy minimization and the Zuker algorithm (67). However, the lowest free-energy structure may not always be the functional structure. The conformation of the RNA may be determined by folding kinetics rather than thermodynamic free energy (11, 106). Also, no RNA structure prediction program yet incorporates RNA-protein interactions, which can dramatically alter the folding landscape. Many prediction programs also do not consider pseudoknots or other stabilizing RNA tertiary interactions. In addition, a very large number of structures may exist within a small range defined by the error of the free-energy calculation, and sometimes those structures can be very different from one another. This becomes more problematic as the lengths of the RNAs increase. For example, when tRNA (76 nucleotides) and satellite tobacco mosaic virus RNA (1,058 nucleotides) are folded with the Wuchty algorithm in the Vienna suite, which calculates all possible structures within a small energy range, tRNA can form 13 different structures within a 1-kcal/mol range, and the structure most different from the minimum energy structure has 23 nucleotides paired differently. In contrast, satellite tobacco mosaic virus can form 42,768 structures within 1 kcal/mol; the structure most different from the minimum free-energy structure has 560 nucleotides paired differently but is only 0.2 kcal/mol greater in free energy, a free-energy difference well within error of the calculation (see the supplemental material). Additional experimental data that further defines possible RNA conformations can address this problem in RNA structure prediction and becomes even more important for longer RNA sequences.
Free-energy minimization is only one approach to analyzing the array of ranked possible pairs and structures for a given RNA sequence. Many different prediction programs use different algorithms to analyze or “trace back” through this array to generate not only a minimum free-energy structure but also a set of low-energy structures. In addition, experimental data such as chemical modification or phylogenetic covariation can be used in addition to free-energy minimization. Alternatively, possible pairs can be ranked using criteria other than free energy, such as the maximum number of base pairs, or the base pairing can be described using concepts other than SCFGs, such as nucleotide cyclic motifs.
Different prediction programs have developed different strategies for addressing the limitations of RNA structure prediction by free-energy minimization alone. Because even slight differences in calculation methods or the set of thermodynamic parameters can alter the minimum free-energy prediction, comparison of the results from more than one program is a good approach to generating an informed hypothesis about RNA structure and function. Table 1 lists some of these programs and their characteristics, such as access, options for algorithms, and options for including experimental data. Refer to each prediction program's website or primary citation for a complete description of user options and an evaluation of performance. The evaluation criteria and databases of known secondary structures used to evaluate prediction accuracy vary substantially between different research groups and make direct comparisons complex. Results of comparative tests of different RNA structure prediction programs have been reviewed elsewhere (26, 42, 78). Prediction programs aim to predict structures based on a set of rules and assumptions rather than optimize for particular types of structures, although several programs are designed to address the computational difficulties associated with predicting pseudoknotted, nonnested pairing (Fig. 1E). This list is not exhaustive, and the following discussion only highlights some practical considerations for selecting an RNA structure prediction program. The “best” program is the user's decision that depends on the RNA studied, the questions asked, the available experimental data and resources, and the intended applications of the structure prediction.
APPROACHES TO RNA STRUCTURE PREDICTION
mfold and UNAFold.The popular mfold program has been expanded, updated, and renamed UNAFold (64). The Quikfold option in DINAmelt, the online version of the program, contains the traditional format where a user submits a sequence and chooses RNA or DNA folding, temperature, salt conditions, and a free-energy range for suboptimal structures. UNAFold and mfold use the Zuker algorithms (108, 109) to compute the minimum free-energy structure for a given sequence and systematically sample structures within a percentage of free-energy range to create a set of diverse suboptimal structures. Users can specify nucleotides to be single stranded or in a particular base pair if information from chemical modification, enzymatic probing, compensatory mutations, or phylogenetic covariation is available. The options for hybridization analysis can be applied to the design of siRNA or antisense RNA.
RNAstructure.RNAstructure 4.6 is a Windows implementation of the Zuker algorithm and includes additional options for other folding algorithms and incorporation of experimental data. The authors of RNAstructure collaborate very closely with the Turner laboratory and keep the most up-to-date thermodynamic parameters (66). The OligoWalk program can be used for siRNA design (61). Two unique ways of incorporating experimental data in the RNA folding is done with Dynalign (42, 68) and chemical modification (15, 66). The Dynalign program computes the lowest free-energy sequence alignment and secondary structure common to two RNA sequences. The two sequences need not be aligned, but the maximum number of sequences to be folded at one time is two. RNAstructure also uses a more-advanced definition of chemical modification restraints for traditional chemical modification reagents and new selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) data (70). Chemical modification reagents, such as dimethyl sulfate, methylate accessible adenine nucleotides and cause stops in reverse transcription primer extension reactions. This type of chemical modification occurs not only at single-stranded adenines but also at adenines in AU pairs that are at the end of a helix, next to a GU pair, or next to bulged nucleotides; and RNAstructure allows for all these possibilities when a nucleotide is marked as chemically modified (66). SHAPE detects flexibility in the RNA backbone that correlates with helical or irregular motifs in secondary structure formation, and these data can be included as a restraint or as a pseudofree energy in ranking possible base pairs (15).
Vienna RNA Websuite.The Vienna RNA Websuite introduced the Wuchty algorithm, developed applications of the McCaskill algorithm, and also offers a wide variety of algorithms and functions. The Wuchty algorithm computes all possible secondary structures within a narrow free-energy range (104). The Wuchty algorithm generates a small but complete set of suboptimal structures that may include some very different secondary structures but also very many highly similar structures. However, structures containing more than one suboptimal region may occur in the Wuchty set of structures but would be absent if the Zuker method for sampling suboptimal structures were used. The McCaskill algorithm evaluates the statistical probabilities of base pairs and secondary structures by using a partition function calculation (69). “Statistical” refers to the thermodynamic and statistical mechanics definition based on a Boltzmann distribution. This information can be overlaid in different colors on the lowest-energy structure and thus provides an estimate of confidence in the predicted structure (Fig. 2). For example, one region of a secondary structure (red) may be exceptionally thermodynamically stable, occur frequently in all suboptimal structures, have a high statistical probability, and thus be more likely to be a correct prediction. In contrast, another region of the secondary structure (purple or dark blue at the left end of the scale near zero probability) may have multiple thermodynamically equivalent possible conformations in the suboptimal structures and a low statistical probability and thus be less clearly a correct prediction.
The minimum free-energy secondary structure of tRNAala from Shigella sonnei with the base pair probabilities shown in color. The red pairs have a high probability of forming, green pairs have a medium probability of forming, and blue pairs (none shown) have a low probability of forming. The normalized scale showing zero (dark blue) to one (red) probability is shown on the lower right. Figure produced using Vienna Websuite with default parameters.
The Vienna Websuite includes RNAalifold and RNAz, programs that combine phylogenetic covariation information with thermodynamic stability and base pair statistical probabilities when ranking possible base pairs. RNAalifold accepts any number of aligned sequences in the standard ClustalW and FASTA formats and generates a lowest-energy consensus structure by weighting the ranking of base pairs with covariation data (6). RNAz searches for noncoding RNAs (101) or conserved secondary structure elements within long RNA sequences, such as viral genomes (45, 46), by weighting base pair statistical probabilities and thermodynamic stabilities with covariation data in a combined Z score for possible base pairs. The RNAup and RNAxs functions are used to design siRNA strategies.
Sfold.The Sfold algorithm uses a unique algorithm to aid in the design of siRNA. The algorithm combines thermodynamic stabilities (67), calculations of target accessibility (59, 88), and empirical rules for efficient siRNA developed by the Zamore, Amgen, and Dharmacon groups (3, 51, 79, 87, 94). The website offers specialized programs for the design of siRNA, antisense RNA, trans-cleaving RNA, and mRNA-microRNA interactions as well as a general program for statistically sampling suboptimal RNA structures. The algorithm uses a partition function calculation and then groups suboptimal structures by similarity (9, 19, 20). The centroid structure is the most-representative structure that is closest in similarity to all the other structures. If the centroid structure is different from the minimum free-energy structure, the centroid structure is often closer to the phylogenetic prediction and contains fewer base pairs, or fewer false-positive base pair predictions, than the minimum free-energy prediction. The point is to show a structure that represents a group of structures rather than a single predicted structure. Many long RNA sequences, such as viral genomes or mRNA, may not have a single structure but instead have a dynamic structure that has some conserved features but also varies and changes, and these many conformations may all exist simultaneously in the cell. Thus, the centroid structure may better describe the overall average prediction for all these conformations and may better show whether a target site is accessible. Sfold also outputs a probability profile of nucleotide accessibility, loop profiles, and an estimate of siRNA potency.
PROGRAMS FOR PREDICTING SECONDARY STRUCTURES WITH PSEUDOKNOTS
RNA STAR.The problem of predicting pseudoknots (Fig. 1) has inspired a wide variety of solutions and adaptations of existing algorithms. RNA STAR offers three algorithms that simulate the folding of RNA and include pseudoknot structures. The greedy algorithm, based on the idea that maximizing each step of the calculation will produce the optimal result, adds the most stable stem helix onto a growing structure (1). A Monte Carlo algorithm generates a set of random structures and then selects a probability-weighted stem helix to add onto a growing structure (39). The genetic algorithm simulates an evolution of RNA folding by starting with unfolded structures and increasing the fitness criterion, low free energy, in each generation (41, 96). The genetic algorithm allows stem helices to be added or removed and allows “crossovers,” structures containing the best pieces of previous generations. Because these programs are simulations rather than calculations, the output should also be evaluated on how well the program converges to a single structure and the reproducibility of the final structure. The user chooses the number of rounds in the simulation.
PKNOTS and pknotsRG.PKNOTS uses free-energy minimization and a higher-level grammar that removes the “context-free” restrictions shown in Fig. 1 (80, 81). The computational cost of allowing pseudoknots [O(N6) in time and O(N4) in space], however, makes the program computationally unreasonable for sequences greater than approximately 100 nucleotides. The program pknotsRG restricts the types of possible pseudoknots to three common simple types, with at most two helices, and does not allow complex interactions, such as kissing hairpins or triple helices. This approximation improves the run time [O(N4) in time and O(N2) in space] so that sequences of up to 800 nucleotides are computationally reasonable (77). The O(Nx) function describes the computational complexity and program run time, depending on N, the number of nucleotides in the sequence. The actual time necessary for the computation will also depend on the type of computer used and the processor speed.
NUPACK.The NUPACK program provides probabilities of pseudoknotted base pair formation (22). These base pair probabilities can predict free energies of pseudoknot stabilities that match experimental values (21). Base pair probabilities are useful for analyses of how pseudoknots may be in equilibrium between two conformations and how mutations can shift that equilibrium. Optical melting experiments and single-molecule pulling experiments can provide measurements of pseudoknot interaction energies (for examples, see references 10, 34, and 76).
ILM.The ILM program uses an iterative loop matching algorithm to maximize base pairs and allows pseudoknots to form by allowing base pairs to be added or removed in successive rounds (83). The Nussinov algorithm, or maximum loop matching algorithm, is the basic framework for generating a structure with the most possible base pairs (73). The base pairs are ranked using both thermodynamic parameters and covariation data for aligned sequences. ILM requires the RnaViz program (16) to visualize the RNA secondary structure with pseudoknots.
HotKnots.The HotKnots program takes advantage of the hierarchical nature of RNA secondary and tertiary structure folding (47, 78). A hot spot is a particularly stable branch point in a tree of possible structures. The program generates a tree of structures using the Zuker algorithm and restrains pseudoknot “hot spots” to be single stranded and then allows structures with pseudoknots to form. The HotKnots program is a heuristic that does not guarantee the lowest energy structure but quickly predicts many reasonably good structures.
ConStruct.The program ConStruct simultaneously aligns multiple RNA sequences and finds a consensus RNA structure, thus combining information on thermodynamic stability and phylogeny (103). Because sequence alignment often requires manual editing, ConStruct offers an interactive user interface that allows manual editing of the alignment. Thus, the alignment can be improved with preliminary information about secondary structure. A new additional step then maximizes base pairing and thus allows for pseudoknot, base triple, and other tertiary interactions.
NONTHERMODYNAMIC APPROACHES
Pfold and CONTRAfold.Not all folding programs use lowest free energy as the criteria for forming base pairs in an RNA secondary structure. Pfold uses a sequence alignment and a set of base pair probabilities determined from a set of known tRNA and 23S rRNA secondary structures to predict RNA secondary structure (53). CONTRAfold (24) uses a conditional training method and a database of 151 consensus structures from known secondary structures in the Rfam database (36, 37) to generate parameters for RNA folding. The parameters for base stacking interactions are surprisingly proportional to the experimentally measured values. RNA alignment and folding (RAF), an option in CONTRAfold, accepts multiple unaligned sequences and simultaneously aligns the sequences and computes a consensus secondary structure (23).
PREDICTING THREE-DIMENSIONAL STRUCTURE
Rosetta.FARNA, fragment assembly of RNA using Rosetta software, samples different possible conformations of RNA to predict three-dimensional structure (14). Computational analysis of the crystal structure of the 50S ribosomal subunit of Haloarcula marismortui (4) produced a library of RNA fragments, which is a collection of torsion angles for all possible conformations of three nucleotide pieces, and an energy function, which includes terms for hydrogen bonding, base stacking, compactness, steric clashes, and planar base pairs. The only additional input is the sequence of the RNA to be predicted. The Monte Carlo sampling method limits the length of the RNA to approximately 40 nucleotides. This method accurately predicts the noncanonical regions for 13 out of 20 test cases, including conformations not in the ribosome, such as the base triple in the SL2 stem-loop of the HIV psi packaging signal (2) and the pseudoknot in beet western yellow mosaic virus (30).
MC-Sym.MC-Sym, macromolecular conformation by symbolic generation, uses a hierarchical approach to three-dimensional structure prediction (74). MC-Fold predicts a secondary structure; MC-Sym predicts a three dimensional structure from a given secondary structure. The program is based on nucleotide cyclic motifs, the smallest nondivisible unit in a graph grammar description of RNA structure which includes backbone torsion angles, base pairing, and base stacking interactions (56, 89). The program also incorporates experimental data from chemical modification, SHAPE, hydroxyl radical footprinting, phylogenetic alignments, and distance restraints from nuclear magnetic resonance spectroscopy. The program is capable of predicting structures of 150 nucleotides and correctly predicts 11 of 13 test sequences, which contain a maximum of 47 nucleotides and include the pseudoknot in yellow leaf virus (13).
APPLICATIONS OF RNA STRUCTURE PREDICTION TO RNAi
One of the most exciting applications of RNA structure prediction is toward understanding and manipulating the ability of small RNAs to regulate gene expression. RNA structure prediction of the secondary structure of the mRNA target, the pre-microRNA, or short hairpin RNA can improve microRNA identification and siRNA design strategies (44, 50, 59, 88, 90). The accessibility of the mRNA target site for base pairing and the energetic cost of rearranging the RNA secondary structure can be important for estimating the effectiveness of the small RNA for regulating gene expression. RNA structure prediction can improve identification of viral sequences that are Dicer substrates to be processed as viral siRNA and also identification of viral siRNA target sites (28, 44, 50, 88, 90, 95). Sequence complementarity alone does not provide sufficient specificity for effective siRNA strategies. Higher-order RNA structure can provide the specificity in molecular recognition to reduce off-target effects (44, 50, 59). RNA structure prediction can also facilitate the interpretation of mutations. For example, in studies of RNAi targeting HIV type 1 (HIV-1) in human cells (102), some resistance mutations in the HIV-1 sequence emerged that were not at the siRNA target site. When the secondary structure prediction of the target site region was recalculated with the resistance mutation sequence, an alternate fold was revealed. Mutations that cause changes in secondary structure in HIV-1 RNA and MS2 bacteriophage RNA occur in forced-evolution experiments in which a deleterious mutation is introduced and the progeny are sequenced to identify compensating mutations (5, 58). Accurate structure prediction methods can thus facilitate future RNAi technologies and viral RNA research.
ACKNOWLEDGMENTS
The author is supported by grants from the Oklahoma Center for the Advancement of Science and Technology and the Pharmaceutical Research and Manufacturers of America Foundation and by an institutional research grant from the American Cancer Society to the University of Oklahoma Health Sciences Center.
The author thanks Douglas H. Turner and Kay Sheez for critical reading of the manuscript.
- Copyright © 2009 American Society for Microbiology
REFERENCES
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.
- 63.
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.
- 92.↵
- 93.↵
- 94.↵
- 95.↵
- 96.↵
- 97.↵
- 98.↵
- 99.↵
- 100.↵
- 101.↵
- 102.↵
- 103.↵
- 104.↵
- 105.↵
- 106.↵
- 107.
- 108.↵
- 109.↵
Author Bios
Susan J. Schroeder began exploring the RNA world as an undergraduate in Douglas H. Turner's laboratory at the University of Rochester in New York. She completed her doctoral thesis in the Turner laboratory and studied thermodynamic stabilities of RNA asymmetric internal loops, RNA structure prediction, and nuclear magnetic resonance spectroscopy of a loop in a group I intron active site. As an NIH Ruth L. Kirchstein postdoctoral fellow in Peter B. Moore's laboratory at Yale University, she studied ribosome crystallography, antibiotic resistance, and rRNA mutagenesis. Now an assistant professor in the Chemistry and Biochemistry Department at the University of Oklahoma, Dr. Schroeder focuses on viral RNA structure, function, and energetics. Dr. Schroeder and members of her laboratory study satellite tobacco mosaic virus RNA and prohead RNA from bacteriophage packaging motors in order to improve RNA structure prediction from sequence and discover the fundamental knowledge necessary to solve the RNA folding problem.