Previous Article | Next Article ![]()
Journal of Virology, March 2006, p. 2483-2494, Vol. 80, No. 5
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.5.2483-2494.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Microbiology, University of Washington School of Medicine, Seattle, Washington 98195
Received 15 August 2005/ Accepted 14 December 2005
|
|
|---|
|
|
|---|
Recombination of two different retrovirus strains relies on coinfection of a host cell by two genetically distinct viruses and copackaging of one genomic RNA from each integrated provirus into a single virion (26, 27, 55). The heterozygous virion then infects a new target cell where a chimeric recombinant provirus can be generated during reverse transcription. Recently, copackaging of heterozygous virions was demonstrated to be strongly affected by the identity of the dimerization initiation sequence. Genomes with different dimerization initiation sequences were packaged together at a lower frequency than those with identical palindromes, resulting in an apparent decrease in the rate of recombination in the former case (12). Homozygous genomes can also recombine when polymerization switches from one genome to the other during reverse transcription, but this does not necessarily change the genotype of the progeny virus.
Nearly all retroviral recombination occurs during minus-sense DNA synthesis, when reverse transcriptase (RT) utilizes the genome RNA as a template for polymerization (3, 63). The mechanism by which two RNA genomes recombine was first described as "copy choice" by Coffin (13). Variations of this model have been formulated to incorporate factors that influence various stages of the process, namely, RT processivity and RNase H activity (8-10, 17-19, 23, 28, 45, 48, 49, 52, 53, 57, 60), RNA secondary structure (1, 2, 4, 5, 20, 38, 40-42, 66), and the viral nucleocapsid protein (NC) (16, 30, 44, 52, 54, 66). Overall, the recombination process is very similar to the obligate strand transfers mediated by the repeat and primer binding site (PBS) sequences during reverse transcription (14). As RT synthesizes DNA on one RNA template (the "donor" template), the RNase H activity hydrolyzes the template RNA behind the polymerase, allowing the nascent DNA to base pair with a second RNA template (the "acceptor" template). Branch migration then "zippers" together the DNA and acceptor template RNA. The position on the acceptor template where the DNA 3' terminus ultimately anneals defines the crossover point, and the annealing appears to be facilitated by single-stranded regions of the acceptor RNA, such as loops and bulges, that are potentially more accessible for base pairing (20, 22, 29).
One version of this recombination mechanism was coined the "dynamic" copy choice model. A mutational analysis of the MLV RT demonstrated that a decrease in polymerase activity increased the rate of homologous recombination while a decrease in RNase H activity decreased recombination in vivo (28, 57), suggesting that a balance exists between RNase H activity and DNA synthesis, which determines the overall rate of recombination. Once the nascent DNA starts to anneal to the acceptor RNA template, extension of the nascent DNA continues until branch migration forces complete DNA 3' terminus transfer.
A major factor affecting the rate of continued RT-mediated DNA synthesis is the presence of pause sites. Pausing during synthesis on RNA templates has been attributed to stretches of homopolymers G and C and to duplex RNA structures (30, 31, 34, 47, 56, 60). In vitro, such pause sites are associated with an overall increase in strand transfer (52, 53, 67), but more recent work has demonstrated that pausing has a greater influence on the point of crossover. When RT encounters a pause site, the location of the 3' terminus transfer to an acceptor template clusters near the pause position (15, 33). If pausing is minimal, however, transfer still occurs but with a more random crossover point distribution (15). Similarly, pausing on RNA templates in vitro was correlated with a preferred region of viral recombination in vivo; however, the effects of removing the pause sites were not determined (21).
Pausing within the duplex region of stem-loop structures has been shown to diminish when base pairing contiguity was disrupted by the introduction of single unpaired nucleotides or mismatches without changing the template sequence (33, 34). This observation provides a unique opportunity to determine if pausing or lack of pausing affects strand transfer in vitro and retroviral recombination in vivo. By employing a variation of a previously characterized Moloney MLV (M-MLV)-based recombination assay (66), recombination in the presence or absence of predicted strong pause sites was monitored with the reconstitution of green fluorescent protein (GFP) expression. Comparing in vitro primer extension assays on RNA segments derived from the proviral constructs with the proportion of GFP-expressing, MLV-infected target cells provides the first direct evidence that the rate of homologous recombination directly corresponds to the extent of pausing during reverse transcription.
|
|
|---|
-32P]ATP was from PerkinElmer Life Sciences. Reagents for RNA solutions were from Ambion. The neomycin analogue G418 was from Sigma or Mediatech Inc. Dulbecco's modified Eagle's medium (DMEM), trypsin, and fetal bovine serum (FBS) were purchased from Gibco. PGEM9Zf() and the T7 RiboMax transcription kit were from Promega. The pMP-1 plasmid was a very generous gift from Vinay Pathak (NIH HIV Drug Resistance Program). Cloning disks, methotrexate, aminopterin, 50x hypoxanthine-aminopterin-thymidine, and 50x hypoxanthine-thymidine medium supplements were from Sigma. Plasmids and retroviral vectors. The parent vector for all retroviral constructs is the M-MLV-based pMP-1 (11). New proviral constructs were made as follows. Using site-directed mutagenesis, an EcoRI site and SalI site were introduced into pMP-1 between the GFP initiating ATG and the rest of the GFP coding sequence, producing pMPes, which has the initiating ATG, followed by 5' GAATTCTTCGGTCGAC 3' and the remaining GFP sequence. pMPes was digested with EcoRI and SalI and ligated to PLf (5' AATTCTTCCTCGAGGCGATCGATCTGG3') and PLr (5' TCGACCAGATCGATCGCCTCGAGGAAG 3') that had been previously annealed to each other. This generated pMPmcs (for multi-cloning site), which contains the restriction sites EcoRI, XhoI, PvuI, ClaI, and SalI between the GFP initiation codon and the remaining GFP coding sequence (5' to 3'). The oligonucleotides HAf and HAr (5' CCGGTTACCCTTACGACGTGCCCGACTACGCGC 3' and 5' TCGAGCGCGTAGTCGGGCACGTCGTAAGGGTAA 3', respectively) and PSf and PSr (5' AATTCGGTCTAACTAGGGAGACCCACA 3' and 5' [Phosph]CCGGTGTGGGTCTCCCTAGTTAGACCG 3', respectively) were ligated to each other, generating a duplex fragment with EcoRI and XhoI complementary overhangs. This PS/HA duplex was then ligated to EcoRI-XhoI-cut pMPmcs to generate pMPph. pMPph was then digested with XhoI and SalI and ligated to create an in-frame N-terminal fusion to the GFP coding sequence. Separately, pMPph was transformed into Escherichia coli strain GM48, a most generous gift from Joyce Karlinsey of K. Hughes' lab (University of Washington). This was necessary, since the ClaI site is known to be resistant to cleavage when grown in strains capable of methylation. pMPph purified from GM48 was digested with ClaI and XhoI. The final product was then ligated with previously annealed oligonucleotides STOPf and STOPr (5' TCGAGTAGATAACTGATTAAGACTGACATAACGTGACAT 3' and 5' CGATGTCACGTTATGTCAGTCTTAATCAGTTATCTAC 3', respectively) creating pNoDR (no direct repeat). E. coli strain GM48-derived pNoDR was digested with SalI and ClaI and gel purified. The heat-annealed oligonucleotides TARhpF and TARhpR (5' CGATGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCAGATCTGGTCTAAC TAGGGAGACCCACA 3' and 5' CCGGTGTGGGTCTCCCTAGTTAGACCAGATCTGAGCTCCCAGGCTCAGATCTGGTCTAACCAGAGAGACCCA T 3', respectively) were ligated to previously annealed HAp (5' [Phosph]CCGGTTACCCTTACGACGTGCCCGACTACGCGC 3') and HAr. The resulting ligation product was subsequently ligated to ClaI-SalI-cut pNoDR, generating pHPnb (hairpin, no-bulge). pHPnb was used as template for site-directed mutagenesis to create pHP-1b (one bulge) and pHP-2b (two bulges) (primer sequences available on request). All plasmids were sequenced in the region of the inserted sequences for final verification.
In vitro RNA expression vectors and RNA synthesis. Primers that flank the inserted sequences described above were used for PCR amplification of proviral vector templates. The PCR products were digested with EcoRI and HindIII and ligated into EcoRI-HindIII-cut pGEM9Zf() (Promega), creating pIVNOSTOP, pIVNoDR, pIVHPnb, pIVHP-1b, and pIVHP-2b. The ligated fragment was oriented such that positive-sense RNA could be produced using the T7 promoter. All constructs were verified by sequencing. The pGEM9Zf()-derived vectors were digested with HindIII, and RNA was synthesized using the RiboMax transcription kit (Promega) with T7 RNA polymerase. Full-length RNA product was purified on a 5% denaturing polyacrylamide gel.
Primer extension assays.
Ten picomoles of the DNA primer 40extMP (5' TCACATCGCCATCCAGTTCCACGAGAATTGGGACCACGCC 3') was 5' end labeled with polynucleotide kinase in the presence of [
-32P]ATP at 37°C in a total reaction volume of 20 µl. Approximately 0.5 pmol labeled primer was annealed to 1 pmol of the RNA templates described above by heating to 85°C for 2 min and cooling to 37°C at a rate of 0.02°C/s using a Hybaid Thermocycler (Savant) in the presence of 71 mM Tris (pH 8.0) and 71 mM KCl. Dithiothreitol, MgCl2, and 10 pmol of M-MLV RT (U.S. Biochemicals) were added, and the mixture was further incubated at 37°C for 5 min. Primer extension was initiated by the addition of deoxynucleoside triphosphates. In a total volume of 20 µl, the final concentrations were as follows: 50 mM Tris (pH 8.0), 50 mM KCl, 10 mM dithiothreitol, 5 mM MgCl2, 100 µM (each) deoxynucleoside triphosphates, 25 nM primer, 50 nM template, and 500 nM M-MLV RT. At each time point, 3-µl aliquots were removed and transferred to 12 µl of 96% formamide-20 mM EDTA. Denaturing polyacrylamide gel electrophoresis was used to separate the reaction products. Gels were dried and exposed to a PhosphorImager screen, which in turn was scanned using a STORM PhosphorImager. To verify the identity of in vitro strand transfer product (see Fig. 4), the putative transfer products for HPnb and HP-1b and the full-length NOSTOP product were gel purified and PCR amplified using the primers 40extMP (see above) and SText [TAACCTAATCTGAAAGTAGGGAGCTCGTCGACGAATTC] and the PCR products were sequenced. It should be noted that a PCR product could be generated only if strand transfer had occurred, since the EcoRI site (in boldface) to which SText anneals is not present if the band in the gel resulted from pausing during sequential extension. The sequences of the PCR products clearly identified the bands in the HPnb and HP-1b experiments as the strand transfer product (data not shown).
![]() View larger version (45K): [in a new window] |
FIG. 4. (A) An 8% denaturing polyacrylamide gel of an in vitro primer extension time course using 5' end-labeled 40extMP (see Materials and Methods) DNA primer. Labeled primer was annealed to in vitro-synthesized RNA templates NOSTOP (lanes 1 to 6; 160 full-length nucleotides), HPnb (lanes 7 to 12; 301 full-length nucleotides), HP-1b (lanes 13 to 18; 300 full-length nucleotides), and HP-2b (lanes 19 to 24; 299 full-length nucleotides). Above each set of lanes are time points and a schematic of the RNA template structure. Pause sites within the stem-loop structures are labeled on the right and numbered in relation to the first base pair at the base of the stem, which is denoted +1. (B) Analysis of the proportion of strand transfer product generated during primer extension of HPnb, HP-1b, and HP-2b relative the amount of full-length product synthesized on the NOSTOP template. For a given experiment, the proportion of signal corresponding to the full-length extension on the positive control NOSTOP template at the 20-min time point was determined and defined as 100% strand transfer. The amount of strand transfer product at 20-min time points for HPnb, HP-1b, and HP-2b was then calculated and normalized to the NOSTOP full-length template. Values are results from three independent experiments.
|
Expression of proviral vectors in D17 cells. D17 cells are a dog osteosarcoma-derived cell line susceptible to gibbon ape leukemia virus (GALV)-mediated MLV infection. D17 cells were transfected with 5 µg of each plasmid by the calcium phosphate method. G418-resistant cells were selected by growing the cells in DMEM supplemented with 6% FBS and 400 µg/ml G418 (active) for 10 days. Fluorescence microscopy and imaging were performed using a Nikon TS-100 inverted fluorescence microscope with Q-capture software generously provided by M. Lagunoff (University of Washington).
Infection of D17 with MLV produced by PG13 clones for flow cytometry analysis. For each virus strain, duplicate 60-mm dishes were seeded with 5 x 105 D17 cells and grown overnight. The next day, the medium was changed to DMEM supplemented with 6% FBS and 4 µg/ml polybrene and virus was added from each clone to two separate 60-mm plates at an approximate multiplicity of infection [MOI] of 2 x 103 CFU/cell per plate. As a negative control, 100 µl of supernatant collected from mock-transfected PG13 cells grown in selective medium for 2 days (before die-off) was added to a separate plate. After 24 h, each 60-mm plate was split into two 100-mm dishes and selected for 9 to 12 days. On the final day of selection, colonies from duplicate 100-mm plates were pooled and the numbers of GFP+ and GFP cells were determined using the Influx flow cytometer from Cytopeia. Flow data were analyzed with Summit 3.1 software from DakoCytomation.
|
|
|---|
![]() View larger version (17K): [in a new window] |
FIG. 1. Schematic diagram of proviral constructs. (A) Constructs used for generating virus from PG13 packaging lines. At the top is the general provirus structure of the unmodified MP-1 vector. LTR, long terminal repeat; , packaging sequence; Neo, neomycin resistance gene; ppt, polypurine tract. Below the structure are expanded schematics of the region containing the GFP translation start site. The letters abcde represent the direct repeat sequence described in Materials and Methods. STOP corresponds to a sequence with stop codons in all three reading frames. Unpaired nucleotides in HP-1b and HP-2b are represented by sideways carats between "a" and "b" and "b" and "c" along the stem-loop. (B) Schematic diagram of the stem-loops introduced into constructs HPnb, HP-1b, and HP-2b. The sequence is derived from the HIV-1 TAR sequence. Gray boxes indicate modified regions, and the specific nucleotides are in boldface. Note that the sequence involved in the recombination process does not change.
|
Figure 2 schematically outlines the recombination process predicted to occur during reverse transcription of the packaged MLV genome derived from the provirus vectors described in Fig. 1. Previous work has shown that during synthesis through stem-loop structures by RT, the polymerase pauses at specific positions on the proximal strand. This pausing is relieved if unpaired nucleotides interrupt hairpin-stem base pair contiguity (33). As RT synthesizes minus-strand DNA, it replicates the proximal direct repeat first. If RT is able to replicate through the first direct repeat, the hairpin, the STOP sequence, and the second direct repeat, then the resulting integrated provirus will produce a transcript in which the GFP would not be translatable because of the presence of the stop codons and, as mentioned above, in some cases, a frame shift (Fig. 2A). If pausing occurs, however, polymerization through the various sequences may be delayed long enough for the process of strand transfer (recombination) to occur in which the 3' terminus of the nascent DNA switches from the original RNA template (donor template) to the second packaged template (acceptor template) (Fig. 2B). This crossover could take place at the proximal or distal repeat of the acceptor strand during intermolecular recombination. Alternatively, although not depicted in Fig. 2B, the 3' terminus could transfer to the distal repeat of the same donor template (intramoleculer recombination). Previous work indicated that the overall rate of recombination would not be affected by which of the repeats is utilized for the transfer (29). If complete DNA 3' terminus transfer occurs intermolecularly at the proximal repeat of the acceptor template, then RT would still replicate the hairpin and STOP sequence and the infected cell would not express GFP. If complete transfer occurs at the distal repeat on either template, however, the region containing the hairpin and STOP sequence would loop out and be deleted. This process would restore the proper GFP reading frame and remove all stop codons between the initiating ATG and the GFP coding region. The expressed protein would be GFP with a 22-amino-acid N-terminal fusion as in the NOSTOP construct, and infected cells in which this process occurred would be expected to exhibit GFP fluorescence.
![]() View larger version (18K): [in a new window] |
FIG. 2. Schematic diagram depicting two possible outcomes of synthesis through stem-loop structures during reverse transcription. Gray indicates RNA, and black indicates DNA; short vertical lines indicate annealing. (A) Synthesis through a stem-loop without pausing. RT polymerizes through the structure and replicates the STOP and distal direct repeat sequences. The resulting provirus (bottom) will not express GFP due to the presence of the stop codons. (B) Synthesis through a stem-loop with pausing. RT dissociates at the pause site ("c"), and RNase H degrades the donor template RNA (dashed line) behind the DNA 3' terminus. Base pairing and branch migration transfer the nascent DNA to an acceptor template such that RT resumes synthesis at the distal direct repeat. The stem-loop and STOP sequences are deleted, and the resulting provirus translates GFP with a 22-amino acid N-terminal fusion (abcde).
|
![]() View larger version (71K): [in a new window] |
FIG. 3. Comparison of GFP expression in D17 cells transfected with the proviral vectors shown in Fig. 1 and selected for G418 resistance for 10 days. Light microscopy (above) and fluorescence microscopy (below) show that only the NOSTOP positive control construct produces cell associated green fluorescence. Above each set of images is a schematic diagram of the construct used for transfection (Fig. 1).
|
50% strand transfer product in relation to NOSTOP full-length, whereas HP-2b generated only 10% strand transfer product. Together, these data set a baseline from which we can predict the outcome of synthesis through these structures during viral infection. According to our hypothesis, since reverse transcription should be difficult through HPnb and HP-1b hairpins compared to synthesis through the HP-2b hairpin, there should be a higher rate of recombination and therefore a larger proportion of GFP-positive colonies after infection with HPnb and HP-1b virus compared to the proportion observed after infection with HP-2b virus. Pausing in vitro directly correlates to an increase in recombination during M-MLV reverse transcription in vivo. The NIH 3T3-derived clone PG13 produces MLV pseudotyped with the GALV envelope (39). Since the GALV envelope cannot mediate infection of NIH 3T3 cells and its derivatives, the PG13 line is an ideal packaging cell for preventing reinfection and the potential confounding effects of an integrated provirus that has undergone reverse transcription-mediated recombination. Stably transfected PG13 clones producing vNOSTOP, vNoDR, vHPnb, vHP-1b, and vHP-2b (the "v" denotes "virus") were characterized for their pattern of GFP expression, virus production, and the nature of the integrated proviral sequences. Flow cytometric analysis of the virus-producing cell lines showed that the NOSTOP clones expressed GFP as expected, while the NoDR, HPnb, HP-1b, and HP-2b clones were negative and similar to the untransfected PG13 control cell line (data not shown). Genomic DNA was harvested from the various clones, and the region encompassing the inserted hairpin sequences was amplified by PCR using flanking primers. Only cell clones that had the correct GFP expression pattern, an unmodified PCR product sequence, and a sufficient titer were used for infection of D17 cells for flow cytometry analysis.
As outlined in Fig. 5, virus derived from these packaging clones was used to singly infect D17 cells, a dog osteosarcoma-derived line susceptible to GALV envelope-mediated MLV infection. Figure 6 shows a representative analysis of G418-resistant D17 cells infected with virus derived from each construct. The data in Fig. 6A and 6B define the lower (NoDR, 0.12% GFP+) and upper (NOSTOP, 76.8% GFP+) limits of detection of GFP+ D17 cells. When virus containing a hairpin structure was used to infect D17 cells, there was a mix of GFP+ and GFP cells (Fig. 6C, 6D, and 6E). In this particular experiment, approximately 31% of D17 cells infected with HPnb were GFP+, whereas only 2.4% of cells infected with HP-2b were GFP+. For each construct, in a total of four separate experiments, virus from at least three separate packaging lines was used in at least nine independent infections. Normalizing the number of GFP+ cells in each experiment to the corresponding NOSTOP positive control revealed that recombination during reverse transcription of the HPnb genome occurred nearly 50% of the time, whereas for HP-1b, it was 27% and for HP-2b, it was only 4.5% (Fig. 7). Thus, the flow cytometry data show a direct relationship between the amount of pausing within the stem-loop structure observed with in vitro primer extension assays and the number of GFP+ cells detected by flow cytometry.
![]() View larger version (25K): [in a new window] |
FIG. 5. Overview of infection assay to determine the proportion of GFP+ D17 cells by flow cytometry. Above is the general proviral structure of plasmids used to create stably transfected PG13 packaging lines. LTR, long terminal repeat.
|
![]() View larger version (26K): [in a new window] |
FIG. 6. Representative flow cytometry analysis of D17 cells infected with virus derived from virus packaging lines producing NoDR (A) , NOSTOP (B), HPnb (C), HP-1b (D), or HP-2b (E). Horizontal axis shows GFP fluorescence (GFP); vertical axis shows propidium iodide (PI) signal. Percentages of live cells expressing GFP (GFP+ in the text) are indicated in the lower right corner of each panel.
|
![]() View larger version (12K): [in a new window] |
FIG. 7. Graph showing the mean percentage of GFP+ cells relative to the NOSTOP positive control ± 1 standard deviation. In four separate experiments, virus from at least three independent PG13 packaging lines was used for each construct. Total numbers of live cells analyzed in all four experiments were 389,738 for NOSTOP, 425,736 for NoDR, 429,402 for HPnb, 369,837 for HP-1b, and 511,857 for HP-2b.
|
![]() View larger version (88K): [in a new window] |
FIG. 8. Comparison of genomic PCR products and GFP phenotypes of cloned, infected D17 cell lines. At the top is an ethidium bromide-stained 1% agarose gel comparing PCR products amplified from genomic DNA from the indicated PG13 and D17 clones. Clone names are indicated above each lane. Lane C shows noninfected D17 genomic DNA, and lane L shows the 100-bp ladder (Invitrogen). Lanes 1 to 4 are PG13 packaging lines for NOSTOP, NoDR, HPnb, and HP-2b, respectively. The right hand portion of gel shows PCR products from D17 clones infected by NOSTOP virus (lane 5), NoDR virus (lane 6), HPnb virus (lanes 7 to 10), and HP-2b virus (lanes 11 to 14). Below the gels are light (top rows) and fluorescent (bottom rows) microscopy comparisons for each of the D17 clones from which genomic DNA was derived for PCR in lanes 5 to 14 of the agarose gel. Below each image set is the corresponding lane on the agarose gel.
|
|
|
|---|
The dynamic copy choice model describing retroviral recombination proposes that an equilibrium between DNA polymerization and RNase H degradation determines the rate of crossover during reverse transcription (57, 67). Factors such as pausing (52, 53), nucleotide misincorporation (19, 45), nontemplated base addition (24), the presence of a functional NC (44, 52, 66), and nucleotide availability (17, 48, 57) all determine the rate of polymerization and whether branch migration will "catch up" to the DNA 3' terminus. Our in vitro primer extension results demonstrate that pausing increases the proportion of strand transfer product. Extension on the HPnb, HP-1b, and HP-2b RNA templates showed a relative transfer frequency of 46%, 46%, and 10%, respectively. In vivo, the recombination rates during viral reverse transcription generated from these same constructs were 47%, 27%, and 4.5%, respectively. Based on the in vivo results, disrupting RNA base pair contiguity with two unpaired nucleotides near pause sites (Fig. 1B) resulted in a 10-fold decrease in recombination (47% versus 4.5%) while the construct with one unpaired nucleotide (HP-1b) gave an intermediate result. HP-1b generates a greater proportion of strand transfer in vitro than in vivo (46% versus 27%), suggesting that the in vivo assay may be more sensitive at detecting differences in pausing and recombination within a direct repeat. Overall, our data clearly demonstrate a trend between decreased pausing within the stem of a stem-loop structure in vitro and decreased direct repeat recombination in vivo. These results provide additional support for the dynamic copy choice model for retroviral recombination.
It should be noted that strand transfer could have been initiated regardless of pausing, with complete crossover occurring beyond the distal direct repeat. Since this would not restore GFP translation, this type of event would not be detected with our system and, therefore, the overall recombination frequency is likely underestimated. The observation that increased pausing increases the proportion of GFP+-infected D17 cells confirms that pausing can direct 3'-terminal transfer to a location on the acceptor template in close proximity to the pause site in vivo (15, 33).
The results from this work provide further support for the hypothesis that the evolution of secondary structures in the retroviral genome was strongly influenced by the presence of unpaired nucleotides that disrupt base pair contiguity in the stems of stem-loop structures (33, 34). The completely duplex stem used in this study was derived from the HIV-1 TAR element, and both HIV-1 RT and MLV RT-mediated RNA displacement syntheses are impeded at identical positions within the stem that normally have single-nucleotide bulges in close proximity (Fig. 4). We also found that the frequency of homologous recombination is substantially increased when unpaired nucleotides are absent due to increased pausing within the RNA duplex stem (Fig. 6 and 7). Together, these data suggest that the spectrum of possible secondary structures that can form within a retroviral genome is limited by how well the RT can replicate through the structure. If there were strong pausing within the duplex stem, then there would be a greater probability that the secondary structure would be lost due to either abortive synthesis or, more likely, recombination. Stem-loop structures possessing unpaired bulges in the duplex portion, however, would be readily replicated, and the secondary structure would be maintained in the genomes of subsequent generations of virus. In turn, the "replicability" of secondary structures would delineate the pool of RNA conformations available for the development of RNA-protein interactions, as is seen with the HIV-1 TAR sequence and poly(A) signal (6, 32).
The data from this and previous work suggest that retroviral genomes are optimized for replication during reverse transcription (33, 34). Genomes possessing stem-loop structures with strong pause sites would likely exhibit a tendency to modify the stem-loop either by deletion due to recombination or perhaps by the formation of bulges due to RT-mediated mutagenesis. Both processes would facilitate replication of the affected structures in the genome by decreasing the pausing associated with the structure. Since retrovirus-based vectors are used for gene therapy, it may therefore be necessary to evaluate whether a nonretroviral sequence, such as a human gene, should be optimized for reverse transcription. Assuming that human RNA structures do not undergo the same selective processes as retroviral secondary structures, there may be stem-loop or other RNA secondary structures within human sequences that induce significant RT pausing in the context of reverse transcription. For retroviruses used in gene therapy, this may cause abortive reverse transcription or unwanted modification of the exogenous human sequence through recombination, which would potentially decrease transduction efficiency. Our work suggests that by destabilizing the duplex portions of RNA secondary structures, the negative effects of pausing can be avoided. Importantly, this could be accomplished without modifying the coding sequence because a nucleotide involved in base pairing can be changed without altering the encoded amino acid. By generating a mismatch that disrupts base pair contiguity, pausing can be reduced and replication of the exogenous human sequence by RT enhanced.
We thank V. K. Pathak, M. Lagunoff, M. Linial, T. Pham, P. Carroll, R. Life, and J. Karlinsey for assistance, reagents, and equipment use. Thanks to M. Zhang for excellent RNA structural probing and to David Coil for extremely helpful discussions. Also, we thank M. Linial, J. B. Leppard, J. Carey, B. T. Paulson, S. Schultz, and H. Interthal for discussion of data and for review of the manuscript prior to submission.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»