Previous Article | Next Article ![]()
Journal of Virology, November 2007, p. 12156-12168, Vol. 81, No. 22
0022-538X/07/$08.00+0 doi:10.1128/JVI.01591-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, Michigan
Received 20 July 2007/ Accepted 29 August 2007
|
|
|---|
|
|
|---|
How elongating reverse transcription complexes recognize and recruit secondary templates and how a growing DNA strand is transferred to an acceptor template are areas of active investigation. Models for recombination during both plus- and minus-strand DNA synthesis have been proposed, with experimental evidence supporting a dominant role for minus-strand recombination (9, 22, 67). Forced copy choice models for minus-strand recombination suggest that RT/primer-template complex dissociation at a broken template end precedes recombinogenic template switching (9). However, evidence that factors other than template integrity can modulate recombination frequency demonstrates that template breakage is not a prerequisite for recombinogenic template switching (42, 44). An alternate model for minus-strand homologous recombination—the acceptor invasion model—suggests that minus-strand DNA sequences behind the growing point for DNA synthesis, which are unmasked by the removal of RNA template segments by RNase H, recruit acceptor templates by base pairing with them and that template switching itself involves a subsequent repositioning of the primer strand's 3' end from the donor to the acceptor template (5, 8, 50).
Most retroviral recombination occurs in regions of donor/acceptor template identity, with nonhomologous recombination occurring roughly 100- to 1,000-fold less frequently than homologous recombination (69, 70). The frequencies of retroviral recombination are roughly proportional to the lengths of sequence identity (70). Recombination between similar, or "homologous," sequences occurs less frequently than recombination between identical sequences, with recombination becoming less frequent the more dissimilar templates are to one another (1). When nonhomologous recombination is monitored, short regions of sequence microidentity are observed at the crossover junctions in some instances, due to the retroviral replication machinery's remarkable capability of recognizing short patches of sequence identity in the context of otherwise nonhomologous sequence (19, 60, 65). However, many nonhomologous crossovers display no microidentity or even have sequence insertions at the crossover junctions (41). Based on the similarity between certain sequence insertions in human immunodeficiency virus type 1 clinical isolates and segments of the human genome, we have previously proposed that many of the insertions observed in retroviral genomes may result from the use of short sections of discontiguous nucleic acids as tertiary templates to bridge nonhomologous crossover junctions (3, 57).
The current work employs a novel Moloney murine leukemia virus (MMLV) forced template switching assay to examine how systematic variation in lengths of donor/acceptor identity affects recombinogenic template switching. Acceptor template selection in the absence of appreciable donor/acceptor identity and the mechanisms employed by the reverse transcription machinery to resolve nonhomologous recombination junctions were addressed by sequencing and analyzing a large number of individual forced nonhomologous recombination products.
|
|
|---|
![]() View larger version (29K): [in a new window] |
FIG. 1. Forced recombination assay. (A) Two-vector recombination assay as described in reference 43. Newly synthesized DNA undergoes a recombinogenic strand transfer from the 5' end of an RNA donor (D) template to the 3' end of a distinct RNA acceptor (A) template. PRSV, Rous sarcoma virus promoter; pbs, primer binding site; ppt, polypurine track. (B) RIO template. DNA is shown as a complete circle. The RIO template contains a single, modified, central LTR ( U U5). An SV40 promoter-driven puromycin resistance gene containing an artificial intron is separated into 5' (puro-, in-) and 3' (-tron, -mycin) portions. Homologous D and A regions are designated within the intron. (C) Recombination during reverse transcription using RIO vector. Upon reaching the 5' end of the donor template, the newly synthesizing DNA is forced to undergo a recombinogenic strand transfer to a region of sequence identity within the acceptor template. After completing plus-strand synthesis and transfer, the final provirus resembles a retroviral provirus. The puromycin resistance gene and intron are reassembled, and two LTRs flank the provirus. is the packaging signal, and PSV40 indicates the SV40 late promoter. Primers shown in the bottom row are those used to analyze products as for Fig. 2D.
|
Cells and virus. 293T cells (human embryonic kidney cells expressing simian virus 40 [SV40] T antigen) were grown in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum and 1% penicillin-streptomycin. RIO vector plasmids were cotransfected with MMLV gag-pol and MMLV ecotropic env helper plasmids (pNGVL3 gag-pol and pAM 178-2, respectively [44, 64]) into 293T cells by calcium phosphate precipitation as previously described (44). Medium containing virions was harvested beginning 24 h posttransfection and filtered through 0.2-µm filters. D17/pJET cells (canine osteosarcoma cells expressing murine ecotropic receptor) were infected with serial dilutions of vector-containing virions in the presence of 0.8 µg/ml Polybrene (hexadymethrine bromide; Sigma) for 2 h and selected 48 h postinfection in Dulbecco's modified Eagle's medium with 10% calf serum, 1% penicillin-streptomycin (GIBCO/Invitrogen), and 1 µg/ml puromycin (25, 39). Titers were determined by the number of CFU per ml of virion-containing medium after 12 days of puromycin selection. Titers were averaged from four independent infections using virus from at least two independent transfections for each vector. CFU/ml values were then normalized to RT protein levels, as determined by RT activity (58), relative to a reference sample. Error bars were calculated as standard deviation of the mean.
Analysis of strand transfer products. For homologous recombination products, after 12 days of selection in puromycin, colonies were pooled, and provirus-containing genomic DNA was isolated using a quick lysis plus proteinase K digestion protocol modified from the work of Woffendin et al. (63). The artificial intron within the puromycin-coding sequence was PCR amplified using primers SF304 (5'-GGCCTAGGCTTTTGCAAAAAGCTTAC-3') and MK204 (5'-TCGGCGGTGACGGTGAA-3') for 35 cycles. For product mobility analyses, SF304 was 5' end labeled with [32P]ATP and T4 polynucleotide kinase, and radioactive PCR products were digested with AscI, separated on a 5% acrylamide gel, and quantified by phosphorimager. Alternately, product bands were resolved on 1% agarose gels and purified using a QIAquick gel extraction kit (QIAGEN) for sequencing by the University of Michigan DNA Sequencing Core Facility.
For nonhomologous recombination product analysis, individual puromycin-resistant colonies containing integrated humanTM vector products were expanded for DNA extraction, and genomic DNA was isolated using a DNeasy tissue kit (QIAGEN). Intron-containing puromycin sequences were PCR amplified and sequenced as described above.
Sequences were analyzed by aligning parental vector and product sequences with DNAStar SeqBuilder (Lasergene). When a likely template could not be identified in the vector, sequences were compared to GenBank using NCBI BLASTn. Sequences including and flanking the insertions were queried, with the longest identified putative template segment designated the best candidate template. Sequences were considered to be likely templates if sequence identities shared by inserts and putative templates were 8 bases or greater in the case of vector-derived sequences and 19 bases or greater in the case of host sequences. Matches of these lengths would be predicted to arise roughly 10-fold less frequently than by random chance.
|
|
|---|
Here, we revisited forced copy choice recombination by use of a single-vector RIO approach, which circumvents the need for the copackaging of two coexpressed vectors. RIO vectors contain primary (donor) and secondary (acceptor) template sequences like those used in two-vector forced recombination assays, but these two recombination templates are contained on single RNAs. RIO vectors have been called "prejumped" because their first DNA intermediate resembles a reverse transcription product that results after minus-strand strong-stop strand transfer (33). The recombination donor was engineered to reside at the RNA's 5' end, as in two-vector forced homologous recombination assays. However, the acceptor was engineered into the 3' portion of a single RIO vector RNA. Here, the two halves of the RIO vector each contained a portion of the puromycin resistance gene separated by an artificial intron that served as the recombination target sequence (Fig. 1B). Generation of an intact puromycin resistance gene, which requires recombination between 5' donor and 3' acceptor sequences, proceeds readily, and thus RIO vectors provide a highly tractable one-vector approach for studying retroviral recombination (Fig. 1C).
Previous two-vector forced homologous recombination experiments suggested that 30-nucleotide secondary templates are recognized as well as longer acceptors, and it has been determined that the reverse transcription machinery can extend a single mismatched nucleotide fairly efficiently (43). Thus, to address how frequently RT proceeded to the primary (donor) template's 5' end before switching to a secondary template, the extent of single-base mismatch extension was determined using a forced homologous recombination assay. Donor and acceptor sequences were designed so that recombination occurring precisely from the donor template's 5' end—which would require mismatch extension—would yield a diagnostic restriction site in the product DNA, while premature template switching would not generate the diagnostic site. An overview of the experimental approach is presented in Fig. 2A.
![]() View larger version (29K): [in a new window] |
FIG. 2. One-base mismatch extension by RIO vector. (A) Experimental scheme. (B) 30TM acceptor. Newly synthesized DNA transfers from a donor template to an acceptor template with 30 bases of identity. If the DNA transfers at any position within the 30 bases of identity on the donor template, an AscI site will be generated in the provirus. (C) 1TMM acceptor. The 1TMM acceptor contains 29 bases of identity and 1 mismatched base. If DNA transfers from the 5' end of the donor and the mismatch extends, an AscI site will be generated. If the DNA transfer occurs prior to the template end and no mismatch extension occurs, the template base will be incorporated and no AscI site will be formed. Acceptor region abbreviations are given in the legend to Fig. 1. (D) PCR amplification of puromycin resistance gene from pooled proviral DNA by use of primers indicated in Fig. 1C and analysis with AscI digestion. pRSVpuro, a vector containing an intronless puromycin resistance gene amplified with primers SF304 and MK204. RIO vectors, with their artificial introns, yield a longer amplification product. PCR products from 30TM and 1TMM, undigested and digested with AscI, are shown. Alternate targets that appear in the 1TMM product pool are indicated. puro, puromycin.
|
Figure 2D compares products of the terminally mismatched and terminally matched RIO vectors. Digestion with the diagnostic restriction site for products of the matched template vector was >95% complete. For the single-base mismatch vector, 80% of the products were digestible with the diagnostic enzyme. This suggests that roughly 80% of the products of this vector resulted from RT proceeding to the donor template's 5' end, transferring to the mismatched acceptor template, and extending the 1-base mismatch that would result upon template switching. About one-half of the products that did not cut with the diagnostic enzyme migrated slightly faster than the correctly targeted product (Fig. 2D, rightmost lane). Low levels of this alternate target product were detectable among terminally matched vector products as well. Sequencing revealed that this product resulted from the use of 5 bases of fortuitous microidentity between the donor template 5' terminus and an internal portion of the vector, which thereby served as an alternate acceptor template. These findings suggest that this vector system successfully generated forced recombination products, as a large majority of puromycin-resistant proviruses resulted from RT proceeding to the donor template's 5' end.
Donor/acceptor identity required for homologous recombination. A series of RIO vectors with incremental differences in donor/acceptor identity length was next used to assess acceptor template recognition. As shown in Fig. 3A, vectors with stepwise reductions in donor terminal identity were generated in series, and their titers were compared to those of the parental vector with 30 bases of acceptor/donor identity, as described in Materials and Methods.
![]() View larger version (36K): [in a new window] |
FIG. 3. Minimal sequence identity required for efficient and accurate recombinogenic strand transfer. (A) Acceptor sequences. The series of stepwise shortened acceptors from 27 bases to 0 bases of identity at the engineered acceptor position. (B) Titer/RT for 30 to 0 bases of identity vectors, determined as described in Materials and Methods. Note that the titer in the 5TM column, indicated with an open circle and connected to dashed lines, is the titer for a 5TM vector variant in which the major alternate target sites were eliminated by mutation. This alternate 5TM vector was generated to assess titers in the absence of the major alternate targets but is not used elsewhere in this report. (C) PCR products of pooled proviral products of the limited identity vectors, with or without AscI digestion. Product mobilities are indicated at left; some alternate target products are indicated at right. puro, puromycin.
|
Provirus-containing genomic DNA from pooled puromycin-resistant colonies produced by each vector was PCR amplified and analyzed on acrylamide gels as described above (Fig. 3C). The results revealed that nearly all of the puromycin-resistant products of acceptors of
10 bases in length were correctly targeted to the engineered acceptor. In contrast, limited use of the fortuitous 5-base alternate acceptor (Fig. 3C, bottom right arrow) was observed for the 8-base acceptor. This and other alternate acceptors predominated for the shorter acceptors in this vector series.
Forced nonhomologous recombination. For the following experiments, RIO vectors were modified to assess forced nonhomologous recombination. A RIO vector (humanTM) was constructed in which the major and ectopic acceptor sites used during reverse transcription of the above-described vectors were mutationally ablated. In their place, a human genome fragment that provided a relatively large number of candidate alternate acceptors with very limited identity to the donor was introduced. The nonhomologous acceptor template region selected for use here was a 590-base shotgun-cloned nonrepetitive noncoding fragment of human DNA (Fig. 4A). This human fragment acceptor included one region with 3 bases of identity to the donor template's 5' end and many candidate 2-base acceptors but lacked identity of 4 bases or longer. The puromycin resistance-conferring titer of humanTM, when produced by transient transfection with helper function plasmids, was 500-fold lower than that of 30TM (1 x 101 CFU/ml versus 6 x 103 CFU/ml; Fig. 4B).
![]() View larger version (26K): [in a new window] |
FIG. 4. Forced nonhomologous recombination. (A) RIO template containing a human sequence nonhomologous acceptor. The acceptor region included a 590-base fragment of human chromosome 14 that contains no region of identity greater than 3 bases with the 5' end of the donor. Note that polyadenylation signal readthrough resulted in the appendage of RNA copies of plasmid backbone sequences to a subset of the vector RNAs' 3' ends. Acceptor region abbreviations are given in the legend to Fig. 1. (B) Vector titers determined as described in Materials and Methods. 30 represents titers for the 30TM vector; human indicates humanTM values; 0 indicates the value for 0TM vector. (C) PCR amplification of cell DNA from individual puromycin resistance cell clones. The amplification product of a correctly targeted clonal 5-base acceptor product is shown, as are PCR products from representative individual humanTM product cell clones. The leftmost two lanes contain radiolabeled size standards. puro, puromycin.
|
|
View this table: [in a new window] |
TABLE 1. Nonhomologous recombination products
|
80-base) introns that lacked strong cryptic splice acceptors or polyadenylation signals in the intronic region would survive the imposed selection (62). Nonetheless, the sequences of the 83 products revealed that the reverse transcription machinery employed a remarkable variety of strategies to resolve forced nonhomologous recombination. These products can be divided into classes based on the nature of the crossover junctions and on the number of template segments employed. Forced copy choice nonhomologous recombinants. Based on the very low frequency of premature jumps observed above during RIO vector forced homologous recombination, as well as on previous observations of donor/acceptor microidentity at many nonhomologous crossover junctions, we had anticipated that most of this vector's products would result from reverse transcription to the vector's 5' end and then use of donor terminus microidentity in the acceptor as the site of transfer. Remarkably, only 1 of the 83 reverse transcription products analyzed here had that structure. Even though the acceptor region included 1 trinucleotide match to the vector's 5' end and 34 matching dinucleotides, this single product of transfer from the vector's 5' end to the acceptor template region (clone 98 in Table 1; Fig. 5A) utilized only a single base of donor terminal/acceptor microidentity. An additional five products (clones 18, 31, 65, 123, and 129 in Table 1), described below in the discussion of splinting tertiary template segments, resulted from forced template switching at the donor 5' end. However, all of these latter were complex products for which the initial template switch was to a discontiguous tertiary template sequence, rather than a direct transfer to the engineered human chromosome 14 acceptor.
![]() View larger version (29K): [in a new window] |
FIG. 5. Categories of nonhomologous recombination products. (A) Forced copy choice nonhomologous recombination product. DNA synthesized to the 5' end of the donor template transfers to an acceptor template position with primer terminal microidentity. (B) Premature jump. DNA synthesis transfers from the donor to the acceptor template before reaching the 5' end of the donor template. (C) Incorporation of tertiary template sequences. Nascent DNA transfers from the donor template to a tertiary template before subsequent transfer to the acceptor template. Illustrated here, the tertiary template used is a segment of polyadenylation signal readthrough RNA. Acceptor region abbreviations are given in the legend to Fig. 1.
|
![]() View larger version (20K): [in a new window] |
FIG. 6. Spectra of donor departure and acceptor entry sites observed in analyzed clonal products. (A) Schematic representation of humanTM vector, including polyadenylation readthrough region. Numbers at top indicate map positions on the RNA and indicate that the schematic is not to scale. Note that polyadenylation signal readthrough resulted in the appendage of RNA copies of plasmid backbone sequences to a subset of the vector RNAs' 3' ends. (B) Compendium of donor departure and acceptor entry sites for all 83 nonhomologous recombination products. Diagrammed as in Fig. 5C. Each open circle indicates a simple recombination donor departure or acceptor entry site; each black bar indicates a corresponding site for a complex recombinant. Acceptor sites for 20-base intervals are binned; stacked circles and/or bars indicate multiple entries into a single acceptor site within the interval. The RNA strand drawn in the center of Fig. 6B represents splinting templates. Crossover site locations for those recombinants that used sequences on the same or copackaged vector RNA are shown. Horizontal lines above the splinting template indicate the locations of splinting RNA template segments; lines below indicate splinting DNA templates. Each horizontal line represents a separate tertiary template segment. Each encompasses the template segments used, but these are not precisely to scale. polyA site, polyadenylation site. Acceptor region abbreviations are given in the legend to Fig. 1.
|
Identification of splinting tertiary template segments. Sequence analysis programs and database searches were used to identify candidate noncontiguous segments of RNA or DNA that served as tertiary templates for the insertions (see Materials and Methods). The sequence analyses suggested that a total of 69 segments of nucleic acid contributed as tertiary templates in the synthesis of the 56 insert-containing clones, with likely templates identified for 90% of the insert segments. Most inserts with unidentified templates were too short (median length of 5 bases) to implicate a specific template with confidence (see Materials and Methods).
Nonhomologous junctions bridged by single segments of vector-internal or polyadenylation site readthrough RNA. In the first and largest class of splinted recombinants, the gaps between donor and acceptor regions were bridged by single discontiguous portions of RIO vector RNA sequence or of sequences that were appended to the vectors' 3' ends due to vector polyadenylation signal readthrough. For five of these (Table 1, clones 10, 19, 39, 50, and 118), the bridging template was derived from discontiguous portions of vector-internal sequences (Fig. 6B). This resulted in a duplication of sequences from one location between the products' LTRs to a second remote location also between the viral LTRs: rare instances of such duplications of viral genome segments have been reported previously (31, 36, 46, 56).
For most of the splinted recombinants analyzed here, inserts were templated by sequences downstream of the vector polyadenylation site (Fig. 6B). The mechanism likely to have been used to generate these insertions (a nonhomologous template switch to a discontiguous template, limited reverse transcription on that tertiary template, and then a subsequent nonhomologous template switch) was identical to that used for the patching in of the vector-internal sequences described above. However, the outcome here was to relocate vector-external sequences into internal portions of the vector rather than to duplicate vector-internal sequences at an ectopic site. The polyadenylation cassette used in humanTM is subject to up to 20% readthrough (3), thus suggesting that as many as one-third of all humanTM RIO vector-containing virions may have contained readthrough sequences. Sequences housed on readthrough RNAs are known to be packageable and to serve readily as "patch repair" recombination templates (3, 20, 37, 53, 56, 69).
Insertions templated by nascent vector DNA, host sequences, or undetermined templates. The inserts observed in a second class of splinted recombinants were derived from the RIO vector but appeared in the product DNAs in reverse orientation, suggesting that they were templated by nascent minus-strand DNA rather than by vector RNA (clones 5, 81, 7, 11, 47, 72, and 49). "Backwards" insertions have previously been described for nonhomologous recombination products and can be explained by models for plus-strand recombination, a key feature of which is template switching between nascent DNA templates (22, 56, 66). Fairly extensive (7-base) fortuitous microidentity between the most commonly utilized minus-strand template and sequences in the acceptor template region likely contributed to the high frequency of minus-strand templated insertions observed here.
The third class of inserts was host-derived sequences. Among these were human (producer cell) leucine and lysine tRNAs (clones 18, 65, and 129), a human U1 spliceosomal RNA cDNA (clone 130), a portion of human 28S rRNA (clone 89), and canine sequences derived from the recipient cell (clone 78).
Likely templates were not identified by sequence analyses for the final class of putative template segments (7 of the 69 total insert segments). These ranged in length from 1 to 16 bases. The possibility that these segments were encoded by tertiary templates cannot be ruled out, and in each case, tertiary templates too short to fulfill the significance criteria described in Materials and Methods could be identified and may have been the source of the inserts (not shown). On the other hand, especially for the shortest of these, the insertion may have resulted from nontemplated base addition prior to nonhomologous recombination. Single-base frameshifting is a common class of reverse transcription error, and under some circumstances RT can add and extend single nontemplated bases (27, 41, 45). However, it is interesting that retrospective analysis of previous insertion-in-deletion mutations suggests that even single-base insertions may in some instances be tertiary template directed (40) (Fig. 7).
![]() View larger version (31K): [in a new window] |
FIG. 7. Retrospective alternate model for generation of single-base insertion plus flanking hypermutations. Based on nonhomologous recombination product 72.P8 in reference 40. (A) Model proposed in reference 40. A single-base insertion occurs during nonhomologous template switching, followed by junction-associated hypermutation by RT as it copies the acceptor template. (B) Splinted recombination model for single-base insertion. RT uses 2 bases of microidentity to switch to the tertiary template and then 6 bases of microidentity to switch back to the acceptor template. This model calls for one base substitution between the candidate splinting template, identified by BLASTn as GenBank accession no. V01204.1, which mapped to the U3 region of a spleen necrosis virus. Note that TK gene expression in the vector used (40) was driven by a spleen necrosis virus promoter, and thus it seems likely that the apparent 1-base insertion in a nonhomologous crossover junction here arose via splinting recombination with a distal portion of the vector RNA.
|
Crossover site distribution and hot spots for nonhomologous recombination. Figure 6B displays the donor departure and acceptor entry sites for each of the 83 analyzed nonhomologous recombination products, whether or not tertiary templates were used prior to acceptor template association. In this figure, the donor RNA's 5' end was assigned position 1.
Analysis of crossover junctions indicated that whereas a broad range of donor departure sites was used (28 different departure sites were used among the 83 clones), two departure sites (positions 48 and 50) were used significantly more frequently than others (14% and 22% of the total clones, respectively), and fully one-half of all products resulted from template departure in the interval from donor template position 48 through position 52 (Fig. 6). Unlike the simple recombinant junctions, which were fairly widely distributed, the initial template switch for all but two of these donor departure hot spot products was to a tertiary template segment (see Table 1).
Many of the observed hot spot donor sites directed template switching to acceptor positions that were themselves hot spots. For example, 10 out of 18 template switches from position 50 used 2 bases of microidentity to arrive at splinting template hot spot acceptor position 4894. Similarly, 10 of 12 jumps from hot spot position 48 were directed to the hot spot acceptor at position 6263, using 3 bases of identity. Note however that although donor departure sites and splinting template entry sites displayed hot spots, there was significant variation in the length of splinting tertiary template that was reverse transcribed after hot spot transfer and before subsequent template switching to the engineered human acceptor. For example, clones 2 and 71 share a donor/tertiary template crossover junction, but 156 bases of tertiary template were copied in clone 2 and only 63 bases of the tertiary template were reverse transcribed in clone 71.
In other cases, a single donor hot spot was joined to many different acceptors (for example, donor position 60 was joined alternately to acceptors 2494, 3032, and 3073), and in some cases, a single acceptor hot spot was joined to many different donors (e.g., a minus DNA splint, donor sites 51 and 59, and an undetermined splinting template were all observed joined to acceptor position 2358 in different clones).
Among hot spot products, some pairs and triples possessed identical sequences. Confirming that these were not sibling integrants but instead independent reverse transcription products, the colonies containing the independent identical sequences were isolated in each instance from separate plates of infected cells; in most cases, they were products of virions produced by separate transfections.
General properties of crossover junctions and template segments. Of the 83 nonhomologous recombination products analyzed here, about one-third were simple recombinants and two-thirds were complex splinted recombinants. In addition to the initial tertiary template, at least one additional template contributed to the synthesis of 20% of the splinted complex nonhomologous recombinants. One product (clone 51 in Table 1) appears to have been synthesized using a total of five template segments. The distribution of crossover-associated donor/acceptor microidentity lengths is presented in Fig. 8A. As shown in Fig. 8B, splinting template segments varied in length from 1 to 280 bases (median length, 63 bases).
![]() View larger version (22K): [in a new window] |
FIG. 8. Properties of crossover junctions and splinting templates. (A) Distribution of junctional microidentity lengths. Black bars represent identities observed at simple recombinants' crossover junctions; hatched bars are complex recombinants' junctional microidentities; und represents junctions for which identity lengths were not determined because a template segment was not identified. (B) Splinting (3°) templates' lengths. Hatched bars indicate minimal template lengths for inserts with unidentified templates (e.g., the length of the insert, whereas for identified templates most included flanking microidentity residues); black bars represent assigned template segments. (Bottom) An expanded view of template length distribution for shorter insert segments shown at top.
|
|
|
|---|
An interesting finding here was the sharp cutoff in acceptor recognition observed with incremental changes in identity lengths. Specifically, while 16 bases served as efficiently as 30 bases, a 15-base acceptor template displayed a sharp 1-log decrease in titer. Titers for acceptors of 10 to 15 bases in length were similar to one another, but a further sharp 1-log drop was observed for an 8-base acceptor. These template lengths are suggestive of the way RT is known to interact with its primer/templates. When RT reaches the end of a template, template degradation by its RNase H activity produces a residual 15- to 19-base oligoribonucleotide, and subsequent slower secondary cleavages can produce 8- to 10-base RNA products (4, 11, 17).
These findings regarding acceptor length can be interpreted in the context of the acceptor invasion model for minus-strand recombination, in which RNase H cleavage of the template strand is a required step (5, 8). Template degradation unmasks nascent minus-strand DNA to serve as a docking site for the invading acceptor template, with the region between 10 and 19 bases behind the RT growing point being especially critical to acceptor template engagement in reconstituted reactions in vitro (7, 50). Thus, speculatively, the results here may suggest that acceptor template engagement and physical interactions within RT's footprint are critical intermediates in template switching. As postulated by the model presented in Fig. 9, these interactions are most effective at engaging a productive acceptor template when RT's polymerase active site is still engaged at the primer terminus (Fig. 9A), but interactions retained after the enzyme's repositioning to generate RNase H secondary cleavages also can serve, to a lesser degree, in acceptor template recruitment (Fig. 9B).
![]() View larger version (25K): [in a new window] |
FIG. 9. Model for RT interactions during acceptor invasion-mediated recombination. (A) RT (represented as a peanut-shaped object) after reaching the 5' end of the donor template. The figure represents the distance between RNase H and DNA polymerase active sites as 15 bases; donor template cleavage is represented with a dashed line; the invading acceptor is shown as a gray line. (B) RT translocated after secondary RNase H cleavages. The acceptor template is shown base pairing with nucleotides 9 to 13 from the 3' end of the nascent pretransfer DNA. Models for acceptor invasion call for subsequent branch migration to subsequently realign the primer strand 3' end onto the acceptor template (5).
|
Several previous studies have addressed nonhomologous recombination (12, 19, 40, 56, 60, 69). This one differs both in attempting to force recombination at a discrete position and in the extensive collection of products analyzed. It was surprising that only a single product herein resulted from synthesis to the end of the donor template followed by microidentity-guided switch to the human sequence acceptor. The copy choice model for genetic recombination suggests that when RT reaches a broken template end, the enzyme:primer-template complex dissociates and acceptor templates are located by primer terminal complementarity (8). However, the decreased titers and the low levels of transfer observed using limiting identity templates suggest that predominantly dead-end products form if RT reaches a donor template's 5' end in the absence of an acceptor template with subterminal identity.
The use of tRNAs as bridging templates that span nonhomologous recombination junctions has been reported previously (6, 19, 26). tRNAs and sequences from other host RNAs known to be packaged by retroviruses were observed for some insertions here (18, 38, 61). Low levels of host messenger RNAs are also packaged by retroviruses (30, 51), but the absence of producer cell mRNA sequences from any of the 83 clones is suggestive of the possibility that within individual virions, copackaging of genomic and host mRNAs is a rare occurrence. In contrast, patching in of sequences encoded downstream of the vector's polyadenylation signal—a process analogous to prevailing models for oncogene transduction—was observed for over one-third of all products analyzed (54).
One product's insertion was derived from intronic sequences from the recipient cell. Although using a recipient cell intronic sequence as a recombination substrate does not readily reconcile with our understanding of how and where reverse transcription takes place, recombination with infected recipient cell sequences has previously been described (36, 60), and recombination with integration-site unlinked noncoding sequences also has been reported (55). Thus, there is precedence for the low-frequency incorporation of recipient cell and noncoding host genomic sequences into defective retroviruses.
Previous work has identified hot spots for retroviral recombination in homologous and homologous recombination templates (14, 71). This study's nonhomologous crossovers occurred at many positions, but some template departure and entry sites were used more than others, with hot spots in both donor and acceptor sequences. Microidentity was involved in
90% of this study's nonhomologous crossovers, which is a greater fraction than previously implicated (68), and some frequently used crossover sites displayed junctional microidentity. However, other crossover sites that were used multiple times were identity independent. Some "warm spots" appeared half site autonomous, in that a single donor departure site was used with any of several entry sites, or one acceptor was joined to any of several donor departure sites. RNA structures have been implicated for some but not all hot spots for template departure, presumably due to pausing during DNA synthesis, and other work has addressed homopolymeric regions and other features as promoting homologous template switching (28).
Parameters governing nonhomologous transfer hot spots here were not rigorously explored. However, observations such as the strong use of the alternative 5-base acceptor in the presence of acceptors of less than 10 bases in length provided clear evidence of positional advantages. The nature of these is unclear. Previous studies have reported RNA secondary structures as biased positions for nonhomologous recombination (12), and RNA structures have also been implicated as contributing to acceptor template hot spots (35). Here, there were some instances where such factors appeared operational but others where they were not. This suggests that many different parameters can lead to recombination hot spots and/or that some other unifying property of these positions—for example, their induction of sequence context-dependent biases in the RT translocation state (32)—led to the observed recombination hot spots.
Interestingly, whereas the average distance that RT copies a viral genomic RNA segment before switching to the copackaged RNA is roughly 1 to 3 kilobases (21, 37, 49), the median splinting template length in the current report was only 63 bases. Pathak and Temin postulated that oligoribonucleotide remnants of genomic RNA generated the short insertions they observed in nonhomologous crossover junctions (41). Here, however, most implicated template segments were in readthrough RNA regions unlikely to be subject to RNase H digestion, suggesting that some factor(s) other than template length contributed to the apparent low processivity of DNA synthesis on these tertiary template segments.
We previously proposed that short sequence insertion may be the typical signature of transductive recombination and that this process may generate many of the insertions observed in retroviral genomes (3, 57). In the present study, this hypothesis was addressed by examining products generated during forced nonhomologous recombination. Some instances of incorporation of distal genome segments at nonhomologous crossover junctions have been reported previously, and "patch repair" is often observed under selective conditions (31, 34, 36, 56). The assay used here revealed that using short portions of tertiary templates is a common way for the reverse transcription machinery to resolve forced nonhomologous recombination. The frequent use of tertiary template segments that was observed here provides experimental support for the possibility that some of the length variation observed for human immunodeficiency virus type 1 clinical isolates may reflect the patching in of short segments from discontiguous templates (16, 24, 57).
This work was supported by NIH grant GM063479 to A.T.
Published ahead of print on 5 September 2007. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»