Previous Article | Next Article ![]()
Journal of Virology, February 2008, p. 1407-1413, Vol. 82, No. 3
0022-538X/08/$08.00+0 doi:10.1128/JVI.01983-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Department of Botany and Plant Pathology, Center for Gene Research and Biotechnology, Oregon State University, Corvallis, Oregon,1 Fred Hutchinson Cancer Research Center, University of Washington, Seattle, Washington,2 Department of Microbiology, Pukyong National University, Busan 608-737, South Korea3
Received 10 September 2007/ Accepted 12 November 2007
|
|
|---|
|
|
|---|
160 to 320 kbp) (7, 14), and single-celled green algae, an internal symbiont of Paramecium (Paramecium bursaria chlorella virus type 1;
330 kbp) (18), to the amoeba Acanthameoba, which may be infected by the recently discovered 1.2-Mbp mimivirus (3, 16). This virus has the largest virus genome described to date and contains as many as 911 open reading frames (ORFs).
Previously, we reported that cultured isolates of a taxonomically undesignated brown algal species in the genus Feldmannia produce a large dsDNA virus, FsV, only within reproductive cells (sporangia) (7). Algae in the genus Feldmannia typically grow in marine habitats in long filaments (
10 cm). They reproduce using a complex "alternation of generations" life cycle consisting of vegetative and/or a sexually reproductive diploid sporophytes and a vegetative haploid gametophyte stage (Fig. 1). Although it has been reported that both diploid sporophyte and haploid gametophyte are isomorphic stages, the gametophytes contain only plurilocular (multicelled) sporangia as reproductive structures, and the sporophytes contain two forms, similar plurilocular sporangia and unique, unilocular (single-celled) sporangia wherein meiosis occurs, producing haploid gametes.
![]() View larger version (16K): [in a new window] |
FIG. 1. Life cycle cartoon of Feldmannia sp. The alga has both a haploid and diploid form. Vegetative growth and germination of spores allow for complete survival in the absence of sexual reproduction. In this isolate, the sporophyte is not produced because viruses are produced in the unilocular sporangia rather than gametes. n, number of genome copies.
|
There is no evidence of virus replication in the vegetative cells or plurilocular reproductive cells of either gametophyte or sporophyte plants. However, Southern blot analysis of the gametophyte DNA probed with a virus-specific probe indicated that virus DNA was present in the host (12). In this form the viruses are stably transmitted during meiosis and vegetative growth and expressed only in cells normally programmed for meiotic, gamete production.
The icosahedral capsid of FsV is 150 nm in diameter and encloses a dsDNA genome (7). In laboratory-grown algal isolates, viruses with two genome size-classes of 158 and 178 kbp are produced (designated FsV-158 and FsV-178), whose ratio is dependent on the temperature used to grow the host (8). The two viral genomes with circular restriction maps are very similar but differ in the number of small,
170-bp repeats (8, 11). Both virus genomes are stable in the host after 7 years of monthly serial passage (8). Genes encoding DNA polymerase, protein kinases, integrases, transposases, and other enzymes have been identified in the FsV genome (9, 12, 13, 15). A comparative phylogenetic analysis of the FsV DNA polymerase with other viral DNA polymerases indicates that FsV belongs to a family of algal viruses named Phycodnaviridae (13). Villarreal and DeFilippis (20) have suggested that the FsV DNA polymerase may be the closest relative to the progenitor of all DNA polymerase delta molecules.
In the current article we identify the FsV genomic integration sites in the host algal DNA and clone the host-virus genome integration sites.
|
|
|---|
Southern hybridization.
Electrophoresed DNA was depurinated in 0.25 N HCl, denatured in 1.5 M NaCl-0.5 N NaOH, transferred by capillary action onto a nylon membrane (Hybond-NTM; Amersham) overnight, and baked for 2 h at 80°C. Probes were prepared by radiolabeling 25 to 50 ng of gel-purified DNA with 50 µCi of [
-32P]dCTP (3,000 Ci/mM) using a random primer extension kit (United States Biochemicals, Cleveland, OH). Southern blots of FsV DNA were prehybridized for 1 h in hybridization buffer (50% formamide, 7% sodium dodecyl sulfate, 0.12 M Na-phosphate buffer, pH 8.2) and then hybridized at 42°C for 6 h in 15 ml of the same buffer containing 1.5 x 107 to 3 x 107 dpm of labeled probe. Hybridized membranes were washed twice for 20 min in 0.1x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate) and 0.1% sodium dodecyl sulfate and autoradiographed at –80°C with intensifying screens.
Cosmid library construction. Total genomic DNA was isolated from Feldmannia gametophyte culture FsP-G4A. Genomic DNA was partially digested with Sau3AI, treated with calf intestinal alkaline phosphatase, and ligated into the BamHI site of Supercos 1 (Stratagene, La Jolla, CA). The ligated DNA was packaged using Gigapack II packaging extract (Stratagene) and transfected into Escherichia coli XL1-Blue MR cells. The library was screened by colony hybridization (17) with radiolabeled DNA from previously mapped virus fragments. Cosmid DNA was isolated by alkaline lysis DNA minipreps (17). Restriction fragment order was determined by analysis of overlapping clones and by comparison to previously mapped virus clones (8).
Sequencing.
The subfragments of the DNA inserts in cosmids were cloned into the pGEM-3Zf(+) vector (Promega) in the E. coli host, DH5
(17). Template DNA for automated sequencing was prepared using QIAprep miniprep kits (Qiagen, Santa Clarita, CA). Automated DNA sequencing was performed by the Central Services Laboratory, Center for Gene Research and Biotechnology, Oregon State University, Corvallis, OR. Assembly and analysis of DNA sequence data were performed by using the GCG program package (Program manual for the Wisconsin package, version 8, Genetics Computer Group, Madison, WI).
PCR amplification of the unoccupied host integration site. PCR primers P35HL (5'-ACTGGTACCAGTATTACAATATTCGTATCATAGGTC-3') and X35HR (5'-ACTCTGCAGGGTGCCATGATGAGTGTTGTGCATACA-3') were designed to target each of the host DNA regions flanking the integration sites based on the sequence of clones P3.5-7A and X3.5-4, respectively (Fig. 2g and h), and PCR was conducted using total sporophyte DNA as a template. Template DNA (1, 10, and 100 ng of total sporophyte DNA) was mixed individually with 20 pM of the oligonucleotide primers in 1x Taq polymerase buffer with 0.2 mM concentrations of each deoxynucleoside triphosphate and 1.5 mM MgCl2. The reaction mixture was heated at 94°C for 4 min to denature the template, and 2.5 units of Taq polymerase (Gibco/BRL, Gaithersburg, MD) was added. For the first PCR, a 4-min incubation at 94°C was followed by 30 cycles of 94°C for 1 min, 65°C for 1 min, and 72°C for 1 min. After the reaction was completed, 15 µl of the PCR samples was analyzed by electrophoresis. The appropriate PCR products were gel purified and cloned.
![]() View larger version (9K): [in a new window] |
FIG. 2. Map of the integration site (IS-V and IS-A) containing clones. Location of cosmid clone C1-09 and regions used as probes for G4A library screening are depicted below the linear format of the full-length restriction map of FsV-158 (8). The dashed vertical line represents the junction regions for integration. Restriction maps of the various subclones containing integration sites are below the full-length restriction map, with their names to the left of each map. Thick lines represent host DNA. Restriction sites are as follows: B, BamHI; E, EcoRI; H, HindIII; K, KpnI; P, PstI; Sa, SalI; Sm, SmaI; Ss, SstI; X, XbaI. Refer to the text for the details of the clones.
|
|
|
|---|
400-bp cloned fragments used as probes (b); cosmid C1-09, one of five cosmids previously prepared that cover FsV-158 (c); K6.0, a 6-kbp KpnI subclone of C1-09 (d); SX1.5, an SstI-XhoI subclone of K6.0 (e); and KX2.2, a KpnI-XhoI subclone of K6.0 (f). The sequence in the virus genome (see also Fig. 4A and B) which opens (Fig. 2, solid arrows) to permit integration was designated the integration site-virus (IS-V). For nomenclature symmetry the algal chromosome site into which virus insertion occurs was designated the integration site-alga (IS-A). The left and right borders of the opened algal site were also determined. The junctions formed upon viral integration (see also Fig. 4B) are included in X3.5-4, an XhoI subclone containing algal-viral junction right (A-VJR) from FsV-158 integration (Fig. 2g); P3.5-7A, a PstI subclone containing algal-viral junction left (A-VJL) of FsV-158 (h); B4.9-T, a BamHI subclone from viral cosmid C1-41 covering the IS-V region of FsV-178 (i); EK2.4-K1, an EcoRI-KpnI subclone containing A-VJR of the FsV-178 integration (j); and P4.0-G3, the PstI subclone containing A-VJL of the FsV-178 integration (k). The bold lines in Fig. 2g, h, j, and k represent areas of the algal chromosome while the thin lines are those of the virus. The 20-kbp marker refers to the map in Fig. 2a only; all other maps should reference the 2-kpb marker.
![]() View larger version (40K): [in a new window] |
FIG. 4. (A) Sequence of the host and FsV-158 and FsV-178 around the site of integration. The sequences are numbered from –300 to +300, with zero representing the IS-A. The regions containing virus DNA sequence are represented in uppercase letters while host DNA sequence is in lowercase. The repeated sequences, described in the text, are underlined. The right arrow marks the virus sequence CCGGTT while the left arrow marks the reverse complement, AACCGG. An asterisk marks bases which vary between virus sequences. The virus integration sites and unoccupied host sites are aligned, and the FsV-158 is represented by the upper line of the couplet, and FsV-178 is represented by the lower. The conserved GC dinucleotide present in the viral genome and host integration sites is shown resulting in junctions A-VJL and A-VJR. (B). Detailed sequence representation in the close vicinity of IS-V and IS-A and the junctions created upon integration. Uppercase and lowercase letters are as described for panel A.
|
This library should contain three insert types: (i) those demonstrating solely algal DNA, (ii) those with internal DNA from the integrated virus, and (iii) those with both algal and virus DNA. The last group should contain the algal chromosome-viral junctions A-VJL or A-VJR. Over 50 cosmids that hybridized with these viral probes were selected. Restriction fragment patterns for each clone were compared with virus DNA patterns on gels, and each viral insert was mapped against previously published maps prepared for FsV-158 and FsV-178 (8). Most of the inserts produced maps identical to those already known for purified virus. These inserts were presumed to arise from viral DNA internal to the integration junction sites.
There were, in addition, distinctly aberrant insert types that hybridized with one specific probe, the PstI fragment DD isolated from C1-09 (Fig. 2b). These inserts gave different patterns compared to gels of purified virus DNA. They also failed to reproduce established FsV restriction fragment maps.
Southern blots from these aberrant cosmids were positive when probed with the entire viral probe, C1-09 (data not shown). This result suggested that C1-09 contains the viral sequence IS-V, which must open to produce right and left ends of a linearized form for integration into the algal chromosome. Furthermore, it suggested that these aberrant cosmids contained in addition the junctions A-VJL and A-VJR created upon viral integration.
Isolation of IS-V, A-VJL, and A-VJR. Based on the viral integration hypothesis, we proposed that small subfragments containing IS-V from C1-09 should disappear from gel patterns of the G4A library and be replaced by newly created fragments containing A-VJL and A-VJR. This linkage of viral and algal sequences resulted in size shifts of two restriction fragments in the G4A library. The fragments P3.5-7A and P4.0-G3 (Fig. 2h and k) were gel isolated and determined to fulfill this criterion. Through mapping and sequence analysis, they were shown to contain A-VJLs of distinct FsV-158 and FsV-178 DNA integrations.
Two subclones, a 6.0-kbp KpnI-fragment (Fig. 2d, K6.0) from C1-09 and a 4.9-kbp BamHI-fragment (Fig. 2i, B4.9-T) from C1-41 covering the same region of FsV-158 and -178, respectively, were isolated from viral cosmid libraries (8). SX1.5, a subclone from K6.0, was identified by positive hybridization to P3.5-7A and P4.0-G3 and shown to possibly contain IS-V (Fig. 2e). A second subclone, KX2.2 (Fig. 2f) from K6.0, known to reside outside of the sequence of IS-V, was isolated. Aberrant cosmids from the G4A library which hybridized with KX2.2 should contain the other junctions of both FsV-158 and -178 integrations. From these cosmids two independent A-VJR-containing subclones, X3.5-4 and EK2.4-K1, were obtained (Fig. 2g and j).
Genomic DNA of FsV, prepared from host cells cultured at 9°C and 18°C, and total G4A genomic DNA were digested with EcoRI and PstI, and Southern blots were probed with SX1.5 (Fig. 2e). Viral genomic DNA prepared from host cells cultured at 9°C, in which the large-size FsV-178 is predominant (8), showed a strong band of the expected size of 2.5 kbp and a faint band of about 10 kb from the FsV-158 (Fig. 3, lane 1). The band pattern was reversed in the viral genomic DNA prepared from 18°C in which the FsV-158 is dominant (Fig. 3, lane 2). The SX1.5 probe hybridized to five DNA fragments (Fig. 3, lane 3). The sizes of four such DNA fragments, 9.0, 4.0, 3.1, and 1.5 kbp, corresponded to the sizes of each DNA fragment containing the algal-virus junctions in the G4A library (data not shown). Unexpectedly, a 10-kbp DNA fragment containing the IS-V of the intact virus also hybridized to SX1.5 (Fig. 3, lane 3).
![]() View larger version (14K): [in a new window] |
FIG. 3. Autoradiograph of gametophyte DNA (G4A) and FsV DNA (from either 9°C or 18°C cultures) digested with both EcoRI and PstI and probed with SX1.5-K6, a subclone of K6.0 (–1064 to +334), containing the IS-V site. Positions of the size markers (in kbp) are indicated on the left. The sizes of the individual hybridization fragments are labeled on the right. The 10.0-kbp fragment corresponds to intact FsV-158, 9.0 corresponds to X3.5-4, 4.0 corresponds to P4.0-G3, 3.1 corresponds to P3.5-7A, and 1.5 corresponds to EK2.4-K1. The 2.5 kbp FsV-178 fragment is absent in the G4A lane.
|
However, the two sites in the algal chromosome where the two virus genomes integrated were entirely unique (Fig. 4A). A CG dinucleotide pair defines the integration cut site, which occurs in the IS-Vs of both FsV-158 and FsV-178. A complementary GC dinucleotide is found in the IS-A where either FsV-158 or FsV-178 integrates. Two GC sequences are reformed upon insertion at both the A-VJL and A-VJR of both the FsV-158 and FsV-178 integrations (Fig. 4B).
Based on restriction maps and sequence data, the virus portions of P3.5-7A and P4.0-G3 (Fig. 2, thin lines on maps) are similar to K6.0 from the 158-kbp genome and B4.9-T from the 178-kbp genome, respectively. These were designated A-VJL for both virus forms. Furthermore, in the virus DNA-containing regions of the A-VJR clones, X3.5-4 is similar to K6.0 (158-kbp virus genome) while EK2.4-K1 is more similar to B4.9-T (178-kbp virus genome). However, there was some variation at the nucleotide sequence level between sequences from purified viruses and their integrated counterparts. The region of homology between K6.0 and P3.5-7A differs by only a single base pair between the integration site and the KpnI site 1,020 bp distant. Greater sequence variation exists between B4.9-T and P4.0-G3; there are two restriction site polymorphisms and 13 base differences between the two sequences. The sequence between the SstI site and the integration site (1,064 bp) is identical between the K6.0 and X3.5-4 clones while the sequence of EK2.4-K1 differs at 29 positions relative to B4.9-T.
Fine structure at the integration site. The IS-V region contains short repeated sequences on both sides of the cut site (Fig. 4A). The sequence TTCGGAA occurs three times in the right border region and once to the left within 200 bp of the IS-V (Fig. 4A, underlined). The sequence CCGGTT occurs five times to the left border of IS-V and once to the right (Fig. 4A, rightward arrows). There, an inverse complement AACCGG is found once to the right in close proximity and three times to the left within 300 bp of IS-V (leftward arrows). At AACCGG sites in the left border, the repeats are contained within the larger repeat, TCAACCGGG (dotted leftward arrows). The 5-bp sequence CGCGG occurs in different regions of both algal genome border regions within 300 bp of where FsV-158 and FsV-178 integrate and is included in the longer GCCGCG sequence in the left of the integration site and the reverse CGCGGC to the right (dotted underlines). The significance or utility of these small repeats has not been established.
The IS-Vs of both viral genomes contain short inverted repeats that are capable of forming stem-loop structures often associated with the recombination sites (Fig. 5). The IS-V sequence from FsV-158 is capable of forming a 6-bp stem with a 3-bp loop while the sequence from the IS-V of the 178-kbp genome can form an 8-bp stem with a 7-bp loop (Fig. 5). In both cases the CG dinucleotide occurs in the loop structure as a part of a common TTCGATATTTGCT sequence. There are no regions of homology between the virus and algal sequences on either side of the junction sites beyond these small sequences. Stem-loops cannot be drawn based on sequences from the algal IS-A borders.
![]() View larger version (10K): [in a new window] |
FIG. 5. Stem-loop structures are found at the integration site of the small (158 kbp) and large (178 kbp) genomes of FsV. The boxes indicate the conserved CG dinucleotides. The conserved CG dinucleotide is present in both viral genomes, the host integration sites, and the resulting junctions A-VJL and A-VJR.
|
Isolation of unoccupied integration sites. Primers based on algal DNA regions flanking the A-VJL and A-VJR were used for PCR analysis of total G4A DNA to determine if an unoccupied site might exist. A DNA fragment of 587 bp was amplified (data not shown). This fragment is the expected size based on sequence data that assumes the virus is not integrated. The alignment of the PCR product sequence and algal regions of the A-VJL and A-VJR showed that the unoccupied IS-A would form the A-VJL and A-VJR upon the integration of the FsV genome (Fig. 4A).
|
|
|---|
In the present study, two virus integration sites for a large dsDNA FsV genome were identified in the gametophyte G4A line. DNA sequence alignment clearly demonstrates that integration occurred at two sequence-specific host locations, likely on the same linkage group as explained in Fig. 6. In contrast, the viral site (IS-V) that opens for preintegration linearization of the normally circular viral genome is identical for the two viral forms. All four host-viral junctions were identified and sequenced. A host site that lacked virus DNA was also identified.
![]() View larger version (7K): [in a new window] |
FIG. 6. A model for the generation of the unoccupied host integration sites. Two different virus genomes are integrated into the same host chromosome (middle). The host-virus junctions and virus genomes are labeled. The virus genome excises at sites P3.5 and X3.5 to generate a functional virus capable of replication and packaging while restoring the host sites P3.5/X3.5 (top). An alternative excision generates two FsV genomes spliced together and unable to replicate. This would bring junctions P4.0 and X3.5 into juxtaposition.
|
Based on restriction site analysis and sequence data, we suggest that clones P3.5-7A and X3.5-4 contain the borders of a single IS-A site for FsV-158. Similarly, P4.0-G3 and EK2.4-K1 define a second host location for FsV-178 (Fig. 6). Using PCR primers designed from P3.5-7A and X3.5-4, we were able to demonstrate an unoccupied IS-A from the host. We did not identify an unoccupied host site for FsV-178 using P4.0-G3 and EK2.4-K1 primers.
Unexpectedly, we amplified a fragment using primers derived from P4.0-G3 (FsV-158) and X3.5-4 (FsV-178). A possible explanation for the P4.0-G3/X3.5-4 PCR product is depicted in Fig. 6. It suggests that the two IS-A sites occur in tandem, possibly with a region of intervening host DNA. The most parsimonious explanation is that while a single excision of FsV-158 led to a proper host site, FsV-178 does not excise alone, hence, our inability to amplify such a site. However, if both viruses excise simultaneously, then the intervening algal DNA will be lost, and a hybrid site described above would be created.
The two independent IS-A sites from P3.5-7A/X3.5-4 and P4.0-G3/EK2.4-K1 show little sequence identity other than a GC motif at the site of integration and GCCGCG to the left of the integration site and its reverse complement, CGCGGC, to the right.
Two characteristics of phaeoviruses, found in FsV as well as EsV, are their long latency period and apparent lysogeny (2, 4, 18, 19). The nucleotide sequence analysis of the EsV type 1 genome (5) identified a gene with significant similarities to the carboxy-terminal, catalytic domain of an integrase. There was no report on the integration mechanism of this virus. Based on recently completed sequencing and annotation of the FsV-158 genome, an ORF with 29% amino acid sequence identity to the putative integrase gene of EsV was identified; the expression and role of this ORF in virus replication need to be elucidated.
Sequence analysis of the IS-V and IS-A, the unoccupied site, and the resulting alga-virus junctions identified a common GC dinucleotide motif (Fig. 4B). The well-studied bacteriophage
integration into the genome of its host, E. coli, shows substantial sequence homology between the integrating virus and target host DNA. The recombination sites of the phage and the bacterial host, called attP and attB, respectively, contain a common "core" 15-bp sequence (10). The core sequence is cleaved in a staggered fashion, with seven overhanging nucleotides at the beginning of recombination, and recovered at each end of the prophage genome by recombination. This process is different from retrovirus integration, where staggered cleavage of the target DNA results in duplication of host target DNA at both ends of the provirus genome while two nucleotides at each end of the long terminal repeats are lost (6). Although the GC dinucleotides in the FsV and its host genome are recovered at the A-VJLs and A-VJRs after integration, no duplication of host target sequence occurs (Fig. 3). It is not clear whether the GC dinucleotides are cleaved in a staggered fashion or in the middle, but recovery of this sequence at both junctions without any loss or duplication of the virus and host genome is uncommon.
It has been suggested that a secondary structure formed by a palindromic sequence may influence integration efficiency by affecting the A-philicity, DNA bendability, and protein-induced deformability of the DNA (21). Also, AT-rich sequence can generate DNA curvature, which makes this region accessible for recombination. No palindromic or AT-rich sequence was found in the Feldmannia genome near the integration site of the two FsV genomes. AT-rich stem-loop structures can be produced around the IS-V by positioning the GC dinucleotides in the loop area (Fig. 5). Further analyses on the function of these stem-loop structures and repeated sequences around the IS-V, including possible involvement in the interaction with viral or host factors, will help explain the mechanism of this unique integration process. The present report is the first description of the integration of an algal virus genome into its host genome, but details of the integration and excision processes and viral and host factors involved in these processes need to be further characterized.
Published ahead of print on 21 November 2007. ![]()
|
|
|---|
ller. 1995. Inheritance and meiotic elimination of a virus genome in the host Ectocarpus siliculosus (Phaeophyceae) J. Phycol. 31:823-827.[CrossRef]
ller. 1999. Persistent virus integration into the genome of its algal host, Ectocarpus siliculosus (Phaeophyceae). J. Gen. Virol. 80:1367-1370.[Abstract]
site-specific recombination. Annu. Rev. Biochem. 58:913-949.[Medline]
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»