Previous Article | Next Article ![]()
Journal of Virology, July 2008, p. 7111-7119, Vol. 82, No. 14
0022-538X/08/$08.00+0 doi:10.1128/JVI.00623-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Brian Stevenson,2
Alexandra Quazzola,1
Félix Naef,1
Maciej Wiznerowicz,1
Frédéric Schütz,2
Victor Jongeneel,2
Denis Duboule,1,3
François Spitz,3,
and
Didier Trono1*
School of Life Sciences and Frontiers in Genetics National Center for Competence in Research, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland,1 Ludwig Institute for Cancer Research and Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland,2 Department of Zoology and Animal Biology, University of Geneva, Geneva, Switzerland3
Received 20 March 2008/ Accepted 30 April 2008
|
|
|---|
|
|
|---|
The kinetics of lentivirus integration into the target genome defines the degree of genotypic mosaicism found in the resulting animal. Integration events occurring subsequent to the first cell division will, indeed, limit the presence of individual proviruses to only a subset of cells and, hence, condition their rate of transmission to the first generation (G1) progeny. Even though the phenotypic consequences of this mosaicism can usually be minimized through the use of vector doses high enough to induce multiple proviral copies per embryo, ensuring that all cells harbor at least one integrant, individual proviruses are subjected to different influences conditioned by their site of integration into the host genome (8). Furthermore, the integration site of individual proviruses will influence their potential to exert cis-acting effects on the host genome, a process known as insertional mutagenesis. Large-scale analyses of retroviral integration sites in somatic cells have revealed that, while vectors derived from both murine leukemia virus (MLV) and HIV favor active genes, the former tend to integrate in and around promoters, whereas the latter rather target transcribed regions (14, 19, 30). Noteworthy, when MLV vectors are used to infect early mouse embryos, they are rapidly silenced during development, in contrast to their lentiviral counterparts (9). It is not known whether this partly reflects differences in integration site selection or results solely from the sequence-specific recruitment of epigenetic repressors, such as recently demonstrated for the primer binding site-dependent KAP1/TRIM28-mediated silencing of MLV vectors in embryonic stem cells (28, 29).
The present work presents a genotypic characterization of transgenic mice obtained from oocytes injected with lentiviral vectors. It demonstrates that integration occurs rapidly following this procedure, so that the degree of mosaicism is low in the resulting animals, and that proviruses are located preferentially in genes likely to be active at the time of infection.
|
|
|---|
LAM-PCR. DNA of transgenic mice was purified by standard phenol-chloroform extraction and ethanol precipitation, and DNA of NIH 3T3 fibroblasts grown in Dulbecco's modified Eagle's medium-10% fetal calf serum (Invitrogen, Paisley, United Kingdom) was purified with a DNeasy extraction kit according to the manufacturer's recommendation (Qiagen, Hilden, Germany). Linear amplification-mediated (LAM)-PCR was carried out as described previously (18) with minor modifications. Five nanograms of genomic DNA was mixed in a 50-µl reaction volume with 1x Taq DNA polymerase buffer (Qiagen), a 200 µM concentration of the deoxynucleoside triphosphates (dNTPs) (Sigma, Buchs, Switzerland), a 2.5 nM concentration of oligonucleotides AD30 and AD31 (for all primers, see Table S3 in the supplemental material), and 2.5 U of Taq polymerase (Qiagen). Primers were extended after initial denaturation for 5 min at 94°C (hot start), followed by 50 cycles of 94°C for 1 min, 60°C for 45 s, and 72°C for 1.5 min, with a final extension at 72°C for 10 min. Dynal kilobaseBINDER beads (Invitrogen) were washed two times with 0.1% bovine serum albumin in phosphate-buffered saline, resuspended in 50 µl of binding buffer provided with the kit, and added to the PCR at a final concentration of 20 µl of beads per PCR. Extension products were captured for 1 h at room temperature under agitation and washed with water. Second strands were synthesized in 20-µl reaction mixtures containing 1x hexanucleotide buffer (Roche, Rotkreuz, Switzerland), 30 µM dNTPs, and 2 U of Klenow (Roche) for 1 h at 37°C. Beads were washed with water and digested in a 20-µl mixture containing 4 U of Tsp509I (New England Biolabs, Allschwil, Switzerland) for 1 h at 65°C. Beads were washed with water and ligated in a 10-µl reaction mixture containing 1x Fast-Link buffer (Epicenter, Dottikon, Switzerland), 1 mM ATP, 2 µl of AD25/AD26 linker cassette, and 2 U of Fast-Link ligase for 5 min at room temperature. Beads were washed with water and denatured for 10 min at room temperature with 5 µl of 0.1 M NaOH. Two microliters of this LAM product was exponentially amplified in 50-µl reaction mixtures containing 1x Taq DNA polymerase buffer (Qiagen), 200 µM dNTPs (Sigma), 500 nM concentrations of oligonucleotides AD27 and AD32, and 5 U of Taq polymerase (Qiagen). Primers were extended after initial denaturation for 3 min at 94°C (hot start) followed by 35 cycles of 94°C for 45 min, 60°C for 45 min, and 72°C for 45 s, with a final extension at 72°C for 10 min. One microliter of this PCR product was amplified in a 50-µl reaction mixture with 1x Taq DNA polymerase buffer (Qiagen), 200 µM dNTPs (Sigma), 500 nM AD33 (oligonucleotide), 400 nM AD28, 100 nM AD62, and 5 U of Taq polymerase (Qiagen). One microliter of the final PCR product was mixed with 0.5 µl of ROX GeneScan standard (Applied Biosystems, Switzerland) and 8.5 µl of distilled formamide (Applied Biosystems) and denatured for 2 min at 95°C. PCRs were separated on a 3100 sequencer (Applied Biosystems) and analyzed with the GeneScan software.
Cloning, sequencing, and annotation of LAM-PCR products.
PCRs were purified with a Nucleobond II kit (Macherey-Nagel, Switzerland) and eluted in 30 µl of 5 mM Tris-HCl, pH 8.0. Fifteen microliters of this eluate was digested overnight with 5 U of NarI (NEB) in a 20-µl reaction volume at 37°C. Of this digest, 0.6 µl was mixed in 96-well plates with 0.5 µl of pCR4-TOPO (Invitrogen) and 0.4 µl of salt solution in a final volume of 2.3 µl. After incubation for 5 min at room temperature, reaction mixtures were heat shocked into DH5
cells provided with the kit (Invitrogen). Colonies were picked into 96-well plates and sequenced at GATC Biotech, Konstanz, Germany. Sequences were extracted from chromatograms using phred (3). Vector and HIV-contaminating sequences plus U5 and LC fragments were identified with BLAT (12) and removed prior to annotation. Mouse repeats in the cleaned sequences were identified and masked with RepeatMasker (A. F. A. Smit, R. Hubley, and P. Green, unpublished data). The location of each cleaned, masked sequence in the mouse genome (NCBIm34 assembly) was determined with the Megablast program (32) using a word length of 24. The annotation procedure was automated using a series of ad hoc Perl scripts. Gene-relative mappings were determined using the ENSEMBL database (http://oct2006.archive.ensembl.org/). We defined genes as transcription units annotated in the ENSEMBL database; data tables were assembled in OpenOffice, version 2.0, and statistical analysis was performed with R or PRISM (version 4.0) software.
ISS-PCR. Integration site-specific PCRs (ISS-PCRs) were exponentially amplified in 25-µl reaction mixtures containing 100 ng of DNA, 1.5x Taq Polymerase buffer (Qiagen), 200 µM dNTPs (Promega), 0.8 µg of bovine serum albumin (NEB), 25 mmol MgCl2 (Qiagen), 5 U of HotStart Taq polymerase (Qiagen), and 200 nM concentrations of each primer. The PCR started with an initial activation step of 95°C for 15 min (hot start), followed by 10 cycles of 94°C for 45 s, 60°C (with a change of –1°C per cycle) for 45 s, and 72°C for 1 min (touchdown); this program was followed by 30 cycles of 94°C for 45 s, 50°C for 45 s, and 72°C for 1 min, with a final extension at 72°C for 10 min. Five milliliters of loading buffer (6x) was added to each PCR, and 15 µl was run on a 0.8% agarose gel for about 30 min at 80V before analysis under UV light.
RNA expression measurements. NIH 3T3 fibroblasts were cultured in Dulbecco's modified Eagle's medium-10% fetal calf serum (Invitrogen) at 37°C in 5% CO2. Oct4-GiP (kind gift A. Smith, Edinburgh, United Kingdom) embryonic stem (ES) cells were cultured as described previously (31) in the presence of 500 ng of puromycin (Sigma)/ml at 37°C in 10% CO2. RNA was isolated with Trizol (Invitrogen) and ethanol precipitation according to standard procedures; cDNA was synthesized with hexanucleotides according to the manufacturer's recommendation using a SuperScript III first-strand cDNA synthesis system (Invitrogen). Fifty unfertilized eggs or two-cell embryos were resuspended in 1 µl of RNase Out and 10 µl of resuspension buffer (SuperScript III CellsDirect cDNA synthesis system; Invitrogen) and denatured for 10 min at 75°C. DNase digestion was carried out according to the manufacturer's instructions, and 2 µl of 50 ng/µl hexamer was added with 1 µl of dNTPs. Reaction mixtures were denatured at 70°C for 5 min, and the remaining reagents were added as indicated in the protocol. Reaction mixtures were left for 10 min at 25°C before reverse transcription. RNase H-digested cDNAs were used for quantitative PCR. The expression of each gene was assayed in triplicate in a total volume of 5 µl containing 1x Power Sybr (Applied Biosystems), a 200 nM concentration of each gene-specific primer pair (see Table S3 in the supplemental material), and diluted cDNA (cultured cells, 1:17; embryonic cDNA, 1:2.6). To verify specificity, each PCR was followed by a melting curve analysis, and samples lacking reverse transcriptase were run in parallel. The increase in fluorescence was analyzed with SDS software, version 2.2.2, (Applied Biosystems). For all amplification plots, the baseline data were set with the automatic cycle threshold function available with SDS, version 2.2.2, calculating the optimal baseline range and threshold values by using the AutoCt algorithm (SDS version 2.2 user's manual, Applied Biosystems, Foster City, CA). A mean quantity was calculated from triplicate PCRs for each sample, and this quantity was normalized to two similarly measured quantities of normalization genes as described previously (24). Normalized quantities were averaged for three replicates for each data point and are represented as the means ± standard deviations (SD). The highest normalized relative quantity was arbitrarily given a value of 1.0. Relative changes were calculated from the quotient of the means of these normalized quantities and are reported as means ± SD.
|
|
|---|
(elongation factor 1
) promoter contained in this vector, it did not differentiate the rapid integration of one or more proviruses into the zygote from the delayed integration of multiple proviruses in later cells or a combination thereof. Measuring integration kinetics directly was difficult due to the small number of cells concerned. We therefore used an indirect yet highly quantitative approach to define this parameter. For this, we proceeded to a large-scale determination of the rates of G0 to G1 transmission of individual integrants, using a combination of Southern blotting and ISS-PCR. An example of Southern blotting-based analysis is depicted in Fig. 2. Tail DNA from the G0 mouse 1426 generated with the pLV-tTRKRAB-red vector (Fig. 1A) revealed that this animal harbored two proviruses. Crossing this mouse with a wild-type animal produced a G1 progeny of 20 members, out of which half were transgenic, as determined by PCR amplification of tail DNA with vector-specific primers (data not shown). Southern blot analysis of this transgenic offspring revealed that the two proviruses had overall rates of germ line transmission of 45% and 30%, respectively.
![]() View larger version (49K): [in a new window] |
FIG. 1. Lentivirus transgene expression in blastocysts. (A) Schematic representation of lentiviral vectors used in this study. Abbreviations: SIN, self-inactivating LTR; WPRE, woodchuck hepatitis posttranscriptional regulatory element; IRES, internal ribosomal entry site; cPPT, central polypurine tract; hPGK, human phosphoglycerate promoter; CAG, chicken actin-globin chimeric promoter; KRAB, tetracycline repressor-KRAB fusion protein. (B) Transduction of fertilized oocytes after perivitelline injection of pWPTS vector. eGFP expression is detected in the trophectoderm and the inner cell mass of the blastocyst (top); a blastocyst derived from an uninfected oocyte is shown as a control (bottom).
|
![]() View larger version (44K): [in a new window] |
FIG. 2. Lentiviral vector transmission detected by Southern blotting. A Southern blot of tail DNA of G1 offspring. The founder animal (G0, animal 1426) carried two copies of the pWPTS provirus. Crossing the G0 animal with a wild-type (wt) animal gave a total of 20 pups with 10 (50%) transgenic animals. Circles and squares represent males and females, respectively.
|
![]() View larger version (49K): [in a new window] |
FIG. 3. Lentivirus transmission detected by ISS-PCR. (A) Scheme of primer sets used for ISS-PCR on tail DNA targeting the integrated provirus or the integration site in G1. Primer set A plus C surrounds the lentivirus integration site (locus), and primer set B plus C targets the lentiviral vector and the downstream genome sequence (junction). (B) PCR analysis of the offspring from mouse 2267 crossed with a wild-type (wt) animal. LV, lentivirus.
|
|
View this table: [in a new window] |
TABLE 1. Integration sites of G0 mouse 2267 outside annotated genes
|
|
View this table: [in a new window] |
TABLE 2. Integration sites of G0 mouse 2267 inside annotated genes
|
|
View this table: [in a new window] |
TABLE 3. Integration sites of G0 mouse 2267 in repeats
|
![]() View larger version (19K): [in a new window] |
FIG. 4. Frequency of G0-to-G1 transmission of lentiviral proviruses. (A) All integrants were pooled to obtain a global rate of transmission of lentiviral vector. Data are given for pooled integrants containing either pLV-tTRKRAB-red or pRRL-GFP from transgenic mice. (B) Point estimates represent the actual proportion of transmission for each integrant. Error bars indicate the 95% confidence interval obtained. The observations corresponding to different integrants in the same founder were pooled, assuming an equal probability of transmission for all integrants within the founder; the pooled estimates and their associated confidence intervals are shown for each family. The confidence intervals are narrower than those obtained for single integrants since they are based on larger sets of data. Finally, all the integrants of founders carrying the same promoter were pooled to obtain an average rate of transmission of individual integrants from this vector, with confidence interval.
|
![]() View larger version (72K): [in a new window] |
FIG. 5. Transmission of lentiviral vectors to G2 mice. (A) G2 animals obtained from crossing G0 animal 2267 with three G1 animals. G2 animals were analyzed by the same PCR as described in the legend of Fig. 3. The asterisks mark gene-specific PCRs that failed to amplify the wild-type locus (mice carrying lentiviral vectors in both alleles). Every PCR was run with DNA from the founder animal amplifying all integration sites and a negative control from a nontransgenic animal (not shown). (B) The frequency (Freq.) of homozygous transmission of lentivirus integration sites 2, 4, 5, 8, and 11 in G2 animals correlates with the copy number of lentiviral vectors detected in the G1 animals. LV, lentivirus.
|
![]() View larger version (22K): [in a new window] |
FIG. 6. Distribution of lentiviral integration sites in the mouse genome. (A) Chromosomal distribution of in vivo lentiviral integration sites in transgenic mice. (B) The fraction of lentiviral vectors inside a gene was determined in transgenic mice and 3T3 fibroblasts with ENSEMBL, version 42, as a reference. To calculate P values, we applied a binomial distribution where the expected probability was taken from the whole-genome fractions. The P values were 4.85–12 for integration sites in transgenic mice (***) and 3.75–8 for 3T3 fibroblasts (**). (C) Orientation of lentiviral vectors inside a gene was compared to the transcriptional orientation of the target gene in transgenic mice and 3T3 fibroblasts. (D) The absolute distance of the lentiviral vector to the transcriptional start site (TSS) was determined in transgenic mice and 3T3 fibroblasts. (E) The fraction of integration sites in LINE/L1, endogenous retroviruses (LTRs), and short interspersed nuclear elements (SINE) in 3T3 fibroblasts and transgenic mice was divided by the fraction of these repeats in the mouse genome.
|
Integration appears to favor genes expressed during early embryogenesis. In order to test whether integration favored genes transcriptionally active during the preimplantation stages of the embryo, as previously observed in adult cells, we used quantitative PCR to compare the expression patterns of 10 randomly selected lentivirus target genes in two-cell embryos, ES cells, and 3T3 fibroblasts (Fig. 7A). Expression of these genes in two-cell embryos was from 2 to 42 times higher than that of housekeeping genes taken as controls (left panel). In ES cells and 3T3 fibroblasts, their expression was generally lower than in two-cell embryos (Fig. 7A), but this in part reflected higher average levels of RNA transcripts in the latter cells (9.5-fold difference; one-way analysis of variance, P < 0.0001) (Fig. 7B). Additionally, we tested the impact of maternal RNA expression levels by comparing the expression of eight lentivirus target genes in unfertilized oocytes with expression in two-cell embryos. Two of the genes (5830435K17Rik and Bbs4) had similar expression levels in both settings, suggesting that their transcripts might be predominantly of maternal origin (Fig. 7C). In contrast, expression levels of the remaining six target genes were higher in two-cell embryos than in unfertilized oocytes.
![]() View larger version (15K): [in a new window] |
FIG. 7. Expression of lentivirus target genes in different mouse tissues. Transcript levels of lentivirus target genes were determined by quantitative PCR cDNAs synthesized on two-cell embryos, ES cells, and 3T3 fibroblasts RNA. PCR on cDNA synthesized without reverse transcriptase were negative (not shown). Expression levels were calculated relative to the function of three normalization genes included in each PCR. All PCRs were performed in triplicate. (A) Expression levels of 10 arbitrarily chosen lentivirus target genes in the indicated tissues. Shown are mean values of three independent experiments. (B) Box plot of the expression levels determined in panel A plus a control in which RNA isolated from two-cell embryos mixed with 10 ng of 3T3 fibroblast RNA served as templates for the cDNA synthesis. Expression levels determined in two-cell embryos and ES cells were compared by one-way analysis of variance (***, P < 0.0001). (C) Expression levels of eight lentivirus target genes in unfertilized oocytes and two-cell embryos. Shown are mean values ± standard errors of two independent experiments.
|
|
|
|---|
We also determined the genomic distribution of the proviral integrants generated by lentivector-mediated transgenesis. Following LAM-PCR-mediated amplification and sequencing of the host genome-provirus junctions, we could successfully locate more than 85% of the integration sites using a Perl script, which automatically removes cloning vector sequences, masks repetitive elements, and aligns the remaining polynucleotidic stretches with the mouse genome. This stepwise procedure enabled us to assign LAM-PCR-generated sequences to the mouse genome even when part of the amplicon was constituted by repetitive sequences, a point verified by ISS-PCR. This latter method, which dually targets the provirus-genome junction and the targeted locus, was previously used to trace retrovirally marked repopulating hematopoietic stem cells. Here, it allowed us to identify unequivocally transgenic G2 mice that were homozygous for specific lentiviral integrants following G0-G1 crosses. The observed frequency of homozygosity was below the 25% expected from a purely Mendelian mode of transmission. Considering the rapid kinetics of integration, it is unlikely that these non-Mendelian transmission rates reflect mosaicism in the germ line. Rather, some of the homozygous G2 mice must be nonviable due to the target gene inactivation potential of retrovirus integrants. However, the normal rates of G1 representation of proviruses that did not achieve homozygosity in G2 suggest an absence of toxicity in the heterozygous state. It remains that our series is very small and that a much larger study would certainly be needed to probe this issue and to confirm that homozygous lentiviral integrants are significantly counter-selected due to gene inactivation.
It was previously noted that the lentiviruses HIV, simian immunodeficiency virus, equine infectious anemia virus, and feline immunodeficiency virus and the vectors derived thereof exhibit similar integration patterns in human, monkey, and murine cells (1, 2, 6, 7, 10). Here, we extend the analysis to transgenic mice generated by infection of fertilized oocytes. We found that, in this case, HIV-derived lentiviral integrants favor genes, where they integrate with a tendency for the middle of the transcribed region, a trend already noted in human cells. One explanation could be that genes form a loop, with the transcriptional start and termination sites serving as bridging points and the tip of the loop protruding to the outside, which makes it more accessible for integration. Alternatively, occupancy of the transcriptional start and termination regions by regulatory proteins might interfere with this process. However, the accumulation of MLV integrants within or near promoters makes either one of these simple models unlikely. Instead, it is tempting to postulate that retroviral preintegration complexes interact with the RNA polymerase II holoenzyme and associated proteins and/or recognize specific histone modifications. Histone 3 dimethylation at lysine 4 correlates with active transcription and accumulates in the middle of genes, as do lentiviral integrants (17). In contrast, histone 3 monomethylation at lysine 4 and diacetylation at lysine 14 cluster at the transcriptional start sites, reminiscent of the MLV integration pattern. Thus, chromatin modifications or factors catalyzing these modifications might act as key players in the viral DNA-host genome interface, as recently suggested (25).
We observed a subtle difference in the integration patterns of lentiviral vectors in 3T3 fibroblasts and transgenic mice, with a slightly higher frequency of integration into repeats in the latter setting. Although this difference was statistically not significant, it might reflect the elevated transcriptional activity of repetitive elements in preimplantation embryos (15, 16, 21). In support of this model, we found that genes targeted by the lentiviral integrants in transgenic mice generally had higher levels of expression in two-cell embryos than in adult tissues. However, the constantly evolving picture of expressed regions of the genome, now known to encompass far more than conventional genes, as well as evidence indicating that genes that are poised for activation may share chromatin marks with actively transcribed genes, call for caution in establishing overly strict correlations, based on monomethylation at lysine 4, between "gene expression" and retroviral integration site selection.
This work was supported by the European Union as part of the 6th Framework Program CONSERT project and by the Swiss National Science Foundation.
The authors declare that they have no conflict of interests.
Published ahead of print on 7 May 2008. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
Present address: Merck Serono International S.A., 9 Chemin des Mines, 1202 Geneva, Switzerland. ![]()
Present address: EMBL, Meyerhofstr. 1, 69117 Heidelberg, Germany. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»