Previous Article | Next Article ![]()
Journal of Virology, January 2003, p. 1584-1588, Vol. 77, No. 2
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.2.1584-1588.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Takeshi Suzuki,1 David J. Munroe,2 Claudia Stewart,2 Lynn Rasmussen,2 Debra J. Gilbert,1 Nancy A. Jenkins,1 and Neal G. Copeland1*
Mouse Cancer Genetics Program, National Cancer Institute-Frederick,1 Laboratory of Molecular Technology, SAIC-Frederick, Frederick, Maryland 217022
Received 9 August 2002/ Accepted 19 September 2002
|
|
|---|
|
|
|---|
Recently, an inverse PCR (IPCR) method was developed for cloning retroviral integration sites from mouse leukemias and lymphomas that dramatically increases the throughput of retroviral tagging for cancer gene discovery (12). This IPCR method was used to clone 419 retroviral integration sites from BXH2 myeloid leukemias and a few AKXD B- and T-cell tumors. Approximately 1.2 to 1.4 kb of mouse genomic DNA was then sequenced from each IPCR product, and the sequences, termed proviral tagged sequences (PTSs), were compared with one another to look for sequence overlaps. Sequence overlaps for PTSs cloned from different tumors indicate a common viral integration site (CIS). The probability that two or more PTSs among the 419 PTSs scored would be located in the same small region of the genome by chance alone is very low and suggests that this region contains a disease gene that is mutated by viral integration. The PTSs were also BLAST searched against the nonredundant and expressed sequence tag databases to look for coding regions of known genes or for homology with already published CISs. In this way, 13% of the PTSs could be localized to known CISs, while 11% could be localized to novel CISs. Another 10% of sequences identified known genes, but these genes were hit only once in the tumor panel, while 8% hit expressed sequence tags of unknown function. Fifty-eight percent of the sequences were noninformative.
Since retroviral integrations can cause disease by affecting gene expression over long distances (tens or even hundreds of kilobases) (11), it is possible that some of the noninformative PTSs in these experiments were also located at CISs but were missed because only a limited amount of mouse genomic sequence information was available for each IPCR product. To determine whether any of the noninformative PTSs are also located at CISs and to better estimate the percentages of PTSs that are located at CISs, we determined the chromosomal location of 131 of these noninformative PTSs with the Frederick Interspecific Backcross Mapping Panel (IBMP) (4). Cosegregation of two or more noninformative PTSs in the Frederick IBMP provides suggestive evidence of a CIS. Physical mapping with mouse bacterial artificial chromosomes (BACs) was then used to further localize these sequences in the mouse genome.
Due to the large number of noninformative PTSs that needed to be mapped, we employed a high-throughput, single nucleotide polymorphism (SNP)-based mapping method. PCR primer pairs were generated for 240 of the noninformative PTSs. The primer pairs, designed from nonrepetitive sequences, were then used to amplify 250- to 300-bp fragments of genomic DNA from C57BL/6J and Mus spretus mice, the two parents of the Frederick IBMP (Fig. 1). Of the 240 primer pairs used in these experiments, 185 (77%) amplified products from DNAs of both species (Table 1). For the other 55 primer pairs, either no PCR products were generated or they were obtained from only one DNA (Table 1). Sequence analysis of the amplification products identified sequence polymorphisms for 148 of the 185 informative primer pairs (80%). SNPs were the most common polymorphism, but insertion-deletion polymorphisms were also identified at lower frequencies (Table 1). In total, about 1 bp was polymorphic for every 101 bp sequenced. This is consistent with the long evolutionary distance between these two species (3). Each polymorphic primer pair was then used to amply 93 DNAs from the Frederick IBMP along with C57BL/6J and M. spretus control DNAs. The amplification products were then sequenced with an ABI 3700 capillary electrophoresis sequence machine. By monitoring the inheritance of the SNPs, or insertion-deletion polymorphisms, in the backcross DNAs, it was then possible to accurately position these PTSs on the Frederick IBMP map.
![]() View larger version (39K): [in a new window] |
FIG. 1. High-throughput, SNP-based mapping of PTSs cloned from mouse hematopoietic tumors. PCR primer pairs were generated for 240 noninformative PTSs (12) and used to amplify 250- to 350-bp fragments from C57BL/6J and M. spretus genomic DNA (10). Proviral DNA is shown in red, and the locations of amplification primers in the flanking cellular DNA are shown by half-arrows. Amplification reaction mixtures contained 4 ng of mouse DNA, 1 µg of forward and reverse primers, 2 U of Platinum Taq polymerase (Gibco BRL), and the buffer supplied by the manufacturer. To improve the specificity of the PCR, we used a touchdown PCR program for the first 12 cycles. The annealing temperature was decreased by 1°C from 65 to 60°C after every two cycles. This was followed by 25 cycles of 94°C for 20 s, 56°C for 20 s, and 68°C for 1 min and a final extension of 68°C for 5 min on a GeneAmp PCR System 9700 (PE Applied Biosystems). The amplification products were then sequenced to identify SNPs or insertion-deletion polymorphisms. PCR products were pretreated to remove excess primers and deoxynucleoside triphosphates before they were sequenced. Each sample was incubated with 1 U of shrimp alkaline phosphatase (Amersham)/µl and 10 U of exonuclease (Amersham)/µl at 37°C for 5 min, followed by 72°C for 15 min to denature the enzyme. The amplification products were sequenced with the ABI PRISM BigDye Terminator Cycle Sequencing kit (PE Applied Biosystems) with the forward primer. Sequence files were analyzed with PolyPhred (Washington University) for simple SNPs or with MultiALN for polymorphisms involving insertions and deletions. Polymorphic primer pairs were then used to amplify 93 DNAs from the Frederick IBMP along with C57BL/6J and M. spretus control DNA. The 205 Frederick IBMP progeny were generated by mating (C57BL/6J x M. spretus)F1 females and C57BL/6J males as previously described (4). By monitoring the inheritance of these polymorphisms in the backcross DNAs, it was possible to accurately position each PTS on the Frederick IBMP.
|
|
View this table: [in a new window] |
TABLE 1. SNP frequency in C57BL/6J (BL/6) and M. spretus DNAs
|
|
View this table: [in a new window] |
TABLE 2. CISs defined by genetic and physical mapping
|
7 Mb of mouse genome DNA, well beyond what would normally be considered to represent a CIS. To further localize these PTSs in the mouse genome, one of the PTSs that defined each CIS was used as a probe to isolate mouse BAC(s). Mouse BAC clones were isolated from two BAC libraries: the RPCI-23 C57BL/6J BAC library (Roswell Park Cancer Institute) and the CITB 129/Sv BAC library (Research Genetics). The PCR products used for the SNP analysis were also used as probes for BAC screening. We then asked whether the other PTSs that defined the CIS were also located on the same BAC. The average insert size of these BAC libraries is
190 kb. Therefore, on average, this type of analysis localizes the PTSs to an
190-kb genomic region. Since proviral integrations can affect gene expression over distances approaching 300 kb (11), this is well within the range of what would normally be considered to constitute a CIS. By use of BAC hybridization, 22 of the 40 PTSs could be confirmed to be located at CISs (Table 2). When extrapolated across the 58% of PTSs that were noninformative in previous experiments and combined with the 24% of PTSs that were located at CISs in previous experiments, these results suggest that at least 35% of the PTSs in this tumor panel are located at CISs. Among these 22 PTSs, 9 were located at five previously published CISs (Evi2, Evi8, Pim1, Evi29, and Evi31) (12), while 13 mapped to eight novel CISs (designated Cis1 through Cis8) (Fig. 2).
![]() View larger version (17K): [in a new window] |
FIG. 2. Chromosomal location of eight novel CISs identified by SNP-based mapping. CIS loci were mapped by monitoring the inheritance of SNPs, or insertion-deletion polymorphisms, in the Frederick IBMP mice (Fig. 1). The number of recombinant N2 animals over the total number of N2 animals typed for each polymorphism is shown to the left of the chromosome maps between each pair of loci. The recombination frequencies, expressed as genetic distance in centimorgans (±1 standard deviation), are also shown. The upper 95% confidence limit of the recombination distance is given in parentheses when no recombinants were found between loci. When recombination between genes was observed, gene order was determined by minimizing the number of recombination events required to explain the allele distribution pattern. The positions of loci on human chromosomes, where known, are shown to the right of the chromosome maps. References for the map positions of human loci can be obtained from LocusLink (http://www.ncbi.nlm.gov/LocusLink).
|
In the studies described here we were able to map 55% of the noninformative PTSs tested by using one set of PCR primers for each noninformative PTS. However, some primer pairs failed to amplify C57BL/6J or M. spretus DNA or, alternatively, preferentially amplified the C57BL/6J allele in interspecific backcross DNA. Both of these problems can largely be eliminated by designing a second set of PCR primers from regions of the genome that are identical in C57BL/6J and M. spretus mice. In addition, 20% of the amplification products were not polymorphic in C57BL/6J and M. spretus DNA. Again, amplifying other regions that flank these proviral integration sites can eliminate this problem.
Nearly 98% of the mouse genome sequence has recently been deposited in GenBank. This sequence information has been assembled into scaffolds averaging 16.5 Mbp in size, and the scaffolds have been mapped to chromosomes by using genetic and radiation hybrid markers. Many of the retroviral integration sites cloned from the mouse can therefore now be mapped on the mouse genome by using this sequence information. For other species, however, where retroviral insertional mutagenesis has been used to identify disease genes, it will be some time before CISs can be identified by sequence alone. The mapping method described here, combined with BAC hybridization, provides an alternative method besides Southern analysis or radiation hybrids for identifying CISs in these species. The primary requirement is that SNPs can be identified in the DNA flanking the proviral integration sites and that individuals segregating for these SNPs are available. SNP-based mapping, combined with BAC hybridization, can detect CISs that would be difficult to detect by conventional Southern analysis, because, for example, the proviral integrations at the CIS are spread out over a large genomic region. Radiation hybrid analysis also has the potential to generate false positives as well as false negatives due to PCR artifacts. These abnormal typings could obscure the identification of novel CISs. SNP-based mapping is especially applicable to species like chickens and cats, where interspecific crosses have already been produced (9, 17). Finally, SNP-based mapping can be used to determine the chromosomal location of endogenous proviruses, provided again that SNPs can be identified in the DNA flanking the endogenous provirus.
Present address: Lexicon Genetics, The Woodlands, TX 77381. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»