Previous Article | Next Article ![]()
Journal of Virology, October 2003, p. 10327-10338, Vol. 77, No. 19
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.19.10327-10338.2003
Marc Taylor, and A. S. M. Alamgir
Laboratory of Persistent Viral Diseases, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases, Hamilton, Montana 59840
Received 28 January 2003/ Accepted 9 July 2003
|
|
|---|
|
|
|---|
Previous studies from this laboratory have described two antigenic subclasses of polytropic virus isolates based on their reactivities with two different monoclonal antibodies termed Hy 7 and MAb 516 (28). Virtually all of the polytropic isolates were reactive with either Hy 7 or MAb 516; however, none of the viruses were reactive with both antibodies. We mapped a major determinant of the epitopes for both antibodies to a single amino acid residue in the receptor-binding region of the polytropic SU protein. Hy 7-reactive viruses contained a lysine at that position, and MAb 516-reactive viruses contained a glutamine. The reactivities to the antibodies could be changed by substitution of a single nucleotide in the coding sequence of the SU protein. Furthermore, it was found that different inoculated ecotropic MLVs gave rise to distinctly different populations of the antigenic subclasses (28). Mice inoculated with Moloney MLV (MMLV), on average, exhibited equal titers of MAb 516- and Hy 7-reactive recombinants, whereas mice inoculated with Friend MLV (FMLV) exhibited predominantly Hy 7-reactive recombinants, suggesting recombination of these MLVs with distinct groups of endogenous proviruses. Subsequent studies using chimeras between MMLV and FMLV indicated that this specificity was determined by a short sequence encoding the nucleocapsid protein and a small sequence of the protease (27). It was apparent that an understanding of the mechanism of this specificity would be facilitated by a more complete description of the endogenous proviruses involved in the generation of polytropic MLVs. A major objective of this study was to determine how the antigenic subclasses of polytropic MLVs are reflected in the population of endogenous proviruses from which they are derived.
Genomes of inbred mouse strains contain numerous proviral sequences which bear very close homology to the genes encoding the amino-terminal SU sequences found in recombinant polytropic MLVs (16, 17, 24, 46, 47). Stoye and Coffin (46) have described two major groups of endogenous polytropic MLV-related sequences, termed polytropic (PT) and modified polytropic (mPT) proviruses, that include all nonecotropic and nonxenotropic proviruses thus far identified in inbred mouse strains. A major feature distinguishing mPT from PT proviruses is a 27-bp deletion in the sequences encoding the SU proteins of mPT proviruses. It was clear from previous analyses of virus isolates that both MAb 516- and Hy 7-reactive viruses include viruses derived from PT proviruses (14, 28). Thus, the antigenic subclasses of polytropic virus isolates do not strictly correspond to mPT and PT types of recombinants. It was unclear, however, if the antigenic subclasses of recombinant virus isolates reflected major groups of endogenous proviruses or, alternatively, if one or both of the subclasses corresponded to recombination with minor populations that were the principal participants in recombination. Furthermore, if the antigenic subclasses of polytropic virus isolates reflect recombination with two major groups of endogenous viruses, it was of interest to determine how these groups relate to the PT and mPT proviruses.
We have approached these issues by the characterization of numerous clones homologous to the receptor-binding domain of polytropic MLVs from an NFS/N mouse genomic lambda library. Our findings indicate that all endogenous polytropic MLV-related proviruses identified encode either a lysine or a glutamine at the position in the SU protein corresponding to the major determinant defined for Hy 7 or MAb 516 reactivity. The Hy 7 group corresponds to the PT proviruses previously described, while the MAb 516 group comprises mPT proviruses as well as a third distinct group of polytropic proviruses that provide insight into the evolutionary relationships of polytropic MLV-related proviruses in mice.
|
|
|---|
Analysis of genomic DNA by blot hybridization. The procedure for detection of endogenous polytropic proviruses by blot hybridization has been described in detail previously (26). Briefly, DNA from late-gestation NFS/N embryos was purified as described below and digested with BamHI for 4 h with 10 U of enzyme per µg of DNA. The digested DNA and a molecular weight marker labeled with digoxigenin (DIG) were electrophoresed on a 22-cm-long 0.8% agarose gel in Tris-borate-EDTA buffer at 85 V for 17 h. DNA was transferred from the gel to positively charged nylon membranes with a vacuum blotting apparatus (VacuGene XL; Pharmacia), cross-linked to the membrane by UV irradiation, and baked at 80°C for 1 h. The membrane was blocked and hybridized in blocking solution containing a DIG-labeled 600-bp BamHI-to-EcoRI fragment cloned from the spleen focus-forming virus. After hybridization, the membranes were washed and treated according to the manufacturer's instructions. The membranes were subsequently developed by addition of anti-DIG Fab fragments and incubated with 0.25 mM CDP-Star (Tropix). Exposure of the film for a 6-µg sample lasted approximately 12 h.
Generation of an NFS/N genomic DNA lambda library and isolation of clones containing polytropic MLV-related proviral DNA. Late-gestation NFS/N embryos obtained from pregnant females were frozen and blended in liquid nitrogen to produce a frozen tissue powder. The frozen tissue was suspended in 8 volumes of lysing buffer from a DNA isolation kit (D5000A; Gentra Systems, Inc.), and the DNA was subsequently purified according to the manufacturer's instructions. Purified DNA was partially digested with MboI and sedimented on a 10 to 30% linear glycerol gradient in 100 mM NaCl-10 mM Tris HCl (pH 7.4)-1 mM EDTA. The gradient was fractionated, and a sample of each fraction was analyzed on a 0.4% agarose gel (Seakem GTG). Fractions containing fragments of approximately 15 to 25 kb were pooled, precipitated with ethanol, resuspended in 10 mM Tris-1 mM EDTA, and cloned into the Lambda Fix vector (Stratagene) according to the manufacturer's instructions. The library was screened by using a polytropic env probe excised from a plasmid containing a 620-bp BamHI-to-EcoRI fragment cloned from the Friend spleen focus-forming virus (32). The BamHI-to-EcoRI fragment is homologous to the receptor-binding domain of the SU-encoding sequences in polytropic genomes. Labeling of the probe for subsequent detection by chemiluminescence was performed by random-primed synthesis with DIG-labeled UTP by use of a DIG DNA labeling kit (Genius 2 kit; Roche). Development of the filters to identify plaques homologous to the probe was carried out as described for blot hybridization of DNAs (26).
Preparation of lambda DNA. Lambda DNA was prepared by utilizing various features of reported procedures (29, 49) along with a lambda DNA preparation kit (catalog no. 12543; Qiagen). A phage stock containing 20 x 103 to 50 x 103 PFU was plated with the MRF' bacterial strain (Stratagene) on a 100-mm-diameter NZY plate to obtain complete lysis after overnight incubation at 37°C. Phage were eluted from the plate in 5.0 ml of suspension medium (0.1 M NaCl, 0.008 M MgSO4, 0.05 M Tris-HCl, pH 7.5, and 2% gelatin) plus 10 mM MgSO4 by rocking for 2 h at room temperature. A 50-ml culture of bacteria in Luria-Bertani medium containing 10 mM MgSO4 was initiated by inoculation of 0.3 ml of an overnight culture of bacteria grown in the same medium. The culture was grown at 37°C to an A600 of 0.5 to 0.9, at which time the entire phage lysate was added to the culture and incubated at 39°C until lysis had occurred. If lysis was not apparent after 2 to 3 h, 1 ml of chloroform was added and the culture was incubated an additional 10 min. The lysate was cleared of bacterial debris by centrifugation at 5,000 x g for 10 min, and the DNA was prepared by using the lambda DNA preparation kit with the following modifications. After treatment of the lysate with DNase and RNase (solution L1), the lysate was diluted with an equal volume of distilled water and warmed at 37°C for 10 min. The lysate was then adjusted to 0.04 M ZnCl2 by addition of a freshly prepared filtered 2.0 M solution, incubated at 37°C for 5 min, and then centrifuged at 5,000 x g for 5 min at room temperature. The precipitate was resuspended in 1.5 ml of solution L3 from the kit and further processed according to the manufacturer's instructions. This procedure typically yielded 50 to 100 µg of lambda DNA.
PCR analyses and DNA sequencing. Lambda DNA was used as the template for PCRs. PCR amplification of a region of the polytropic provirus encoding the receptor-binding region of the SU protein was accomplished with a primer set consisting of oligonucleotides termed FORPOLY (GCAGTACAACGAGAGGTCTGG) and REVPOLY (GGGTCAAAGAGAACCGGGTCAC). FORPOLY is a sequence in the polymerase gene of the recombinant polytropic MLV MCF1233 (GenBank accession no. U13766) (45) from nucleotide 5519 to 5539. REVPOLY is the reverse complement of sequences in the published envelope genes of the endogenous PT provirus MX27 (GenBank accession no. M17327) and the mPT provirus MX33 (GenBank accession no. M17326) from nucleotide 710 to 731 (46) and in MCF1233 from nucleotide 6443 to 6454. PCRs in preparation for sequencing were conducted by using the puReTaq Ready-To-Go PCR system (catalog no. 27-9558-01; Amersham Pharmacia Biotech Inc.) with Pfu Turbo DNA polymerase (catalog no. 600153; Stratagene) or Platinum Pfx DNA polymerase (catalog no. 11708-021; Invitrogen) according to the manufacturer's instructions.
PCRs to distinguish PT from mPT envelope sequences utilized oligonucleotides 510MX33 (CCCTTAAGCGAGGAAACACCCCTCGG) and 897MX33RC (GCTTGGTAGGCTCCATCTACCAGGT). 510MX33 corresponds to a sequence present in MX33 and MX27 from position 510 to 535. 897MX33RC corresponds to the reverse complements of a sequence in MX33 from position 897 to 921 and a sequence in MX27 from position 924 to 948. Additional oligonucleotides used for the generation of PCR products and as sequencing primers included MIDREVPOLY (ACATTGAAGACCTGATGAGGG), MIDFORPOLY (AGCTAATGCTACCTCCCTCCTG), 820MX33 (CATGCTCCCCAGGCCTCCTC), 1125MX33 (CCCATCAGGCCCTGTGTAATACC), 1203MX33RC (AGCCCGGTGTTGCAAGCCC), 1555MX33 (GGTGGTCCTACAGAACCGGA), 1645MX33RC (CTRACGCCAGTGTGGTCCGCG), 1882MX33 (GGTGGTGCAGGCCCTGGTTC), 2016MX33RC (CTGCAGCTAGCTTGCTAAGCC), 2024MX33 (AAGCTAGCTGCAGTAACGTCCATTTTGC), 2224MX33RC (CTTCTGTGTCTGTTGCTGGTTCCGC), 2328MX33RC (CCAGAGCACTGTGCACCTC), 2359MX33RC (CAGAACATTGGCAGACACACAGG), and 2580MX33RC (GCGCGCCGAGTGTGGGAGTT). All oligonucleotides utilized in this study were synthesized by Sigma Genosys, and sequence analyses of the PCR products were carried out on an ABI model 377 DNA sequencer at the DNA Sequencing and Synthesis Facility at Iowa State University.
Alignment and phylogeny of DNA sequences. Alignment and phylogenetic analyses were performed by using the ClustalX sequence alignment program (51) in conjunction with the PAUP phylogenetic analysis program (version 4; Sinauer Associates, Sunderland, Mass.) for neighbor joining (NJ), maximum parsimony, and maximum likelihood analyses. Bayesian analysis of phylogeny was performed with the MrBayes program.
Nucleotide sequence accession numbers. The sequences in this report have been deposited in GenBank under accession no. AY219536 through AY219567.
|
|
|---|
![]() View larger version (22K): [in a new window] |
FIG. 1. Endogenous PT and mPT sequences of NFS/N mice. DNA from a prenatal NFS/N mouse (6 µg) was digested with BamHI and analyzed by electrophoresis on a 0.8% agarose gel. The gel was then blotted and hybridized to the DIG-labeled polytropic MLV env probe to detect homologous sequences. Nearly all polytropic MLVs have a BamHI site derived from the endogenous sequence near the 3' end of the pol gene and no additional BamHI sites in the env gene or LTR. Thus, the bands correspond to fragments between the BamHI site in the endogenous pol gene and the nearest BamHI site after the 3' end of the provirus.
|
![]() View larger version (10K): [in a new window] |
FIG. 2. Amplified region of endogenous proviruses encompassing the receptor-binding region of the SU protein. The diagram at the top represents an intact endogenous polytropic provirus (open rectangles) and the flanking sequences (solid rectangles). Approximate locations of the LTR, the gag, pol, and env genes, and sequences encoding the integrase (IN), the surface glycoprotein (SU), and the transmembrane protein (TM) are indicated. The expanded region at the bottom represents the PCR product of approximately 900 bp that was used for initial sequence comparisons among the endogenous proviruses. The product contains pol gene sequences encoding a portion of the IN protein, the env gene leader sequence (L), and the receptor-binding region (RBR) of the SU protein. The positions of the BamHI and EcoRI sites that define the polytropic MLV probe are also indicated.
|
900-bp PCR product contained the remaining proviral env gene sequences and the 3' long terminal repeat (LTR). Our initial sequence analyses focused on the
900-bp PCR products as well as the U3 region of the LTRs. The results of these analyses prompted the determination of the remaining intervening env sequences between the receptor-binding region and the LTR to assess the possibility that some clones represented endogenous recombinant proviruses. The NFS/N mouse genome contains numerous sequences encoding intact polytropic MLV receptor-binding regions. The PCR analyses identified clones that may contain intact receptor-binding sequences, based on the absence of large deletions in this region. However, the analyses would not distinguish sequences that were defective as a result of changes such as point mutations or small deletions that introduced a frameshift or a premature termination. We found that 27 of the 32 clones contained intact coding sequences devoid of frameshift or termination mutations within the receptor-binding region. Twenty-four of these were colinear with published sequences of recombinant virus isolates (Fig. 3), while three others contained additional small sequences of 9 bp (NJ1 and NJ2) or 12 bp (NO1) compared to the consensus sequence (Fig. 3). Five clones containing mutations that would preclude the synthesis of an intact SU protein were identified. The nature and location of these mutations are indicated in Fig. 3. It is perhaps noteworthy that one clone (ND1) had a tRNAIle rather than a tRNAMet codon as the initiator of the env gene. Endogenous proviruses in HRS/J and RFM/Un mice containing an identical mutation in their initiation codon have been described previously (35, 46), suggesting that they may represent allelic forms of the same proviral locus in different mouse strains. We also identified two distinct proviruses that had identical mutations resulting in a termination codon in the env gene, each of which was identified twice in different lambda clones (Fig. 3, NF1/NF2 and NH1/NH2).
![]() View larger version (25K): [in a new window] |
FIG. 3. Sequence properties of the 900-bp PCR products of lambda clones containing nondeleted receptor-binding regions. Diagrams depict the amplified region shown in Fig. 2. Vertical arrows indicate the positions at which additional sequences (given below) were located within the SU-encoding sequences. The approximate locations and natures of mutations rendering five of the proviruses defective are indicated. These include a mutation at nucleotide 251 in ND1 resulting in a tRNAIle as the initiator codon of env and a termination codon in pol, a frameshift mutation at nucleotide 785 in ND1, a premature termination codon in pol at nucleotide 252 in NF1 and NF2, a premature termination codon at nucleotide 385 in env in NF1 and NF2, and an identical mutation at nucleotide 385 in NH1 and NH2.
|
Sequence heterogeneity among endogenous polytropic MLV-related proviruses.
Endogenous polytropic MLV-related proviruses are very closely homologous. Although the PT and mPT subclasses of polytropic MLV-related proviruses differ by several bases within the receptor-binding region, proviruses within each of these groups are nearly identical in sequence, even when they are obtained from different mouse strains. We wished to determine if the sequence heterogeneity in this region was sufficient to uniquely identify a substantial number of the polytropic MLV-related proviruses. If not, our eventual goal of unambiguously identifying proviruses that participate in recombination would not be feasible. Among the 32 clones we have characterized, we identified 17 different proviruses distinguishable by base differences within the receptor-binding region. Most of the sequences were found only once (nine clones) or twice (six clones), while other sequences had as many as six identical counterparts in different clones. We have designated the clones in an alphanumeric fashion indicating the mouse strain (N for NFS/N), sequence identity within the
900-bp region (A to Q), and a number to identify the individual clone (clones 1 to 6) (Table 1). Sequence analyses of the remainder of the env gene and the LTR (see below) indicated that some
clones that exhibited identical
900-bp sequences were, in fact, derived from different endogenous proviruses. In this regard, the NA group is composed of two proviruses (NA2 differs from the remaining members of the NA group) and the NC group is composed of three different proviruses (NC1 and NC5 are identical, NC3 and NC4 are identical, and NC2 is unique). Thus, our sequence analyses have identified 20 distinct proviruses. Seventeen of these are nondefective within the minimal region found in polytropic MLVs and could potentially generate replication-competent recombinant viruses.
|
View this table: [in a new window] |
TABLE 1. Properties of 900-bp PCR products from lambda clones
|
Identification of PT and mPT envelope sequences.
Two groups of endogenous polytropic MLV-related proviral sequences have been described for inbred mouse strains; these groups are termed PT and mPT proviruses (46). A major distinction between PT and mPT env gene sequences is the presence of a 27-bp deletion in a region of the env gene encoding the SU protein of mPT proviruses. This deletion is located approximately 190 bases in the 3' direction from the nucleotide distinguishing Hy 7 from MAb 516 proviruses. We found that 23 of the clones corresponded to PT proviruses, while 8 corresponded to mPT proviruses (Table 1). One clone (NA6) that yielded a
900-bp sequence appears to have been truncated during the generation of the lambda library. The other clones in this group (NA1 to NA5) yielded PCR products indicative of mPT proviruses. Interestingly, the Hy 7 group was entirely composed of PT proviruses whereas the 516 group included all of the mPT proviruses as well as some PT proviruses.
The proportion of mPT proviruses among the clones was somewhat smaller than that indicated by previous reports of the relative frequencies of these two types of sequences in different mouse strains (46, 47). As noted above, sequence analyses of the remaining env gene sequences and the LTR revealed two distinguishable proviruses within the NA set of clones as well as three distinguishable proviruses within the NC set. Thus, of the 20 distinct proviruses identified, 15 correspond to PT proviruses while 5 correspond to mPT proviruses.
Phylogenetic relationships of endogenous proviral 3' pol and 5' env sequences. The Hy 7 proviruses described above were composed entirely of PT proviruses, whereas the 516 proviruses comprised both PT and mPT proviruses. Furthermore, PT proviruses of the Hy 7 group were all nearly identical in sequence to the prototypic PT provirus MX27, while the mPT proviruses, all of which belonged to the 516 group, were nearly identical to the prototypic mPT provirus MX33 (46). In contrast, the remaining proviruses of the 516 group, which were PT in that they did not exhibit the deletion characteristic of mPT proviruses, exhibited numerous sequence differences distinguishing them from previously described PT or mPT proviruses. Sequence comparisons of the proviruses with the xenotropic MLV NZB-9-1 (37) indicated that several of the differences that distinguished the 516 PT proviruses from the Hy 7 PT proviruses and the mPT proviruses were shared with the xenotropic MLV. These included several specific nucleotides as well as the presence of a 12-bp xenotropic sequence beginning at nucleotide 262 of the env gene of one of the proviruses (NO1) that was uniformly deleted in the remaining proviruses. These observations were further assessed by phylogenetic analyses of the proviral sequences.
Phylogenetic analyses were performed on alignments of the xenotropic MLV (20, 37) and 19 polytropic MLV-related proviruses that could be distinguished by sequence heterogeneity within the region we have analyzed. The region compared in the analyses included the
900-bp PCR product (Fig. 2 and 3) extended to include the region of the env gene immediately past the 27-bp deletion present in mPT proviruses for a total of
1,100 bp. We found that the proviruses segregated into three distinct phylogenetic sets (clades) (Fig. 4). All members of the Hy 7 group segregated into a single closely related clade of PT proviruses, while proviruses of the 516 group segregated into two distinct clades. Several of the 516 proviruses segregated into a clade whose members were very closely related and comprised all mPT proviruses identified. The remaining 516 proviruses, corresponding to the PT members of this group, segregated into a clade that exhibited a much higher degree of divergence among its members than the Hy 7 PT or 516 mPT clades. Also included as the most divergent member of this clade was the xenotropic MLV. It is noteworthy that the xenotropic MLV, like the PT members of the clade, encodes the major determinant for MAb 516. The confidence limits of the phylogenetic segregation into the three clades were maximal by bootstrapping analysis. Moreover, essentially identical phylogenetic trees were obtained by using NJ (Fig. 4), maximum parsimony, maximum likelihood, and Bayesian analyses (with the MrBayes and PAUP programs). It is apparent from these analyses that the 516 PT proviruses reflect sequence characteristics of phylogenetic intermediates between the xenotropic MLV and the Hy 7 PT and mPT proviruses. The provirus present in clone NB1 appears to be somewhat more closely related to the Hy 7 PT and mPT proviruses than those in NJ1 and NO1.
![]() View larger version (18K): [in a new window] |
FIG. 4. Phylogenetic relationships of NFS/N endogenous proviruses. An unrooted phylogram generated by NJ phylogenetic analysis of a sequence encompassing a 3' pol gene sequence and approximately 850 bases of the env gene is presented. This sequence corresponds to the 900-bp PCR product initially sequenced (Fig. 3) plus an additional 200 bp obtained by sequence analyses of the PCR products flanking the 27-bp deletion (Fig. 4). The analyses included the 19 proviruses distinguishable by sequence in this region as well as the analogous sequence from the xenotropic MLV NZB-9-1 (XENO). Branch lengths are drawn to scale. The identity of each clone is indicated at the branch termini. The very closely related clades containing the Hy 7 PT proviruses and the mPT proviruses are expanded (within dashed circles) in order to present the branch termini of these clades clearly. Group 516 PT viruses (NB1, NO1, and NJ1) are indicated by a brace. The robustness of the map was assessed by bootstrap analysis repeated 1,000 times. Bootstrap values are shown as circled numbers on the major branches.
|
![]() View larger version (23K): [in a new window] |
FIG. 5. U3 regions of endogenous polytropic MLV-related proviruses. Schematic diagrams are shown depicting distinguishing structural features of the U3 regions of polytropic MLV-related proviruses described by Tomanaga and Coffin (52, 53). The diagrams are adapted from similar diagrams presented in their reports. U3 types P-I and P-II correspond to the U3 regions found in PT and mPT proviruses, respectively, identified in inbred mouse strains (46). Types P-III, P-IV, and P-V correspond to U3 regions of proviruses identified in wild mouse subspecies and may reflect features that are ancestral to the P-I and P-II U3 regions. The regions exhibiting the distinguishing features, numbered 1 through 6, are indicated by boxes and include direct repeats (indicated by a repeated number with an asterisk), unique sequences, and deletions exhibited by the proviruses. In addition, the 190-bp insertion is indicated. V's extending down from the horizontal axes of the diagrams indicate regions that are absent from the various U3 types and include deleted sequences identified in P-I and P-II. The diagrams do not indicate the precise locations or sizes of the various structural features.
|
![]() View larger version (20K): [in a new window] |
FIG. 6. Structural features of U3 regions of endogenous polytropic MLV-related proviruses identified in NFS/N mice. Structural features of the U3 regions of Hy 7 PT, mPT, and 516 PT proviruses identified in this report are presented. The features are indicated as described in the legend to Fig. 5. The structures of the U3 regions of two distinguishable xenotropic MLVs are also included in order to facilitate comparison to the structure of the 516 PT provirus NO1, which lacks the 190-bp insertion characteristic of all other polytropic MLV-related proviruses.
|
NO1, the remaining 516 PT provirus identified in this study, exhibited an unexpected U3 structure. The U3 of NO1 lacked the 212-bp insertion and, as such, would be considered to possess a xenotropic U3 region (Fig. 6). In addition, the U3 of NO1 did not exhibit a small duplication near its 3' end (region 6*). This duplication is common to PT, mPT and most xenotropic MLVs but is not present in the U3 region of the xenotropic MLV Bxv1, which is the origin of the LTR acquired by MCF recombinant MLVs in AKR/J and HRS/J mice (33, 48, 52). It is possible that NO1 represents an endogenous recombinant provirus containing a polytropic env gene and a xenotropic LTR. However, other features of the NO1 U3 region distinguish it from all previously described xenotropic LTRs. In contrast to replication-competent xenotropic MLV isolates from laboratory strains, NO1 exhibits the 14-bp repeated sequence designated 1*, as do the PT, mPT, and 516 PT proviruses (Fig. 6). Furthermore, the NO1 U3 exhibits the 5-bp deletion of region 2 that is characteristic of PT proviruses but not of the U3 regions of mPT or xenotropic MLVs. Thus, the LTR of this provirus, like its associated env gene, exhibits structural features that are intermediate between those of xenotropic MLVs and those of PT and mPT proviruses.
Phylogenetic analyses of the proviral LTRs and assessment of possible endogenous recombinant proviruses. Phylogenetic analyses of alignments of the U3 regions of the proviruses (Fig. 7) yielded results similar to those of the analyses of the receptor-binding regions (Fig. 4), with some notable differences. With the exception of clone NQ1, the Hy 7 proviruses segregated into a single closely related clade of PT proviruses, while the mPT proviruses segregated into a separate distinct clade. As noted above (Fig. 6), NQ1, which possesses a PT env sequence, exhibits a U3 structure characteristic of an mPT provirus and, as expected, segregated with the mPT clade. The 516 PT proviral U3 regions of NB1 and NJ1 occupy intermediate phylogenetic positions, in agreement with the analyses of the receptor-binding regions of the proviruses (Fig. 4) as well as their structural organization, presented above (Fig. 6). The U3 region of NO1 also occupied an intermediate phylogenetic position; however, in contrast to the analysis of the env gene region (Fig. 4), the U3 region of NO1 segregated into a separate clade with the xenotropic U3 region (Fig. 7).
![]() View larger version (11K): [in a new window] |
FIG. 7. Phylogenetic relationships of the U3 regions of NFS/N endogenous proviruses. An unrooted phylogram generated by NJ phylogenetic analysis of alignments of the U3 regions of the proviruses is presented. U3 sequences extending from the 5' end of the U3 up to, but not including, the TATA box were analyzed. The analyses included all 20 distinct proviruses identified in our analyses as well as the analogous sequence from the xenotropic MLV NZB-9-1. Branch lengths are drawn to scale.
|
To assess the possibility of recombinant endogenous proviruses, the intervening env sequences between the
900 bp sequence and the U3 region were determined for 30 of the 32 clones included in this study. Determination of the NF2 and NA6 sequence was precluded by a truncation of the env gene during the cloning procedure. The sequences were aligned, and short (100- to 200-bp) segments of the alignment were subjected to phylogenetic analysis. This analysis yielded a likely recombination area for NQ1 within the 5' portion of the U3 region. Throughout analysis of the entire envelope gene, the clones segregated into the three major clades consisting of Hy7 PT, mPT, or 516 PT proviruses quite similarly to their segregation in the analysis of the
900-bp region (Fig. 4 and data not shown). Notably, NQ1 segregated with the remaining Hy 7 PT proviruses, and NO1 segregated closely with NB1 and NJ1, the remaining 516 PT proviruses. These phylogenetic positions were also maintained in the first 100 bases of the U3 region (Fig. 8A); however, analysis of the adjoining 3' sequences up to the point of the large insertion revealed a shift of NQ1 from the PT clade to the mPT clade (Fig. 8B).
![]() View larger version (13K): [in a new window] |
FIG. 8. Phylogenetic analysis of 5' and 3' segments of the U3 regions of NFS/N endogenous proviruses. Phylogenetic analyses were performed on alignments of two 5' segments and a 3' segment of the U3 region of the proviruses to assess shifts of putative recombinant provirus from one clade to another. The sequences included in the alignment are indicated by the braces in the diagram at the top of the figure, which is drawn to scale. Unrooted phylograms generated by NJ phylogenetic analysis of alignment of the segments of the U3 regions of the proviruses are presented. The analyses included all 20 distinct proviruses identified in our analyses as well as the analogous sequence from the xenotropic MLV NZB-9-1. The clones that are putative recombinant proviruses, NQ1 and NO1, are underlined and boldfaced for ease of identification. Branch lengths are drawn to scale.
|
![]() View larger version (7K): [in a new window] |
FIG. 9. Recombination region in a PT env-mPT U3 endogenous recombinant provirus. An alignment of the consensus sequence of PT proviruses (top), the corresponding sequence of the putative recombinant provirus NQ1 (center), and the consensus sequence of mPT proviruses (bottom) is presented. A 17-bp sequence in the alignment is flanked by a PT provirus-specific base on the 5' side and an mPT provirus-specific base on the 3' side. Positions in the NQ1 sequence that are identical in both the PT and mPT consensus sequences are indicated by dots, and positions that are specific to the PT or mPT consensus sequence and shared by NQ1 are indicated by a vertical line connecting the base of NQ1 to the base of the matching consensus sequence. NQ1 sequences 5' of the 17 bp exhibit nearly complete agreement with the PT consensus sequence, whereas sequences 3' of the 17 bp exhibit nearly complete agreement with the mPT consensus sequence. Arrows directed toward the 5' (PT) or 3' (mPT) sequences are intended to convey this observation.
|
|
|
|---|
Possible implications of identical mutations in different proviruses. Among the proviruses that were apparently defective, it was noted that the same, presumably lethal mutations were identified in different proviruses. In one instance (ND1,) the provirus exhibited a tRNAIle rather than a tRNAMet codon as the initiator of the env gene, a mutation identical to those found in two other mouse strains (35, 46). In another instance, an identical termination codon in the env gene was identified in different lambda clones (Fig. 3, NF1/NF2 and NH1/NH2). These observations suggest the possibility that some viruses were introduced into the germ line as replication-defective viruses, probably through viral pseudotyping. This would, in turn, suggest that there were periods during their evolution in which many of the proviruses were replicating simultaneously in the host. In this regard, analyses of endogenous proviruses of wild mice suggest that polytropic MLV-related proviruses may have entered the germ line of mice prior to the speciation of Mus musculus and undergone extensive periods of replication (52, 53). Alternatively, identical mutations may have been introduced into different proviruses by gene conversion. Convincing evidence has recently been reported indicating gene conversion between 5' and 3' LTRs of human endogenous retroviruses (22). In view of the large number of retroviral elements in mammalian genomes, the possibility of recombination at the DNA level must be considered.
Phylogenetic relationships of polytropic MLV-related proviruses in NFS/N mice.
An unanticipated result of these analyses was the finding that the 516 PT proviruses differed extensively from the Hy 7 PT proviruses, particularly considering that the basis for grouping the proviruses relied on the identity of only one nucleotide (28). Analysis of the
900-bp sequence alignments that encompassed the receptor-binding region of the proviral env gene indicated that the 516 PT proviruses occupied a phylogenetic position intermediate between xenotropic proviruses, on the one hand, and the Hy7 PT and 516 mPT proviruses, on the other. This conclusion was strongly supported by analyses of the U3 regions of the proviruses. Two of the proviruses, NB1 and NJ1, exhibited large inserts of 212 bp and additional structural features similar to those of proviral U3 regions identified in wild mice that have been suggested as possible ancestors of the PT and mPT proviruses (53). The third 516 PT provirus exhibited env sequences that segregated as a phylogenetic intermediate but did not exhibit a large insertion in its U3 region and, as such, corresponded to a xenotropic U3 region. NFS mice reportedly have a single endogenous provirus containing a xenotropic LTR termed Nfxv-1 (20), and a single xenotropic MLV has been isolated from these mice (NFS-Thy-1). Restriction enzyme analyses of the envelope gene linked to the endogenous xenotropic LTR were not in concordance with analyses of the envelope contained in NFS-Thy-1, leading to the suggestion that NFS-Thy-1 was the result of recombination between the LTR of the endogenous virus and other sequences in the mouse. A portion of the env gene sequence and the LTR sequence of NFS-Thy-1 have been reported (23, 40) and are nearly identical to the sequence of the xenotropic MLV NZB. It is clear that the NFS-Thy-1 LTR does not correspond to the LTR identified in NO1, although it is quite likely that NO1 corresponds to Nfxv-1. Barring the existence of a second xenotropic LTR in NFS mice, the origin of NFS-Thy-1 is unclear. In this regard, the probe used to identify lambda clones containing polytropic MLV-related proviruses in this study would also hybridize to xenotropic env gene sequences.
Our analyses suggest that NO1 may be a recombinant between a 516 PT provirus and a xenotropic MLV, resulting in a loss of the large insertion. This is based on a shift toward the xenotropic MLV clade in phylogenetic comparisons of 5' and 3' segments of the U3 region. However, the 3' U3 region of NB1 also appears to be more closely related to the xenotropic MLV, although to a lesser extent than NO1 (Fig. 8), suggesting that this may be a feature of the 516 PT proviruses. Rather than being a simple recombinant, NO1 could reflect features of an intermediate in a number of evolutionary schemes. It has been suggested that endogenous proviruses similar to NB1 or NJ1 are likely ancestral to the PT and mPT proviruses in a process that involved, among other changes, deletions in the U3 region of the LTR (53). Similarly, NO1 could represent an intermediate in an evolutionary process leading from a 516 PT provirus to xenotropic MLVs via a large (212-bp) deletion within the U3 region. Alternatively, NO1 could represent an earlier 516 PT provirus, possibly derived from a xenotropic MLV, that has not acquired the large inserted sequence characteristic of other polytropic MLV-like proviruses. In this regard it should be emphasized that the phylogenetic analyses do not establish an evolutionary direction progressing from xenotropic to polytropic MLV-related proviruses, as might be inferred from the phylograms (Fig. 4, 7, and 8). The common ancestor of the polytropic MLV-related proviruses and the xenotropic MLVs is not necessarily more closely related to xenotropic MLVs. The observation that the 516 PT proviruses reflect properties of phylogenetic intermediates between the xenotropic MLVs and the remaining polytropic MLV-related proviruses implies that the intermediates reflect properties of proviruses that were ancestral either to the Hy 7 PT and mPT proviruses or to the xenotropic MLVs, or conceivably to both groups (53).
The provirus isolated in clone NQ1 is a clear example of an endogenous recombinant provirus. We were able to unambiguously identify a 17-base sequence in the 5' portion of the U3 as the region of recombination. The existence of endogenous recombinant proviruses likely reflects the simultaneous infection and replication of viruses of different groups during their evolution. However, one must again consider the possibility of gene conversion as an alternative to active replication (52).
Phylograms nearly identical to that shown in Fig. 6 were obtained from analyses of short segments of the sequences, including the analyses of sequences limited to pol gene or U3 sequences. Thus, it is apparent that the term "516 PT" has little descriptive value. Accordingly, we propose that the 516 PT proviruses be termed intermediate polytropic (iPT) proviruses. This nomenclature indicates their relationship to phylogenetic intermediates and distinguishes them from the PT and mPT proviruses. Furthermore, it serves to eliminate reference to the epitope, which is simply one of multiple structural features of this group.
It seems likely that at least some iPT proviruses participate in recombination to generate polytropic MLVs. The distribution of MAb 516- and Hy 7-reactive polytropic MLV isolates is about equal (28), yet few mPT isolates have been identified. Considering that the mPT proviruses uniformly encode the major determinant for the MAb 516 epitope, many of the MAb 516-reactive polytropic MLVs may be generated by recombination with iPT proviruses. In this regard, the polytropic FMLV FrNx (1) differs from the iPT provirus NB1 at only two positions in the
1,100-bp region we have analyzed. In contrast, FrNx differs from each of the PT and mPT proviruses at more than 30 positions in this sequence. As noted above, the provirus cloned in NB1 and NB2 carries an inserted A within the SU-encoding region, resulting in a frameshift mutation which introduces a termination codon. The PT sequences of FrNx are limited to sequences 5' of this mutation, suggesting that the endogenous parents of polytropic MLVs do not require intact coding sequences outside of the region of recombination.
Identification of distinct proviruses by sequence heterogeneity.
One of the original goals of this study was to determine the extent of heterogeneity of the endogenous polytropic MLV-related provirus sequences in the minimal region of recombination and to identify single-nucleotide polymorphisms within this region that differentiate the proviruses. It is possible that the specificity of recombination observed with FMLV and MMLV (27, 28) simply reflects preferential replication of the ecotropic viruses in different cell types in which different endogenous proviruses are expressed (15, 42). However, we found that the specificity was dependent on sequences encoding the ecotropic nucleocapsid protein rather than on sequences one might expect to influence tissue tropism (27). This suggested that the specificity of recombination might reflect structural features of the endogenous proviruses rather than the tissue tropism of the ecotropic MLVs. A necessary prelude to the identification of features of the proviruses that might influence this specificity is the precise identification of the proviruses that actually participate in recombination. This determination remains elusive, due largely to the close homology of the endogenous sequences themselves and to the rate of evolution of retroviral sequences during viral replication. Furthermore, the region of comparison to assess the precise origin of the MLV sequences is limited to those sequences acquired by recombination and in many instances is restricted to 3' pol and 5' env gene sequences. A completely unambiguous identification of endogenous sequences participating in recombination requires the characterization of all of the endogenous proviruses in a mouse strain. Our analyses are not yet exhaustive, and there likely remain several endogenous proviruses not represented in our clones. Based on an estimate of 30 endogenous proviruses reactive with our probe and the 57 positive clones we have thus far isolated, we expect that approximately 85% of the endogenous viruses of NFS/N mice are represented in our clones. To achieve a 99% certainty that any individual provirus is represented will require the isolation of
135 clones. We are continuing to characterize the clones that did not yield 900-bp PCR products and are isolating additional clones to attain this level of confidence.
We found that almost all of the endogenous proviruses we have identified thus far can be distinguished by sequence heterogeneity within the minimal region of recombination found in polytropic MLVs. This includes 17 proviruses distinguishable within this region and 3 additional proviruses (NA2, NC2, and NC3) distinguishable by sequences immediately downstream of this region. Alignments of the proviruses reveal numerous single-nucleotide polymorphisms that are unique to individual proviruses as well as several that are diagnostic of each group of proviruses (Fig. 10). These markers can be detected in samples by a variety of techniques (2, 3, 34) and should greatly facilitate the identification of proviruses that participate in recombination. Furthermore, endogenous proviral genes are expressed in a tightly controlled fashion at various times and in various tissues of mice (5, 30, 31, 36, 54). The expression of these sequences may have a physiological role and certainly has physiological consequences in some mouse strains, such as those that exhibit a high incidence of leukemia. Little is known about the control of their expression or, in the case of polytropic proviruses, the precise identity of the proviruses involved. The polymorphisms defined in these analyses provide valuable markers for the detailed examination of provirus expression.
![]() View larger version (43K): [in a new window] |
FIG. 10. Single-nucleotide polymorphisms characteristic of PT, mPT, and iPT proviruses and of specific proviruses. Two windows in the alignment of 19 distinct polytropic MLV-related proviruses with their consensus sequence (Con.) are shown and illustrate examples of nucleotide polymorphisms. Numbers on the left of each window indicate the nucleotide position starting from the beginning of the initiation codon for env. Nucleotides differing from the consensus sequence are boxed. A single-nucleotide polymorphism specific for PT proviruses is located at position 60. Single-nucleotide polymorphisms characteristic of mPT proviruses are located at positions 53 and 637, and a single-nucleotide polymorphism characteristic of iPT proviruses is located at position 638 in NB1. Single-nucleotide polymorphisms specific for individual proviruses are found at positions 70 (NN1) and 646 (NJ1).
|
Present address: Bio-Inova Life Sciences International, 78372 Plaisir Cedex, France. ![]()
|
|
|---|
Myb5 mutant. J. Virol. 75:522-526.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»