Previous Article | Next Article ![]()
Journal of Virology, January 2008, p. 582-587, Vol. 82, No. 1
0022-538X/08/$08.00+0 doi:10.1128/JVI.01451-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Laboratory of Mammalian Genomics, Department of Animal Biotechnology, Konkuk University, Seoul 143-701, Republic of Korea
Received 3 July 2007/ Accepted 15 October 2007
|
|
|---|
4, and
9 families. PCR with OERV family-specific primers revealed an additional ERV that was classified into the bovine endogenous retrovirus (BERV)
7 family. In conclusion, here we report the results of a genome scale study of the BERV. Our study shows that the ERV family expansion in cattle may be somewhat limited, while more diverse family members of ERVs have been reported from other artiodactyls, such as pigs and sheep. |
|
|---|
(C-type) genera (3, 20). To date, analyses of the ERVs in the genomes of several mammals, including humans, pigs, and sheep, have revealed the presence of multiple different families (1, 2, 4, 5, 7, 10, 11, 13, 15, 16, 17, 19). For example, in the sheep genome, 12 different ovine ERV (OERV) families were detected. These were classified into three β families (β1 to β3) and nine
families (
1 to
9) (11). However, except for one pro/pol sequence from murine leukemia virus-related retrovirus (MLVRT1-BoEV) (19), no other sequence information regarding ERVs residing in the cattle genome is available. It is known that human ERVs (HERVs) comprise approximately 8% of the human genome (6). Thus, the identification of the ERVs in the bovine genome will be helpful for the annotation of the bovine genome; it will also improve our understanding of ERVs. We searched the bovine genome for the conserved pro/pol nucleotide sequences of ERVs by PCR using degenerate primers, which contain the active site motifs DTGA of protease (PR) protein and YMDD or YVDD of reverse transcriptase (RT) protein (7, 10, 11, 18). Since we found that all ERV clones obtained by degenerate PCR were closely related to three specific OERV families, we also subjected the bovine genome to PCR using OERV family-specific primers.
Degenerate PCR amplification of BERV sequences. We isolated the genomic DNA from one Korean Yellow native female cow (Bos taurus) by a simple lysis method (14). The template DNA was subjected to PCR using Taq polymerase and six pairs of degenerate primers previously used for successful amplification of ERVs in pigs, sheep, and other vertebrates (7, 10, 11, 19). The two 5' and three 3' oligonucleotides, which consist of six pairs of primers, are 5' primers against PR protein active site motifs, 5'-GT(T/G) TTI (G/T)TI GA(T/C) ACI GGI (G/T)C-3' and 5'-(C/T)TI (T/G)TI GA(T/C) ACI GGI GCI (G/C)I-3'; and 3' primers against RT protein active site motifs, 5'-AGI AGG TC(A/G) TCI AC(A/G) TA(C/G) TG-3', 5'-ATI AGI A(G/T)(A/G) TC(A/G) TCI AC(A/G) TA-3', and 5'-ATI AGI A(G/T)(A/G) TC(A/G) TCC AT(A/G) TA-3' (I stands for inosine). PCRs consisted of 2 min at 80°C, followed by 35 cycles of a 45°C annealing step for 30 s, extension at 74°C for 60 s, denaturation at 94°C for 30 s, and finally, one cycle at 45°C for 3 min and 74°C for 10 min. Reaction conditions were as follows: 40 pmol of each primer, 200 µM deoxynucleoside triphosphates, PCR buffer (10 mM Tris [pH 8.3], 50 mM KCl, 1.5 mM MgCl2), 100 ng of genomic DNA, and 2 U of Taq polymerase in a 25-µl reaction volume (7, 18, 19). The amplified fragments were separated on 1.3% agarose gels, extracted, and cloned into the pCRII vector (Invitrogen, Carlsbad, CA). Cloned inserts were sequenced in both directions with an ABI Prism BigDye Terminator cycle sequencing ready reaction kit (Applied Biosystems) on ABI 3700 automated sequencers (Applied Biosystems). Sequencing results were analyzed by BLAST searches against the nr database at NCBI (http://www.ncbi.nlm.nih.gov/BLAST/) and the bovine genome database at Baylor College of Medicine (http://www.hgsc.bcm.tmc.edu/blast.hgsc). Full sequences of clone inserts from the PR-RT region were assembled by overlapping forward and reverse sequencing products. Ambiguous sequences or unique polymorphisms were confirmed by resequencing the region in question using the same clone. Of the 105 unique sequences, 64 (61%) of a length of 0.7 to 1.0 kb contained the retroviral PR-RT characteristic motifs and showed significant matches (expected [E] value, <10–10) with other ERVs from GenBank BLAST analysis, indicating that these sequences were of retroviral origin. Among the 68 clones from retroviral origin, 8 clones of four different pairs showed redundancy, thereby indicating a low rate of PCR errors in the amplification process, as reported previously (17).
We then compared the BERV sequences to the pro/pol regions of the OERV families. The neighbor-joining tree constructed with the BERV and OERV nucleotide sequences using the MEGA (version 3.1) package (12) revealed that 52 of the 64 BERV clones from degenerate PCR could be classified into the BERV
family, while the remaining 12 clones could be classified into the BERV β family (Fig. 1A). More specifically, since the 12 BERV clones clustered most closely with the OERV β3 family (Fig. 1C), we classified the 12 clones as belonging to the BERV β3 family. The 52 BERV
family clones were classified into the BERV
4 and
9 families. We identified 370 bovine bacterial artificial chromosome clones containing the cloned pro/pol sequences by BLAST search against the bovine genome database. When the flanking regions of the matched pro/pol sequences within the available bacterial artificial chromosome sequences were analyzed, long terminal repeat, gag, and env genes, which are the characteristic of ERV genomic structure (3, 5), were identified (21). The pro/pol clones, having the fewest nonsense mutations and being most closely related to potentially infectious ERV (17), were selected as representatives of each family and deposited into GenBank. Nomenclature of the families was carried out according to the results of studies of sheep (11).
![]() View larger version (23K): [in a new window] |
FIG. 1. Analysis of the pro/pol nucleotide sequences of the 12 BERV β3 clones. (A) Classification of the BERV β3 and families relative to the pro/pol nucleotide sequences of the OERV β and families. A neighbor-joining tree was constructed using 65 BERV and 14 OERV sequences. The BERV 4, 7, and 9 families cluster closely with the corresponding OERV families, while the BERV β3 family is closely related to the OERV β2 and β3 families. The nucleotide sequences of OERV families used here have the following GenBank accession numbers: AY193896 (OERV 1A), AY193898 (OERV 1B), AY193899 (OERV 1C), AY193900 (OERV 2), AY193903 (OERV 3), AY193905 (OERV 5), AY193906 (OERV 6), and AY193908 (OERV 8). (B) Comparison of the 22 nucleotides containing variable bases among 12 BERV β3 790-bp pro/pol sequences. Individual nucleotides that are polymorphic are shown in shades, and the nucleotide positions containing variable bases in the sequence are indicated at the top. The names of the amplified BERV β3 clones are marked on the left. (C) Relationship of the 12 BERV β3 sequences to the three OERV β families. A neighbor-joining tree was constructed using the pro/pol nucleotide sequences of the 12 BERV β clones and the three OERV β families (β1, accession no. AY266332; β2, accession no. AY193894; and β3, accession no. AY193895). Bootstrap values of above 50 from 500 replicates are indicated at the branch nodes.
|
4 pro/pol sequences with 933 bp examined, 508 nucleotides positions (54.4%) were variable (data not shown), which indicates that the β3 family is less variable than the
4 family, although the sequence lengths and identified numbers of β3 clones are less than those of the
4 family. The BERV β3 clones clustered separately from the OERV β families and converged into a single family in the neighbor-joining tree (Fig. 1C). Phylogenetic analysis using the Pro/Pol amino acid sequences from multiple retroviruses showed that the BERV β3 family clustered with the betaretroviruses (Fig. 2). In contrast to BERV
4, the lower intersequence variation of BERV β3 is consistent with the identification of a single β family member in the bovine genome from our analysis.
![]() View larger version (12K): [in a new window] |
FIG. 2. Phylogenetic analysis of previously identified retroviruses and BERVs based on the amino acid sequences of the pro/pol region using the neighbor joining method. Numbers at the branch nodes denote the bootstrap values (>50) from 500 replicates. The BERV β and families clustered with betaretroviruses (jaagsiekte sheep retrovirus [JSRV], mouse mammary tumor virus [MMTV], simian sarcoma virus [SMRV], Trichosurus vulpecula retrovirus [TvERV], and HERV-K) and gammaretroviruses (PERV-A, PERV-B, PERV-C, gibbon ape leukemia virus [GALV], feline leukemia virus [FeLV], baboon endogenous virus [BaEV], MLVRT1-BoEV, and MLVRT5-MiEVI), respectively. The viruses and GenBank accession numbers of the sequences used in the phylogenetic tree are as follows: JSRV, AAA89182; MMTV, AAA46542; SMRV, AAA66453; TvERV, AAF36395; HERV-K, CAB56603; and murine ERV-L (MuERV-L), CAA73251.
|
4 clones.
Of the 57 BERV
family clones, 51 were classified as members of the BERV
4 family as they showed more than 90% homology to the OERV
4 pro/pol sequence (GenBank accession no. AY193904). Phylogenetic analysis indicated that these clones could be assigned to four distinct subfamilies, namely,
4-A,
4-B,
4-C, and
4-D (11, 11, 11, and 18 clones, respectively) (Fig. 3).
![]() View larger version (24K): [in a new window] |
FIG. 3. Phylogenetic analysis of the BERV 4 pro/pol nucleotide sequences. The tree was created by using the neighbor-joining method with 600 bootstrap replicates. The genetic distance (0.01) is defined. BERV 4 sequences were further classified into the 4-A, 4-B, 4-C, and 4-D subfamilies.
|
4 family was grouped with porcine ERV (PERV)-A (GenBank accession no. AAL87853), PERV-B (CAB65341), PERV-C (AAC16764), gibbon ape leukemia virus (accession no. AAA46810), feline leukemia virus (accession no. AAA93092), baboon endogenous virus strain M7 (accession no. BAA89659), and a murine leukemia-related virus strain (MLVRT5-MiEVI; accession no. X99928) (Fig. 2). The result of the phylogenetic analysis on the respective pro/pol nucleotide sequences was also consistent with that of amino acids (data not shown).
Characterization of the BERV
9 clone.
The remaining clone belonged to the BERV
9 family because it showed 74% and 62% homology with the OERV
9 pro/pol nucleotide and amino acid sequences, respectively (GenBank accession no. AY193909). Upon comparison with the amino acid sequences of the pro/pol region from multiple retroviruses, the BERV
9 clone matched that of murine leukemia virus-related retrovirus MLVRT1-BoEV (GenBank accession no. X99924), with 99% nucleotide identity. Endogenous MLVRT1-BoEV has also been shown to have a close relationship with OERV
OvEVII (GenBank accession no. X99932), indicating the presence of strong sequence similarity between ERVs in cattle and sheep (19).
Identification of an additional BERV family by PCR using OERV family-specific primers.
To confirm the results of degenerate PCR analysis, family-specific primers of ERVs were designed and genomic DNAs from pigs, cattle, and sheep, which belong to the mammal order Artiodactyla, were subjected to PCR using the primers. As shown in Fig. 4A, PCR with degenerate primers revealed that the BERV families were highly homologous to OERVs. However, either very weak or no amplification resulted from PCR using pig DNA. Consequently, we speculated that we might be able to identify more BERV families by amplifying the bovine genome with primers that are specific for the pro/pol sequences of the OERV β1 and β2,
1A to
3, and
5 to
8 families. Thus, nine pairs of primers with slight degeneracy in their 3' ends were designed. Each PCR mixture contained 50 ng genomic DNA, 4.5 pmol each primer, and 25 µM deoxynucleoside triphosphate. The quality of genomic DNA was confirmed by efficient PCR amplification with a control primer. Only the OERV
7 (GenBank accession no. AY193907) primers generated a specific PCR product (Fig. 4B). Five clones from this product were sequenced. Sequence analysis revealed that they had 94% nucleotide identity to OERV
7 and also clustered closely with OERV
7 in phylogenetic analysis (Fig. 1A). Thus, the bovine genome also contains a BERV
7 family ERV.
![]() View larger version (32K): [in a new window] |
FIG. 4. Detection of a new ERV family in the bovine genome by OERV family-specific PCR amplification. (A) ERVs detected in porcine, bovine, and ovine genomes by degenerate PCR amplifying the pro/pol region. The β3 primers were designed according to sequence homology between pig and sheep ERVs, while the 4 and 9 primers were designed according to sequence homology between cattle and sheep. Pig genomic DNA was not amplified and weakly amplified by the 4-specific ( 4-sb) and 9-specific ( 9-sb) primers, respectively, due to nucleotide mismatches within the primer regions. The nucleotide sequences for primers were 5'-GTAGCCACTGCTCAAATTCC-3' and 5'-GYAASACTGTCCATTGATAA-3' for β-p5s3 (β3-specific), 5'-CTCCTCCCAAACCTGTACCA-3' and 5'-AATACTGTCCAAGTCATCTG-3' for 4-sb, and 5'-AACCTGTGGCATCACTCTCN-3' and 5'-GGAGTCCAGATGAGCTGTTN-3' for 9-sb. (B) PCR with OERV family-specific primers revealed a new BERV family, namely, BERV 7. The nucleotide sequences of the family-specific primers were 5'-TTCCGTGGAGTGCTTGATAN-3' and 5'-CTCTCCATTGATAGCGTTGN-3' for OERV-β1, 5'-TGTTTCGGTTATTTCTCTCC-3' and 5'-TGTCTCACAGTAATAAGAGC-3' for OERV-β2, 5'-GATATTACTGGACTACTA-3' and 5'-GGTACTCACAGAGATCTT-3' for OERV- 1A, 5'-GGCATCCTCTTGAGGTCTTN-3' and 5'-CGAGTCCATGTCAGCTGCAN-3' for OERV- 2, 5'-TATAGACTGTACCAGCCGAN-3' and 5'-GAGGTGAGTCCAGGTTAGTN-3' for OERV- 3, 5'-GGTATAGATGGCACGTCTTN-3' and 5'-AGGACTGTCCACGTTAGCTN-3' for OERV- 5, 5'-GGCCTTCCTAATGCTATTGN-3' and 5'-CCTTGAGGTCAATAACGGTN-3' for OERV- 6, 5'-TGACTTCTCTGTTCTTCCTT-3' and 5'-TGTTCCCAGGTCCCACCACT-3' for OERV- 7, and 5'-CCGAAGATCAGTGGACTTGN-3' and 5'-TCCTCATCGAAGATGGTTGN-3' for OERV- 8.
|
4,
7, and
9 families, which were all identified by our PCR analyses, other ERV sequences were not identified. This shows that the degenerate PCR approach can successfully identify diverse ERV families in vertebrates, which confirms what has been observed previously (7, 10, 11). Thus, it appears that the bovine genome may contain less diverse ERVs than those in the porcine or ovine genome.
Estimation of the copy number of BERVs.
To estimate the copy numbers of BERVs, we directly analyzed the insertion sites of BERVs in the bovine genome using the Btau_3.1 bovine genome assembly by BLAST search using the BERV
4,
7,
9, and β3 pro/pol sequences and counted insertion sites for each BERV family in each chromosome. An E value of 0.0 was used as the criterion for BLAST analysis. Since the nucleotide sequence identity of the pro/pol region between different BERV families is less than 90%, insertion sites of each BERV family were clearly distinguished. With 163 insertions, the
4 family was the most abundant (see Table S1 in the supplemental material). The BERV
7,
9, and β3 families had 7, 3, and 57 insertions, respectively. None of the BLAST matches were shared between different BERV families, indicating that the results are specific for each family. BERVs are almost evenly distributed through the genome, depending on the length of the chromosome (see Table S1 in the supplemental material). Considering that the current bovine genome assembly covers 95% of the cow genome, a total of 242 BERV insertion sites (230/0.95) could be present in the genome of the reference animal.
In pigs, the residing ERV families and copy numbers in the genome were consistent across breeds (10). Similarly, our experimental analysis of a Korean native cow and our bioinformatics analysis of a Hereford cow genome showed consistent patterns in residing ERV families in the genome. In fact, the genetic distances based on mitochondrial DNA D-loop sequences among European, Japanese, and Korean native cattle were not much different from their intrapopulation distances (9). However, the pigs indigenous to Asia, including China, Korea, and Japan, are similar in their mitochondrial sequences, but different from European-type pigs (8). Interestingly, unlike ERVs in other artiodactyls, such as pigs and sheep, the ERV family expansion in cattle may be somewhat limited since only four different BERV families were identifiable in our study.
Nucleotide sequence accession numbers.
The GenBank accession numbers of representatives of the BERV families are DQ889607 for BERV β3, DQ889608 for
4-A, DQ889609 for
4-B, DQ889610 for
4-C, DQ889611 for
4-D, DQ889612 for
7, and DQ889613 for
9.
Published ahead of print on 24 October 2007. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
pro-pol nucleotide sequences. J. Virol. 76:11738-11743.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»