Previous Article | Next Article ![]()
Journal of Virology, August 2000, p. 7666-7670, Vol. 74, No. 16
Department of Zoology, University of Oxford,
Oxford, United Kingdom
Received 29 December 1999/Accepted 24 May 2000
Analyses of a collection of full-length TT virus genomes showed
nearly half of them to be recombinant. The results were highly significant and revealed homologous recombination both within and among
genotypes, often involving extremely divergent lineages. Recombination
breakpoints were significantly more common in the noncoding region of
the TT virus genome than in the coding region.
TT virus (TTV) is a newly discovered
nonenveloped DNA virus that was initially considered to be a possible
agent of viral hepatitis, since it was first recovered from a patient
with posttransfusion hepatitis of unknown etiology (12, 15).
Subsequent studies, however, have shown it to be very widespread and to
occur at an extremely high prevalence even in healthy populations
(1, 9, 18, 20), casting doubt on its causal role in human
disease. Its circular, single-stranded, negative-sense DNA genome,
approximately 3,850 nucleotides in length (10, 11), bears
little or no identifiable similarity to other known viruses, and TTV
appears to represent a new virus family, tentatively designated
Circinoviridae (11).
For a DNA virus, TTV exhibits an astonishingly large amount
of genetic diversity. To date, more than 16 genotypes separated by more
than 30% divergence at the nucleotide level have been described
(6, 7, 14, 16, 18, 22), incorporating three hypervariable
regions (13). Understanding the origins of such diversity is
a fundamental problem in virology. While the role played by mutation
has long been considered, it is becoming increasingly apparent that
recombination also plays a key role in the evolution of many virus
groups (19, 24). The recent availability of several
full-length TTV sequences (3, 6, 10, 11, 16), along with
evidence for mixed infection by multiple genetic types (2, 4, 17,
21), prompted this investigation into whether recombination might
also play a role in the evolution of TTV.
A total of 15 full-length or near-full-length TTV genomes
with the following isolate names (accession numbers) were
collected from GenBank: TA278 (AB017610) (16); GH1
(AF122913) (11); TUS01 (AB017613) (16); SANBAN
(AB025946) (6); JA20 (AF122914), JA9 (AF122915), JA10
(AF122919), JA4 (AF122917), JA1 (AF122916), JA2B (AF122918), US32
(AF122921), and US35 (AF122920) (3); and BDH1 (AF116842),
TTVCHN1 (AF079173), and TTVCHN2 (AF129887). The sequences were
aligned using CLUSTAL W (23) and adjusted by hand. The
resulting 3,853-nucleotide full-length TTV alignment is available
from the author on request.
The aligned data set was analyzed by various methods to identify
possible recombinant isolates, to characterize their putative recombination breakpoints, and to test results suggestive of
recombination for statistical significance. First, an exploratory
tree analysis (5) was performed by sliding a
400nucleotide window down the sequence alignment in
200-nucleotide increments, generating a series of trees for the
different regions. All trees were reconstructed with the PAUP* program
(version 4; Sinauer Associates, Sunderland, Mass.) using the
neighbor-joining algorithm with distances estimated under the HKY85
model of DNA substitution. Seven isolates clearly changed topological
position over different regions of their genomes, an indication of
possible mosaic structure, and were earmarked as putative recombinants
(data not shown).
Each putative recombinant isolate was subsequently examined using
sliding window diversity plots to determine which of the other
sequences in the data set it most closely resembled over the
conflicting regions of its genome (5, 25). A
neighbor-joining tree of the full-length data set, with bootstrap
percentages based on 1,000 replicates, was also reconstructed and is
shown along with a diagram of the TTV genome in Fig.
1. Surprisingly, although almost half of
the isolates appeared to contain sequence regions from more than one
genotype, analysis of the full-length alignment gave no indication of
this, with all putative recombinants supported by high bootstrap scores
within distinct clades (Fig. 1b).
0022-538X/00/$04.00+0
Copyright © 2000, American Society for Microbiology. All rights reserved.
Extensive Homologous Recombination among Widely
Divergent TT Viruses
![]()
ABSTRACT
Top
Abstract
Text
References
![]()
TEXT
Top
Abstract
Text
References

View larger version (14K):
[in a new window]
FIG. 1.
TTV circular genome and phylogenetic tree of the
full-length alignment. (a) Schematic diagram of the TTV circular genome
(isolate TA278) with open reading frames 2, 1, and 3 indicated. The
remaining 1,048 nucleotides of the noncoding region account for
approximately 27% of the TTV genome. (b) Unrooted neighbor-joining
tree with associated bootstrap percentages from analysis of the
full-length alignment. Branch lengths are drawn to scale, and genotypes
1 to 3 are indicated. The seven putative recombinants identified by the
initial exploratory analyses are boxed.
The putative recombinants were next subjected to a maximum likelihood (ML) method for estimating recombination breakpoints and testing their significance (8). Briefly, this approach finds breakpoints by dividing an alignment of a putative recombinant and its two "parents" (those sequences it most resembles in different genomic regions) into two regions (using either one or two breakpoints) and finding a separate ML tree for each. All possible partitions (breakpoints) are tried, and the scores for the two trees in each case are combined to give a "recombination model" likelihood for that particular partitioning of the alignment. If recombination has occurred, the highest combined score is expected when the alignment is broken at the actual recombination breakpoint(s), since the two trees reconstructed in this case best reflect the true phylogenetic history of the separate recombinant regions (8, 25).
Statistical significance for the inferred breakpoints is determined by comparing the recombination model likelihood to the likelihood obtained from the unbroken alignment (i.e., the "no recombination model") by using a likelihood ratio test and a Monte Carlo approach based on sequences simulated without recombination (8, 25). Significance is established if the observed improvement in likelihood under the recombination model (i.e., when more than one tree can fit the data) is greater than that expected by chance, as assessed by comparison with the null distribution generated from simulated data. These analyses were performed on four overlapping, 1,200-nucleotide regions of the full-length alignment, and all significance tests were based on 200 simulated data sets for each set of breakpoints evaluated.
Finally, confirmation that significant results reflected mosaic genomes containing regions with different evolutionary histories was provided by constructing bootstrap phylogenetic trees (1,000 replicates) for the different recombinant regions.
Table 1 shows the results of the
breakpoint analysis of putative recombinants and their "parents,"
those isolates most similar to them in different genomic
regions. The breakpoints identified within every putative
recombinant genome closely matched those expected from the exploratory
tree and diversity analyses and proved to be highly statistically
significant, with likelihood ratios in each case much higher than any
generated by 200 simulated data sets (P < 0.005). The table also lists the percent similarity between
putative recombinants and their parents, both over the full-length
alignment and in specific recombinant fragments, showing that otherwise
divergent isolates shared marked similarity in some genomic regions.
|
Strongly supported bootstrap trees (Fig.
2) confirmed the presence within single
genomes of sequence from different genotypes or lineages: JA10 and JA2B
(Fig. 2a) contained nearly identical stretches of genotype 1 sequence,
most similar to that of JA20, and appeared to be descendants of a
single, recombinant common ancestor. TTVCHN2 (Fig. 2b to d) grouped
alternately with genotypes 1 and 3 in various parts of its genome. US35
(Fig. 2e) moved from a position basal to US32 and JA1 to group with
US32. TUS01 (Fig. 2f), otherwise extremely divergent (see Fig. 1b),
contained some genotype 2 sequence. Likewise, SANBAN (Fig. 2g)
contained some genotype 1 sequence. Finally, over a similar region
(Fig. 2h; see Table 1 for the precise breakpoints), JA20 exhibited
genotype 3 sequence, TUS01 contained genotype 1 sequence, and JA10
grouped with the extremely divergent isolate SANBAN. For clarity, three of the isolates shown in Fig. 1b (BDH1, JA9, and JA4) were not included
here because they were very similar to other isolates across the
full-length alignment. Also for clarity, the two very divergent
isolates TUS01 and SANBAN were only included for regions where they
were recombinants or parents of other recombinants. Neither the
alignment nor the subsequent analysis was significantly affected by the
presence or absence of these two isolates, and bootstrap trees produced
using all 15 isolates produced virtually identical findings in all
cases. These results indicate that TTV undergoes relatively frequent
recombination.
|
Of the seven recombinants, five (TUS01, SANBAN, JA20, JA10, and JA2B) were identified as intergenotype homologous recombinants; one (US35) was an intragenotype homologous recombinant; and one (TTVCHN2) showed evidence of both inter- and intragenotype homologous recombination. Three isolates (TTVCHN2, TUS01, and JA10) contained gene sequences from at least three different sources, indicating a history of multiple recombination events. This observation was not surprising, given the extremely high percentage of recombinants among these natural TTV isolates (>46%). One of these, TTVCHN2, contained two distinct regions of genotype 3 sequence (Fig. 2c and d).
A comparison of all the recombinant regions identified in this study
(Table 1) with the structure of the TTV genome (Fig. 1a) revealed a
surprising preponderance of breakpoints in the relatively short
noncoding region. After correcting for nonindependent comparisons, 13 out of 19 breakpoints fell within the contiguous 1,048-nucleotide
noncoding region, while only 6 out of the 19 were found within the
2,805 nucleotides spanning open reading frames 2, 1, and 3. The
difference in the numbers of breakpoints falling in noncoding versus
coding sequence was highly significant based on a chi-square test
(
2 = 16.3; df = 1; P < 0.0001). It is not clear whether this reflects a higher rate of
recombination events in the noncoding region or the enhanced fitness of
noncoding compared to coding region recombinants. While neither the
coding nor the noncoding region breakpoints appeared to map to
especially similar sequence regions, such as direct or inverted
repeats, they did not generally correspond to insertions or deletions
either. The evident pattern of homologous crossing over may thus
reflect a copy choice mechanism for recombination in these novel DNA
viruses. Because information about different levels of recombination in
different regions of viral genomes is extremely rare, these results are
particularly interesting.
It is important to note that detailed information on the strategy of sequencing was obtained for all seven recombinants (3, 6, 11, 16; C.-H. Huang, personal communication) and provided convincing evidence that the mosaic genomes identified here are natural recombinants and not laboratory artifacts. First, the successful amplification of long, overlapping sequence regions depended upon the presence of intact, circular TTV genomes, not fragmented ones. Crucially, for each recombinant, a comparison of the endpoints of the amplified PCR products with the inferred breakpoints showed that they did not correspond: the breakpoints listed in Table 1 could not be explained by the regions amplified and sequenced for each isolate. Furthermore, laboratory artifacts cannot readily explain the common recombinant region shared by JA10 and JA2B (Fig. 2a): it seems much more plausible that they diverged from a recombinant ancestor than that they arose by independent but identical errors (8, 25).
It is worth considering in this context that Taq polymerase has been shown to produce recombinant molecules reminiscent of the products of rolling circle DNA synthesis (26). However, since the recombination events described in that study were nonhomologous, it is not clear that such a process could underlie the homologous recombination detected here. Significantly, the majority of the genomes in the present data set were isolated from carefully collected and stored patient serum, most of which were not observed to be multiply infected, were the result of extremely clean, single, intense PCR products of the expected size, and were confirmed by evaluation of multiple PCR products and found to represent single sequences (J. C. Erker, personal communication). Taken together, these observations strongly suggest that these chimeric genomes were generated by natural recombination, not laboratory error.
The full-length tree with its associated bootstrap values (Fig. 1b) is quite instructive in light of the detailed evidence for recombination among the isolates depicted. For instance, not only does each recombinant fall within a distinct clade of this tree, but every one is supported by a deceptive 100% bootstrap score even though it contains extensive regions of alternative ancestry. At the same time, the highly structured tree clearly indicates that a good deal of phylogenetic signal remains among the isolates analyzed, despite the effects of recombination. Except, for instance, in the case of JA10 and JA2B (Fig. 2a), much of the evidence for recombination described here could reflect fairly recent events between different TTV lineages. The often high degree of similarity between recombinants and their parents (Table 1) suggests that in some cases little independent evolution has occurred since recombination.
In conclusion, the findings reported here imply that short sequence regions, particularly from the noncoding regions of TTV genomes, may be inadequate markers for identifying and typing isolates and for reconstructing the evolutionary history of this group. Furthermore, although long sequence regions or full-length genomes will be preferable for studies of TTV, even these must be scrutinized with care to reveal the fingerprint of recombination. The detection of recombination in TTV underscores the notion that for any group of viruses the assumption of clonality should be validated by explicit tests whenever phylogenies are used for virological inference. The evidence for recombination revealed by these analyses of full-length TTV genomes is remarkable because it shows for the first time that recombination not only occurs but is widespread in this newly discovered group and so is probably an important force in its evolution. Perhaps most intriguingly, the results demonstrate that extremely divergent variants of this novel DNA virus are linked, by recombination, into a single gene pool.
| |
ACKNOWLEDGMENTS |
|---|
This work was supported by The Rhodes Trust and by the Natural Sciences and Engineering Research Council of Canada.
I thank James Erker and Cheng-hui Huang for helpful correspondence and Eddie Holmes and two anonymous reviewers for comments on the manuscript.
| |
FOOTNOTES |
|---|
* Mailing address: Department of Zoology, University of Oxford, South Parks Rd., Oxford OX1 3PS, United Kingdom. Phone: 44 (0)1865 271273. Fax: 44 (0)1865 310447. E-mail: michael.worobey{at}zoo.ox.ac.uk.
| |
REFERENCES |
|---|
|
|
|---|
| 1. |
Abe, K.,
T. Inami,
K. Asano,
C. Miyoshi,
N. Masaki,
S. Hayashi,
K. I. Ishikawa,
Y. Takebe,
K. M. Win,
A. R. El-Zayadi,
K. H. Han, and D. Y. Zhang.
1999.
TT virus infection is widespread in the general populations from different geographic regions.
J. Clin. Microbiol.
37:2703-2705 |
| 2. | Ball, J. K., R. Curran, S. Berridge, A. M. Grabowska, C. L. Jameson, B. J. Thomson, W. L. Irving, and P. M. Sharp. 1999. TT virus sequence heterogeneity in vivo: evidence for co-infection with multiple genetic types. J. Gen. Virol. 80:1759-1768[Abstract]. |
| 3. | Erker, J. C., T. P. Leary, S. M. Desai, M. L. Chalmers, and I. K. Mushahwar. 1999. Analyses of TT virus full-length genomic sequences. J. Gen. Virol. 80:1743-1750[Abstract]. |
| 4. | Forns, X., P. Hegerich, A. Darnell, S. U. Emerson, R. H. Purcell, and J. Bukh. 1999. High prevalence of TT virus (TTV) infection in patients on maintenance hemodialysis: frequent mixed infections with different genotypes and lack of evidence of associated liver disease. J. Med. Virol. 59:313-317[CrossRef][Medline]. |
| 5. |
Gao, F.,
D. L. Robertson,
C. D. Carruthers,
S. G. Morrison,
B. X. Jian,
Y. L. Chen,
F. Barré-Sinoussi,
M. Girard,
A. Srinivasan,
A. G. Abimiku,
G. M. Shaw,
P. M. Sharp, and B. H. Hahn.
1998.
A comprehensive panel of near-full-length clones and reference sequences for non-subtype B isolates of human immunodeficiency virus type 1.
J. Virol.
72:5680-5698 |
| 6. | Hijikata, M., K. Takahashi, and S. Mishiro. 1999. Complete circular DNA genome of a TT virus variant (isolate name SANBAN) and 44 partial ORF2 sequences implicating a great degree of diversity beyond genotypes. Virology 260:17-22[CrossRef][Medline]. |
| 7. | Hohne, M., T. Berg, A. R. Muller, and E. Schreier. 1998. Detection of sequences of TT virus, a novel DNA virus, in German patients. J. Gen. Virol. 79:2761-2764[Abstract]. |
| 8. | Holmes, E. C., M. Worobey, and A. Rambaut. 1999. Phylogenetic evidence for recombination in dengue virus. Mol. Biol. Evol. 16:405-409[Abstract]. |
| 9. |
Leary, T. P.,
J. C. Erker,
M. L. Chalmers,
S. M. Desai, and I. K. Mushahwar.
1999.
Improved detection systems for TT virus reveal high prevalence in humans, nonhuman primates and farm animals.
J. Gen. Virol.
80:2115-2120 |
| 10. |
Miyata, H.,
H. Tsunoda,
A. Kazi,
A. Yamada,
M. A. Khan,
J. Murakami,
T. Kamahora,
K. Shiraki, and S. Hino.
1999.
Identification of a novel GC-rich 113-nucleotide region to complete the circular, single-stranded DNA genome of TT virus, the first human circovirus.
J. Virol.
73:3582-3586 |
| 11. |
Mushahwar, I. K.,
J. C. Erker,
A. S. Muerhoff,
T. P. Leary,
J. N. Simons,
L. G. Birkenmeyer,
M. L. Chalmers,
T. J. Pilot-Matias, and S. M. Dexai.
1999.
Molecular and biophysical characterization of TT virus: evidence for a new virus family infecting humans.
Proc. Natl. Acad. Sci. USA
96:3177-3182 |
| 12. | Nishizawa, T., H. Okamoto, K. Konishi, H. Yoshizawa, Y. Miyakawa, and M. Mayumi. 1997. A novel DNA virus (TTV) associated with elevated transaminase levels in posttransfusion hepatitis of unknown etiology. Biochem. Biophys. Res. Commun. 241:92-97[CrossRef][Medline]. |
| 13. |
Nishizawa, T.,
H. Okamoto,
F. Tsuda,
T. Aikawa,
Y. Sugai,
K. Konishi,
Y. Akahane,
M. Ukita,
T. Tanaka,
Y. Miyakawa, and M. Mayumi.
1999.
Quasispecies of TT virus (TTV) with sequence divergence in hypervariable regions of the capsid protein in chronic TTV infection.
J. Virol.
73:9604-9608 |
| 14. | Okamoto, H., N. Kato, H. Iizuka, F. Tsuda, Y. Miyakawa, and M. Mayumi. 1999. Distinct genotypes of a nonenveloped DNA virus associated with posttransfusion non-A to G hepatitis (TT virus) in plasma and peripheral blood mononuclear cells. J. Med. Virol. 57:252-258[CrossRef][Medline]. |
| 15. | Okamoto, H., T. Nishizawa, N. Kato, M. Ukita, H. Ikeda, H. Iizuka, Y. Miyakawa, and M. Mayumi. 1998. Molecular cloning and characterization of a novel DNA virus (TTV) associated with posttransfusion hepatitis of unknown etiology. Hepatol. Res. 10:1-16. |
| 16. | Okamoto, H., T. Nishizawa, M. Ukita, M. Takahashi, M. Fukuda, H. Iizuka, Y. Miyakawa, and M. Mayumi. 1999. The entire nucleotide sequence of a TT virus isolate from the United States (TUSO1): comparison with reported isolates and phylogenetic analysis. Virology 259:437-448[CrossRef][Medline]. |
| 17. | Okamoto, H., M. Takahashi, T. Nishizawa, M. Ukita, M. Fukuda, F. Tsuda, Y. Miyakawa, and M. Mayumi. 1999. Marked genomic heterogeneity and frequent mixed infection of TT virus demonstrated by PCR with primers from coding and noncoding regions. Virology 259:428-436[CrossRef][Medline]. |
| 18. | Prescott, L. E., D. M. MacDonald, F. Davidson, J. Mokili, D. I. Pritchard, D. E. Arnot, E. M. Riley, B. M. Greenwood, S. Hamid, A. A. Saeed, M. O. McClure, D. B. Smith, and P. Simmonds. 1999. Sequence diversity of TT virus in geographically dispersed human populations. J. Gen. Virol. 80:1751-1758[Abstract]. |
| 19. | Sharp, P. M., D. L. Robertson, and B. H. Hahn. 1995. Cross-species transmission and recombination of "AIDS" viruses. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 349:41-47[Medline]. |
| 20. | Takahashi, K., H. Hoshino, Y. Ohta, N. Yoshida, and S. Mishiro. 1998. Very high prevalence of TT virus (TTV) infection in general population of Japan revealed by a new set of PCR primers. Hepatol. Res. 12:233-239. |
| 21. | Takayama, S., S. Yamazaki, S. Matsuo, and S. Sugii. 1999. Multiple infection of TT virus (TTV) with different genotypes in Japanese hemophiliacs. Biochem. Biophys. Res. Commun. 256:208-211[CrossRef][Medline]. |
| 22. | Tanaka, Y., M. Mizokami, E. Orito, T. Nakano, T. Kato, X. Ding, T. Ohno, R. Ueda, S. Sonoda, K. Tajima, T. Miura, and M. Hayami. 1999. A new genotype of TT virus (TTV) infection among Colombian native Indians. J. Med. Virol. 57:264-268[CrossRef][Medline]. |
| 23. |
Thompson, J. D.,
D. G. Higgins, and T. J. Gibson.
1994.
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
Nucleic Acids Res.
22:4673-4680 |
| 24. |
Worobey, M., and E. C. Holmes.
1999.
Evolutionary aspects of recombination in RNA viruses.
J. Gen. Virol.
80:2535-2543 |
| 25. |
Worobey, M.,
A. Rambaut, and E. C. Holmes.
1999.
Widespread intra-serotype recombination in natural populations of dengue virus.
Proc. Natl. Acad. Sci. USA
96:7352-7357 |
| 26. |
Zaphiropoulos, P. G.
1998.
Non-homologous recombination mediated by Thermus aquaticus DNA polymerase I. Evidence supporting a copy choice mechanism.
Nucleic Acids Res.
26:2843-2848 |
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»