Previous Article | Next Article ![]()
Journal of Virology, June 2007, p. 5902-5907, Vol. 81, No. 11
0022-538X/07/$08.00+0 doi:10.1128/JVI.02431-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Jiangtao Zhang,
Xueping Zhou,* and
Hongye Li*
Institute of Biotechnology, Zhejiang University, Hangzhou, Zhejiang 310029, China
Received 5 November 2006/ Accepted 11 March 2007
|
|
|---|
|
|
|---|
Previous research on plant viral population structures and evolution has mainly focused on plant RNA viruses (reviewed in references 9, 12, and 27-29). The populations of plant RNA viruses are intrinsically genetically heterogeneous, and such populations are designated as quasispecies. The quasispecies concept suggests that the population of a virus, even in a single replicating population, is essentially a collection of variants varying around the consensus sequence (6, 7).
Members of the genus Begomovirus of the family Geminiviridae, which have circular single-stranded DNA genomes, are transmitted by the whitefly Bemisia tabaci Genn. in a semipersistent circulating manner (26). Due to the worldwide increases in the population and distribution of the insect vector and global movement of plant materials, begomovirus-induced diseases have become a major constraint on crop production in tropical and subtropical regions (26, 35). Cassava mosaic disease and cotton leaf curl disease, which have caused severe crop losses in African countries and Pakistan, are typical recently emerging begomovirus diseases (reviewed in references 20, 26, and 35). Although the impacts of recombination and pseudorecombination on the evolution of begomoviruses and epidemics of begomovirus-induced diseases have been extensively documented (10, 25, 30-32, 43-44), the effects of mutation on begomovirus evolution have thus far not been documented extensively. Unlike RNA viruses, begomoviruses replicate their genomes inside the nucleus by using the host replication machinery (reviewed in references 14 and 16). Thus, these viruses were assumed to have higher replication fidelity and lower rates of mutation accumulation than RNA viruses (6, 26). However, the large number of species (>100) (8) and the continued reports of new species (3), as well as the high degree of genetic diversity within the species (13, 23, 24, 31, 38), suggest that begomoviruses have a high mutation rate and that they generate highly diverse populations in a short time. Isnard et al. (18) analyzed the genetic diversity of a single isolate population of Maize streak virus (MSV), a virus species in the genus Mastrevirus of the family Geminiviridae, and demonstrated that the population of MSV had a quasispecies structure (18). However, in contrast to plant RNA viruses, the available information about genetic variability is scant, and essentially no quantitative studies have analyzed the genetic structures and variability of begomovirus populations under controlled conditions.
In this study, we investigated the genetic structure and population variability of the Tomato yellow leaf curl China virus (TYLCCNV) populations in a naturally infected Solanum lycopersicom plant and in Nicotiana benthamiana and S. lycopersicom plants inoculated with a cloned TYLCCNV derivative. Our results show that the structure of the TYLCCNV population is quasispecies-like and that rapid accumulation of variation during infection of both plants generated diversity levels comparable to those reported for plant RNA viruses.
|
|
|---|
Plant, virus, and viral inoculation. S. lycopersicom cv. Hongbaoshi and N. benthamiana plants were grown in an insect-free room with a constant temperature of 25°C and supplementary lighting for 16 h per day. Infectious clones of TYLCCNV isolate Y10 (pBinPLUS-Y10-1.7A) and its associated DNAß (pBinPLUS-2ß) were constructed previously (4). Plants at the six- or seven-leaf stage were inoculated by injecting the stems with a mixture of Agrobacterium tumefaciens strain EHA105 cultures carrying pBinPLUS-1.7A and pBinPLUS-2ß as described previously (4). One newly emerging leaf with typical symptoms was taken from each plant at 60 days postinoculation (p.i.) and 120 days p.i. for population analysis.
Total DNA extraction and PCR amplification. Total DNA was extracted from systemically infected leaves as described previously (42). The primers Y10F (5'-GCATCCACAAGGTAGGTC-3') and Y10R (5'-CGATGAGCAGAGGATCCC-3'), corresponding to nucleotides (nt) 1491 to 1508 and nt 149 to 132 of TYLCCNV (AJ319675), respectively, were used to amplify a 1,396-bp fragment covering the intergenic region (IR) and the AC1 coding region (Fig. 1). The PCR mixture (50 µl) contained 1.5 to 3.0 µg of the total DNA extract, 0.5 µl Pfu DNA polymerase (5 U/µl; Biobasic Inc., Ontario, Canada), 0.6 µl 1.5 mM MgCl2, 5 µl 2 mM deoxynucleoside triphosphate, and 10 µl of a 5 mM primer mixture of Y10F and Y10R. PCRs were performed on a Minicycler (MJ Research, Inc., MA). After being preheated at 94°C for 3 min, the reaction mixtures were cycled 20 times at 94°C for 45 s, 50°C for 45 s, and 72°C for 1.5 min, followed by an extension at 72°C for 10 min.
![]() View larger version (31K): [in a new window] |
FIG. 1. Genome organization of TYLCCNV DNA-A. The region between the arrows of the outer line was analyzed in this study. The primers Y10F and Y10R were used to amplify the region, and the primers IF and IR were used in sequencing. The numbers represent the nucleotide positions.
|
according to the manufacturer's instructions. About 20 clones were randomly picked from each viral population. The plasmid DNA was extracted from each clone, quantified, and sequenced using the Taq DyeDeoxy terminator cycle-sequencing kit (Applied Biosystems, CA). Sequencing primers included the M13 forward and reverse primers and a pair of primers, IF (5'-GATTGCCTCGGCATATG-3', identical to nt 2179 to 2195) and IR (5'-CATATGCCGAGGCAATC-3', complementary to nt 2179 to 2195), located in the middle of the amplified fragment (Fig. 1). About 20 nt at both the 5' and 3' ends of the fragment were excluded from sequence analysis. To minimize the potential variation resulting from PCR errors, we took the following measures: high-fidelity polymerase with proofreading capability was used in amplifications of viral DNA of progeny populations; a maximum amount of template DNA from which the target fragment could still be effectively amplified was used to avoid the bias that might result from a small subset of viruses; the PCR parameters were the same, and all components of PCR were from the same lot; and only 20 cycles for each PCR were carried out. In addition, the level of PCR error was determined by sequencing 15 clones generated from a PCR product amplified using the DNA of the infectious clone as a template. Only one of theses clones was shown to bear a single mutation. Thus, the PCR-associated mutation frequency was estimated to be 4.9 x 105 mutations per nt (Table 1). |
View this table: [in a new window] |
TABLE 1. Variation in progeny populations of TYLCCNV derived from the infection of identical sequence
|
|
|
|---|
Analysis of population variation in N. benthamiana at 120 days p.i. indicated that the heterogeneous character of the TYLCCNV population did not change over the time course of infection and that the level of diversity increased at 120 days p.i., although the significance of the increase was not supported statistically (Table 1) (Kruskal-Wallis test; P > 0.05).
To determine that the mutation rates were not unusual features of TYLCCNV infection in N. benthamiana, we analyzed the heterogeneity of the TYLCCNV progeny population in an infected tomato plant at 60 days p.i. The results clearly showed that the TYLCCNV progeny population was genetically heterogeneous in the infected tomato plant, and the level of diversity, as indicated by the proportion of mutated clones (35%) and the mutation frequency (4.1 x 104), was not significantly different from that observed in N. benthamiana at 60 days p.i. (Table 1) (Kruskal-Wallis test; P > 0.05).
Genetic structure and variability of the TYLCCNV population in a naturally infected plant. The genetic structure and variability of a TYLCCNV population in a naturally infected tomato (YN-33) was also analyzed. Based on the alignment analysis of 16 sequences of randomly amplified viral clones, we found that the TYLCCNV population contained two distinct clusters, designated as subpopulation I and subpopulation II. Subpopulation I consisted of 2 clones with identical sequences, whereas subpopulation II was composed of 14 clones. Ten of these clones had identical sequences (designated the consensus sequence), and the other four clones harbored five point mutations compared to the consensus sequence. The sequence identity between the consensus sequences of the two subpopulations was 92%, suggesting that they belonged to the same species based on current International Committee on Taxonomy of Viruses guidelines, which set a demarcation value of 89% nucleotide sequence identity between begomovirus DNA-A components (8). Because of the obvious divergence present between subpopulation I and subpopulation II, we assumed that the two subpopulations coexisting in the same plant tissue was the consequence of a mixed infection by two genetically distinct parental viruses. If we assume that the variations within subpopulation II resulted from the accumulation of mutations during the infection, the percentage of mutated clones and the mutation frequency in subpopulation II were 29% and 2.6 x 104, respectively. This value is not significantly different from that presented in the TYLCCNV progeny population of experimentally inoculated tomato plants at 60 days p.i. (Table 1) (Kruskal-Wallis test; P > 0.05). Interestingly, the TYLCCNV isolate Y10 (TYLCCNV-[Y10]) and subpopulation I had identical sequences, except for 1 nt difference (62A for TYLCCNV-[Y10] and 62G for subpopulation I), whereas subpopulation II was more divergent from TYLCCNV-[Y10].
The consensus sequence is stable, but the population composition is varied. Comparison of the mutant spectra for TYLCCNV progeny populations in three N. benthamiana plants and one tomato plant indicated that the parental sequence from the infectious clone dominated in the populations at 60 days p.i. Thus, the population still represented the consensus sequence, and this status did not change during the time course of infection (Table 2). However, the mutant spectra for the populations at 60 days p.i. varied in each host plant, except for a few mutants that recurred in all populations. Most mutants present at 60 days p.i. were not detected at 120 days p.i. in the same plant; however, some new mutants appeared at 120 days p.i. (Table 2). This result indicates that the consensus sequence of TYLCCNV is stable but that the quasispecies composition varies in a manner similar to that of plant RNA virus populations (33-34).
|
View this table: [in a new window] |
TABLE 2. Compositions of TYLCCNV progeny populations in TYLCCNV-infected N. benthamiana
|
![]() View larger version (8K): [in a new window] |
FIG. 2. Distribution of mutations detected in progeny populations derived from a TYLCCNV clone. The region sequenced extends from base 1507, the final nucleotide in the AC1 reading frame, to base 127, the final nucleotide of the IR. Sites of mutations are indicated by vertical lines below the map. A line with a number indicates a position where more than one mutation occurred. The actual nucleotide positions of the mutations are shown in Table 2.
|
|
|
|---|
The high level of population diversity and the uniform distribution of mutations over the region of the genome analyzed in this study suggest that TYLCCNV has high mutation rates during viral-genome replication in vivo. Indeed, similar observations have also been reported for MSV, a geminivirus in the genus Mastrevirus (18). In that case, the observed mutation frequencies ranged from 3.8 x 104 to 10.5 x 104, depending on the host species, the homogeneity of inocula, and the periods of time the populations were maintained in the hosts (18). The population variation of TYLCCNV estimated in this study is also comparable to that of MSV.
The phenomenon of high population genetic diversity was also observed in a field population of Beet curly top virus, genus Curtovirus, family Geminiviridae (38). In addition, a high level of genetic diversity was reported in TT virus, a small nonenveloped, single-stranded, circular DNA virus of humans (39, 41), and in Canine parvovirus, a small single-stranded DNA virus of animals (36-37). The mutation frequency for a virus is determined by a combination of the intrinsic frequency of misincorporation and the capability for mismatch repair, as well as the extent of specific selection or stochastic drift resulting from genetic bottlenecks imposed on the virus population. The high mutation frequency associated with RNA viruses is presumed to be due in large part to their lack of proofreading capability during replication (6-7, 27). In comparison, geminiviruses replicate using their host DNA replication machinery (14, 16). Theoretically, these viruses should have less population variation. However, there is no information about the nature of plant DNA polymerase or the polymerase factors involved in the replication of geminiviruses. In particular, it is not known whether only a subset of cellular DNA replication and/or mismatch repair machinery is activated for geminivirus replication or whether the cellular environment affects the fidelity of those polymerases. Brough et al. (2) reported that DNA methylation inhibits the replication of tomato gold mosaic virus in tobacco protoplasts (2), implying that geminivirus DNA may not be methylated and that the normal mechanisms for mismatch repair probably do not operate during the tomato gold mosaic virus replication cycle (17). Thus, it is possible that the mechanisms of mismatch repair may function differently during geminivirus DNA and cellular DNA replication and that the lack of postreplication repair may be responsible for higher misincorporation in the geminivirus progeny DNA (27, 31, 35).
Analyses of the composition of the TYLCCNV progeny populations in both N. benthamiana and tomato indicate that the progenitor sequence is still dominant. This implies that the rule of genetic stability is obeyed by the TYLCCNV quasispecies, as is the case for quasispecies of plant RNA viruses (11-12, 33-34). Moreover, the minor sequences (excluding a few presumed mutational hot spots) were inconsistent in the TYLCCNV populations analyzed and varied as infection progressed (Table 2). In addition, the mutations must have occurred during TYLCCNV replication, and the mutation frequency in TYLCCNV should theoretically have increased in the quasispecies population as the infections progressed. However, no statistically significant diversity increase was observed for the TYLCCNV population in N. benthamiana at 120 days p.i. compared to 60 days p.i. (Tables 1 and 2), suggesting that some mechanisms must operate on the TYLCCNV population to reduce particular variants and to maintain the diversity at a certain level. For the plant RNA viruses, predominantly negative selection, bottlenecks arising during movement and transmission, and population differentiation during plant growth and development have been demonstrated to be responsible for maintaining population diversity (5, 9, 15, 19, 21). It is clear that selection must play an important role in TYLCCNV populations, since mutations based on the bias of coding capacity are not evenly distributed throughout the viral genome (Fig. 2). The presence and impact of bottleneck and population allocation on quasispecies diversity of plant DNA virus remains to be explored.
The mutants C127T/G2312A and G2718T were detected in all three N. benthamiana plants analyzed at 60 and 120 days p.i. These nucleotides are most likely the hot spots for misincorporation, based on the fact that the parental sequence still dominated in the various populations analyzed and that the ratio of these mutants in the 120-day-p.i. population did not increase obviously compared to the 60-day-p.i. population. The C-to-T change at 127 is 1 bp upstream of the ATG for AV1, which may impact promoter activity or translation of the AV1 mRNA. The G-to-A change at 2312 results in an atypical residue at position 94 in AC1. The G-to-T change at 2718 alters the sequence of one side the stem of the conserved hairpin. In future studies, we plan to reintroduce these mutations into the TYLCCNV genome to elucidate their functional consequences. Sequence variability in different genomic regions of TYLCCNV could reflect selective constraints or hot spots for misincorporation of those regions. The mutation frequency in the IR is higher than that in the AC1 coding region, suggesting that the IR is subjected to less-selective constraints. This result agrees with previous reports that the IR is a variable region for geminiviruses (32). ORF AC1 overlaps ORF AC4 completely. Theoretically, the overlapping region should be subjected to stricter evolutionary constraints than the nonoverlapping region and should be less variable. However, as indicated by the mutation frequency, the current results show that the AC1-AC4 overlapping region is more variable than the AC1 nonoverlapping region (4.8 x 104 versus 2.6 x 104). Higher variations in the AC1-AC4 overlapping region were also found in cotton leaf curl virus, and these were thought to be related to the mechanism by which the AC4 gene was generated (by overprinting) and the lack of its universal presence in geminiviruses (31).
With the introduction and spread of the begomovirus whitefly vector (B. tabaci B biotype), begomovirus diseases have posed serious threats to crop production in tropical and subtropical regions. These threats will continue to expand with the increase of global temperatures and the spread of the whitefly. The quasispecies-like nature of geminivirus populations clearly provides a mechanism for their rapid evolution and adaptation to ever-changing environments. Thus, a study of the genetic structure of a geminivirus and the determinants that affect its population evolution will require durable strategies to manipulate geminivirus-induced disease epidemics and to limit the emergence of these viruses.
This work was supported by the National Key Basic Research and Development Program (2006CB101903) and the National Natural Foundation of Science of China (30470078 and 30530520).
Published ahead of print on 21 March 2007. ![]()
Present address: Institute of Food Processing, Zhejiang Academy of Agricultural Science, Hangzhou, 310021, People's Republic of China. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»