Previous Article | Next Article ![]()
Journal of Virology, December 2006, p. 11827-11832, Vol. 80, No. 23
0022-538X/06/$08.00+0 doi:10.1128/JVI.01100-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Molecular and Cell Biology, Faculty of Science, University of Cape Town, Rondebosch, 7701, South Africa,1 Electron Microscopy Unit, University of Cape Town, Rondebosch, 7701, South Africa,2 Institute of Infectious Disease and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Observatory 7925, South Africa3
Received 30 May 2006/ Accepted 4 September 2006
|
|
|---|
|
|
|---|
The extensive genetic and antigenic diversity observed in RNA viruses such as FMDV is generally attributed to the error-prone nature of their replication machinery (4). FMDV is thought to be evolving largely through genetic drift but with positive selection contributing substantially to the fixation of mutations, particularly in the capsid coding regions (8, 20, 21). However, incongruence between the inferred phylogenies of individual subgenomic regions suggests that recombination may also play a significant role in FMDV evolution (7, 32, 34). A number of specific FMDV recombination events have been described thus far. Whereas a very few events have involved exchanges of genome sequences encoding parts of the capsid-coding region (P1), all of the rest seem to have involved genome regions encoding the nonstructural proteins (11, 12, 32). In this study, we sought first to identify the set of unambiguously unique recombination events detectable in publicly available FMDV full genome sequences and secondly to determine the distribution of these events across the FMDV genome. We provide evidence that the P1 regions of both FMDV and picornaviruses in general may be functionally interchangeable modules that facilitate their promiscuous recombinational exchange within different picornavirus species.
|
|
|---|
Phylogenetic-compatibility analysis was performed using the program TreeOrderScan in the Simmonic 2005 ver. 1.4 package (28, 29). This software generates "optimally ordered" rooted neighbor-joining trees (100 bootstrap replicates; all branches with <70% support are collapsed) for successive fragments along an alignment (300 nucleotides [nt] in length at 100-nt intervals). The taxon orders within each tree are then compared with those generated for all the other 300-nt sequence fragments examined along the alignment. A compatibility matrix is constructed by counting the minimum number of tree topology alterations required to convert the taxon ordering in each tree to that of every other tree. Sequences were assigned to predefined groups (based on their serotypes) in order to compute the numbers of phylogenetic violations between groups of sequences represented in different trees.
To investigate the extent of recombination within the data set, the aligned sequences were examined using the Recombination Detection Program (RDP) (16), GENECONV (25), BOOTSCAN (17), MAXIMUM CHI SQUARE (22), CHIMAERA (19), and SISTER SCAN (6) recombination detection methods as implemented in RDP3 (19), available from http://darwin.uvigo.es/rdp/rdp.html (for full details of program settings, see http://darwin.uvigo.es/rdp/heath2006.zip). The breakpoint positions and recombinant sequence(s) inferred for every detected potential recombination event (PRE) were manually checked and adjusted where necessary using the extensive phylogenetic and recombination signal analysis features available in RDP3.
Once a set of unique PREs was identified (http://darwin.uvigo.es/rdp/heath2006.zip), a breakpoint map containing the positions of all clearly identifiable breakpoints was compiled. A breakpoint density plot was then constructed from this map by moving a 200-nt window 1 nucleotide at a time along the length of the map. At each window position, all identified breakpoints falling within the window were counted, and the number was plotted at the central window position. Significant clustering of breakpoint positions within each window was tested by permutation. Globally significant breakpoint clusters were identified as those windows within the breakpoint density plot that contained more breakpoint positions than the maximum found in more than 95% of 1,000 permuted breakpoint density plots. Locally significant breakpoint clusters were identified as those windows at a particular location within the breakpoint density plot that contained more breakpoint positions than more than 99% of the windows at the same location in the 1,000 permuted breakpoint density plots (for a detailed description of the permutation test, see http://darwin.uvigo.es/rdp/heath2006.zip).
|
|
|---|
![]() View larger version (49K): [in a new window] |
FIG. 1. (a) Phylogenetic compatibility matrix of 125 FMDVs. To generate this plot, a 300-nucleotide window was moved across the alignment with 100-nucleotide steps, and at each position, a neighbor-joining tree was constructed. The compatibility of two trees is measured by the number of phylogenetic violations that are required to match the ordering of taxa within the tree. Obtained values are color coded, with phylogenetically compatible regions shown in blue. The FMDV genome diagram superimposed on the diagonal axis was drawn to scale relative to FMDV A10/Holland/62 (AY539751). (b) Potential recombination breakpoint hot spots within the FMDV alignment. Detectable breakpoint positions are indicated by small vertical lines at the top of the graph. A 200-nucleotide window was moved along the alignment 1 nucleotide at a time, and the breakpoints detected within the window region were counted and plotted (solid line). The upper and lower broken lines, respectively, indicate 99% and 95% confidence thresholds for globally significant breakpoint clusters. The light- and dark-gray areas, respectively, indicate local 99% and 95% breakpoint-clustering thresholds, taking into account local regional differences in sequence diversity that influence the abilities of different methods to detect recombination breakpoints (for a detailed description of the permutation test, see http://darwin.uvigo.es/rdp/heath2006.zip).
|
The results of these FMDV breakpoint distribution and phylogenetic-compatibility analyses mirror those recently reported for the enteroviruses, another picornavirus genus. Phylogenies of the structural and nonstructural genes of enteroviruses are apparently also largely incompatible with one another (14, 24, 29). This suggested to us that a common mechanism may be influencing intertype recombination occurring among both aphthoviruses and enteroviruses and that evidence of the same mechanism might also be detectable in other picornavirus genera. To test this hypothesis, complete genome sequences of species A, B, and C enteroviruses (29) (n = 28, 61, and 51, respectively) and teschoviruses (27) (n = 29) were subjected to the same set of analyses as that used on the FMDV alignment. There are insufficient full genome sequences available for members of the other picornavirus genera for analysis of these to have yielded any meaningful results. We detected evidence of 32, 119, 58, and 46 independent PREs in the enterovirus species A, B, and C and teschovirus alignments, respectively (http://darwin.uvigo.es/rdp/heath2006.zip).
Consistent with previous reports, significant recombination hot spots were detected at the boundaries of the P1 regions of all three enterovirus species (24, 29). These recombination hot spots are most clearly defined in the poliovirus and nonpoliovirus species C enteroviruses (Fig. 2c), for which two globally significant breakpoint hot spots are evident at or close to the boundaries of the P1 region. For the species A and B enteroviruses, globally significant hot spots are located close to the 5' boundary of the P1 region (Fig. 2a and b). Athough there are breakpoint clusters close to the 3' P1 boundary in both species A and B enterovirus genomes, the significance of these clusters is not statistically supported. A locally significant breakpoint hot spot is, however, apparent in both the enterovirus A and B genomes at the boundary between the 2A and 2B ORFs (Fig. 2a and b). It is perhaps also noteworthy that a nonsignificant breakpoint cluster between the 2A and 2B ORFs is also evident in the enterovirus C genomes (Fig. 2c). The globally significant (P = 0.01) breakpoint hot spots at or close to the P1 boundaries in the teschoviruses closely resemble those observed in the FMDV and enterovirus C genomes (Fig. 2d). As with FMDV, the teschovirus 5' untranslated region also contains a locally significant breakpoint cluster. The other locally significant teschovirus breakpoint cluster at the 2C-3AB ORF interface also seems to correspond with breakpoint clusters at or near the 2C-3AB ORF interface in the A, B, and C enteroviruses, with the clusters in enterovirus Bs being locally significant. Importantly, all of the recombination hot spots identified by the breakpoint-clustering analyses coincide very closely with the boundaries of genomic regions with maximum phylogenetic conflict indicated by the phylogenetic-compatibility analyses.
![]() View larger version (62K): [in a new window] |
FIG. 2. Phylogenetic-compatibility matrices and breakpoint density plots of enteroviruses and teschoviruses constructed as outlined for Fig. 1. (a) Enterovirus species A. (b) Enterovirus species B. (c) Enterovirus species C. (d) Teschoviruses. The superimposed genome diagrams were drawn to scale relative to sequences NC001430, NC001472, NC002058, and AJ011380. The upper and lower broken lines in each breakpoint density graph, respectively, indicate 99% and 95% confidence thresholds for globally significant breakpoint clusters. The light- and dark-gray areas, respectively, indicate local 99% and 95% breakpoint-clustering thresholds, taking into account regional differences in sequence diversity that influence the abilities of different methods to detect recombination breakpoints (http://darwin.uvigo.es/rdp/heath2006.zip). It should be noted that apart from a different color scale, the phylogenetic-compatibility analyses presented in panels a, b, and c are a repeat of those, using the same datasets, previously described by Simmonds and Welch (29) and are included here for comparative purposes.
|
Although it is likely that there is a common mechanism influencing the recombination patterns we have observed in FMDV and other picornaviruses, it is not immediately obvious what that mechanism might be. The recombination hot spots described here could be facilitated by conserved biochemical and/or folded nucleotide structural features at, for example, the interfaces between the structural- and nonstructural-protein coding sequences. There does not, however, appear to be any correlation between the positions of enterovirus genomic-RNA secondary structures and favored recombination breakpoint positions detected in these viruses (29).
It is also possible that recombination occurs throughout these genomes at roughly the same rate but that recombinants containing breakpoints outside certain well-defined genome regions (such as at the interfaces between the structural and nonstructural genes) are generally less fit than parental viruses. Evaluation of the replicative abilities of recombinant FMDV and poliovirus genomes provides some experimental support for the notion that many newly formed recombinant picornaviruses, if not the vast majority, are substantially less viable than either of their parents (23, 30, 33). We and others have previously demonstrated that selection seems to favor the survival of recombinants in which the regions of sequence inherited from different sources either work well together or do not encode proteins that interact extensively with one another (10, 18). The viability of a newly formed recombinant genome is apparently inversely correlated with both the complexity of interactions between genome regions inherited from its different parents and the relatedness of those parents to one another (10, 18). It is conceivable that picornavirus recombinants with complexly interacting ORFs inherited from divergent parents will be removed by purifying selection. If this is true, what should remain is evidence of a set of recombination events that accurately anticipate protein-protein, protein-RNA, and RNA-RNA interactions between viral components. The apparent paucity of recombination breakpoints within P1 observed in all five picornavirus data sets that we examined may, for example, indicate that breakpoints occurring in this region have a high probability of disrupting complex interactions between the various structural proteins that play crucial roles in the stability and maturation of picornavirus virions (9).
The results presented here should be of considerable use in future analyses of picornavirus evolution. Partitioning picornavirus genomes at the recombination hot spots we have identified should both increase the power of and decrease the rate of false-positive inferences in population-genetic and selection studies. Our RDP project files (http://darwin.uvigo.es/rdp/heath2006.zip) can also be directly used with the program RDP3 to obtain mostly recombination-free picornavirus data sets that should be of great use in such studies. Besides this, the files, when loaded into RDP3, are essentially a highly interactive database of probable aphthovirus, enterovirus, and teschovirus recombination. These project files will enable detailed analyses of how, when, and where any of the many individual PREs detected in this study may have occurred.
We thank the Harry Oppenheimer Trust, South African National Bioinformatics Network, and National Research Foundation for funding this work. D.P.M. is supported by the Sydney Brenner Fellowship.
Published ahead of print on 13 September 2006. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»