Previous Article | Next Article ![]()
Journal of Virology, November 2006, p. 10307-10314, Vol. 80, No. 21
0022-538X/06/$08.00+0 doi:10.1128/JVI.00996-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada,1 Department of Clinical Molecular Informative Medicine, Nagoya City University Graduate School of Medical Sciences, Kawasumi, Mizuho, Nagoya, Japan,2 Section of Hepatology, Department of Medicine, University of Manitoba, Winnipeg, Manitoba, Canada3
Received 15 May 2006/ Accepted 3 August 2006
|
|
|---|
|
|
|---|
Since HBV replication involves an error-prone reverse transcription step, the rate of nucleotide change during replication is higher than that found for other DNA viruses and is more similar to the rate observed for the slower-evolving RNA viruses (21, 24). The rate of HBV evolution in hepatitis B virus e antigen (HBeAg)-positive individuals has been estimated to be 1.5 x 105 to 5 x 105 nucleotide substitutions per site per year (1, 13, 24, 29). However, the mutation rate or total accumulated number of mutations appears to be higher in HBeAg-negative patients (11, 33), suggesting that the host immune response plays an important role in HBV evolution.
There are very few reports of long-term longitudinal studies involving the full-length HBV genome, likely due to the difficulty in collecting paired samples from individual chronic HBV patients over a long period (11). The present study analyzed HBV genome sequence changes occurring over time (25 years) in eight HBeAg-negative patients. The molecular evolutionary rate and distribution of mutations occurring throughout the entire genome and in the pre-S gene region of serial sample quasispecies were also investigated.
|
|
|---|
DNA extraction. DNA was extracted from 150 µl of serum by the proteinase K-sodium dodecyl sulfate lysis and phenol-chloroform extraction methods, as described previously (25), and was resuspended (final volume, 30 µl) in sterile, nuclease-free water. The extracted DNA was stored at 20°C.
Viral load determination. HBV DNA quantitation was performed by real-time PCR analysis using a RealArt HBV PCR kit (Artus Biotech, QIAGEN, Mississauga, Ontario, Canada) with an ABI Prism 7500 sequence detection system. Briefly, 2.5 µl of DNA extract was added to 7.5 µl sterile water and 15 µl kit master mix consisting of buffers, enzyme, primers, and probe for the specific amplification of a 134-bp region of the HBV genome. One microliter of kit internal control was also added per reaction to identify possible PCR inhibition. The DNA quantity (international units [IU]/ml) was determined by comparison to external quantitation standards (range, 10 IU to 1 x 105 IU). Real-time PCR cycling parameters and result interpretation were carried out according to the manufacturer's protocol.
PCR amplification. Full-length genome sequencing of HBV DNA was performed by nested PCR with a full-length amplicon obtained using the primers and thermocycling conditions described by Günther et al. (10). Thereafter, several nested PCR steps were performed in order to increase the sensitivity of detection and produce a sufficient amount of amplicon for sequence analysis. The sequences and annealing temperatures of the nested PCR primer sets used are shown in Table 1. PCR was performed using an ultra-high-fidelity polymerase (AccuPrime Pfx DNA polymerase; Invitrogen Life Technologies, Burlington, Ontario, Canada) to ensure low to nil error rates during amplification (5). Reaction tubes for PCR contained 5 µl DNA extract or 2 µl of the first-stage PCR product, AccuPrime Pfx reaction mix (Invitrogen Life Technologies), a 0.5 µM concentration of each primer, and 1 U AccuPrime Pfx DNA polymerase. Thermal cycling parameters for each set of primers were those suggested by the manufacturer (Invitrogen Life Technologies) for three-step cycling using the annealing temperatures listed in Table 1.
|
View this table: [in a new window] |
TABLE 1. Sequencing primers used for this study
|
Quasispecies analysis. Viral quasispecies were investigated by clonal analysis of the pre-S gene regions from four sample pairs. A 479-bp amplicon from the pre-S1/pre-S2 gene region was obtained, using primers P1 and P2 as described previously (17). Amplicons were gel purified and cloned into a pCR2.1-TOPO plasmid vector (Invitrogen Life Technologies) according to the manufacturer's instructions. Ligated products were transformed into Escherichia coli TOP10F cells (Invitrogen Life Technologies), at least 10 individual colonies were picked, and the plasmid DNA inserts were sequenced.
Mean genetic distances of all synonymous and nonsynonymous positions were calculated using the Pamilo-Bianchi-Li (P-B-L) model (MEGA v.2.1). Linear regression analyses based on Tamura-Nei model evolutionary distances were performed to calculate a mean evolutionary rate of quasispecies sequences over the 25-year period.
Nucleotide sequence accession numbers. The full-length genome sequences obtained from the patients at each time point were submitted to the National Center for Biotechnology Information GenBank database under accession numbers DQ463787 to DQ463802.
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. Characteristics of the patient populationa
|
Phylogenetic analysis. The 16 full-length HBV genome sequences were aligned, and a histogram was prepared to visually demonstrate regions of hypervariability and relative conservation along the length of the genome (Fig. 1). Nucleotide changes occurred throughout the entire genome and in each coding region. Regions of hypervariability were observed within the core gene, the 3' end region of the S gene, and other regions of overlap between the polymerase and pre-S/S genes. Conversely, the X gene and the overlap region encompassing the S major hydrophilic region and reverse transcriptase domains B and C within the polymerase gene were observed to have fewer nucleotide substitutions among the 16 sequences.
![]() View larger version (42K): [in a new window] |
FIG. 1. Alignment histogram of 16 full-length HBV genome sequences. The different colored bars above the histogram denote the coding regions for the indicated proteins. The ruler below the histogram denotes the numbers of nucleotides along the length of the genome. Nucleotide substitutions compared to the consensus are shown as colored slashes within the red consensus bar, with increasing diversity shown as follows: orange < green < blue.
|
![]() View larger version (20K): [in a new window] |
FIG. 2. Phylogenetic analysis of 16 full-length HBV genome sequences. Arrows indicate the two sample pairs showing independent evolution (234-6 [red] and 739-11 [blue]). Sequence alignment was performed using ClustalX v.1.8. Evolutionary distances were calculated using the Tamura-Nei model, and the phylogenetic tree topology was evaluated by bootstrap analysis (1,000 replicates) using the neighbor-joining method (confidence values of 50% or greater are shown). The ruler shows the branch length for a pairwise distance equal to 0.02.
|
![]() View larger version (9K): [in a new window] |
FIG. 3. Molecular evolutionary rate estimation based on long-term serial HBV samples from HBeAg-negative asymptomatic carriers of HBV genotype B. The results of regression analysis of evolutionary distances among the six sample pair sequences demonstrating significant phylogenetic clustering are shown. The mean evolutionary rate (solid regression line) from the Tamura-Nei model is indicated above the graph. The 95% confidence intervals of the regression line are indicated by dashed lines.
|
The mean overall genetic distances (including synonymous and nonsynonymous sites) between quasispecies sequences from 1979 and 2004 were determined (Table 3). A significant reduction in the overall genetic distances in 2004 compared to those in 1979 was observed (except for samples from patient 650-16), suggesting that selective pressure occurred within the pre-S region. Following nonsynonymous versus synonymous substitution analysis of quasispecies, only sample pair 234-6 quasispecies showed greater nonsynonymous than synonymous changes over time (ratio, 1.247), indicating positive selection. The other three sample pairs showed no positive selection over the 25-year period (ratios of nonsynonymous to synonymous substitutions, 0.438 [462-4], 0.492 [539-16], and 0.232 [650-16]). Sample pair 234-6 quasispecies sequences also showed a shift within the pre-S2 region, with a three-codon deletion in 6 of 11 clones from 2004 that was not observed in any clones from 1979 (n = 11).
|
View this table: [in a new window] |
TABLE 3. Quasispecies genetic distance analysis of four sample pairs over 25 years
|
![]() View larger version (13K): [in a new window] |
FIG. 4. Comparison of evolutionary rate calculations based on quasispecies versus dominant strain sequences. Evolutionary rates were calculated by regression analyses (Tamura-Nei model) using either (A) quasispecies (cloned) pre-S sequences (n = 114) from four sample pairs or (B) dominant strain (directly amplified) pre-S sequences (n = 14) from seven sample pairs. The differences between the evolutionary rates in panels A and B are not significant (P > 0.05).
|
|
View this table: [in a new window] |
TABLE 4. Ratio of synonymous to nonsynonymous substitutions for coding regions among full-length genomes
|
|
|
|---|
An alignment of all genomic sequences demonstrated the presence of nucleotide substitutions throughout the entire genome. Their distribution was not entirely even along the length of the genome, with regions of apparent clustering or absence of substitutions, suggesting that certain proteins may have greater importance as immune targets. This contrasts with the findings of Hannoun et al. (11), who found the entire HBV genome to be extremely stable, with mutations distributed fairly evenly in all coding regions, particularly in HBeAg-positive patients. Some of the observed substitutions may also be intragenotypic variations within genotype B viruses and not necessarily associated with natural selection. Sequence similarity among all matched pairs was >98%, which is well within the range expected for isolates of the same genotype (
92%).
In this study, low dS/dN values, indicative of positive selection, were observed in certain regions for samples from several patients. Low dS/dN values were observed upon analysis of the core gene, where no overlap exists with other HBV genes. The core protein is an important immune target for both antibody and T-cell responses (16), and therefore positive selective changes within this region are likely an immune evasion strategy of the virus. In general, fewer nonsynonymous than synonymous changes were observed in most regions of the viral genome for samples from all patients. In particular, the reverse transcriptase domains B and C were completely conserved within each matched pair and among all patients, further emphasizing the critical function performed by this region and its requirement for HBV survival (18). Similarly, the X coding region demonstrated relatively negative selection as well as a general lack of nucleotide variation among the matched pairs. The polymerase and X serve essential functions during HBV replication (16), and thus limited nucleotide substitution within these regions would be expected.
The rate of molecular evolution determined by comparing paired samples provides information on the rate of change occurring within a single individual. In particular, the data demonstrate the more rapid or extensive rate of change that occurs in HBeAg-negative patients than in HBeAg-positive patients (11). However, to obtain a more correct mean molecular evolutionary rate for HBV, linear regression analysis of evolutionary distances was performed. This method provides a more accurate value based on a putative ancestral sequence (30), thus avoiding the overestimation resulting from direct comparison of serial sequences (12). Phylogenetic analysis demonstrated that almost all sample pairs clustered uniquely within the tree, indicating an evolutionarily dependent relationship. Serial samples from two study participants (234-6 and 739-11) did not cluster together on the tree, suggesting that the 2004 strain was distinct from the 1979 strain for each pair.
The observation of independent evolution in these two study subjects led us to investigate the contribution of quasispecies to the HBV molecular evolutionary rate. Immune selective pressure coupled with the lack of proofreading activity by the HBV polymerase likely contributes to the development of quasispecies complexity and diversity during infection (27). Viral quasispecies have very closely related genomes but exist in an environment of mutation, selection, and competition, thus creating a dynamic and changing population over time (7). Based on phylogenetic and linear regression analysis of pre-S1/S2 sequence quasispecies from four sample pairs, it was determined that the mean molecular evolutionary rate did not diverge significantly from the rate calculated using pre-S sequences derived from the directly sequenced or dominant strain. This result suggests that the quasispecies populations from HBeAg-negative, asymptomatic chronic HBV carriers during long-term follow-up over 25 years did not contribute significantly to the putative overall evolutionary rate and, therefore, that the dominant sequence is sufficient for rate estimation.
Although the sample pair quasispecies evolutionary rate was not significantly different, the observation that two sample pairs showed independent evolution may be related to quasispecies competition. Replacement of the dominant strain observed in 1979 with a minority quasispecies strain may have occurred due to a selective advantage of the minority variant during the quiescent phase of chronic infection in the study participants (9). For example, the three-codon deletion observed in the pre-S2 region from the majority of 234-6 quasispecies clones from 2004 may contribute a selective advantage to the virus to allow it to become the dominant strain sometime in the future. Such mutations within the pre-S2 region (deletions and start codon mutations) are characteristic of genomes from the HBeAg-negative phase of infection (9). Another explanation for the observed independent evolution may be reinfection with a different HBV genotype B strain during the follow-up period.
The mean nucleotide substitution rate observed in this study was slightly higher than previously estimated rates based on HBeAg-positive carriers, as transmission is assumed to occur predominantly through HBeAg-positive donors (1, 13, 24, 29). However; transmission from carriers negative for HBeAg has been documented (4, 6, 31). The observed evolutionary rate validates previous statements that viruses lacking HBeAg evolve more rapidly, possibly as a function of increased immune pressure during the immune clearance phase of infection (1, 3, 11, 19, 28). The chronic infection phase for all patients investigated in this study was typically quiescent or asymptomatic and HBeAg negative, suggesting a reduction in host immune activity following seroconversion. Therefore, the slightly higher evolutionary rate observed in this study, despite less selection pressure, may be related to the seroconversion event driving quasispecies complexity and diversification. The more diversified quasispecies pool would then undergo competition during the follow-up period to obtain the most "fit," and thus dominant, genome. Furthermore, the reduced selection pressure during the HBeAg-negative chronic phase may allow the accumulation of mutations due to error-prone reverse transcription during replication. Indeed, since overall more synonymous mutations were observed in the coding regions of the study sequences, it is likely that selective outgrowth of sequences having a structure/function advantage for the virus occurred throughout the follow-up period (27).
In conclusion, further analysis of HBV evolutionary patterns should include both HBeAg-positive and -negative symptomatic and asymptomatic patients representing different HBV genotypes to truly characterize HBV sequence divergence over time. In this manner, estimating the molecular clock and origins of HBV may be done more accurately.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»