Previous Article | Next Article ![]()
Journal of Virology, February 2009, p. 1718-1726, Vol. 83, No. 4
0022-538X/09/$08.00+0 doi:10.1128/JVI.02011-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
National Institute for Infectious Diseases Lazzaro Spallanzani, Rome, Italy
Received 24 September 2008/ Accepted 27 November 2008
|
|
|---|
20% can be detected by sequencing, and only known mutations are detected by LiPA. Massively parallel ultradeep pyrosequencing (UDPS; GS FLX platform) was used to analyze HBV quasispecies in reverse transcriptase (RT) and hepatitis B S antigen (HBsAg) from five drug-naive patients and eight drug-resistant patients. Eight primer pairs were used to obtain partially overlapping amplicons, covering the RT gene from codons 1 to 288 and the complete overlapping HBsAg sequence. A 1% mutation frequency was selected as the cutoff based on an error rate estimated on plasmid DNA. This technology enabled simultaneous analysis of between 2,852 and 18,016 clonally amplified fragments from each patient. The results indicate that UDPS has a relative sensitivity much higher than both direct sequencing and LiPA. In addition, the UDPS results are quantitative, allowing establishment of the relative frequency of both known mutations and novel substitutions. Some of the detected RT substitutions led to changes also in HBsAg. On the whole, genotype D presented a higher heterogeneity than genotype A. Considering the high quantity of information that can be provided by a single test from one patient, the short turnaround time, the information on substitution frequency, and the detection of rare variants, there are strong advantages conferred by UDPS, and the new method could play a relevant role in the clinical management of HBV infection and therapy. |
|
|---|
Analogues rarely produce control of hepatitis B S antigen (HBsAg) antigenemia, and require prolonged use to control hepatitis B virus (HBV) replication in most patients (7), although recent advances suggest that combined treatment with multiple antiviral compounds may lead to a more rapid and durable HBV replication control (29, 15). In fact, prolonged therapy is associated with the emergence of resistant strains that are responsible for therapeutic failure and disease progression (5, 42). Drug resistance mutations mainly involve the reverse transcriptase (RT) region of the polymerase gene, which contains seven functional domains (A to G). The mutations conferring nucleoside/nucleotide resistance are located essentially in domains A through E (12, 21, 43).
Due to the high variability of the HBV genome, a collection of distinct, albeit closely related viral genetic variants are usually present at any given time in an infected host and are known as viral quasispecies. Antiviral treatment can result in the selection and progressive fixation of viral variants with reduced drug susceptibility, leading to virological resistance and therapy failure. Many published data have shown that the appearance of resistance mutations (genotypic resistance) may precede the increase in viral load (virological resistance) by several months (9, 18) and could be a prognostic marker for the occurrence of viral breakthrough.
Drug resistance tests have been developed in recent years, to help clinicians to choose therapeutic options attuned to the virological status of each patient.
To date, the most commonly used method for detecting drug resistance mutations is direct sequencing after PCR amplification. A major limitation of direct PCR sequencing, however, is its inability to detect variants poorly represented in the heterogeneous virus population existing in a patient's circulation. From the long-lasting experience gained with human immunodeficiency virus (HIV) therapy, it is clear that minor drug-resistant variants, although not detected by the direct sequencing approach, may be clinically relevant, as they can quickly grow out when subjected to the selective pressure exerted by the drug and become the predominant form, leading to treatment failure (16, 31). To overcome this problem, reverse hybridization line-probe assays (LiPAs) have been developed, which have the ability to detect mutations present in as few as 5% of the circulating viral population (13, 17, 34). The reverse hybridization LiPAs employ a series of short membrane-bound oligonucleotide probes to detect single mismatches in PCR-amplified HBV DNA. The major limitation of these methods is that only a limited repertoire of well-established mutations can be detected, in particular those most importantly associated with LAM and ADV resistance, but the identification of novel or recently described ones is not possible (23, 44). In addition, this technique does not allow a mutation frequency estimate, so it is not suitable for dynamic studies. Other sequence-specific genotypic resistance tests are available or are being developed, such as restriction fragment length/mass polymorphism (11, 33, 41), oligonucleotide microarray and gene chip technology (19, 36), mutation-specific real-time PCR (39), matrix-assisted laser desorption ionization-time of flight mass spectrometry (17), and pyrosequencing (14, 20). On the whole, all of the sequence-specific methods present the same drawbacks of reverse hybridization.
A very powerful tool, based on the new 454 Life Sciences platform (GS FLX, distributed by Roche), has recently become available. This technology performs massively parallel picoliter-scale amplification and pyrosequencing of individual DNA molecules (26), allowing the simultaneous analysis of thousands of clonally amplified regions of about 200 nucleotides, which increases the probability of detecting minority variants. (A description of the method can be found at http://www.454.com/index.asp.) Recently, the detection and characterization of rare drug-resistant variants in HIV-1 have been reported with the 454 Life Sciences GS20 technology (10, 28, 38).
In the present study, the ultradeep pyrosequencing (UDPS) approach, applied with the GS FLX platform, was used to detect and quantify minor variants in the RT gene of HBV present in the serum of patients. Direct sequencing and INNO-LiPA HBV DR v2 were used as standard reference tests. The corresponding mutations in the S gene open reading frame (ORF) were also determined.
(This study has been accepted for presentation at the 59th Annual Meeting of the American Association for the Study of Liver Diseases.)
|
|
|---|
PCR amplification and UDPS. UDPS was used to sequence overlapping segments of HBV RT after PCR amplification (ultradeep approach). Since this technique is not able to sequence target fragments longer than 250 bp, primers were designed to amplify eight partially overlapping segments covering from 1 to 288 amino acids (aa) of the RT region (including all functional domains) and the complete region of the HBsAg. The primers were designed to anneal sequences conserved among genotypes, and some were degenerate. All primers used, including 5' extensions which provided binding sites for the pyrosequencing on the genome sequencer, are detailed in Table 1.
|
View this table: [in a new window] |
TABLE 1. Sequences and positions of primers used for UDPS of HBV
|
The PCR amplicons were purified with the QIA quick PCR purification kit (Qiagen,) and quantified by Agilent 2100 bioanalyzer (Agilent Life Sciences and Chemical Analysis).
All amplification reactions yielded DNA chains of the expected length and in amounts sufficient to support sequencing of the target. The PCR amplicons of each patient were pooled and subjected to UDPS, carried out with the GS FLX platform according to the manufacturer's instructions.
The pyrosequencing method is more error prone than the Sanger method, and therefore, a plasmid DNA was included as an error rate control, to measure the accuracy of the UDPS. The plasmid clone was obtained from a patient's sample by inserting a PCR amplicon spanning nucleotide positions 180 to 930 into a pCR4-TOPO vector (Invitrogen Corp.). The plasmid clone containing the region of interest was sequenced in parallel by UDPS and by the Sanger method.
Sanger sequencing of the clone was performed on an ABI Prism 3100, by using the BigDye terminator cycle sequencing kit, following the manufacturer's instructions (Applied Biosystems, Warrington, United Kingdom).
Any differences between the two methods were considered to be UDPS sequencing errors. Because the pyrosequencing error rate is higher in homopolymeric regions, the error rates were determined separately for homopolymeric (nucleotide repeats of three or more identical bases) and nonhomopolymeric regions. The error rate was 0.0027 (standard error, 7.4 x 10–4) within homopolymeric and 0.0012 (standard error, 7.48 x 10–5) in nonhomopolymeric regions. Based on these results, as well as on a theoretical calculation using a statistical approach (6), a conservative cutoff of 1% was adopted to estimate mutation frequency.
Analysis of HBV resistance by direct sequencing and INNO-LiPA HBV DR v2. The presence of HBV RT mutations was also determined by direct sequencing. Two sets of primers (1 FW to 4 RW and 5 FW to 8 RW, without the adaptors) were used (covering from 1 to 288 aa). The conditions of PCR are the same as reported above. The amplified PCR products were purified by the QIA quick PCR purification kit (Qiagen). After quantification by gel electrophoresis with a low-weight molecular standard, sequencing was performed on an ABI Prism 3100, as described above.
Consensus sequences were built for RT and HBsAg of both genotypes A and D, using reference sequences derived from GenBank (genotype A, E00010 and V00866; genotype D, Y07587, AF043594, Y796031, AY721610, and AY796030). The sample sequences were compared with the genotype-matched consensus sequences by BLAST search analysis, to identify the amino acid changes in both ORFs. In the samples from the drug-experienced patients, the genotypic resistance pattern was also determined by INNO-LiPA HBV DR v2 (Innogenetics NV, Ghent, Belgium).
Treatment of sequence data and statistical analysis. The local pairwise Smith-Waterman-Gotoh (SWG) algorithm (8, 35) was applied to sequences obtained by UDPS in order to determine changes with respect to the genotype D and A consensus sequences. A modified version of this method had already been employed by Wang et al. (38) to analyze mutations of HIV-1 sequences obtained by UDPS: Wang proposed a modified version of the Smith-Waterman algorithm, called asymmetric Smith-Waterman, where alignment scores were recalculated weighting bases by their phred-equivalent quality values for each UDPS base call. Wang compared asymmetric Smith-Waterman against standard Smith-Waterman and BLAST (2): when benchmarking the three algorithms, no significant differences were found, except for highly distant isolates (like different subtypes). Thus, we applied the standard SWG algorithm, adding space optimization as an additional parameter (with respect to open and extension gap penalties) with a grid search in [5:25] and [0.25:5], using step sizes of 5 and 0.5, respectively, maximizing the nucleotide local similarity and minimizing the number of 1-base or 2-base frameshifts. The procedure for executing the SWG algorithm and extracting mutations was implemented using the JAligner package (http://jaligner.sourceforge.net/) for java programming language (http://java.sun.com): minor refinements to the original algorithm were designed in order to correct for 3-base insertions/deletions not in frame (that were replaced in frame when possible), ambiguity translation, and 1-base or 2-base frameshift report/correction, accounting for the higher pyrosequencing error rate (6).
Variations from the reference consensus sequences, identified by the UDPS method, were considered as true mutations if previously described to be associated with drug resistance; otherwise, they were regarded as possible novel mutations or polymorphisms.
The prevalence of substitutions was calculated by means of relative frequencies in the subregions sequenced, calculating the 95% confidence interval via Agresti and Coull approximation (1).
|
|
|---|
In the drug-experienced patients, resistance mutations were identified by UDPS and compared with those identified by DR v2 and by direct sequencing (Table 2). None of the mutations detected at frequencies of <24% by UDPS were detected by direct sequencing, and some of these (four mutations for patient 1, four for patient 5, and one for patient 6, whose frequency ranged between 2% and 8%) were missed also by DR v2. Only in one case was a mutation (M204V in patient 4) detected by DR v2 but missed by both UDPS and direct sequencing; these latter results were confirmed by repeated testing in both systems, suggesting that the DR v2 result could be a system artifact.
|
View this table: [in a new window] |
TABLE 2. Comparison of UDPS, DR v2, and direct sequencing for HBV from eight patients under treatment
|
In addition, a number of RT gene substitutions different from the major drug resistance-associated mutations (hence not included in DR v2) were detected at variable frequencies in both drug-treated and drug-naive patients. The complete list of the variants found in all patients is reported in Table S1 in the supplemental material. It is noteworthy that some mutations known to be associated with drug resistance (V214A and/or M204I), were detected by UDPS as minority components of viral quasispecies in two drug-naive patients (patients 11 and 12). In particular, the primary LMV-associated mutation M204I, present at a frequency of 1.5% in patient 11, corresponded to a stop codon in HBsAg.
From Table S1 in the supplemental material, it is also evident that four substitutions leading to the introduction of a stop codon in RT were detected by UDPS with frequencies ranging from 1.0% to 2.8% in five patients.
The overall distribution of all substituted codons detected by UDPS within the RT gene from each patient, according to their position in the functional and the interdomain regions, is shown in Fig. 1A. On the whole, the pattern of changes versus the genotype consensus sequence varied according to patient source and treatment schedule, including some substitutions with a frequency of >20% and several additional changes spread along the gene occurring with low frequency. Interestingly, both functional and interfunctional domains presented variant positions in both treated and drug-naive patients. The total number of changes in treated patients was always higher than in drug-naive patients, both in functional domains (48 versus 19, ratio of 2.5) and in interdomains (96 versus 63, ratio of 1.5). Generally, HBV genotype D presented a higher number of substitutions compared to genotype A in both treated and drug-naive patients. However, due to the small number of patients, it is not possible to perform statistical evaluation of the differences in the mutation patterns in relation to genotype and treatment.
![]() View larger version (36K): [in a new window] |
FIG. 1. Frequency of substitutions within RT (A) and HBsAg (B) detected by UDPS in 13 HBV-positive patients. Patients (pts) are reported according to genotype and treatment status. On the top of each column, the relevant protein domains are indicated with a thick line, including functional domains for RT and the "a" determinant (*) for HBsAg. The frequency scale axis has been cut at 20%, to allow better visualization of infrequent substitutions. For details, see Table S1 in the supplemental material.
|
|
View this table: [in a new window] |
TABLE 3. Frequency of substitutions in RT detected by UDPS in the study patients not previously described in literature
|
![]() View larger version (18K): [in a new window] |
FIG. 2. Distribution of changed codons in the overlapping ORFs of RT and HBsAg detected by UDPS in one patient. The overall distribution of changed positions and their relative frequency are shown for patient 2, reporting the two overlapping ORFs in the same panel. The mutated codons are shown according to their localization in RT and HBsAg. The "a" determinant (*) of HBsAg is highlighted at the top, while functional and interdomain regions of RT are underlined at the bottom of the panel. The frequency scale axis has been cut at 20%, to allow better visualization of infrequent substitutions. For details, see Table S1 in the supplemental material.
|
|
|
|---|
In this study, we describe for the first time the use of the massively parallel UDPS method, based on the next generation sequencing platform GS FLX, to analyze viral quasispecies in the RT/S region of the HBV genome.
To date, the most commonly used method for detecting drug resistance mutations is direct sequencing after PCR amplification by the Sanger method (population-based sequencing approach). This approach allows identification of all substitutions included in the amplified fragment, including primary, compensatory, and novel mutations; the contemporary determination of viral genotype; and the identification of amino acid changes in the overlapping S gene. However, direct sequence-based methods are only capable of detecting substitutions present in viral quasispecies with a prevalence of
20% of the total HBV population (30). On the contrary, sequence-specific methods (such as DR v2), also largely used in clinical practice, are rather sensitive and can detect minority variants represented at a frequency down to about 5%, but allow detection of only a limited set of known mutations.
The UDPS approach enables the simultaneous analysis of thousands of clonally amplified fragments from each patient, increasing the probability of highlighting minority variants. In fact, the results indicate that UDPS has a much higher relative sensitivity not only compared to direct sequencing, as expected, but also compared to DR v2. Furthermore, a large set of substitutions were detected by UDPS and the results were quantitative, allowing us to establish the relative frequency of both known and novel changes in both the RT and the overlapping S gene ORFs, in drug-experienced as well as in drug-naive patients.
Pyrosequencing (i.e., sequencing by synthesis) has been used previously to detect resistance mutations in HBV (14, 20). Compared to the classical pyrosequencing, UDPS presents a major advantage. In fact, a limitation inherent in classical pyrosequencing is that the read length is approximately 40 bp. This aspect, while allowing a large-scale screening for known point mutations, such as those at codons 180 and 204 (14, 20), renders impractical the use of pyrosequencing for whole-gene sequence analysis. On the contrary, the UDPS approach produces amplicons of about 200 bp: hence, a proper combination of primers (in this study, eight primer pairs were used) permits one to easily cover the whole RT gene, allowing identification of the entire set of mutations spread along this gene.
On the whole, UDPS results confirm the higher heterogeneity of HBV genotype D compared to genotype A (4). In addition, the patients on treatment showed a higher variability in both functional and nonfunctional domains of RT as compared to drug-naive patients, but the difference was higher for functional domains, supporting the concept that the selective pressure of the drug is stronger for domains involved in enzymatic activity. However, the older age of infection in treated patients may also play a role in the accumulation of substitutions, irrespective of selective pressure exerted by treatment. An alternative explanation is that treatment with analogues actually could be mutagenic to the virus, resulting in a higher mutation frequency in treated patients.
Interestingly, a number of changes, both novel and already described as associated with drug resistance, were found at variable frequencies in drug-naive patients. Specifically, rtM204I, corresponding to a stop codon in HBsAg, was detected at low frequency in patient 11, and V214A, which is associated with reduced sensitivity to ADV and TDF, was detected in two drug-naive patients (patients 11 and 12) at frequencies of 7.0% and 1.9%, respectively. The presence of mutations in patients never exposed to antiviral drugs is not unexpected, as it is well known that the viral replicative enzyme is error prone, and a number of casual mutations arise at each defined time point during the infection, being fixed by casual drift, or by positive selection. In fact, several studies have demonstrated that mutations associated with LAM resistance do occur naturally and can be found in low percentage in HBV carriers who have never received LAM or other antiviral therapies (24). Recent data suggested that preexistence of LAM mutations at frequencies as low as 0.1% in the viral population from patients who have not received LAM treatment may increase the risk of ETV-associated mutations (3).
In addition to the possibility to detect rare mutations before the start of therapy, because UDPS provides quantitative results, it may represent a suitable tool for the precise monitoring of viral quasispecies dynamics along treatment history, allowing the early identification of increasing frequencies of HBV mutants before the appearance of viral breakthrough. In fact, the appearance of resistance mutations (genotypic resistance) may precede the increase in viral load (virological resistance) by several months (9, 18) and could be a prognostic marker for the occurrence of viral breakthrough.
It is well known that the RT and S genes of HBV have overlapping ORFs, and thus drug resistance mutations in RT can also lead to changes in the HBsAg (27, 37). The new technology may be useful also to detect HBsAg substitutions. From our results, the frequency of changed amino acids in RT and HBsAg was not superimposable at all positions, implying that some nucleotide changes were synonymous in one ORF and nonsynonymous in the overlapping ORF; the driving force for selecting such variations is presumably acting differently on RT and HBsAg. Further studies are necessary to clarify this point, and it is foreseen that the new technology will be extremely helpful to this aim.
One particular aspect which deserves further investigation is the high number of mutations leading to stop codons in RT and in HBsAg, whose frequency was generally low (in the range of 1 to 3%) but might represent a relevant portion of the viral population in some patients (see for example, sW36stop, detected at 17.1% in patient 11). It is likely that situations similar to that of rtA181T (corresponding to sW172stop), recently described to affect the viral replication rate as a dominant-negative mutant (40), may be more diffused than expected. The new method is particularly suitable to identify mutations whose presence may have been disregarded so far, due to their low frequency. In fact, low frequencies are expected from mutated virions that are not capable of coding correct RT or HBsAg, as their function is expected to be strongly impaired and their maintenance in viral quasispecies should be dependent on the concomitant presence of wild-type virions providing the lost function(s). Similar situations may be more common than expected and may provide, for instance, a possible explanation for primary nonresponse to antiviral treatment in the absence of detectable mutations in some patients (22).
As this study included a small number of patients, HBV substitutions detected by UDPS and the different patterns observed in drug-treated versus drug-naive patients require further investigation to clarify their clinical significance. In addition, it remains to be established whether all of the variations with respect to consensus sequences represent true mutations or simple polymorphisms. Despite these limitations, our results clearly indicate that genotyping analysis based on the next generation sequencing platform is suitable for characterization of genetic diversity and detection of minor variants that may have clinical relevance for adapting patient management and adjusting treatment strategies.
We thank Carla Nisii for revising the English in the manuscript.
Published ahead of print on 10 December 2008. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»