Previous Article | Next Article ![]()
Journal of Virology, November 2002, p. 11079-11090, Vol. 76, No. 21
0022-538X/02/$04.00+0 DOI: 10.1128/JVI.76.21.11079-11090.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Eva Herrmann,1,2,
Katharina Bruch,1 and Stefan Zeuzem1*
Medizinische Klinik II, Johann Wolfgang Goethe-Universität, 60590 Frankfurt am Main,1 Fachbereich Mathematik, Technische Universität, 64289 Darmstadt, Germany2
Received 1 March 2002/ Accepted 2 July 2002
|
|
|---|
|
|
|---|
). IFN-
alone or in combination with ribavirin leads to sustained virologic response in 20 to 56% of patients with chronic hepatitis C (24, 26, 37, 53). The strongest predictive parameter for a sustained virologic response to treatment is the HCV genotype, with HCV genotype 1 (HCV-1) being the least sensitive to IFN-
-based therapy (24, 37).
For HCV as for other eukaryotic viruses, including the influenza, vaccinia, and human immunodeficiency viruses (6, 22, 27), the ability to evade the antiviral effects of IFN-
has been reported. Both the HCV envelope (E) 2 protein and the nonstructural (NS)-5A protein of HCV-1a/b isolates were shown to inhibit IFN-
-induced double-stranded RNA-activated protein kinase (PKR) in vitro (14, 51). Moreover, cellular expression of the NS5A protein from HCV-1a/b isolates confirmed inhibitory effects on IFN-
activity (13, 33, 34, 36).
In 1995 and 1996 Enomoto et al. demonstrated clinically that an increasing number of mutations within a restricted region of the HCV NS5A protein named the IFN sensitivity determining region (ISDR; amino acids [aa] 2209 to 2248 according to HCV-J [18]) was correlated with treatment response in isolates of HCV-1b-infected patients (10, 11). The importance of mutations within the NS5A ISDR of HCV-1b-infected patients was confirmed by other investigations from Japan (7, 20). In several studies from Western countries, however, the majority of HCV-1b-infected patients with sustained virologic response showed no or only a small number of mutations within the ISDR (19, 35, 44, 46, 49, 55). Furthermore, mutations within other regions of the NS5A protein (the PKR-binding domain and the variable region V3) were shown to correlate with virologic response in HCV-1-infected patients (31, 43). The methodological approach of counting the number of mutations within multiple regions of the NS5A protein for correlation with treatment response may be insufficient because it does not consider the change of functional properties by amino acid mutations and the location of the amino acids with respect to the secondary structure of the resulting protein. Furthermore, multiple tests are inappropriate when proving an overall significance for the existence of local accumulations of mutations because the probability of observing a significant correlation increases with the number of regions tested.
In the present study, we sequenced full-length NS5A genes from 45 HCV-1b-infected patients for mutational analyses in correlation with treatment outcome on the basis of viral kinetics, functional properties of amino acid mutations, and the predicted secondary structure of the NS5A protein. A new statistical model was developed for testing the significant local accumulation of mutations with specific functional data analysis procedures (47). Moreover, statistical methods were evaluated by analogous investigations on randomly generated sequences.
|
|
|---|
three times per week given subcutaneously for 24 to 48 weeks or 6 MU IFN-
for the initial 24 weeks, followed by 3 MU IFN-
given thrice weekly for a total of 48 weeks (cumulative doses of 216 to 648 MU). Thirty-one patients were treated in combination with ribavirin given perorally (1,000 mg/day for patients of
75 kg [body weight] or 1,200 mg/day for patients of >75 kg). Quantitative measurement of HCV RNA and HCV genotyping. Serum samples were prepared in a laminar flow bench and frozen at -80°C. Quantitative detection of serum HCV RNA from pretreatment sera and during therapy for viral kinetics (three measurements between days 2 and 18, 5 and 32, and 13 and 166, respectively) was performed by branched DNA assay (Versant Quantitative HCV RNA 3.0; Bayer Diagnostics, Emeryville, Calif.). Genotyping of HCV(48) was performed by reverse hybridization assay (INNO LiPA HCV-II; Innogenetics, Ghent, Belgium).
Amplification of HCV NS5A RNA by reverse transcription-PCR and sequence analysis. After extraction of HCV RNA from 100 µl of pretreatment serum, cDNA was generated by using appropriate specific antisense primers. The HCV NS5A gene (codons 1973 to 2419 according to HCV-J [18]) was amplified by nested PCR in four overlapping parts with different sets of primers (Table 1). After an initial denaturation step of 95°C for 2 min, 35 cycles of 95°C for 30 s, 55°C for 50 s, and 72°C for 2 min were performed for the first- and second-round PCRs in a PE9700 thermocycler (Perkin-Elmer Cetus, Norwalk, Conn.). The resulting amplification product was analyzed on a 1.5% agarose gel stained with ethidium bromide.
|
View this table: [in a new window] |
TABLE 1. Primers for NS5A sequencing
|
Phylogenetic and conformational analysis. Multiple alignment of amino acid sequences, calculation of the consensus sequence (cons), and phylogenetic analyses were performed as previously described (43). Residue accessibility (buried, exposed, and unknown) within the secondary structure of the NS5A protein on the basis of the 45 full-length NS5A sequences from the present study and 25 published full-length NS5A sequences (9, 31) was predicted by the PHD and PROFacc programs of B. Rost et al. (http://maple.bioc.columbia.edu/predictprotein/) (39). The methods and validity of the prediction programs were recently reviewed by Rost and Sander (41).
Analyses of amino acid mutations according to their functional properties (conserved versus nonconserved mutations) were based on six different classes of amino acids with predicted similar properties: positive (R and K), negative (D and E), aromatic (H, F, W, and Y), polar (N, Q, S, and T), and aliphatic (A, I, L, M, V, and C) amino acids and amino acids with special conformational properties (G and P) (28).
In addition to the 45 sequences of the present study, published NS5A HCV-1b sequences were analyzed. Duverlie et al. reported full-length NS5A sequences from 11 nonresponder patients and from 8 patients who achieved a sustained virologic response (9). These sequences were retrieved from the EMBL nucleotide sequence database (accession numbers AF033358 to AF033376). NS5A isolates AF033364 and AF033365 were identical and were regarded as one isolate for sequence analyses. Nousbaum et al. published HCV-1b full-length NS5A sequences from four patients with sustained virologic responses (patients 11 and 12 were as published previously and patients 10 and 19 were from S. Polyak [unpublished data]; AF265148, AF265151, AF265145, and AF265157) and from two virologic nonresponders (patients 1 and 13; AF265142 and AF265154) (31). For analysis of the NS5A sequence from patient 19, the insertion of one nucleotide (adenine at position 6441 according to the prototype sequence HCV-J) was removed.
Kinetic modeling.
A biphasic model for the initial viral decay was used (29, 54). Estimates for the second-phase slope
2 were obtained by a least-squares linear fit of the log viral load. Quantifications below the detection limit were included by using the Tobit approach. The resulting nonlinear minimization was performed numerically by using Matlab (MathWorks, Inc., Natick, Mass.). The quotient of pretreatment viral load and the estimated viral load 24 h after starting therapy was used as an approximation of the efficiency factor
on viral production (21, 29).
Functional data analysis and statistics.
Local accumulations of mutations were analyzed on the basis of a nonparametric binomial regression model, assuming independent amino acid exchanges. This assumption was justified as a first-order approximation in the related context of sequence segmentation (5). A specific functional data analyses procedure (47) was defined to test whether differences between the mutational frequencies for responding and nonresponding patients can be explained by a constant shift only. The test was based on
![]() | (1) |
R and
NR are estimates of the mutational probability function for sequences from responding and nonresponding patients, respectively, and &
circ; is a frequency estimator of the global shift.
R and
NR are estimated nonparametrically by smoothing the local mutational frequencies with Gaussian kernel weights, and bandwidth b = 1.5. The asymptotic normal distribution of T was derived analytically as:
![]() | (2) |
![]() | (3) |
![]() | (4) |
Standard pointwise tests for smoothed mutational frequencies (bandwidth 3) of the genuine sequences and of sets of random sequences were applied. Continuous variables were compared by using the Mann-Whitney U test, and two continuous variables were analyzed by Spearman rank correlation. Unless indicated otherwise, all tests were two-tailed and P values of <0.05 were considered significant.
The clinical and biochemical characteristics of patients are expressed as the mean ± the standard deviation or median and range as appropriate.
|
|
|---|
treatment. Mutations were defined always with respect to the consensus sequence of all investigated NS5A sequences. The different potentially functional regions of the NS5A protein are shown in Fig. 1.
![]() View larger version (34K): [in a new window] |
FIG. 1. Location of different regions of interest within HCV NS5A protein. The NS5A protein (aa 1973 to 2419), the potential region for cytoplasmatic retention (aa 1973 to 1999), potential cleavage sites (aa 2126 to 2127, 2167 to 2173, and 2361 to 2362), binding to NS4A (aa 2135 to 2139), a region with potential hyperphosphorylation sites and amino acids important for replication initiation (aa 2197 to 2204), the ISDR (aa 2209 to 2248), the PKR-binding domain (aa 2209 to 2274), the HCV-1a basal phosphorylation site (aa 2321), the NLS (aa 2326 to 2334), and the V3 region (aa 2356 to 2379) are indicated according to the numbering system of HCV-1b prototype HCV-J (18).
|
|
View this table: [in a new window] |
TABLE 2. Mutations in different parts of the NS5A region in 45 HCV-1b-infected patients
|
), and the second-phase slope (
2) representing the degradation rate of infected hepatocytes were estimated. A higher number of mutations within the complete NS5A protein correlated with a more rapid HCV RNA decay in the second phase (one tailed P = 0.034). Furthermore, a trend toward a stronger clearance of HCV RNA in correlation with a higher number of mutations was observed (one-tailed P = 0.125). Phylogenetic analyses showed no differences between full-length NS5A isolates from virologic responders compared to nonresponders (data not shown).
Functional data analyses. The number of mutations did not correlate with treatment response in any of the known putative functional NS5A regions. In contrast, several studies previously reported a correlation between mutational frequencies within different NS5A regions and treatment response. The probability of observing significant correlations between the number of mutations within NS5A regions and treatment response increases with the number of regions tested. The overall significance of these tests derived from pointwise significance levels within arbitrary regions is difficult to determine.
In the present study, mutational analyses from full-length HCV-1b NS5A sequences available from patients with known treatment outcomes (n = 70; 45 patients from the present study and 25 patients from earlier studies [9, 31]) were compared with analyses of mutational frequencies for simulated sequences. The 70 NS5A sequences were divided into two groups: virologic responders (SR and ETR, n = 34) and virologic nonresponders (NR, n = 36). The simulated sequences were assigned randomly to be from virologic responders (n = 34) or from nonresponders (n = 36).
Figure 2 shows the smoothed mutational frequencies of genuine (Fig. 2A) and simulated sequences (Fig. 2B), respectively. For both genuine and simulated sequences within different NS5A regions, the number of mutations was higher for virologic responders than for nonresponders. Therefore, in a next step significance levels of the number of mutations in correlation with treatment response throughout the complete NS5A protein were calculated (Fig. 3). Different regions with a significant correlation between the number of mutations and the treatment response were observed for both genuine and simulated sequences (Fig. 3). Furthermore, regions with significant correlation between the early viral decline and treatment response were observed by screening the complete NS5A of genuine and simulated sequences (Fig. 3). Indeed, such regions could be identified in the majority (94%) of 500 simulated samples of random sequences.
![]() View larger version (39K): [in a new window] |
FIG. 2. Smoothed mutational frequencies of all full-length NS5A sequences from HCV-1b-infected patients (A) and of a typical sample of simulated sequences randomly assigned to derive from virologic responders and nonresponders (B). Virologic responders (SR and ETR) are indicated in green, and virologic nonresponders are indicated in blue. The brown-shaded areas indicate the potential functional regions within the NS5A protein according to the data shown in Fig. 1.
|
![]() View larger version (51K): [in a new window] |
FIG. 3. Pointwise significance levels of mutations in correlation with early viral kinetics (efficiency factor and second-phase decay 2) and of mutational frequencies in correlation with treatment response of full-length NS5A sequences from HCV-1b-infected patients (A) and of simulated sequences from Fig. 2B randomly assigned to belong to the genuine viral kinetic data (B). The black line indicates the significance levels from the correlation of mutational frequencies with treatment response. The blue and red lines indicate the significance levels from the correlation of the number of mutations with the efficiency factor and second-phase decay, respectively. Dotted horizontal lines indicate the 0.05 and 0.01 significance levels. The brown-shaded areas show the potential functional regions within the NS5A protein according to Fig. 1.
|
![]() View larger version (59K): [in a new window] |
FIG. 4. Shifted squared differences (shown with the original sign [±]) of smoothed mutational frequencies from virologic responders and virologic nonresponders based on the full-length NS5A sequences from HCV-1b-infected patients (A) and on the simulated sequences from Fig. 2B (B). Significant local differences are present if the gray-shaded areas under the curve are larger than the area within the black horizontal lines. The gray-shaded vertical bars show the potential functional regions within the NS5A protein determined according to the data presented in Fig. 1.
|
![]() View larger version (50K): [in a new window] |
FIG.5. Smoothed mutational frequencies of full-length NS5A sequences from HCV-1b-infected patients with respect to the localization of amino acid residues within the secondary structure of the NS5A protein. Red-shaded areas indicate exposed amino acid residues, and white areas indicate buried amino acid residues or unknown localization. (A and B) Smoothed mutational frequencies based on all amino acid mutations (A) and based on nonconserved amino acid mutations only (B). Virologic responders (SR and ETR) are indicated in green, and virologic nonresponders are indicated in blue. The brown-shaded areas at the bottom indicate the potential functional regions within the NS5A protein according to the data presented in Fig. 1.
|
![]() View larger version (24K): [in a new window] |
FIG. 6. Box plots of mean mutational frequencies of full-length NS5A sequences from HCV-1b-infected patients based on all mutations (A) and on all nonconserved mutations (B). Horizontal joint and dotted lines indicate median and mean, respectively. The term "buried" indicates analysis for amino acids that are predicted to be located within the buried core of the NS5A protein or with unknown localization; the term "exposed" indicates analysis for amino acids that are predicted to be located on the surface of the NS5A protein. Abbreviations: resp., virologic responders to antiviral therapy (SR and ETR); non-resp., virologic nonresponders to antiviral therapy (NR).
|
|
|
|---|
-based treatment (7, 11, 44, 46, 49, 52). However, in the majority of patients from western countries with a virologic response, fewer than four mutations within the ISDR are present, and sensitivity or resistance to antiviral therapy cannot be explained by the ISDR hypothesis (44, 46, 49, 55). Expression in cell culture systems generally confirmed IFN-
resistance mechanisms of the NS5A protein (13, 33, 34, 36). However, resistance to IFN-
was independent of the presence or absence of mutations within the ISDR. Therefore, other regions within the NS5A protein outside the ISDR may be involved in IFN resistance mechanisms. Mutations in other regions of the NS5A protein (PKR-binding domain, V3 region, carboxy-terminal part) have been described as associated with virologic response (9, 15, 31, 43).
In the present study, full-length NS5A sequences of 45 patients infected with HCV-1b were investigated for a possible correlation of mutational frequencies with treatment response. Patients were treated with IFN-
with or without ribavirin, and the response to antiviral therapy was carefully characterized. Furthermore, the initial virologic response to antiviral therapy was assessed by first- and second-phase kinetic analysis (29, 54). Since putative NS5A treatment resistance mechanisms have been characterized as directed against IFN-dependent pathways, patients with a virologic treatment response (SR and ETR patients) were compared to virologic nonresponders (NR patients).
The number of mutations throughout the complete NS5A protein correlated positively with the virologic response to treatment (P = 0.008). Furthermore, independent of different treatment regimens (3 to 6 MU of IFN-
with or without the addition of ribavirin), a more rapid second-phase viral decline correlated significantly with multiple mutations within the NS5A protein (one-tailed P = 0.034). However, in none of the known presumably functional regions within the NS5A protein (transmembrane domain, cleavage sites, NS4A association, ISDR, PKR-binding domain, NLS, and V3 region) was any significant correlation between mutations and treatment response observed.
Therefore, in a next step, analyses in other regions of the NS5A protein were performed based on 70 full-length NS5A sequences (31). Screening multiple regions of the complete NS5A by pointwise significance levels showed several regions with significant correlations between the mutational frequencies and treatment response and significant correlations between the mutational frequencies and the early viral decline for genuine sequences (Fig. 3A). Because the significance levels of multiple and correlated tests are difficult to interpret, random sequences were generated on the basis of a newly developed nonparametric binomial model and randomly assigned to be from virologic responders or nonresponders (Fig. 3B). Indeed, regions with pointwise significant correlations between mutational frequencies and virologic response could be identified in 94% of 500 simulated samples of random sequences. These findings are neither restricted to specific model assumptions nor to the assumed mutational probability function. Hence, comparing results from genuine sequences with those from randomly generated sequences can help to avoid misinterpretations of statistical results in general and especially showed that the identification of local accumulations of mutations from pointwise significance levels as in previous studies is problematic.
For further analyses of NS5A isolates, the functional consequences of amino acid exchanges, as well as predictions of the localization of amino acid residues in the core or on the surface of the resulting NS5A protein were studied. Amino acids were grouped according to functional properties to differentiate between conserved and nonconserved amino acid mutations (28). For conformational analysis, the amino acid residue accessibility within the secondary structure of the NS5A protein was predicted (39, 40). Exchange of amino acids with identical functional and conformational properties (conserved), as well as exchange of amino acids within the core of the resulting NS5A protein, should be of minor relevance, whereas nonconserved amino acid mutations and especially mutations for amino acids located on the predicted surface of the NS5A protein may be of functional importance. Analysis of the genuine NS5A isolates of the 70 HCV-1b-infected patients for predicted buried amino acid residues showed no differences between the virologic responders and nonresponders for the number of all (conserved and nonconserved) mutations, as well as all nonconserved mutations, in correlation with the treatment response. However, the number of all mutations and all nonconserved mutations was significantly higher for virologic responders compared to that for nonresponders for amino acids predicted to be located at the surface of the resulting NS5A protein (P = 0.0001 and P = 0.0029, respectively).
The existence of local accumulations of mutations truly correlating with the treatment response was analyzed by a new specific functional data analysis procedure. Significant local differences of mutational frequencies were detected for the complete NS5A protein (P = 0.018), as well as on the basis of predicted buried and exposed amino acid residues (P = 0.038), between virologic responders and nonresponders with a main focus upstream of the V3 region (aa 2350 to 2370) of the NS5A protein. At present, the functional relevance of this region is unknown. In a recent study, mutations in two variable regions within the carboxy-terminal part of the NS5A protein (a region downstream the PKR-binding domain and the V3 region) were characterized as correlated with the treatment outcome (31). Therefore, together with the results of the present study, the V3 region and flanking sequences may be involved in IFN-
resistance. The potential function of this NS5A region should be investigated in further studies, e.g., in the new HCV replicon model.
In conclusion, we have shown with the help of randomly generated sequences that simple analysis of counts of mutations in different NS5A regions is an insufficient approach for identifying local accumulations of mutations correlated with treatment response. A new nonparametric binomial model was developed to overcome this problem. In addition to the present study of HCV isolates, this model and the proposed analytical method have potential implications for similar investigations in other viruses (i.e., human immunodeficiency virus, hepatitis B virus) (12, 38).
The detection of significant mutational differences in regions within the NS5A protein of virologic responders and nonresponders is improved by consideration of functional properties of amino acid exchanges and the predicted accessibility of amino acid residues within the secondary structure of the resulting protein. Based on the investigations of nonconserved and predicted exposed amino acid mutations, one region (aa 2350 to 2370) within NS5A in particular was identified that showed significant differences between virologic responders and nonresponders to antiviral therapy.
Present address: Klinik für Innere Medizin II, Universitätskliniken des Saarlandes, D-66421 Homburg/Saar, Germany. ![]()
|
|
|---|
2b plus ribavirin for 48 weeks or for 24 weeks versus interferon
2b plus placebo for 48 weeks for treatment of chronic infection with hepatitis C virus. Lancet 352:1426-1432.[CrossRef][Medline]
-2a in patients with chronic hepatitis C. N. Engl. J. Med. 343:1666-1672.
2a. Gastroenterology 120:1438-1447.[CrossRef][Medline]
. Hepatology 25:740-744.[CrossRef][Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»