Previous Article | Next Article ![]()
Journal of Virology, August 2007, p. 8211-8224, Vol. 81, No. 15
0022-538X/07/$08.00+0 doi:10.1128/JVI.00487-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Jia Li,4
Abdus Wahed,4
Milton W. Taylor,5
Steven H. Belle,4
Adrian M. Di Bisceglie,2,3
Rajeev Aurora,1
John E. Tavis,1,3* for the Virahep-C Study Group
Department of Molecular Microbiology and Immunology,1 Department of Internal Medicine,2 Saint Louis University Liver Center, Saint Louis University School of Medicine, 1402 S. Grand Blvd., Saint Louis, Missouri 63104,3 Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania,4 Department of Biology, Indiana University, Bloomington, Indiana5
Received 7 March 2007/ Accepted 11 May 2007
|
|
|---|
|
|
|---|
HCV is a hepatotropic Flavivirus (reviewed in reference 40). The virion contains a lipid envelope with two envelope proteins surrounding a capsid. Within the capsid is a positive-polarity RNA genome about 9,600 nucleotides long that contains an open reading frame (ORF) that encodes a polyprotein of
3,000 amino acids (Fig. 1). The structural proteins include the core protein that forms the capsid and the E1 and E2 surface glycoproteins. The nonstructural proteins include P7 (ion channel), NS2 (protease), NS3 (protease and helicase), NS4A (cofactor for NS3), NS4B (putative organizer of the viral replicase complex), NS5A (implicated in viral replication and pathogenesis), and NS5B (RNA polymerase). An 11th viral protein, the alternate reading frame (ARF) protein, is encoded in the +1 frame within the core region and is of unknown function (7).
|
View larger version (3K): [in a new window] |
FIG. 1. The HCV genome. The HCV genome contains a single major ORF flanked by untranslated regions (UTR). The 10 proteins encoded within the main ORF are indicated by alternate shading. The ARF protein is encoded in the +1 reading frame within the core region.
|
The currently recommended therapy for chronic HCV infection is a combination of pegylated alpha interferon (peginterferon) and ribavirin for 24 to 48 weeks. Interferon (or peginterferon) provides the primary antiviral effect and can eradicate HCV even when used alone (25, 38, 42, 50, 75). Ribavirin is ineffective by itself in eliminating viremia (6, 11, 13), but in combination with peginterferon it increases the clearance rate and decreases the risk of relapse (42, 50). Sustained viral response (SVR; undetectable HCV RNA for at least 24 weeks posttherapy) is the primary goal of therapy. SVR rates with peginterferon and ribavirin therapy differ between racial groups; most notably, African Americans (AA) respond to therapy only about half as well as Caucasian Americans (CA) and Asians (9, 27, 29, 44, 53).
The reasons that peginterferon and ribavirin therapy is effective in only about half of cases are unknown, as are the reasons for the exceptionally poor SVR rate for AA patients. Host factors such as differences in immune responses to the virus and environmental factors such as alcohol use and drug tolerance can contribute to failure of therapy (9). HCV genetic variation can have a major effect on efficacy of therapy, because patients infected with HCV genotype 1 have a much lower response rate (40% to 50%) than patients with genotype 2 or 3 (75% to 80%) (18, 21, 41, 63). However, the role of genetic variation within a genotype in response to therapy is less clear, because there is little evidence that response to therapy differs between subtypes of a given genotype (e.g., patients infected with genotype 1a and 1b both respond to therapy in 40% to 50% of cases, despite the subtypes differing by 10% to 12% at the amino acid level) (5, 9). This observation is equally consistent with the possibility that there is no contribution of the variation between 1a and 1b to response to therapy or that "sensitive" and "resistant" HCV variants are found with equal frequencies in genotypes 1a and 1b.
Therefore, to determine the role of HCV genotype 1 genetic variation between infected individuals and between CA and AA patients in responses to therapy, we conducted a comprehensive analysis of the association of HCV genetic variability in the viral ORF with outcome of therapy. This analysis was part of the Study of Viral Resistance to Antiviral Therapy of Chronic Hepatitis C (Virahep-C), a multicenter clinical study designed to assess rates of response of CA and AA patients to peginterferon and ribavirin therapy and to assess reasons for nonresponse (9).
|
|
|---|
75 kg; administered orally); treatment duration was 24 to 48 weeks for nonresponders and 48 weeks for responders. Serum HCV RNA levels were quantified using a COBAS AMPLICOR hepatitis C virus monitor test, version 2.0 (Roche Molecular Diagnostics). The primary outcome was the SVR. The SVR rate was higher among CA (52%) than AA (28%) patients, and this racial difference could not be explained by clinical factors associated with response such as age, gender, body weight, obesity, severity of the underlying hepatitis, pretreatment viral levels, or amount of drug taken. A full description of Virahep-C is provided in reference 9. Sequencing. Consensus sequences for the full HCV ORF were obtained by directly sequencing overlapping nested reverse transcriptase PCR amplicons as described previously (72). Briefly, HCV RNA was isolated from plasma and nested reverse transcriptase PCR was performed. Both strands of the amplified DNAs were sequenced using ABI dye-terminator technology; sequence depth averaged over four. Sequences were assembled using Vector NTI software, and base-calling errors were corrected following inspection of the chromatograms. Mixed-base positions due to the HCV quasispecies were resolved by identifying the predominant base at each position. The extreme 3' end of the ORF could not be amplified in 13 of the 94 samples; hence, the C-terminal 56 amino acids of NS5B were excluded from the analyses to ensure that all sequences were represented equally.
Sequence analyses. The genotype 1a and 1b samples were analyzed separately, because the sequence variation between the genotypes was anticipated to be larger than differences associated with the response class or race of the patient. Amino acid sequences were deduced from the nucleotide sequences, and all analyses were performed at the amino acid level.
Sequence alignments were done with ClustalW software. Positions that differed relative to the genotype 1a or 1b population consensus sequence were identified with Mutation Master software (70). The genotype 1a consensus sequence (data not shown) was derived from all 12 full-length ORFs in the Los Alamos National Laboratory and the European HCV databases (33) in April 2005 that were from different patients and from 5 genotype 1a ORFs that we sequenced using samples from members of non-Virahep-C cohorts. The genotype 1b population consensus sequence (data not shown) was generated from all 126 full-length ORFs from different patients in the Los Alamos and European HCV databases in January 2006. The known and predicted CD4+ and CD8+ T-cell epitope sequences were obtained from the HCV Immunology Database at the Los Alamos National Laboratory in March 2006 (73). The Shannon's entropy value (64) was calculated with Bioedit software (23). Phylogenetic trees were generated using the neighbor-joining algorithm in PHYLIP software (16). The mean genetic distance value for each group was calculated using the p-distance algorithm in the MEGA DNA analysis package (34).
Statistical analyses.
Race and sex distributions were compared across response groups by use of Pearson's chi-square test for association. Normally distributed characteristics were compared across response groups by use of analysis of variance, whereas the Kruskal-Wallis nonparametric test was used when the distributions were not normally distributed. Fisher's exact test was used to compare the proportions of unique variations between marked- and poor-response groups and between CA and AA patients. Unique variations across all three response groups were compared using the Freeman-Halton exact test. To determine whether there were racial or response differences in the number of variations (unique or total) or whether any response relationship to the number of variations differed by race, we used Poisson regression models with the number of variations as outcome and race, response groups, and their interactions as independent variables. The likelihood ratio test was used to assess statistical significance of the race and response interaction. Shannon's entropy values were compared between races and response groups separately using the Mann-Whitney rank sum test, and the average genetic distances between the groups were compared using an independent-sample t test. The level of significance (
) for statistical significance was 0.01. Statistical analyses were carried out using SAS software (SAS Institute, Inc.) or SPSS version 13.0 (SPSS, Inc.).
Nucleotide sequence accession numbers. The sequences determined in this work have been deposited in GenBank (EF407411 to EF407504).
|
|
|---|
The baseline characteristics of these 94 patients are shown in Table 1. There were no significant (P
0.05) differences among the three response groups except in baseline HCV RNA levels, platelet counts, and alpha-fetoprotein (AFP) levels. The lower platelet counts and higher AFP levels reflect slightly more severe liver disease, which is known to affect response to therapy, as do lower serum HCV RNA levels at baseline (17, 63). Lower pretreatment HCV titers in responders were also seen in the whole Virahep-C cohort (9), and although viral titer differences were associated with response to therapy, they did not account for the lower response rate in the AA patients in the full Virahep-C cohort.
|
View this table: [in a new window] |
TABLE 1. Baseline characteristics of patients in the Virahep-C viral genetics study
|
Directly comparing the consensus sequence determined by the method employed here with a near full-length quasispecies analysis of variants found in the same patient sample explicitly demonstrated that the consensus sequence was representative of the common quasispecies variants (76). The average amino acid difference between the consensus sequence and the individual quasispecies variants was 1.8%, which was smaller than the average distance between the quasispecies variants themselves (2.3%). Phylogenetic analyses revealed that the consensus sequence was either in the center or near the base of the phylogenetic tree for each HCV gene.
Lack of clustering by response or race class. We first tested the hypothesis that distinct viral strains were found in marked compared to poor responders or in AA compared to CA patients. We generated phylogeny trees from alignments of all 32 1a or 31 1b marked- and poor-responder amino acid sequences (intermediate responders were excluded to focus on the biological extremes of response), and similar trees were generated for race categories (AA versus CA) for all samples. No clustering of sequences by day 28 response or by race was found for any viral gene for either genotype 1a or 1b (data not shown). Therefore, the response and racial groups were not infected with distinctly different HCV strains.
Few amino acid positions differ consistently by response or race group.
To determine whether any amino acid positions differed consistently between the response and race groups, we aligned the sequences by response or race class and identified positions that were conserved in at least 60% of the sequences in a given group. Eight positions at which the respective biological classes had different conserved residues were found (Table 2). In all cases, the prevalence of the dominant amino acid in the alignment was
60% to
70%, and in all but one case, the predominant amino acid in the alignment of the other biological group was the second most prevalent amino acid. These variations were largely conservative, with the biggest differences occurring at S or P at position 401 for genotype 1a NS5A by race. The conservative nature of these variations and frequent presence of the most prevalent residue from the other biological class indicates that the variations were unlikely to play a dominant role in causing the limited reduction in viral titers in the poor responders or the low rate of response to therapy in the AA patients.
|
View this table: [in a new window] |
TABLE 2. Positions at which the race or response groups are conserved at 60% but at which the marked-responder and poor-responder or CA and AA conserved residues are different
|
Each Virahep-C sequence was compared to the 1a or 1b reference sequence, positions of nonidentity were identified, and the numbers of variations were compared by response group. The total number of variations was generally higher in the marked than in the poor responders for both genotype 1a and genotype 1b, but only sequence variations in genotype 1a NS3 and genotype 1b NS2 and NS3 were significant at the P
0.01 level by a Poisson analysis. Next, to focus on differences that may be more strongly associated with response to therapy, we eliminated variations common to both the marked- and poor-responder groups because they were likely to be neutral. The number of unique variations in the sequences from the marked responder patients was generally higher than in those from the poor responders; these differences were statistically significant for genotype 1a E2, NS3, and NS5A (Fig. 2A) and genotype 1b core, E2, NS2, and NS3 (Fig. 2B).
![]() View larger version (26K): [in a new window] |
FIG. 2. Number of unique variations per sample by response. The numbers of variations relative to a population consensus that were unique to either the marked or poor responders are shown for the polyprotein (PP) and for each gene within the polyprotein. Statistical significance is shown for genes with P 0.01. (A) Genotype 1a. (B) Genotype 1b.
|
![]() View larger version (23K): [in a new window] |
FIG. 3. Sum of Shannon's entropy values by response. The marked- and poor-responder sequences were aligned separately, Shannon's entropy was determined for each position in the two alignments, and the entropy values were summed for each gene. Statistical significance between values for the marked- and poor-responder samples is indicated for genes where P 0.01. PP, polyprotein. (A) Genotype 1a. (B) Genotype 1b.
|
|
View this table: [in a new window] |
TABLE 3. Number of focal entropy differences between marked and poor responders compared by response and in all samples compared by race
|
![]() View larger version (22K): [in a new window] |
FIG. 4. Average genetic distances by response. The marked- and poor-responder sequences were aligned separately, the genetic distance between each pair of sequences in the alignments was calculated using the p-distance algorithm, and the average genetic distance for each protein in the alignment was plotted. The statistical significance of the difference between the groups is indicated for those genes where P 0.01. (A) Genotype 1. (B) Genotype 1b.
|
![]() View larger version (15K): [in a new window] |
FIG. 5. Numbers of unique variations per sample for all three response classes. The numbers of variations relative to a population consensus that were unique to the marked, intermediate, or poor-responder sequences are shown for the full genotype 1a and 1b polyproteins.
|
0.01. When variations unique to either the AA or CA sequences were considered to minimize neutral genetic variation, no significant differences were observed for genotype 1a (Fig. 6A). For genotype 1b the number of variations was significantly higher for CA patients in E1, E2, NS2, and NS5B (Fig. 6B).
![]() View larger version (23K): [in a new window] |
FIG. 6. Numbers of unique variations per sample by race. The numbers of variations relative to a population consensus that were unique to either the AA or CA sequences are shown for the polyprotein (PP) and for each gene within the polyprotein. Statistical significance is shown for genes with P 0.01. (A) Genotype 1a. (B) Genotype 1b.
|
![]() View larger version (24K): [in a new window] |
FIG. 7. Sum of Shannon's entropy values by race. The AA and CA sequences were aligned separately, Shannon's entropy was determined for each position in the two alignments, and the entropy values were summed for each gene. Statistical significance between values for the AA and CA samples is indicated for genes where P 0.01. (A) Genotype 1a. (B) Genotype 1b.
|
![]() View larger version (21K): [in a new window] |
FIG. 8. Average genetic distances by race. The AA and CA sequences were aligned separately, the genetic distance between each pair of sequences in each of the two alignments was calculated using the p-distance algorithm, and the average genetic distance for each protein in the alignment was plotted. The significance of the difference between the groups was determined using an independent sample t test and is indicated for those genes where P 0.01. (A) Genotype 1. (B) Genotype 1b.
|
0.01, but those for NS5B approached significance (P = 0.014), with the marked-responder CA samples being more diverse than the marked-responder AA samples. For genotype 1b, only NS4A results achieved significance, with the marked-responder CA samples having fewer variations than the marked-responder AA samples. When this analysis was limited to variations unique to either the marked- or poor-responder samples to minimize neutral genetic variation, genotype 1a NS5B results approached significance (CA marked responders > AA marked responders), and for genotype 1b E1 the CA poor-responder samples were more diverse than the AA poor-responder samples (Fig. 9). Finally, we stratified the marked-responder and poor-responder samples by both response and race and identified variations that were unique to each of the four race/response groups. Using a log-likelihood ratio test to determine whether the variations were distributed differently among the groups, we found no significant differences for either genotype 1a or 1b in any of the genes (data not shown). Therefore, the effect of race on variations associated with response to therapy was relatively small. However, we may have underestimated the effect of race on variation associated with response because the power of this analysis was lower than when race and response were assessed separately.
![]() View larger version (28K): [in a new window] |
FIG. 9. Effect of race on genetic variation associated with response to therapy. Variations relative to a population consensus that were unique to the marked- and poor-responder samples were divided by race, and a log-likelihood test was performed to determine whether race affected the diversity associated with response to therapy. Statistical significance is shown for domains with P 0.01; the upper P value is for the log-likelihood interaction of race and response, and the lower P value is for the Poisson analysis of the indicated response groups. M, marked; P, poor; PP, polyprotein. (A) Genotype 1a. (B) Genotype 1b.
|
![]() View larger version (23K): [in a new window] |
FIG. 10. Numbers of unique variations in the nonepitope regions by response. Variations relative to the population consensus sequence that were not in any known or predicted T-cell epitope were compared in the marked- and poor-responder samples. Statistical significance is indicated where P 0.01. PP, polyprotein. (A) Genotype 1a. (B) Genotype 1b.
|
![]() View larger version (21K): [in a new window] |
FIG. 11. Numbers of variations in NS3 protease and helicase domains by response. Variations relative to a population consensus sequence in the NS3 protease and helicase domains were compared in the marked- and poor-responder samples. Statistical significance is shown for domains with P 0.01. (A) All variations in genotype 1a NS3 domains. (B) Variations unique to the marked- or poor-responder sequences for genotype 1a NS3 domains. (C) All variations in genotype 1b NS3 domains. (D) Variations unique to the marked- or poor-responder sequences for genotype 1b NS3 domains.
|
0.01 (data not shown). However, genotype 1a marked-responder sequences had a significantly higher proportion of unique variations relative to total variations than the poor-responder sequences for domain I, domain III, and V3 by Fisher's exact test (data not shown). Finally, the ISDR has been widely studied as a potential genetic modulator of interferon sensitivity (47, 71). In this study, higher variation in the ISDR/PKR binding site correlated with response to therapy only for genotype 1a CA patients (Fig. 12).
![]() View larger version (20K): [in a new window] |
FIG. 12. Variations in the genotype 1a NS5A ISDR/PKR binding domain. Sequences encompassing the ISDR/PKR binding site in NS5A from CA and AA marked (M) and poor (P) responders were aligned relative to the genotype 1a population consensus sequence (Ref). Positions where the Virahep-C sequences differ from the reference sequence are indicated. Numbers at the top indicate amino acid positions in NS5A.
|
|
|
|---|
10% to
12% at the amino acid level; hence, isolates from different genotypes or subtypes may not be affected the same way at the molecular level by the drugs even when overall response rates to therapy are similar. Finally, all HCV isolates from treated individuals are wild-type strains that established chronic infections; hence, stark differences between isolates are not expected in parameters essential for viral replication or immune evasion. Rather, differences in susceptibility to therapy between isolates from different individuals are likely to be due to quantitative differences in viral functions, with some isolates being somewhat more sensitive than others. Therefore, variations at simple genetic motifs are unlikely to correlate highly with response. Consistent with this prediction, we found only three positions at which variation showed
60% correlation with response to therapy but found many diversity differences that were highly correlated with day 28 virological response. The diversity data summarized in Table 4 led to three conclusions. First, there were strong associations between response to therapy and genetic diversity of the consensus sequences from infected individuals for a few viral genes but little to no association for the other genes. Higher variation was found by multiple analytical methods in sequences from the marked compared to poor responders in genotype 1a NS3 and NS5A and in genotype 1b core and NS3. Associations with response to therapy were also seen with E1, E2, and NS2, depending on the analytical method used. Second, the patterns of association between genetic diversity and response to therapy were not the same for genotypes 1a and 1b. This implies that the two genotypes may be affected by therapy through somewhat different molecular mechanisms. Finally, diversity differences correlating with the race of the patient were seen only for genotype 1b.
|
View this table: [in a new window] |
TABLE 4. Summary of HCV genetic diversity differences by response and race
|
A possible confounding issue concerning the relevance of pretherapy HCV sequences to the outcome of therapy is that HCV replicates as a quasispecies and hence can evolve rapidly in response to the selective pressures induced by antiviral therapy. Viral evolution during the first month of therapy could skew our results by misclassifying sensitive HCV sequences into the poor-responder group or resistant sequences into the marked-responder group. Misclassifying resistant sequences into the marked-responder group was not a major problem in this study because 28 of the 31 marked responders achieved SVR; hence, the large majority of HCV quasispecies variants in the marked responders were sensitive to therapy. It is possible that some of the pretreatment sequences in the poor-responder group were sensitive to the drugs and were rapidly supplanted by resistant strains during therapy. To evaluate the degree of viral evolution during therapy, we are sequencing the full ORFs at 24 weeks posttherapy from the 1a patients who did not achieve SVR. Posttherapy sequence data have been obtained for the full NS3 gene and for NS5A amino acids 1 to 328 (of 448) from all 24 1a non-SVR patients for whom a follow-up week 24 blood sample is available. In NS3 the median numbers of variations relative to the reference sequence were 18 prior to therapy and 19 after therapy. When the pre- and posttherapy NS3 sequences from the same patient were compared, the median number of amino acid changes per patient was only 1. In the first 328 amino acids of NS5A, the median number of total variations relative to the reference sequence in both the pre- and posttherapy sequences was 14, and the median number of amino acid changes between the pre- and posttherapy sequences from the same patient was again just 1. Therefore, the pre- and posttherapy sequences were very similar in NS3 and NS5A, the two genotype 1a genes in which high pretherapy diversity correlated with suppression of viremia. This indicates that viral evolution during therapy does not alter the observation that higher sequence diversity correlates with successful suppression of HCV during therapy.
Alpha interferon causes the large majority of the decline in viral titers by day 28 during combination therapy (12); hence, our marked- and poor-response categories were defined primarily by the response to peginterferon. Alpha interferon is a key player in the innate immune system, and HCV has evolved strategies to inhibit its effects (reviewed in references 22, 26, 48, 51, 54, and 60). Therefore, the variable sensitivity to peginterferon of HCV isolates from different individuals is likely due to variations in the sequences and functions of viral proteins that dampen host antiviral responses. The strongest correlations between response to therapy and viral variation were in NS3 and NS5A for genotype 1a and in core and NS3 for 1b. Importantly, all of these genes can actively block the action of alpha interferon in vitro (reviewed in reference 19). Therefore, the genetic data reported here provide strong support for a role for core, NS3, and NS5A (and, to a lesser extent, E2) in antagonizing the type 1 interferon response induced pharmacologically during treatment of HCV infections in humans.
Core residues R70 and M91 have been associated with response to therapy (2), but few other studies have examined the role of diversity in the core in the outcome of therapy. We observed a similar association of R70 with marked response for genotype 1b but not 1a; however, M91 was highly dominant in both the marked- and poor-responder sequences. E2 has been more thoroughly studied, because it has a PKR-phosphorylation homology domain (PePHD) which inhibits PKR in vitro (65). Increased diversity in the PePHD has been correlated with response to therapy in studies of patients infected with genotype 3 (58) but not in other studies that included genotypes 1 to 3 (1, 4, 49, 56, 57). We found that by some measures, E2 from marked responders was more diverse than E2 from poor responders for genotype 1a, and this difference approached significance for genotype 1b. However, these differences clustered in the N-terminal third of E2, and there were almost no variations among the isolates in the PePHD for either genotype 1a or 1b.
We observed a strong correlation between HCV diversity in NS3 between infected individuals and response to therapy, and to our knowledge variation in NS3 has not previously been associated with response to alpha interferon-based therapy in humans. The NS3 protease can block induction of type 1 interferon in response in vitro by cleaving signal transduction molecules downstream of the double-stranded RNA sensors Rig-I and TLR3 (reviewed in reference 30). In our sequences, diversity differences in NS3 correlating with response to therapy mapped exclusively to the helicase domain rather than the protease domain (Fig. 11). This implies that variation in NS3 correlating with response to therapy may affect the helicase activity rather than the protease activity or that the helicase domain may modulate the activity of the protease domain allosterically; interdomain communication within NS3 has been reported previously (32). These results also raise the possibility that variability in the viral replication rate modulated by variation in helicase activity may correlate with sensitivity to antiviral therapy and/or that the helicase may limit accumulation of double-stranded RNA that can trigger innate immune responses.
We found significant differences between viral sequences from marked and poor responders in NS5A, but only in genotype 1a. The functions of NS5A during HCV infection are unknown, but NS5A can attenuate the type 1 interferon response in vitro (19, 24, 39). Variation within a short region of NS5A termed the ISDR (15) has been associated with interferon sensitivity during antiviral therapy in some populations (47, 71). We observed higher variability in the ISDR of marked responders relative to the poor responders, but only in genotype 1a CA patients (Fig. 12). The role of genetic variation in NS5A outside the ISDR or the overlapping PKR binding site in response to therapy has been investigated in relatively few studies (14, 36, 46, 69). These studies found greater diversity in responders to alpha interferon-based therapy, especially in the carboxy-terminal half of the protein which includes the V3 hypervariable region (28). The Virahep-C samples were more diverse in the marked responders primarily in the carboxy-terminal half of the protein, including the V3 region (data not shown), so our results agree with previous studies of the full NS5A gene. Because most of the variability associated with response in NS5A in our samples was outside the ISDR/PKR binding site, variation in the ISDR may have modulated sensitivity to therapy in some patients, but this is not the only contribution that diversity in NS5A could have made to response of HCV to therapy.
Two recent studies examined diversity differences in E2 and NS5A between CA and AA patients (31, 36). In E2, sequences from CA patients were more diverse than those from AA patients for both genotype 1a and genotype 1b. This diversity was very significant in genotype 1b, with the majority of the difference clustering in the middle third of the protein. In NS5A, higher viral V3 variability was found for CA than for AA patients, and this increased variability correlated with response to therapy (36). We found a trend to higher diversity in the CA compared to the AA sequences throughout the viral ORF. For genotype 1a, these differences did not achieve statistical significance for any gene, but for 1b significant differences in E1, NS2, and NS5B were found by at least three analytical methods. The association between diversity and race for NS2 is novel, but the implications of this association are not clear because the role(s) of NS2 in HCV biology is not well known.
This study is the most comprehensive analysis to date of HCV genetic variation in the AA population. Our data do not support the idea that unusual genotype 1a HCV strains infecting AA patients exist, but there may be minor strain differences for genotype 1b between the AA and CA populations that could contribute to the poor response of AA patients to therapy. However, this equivocal conclusion does not preclude a role for viral diversity contributing to the unusually poor response of AA patients to therapy. We selected equal numbers of marked, intermediate, and poor responders in the AA and CA populations for sequence analysis, despite significantly lower response rates to therapy for the Virahep-C AA patients (28% versus 52% SVR rates) (9). The genetic similarity of HCV strains in the AA and CA patients across the three response classes and the intermingling of the AA and CA sequences in phylogenetic analyses (data not shown) indicate that very similar HCV strains circulate in the two racial groups; hence, the higher proportion of poor responders in the AA population implies that a higher proportion of relatively resistant HCV variants may circulate in the AA population, even for genotype 1a. Therefore, viral sequence variation may contribute to the unusually low response of AA patients to therapy, but the mechanism would be circulation in the AA population of a higher proportion of the same resistant strains found in the CA population rather than circulation of novel resistant strains that are absent from the CA population. Genotype 1b is more common in the AA population than in the CA population (53), so there is a precedent for differential circulation of HCV strains in the two racial groups.
The goal of antiviral therapy for HCV is SVR, the clearance of detectable viral RNA from serum for at least 6 months posttherapy. Large declines in viral titers by day 28 (such as occurred in the marked responders) correlate well with SVR (17, 43), with a major confounding factor being the failure or inability of the patients to take sufficient drug amounts during the demanding 48-week treatment regimen. A complete analysis of how viral genetic diversity is related to SVR and other clinical variables will be presented elsewhere, but we have performed an initial characterization of how pretreatment HCV genetic diversity correlates with SVR. The pretherapy sequences were stratified as SVR or non-SVR, and then the total and unique numbers of variations relative to the population consensus reference sequence, the total Shannon's entropy values, and the mean genetic distances in the two groups were compared. For both genotype 1a and genotype 1b, the polyproteins of the SVR sequences were significantly more diverse than those of non-SVR samples at P
0.01. For genotype 1a, NS3 and NS5A were significantly more diverse in the SVR samples by at least three of the four analytical methods. For genotype 1b, core was significantly more diverse in SVR patients by all four measures, and E2, NS2, and NS3 were more diverse by three of the four measures. Therefore, our basic observation that pretherapy sequence diversity was higher in the day 28 marked responders than in the poor responders for a few select viral genes is also true when the sequences are analyzed with respect to the ultimate goal of therapy, SVR.
Taking the data together, the focal distribution of viral genetic diversity at the patient level correlating with response to therapy and the inability of immune selection to account for all of this diversity lend strong support to the hypothesis that HCV genetic variation modulates efficacy of antiviral therapy in human patients. Because most of the reduction in viral levels at day 28 was due to peginterferon therapy, these data imply that the genes for which diversity differences correlate with early response to therapy function at least in part to blunt the type 1 interferon response in humans. This is consistent with the in vitro immune evasion activities reported for core, NS3, and NS5A (19). The consistently higher diversity in isolates from the marked responders (for whom the virus failed to counteract the drugs) compared to the poor responders (for whom the drugs were ineffective) implies that the functions of the proteins which counteract the effects of therapy can be impaired by amino acid differences at multiple locations. In essence, diversity would correlate with clearance because there are only a few ways to optimize activity of the viral proteins but many ways to interfere with their function.
Virahep-C was funded by the National Institute of Diabetes and Digestive and Kidney Diseases and the National Cancer Institute with further support under a Cooperative Research and Development Agreement with Roche Laboratories, Inc. Funding for this viral genetics project was through U01 DK60345; other grants supporting Virahep-C were U01 DK60329, U01 DK60340, U01 DK60324, U01 DK60344, U01 DK60327, U01 DK60335, U01 DK60352, U01 DK60342, U01 DK60309, U01 DK60346, U01 DK60349, and U01 DK60341. Support through the National Center for Research Resources General Clinical Research Centers Program was through grants M01 RR00645, M02 RR000079, M01 RR16500, M01 RR000042, and M01 RR00046.
Published ahead of print on 23 May 2007. ![]()
Present address: College of Medicine, University of Iowa, Iowa City, IA. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»