Previous Article | Next Article ![]()
Journal of Virology, March 2009, p. 2645-2654, Vol. 83, No. 6
0022-538X/09/$08.00+0 doi:10.1128/JVI.01556-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
UCL/MRC Centre for Medical Molecular Virology, University College London, Windeyer Building, 46 Cleveland Street, London W1T 4JF, United Kingdom,1 Division of Infection Diseases, Stanford University, Stanford, California 94305,2 Medical Research Centre Clinical Trials Unit, London NW1 2DA, United Kingdom,3 Centre for Infections, Health Protection Agency, 61 Colindale Avenue, London NW9 5EQ, United Kingdom4
Received 23 July 2008/ Accepted 24 December 2008
|
|
|---|
|
|
|---|
Between 50 and 70% of treated patients with virological rebound harbor some form of drug-resistant virus (17). This has two consequences. First, some resistance-associated mutations cause cross-resistance to other drugs within the same class and future drug options become limited (19). Second, since high levels of plasma viremia are associated with infectivity (32, 35) these mutants can be transmitted to other patients (4, 33, 44). Transmitted drug resistance has reached between 5 and 10% in areas of the world with access to treatment (41, 45, 51), compromising response to first-line therapy (25). It is assumed that such transmission reflects direct infection from drug-experienced individuals. It follows that recent improvements in the treatment of HIV infections, with higher rates of viral suppression (31), will lead to reductions in transmitted resistance. Indeed, such reductions have been recently reported (45).
The extent to which acquired resistance persists in the infected population is unclear. It is generally accepted that most DRMs are associated with a fitness cost (11, 16). Thus, wild-type viruses commonly reemerge from archived reservoirs in treated patients who stop therapy following the emergence of drug resistance (7, 21). By contrast, transmitted resistance appears to be more long-lived in plasma virus even in the absence of treatment (24). Since the probability for a mutation to be transmitted is positively correlated with its persistence in a viral population, resistant polymorphisms fixed during the early stage of the disease have an increased chance to spread within the community. This can lead to the establishment of reservoirs of resistance among new HIV infections, independent of treatment. As up to 20 to 40% of infected individuals remain undiagnosed in the United Kingdom (18) and transmission events often occur around the time of primary infection, long before antiretroviral therapy is instigated (32), the potential for such reservoirs is a major concern.
In addition to traditional epidemiological approaches, phylogenetic analyses of viral gene sequences have substantially broadened our understanding of the role played by newly infected individuals in HIV epidemics (1, 15, 23, 30). Importantly, the contribution of untreated patients to the spread of antiretroviral resistance has not been thoroughly addressed. Moreover, the extent to which reservoirs of resistant HIV mutants may persist in a drug-naïve population in the absence of treatment has yet to be defined.
Here, we sought to identify potential lineages of resistant viruses circulating in drug-naïve patients, as evidence of treatment-independent reservoirs of resistance. Through the phylogenetic analysis of the largest United Kingdom database of HIV-1 sequences, we have identified drug-naïve resistant HIV-1 strains and characterized the population dynamics of these lineages. The demonstration of such a phenomenon raises concern for long-term effectiveness of antiretroviral therapies, both in resource-rich countries and in those areas of the world now rolling out therapy to those in need.
|
|
|---|
The sequences were generated in clinical virology and commercial laboratories serving United Kingdom clinical centers. The sequences used included the entire protease region (297 bp) and a partial reverse transcriptase region (1,023 bp) of the HIV genome. Subtypes and resistance profiles were established by electronic submission to the Rega subtyping algorithm (6) and the Stanford HIV Drug Resistance Database (http://cpr.stanford.edu/). The cohort included sequences of subtypes A (n = 788), B (n = 8,850), C (n = 2,720), D (n = 233), F (n = 42), G (n = 189), CRF01_AE (n = 157), and CRF02_AG (n = 355), as well as complex recombinant forms (n = 727).
Phylogenetic reconstruction and resistance mapping.
The sequences were divided per subtype and manually aligned using the sequence editor Se-Al v2.0 (36). Because of the size of the data set, the phylogenies of clade A, B, and C sequences were reconstructed by neighbor-joining inference, under the General Time Reversible model of nucleotide substitution with proportion of invariable sites and gamma-distributed rate heterogeneity (GTR + I +
). For smaller data sets (i.e., subtypes D, F, G, CRF01_AE, and CRF02_AG), maximum likelihood trees were inferred under a similar model. All phylogenies were reconstructed using the software Paup* 4.0 beta 10 (43).
In order to identify drug resistance mutations sharing common ancestry in the data set, the ancestral states of resistance-associated codon positions were reconstructed at the internal nodes of the trees. A total of 37 codons (30, 32, 33, 46, 47, 48, 50, 54, 76, 82, 84, 88, and 90 in PR, and 41, 62, 65, 67, 70, 74, 75, 77, 100, 103, 106, 108, 115, 116, 151, 181, 184, 188, 190, 210, 215, 219, 225, and 236 in RT) were investigated, according to the International AIDS Society USA Drug Resistance Mutations Group guidelines (http://www.iasusa.org). These were positions known to be associated with high-level or primary resistance to protease inhibitors (PIs), nucleoside reverse transcriptase inhibitors (NRTIs), and nonnucleoside reverse transcriptase inhibitors (NNRTIs). An initial ancestral reconstruction was performed using a maximum parsimony approach (12) with the software McClade version 4.07 (26). Parsimony reconstruction methods determine the ancestral state that minimizes the number of mutational changes along a phylogenetic tree, given an observed distribution of characters (here, the sequence alignment).
To characterize virus lineages harboring transmitted drug resistance mutations in epidemiologically linked drug-naïve individuals, we looked for resistance-associated polymorphisms present at internal nodes that linked patients with no known treatment history. For each tree, the phylogenetic clustering of three or more sequences from drug-naïve patients sharing at least one resistance-associated polymorphism was considered evidence for the existence of a drug-resistant viral lineage.
Confirmation of resistant viral lineages.
When potential drug-naïve resistant lineages were identified, the reliability of the clusters was confirmed on smaller data sets by using more robust phylogenetic methods. First, the sequences of interest were compared to a set of 100 random sequences from either treated or untreated patients using Bayesian Markov chain Monte Carlo (MCMC) inference. Bayesian posterior trees were generated with the software BEAST version 1.4.6 (8), under the GTR +
model of evolution and assuming a relaxed molecular clock. The MCMC search was set to 100,000,000 iterations and the trees were sampled every 1,000th generation after a 10% burn-in. The maximum clade credibility tree was selected from the sampled posterior trees with the program TreeAnnotator (http://beast.bio.ed.ac.uk/).
Second, the effect of convergent evolution on our phylogenetic reconstructions was assessed by repeating the analyses (i) after removing from the alignment 59 codon positions known to be associated with drug resistance (http://www.iasusa.org), leaving 1,143 bp, and (ii) taking into account third-codon positions only (440 nucleotides). The MCMC search was set to 100,000,000 iterations, and trees were sampled every 1,000th generation after a 10% burn-in. Each cluster was also independently compared to the 100 sequences most closely related to those of interest in the original neighbor-joining tree, and their respective robustness was assessed by bootstrap analyses under the two conditions listed above. Bootstrap scores were obtained on the basis of 1,000 neighbor-joining replicates.
Finally, the reconstruction of the clusters' ancestral states was confirmed by maximum likelihood reconstruction, using the marginal reconstruction approach implemented by the program codeml of the PaML package version 3.14a (49). This approach compares the probabilities of different character assignments to an internal node at a site and selects the character that has the highest posterior probability (50). Maximum likelihood reconstructions were performed under the Whelan and Goldman empirical model of protein evolution (48).
Timing of the emergence of drug-resistant lineages. The time of the most recent common ancestor of each drug-resistant lineage, corresponding to the date at which resistance mutations occurred in the clusters, was estimated using the Bayesian MCMC approach implemented in BEAST. For computational reasons, the data were downsized to 100 sequences isolated between 1997 and 2006 from both drug-naïve and -experienced patients. Estimates were sampled every 1,000th generation from an MCMC search of 10,000,000 iterations under two different molecular clock models: a strict molecular clock model and an uncorrelated lognormal relaxed clock model. Bayesian skyline plots were used as coalescent tree priors. The model with the best fit to the data was used for interpretation, as determined by the calculation of the models' Bayes factor (BF). The Bayes factor corresponds to the ratio of the marginal likelihood of the models tested, and a difference in the BF above 20 strongly supports the favored model. The BF of the models was calculated using a script available at http://code.google.com/p/beast-mcmc/ (42).
Nucleotide sequence accession numbers. The sequences used in the study were registered with GenBank under the accession numbers EU817047 to EU817091.
|
|
|---|
![]() View larger version (14K): [in a new window] |
FIG. 1. Schematic distribution of DRMs found in the reconstructed HIV-1 pol gene phylogenies. Circles and squares at the tips of the branches represent drug-naïve and -experienced patients, respectively. Sequences with drug resistance mutations are indicated in red. Three distribution patterns were distinguished: sporadic acquisition of DRMs (a), treatment-dependent transmission of DRMs (b), and treatment-independent transmission of DRMs (c). Lineages of drug-resistant viruses circulating among newly infected individuals were defined as clusters of resistant viruses that included three or more drug-naïve individuals (c).
|
![]() View larger version (17K): [in a new window] |
FIG. 2. Phylogenies of the five resistant HIV-1 clusters circulating among drug-naïve individuals. The clusters were extracted from a Bayesian MCMC maximum clade credibility tree generated under the GTR + model of evolution and a relaxed molecular clock. Branch lengths are expressed in nucleotide substitutions per site, and Bayesian posterior probabilities are shown when above 0.5. Terminal branches are labeled with the resistance mutations found in the respective sequences. Circles and squares symbolize drug-naïve and -experienced patients, respectively. Labels at internal nodes or tips of the trees correspond to the branches listed in Table 2.
|
|
View this table: [in a new window] |
TABLE 1. Details of the five treatment-independent drug-resistant lineages circulating in the United Kingdom
|
![]() View larger version (18K): [in a new window] |
FIG. 3. Bayesian MCMC phylogenetic trees of the resistant lineages compared to 100 HIV-1 pol sequences after exclusion of 59 codon positions associated with drug resistance (a) or based on third-codon positions only (b). Branch lengths are expressed as the number of nucleotide substitutions per site. Boxes indicate the position of the five resistant clusters in the trees. Nodes with a posterior probability of 1.00 are labeled with a star.
|
|
View this table: [in a new window] |
TABLE 2. Maximum likelihood reconstruction of the five lineages' ancestral states at drug-resistant codon positions
|
Cluster A included viruses sampled from 22 drug-naïve patients between 1998 and 2005, 19 of whom harbored viruses with the K219Q resistance genotype in RT. One virus of the cluster exhibited a drug-sensitive genotype at position 219 (i.e., s3), while the replacement of a glycine with an arginine (CAA to CGA) at the same position led to the independent loss of the resistance genotype in two patients (s7 and s10) (Table 2). Cluster B included 13 viruses sampled between 2000 and 2005. Two of the patients (s23 and s24) had experienced antiretroviral treatment. This lineage also showed evidence of sporadic reversions to wild type. The L90M resistance genotype was lost along an internal branch of the cluster by the effect of a single nucleotide substitution (ATC to CTG) (Table 2) before transmission to three individuals (Fig. 2). Three of the four DRMs found in cluster C (i.e., G48V, I54T, and V82A in PR) (Table 1) were absent in the earliest sequence of the cluster. The branch separating this subcluster from the oldest sample of the lineage (i.e., s36) is suggestive of a longer interval between transmission events or of transmission through individuals not present in the data set. Maximum likelihood ancestral reconstructions indicated that the G48V, I54T, and V82A mutations reached fixation during that time period (Table 2), either following the treatment of patient s36 before transmission to the other patients or in treated individuals missing from the cohort. In contrast, the DRMs conferring resistance to NRTIs (D67N and K219Q in RT) were common to all four patients, indicating a more ancient fixation in the lineage. Clusters D and E represented the smallest clusters identified, each involving three patients sampled during the year 2005. Cluster E also showed evidence of reversion from drug resistance to wild type. The I54V mutation was absent in the viruses from patient s43, while it was found in the two adjacent isolates (Fig. 2).
All five clusters also harbored secondary or accessory mutations associated with resistance to either protease or reverse transcriptase inhibitors (see Table S1 in the supplemental material). Secondary mutations are protease polymorphisms of varying frequency that, by themselves, may not have a significant impact on phenotype but rather improve the replicative fitness of resistant viruses (19). Such polymorphisms were found in the three clusters showing genotypic resistance to protease inhibitors, i.e., clusters B, C, and E. Likewise, accessory mutations associated with high-level resistance to reverse transcriptase inhibitors (3) were found in the four clusters with resistance to the latter class of drugs (i.e., clusters A, C, D, and E). It is possible that these mutations allow key drug resistance changes to persist with limited fitness cost.
Timing of the emergence of drug-resistant lineages. The time of origin of the resistant lineages was determined using a Bayesian MCMC approach under strict and relaxed molecular clock models of evolution. The Bayes factors for the two models strongly supported the relaxed clock over the strict model (log BF, 102.59), indicating a significantly better fit of the estimates obtained under the former model. Assuming a relaxed molecular clock, the viral gene under study has evolved at a rate of 2.6 x 10–3 substitutions/site/year (95% confidence interval, 1.9 x 10–3 to 3.3 x 10–3), with a coefficient of variation in substitution rate of 0.31 (95% confidence interval, 0.25 to 0.37). Estimates of the most recent dates at which all DRMs found in the clusters were fixed in the lineages are shown in Fig. 4a. According to these estimates, the five lineages originated between 1997 (95% highest posterior density, 1995 to 1998 [cluster A]) and 2003 (95% highest probability distribution, 2001 to 2004 [cluster E]). Clusters B, C, and D have persisted since 1998 (95% highest posterior density, 1997 to 1999), 1999 (95% highest posterior density, 1997 to 2001), and 2001 (95% highest posterior density, 1998 to 2003), respectively. The dated phylogenies of the lineages obtained under the relaxed clock model of evolution are shown in Fig. 4b. These indicate that the resistant genotypes found in the lineages have individually persisted in the United Kingdom population for an average of 5 years. The resistant lineage represented by cluster A was the oldest and has been circulating in the United Kingdom for about 8 years. Since the viral gene sequences used for the analysis represent a single patient, the internal nodes of the phylogenies reflect at least one transmission event. The internal structure of the clusters scaled in calendar years is therefore informative of the transmission histories of the clusters (23). Clusters A and B exhibit an even distribution of transmission events (i.e., of internal nodes) over 8 and 7 years, respectively. By contrast, cluster C shows a contemporary expansion of the transmission network it represents, with three out of four individuals being infected in 2004. Likewise, the three patients of cluster E seem to have been infected within a short period of time during the year 2003.
![]() View larger version (12K): [in a new window] |
FIG. 4. (a) Estimated time of origin of the five resistant clusters circulating among drug-naïve individuals. The time of the most recent common ancestor of each cluster was estimated under a relaxed molecular clock model. (b) Relaxed clock dated phylogenies of the five resistant clusters circulating among drug-naïve individuals. Branch lengths are expressed in calendar years.
|
|
|
|---|
The resistant lineages involved HIV subtype B viruses exclusively. The absence of non-B resistant clusters is likely due to the low number of non-B sequences available in our cohort (37%). More importantly, a majority of non-B HIV infections diagnosed in the United Kingdom were acquired abroad and subsequently introduced to the country (18), reducing the likelihood of United Kingdom-based transmission chains. However, changes in the epidemiological structure of HIV epidemics can be expected in the future, as illustrated by the recent identification of HIV transmission networks of subtypes other than B among men who have sex with men (15). The increasing availability of molecular data from imported HIV strains is likely to further establish the role of the emergent HIV diversity, especially with regard to transmitted resistance.
The survival of resistant lineages within untreated populations for up to 8 years raises questions about the persistence and transmissibility of drug-resistant variants. The fitness costs of some drug resistance mutations have been well-established (11, 16), and the presence of common polymorphic changes and/or acquisition of compensatory mutations known to increase the replicative fitness of resistant viruses is likely to have contributed to the persistence of the clusters. However, the relationship between in vitro measurement and in vivo replication is unclear, and some transmitted drug-resistant species can persist for months or years (24). Since HIV populations experience a substantial loss of genetic diversity during transmission (20), the emergence of drug-sensitive viruses in patients infected with drug-resistant strains is likely to be due to mutational reversion rather than outgrowth of archived wild-type viruses. In this regard, it is noteworthy that two of the five clusters harbor multiple resistance mutations, which may make reversion more difficult. T215Y mutations, for instance, are known to confer severe fitness cost (16) and were found in cluster E in association with other resistant polymorphisms. Yet this cluster persisted for over a year in a drug-free environment. This is suggestive of a maintained fitness in the context of other resistance mutations, similar to the compensatory mechanism identified in viruses with multiple PI resistance (47). Of the other clusters, K219Q and K103N are not thought to confer major fitness loss in isolation (5, 47). The persistence of such drug-resistant viruses in drug-naïve patients could also result from short transmission intervals, such that resistance polymorphisms are transmitted to the next individual before reversion occurs.
The dated phylogenies reconstructed in the present study provide new insights into the temporal structure of resistance transmission among untreated HIV patients. According to our estimates, three of the five resistant lineages originated between 1999 and 2003, which corresponds to the time when the transmission of DRMs was at its highest in the United Kingdom (45). The distribution of transmission patterns in the large clusters, such as clusters A and B, clearly indicates that infections among drug-naïve individuals have taken place over several years, reinforcing the idea of transmission networks involving more than one individual. However, the hypothesis that a single transmitter infected more than one individual cannot be totally excluded for cluster E, where the dates of transmission events fell within a narrow time frame.
The existence of resistance reservoirs among untreated HIV-infected persons has multiple implications. Antiretroviral therapy is increasingly successful at long-term suppression of viremia (31), with a consequent reduction in rates of transmitted resistance from these individuals (45). The persistence of resistant viruses in drug-naïve individuals suggests that there is a limit to the reduction in rates of transmitted HIV drug resistance. Current treatment guidelines recommend initiation of therapy when CD4 counts are below 350 cells per µl (14). This allows up to 30% of the diagnosed population (and all undiagnosed individuals) to maintain high viremia and infectivity. It is such individuals in whom circulation of resistance will continue. We argue that current discussions on an earlier start of therapy, such as the Strategic Timing of AntiRetroviral Treatment trial, should also consider the public health benefits of reducing transmission rates of drug-resistant viruses.
The cohort studied in the present work is, to our knowledge, the largest population of drug-naïve and -experienced HIV-1 patients from a single country that has been used for a molecular epidemiological analysis to date. Out of the 60,000 individuals diagnosed with HIV in the United Kingdom, about 55,000 HIV pol gene sequences were sequenced, a small fraction of which were longitudinally generated from the same patient. We therefore estimate that about 70% of known HIV-positive individuals in the United Kingdom are represented in our data set. Yet only 45 out of the 4,870 (
0.9%) drug-naïve subtype B viruses included were involved in transmission of drug resistance. This percentage is remarkably low considering the current prevalence of drug resistance among untreated patients (5 to 10%) and estimated levels of ongoing transmissions within the population (23). However, the database does not represent all infected individuals in the United Kingdom, let alone those remaining undiagnosed, and our results are likely to underestimate the burden of drug-resistant reservoirs. This may also explain why previous studies of transmission networks based on smaller data sets failed to characterize resistant lineages (23, 30). Since all treated patients carrying resistant viruses in the United Kingdom are included in our database, it is highly unlikely that viruses from drug-experienced patients not represented within our data set are present as intermediaries in the lineages shown. Furthermore, the short internode intervals characterizing the five clusters narrow down the likelihood of missing lineages in these transmission networks. In any case, we believe that the possible involvement of treatment-experienced individuals in transmission clusters does not challenge our findings, since the acquisition of drug resistance would predate their connection to the network. These individuals may also have transmitted the virus prior to therapy. It is nonetheless possible that the presence of treated individuals in the clusters (such as in cluster B) could "boost" the persistence of drug resistance in the untreated population. With about 80% of new HIV diagnosis patients undergoing resistance testing since 2007 in the United Kingdom, our sequence database will increasingly represent all diagnosed individuals in the country, and future work on the transmission of drug resistance among untreated individuals will yield more accurate estimations.
In summary, given the current decrease in resistance transmitted from treated individuals, an increasing proportion of such transmissions is likely to derive from drug-naïve lineages. The extent to which reservoirs of resistance can persist in a drug-naïve population has yet to be estimated, but the very existence of these sustained reservoirs suggests there is a limit to the decline of transmitted drug resistance. These findings provide new insights for the planning and management of treatment programs in developing countries.
We also thank the Steering Group of the UK HIV Drug Resistance Database: Jane Anderson, Homerton University Hospital, London; David Asboe and Anton Pozniak, Chelsea & Westminster Hospital, London; Sheila Burns, Royal Infirmary of Edinburgh; Sheila Cameron, Gartnavel General Hospital, Glasgow; Patricia Cane, Health Protection Agency, Porton Down; Ian Chrystie, Guy's and St. Thomas' NHS Foundation Trust, London; Duncan Churchill, Brighton and Sussex University Hospitals NHS Trust; Duncan Clark, St. Bartholomew's and The London NHS Trust; Valerie Delpech, Health Protection Agency Centre for Infections, London; David Dunn, Esther Fearnhill, Hannah Green, and Kholoud Porter, MRC Clinical Trials Unit, London; Philippa Easterbrook and Mark Zuckerman, King's College Hospital, London; Anna Maria Geretti, Royal Free NHS Trust, London; Paul Kellam, Deenan Pillay, Andrew Phillips, and Caroline Sabin, Royal Free and University College Medical School, London; David Goldberg, Health Protection Scotland, Glasgow; Mark Gompels, Southmead Hospital, Bristol; Antony Hale, Leeds Teaching Hospitals NHS Trust; Steve Kaye, St. Mary's Hospital, London; Svilen Konov, Community Advisory Board; Linda Lazarus, Department of Health; Andrew Leigh-Brown, University of Edinburgh; Nicola Mackie, St. Mary's Hospital, London; Chloe Orkin, St. Bartholomew's Hospital, London; Caroline Sabin, University College London; Erasmus Smit, Health Protection Agency, Birmingham Heartlands Hospital; Peter Tilston, Manchester Royal Infirmary; Ian Williams, Mortimer Market Centre, London; Hongyi Zhang, Addenbrooke's Hospital, Cambridge.
We are grateful to the following participating laboratories: Addenbooke's Hospital, Cambridge (Hongyi Zhang); Department of Virology at St. Bartholemew's and The London NHS Trust (Duncan Clark, Ines Ushiro-Lumb, and Tony Oliver); Belfast Health and Social Care Trust (Suzanne Mitchell); HPA Birmingham Public Health Laboratory (Erasmus Smit); Chelsea and Westminster Hospital, London (Adrian Wildfire); Dulwich Hospital, London (Melvyn Smith); Royal Infirmary of Edinburgh (Jill Shepherd); West of Scotland Specialist Virology Lab Gartnavel, Glasgow (Alasdair MacLean); Guy's and St. Thomas' NHS Foundation Trust, London (Ian Chrystie); Leeds Teaching Hospitals NHS Trust (Diane Bennett); Specialist Virology Centre, Liverpool (Mark Hopkins) and Manchester (Peter Tilston); Department of Virology at Royal Free Hospital, London (Clare Booth and Ana Garcia-Diaz); St. Mary's Hospital, London (Steve Kaye); University College London Hospitals (Stuart Kirk); Medical Research Council Clinical Trials Unit (MRC CTU), London (David Dunn, Esther Fearnhill, Hannah Green, and Kholoud Porter).
This work was funded by the European Union, UK Department of Health, and UCLH/UCL Comprehensive Biomedical Research Centre. The UK HIV Drug Resistance Database is partly funded by the Department of Health. Additional support was provided by Boehringer Ingelheim, Bristol-Myers Squibb, Gilead, Tibotec (a division of Janssen-Cilag Ltd.), and Roche.
The views expressed in this publication are those of the authors and not necessarily those of the Department of Health.
The founding bodies had no role in the study design, collection, analysis, and interpretation of data, writing of the paper, or decision for publication.
Published ahead of print on 21 January 2009. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»