Previous Article | Next Article ![]()
Journal of Virology, August 2005, p. 10718-10729, Vol. 79, No. 16
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.16.10718-10729.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Experimental Medicine, University of Rome "Tor Vergata," Rome,1 National Institute for Infectious Diseases, "L. Spallanzani," Rome,3 Istituto Superiore di Sanità, Rome,4 Institute of Infectious and Tropical Diseases, University of Milan, Milan, Italy,5 Department of Pharmacology, University of Alcalá, Alcala de Henares, Spain,2 Rega Institute for Medical Research, K. U. Leuven, Leuven, Belgium6
Received 22 March 2005/ Accepted 19 May 2005
|
|
|---|
|
|
|---|
HIV-1 RT is a multifunctional enzyme that possesses RNA-dependent and DNA-dependent DNA polymerase activities as well as an RNase H activity that specifically degrades the RNA strand of RNA/DNA hybrids (25). Its structure is a heterodimer consisting of two subunits, termed p66 and p51. Although both subunits share the first 440 amino acids (aa) of the RT gene, their relative arrangements are significantly different, and the substrate-binding site is located only in the p66 subunit. p51 does not directly contribute to catalytic enzyme activities (25, 38) but is essential for loading the p66 subunit on the template/primer (27). Indeed, the single p66 monomer is devoid of any catalytic activity.
As an intrinsic property, and in contrast to other DNA polymerases, HIV-1 RT lacks a proofreading function. This error-prone nature of RT, together with the high rate of virus production sustained by HIV-1 infection in vivo, highly contributes to the continuous generation of new viral variants (14, 48, 63). In addition, retroviral particles contain two copies of single-stranded RNA, and template switches occur frequently during reverse transcription, thus generating mutations and recombinations by intramolecular and intermolecular jumps. This intrinsic variability is dramatically enhanced under suboptimal drug pressure, resulting in mutations particularly in the pol region encoding both RT and protease (PR) enzymes (13, 24, 35, 65).
The emergence of resistance is the inevitable consequence of incomplete suppression of HIV replication by the antiretroviral drugs and represents a frequent and major limitation of antiviral therapy (13, 55). As a consequence, sequencing of HIV-1 RT and PR has been recommended to guide the choice of antiretroviral therapy in clinical practice (17, 29). Most of the known RT inhibitor resistance mutations are in the 5' polymerase coding regions, particularly in the fingers and palm subdomains. Therefore, on a routine basis, only the first part of the genetic information of RT (up to 240 to 300 aa) is generally analyzed (57).
To date, mutations at 61 residues in the RT gene have been related to treatment with experimentally tested RTIs (13, 24, 35, 55, 57). Of these, 18 sites are involved in resistance against the eight currently approved nucleoside and nucleotide RT inhibitors (NRTIs) and 16 sites are involved in resistance against the three currently approved nonnucleoside RT inhibitors (NNRTIs) (35) (Stanford HIV sequence database 2005 [http://hivdb.Stanford.edu]). This variability has been extensively studied (1, 6, 15, 19, 23, 24, 26, 30, 39, 46, 51, 52, 61, 67), yet limited information is available regarding the conserved regions of HIV RT (3, 34), even under strong antiviral pressure.
The identification of conserved regions within the HIV-1 genome allows a better characterization of essential regions in the viral proteins and can help in the design of new therapeutic strategies aimed to drive the virus to mutate at key amino acids that are crucial for the maintenance of sufficient viral fitness (54). The resulting virus may be replicating at such a limited rate that its ability to damage the immune system, and to further mutate to compensate for such decreased fitness, is minimal. The characterization of the conserved areas of HIV enzymes and proteins is therefore of crucial importance to investigate new and innovative attempts to control and prevent efficient virus spread in the body.
This research approach led us to investigate and define the conserved areas of HIV-1 PR (10). We then concentrated our attention on HIV-1 RT, in an attempt to define the minimal conserved structure of the first 320 aa of 2,236 HIV-1 RT sequences obtained from a well-defined cohort of 1,704 HIV-1-infected individuals (drug-naïve and drug-treated patients). Conserved amino acids have been analyzed and mapped onto the structure of both p66 and p51 subunits using available X-ray information. The identification of novel HIV-1 RT invariant regions can now be used in the rational design of new HIV-1 RT inhibitors with a much more favorable resistance profile.
|
|
|---|
At the time of genotypic analysis, 99% of drug-treated patients were under treatment with NRTIs: 62.3% of patients were treated with lamivudine (median time, 582 days), 54.4% with stavudine (median time, 553 days), 35.8% with didanosine (median time, 432 days), 30.5% with zidovudine (median time, 577 days), 14.4% with abacavir (median time, 351 days), 2.6% with dideoxycytosine (median time, 457 days), and 1.7% with tenofovir (median time, 171 days). Among these RTI-treated patients, 35% were also under NNRTIs: 17.9% with nevirapine (median time, 470 days) and 17.1% with efavirenz (median time, 448 days). PR inhibitors were included in the last regimen in 1,103 patients (62%).
HIV sequencing. HIV genotyping was performed on plasma samples by means of a commercially available kit (the ViroSeq HIV-1 genotyping system; Celera Diagnostics/Abbott Molecular Diagnostics, Rome, Italy) based on RNA extraction, retrotranscription, DNA amplification, and sequencing by an automated sequencer (ABI 3100; Applied Biosystems, Foster City, CA) (10, 47). The amplified product contained the entire PR (99-aa) open reading frame and the first 320 aa of the RT open reading frame.
Sequence data were analyzed by dedicated HIV genotyping system software that automatically assembles the seven sequence segments into a consensus sequence, which is then compared to an NL4.3 reference strain. For each individual, the quality endpoint was ensured by a coverage of the RT sequence by at least three sequence segments. Sequences having a mixture of wild-type and mutant residues at single positions were considered to have a mutation at that position. When the mixture contained two different mutations, both were considered and reported (see Fig. 2). HIV subtype analysis showed that 18 out of 457 (3.9%) drug-naïve patients carried non-clade B pol subtypes (F = 4, A = 2, C = 1, G = 1, and recombinant subtypes = 10). Similarly, 60 out of 1,247 RTI-treated patients (4.8%) carried non-clade B pol subtypes (F = 27, C = 11, G = 4, A = 4, D = 2, and recombinant subtypes = 12). Overall, the large majority of the sequences included in this analysis (2,102/2,236, 94%) carried a pol B subtype virus.
![]() ![]() View larger version (60K): [in a new window] |
FIG. 2. Polymorphisms and mutations in HIV-1 RT from drug-naïve and RTI-treated patients. Amino acid mutations in HIV-1 RT from 457 plasma samples from drug-naïve patients and 1,779 samples from RTI-treated patients are reported. The clade B consensus sequence is shown as a reference. Residues associated with RTI treatment by previous in vitro or in vivo studies (24, 35, 55, 56) are underlined. Residues whose wild-type prevalence differs significantly (P < 0.05) between naïve and treated patients are shown in boldface. The predominant mutation for each position is given closest to the reference. Mutations are represented by one-letter amino acid symbols and are numbered.
|
Published pol consensus sequences of pure HIV-1 subtypes (A, B, C, D, F1, F2, G, H, J, and K) (Los Alamos DB, http://hiv-web.lanl.gov/content/hiv-db/mainpage.html), together with GenBank (http://www.ncbi.nlm.nih.gov)-published pol lentiviral sequences (of HIV-2 P18042, visna/maedi virus AAM51650 feline immunodeficiency virus [FIV] AAM13444 equine infectious anemia virus [EIAV] AAK21112 caprine arthritis encephalitis virus [CAEV] AAG48629 ovine lentivirus AAO33139 puma lentivirus AAA67168 simian immunodeficiency virus [SIV] SIVS4 P12502 and SIVCZ P17283, and bovine leukemia virus [BLV] P19561), were examined in this study and aligned using the Clustal X sequence alignment program.
Mutations.
Consensus B was used as a reference strain for the definition of mutations. Mutations associated (by in vitro and in vivo studies) with resistance to NRTIs and NNRTIs currently discovered (24, 35, 55), as well as mutations not yet associated with drug resistance, were analyzed. In particular, the frequency of mutations in the first 320 aa of RT was calculated and statistically compared using the
2 test (based on a 2-by-2 contingency table containing the numbers of isolates from untreated and treated persons and the number of isolates with and without mutations).
Amino acids not mutated or mutated with
1% prevalence over each cohort of patients (drug naïve or treated) were defined as conserved. HIV-1 RT depictions were colored according to mutation frequency rate, so that in specific figures blue corresponds to highly conserved residues (variability in
1% of patients), cyan to amino acids mutated only in 1 to 5% of patients, green to amino acids mutated in 5 to 10% of patients, yellow to amino acids mutated in 10 to 25% of patients, and red to residues that appear mutated in >25% of patients (for further details, see figure legends).
Conservative (strongly or weakly) and nonconservative amino acid substitutions were recognized according to the ClustalW formalism (http://www.ebi.ac.uk). Residue substitutions were considered strongly conservative if amino acid alterations were from/to one of the amino acids within the following groups having similar biochemical characteristics: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, and FYW; they were weakly conservative if alterations were from/to one of the amino acids within the following groups having different biochemical characteristics: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEHQRK, MILVF, and HFY.
Structural analysis. Several X-ray crystallographic structures of HIV-1 RT available from http://www.rcsb.org/pdb were used to map mutations and conserved residues in three dimensions, namely, HIV-1 RT in complex with efavirenz and solved at 2.5-Å resolution (PDB code 1FK9 [49]), the apoenzyme solved at 2.7-Å resolution (PDB code 1DLO [31]), and DNA-bound HIV-1 RT (PDB code 1RTD [32]). The public domain program ViewerLite 5.0 (Accelrys) was employed for visualization and display.
Nucleotide sequence accession numbers. Nucleotide sequences of isolates from patients treated with at least a PI within the highly active antiretroviral therapy regimens (10, 58) were recently submitted to GenBank with these accession numbers: AY855351 to AY855439, AY855441 to AY855458, AY855460 to AY855502, AY855504 to AY855556, AY855558 to AY855773, AY855775 to AY855795, AY855797 to AY855818, AY855820 to AY855836, and AY995408 to AY995555; the rest of the sequences are in the process of being submitted.
|
|
|---|
(i) Drug-naïve patients.
The analysis of sequences from drug-naïve patients showed conservation (
1% variability) in 233 out of 320 aa (73% overall conservation) (Fig. 1). A few invariant residues were scattered throughout the sequence either individually (K103, S105, N175, K201, E203, L282, G285, K287, and L295) or either pairs or triplets, whereas the majority (159/233, 68%) clustered into 14 regions comprising 5 to 29 consecutive invariant residues (Fig. 1). The nine longest invariant regions, containing some functionally important residues, were I (T7 to P19), II (K70 to K82), III (Q91 to P97), IV (T107 to S117, including the catalytic D110), V (F124 to S134), VI (N147 to P157), VII (Y181 to E194, including the catalytic D185 and D186), VIII (T216 to I244), and IX (W252 to Y271).
![]() View larger version (48K): [in a new window] |
FIG. 1. Conserved regions of HIV-1 RT in drug-naïve and drug-treated HIV-1-infected patients. The sequence of the first 320 aa of HIV-1 RT of clade B consensus (shown as a reference) is colored according to the frequency rate of mutations observed in 457 plasma samples from drug-naïve patients and 1,779 plasma samples from RTI-treated patients. The long invariant regions are boxed (I to IX) in both drug-naïve and RTI-treated patients.
|
Compared to the nine large conserved areas found in drug-naïve patients, the degree of conservation in drug-treated patients was almost identical at six regions (I, III, IV, V, VI, and IX). Minor exceptions were found at T7, K11, and F116 (with a variation frequency somewhat increased to
3% of patients) and V108 and Q151 (with a variation frequency increased up to 6% and 4% of patients, respectively). Interestingly, the decrease of variation of aa 146 (from 2 to 0.8%) that was found in the group of RTI-treated patients resulted in an upstream extension of conserved area VI up to position R143.
By contrast, a different picture, characterized by a substantial decrease of amino acid conservation, was observed in the other three large conserved areas (II, VII, and VIII). The conserved areas II and VII markedly shrank to smaller stretches, whereas the largest conserved region, VIII, was split into several isolated conserved residues and two smaller conserved areas (229 to 237 and 239 to 244) (Fig. 1).
These results are consistent with a previous analysis conducted on 1,210 different RT sequences from drug-naïve and NRTI-treated (but NNRTI-naïve) individuals (24). However, this study addressed only the mutation frequency at RT positions 1 to 240; in addition, the focus was on characterizing RT mutations, while distinction between invariant residues (variability of <1%) and residues with low variability (
3%) was not made. Taking into account these considerations and despite a general agreement, differences with that analysis were found in the rate of conservation at a small number of amino acid positions (e.g., 36, 88, 172, and 175 and obviously NNRTI-associated residues [100, 181, 189, 190, 194, 225, and 227]) for which we found variability of >1% (and >5% at residues 181 and 190), not reported in the aforementioned study (24), and at residues 111, 115, 224, and 239, for which variability was reported to be >3% in NRTI-treated patients (<1% in our study). It is assumable that the differences in RT conservation/mutation patterns highlighted by these studies may reflect the circulation of distinct viruses in different geographic areas as well as variation in patients' characteristics and RTI regimens.
Pattern of HIV-1 RT mutations.
We found several RT polymorphisms in drug-naïve patients. Among 87 variable residues, 42 were mutated in >5% of drug-naïve patients; of them, 14 were highly variable (replaced in >25%) (Fig. 1). The frequency of the major drug resistance mutations in drug-naïve patients was consistently
1%, with only the exception of V118I (2.6%), while minor drug resistance mutations (or unknown mutations in drug resistance-associated residues) were present at low variability in drug-naïve patients: A98S (6.8%), K101R (2.2%), V106I (1.3%), E138A (3.5%), V179I (3%), and T215D (1.8%) (Fig. 2).
As expected, the development of drug resistance during therapy failure resulted in a general increase of mutations in the RT from RTI-treated patients, with 70 out of 320 residues (22%) found mutated in >5% of patients (compared to 42 in drug-naïve patients [13%], P = 0.005). Of these, 20 were highly variable (replaced in >25%) (Fig. 1).
Statistically significant differences (P < 0.05 by
2 test) in the variability frequency between HIV-1 RT sequences from drug-naïve and RTI-treated patients were found at 53 out of 320 residues (17%) (shown in boldface type in Fig. 2). Among them, 17 known NRTI resistance sites (M41, E44, A62, K65, D67, T69, K70, L74, V75, F77, F116, V118, Q151, M184, L210, T215, and K219) and 16 known NNRTI resistance sites, not only for efavirenz and nevirapine (A98, L100, K101, K103, V106, V108, I135, E138, V179, Y181, Y188, V189, G190, P225, F227, and K238), were identified (35, 55, 57; http://hivdb.Stanford.edu). Besides, seven new sites (E28, K122, G196, Q278, R284, T286, and A288) were positively associated with RTI treatment, together with nine sites (K20, T39, K43, E203, H208, D218, H221, K223, and L228) recently associated with NRTI therapy (24). Interestingly, at four new sites (T27, I50, R83, and V276) the variability frequency was statistically decreased in the group of RTI-treated patients relative to the group of drug-naïve patients (P < 0.004; Fig. 2).
Regarding the type of mutations, some unusual changes at known NRTI resistance sites (less common than others at the same positions) (E44A, D67G, L74I, V75M, V75T, T215I, K219R, and K219N), as well as novel mutations not yet associated with resistance to any known RTI (K20R, E28A, E28K, V35M, T39A, K43E, K43Q, K43N, K122E, G196E, E203K, E203D, H208Y, D218E, H221Y, K223Q, K223E, L228H, L228R, Q278H, R284K, T286A, and A288T), were significantly increased in treated patients (P <0.05 to <0.0001). Surprisingly, a few positions were characterized by a decreased rate of mutations in treated versus drug-naïve patients: T27P (from 1.3 to 0.22%, P = 0.007), T27A (from 1.3 to 0.11%, P = 0.001), V35I (from 20.8 to 11.3%, P < 0.0001), I50V (from 5 to 2.3%, P = 0.004), R83K (from 33 to 18.6%, P < 0.0001), and V276I (from 5 to 1.7%, P < 0.0001).
Interestingly, among mutations not yet associated with resistance to any known RTI, some of them (T39A, K122E, and T286A) were also included as candidate mutations for drug resistance and positive reproductive fitness, by having a ratio of nonsynonymous mutations to synonymous mutations (Ka/Ks) of >1, with values similar to those observed for the already-known RTI resistance mutations (12).
Structural interpretation of the minimal conserved areas of HIV-1 RT.
It is known that the HIV-1 RT catalytic activities are localized in the p66 subunit that is composed of all 560 aa encoded in the RT gene, whereas the p51 subunit, made up of 440 shared aa, displays no enzymatic activity although it provides essential structural support to the enzymatically active p66 monomer in the RT dimer (25). It is important to recall that the shared 440 aa are arranged in structurally different ways in p66 and p51 and also that mutations occurring in vivo in the RT genome influence both subunits (50). Therefore, in an attempt to define an improved minimal conserved structure of HIV-1 RT, we rationalize our findings also in structural terms by mapping the first 320 aa of RT from both naïve and treated patients onto a three-dimensional representation of both p66 and p51 subunits (Fig. 3). Among the large conserved areas observed in drug-naïve patients, regions I (7 to 19), II (70 to 82), and V (124 to 134) are located in the "fingers" subdomain (1 to 88 and 121 to 146). Regions III (91 to 97), IV (107 to 117), VI (147 to 157), VII (181 to 194), and VIII (216 to 244) are located in the "palm" subdomain (89 to 120 and 147 to 242), while region IX (252 to 271), containing a long
-helix, usually termed
H, is in the "thumb" subdomain (243 to 311) (Fig. 3). The IV and the VII areas contain, respectively, the three aspartate residues involved in catalysis (D110, D185, and D186). The largest conserved area in drug-naïve patients, VIII (216 to 244), contains residues (224 to 235) that are involved in positioning the primer terminus ("primer grip" [40]).
![]() View larger version (48K): [in a new window] |
FIG. 3. HIV-1 RT conserved area locations. Left: Ribbon representation of the C trace of HIV-1 RT (as found in the X-ray crystal structure with PDB code 1DLO) with p66 and p51 subunits each enveloped by a solvent-accessible surface colored in blue and pink, respectively. The boxed area is enlarged in Fig. 4. Right: Ribbon representation of the X-ray crystal structure of HIV-1 RT showing the location in both p66 and p51 subunits of the large conserved areas (I to IX) characterized in both drug-naïve and RTI-treated patients.
|
![]() View larger version (85K): [in a new window] |
FIG. 4. Detail of the NNRTI binding and catalytic sites in HIV-1 RT. Enlarged view of the boxed area shown in Fig. 3. The C trace of HIV-1 RT, as found in its crystallographic complex (PDB code 1FK9) with efavirenz (shown as sticks), is displayed as a ribbon, colored in gray and pink for the p66 and p51 subunits, respectively, except for all residues having a mutation frequency of <1%, which are colored in green. Side chains of nonmutable residues in the neighborhood of the NNRTI binding site, together with the active-site aspartates (D185, D186, and D110), are shown as sticks, with carbon atoms colored in green.
|
B 78 to 82, 86 and 87, 89, 91 to 93, 147 to 150, and 152 to 154 remain conserved. It is interesting that, from mutational analyses of the 67 to 81 region, residues W71, R72, D76, R78, and N81 (residues also evolutionarily conserved) were intolerant of substitution compared to the other nine, thus suggesting an important role for these residues in polymerase activity (8, 37).
Region IX (252 to 271) is the largest of all the fully conserved stretches in both naïve and drug-treated patients (Fig. 1 and 3). Residues 259 to 284 exhibit sequence homology with other nucleic acid polymerases and have been termed the "helix clamp" (4, 28). In p66 these residues belong to the polymerase domain as part of the thumb, and it has been shown by alanine-scanning mutagenesis that specific amino acids (Q258, G262, and W266) are critical for template-primer affinity (5). Another recent paper has demonstrated that mutation of N255 or N265, each adjoining the minor groove-binding track, to aspartate leads, respectively, to a drastic reduction of processive synthesis on both templates or to a loss of processive polymerization on DNA but not on RNA (18). The invariant region IX in the p66 subunit is essential for DNA polymerization and can have an important role also in p51. In fact,
H (V254 to S268) in the p51 subunit is in contact with the RNase H domain of p66, suggesting a substantial contribution of this region to dimer stabilization (38, 41, 62).
In this regard, the segment from I135 to P140 in p51, which is part of the ß7-ß8 loop and essential for the catalytic activity of the p66 subunit, has been identified as a "hot spot" of binding energy (53) and is invariant in HIV-1 RT in three residues, N136, N137, and P140. In particular, N136 represents a highly conserved amino acid among all known lentivirus RTs, whose role in HIV-1 RT has been recently shown to be severely compromised upon mutation to other amino acids (2a).
The NNRTI binds at a hydrophobic pocket approximately 10 Å from the polymerase active site (38). Analyzing the conservation within the NNRTI binding pocket, in drug-naïve patients, 14 of 19 residues that make up the hydrophobic NNRTI binding pocket (P95, L100, K103, V108, Y181, Y183, Y188, G190, P225, F227, W229, E233, L234, and P236) are fully conserved (<1%), others (K101, V106, and Y318) are mutated at low frequency (<3%), and two (A98 and V179) are mutated in 5 to 8% of patients (Fig. 4). As expected, and in contrast to what is observed in naïve patients, the NNRTI binding cavity of HIV-1 RT from drug-experienced individuals displays a high and variable occurrence (from 5 to 29%) of mutations at positions A98, K101, K103, V106, V108, V179, Y181, and G190. Nevertheless, seven residues (P95, Y183, W229, E233, L234, P236, and Y318) remain fully conserved, and others (L100, Y188, P225, and F227) are mutated only rarely (<4% frequency) (Fig. 4). The invariant amino acids in the HIV-1 NNRTI binding pocket, as already shown for W229 and Y318 (44, 45), should be regarded as prime candidates to be targeted by modified or novel NNRTIs (43) in order to rationally design new drugs with a more favorable resistance profile.
RT evolutionary conservation. Being aware of the potential limitation of working solely with HIV-1-derived data (mainly B subtype), we undertook an analysis of RT conservation also in a variety of HIV-1 clades and other published retroviral sequences. Consistent with the pattern of sequence conservation observed in HIV-1 RT from isolates of drug-naïve patients, the largest invariant regions are also highly conserved in RTs from other HIV-1 subtypes (A, C, D, F1, F2, G, H, J, and K), with the single exception of RT from clade H in which, at position 107, Thr (present in all subtypes) has been replaced by Ser (a conservative mutation) (data not shown). This indicates that, in the absence of any drug pressure and independent of both virus origin and circulation, HIV-1 RT amino acid variations (i.e., mutations) are indeed allowed (more than 30% of the sequence), but only in restricted selected areas.
In contrast, when we compared the sequence of the first 320 aa of HIV-1 RT consensus clade B with the corresponding RT sequences from other lentiviruses (HIV-1 subtypes, HIV-2, EIAV, visna/maedi virus, CAEV, BLV, FIV, and SIV) a completely different picture (Fig. 5), characterized by a substantial decrease of amino acid conservation, became visible. Indeed, among all viruses only 73/320 residues were fully conserved, 47 residues mutated toward strongly conservative amino acid substitutions, and 17 residues mutated toward weakly conservative substitutions, with an overall evolutionary conservation of 137 residues out of 320 (43%). As previously proposed (3), the N terminus of the HIV-1 RT lies on a surface distantly located from the active site and is accessible to the HIV-1 PR. This area is subject to a considerable variation. Instead, the most evolutionarily conserved RT regions (in the frame of aa 20 to 190 of HIV-1 RT) are within the inner surface of the enzyme and responsible for polymerization function. Overall, several evolutionarily conserved stretches in lentiviral RTs were also found invariant in HIV-1 RT under antiviral pressure (G18-P19, Q23 to L26, E29-K30, G45 to I47, N57 to P59, W71-R72, R78 to K82, G93 to P97, L109 to V111, D113 to Y115, P119-L120, Y127 to I132, N136-N137, R143-Y144, L149-P150, G152-W153, S156-P157, I159 to Q161, I167-L168, D185 to L187, S191-D192, W229 to Y232, and T253 to L260) (Fig. 5, see boxes), whereas a lower number of RT residues are overall conserved in more distantly related retroviral sequences (P52, N57, P59, K65, R72, D76, R78, N81, D110, D113, F116, A129-F130, L149 to G152, S156-P157, L168, Y183, D185-D186, L209, G213, K220, G231, and G262) (34).
![]() View larger version (48K): [in a new window] |
FIG. 5. Conserved regions of RT in lentiviruses. The sequence of the first 320 aa of HIV-1 RT of clade B consensus (shown as a reference) is colored according to the degree of conservation observed in different lentiviruses. Published pol consensus sequences of pure HIV-1 subtypes (A, B, C, D, F1, F2, G, H, J, and K) together with published pol sequences of HIV-2 (P18042 from residue 198), visna virus (AAM51650from residue 160), FIV (AAM13444from residue 163), EIAV (AAK21112from residue 175), CAEV (AAG48629from residue 147), ovine lentivirus (AAO33139from residue 14), puma lentivirus (AAA67168from residue 64), SIVS4 (P12502 from residue 168), SIVCZ (P17283 from residue 180), and BLV (P19561 from residue 157) were aligned using the Clustal X program. Mutated residues with conservative amino acid substitutions were considered either strongly conserved, if amino acid alterations were from/to one of the amino acids within the groups STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, and FYW, or weakly conserved, if alterations were from/to one of the amino acids within the groups CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEHQRK, MILVF, and HFY. RT conserved regions in lentiviruses and HIV-1 (from our cohort of drug-naïve and RTI-treated patients) are boxed.
|
At least 140 HIV-1 RT conserved residues have been mutagenized in vitro, and the effect of one or more of these mutations has been analyzed by measuring RNA- and/or DNA-dependent DNA polymerase activity, RNase H activity, strand transfer, structural stability, and/or p66-p51 heterodimerization (2, 2a, 4, 5, 7, 8, 11, 18, 20-22, 37, 40-42, 44, 45, 59, 64). Of these 140 mutagenized residues, 79 (56%) have been shown to be extremely important to keep the functional and/or structural integrity of RT, thus confirming their key role in RT and the necessity for their conservation. In contrast, the other 61 invariant residues (44%) did not show any drastic effects on the efficiency of in vitro RT/RNase H activity (
30% of that of wild-type HIV RT). This does not necessarily prove an unsupportive role in HIV-1 RT structure and/or function but suggests that these amino acids, in principle, may be mutated without a deleterious loss of catalytic RT efficiency. The fact, for instance, that the majority of these residues (48/61) are subject to variation in the larger world of lentiviruses (3) (Fig. 5) may confirm that substitutions on these sites are tolerable in vivo, without causing a major deleterious effect on the RT enzyme. Therefore, we cannot exclude that mutations at some of these residues may occur under the pressure of novel RT inhibitors with a different mechanism of action. Future studies may provide answers to these considerations. On the other hand, the other residues (13/61) that are invariant also among other lentiviruses may play a role or perform a function that cannot be easily detected by the usual in vitro assays, which do not always cover the multifunctional activities of HIV-1 RT, but are possibly amenable to targeting by new drugs. This further emphasizes the important value of both in vivo and in vitro observations to achieve results that are both noticeable and reliable and demonstrates that the need of evolutionary conservation is not sufficient to avoid the problem of RT becoming resistant to nucleoside analogues, even if they interact with the well-conserved polymerase active site.
Finally, we should consider that the conservation or mutation at the amino acid level observed in vivo is also dependent, besides the functional or structural constraint, on the nature of the amino acid itself, and thus the codon usage may be relevant in the development of immune escape and drug resistance (36). To detect the overall selection pressure at the amino acid level of HIV-1 RT, nucleic acid sequences of 179 randomly selected patients with clade B pol subtype were analyzed. The ratio between nonsynonymous and synonymous substitutions (dn/ds) was calculated as a measure of selection pressure (66). Both RT and PR genes were under strong purifying (negative) selection pressure (dn/ds < 1) in both drug-naïve and drug-treated patients, as previously reported (12, 56). Interestingly, RT nucleotide sequences showed a remarkably lower dn/ds ratio than PR, suggesting a greater conservation of RT even under drug pressure (data not shown). Recently, analyzing 40,000 samples of sequences of the entire PR and the first 381 RT codons, Chen et al. detected 69 individual PR mutations and 142 RT mutations under positive selection pressure (12). Among the 107/381 RT codons reported with a Ka/Ks value of >1, very few were invariant residues in our study (I31, S105, T107, P176, P243, I257, A304, I309, and Y319).
In conclusion, HIV-1 can become resistant to the currently available drugs because of its high replication and mutation rate (54), yet several amino acid sequences are highly conserved even under strong drug pressure. The data presented here may allow the design of new drugs that force the virus toward a mandatory escape to residues characterized by changes in the replicative capacity extremely detrimental for its replication. The resulting mutant virus may be replicating at a limited rate such that its ability to damage the immune system, as well as to further mutate to compensate for the loss of replication capacity, would be remarkably decreased. Therefore, the characterization of the highly conserved residues (involved in protein stability, dimerization ability, template-primer binding, or catalytic activity, and some still poorly understood functions) could help in the rational design of new HIV-1 inhibitors with alternative mechanisms of action and more favorable resistance profiles.
We thank Sara Giannella, Federica Forbici, Maria Concetta Bellocchi, Alessandra Cenci, Fabio Continenza, Andrea Biddittu, and Sandro Bonfigli for sequencing and data management and all the Italian Cohort of Antiretroviral Naive patient study group participants and members.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»