Analysis of Human APOBEC3H Haplotypes and Anti-Human Immunodeficiency Virus Type 1 Activity

ABSTRACT Human APOBEC3H (A3H) has one cytidine deaminase domain (CDD) and inhibits the replication of retrotransposons and human immunodeficiency virus type 1 (HIV-1) in a Vif-resistant manner. Human A3H has five single amino acid polymorphisms (N15Δ, R18L, G105R, K121D, and E178D), and four haplotypes (I to IV) have previously been identified in various human populations. Haplotype II was primarily found in African-derived populations, and it was the only one that could be stably expressed. Here, we identified three new haplotypes from six human population samples, which we have named V, VI, and VII. Haplotypes V and VII are stably expressed and inhibit HIV-1 replication. Notably, haplotype V was identified in samples from all African-, Asian-, and Caucasian-derived populations studied. Using haplotype VII, we investigated the A3H anti-HIV-1 mechanism. We found that A3H virion packaging is independent of its CDD but dependent on a 112YYXW115 motif. This motif binds HIV-1 nucleocapsid in an RNA-dependent manner, and a single Y112A mutation completely disrupts A3H virion incorporation. We further studied the mechanism of A3H resistance to Vif. Although the previously identified APOBEC3G Vif-responsive motif 128DPDY131 is not conserved in A3H, placement of this motif into A3H does not make it become less resistant to HIV-1 Vif. We conclude that stably expressed A3H haplotypes may be more broadly distributed in humans than previously realized, and A3H protein is resistant to Vif. These results have important implications for the role of A3H in retrotransposon and HIV-1 inhibition.

Both human A3B and A3H are poorly expressed in vivo. The A3B mRNAs are barely detectable in white blood cells (8), and this gene is also frequently deleted in certain human populations (16). A3H mRNAs are more easily detectable (28), but no expressed protein has been observed (20). We found that the human A3H gene contains a premature termination codon (PTC) in its fifth (last) exon, compared to the A3H gene of Old World monkeys, and has significantly reduced levels of mRNA. After repairing this PTC, we detected a higher level of A3H protein in cell culture and a strong Vif-resistant anti-HIV activity (6). Later, others reported five single amino acid polymorphisms (N15⌬, R18L, G105R, K121D, and E178D) in the human A3H gene, and based on the five single nucleotide polymorphisms (SNPs) that underlie these protein variants, four haplotypes designated with Roman numerals (I, II, III, and IV) were reported to be present in different human populations (27). Both the N15⌬ and G105R mutations can independently cause a dramatic decrease in A3H expression, and only the haplotype II (hap II) that does not have these two mutations is stably expressed (27). hap II was most common in the African-American population compared to Asian and European populations and potently inhibits HIV-1 replication (27). It is still unclear if there are any other haplotypes in humans, particularly those that can be stably expressed. So far, several groups have observed the anti-HIV-1 activity of human A3H hap I and II (6,12,18,27,39), and two groups found that A3H also inhibits the replication of non-long terminal repeat (LTR) retrotransposons, including long interspersed nuclear element 1 (LINE-1) and the short interspersed nuclear element (SINE) Alu (17,39). However, it is still unclear how A3H is packaged into virions and whether human A3H can be neutralized by HIV-1 Vif. The A3H gene used in our previous studies was hap I, which contains the NRGKE polymorphisms (6). Although it is less stable than hap II, we were still able to express this gene by using the VR vector. We and others (6,12,18) reported that HIV-1 Vif could not neutralize human A3H, although we found that Vif from simian immunodeficiency virus (SIV) that infects rhesus macaques (SIVmac) or African green monkeys (AGM) could (6). Recently, Li et al. (20) confirmed that A3H hap I is completely resistant to HIV-1 Vif, but they reported that the A3H hap II (NRRDD) protein was partially sensitive to HIV-1 Vif. It was concluded that the increased Vif sensitivity was due to a single K121D mutation in the A3H protein (20,46). However, in the investigation carried out by Harari et al. (12), A3H hap II was completely resistant to HIV-1 Vif.
Here, we performed genotype and haplotype analysis of A3H in samples from six major human populations and identified additional haplotypes that can be stably expressed. In addition, we identified the critical domain for A3H virion packaging and found that the K121D mutation does not reduce A3H resistance to HIV-1 Vif.
A3H genotype and haplotype analysis. DNA samples from human variation panels (8 to 10 individuals each) were obtained from the Coriell Institute for Medical Research. The panel designations, originating populations, and sample identification (ID) numbers are listed in Table 1. The Northern European and Caucasian European panels are derived from slightly different geograph-ical regions of Europe (see http://www.coriell.org/ for specific details). Previously identified SNPs that cause the amino acid polymorphisms R18L, G105R, K121D, and E178D, as well as the E140K SNP, were genotyped by means of restriction enzyme digestion assays. Assays were based upon naturally occurring restriction sites that could distinguish between alleles for some of the polymorphisms or, for those SNPs with no naturally occurring sites, production of restriction sites by primer mutagenesis. Control digestion sites common to both alleles were included for each assay to verify that complete digestion had occurred. The amplification conditions for the reactions were 5 min at 95°C, followed by 45 cycles of 1 min at 95°C, 2 min at 60°C, and 3 min at 72°C and 1 cycle of 10 min at 72°C. Betaine (1 M) was included for three primer sets because of the high GC content of the amplicon. Digestions were performed in the original amplification reaction with 5 U of restriction enzyme and 50 mM MgCl 2 added to bring the final concentration to ϳ10 mM. Based upon time course experiments, digestions were complete within 1 h, except for the R18L digestion with Fnu4HI, which requires an overnight digestion to ensure completion (see Fig. S2 in the supplemental material). Sequence analysis of exon 2 for 16 samples showed complete concordance with the results of the diagnostic assay for R18L. Primer sequences, restriction enzymes, and the diagnostic band sizes are given in Fig. S2, S3, S4, S5, and S6 in the supplemental material. For the N15⌬ polymorphism, primers were designed to produce a small PCR product that could be genotyped on 15% polyacrylamide gels (see Fig. S1 in the supplemental material). Haplotypes were directly inferred for individuals that were homozygous for all SNPs or had only one heterozygous SNP. The haplotypes for individuals that were heterozygous for 2 or more SNPs were inferred by the method of Clark (4). The haplotypes for 28 of the individuals studied could be directly inferred because they were homozygous for each SNP or all but one SNP.
Vif activity assay. Vif activity was measured by its ability to rescue ⌬Vif HIV-1 virus infectivity in the presence of A3G or A3H. Viruses were produced from 293T cells by a standard calcium phosphate transfection. Typically, 21 g of plasmid DNAs containing 5 g pNL-Luc⌬env⌬vif, 5 g Vif expression vector, 1 g vesicular stomatitis virus glycoprotein (VSV-G) expression vector, and 10 g A3 expression vector was transfected into 293T cells in a 100-mm culture dish with 20% confluence. The production of HIV-1 was quantified by p24 Gag capture enzyme-linked immunosorbent assay (ELISA). Equal amount of viruses were used to infect GHOST-R3/X4/R5 cells. Thirty-six hours later, cells were lysed in a buffer containing 50 mM Tris-HCl (pH 7.4), 150 mM NaCl, 3 mM EDTA, and 1% Triton X-100. After the nuclei were removed, the cytosolic fraction was used to determine luciferase activity with a luciferase assay kit (Promega).
Virion incorporation assay. 293T cells cultured in 6-well plates were transfected with 3 g HIV-1 proviral construct pNL4-3⌬vif and 3 g A3 expression vector by a standard calcium phosphate transfection. Supernatants were harvested 48 h after transfection and clarified by low-speed centrifugation at 5,000 ϫ g for 10 min at 4°C. Virions were further purified by spinning the clarified supernatants through a 20% sucrose cushion at 296,000 ϫ g for 30 min at 4°C with the S100AT6 rotor (Sorvall). Pellets were dissolved in phosphate-buffered saline (PBS) and analyzed by Western blotting.
GST pulldown. The interaction between Gag and A3H was determined by GST pulldown. Briefly, human A3H and various Gag-GST fusion proteins were expressed in 293T cells. Cells were then lysed with radioimmunoprecipitation assay (RIPA) buffer (50 mM Tris-HCl, pH 7.4, 150 mM NaCl, 1% Triton X-100, 1% sodium deoxycholate, 0.1% SDS, 1 mM phenylmethylsulfonyl fluoride [PMSF], 1 mM EDTA, 5 g/ml aprotinin, 5 g/ml leupeptin). The cytosolic fraction was precleared with Sepharose 4B beads. Samples were either treated or untreated with RNase A at 100 g/ml for 1 h at 4°C and then rocked with prewashed glutathione (GSH)-Sepharose beads for 4 h at 4°C. After being extensively washed with PBS, bead-associated proteins were analyzed by Western blotting.
Western blotting. Horseradish peroxidase (HRP)-conjugated anti-HA antibody (Roche) or HRP-conjugated anti-V5 antibody (Invitrogen) was used to directly detect the expression of A3G, A3H, and Vif proteins. The anti-green fluorescent protein (GFP) monoclonal antibody and anti-actin polyclonal antibody (C-11) were from Santa Cruz Biotechnology. The anti-human A3H monoclonal antibody was obtained from M. Emerman (20). The anti-HIV-1 Gag monoclonal antibody (no. 3537) was from the NIH AIDS Research and Reference Reagent Program. HRP-conjugated anti-human, anti-rabbit, or anti-mouse immunoglobulin G secondary antibodies were from Pierce. HRP-conjugated antibody was detected with an enhanced chemiluminescence detection kit (Amersham Bioscience).

Genotype and haplotype analysis of the human A3H gene.
To further understand how SNPs affect A3H expression, we genotyped each of the five N15⌬, R18L, G105R, K121D, and E178D polymorphic sites in six major human populations by PCR-based assays. In addition, there is another site, E140K, reported to be polymorphic in the Database of Single Nucleotide Polymorphisms (dbSNP; http://www.ncbi.nim.nih.gov /projects/SNP/) (42), which prompted us to also genotype this reported polymorphism. Human genomic DNA samples from Northern European, African American, Japanese, Caribbean, Chinese, and European Caucasian variation panels were purchased from the Coriell Institute for Medical Research. The genomic region containing each polymorphic site was amplified by PCR with specific primers. After that, the N15⌬ deletion was detected by size difference on a 15% polyacrylamide gel, and the alleles for R18L, G105R, K121D, E178D, and E140K were detected by restriction enzyme digestion with Fnu4HI, HhaI, XmnI, SmlI, and BstNI, respectively (see Fig.  S1 to S6 in the supplemental material). The frequencies of the genotypes for all SNPs studied were not significantly different from Hardy-Weinberg equilibrium expectations within and across all populations.
The assay for K121D(E) is ambiguous with the respect to the possible D or E alleles. However, a search of GenBank for independently derived and naturally occurring genomic and cDNA alleles suggests that the D allele is much more common than the potential E allele, if it exists at all (data not shown), which is in agreement with the results of a previous study (27). Genotypes for all six sites were called for 57 of the 58 individuals studied, but for one individual (Table 1; GM06990) the K121D(E) assay did not produce an interpretable pattern, and so the SNP for this individual was not included in the allele frequency calculation for this SNP, and the individual was not used in the determination of haplotype frequencies reported in the following section. Individual genotypes and inferred haplotypes are shown in Table 1, and allele frequencies for each SNP are shown in Table 2. Although the potential E140K variable site is reported in the dbSNP to have a K allele frequency of 0.92, we were surprised to find none of the 50 samples from five populations (Northern European, African-American, Japanese, Caribbean, and Chinese) that we genotyped had this allele. Because the K140 allele in dbSNP was reportedly found in the European Caucasian samples from the Coriell Institute, we tested 8 samples from this population but we still did not find a K140 allele.
Of the 32 potential haplotypes (2 5 ) for five biallelic SNPs, we identified seven haplotypes among the 57 individuals from six populations included in this part of the study ( Table 3). The first four haplotypes were previously described in detail in other studies (27), and the last three are new ones studied here ( Table 3). The new haplotypes hap VI and hap VII appear to be rarer than the previously reported four haplotypes, whereas the new haplotype hap V is widely present among the 57 samples. Like hap II, both hap V and VII were stably expressed (see below), but hap V was found in all six population samples at an overall frequency of 0.202, which is greater than those of hap II and VII, which had frequencies of only 0.061 and 0.009, respectively (Table 3). We detected hap II mainly in the African-American population (Table 3), which has previously been shown to retain significantly stable expression and anti-HIV-1 activity (27). Interestingly, hap V is more frequently detected in not only the African-American population, but also the Caribbean and Chinese populations. In contrast, hap VII was found only in the European Caucasian sample. Based upon the stability data given in the following sections, among the African-derived populations, African-American and Caribbean, 9 of 10 and 6 of 10 individuals, respectively, had at least one To understand how these polymorphisms affect A3H protein expression, we first introduced five mutations, including N15⌬, R18L, R105G, D121K, and D178E, into the stably expressed hap II gene and determined how its expression was altered. It was found that the wild-type hap II as well as its R18L, D121K, and D178E mutants were all stably expressed, but the expression of both N15⌬ and R105G mutants was undetectable (Fig. 1A).
This result confirms that either N15⌬ or R105G mutation can completely disrupt A3H expression (12,27,46). We next compared levels of A3H protein expression among different haplotypes. Each of the seven different haplotype expression vectors was transfected with a GFP expression vector into 293T cells, and A3H and GFP expression was detected by Western blotting. GFP expression was detected from all transfected cells at the same levels, indicating that cells were transfected at a similar efficiency (Fig. 1B). However, A3H expression was only detected from cells transfected with hap II, V, and VII expression vectors, but not the others. Thus, only hap II, V,  1. Expression of human A3H genes in 293T cells. A3H genes containing various mutations at the six polymorphic sites N15⌬, R18L, G105R, K121D, E178D, and E140K were inserted into the pcDNA3.1 vector with a V5 tag and were transiently expressed in 293T cells. Protein expression was detected by Western blotting with the indicated antibodies. Expression of A3H hap II and its mutants is shown in panel A, expression of the seven A3H haplotypes is shown in panel B, expression of A3H hap VII and AGM A3H bearing an E140K mutation is shown in panel C, and expression of A3H-L and its mutants is shown in panel D. In addition, the insert of the seven A3H gene fragments was sliced out from the pcDNA3.1 vector by HindIII/XhoI digestion and analyzed by agrose gel electrophoresis (B).

3146
WANG ET AL. J. VIROL. and VII could be stably expressed. This result is not surprising, because all of these stably expressed haplotypes do not have either of the N15⌬ and R105G mutations (Table 3). We also determined whether the E140K mutation affects A3H expression. Although our genotyping data indicate this residue is not polymorphic despite the polymorphic frequency of the K allele being reported in dbSNP, it is possible that the K allele exists as a rare variant. When this mutation was introduced into the hap VII or AGM A3H gene, the expression of both proteins became undetectable by Western blotting (Fig. 1C). Thus, we found that like N15⌬ and R105G mutations, the E140K mutation also disrupts A3H protein expression.
To understand whether these three mutations have different levels of activity in disruption of A3H expression, we introduced an N15⌬, R105G, or E140K mutation into a long version of the human A3H gene. Although simian A3H genes encode 210 amino acids, the human A3H gene encodes only 183 amino acids due to the presence of a PTC on the fifth exon. Previously, we repaired this PTC in the hap I gene to produce a full-length 210-amino-acid protein, A3H-L, and found that the hap I gene was then stably expressed (6). We confirmed that the A3H-L protein was expressed at similar levels to the A3H hap II protein (Fig. 1D, lanes 1 and 2). Since A3H-L already contains an R105G mutation, this result indicates that this mutation does not affect A3H-L protein expression. We further introduced N15⌬ and E140K mutations and found that a single N15⌬ or E140K mutation could completely disrupt A3H-L expression (Fig. 1D, lanes 3 to 6). Taken together, we found that among the three new haplotypes, two of these, hap V and hap VII, are stably expressed. In addition, although N15⌬, R105G, and E140K mutations all disrupt A3H expression, the N15⌬ and E140K mutations may have a more dramatic effect than the R105G mutation to disrupt A3H expression.
Anti-HIV-1 activity of A3H haplotypes. Having compared levels of protein expression among seven A3H haplotypes, we determined their anti-HIV-1 activities. Vif-defective (⌬Vif) HIV-1 virions were produced from 293T cells in the presence of different A3H haplotypes or A3G expression and collected for protein and infectivity analysis. Initially, we determined levels of A3H virion packaging, since virion incorporation is an essential step for viral inhibition. It was found that the stably expressed A3H hap II, V, and VII were packaged into virions as efficiently as the A3G protein, whereas the levels of poorly expressed A3H hap I, III, IV, and VI were barely detectable in virions (Fig. 2). Next, we measured the infectivity of virions containing these different A3 proteins in GHOST cells. It was found that compared to the control, A3G reduced ⌬Vif HIV-1 infectivity approximately 30-fold, and A3H hap II, V, and VII reduced viral infectivity approximately 10-fold (Fig. 2, top  panel). Taken together, we conclude that levels of A3H virion packaging are dependent on its levels of expression in viral producer cells, and all stably expressed A3H haplotypes can be efficiently packaged into virions and inhibit HIV-1 replication.
Contribution of A3H CDD to its anti-HIV-1 activity. A3G has two CDDs with separate activities: CDD1 mediates A3G virion packaging, and CDD2 mediates cytidine deamination. We wanted to know whether the single CDD in A3H has both activities. Accordingly, we mutated the 54 HXEX 27 PCXXC 88 motif in the A3H hap VII gene, creating four single-point mutants: H54R, E56Q, C85S, and C88S. When these mutants were expressed from pcDNA3.1 mammalian expression vector, we found that all of these mutations impaired A3H protein expression (Fig. 3A), which made it impossible for further study. We then introduced these mutations into the VR-A3H-L expression vector and found that all of these mutants were stably expressed (Fig. 3B, upper panels). After that, we determined whether these mutations could affect A3H virion packaging. It was found that all these A3H mutants could be packaged into HIV-1 virions as efficiently as the wild-type protein (Fig. 3B, lower panels). However, we found that these mutants showed a reduced anti-HIV-1 activity compared to the wild-type protein (Fig. 3C). Thus, we conclude that A3H CDD does not determine A3H virion packaging, but it is required for A3H antiviral activity. These observations agree with a previous study (12). A3H incorporation into virions. As mentioned in the introduction, A3G has two motifs that determine its virion packaging: CDD1 and 124 YYFW 127 . In the 124 YYFW 127 motif, both Y124 and W127 residues were reported to be critical for A3G virion packaging (15), and they also mediate A3G oligomerization (14). In addition, like the A3G protein, A3B and A3F also interact with HIV-1 NC for virion incorporation (8,40). To understand how A3H is packaged into virions, we first determined whether A3H interacts with HIV-1 Gag and where it binds. Previously, we created a panel of HIV-1 Gag-GST fusion expression vectors containing a C-terminal V5 tag, including Gag-GST, MA-CA-GST, CA-GST, NCp6-GST, and NC-GST (41). These Gag-GST fusion proteins were expressed with A3H hap VII protein in 293T cells, and their interactions were determined by GST pulldown. It was found that Gag-GST, NCp6-GST, and NC-GST could pull A3H down, whereas the others could not (Fig. 4A). These interactions required RNAs since the binding was lost in the presence of RNase A treatment (Fig. 4B). Thus, we conclude that A3H binds HIV-1 NC in an RNA-dependent manner, which is consistent with a recent study (29). Next, we determined whether A3H contains a similar YYFW motif that serves as the packaging signal. Indeed, a quite similar 112 YYHW 115 motif is present in A3H (Fig. 5A). To understand its function, we mutated residues Y112, Y113, and W115 in the A3H hap VII gene, creating four singleresidue mutations: Y112A, Y113A, W115A, and W115L. In addition, we created another double-residue mutation, YY/ AA, with both Y112 and Y113 changed to alanines. Among these, the Y112A, Y113A, and YY/AA mutants were well expressed, but both W115A and W115L mutants were not expressed (Fig. 4C, upper panels) (data not shown). We then compared the levels of Y112A, Y113A, and YY/AA virion incorporation with that of the wild-type A3H protein. It was found that all of these mutants showed much reduced levels of virion packaging, and this reduction was particularly apparent for the Y112A and YY/AA mutants (Fig. 4C, lower panel). Moreover, we compared their interactions with HIV-1 NC by GST pulldown. It was found that the Y112A and YY/AA mutants bound NC poorly, and levels of Y113A mutant binding to NC were also reduced (Fig. 4D). Furthermore, we compared the anti-HIV-1 activities of these mutants. It was found that the Y112A and YY/AA mutants almost completely lost anti-HIV-1 activity, and the Y113A mutant partially lost this activity (Fig. 4E). Taken together, these results indicate that the 112 YYHW 115 motif determines A3H virion packaging via an interaction with HIV-1 NC, and among these residues, Y112 is particularly critical for NC interaction and virion packaging.
The sensitivity of human A3H to HIV-1 Vif. We reported that the A3H antiviral activity is resistant to HIV-1 Vif neutralization, which was confirmed by Harari et al. (12). When we compared A3G and A3H amino acid sequences, we found that the previously identified A3G Vif-responsive motif 128 DPDY 131 is not conserved in A3H (15). We then created an A3H hap VII mutant that contains this motif, which was called Mut I (Fig. 5A). Recently, two groups reported that A3H could become partially sensitive to HIV-1 Vif if the K121 residue was changed to an aspartate (K121D) (20,46). We therefore introduced a K121D or K121E mutation into Mut I, creating another two mutants, Mut II and III. Since A3H hap II and VII have a respective D121 or K121 residue, we also introduced a D121K mutation into hap II and a K121E or K121D mutation into hap VII, creating the Hap II D121K, Hap VII K121E, and Hap VII K121D mutants. We then determined the Vif sensitivity of these A3H mutants by a single-round viral replication assay. It was found that both HIV-1 and SIVmac Vif proteins could completely neutralize the activity of A3G (Fig. 5B), confirming that A3G is very sensitive to both Vif proteins. The SIVmac Vif could also completely neutralize the activity of all of these A3H proteins (Fig. 5B). However, the HIV-1 Vif showed barely detectable activity to neutralize any of these A3H proteins (Fig. 5B). We further verified these results by determining A3H protein expressions in viral producer cells by Western blotting. It was found that unlike A3G, whose expression was decreased by both HIV-1 Vif and SIVmac Vif, the expression of all of these A3H proteins was selectively decreased by SIVmac Vif, but not HIV-1 Vif (Fig. 5C). This result is consistent with that from the viral infectivity assay. Since all of these different A3H mutants contained a V5 epitope tag at the C terminus, we wondered whether the V5 tag could contribute to A3H resistance to HIV-1 Vif protein.
We then created expression vectors to express untagged A3H hap II [Hap II (⌬V5)] and hap VII [Hap VII (⌬V5)] proteins by site-directed mutagenesis. When these proteins were expressed in HIV-1 producer cells (Fig. 6A), they were still resistant to HIV-1 Vif (Fig. 6B). Thus, the V5 tag does not contribute to A3H Vif resistance. Taken together, we were unable to make human A3H become sensitive to HIV-1 Vif by introducing a K121D mutation and/or implanting an A3G Vifresponsive motif into A3H protein.

DISCUSSION
We have examined human A3H genotypes and haplotypes among six major human populations. Our estimated allele frequencies for these six nonsynonymous SNPs are similar to those reported by dbSNP, with two notable exceptions. First, our frequency for the L18 allele is considerably greater than that contained in dbSNP (allele frequency of 0.13 versus 0.01), but is less than that reported by OhAinle et al., which was 0.20 (27). Second, we found no K140 allele among the 58 individuals we genotyped, although dbSNP reports this allele to be present at a frequency of 0.92 (Table 2). One possible explanation for the first difference is that the N15⌬ allele is very close to the L18 allele, and it is possible that this deletion interfered with the correct genotyping calls for L18 used for the high-throughput method in dbSNP, as opposed to the sequencing method of OhAinle et al. (27) and the restriction enzyme method used here. The second difference may be due to the pooled genotyping method used to detect the K140 allele that may have overestimated the frequency of this allele in the European Caucasian population that was examined (1). It seems unlikely that the restriction enzyme-based assays that we used here could result in our missing the K140 allele, because this allele is not digested by the restriction enzyme used and all samples in our study were completely digested (i.e., were all homozygous for the E140 allele). Furthermore, we examined the SNP position among 13 informative expressed sequence tags contained in GenBank, as well as the six currently available genomes contained in the 1000 Genome Project, in addition to the Watson and Venter genomes. All showed only the E140 allele. The only other sequence that shows the presence of the K140 allele, besides the data in dbSNP, is the single reference genome itself. Because quality scores are not easily obtainable for this sequence, the reference sequence allele could either be a sequencing error or might indicate that this allele does exist, but at an apparently low frequency. As shown in the results and discussed later, the K140 allele is damaging to the protein.
It is also interesting to note that at amino acid position 121, there appear to be two alleles (K121 and D121), but each of these was independently derived from the ancestral E121 allele. The E121 allele was not found in the available GenBank sequences, and it appears that no other group has detected this allele among the samples they have studied (12,27). This allele is present in the single Neanderthal sequence that has been studied, which is observable in the UCSC Genome Browser. Neanderthals are estimated to have diverged from humans about 500,000 years ago and became extinct about 30,000 years ago (9,10). Thus, it appears that the E121 allele was replaced by K121 and D121 in the evolutionarily short period of time since the divergence of our species from the Neanderthals. Out of 32 theoretically possible haplotypes for the 5 biallelic nonsynonymous sites studied here, we found seven haplotypes, including the previously reported four haplotypes (I to IV) that have been studied in some detail (27). Six of these haplotypes were observed in individuals that were either homozygous for all five SNPs or were heterozygous at only a single SNP, and this provides very high confidence for the existence of these haplotypes among humans ( Table 3). One of them, hap V (NRRDE), is relatively frequent, particularly in the Caribbean population, and this haplotype is stably expressed and exhibits anti-HIV activity. The two rarer haplotypes, hap VI and hap VII, were found only within one or two of the populations examined ( Table 3). The relatively high number of haplotypes within the short chromosomal span between amino acids 15 and 178 (1,714 bp) compared to the more commonly observed 2 to 4 haplotypes observed among humans at other sites in the genome is in keeping with the active evolutionary dynamics of this region thought to be due to strong host-pathogen interactions (5,27,31). This relatively large number of haplotypes also indicates that it will be necessary to examine the activity of the haplotypes found in different populations, as some of these haplotypes provide good A3H activity, whereas others do not. Additional unnumbered haplotypes have been inferred by using the PHASE 2.1.1 computer program and reported in the supplemental materials of the study carried out by OhAinle et al. (27). However, at least some of these haplotypes seem unlikely to exist. For example, haplotypes with N15-R18, ⌬15-R18, N15-L18, and ⌬15-L18 are all present among the unnumbered haplotypes. However, based upon the suggested phylogeny of OhAinle et al. (27), the presence of the ⌬15-R18 combination would require either a recombination or gene conversion event within the 9 bp separating the two polymorphic sites or a recurrent mutation event, all of which are very unlikely.
We have compared the expression and anti-HIV-1 activity levels of these seven A3H haplotypes. We confirmed that two N15⌬ and R105G polymorphisms seriously disrupt A3H expression (12,27). In addition, we found that an E140K mutation could also disrupt A3H expression. Among these three N15⌬, R105G, and E140K mutations, the destructive effect of the R105G mutation was suppressed in the context of the full-length 210-amino-acid protein. This result indicates that the N15⌬ and E140K mutations are more deleterious for A3H expression. Interestingly, we also observed that four CDD mutants H54R, E56Q, C85S, and C88S as well as the two W115A and W115L mutants were poorly expressed in cell culture, indicating that these mutations also disrupt A3H expression. It is still unclear why the human A3H gene is so easily damaged by point mutations. Previously, we found that although a PTC disrupts A3H expression, this disruption is not caused by protein degradation (6). When cells were treated with proteasomal and lysosomal inhibitors, the expression of A3H was increased only marginally. Others also reported a marginal increase of A3H expression by proteasomal inhibitors (27). Notably, when we measured A3H mRNA, we found that its levels were significantly decreased when a PTC was present (6). PTCs in the last exon of a gene generally do not cause nonsense-mediated mRNA decay, and so we speculated that human A3H mRNA could have a very unique secondary structure, which can be easily disrupted by mutations, resulting in decrease in A3H mRNA stability and loss of protein expression.
Unlike the A3B, A3DE, A3F, and A3G proteins, A3H has only one CDD, but it still inhibits HIV-1 replication. We have studied the function of A3H CDD and found that it is required for A3H anti-HIV-1 activity, but it is not required for virion packaging. Since the deaminase activity of A3H has been demonstrated in Escherichia coli (28,29), we believe it is most likely that the deaminase activity is required for A3H anti-HIV-1 activity. Although its virion packaging is independent of its CDD, it is dependent on an YYXW motif, which was identified as a packaging signal for A3G. Indeed, this motif is conserved in all anti-HIV-1 A3 proteins, indicating that these anti-HIV A3 proteins may use a universal mechanism for virion packaging. Our results showed that this motif specifically interacts with HIV-1 NC, which further supports its important role in virion incorporation. Since the YYXW motif is enough for A3H virion packaging, it is not clear why A3G virion packaging requires its CDD1. Recently, Ooms et al. reported that although A3H hap I protein was poorly expressed in cells, it could be still packaged efficiently into HIV-1 virions. They also presented evidence that A3H hap I specifically interacted with MA-CA other than NC for virion packaging (29). We found that A3H hap I was poorly packaged into virions because it was not expressed in the cell, and, therefore, we cannot confirm this observation.
We have studied the mechanism of A3H resistance to Vif. Initially, we thought that the resistance of human A3H to HIV-1 Vif could be due to the lack of a similar A3G Vifresponsive domain, DPDY. However, even when this domain was placed in the corresponding region, A3H was still completely resistant to HIV-1 Vif (Fig. 5). Next, we tried to replicate work showing the K121D mutation could reduce A3H resistance to HIV-1 Vif, as reported by others (20,46). However, we found that both hap II and VII, which have a D121 and K121 residue, respectively, were equally resistant to HIV-1 Vif (Fig. 5). Even when these two residues were exchanged between hap II and VII and when a K121D or K121E mutation was introduced into Mut I, we still could not change the resistance of human A3H to HIV-1 Vif (Fig. 5). From these results, we conclude that human A3H is completely resistant to HIV-1 Vif. Our results further highlight the potential role of human A3H in inhibition of HIV-1 infection.