Previous Article | Next Article ![]()
Journal of Virology, September 2008, p. 8762-8770, Vol. 82, No. 17
0022-538X/08/$08.00+0 doi:10.1128/JVI.00751-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Aaron Diamond AIDS Research Center and Laboratory of Retrovirology, The Rockefeller University, New York, New York,1 Department of Infectious Diseases, King's College London School of Medicine, London, United Kingdom2
Received 4 April 2008/ Accepted 11 June 2008
|
|
|---|
|
|
|---|
Human endogenous retroviruses (HERVs) comprise approximately 8% of the human genome, but all are remnants of ancient retroviral infections and harbor inactivating mutations that render them replication defective (reviewed in reference 2). Nevertheless, as viral "fossils," HERVs may provide insights into the ancient retroviral-host interactions and their evolution. For example, several antiretroviral genes have been identified in recent years (24, 30, 34), including the APOBEC3 family of cytidine deaminase-encoding genes, which have proliferated during mammalian speciation and many members of which exhibit signs of positive (diversifying) selection in the primate lineage (28). This evolutionary legacy likely affects the course of viral infections in modern organisms, including humans. Indeed, human immunodeficiency virus type 1 (HIV-1) and hepatitis B virus (HBV) sequences in infected humans exhibit patterns of G-to-A hypermutations which suggest that APOBEC proteins have a current physiological role as an antiviral defense (6, 8, 22, 31, 35, 38). The source of the ancient evolutionary pressure on APOBEC proteins is unknown, but the ERVs that are fossilized in the genomes of modern humans and other primates are reasonable candidates, particularly given the recent observation that endogenous murine leukemia viruses bear signatures of mutation by the single murine APOBEC3 protein (18).
Among HERVs, the HERV-K(HML-2) family is thought to be the youngest, with multiple human-specific and polymorphic insertions that indicate replication in human ancestors within the past few million years (3, 4, 15, 37). None of the known proviruses in modern humans are capable of replication and, therefore, many aspects of HERV-K biology could not previously be studied. Recently, however, we and others have derived a consensus HERV-K(HML-2) sequence by aligning human specific proviruses and synthesized a pseudo-ancestral proviral construct (11, 21). We have shown that this virus, named HERV-KCON, encodes and expresses all viral proteins necessary for infection, and we have derived an experimental system in which HERV-KCON can be shown to infect numerous cell lines in a single-cycle assay. Using this approach, we also have shown that human APOBEC3F (hA3F) protein can inhibit HERV-KCON infection in single-cycle, transient-overexpression assays (21).
In this study, we expand our study of ancient retrovirus-host interactions to other APOBEC3 family members and find that several APOBEC3 proteins are intrinsically capable of mutating and inhibiting infection by HERV-K. We also present striking evidence that two HERV-K(HML-2) proviruses that are present in the human genome have been subjected to hypermutation by APOBEC3G (hA3G), in a manner that can be accurately recapitulated during HERV-K replication in vitro. Along with evidence of hA3G hypermutation in HIV-1- and HBV-infected patient samples (17, 35, 38), this is a rare example of the antiretroviral effects of hA3G in a natural infection setting and the first example where inactivation of ancient retroviruses in humans by hA3G can be demonstrated.
|
|
|---|
Expression plasmids. The HERV-KCON vector genome plasmid CCGBX is a derivative of the previously described CHKCG (21) and was constructed by inserting a CMVP-EGFP cassette into XbaI and SwaI sites of CHKCG, resulting in a vector genome that is slightly smaller than CHKCG and gives higher infectious titers. CCGBX-P was derived from CCGBX by inserting a 53-bp HIV-1-derived sequence (GATCTGAGCCTGGGAGCTCTCTGGCTTGTGACTCTGGTAACTAGAGATCCCTC) into the 5' end of the 3' long terminal repeat (LTR) to allow specific amplification of newly introduced HERV-KCON sequences upon infection of human cells. The insertion was made using overlap extension PCR and SwaI and NheI restriction sites in the HERV-KCON vector sequence. HERV-K protein expression vectors (pCRVI/Con Gag-PR-Pol and pCR3.1/K-Rev) have been previously described (21). CSGW, a packageable HIV-1 vector plasmid expressing enhanced green fluorescent protein (EGFP), and HIV-1NL4-3 GagPol expression plasmids have been described elsewhere (1, 13). Plasmids expressing various human APOBEC3 proteins, namely hA3A, hA3B, hA3C, hA3F, and hA3G, have been described previously (6). Plasmids expressing additional human APOBEC3 proteins (hA3DE and hA3H) were cloned using the same pCMV4-HA vector and HindIII and XbaI restriction sites.
Infection assays. To generate infectious HERV-KCON-derived virions, 293T cells in six-well plates were transfected with 1.8 µg CCGBX, 0.5 µg pCRVI/Con Gag-PR-Pol, 0.5 µg pCR3.1/K-Rev, 0.2 µg G protein of vesicular stomatitis virus (VSV-G), and an APOBEC3 expression plasmid or empty vector control. To generate infectious HIV-1 virions, 293T cells were transfected in six-well plates with 0.75 µg of CSGW, 0.75 µg HIV-1 Gag-Pol, 0.2 µg VSV-G, and an APOBEC3 expression plasmid or empty vector control. No Vif protein was expressed during the generation of virions. Media were changed 5 h after transfection with fresh media containing 5 µM sodium butyrate (sodium butyrate for HERV-K only), and virus-containing supernatants were collected after an additional 40 h. For HERV-K infection, filtered (0.2-µm pore size) supernatant was layered onto 5 x 104 293T cells in 24-well plates with fresh medium supplemented with 5 µg of polybrene/ml and spinoculated at 2,000 rpm for 2 hours at room temperature. For HIV-1 infection, filtered supernatant was layered onto 2.5 x 104 CEM cells in 96-well plates with fresh medium and 5 µg of polybrene/ml. Two days after infection, GFP+ cells were quantified by fluorescence-activated cell sorter analysis.
HERV-K sequence analyses. Full-length HERV-K(HML-2) sequences in the human genome were identified using a TBLASTX search (www.ensembl.org/Multi/blastview) of the human genome with the HERV-KCON Gag sequence as the query sequence. Sixteen unique (by chromosomal location) human-specific full-length HERV-K(HML-2) proviruses were identified by cross-referencing with insertions identified by Belshaw et al. and Romano et al. (4, 27). All identified insertions were included in subgroup N, as defined by Romano et al. (27). Specifically, the proviruses included K101, K102, K104, K106, K107, K108, K109, K113, K115, 11q22, 12q14, 19q12, 1p31 (K4), 3q27 (K50b), 3q21 (KI), and 21q21 (K60). All GenBank accession numbers for these sequences are found in the reports by Barbulescu et al. and Romano et al. In addition, the K60 sequence was deduced from GenBank entry AL109763 (3, 27). The proviruses were aligned to HERV-KCON using AlignX (Invitrogen) for comparison. Figures 2B and 4A, below, depict G-to-A differences between the provirus and HERV-KCON and were derived using the HYPERMUT program (http://www.hiv.lanl.gov/content/sequence/HYPERMUT/hypermut.html).
![]() View larger version (38K): [in a new window] |
FIG. 2. Nucleotide changes in human-specific HERV-K proviruses relative to HERV-KCON. (A) The numbers of changes of each type from the HERV-KCON to sequences in endogenous proviruses (listed in Materials and Methods) are plotted. For comparison of sequences flanking HERV-K60 and HERV-KI, 4,000 nucleotides immediately proximal to the 5' and 3' ends of the proviruses were compared for changes from chimpanzee to the orthologous human sequence. For each sequence comparison, the numbers of changes were normalized to enable direct comparison with the numbers of changes in the individual HERV-K60 and HERV-KI proviruses. (B) Graphical representation of nucleotide changes relative to HERV-KCON in the 16 human specific proviruses. Red, GG to AG; cyan, GA to AA; green, GC to AC; magenta, GT to AT; black, non-G-to-A transitions; yellow, gaps in the sequence.
|
![]() View larger version (38K): [in a new window] |
FIG. 4. Hypermutation of HERV-K during in vitro infection. All changes in HERV-KCON reverse transcripts relative to preinfection sequence were analyzed for +1 site preference using HYPERMUT, and the sequences of 12 HERV-K clones generated during infection in the presence of each APOBEC protein are represented as horizontal lines. Mutations are indicated as vertical lines. Red, GG to AG; cyan, GA to AA; green, GC to AC; magenta, GT to AT; black, non-G-to-A transitions. (B) Flanking nucleotide sequence context surrounding mutations generated by APOBEC3 proteins during HERV-K infection in the presence of hA3A, hA3B, hA3F, and hA3G. The numbers of times that each nucleotide occurred at five positions 5' and 3' to each G-to-A change were plotted.
|
HERV-K hypermutation assay. Infectious HERV-KCON particles were generated as described above using CCGBX-P in place of CCGBX. Prior to infection, supernatant was supplemented with 10 mM MgCl2 and treated with DNase I (0.1 U per µl; Roche) for 1 h at 37°C to eliminate residual transfected DNA. Fresh 293T cells were infected as described above. Ten hours postinfection, total DNA was collected using the DNeasy blood and tissue kit (Qiagen). Partial EGFP (Clontech) and HERV-K sequence of 762 nucleotides were amplified using oligos designed to target EGFP (CGC ACC ATC TTC TTC AAG GAC GAC G) and the inserted HIV-1 sequence (GAG GGA TCT CTA GTT ACC AGA GTC ACA AGC C) using Phusion polymerase (Invitrogen) (98°C for 10 s, 55°C for 10 s, and 72°C for 15 s; 30 cycles). Amplified DNA was purified using a gel extraction kit (Qiagen) and cloned into pCR-Blunt II-TOPO according to the manufacturer's instructions (Invitrogen). To confirm the complete elimination of transfected DNA, we attempted to amplify plasmid sequence using primers targeting the plasmid backbone sequence and the untranslated region of HERV-K using similar PCR conditions. Also, to confirm that the amplified sequences were derived from de novo reverse-transcribed DNA, HERV-K particles containing a mutationally inactivated reverse transcriptase were subjected to the same infection procedure and PCR analysis. Twelve clones were sequenced for each APOBEC3 protein studied, as well as the empty vector control, and compared to the original CCGBX-P sequence for evidence of hypermutation.
Western blot analysis. To generate an anti-CA polyclonal antiserum, the N-terminal cleavage site of CA was first determined by Edman sequencing of the putative CA protein isolated from purified HERV-KCON virion particles. As CA is known to be around 30 kDa, the position of the C-terminal CA cleavage site was estimated using this molecular mass and the determined position of the CA N terminus. The deduced CA-encoding sequence was cloned into pGEX-6P-1 (GE Healthcare Life Sciences) to express a glutathione S-transferase (GST)-tagged CA protein that was purified using glutathione-agarose beads. The GST tag was eliminated by PreScission protease cleavage as per the manufacturer's instructions (GE Healthcare Life Sciences). The purified recombinant CA protein was used to generate the antiserum (Covance).
Transfected 293T cells were lysed in sodium dodecyl sulfate-polyacrylamide gel electrophoresis loading buffer and separated on 4 to 20% sodium dodecyl sulfate-polyacrylamide gel electrophoresis gradient gels. Proteins were transferred onto nitrocellulose membranes and probed with anti-HERV-K Gag, anti-HIV-1 CA, or anti-hemagglutinin (anti-HA; Covance) primary antibody followed by anti-rabbit or anti-mouse antibody-peroxidase conjugate.
|
|
|---|
![]() View larger version (46K): [in a new window] |
FIG. 1. Effects of APOBEC3 proteins on HERV-KCON infection. (A) Anti-HA (top panel) and anti-HERV-K CA (bottom panel) Western blot of 293T cell lysates transfected with HERV-KCON, VSV-G, and APOBEC-HA plasmids. (B) Infection of CEM cells with HIV-1 (left panel) or 293T cells with HERV-KCON virions, generated in the presence of the indicated APOBEC3-HA proteins. Infectious units were quantified as GFP+ cells using fluorescence-activated cell sorter analysis 2 days postinfection, and are expressed as a percentage of the number of infected cells (typically 15 to 30%) that were obtained in the absence of an APOBEC3 protein.
|
Fresh 293T cells were infected with HERV-K virions generated in the presence of each of the APOBEC3 proteins. As can be seen in Fig. 1B, hA3A, hA3B, and hA3F inhibited HERV-K-CON infection by approximately fivefold at the highest concentration tested. Only marginal inhibition of infection restriction was seen with hA3G, while hA3C, hA3DE, and hA3H did not inhibit infection. Clearly, the relative sensitivity of HERV-K to the various APOBEC3 proteins differed from that of HIV-1 (Fig. 1B).
Hypermutated HERV-K proviruses in modern human DNA. While several APOBEC3 proteins appeared capable of inhibiting HERV-K infection in vitro, to determine whether restriction of HERV-K infection might have occurred in vivo, we investigated whether evidence of APOBEC3-induced mutation could be found in HERV-K proviruses that are present in modern human DNA. Specifically, we inspected 16 human-specific full-length HERV-K(HML-2) proviruses and looked for biases in the patterns of mutation therein, relative to the pseudo-ancestral HERV-KCON sequence. Overall, we found that G-to-A and C-to-T substitutions were the most abundant changes in the proviruses as a whole (Fig. 2A), as is typical of the general pattern of nucleotide substitutions that are found in the human genome.
Fourteen HERV-K proviruses showed a comparatively minor increase in the frequency of G-to-A changes and C-to-T changes relative to other changes (Fig. 2A). However, two proviruses, HERV-K60 and HERV-KI, were exceptional in the total quantity and type of changes. Overall, they exhibited similar frequencies of C-to-T mutation as did the other 14 proviruses, but both HERV-K60 and HERV-KI exhibited a very high frequency of G-to-A changes relative to the HERV-KCON (Fig. 2A and B). Indeed, each of these individual proviruses had more G-to-A mutations than the other 14 proviruses combined; two-thirds and one-half of all the changes in K60 and KI, respectively, were G-to-A mutations (Fig. 2A).
To ensure that the exceptional properties of the two apparently hypermutated proviruses were not due to their insertion into an unusually hypermutated region of the human genome, we examined 2 kb of flanking genomic sequence at each end of the two proviruses for evidence of hypermutation. Specifically, we examined changes between the equivalent loci in humans compared to chimpanzees. This is a conservative approach, since the HERV-K insertions are absent in chimpanzees and have, therefore, been resident in the human genome for less time than the flanking sequences have been diverging. We found, as expected, that G-to-A, C-to-T, and reciprocal A-to-G and T-to-C changes were the most abundant in comparisons of the human and chimpanzee flanking sequences (Fig. 2A). However, G-to-A changes did not greatly outnumber other changes, suggesting that that the apparent hypermutation in the inserted proviruses occurred independently of the genomic context sequence, likely prior to their integration.
In addition to the 16 aforementioned proviruses, additional human-specific partial HERV-K sequences that lacked LTRs, as well as HERV-K proviruses (defined as group N by Romano et al.) that were not human specific, and group N CERV-K proviruses (chimpanzee counterparts of HERV-K), including four chimpanzee-specific insertions, were also examined, as were related group O proviruses. No evidence of hypermutation was evident in these sequences (data not shown).
Flanking nucleotide characteristics of G-to-A changes in hypermutated HERV-K proviruses. A common cause of G-to-A and C-to-T substitutions in genomic DNA is spontaneous cytosine deamination. This occurs most often after methylation of cytosines in CG dinucleotides, followed by spontaneous deamination of 5-methylcytosine to a thymine (CG to mCG to TG). This series of events would lead to an overabundance of plus-strand C-to-T changes with G in the +1 position relative to the C-to-T mutation (CG to TG). Conversely, the same deamination event on the minus strand would lead to plus-strand G-to-A changes with an overabundance of C at the –1 position relative to the mutated nucleotide (CG to CA). Indeed, we found that in most of the HERV-K proviruses examined (the 14 nonhypermutated proviruses), G-to-A changes were significantly enriched for C in the –1 position (Fig. 3A), suggesting most G-to-A changes were a result of spontaneous cytosine deamination, as would be expected of DNA elements that are long-term residents of the human genome.
![]() View larger version (28K): [in a new window] |
FIG. 3. Nucleotide sequence context bias accompanying G-to-A mutations in endogenous HERV-K proviruses. (A and B) Nucleotide occurrence at five positions 5' and 3' to G-to-A changes was catalogued, using the HERV-KCON sequence as a reference. The absolute number of times that each nucleotide occurred at each position relative to each G-to-A mutation in the nonhypermutated proviruses (A) as well as HERV-K60 and HERV-KI (B) is plotted. The P value in panel B for deviation from random nucleotides at +1 and +2 positions was <0.0001, calculated using a chi-square test of independence. (C and D) Frequencies of each di- and trinucleotide for all G-to-A and GG-to-AG changes, respectively, in HERV-K60 (C) and HERV-KI (D) are plotted as black bars. The expected numbers of G-to-A mutations in each sequence context, based on di- and trinucleotide composition, of HERV-KCON is represented as a horizontal gray line. The P value, for deviation from random di- and trinucleotide preference, was <0.0001, calculated by a chi-square test of independence.
|
Hypermutation of HERV-K by APOBEC3 proteins during in vitro replication. To test whether the in vivo hypermutation changes could be recapitulated in vitro, we generated HERV-K virions in the presence of each human APOBEC3 protein using the HERV-K packaging construct CCGBX-P, which contains HIV-1 sequence in the 3' LTR to allow selective amplification of newly synthesized HERV-K in a background of existing HERV-K proviruses present in human cells. A 762-bp sequence of nascent HERV-K DNA was thus amplified using primers targeting the EGFP insert in the vector genome and the inserted HIV-1 sequence. Two controls were done to establish the success of this strategy and to show that DNase treatment reduced contaminating plasmid DNA in the virion preparations to subdetectable levels. First, PCR amplifications were done using primers targeting the plasmid backbone. Second, infections were done using virions harboring an inactivating point mutation in the HERV-K reverse transcriptase. In both cases, PCR products were not detected, indicating that the sequences generated following HERV-K infection genuinely represented infection-dependent, de novo-synthesized HERV-K DNA.
We found that hA3A, hA3B, hA3F, and hA3G were capable of inducing hypermutation of HERV-K during in vitro infection. However the patterns and frequency of hypermutation were different (Fig. 4A). In particular, nearly all HERV-K clones generated in the presence of hA3G had G-to-A mutations, but each had a low to moderate number of changes (median of six G-to-A changes per clone). Conversely, hA3A, hA3B, and hA3F each induced hypermutation in a minority of clones, but hA3A and hA3B hypermutated clones had a very high burden of mutations (37 changes for the sole hA3A hypermutated clone, and a median of 43.5 G-to-A changes for hA3B hypermutated clones). Only a few changes were seen in HERV-K DNA generated in the presence of hA3F (median of four changes per clone).
Of the four human APOBEC3 proteins found to hypermutate HERV-K in vitro, only hA3G exhibited the GG dinucleotide and GGG trinucleotide bias for the generation of G-to-A mutations (Fig. 4A and B), as has previously been reported for hA3G mutation of HIV-1. Moreover, hA3F and hA3B exhibited the expected GA dinucleotide bias at positions where G-to-A mutations were generated (Fig. 4A and B). hA3A also showed the same GA bias, with the caveat that only a single HERV-KCON clone was found to be mutated by hA3A (Fig. 4A and B). Hence, among all seven of the human APOBEC3 proteins, four appear intrinsically capable of inducing hypermutation in HERV-K, but only hA3G was capable of hypermutating HERV-K during in vitro infection with a characteristic bias that very closely mimicked mutations found in the endogenous HERV-K60 and HERV-KI proviruses (compare Fig. 3B and 4B). Moreover, when we compared the 273 bp of HERVKCON sequence that was analyzed in the hA3G mutagenesis assay with the corresponding sequence in HERV-K60 and HERV-KI, 10 out of the possible 75 G-to-A mutations were found in either or both HERV-K60 and HERV-KI, while 15 out of the possible 75 G-to-A mutations were represented in corresponding sequences that were mutated by hA3G during in vitro infection (Fig. 5A and B). Notably, six of these G-to-A mutations were at identical positions, a highly significant correlation (P = 0.003, Fisher's exact test), lending further supporting to the notion that hA3G was responsible for hypermutation of HERV-K60 and HERV-KI.
![]() View larger version (18K): [in a new window] |
FIG. 5. Comparison of mutations relative to HERV-KCON that appear in naturally hypermutated proviruses (HERV-K60 and HERV-KI) and in the 273 nucleotides of mutated HERV-K sequence generated in vitro in the presence of hA3G. Changes relative to HERV-KCON are represented graphically on horizontal lines and are color coded according to the nucleotide appearing at the +1 site for HERV-K60 and HERV-KI and 12 HERV-K clones generated during infection in the presence of hA3G. Mutations are indicated as vertical lines. Red, GG to AG; cyan, GA to AA; green, GC to AC; magenta, GT to AT; black, non-G-to-A transitions; yellow, gaps in sequence. (B) The 75 G nucleotides in the HERV-K-derived portion of the in vitro-hypermutated sequence and the number of positions at which the G residue was universally unaltered or sometimes mutated to A in the naturally and experimentally hypermutated HERV-K sequences.
|
|
|
|---|
When comparing full-length human-specific HERV-K proviruses to HERV-KCON, which approximates an ancestral sequence for the most recent expansion of HERV-K proviruses in humans, we found an abundance of G-to-A and C-to-T substitutions. These substitutions, the most common change found in genomes, can occur when DNA methyltransferase methylates cytosines in CG dinucleotides to 5-methylcytosine, which spontaneously deaminates to thymine, resulting in a CG-to-TG change. These methylation events, important for development via genomic imprinting and X chromosome inactivation, can also silence retroelements. This effect has been demonstrated in mice, where knocking out DNA methyltransferase Dnmt1 or Dnmt3L leads to transcriptional activation of mouse retroelements intracisternal A particles and LINE-1 (9, 39). In addition, previous studies of a selection of HERV-K LTRs in the teratocarcinoma cell line Tera-1 showed that methylation and transcription are inversely related (20). Given these two facts, one would expect to find that HERV-K proviruses would be cytosine methylated by the host, and consequently G-to-A and C-to-T mutations might be abundant. This was indeed the case, and in 14 of 16 HERV-K proviruses examined, the CG dinucleotide methylation/spontaneous deamination pathway appeared to be the major source of G-to-A and C-to-T mutations.
Another common cause of G-to-A, and less frequently C-to-T (7), changes in viral DNA is APOBEC3-mediated cytidine deamination. Fortuitously, the two events are easily distinguished. DNA methyltransferases methylate cytosines in CG dinucleotides, while APOBEC proteins deaminate cytosines in XC dinucleotides, where X can differ depending on the APOBEC protein that is responsible for deamination. Hence, by examining the nucleotides immediately 5' and 3' to the mutated cytosine, we can largely assign G-to-A and C-to-T changes to either mechanism. One exception is when both DNA methyltransferase and APOBEC preferred nucleotides flank the altered cytosine, such as CCG trinucleotides, where CC represents the dinucleotide preference of hA3G and CG the preference of DNA methyltransferase. However, exclusion of these ambiguous samples in our analyses did not alter the conclusions.
The characteristics of HERV-K hypermutation found both in vivo and in vitro match several of the characteristics previously observed in the context of APOBEC-induced mutations in other retrovirus infections. First, G-to-A mutations constituted a large fraction of the total mutations in HERV-K60 and HERV-KI, as has previously been found for hypermutated viral sequences in HIV- and HBV-infected patients (17, 31, 36, 38). Furthermore, the GGG trinucleotide preference found during in vitro HERV-K infection in the presence of hA3G has been documented in HIV-1 infection assays by several groups (6, 41, 43). The combination of these two major characteristics found in HERV-K60 and HERV-KI, plus the failure of any other human APOBEC3 protein to induce a similar pattern of mutation during HERV-K replication in vitro, makes a strong argument for hA3G as the sole source of hypermutation in ancient HERV-K proviruses.
Another reported characteristic of viral hypermutation by hA3G is the gradient of changes along the viral genome. This characteristic is thought to derive from the position-dependent length of time that the nascent viral DNA is in the form of single-stranded DNA, the preferred nucleic substrate of hA3G. However, this was not observed in the HERV-K60 and HERV-KI sequences (data not shown). The reasons for this are unclear at present. Nonetheless, our ability to fairly precisely recreate the hypermutation patterns present in ancient proviruses specifically using hA3G during in vitro HERV-K replication assays suggests that the interaction between this protein and HERV-K occurred and was physiologically and evolutionarily relevant. HERV-K is therefore an eminently reasonable candidate for an infectious agent that has applied selective pressure on A3G during primate evolution. However, we do note that HERV-K is but one of a number of agents that could potentially have imposed selective pressure on antiretroviral defenses. Other abundant endogenous retroelements, such as Alu and LINE-1 elements in humans, have also been shown to be restricted by APOBEC3 proteins (10, 16, 25, 33), and these elements as well as other exogenous and endogenous retroviruses may also have contributed to the expansion and positive selection that is evident in APOBEC3 genes. Indeed, among the ancient retroviruses, only those that colonized the germ line are accessible to this type of analysis, and it is completely unknown what fraction of ancient retroviruses that replicated in ancestral primates are fossilized in modern DNA. Nonetheless, A3G has been under positive selection for many millions of years (28), and HERV-K could, potentially, have contributed to this pressure.
Given that HERV-K(HML-2) appears intrinsically mutable by hA3G and a hypermutated provirus is less likely to be harmful (and hence more likely to become fixed in a host genome) than an intact provirus, it is perhaps surprising that only 2 out of 16 HERV-K human-specific proviruses and 0 of 4 chimpanzee-specific proviruses were clearly hypermutated. Several factors may have contributed to this, and perhaps the most important influence would be viral tropism. The appearance of a hypermutated provirus in human DNA indicates that HERV-K likely replicated in an A3G-expressing maternal or paternal tissue immediately prior to deposition of the provirus into the germ line. Conversely, the deposition of a nonhypermutated virus suggests that the preceding generations involved replication in APOBEC3G-negative tissues. The simplest explanation for the appearance of hypermutated and nonhypermutated proviruses in the human genome is that HERV-K replicated in both A3G-expressing and nonexpressing somatic cells prior to germ line infection.
Moreover, the frequency of hypermutated proviruses in modern genomes may not reflect the frequency at which hypermutation occurred during ancient infections. Indeed, while hypermutation would generally inactivate a particular provirus, hypermutation itself is unlikely to be always necessary or sufficient to result in fixation of the element, where chance-influenced mechanisms, such as drift or bottlenecking, may play a dominant role in provirus fixation. Of note, older HERV-K sequences belonging to group O as defined by Romano et al. (27) did not exhibit signs of hypermutation compared to HERV-KCON. Moreover, a previous study of endogenous murine leukemia viruses also found that only a minority of proviruses were overtly hypermutated, perhaps for the same aforementioned reasons.
While there was a reasonable qualitative correlation between the appearance of APOBEC3-induced G-to-A mutations and infection inhibition during in vitro HERV-K replication, there was a notable lack of a quantitative correlation between the burden of mutations and the extent to which infection was inhibited. Specifically, hA3A, hA3B, and hA3F caused mutation in a minority of nascent HERV-K reverse transcripts yet inhibited infection to a greater degree than hA3G, which mutated the majority of nascent HERV-K DNA molecules. Since we found no evidence of hA3A, hA3B, or hA3F hypermutation in endogenous proviruses, inhibition of HERV-K infection by these cytidine deaminases may be physiologically irrelevant or might occur primarily via mechanisms (e.g., inhibition of DNA synthesis or integration) that would not leave remnants of the viral encounter with the APOBEC protein (5, 12, 14, 23, 42).
The lack of strong inhibition of HERV-K infection by hA3G in the single-cycle replication assay should not be overinterpreted as suggesting that hA3G lacks anti-HERV-K activity in vivo. As documented here and elsewhere, APOBEC3G appears to have evolved to target GG dinucleotides, especially GGG trinucleotides. This property makes it a particularly efficient mutator of tryptophan codons. G-to-A mutation of tryptophan codons invariably leads to the generation of new stop codons, which would almost always be lethal to a retrovirus, even if a provirus were successfully established with a relatively low burden of APOBEC3G-induced mutation. Importantly, the HERV-K infection assay requires a single cycle of infection by a reporter virus that encodes the commonly used EGFP as the reporter gene. EGFP contains only a single tryptophan, and thus a moderate level of hA3G-induced mutations might not score as strong inhibition during an in vitro single-cycle infection assay but would abolish further rounds of replication in an in vivo spreading infection. Indeed, HERV-K60 and HERV-KI represent clear examples of viral sequences that have been fossilized in the human genome in defective form as a consequence of hA3G-induced hypermutation.
This work was supported by NIH grant R01AI64003 (P.D.B.) and the UK Medical Research Council (M.H.M.). P.D.B. and M.H.M. are Elisabeth Glaser Scientists.
Published ahead of print on 18 June 2008. ![]()
|
|
|---|
A hypermutation during negative-strand viral DNA synthesis. Virology 208:601-609.[CrossRef][Medline]
identifies a critical species-specific retroviral restriction domain. Proc. Natl. Acad. Sci. USA 102:2832-2837.
exhibits lineage-specific length and sequence variation in primates. J. Virol. 79:6111-6121.
restricts HIV-1 infection in Old World monkeys. Nature 427:848-853.[CrossRef][Medline]
A hypermutation of human immunodeficiency virus type 1 genomes. J. Virol. 65:1779-1788.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»