Previous Article | Next Article ![]()
Journal of Virology, October 2008, p. 9591-9599, Vol. 82, No. 19
0022-538X/08/$08.00+0 doi:10.1128/JVI.02471-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Department of Biochemistry, Molecular Biology, and Biophysics,1 Institute for Molecular Virology,2 Beckman Center for Genome Engineering, University of Minnesota, Minneapolis, Minnesota 554553
Received 16 November 2007/ Accepted 16 July 2008
|
|
|---|
|
|
|---|
Members of the A3 family possess DNA cytosine deaminase activity, but they can be differentiated on the basis of other activities (see, for example, references 5, 7, 11, 21, 31, 34, 39, 52, 56, and 61). For instance, some A3s such as A3F and A3G are potent inhibitors of HIV-1 replication, while others such as APOBEC3B (A3B) have modest anti-HIV activity (see, for example, references 5, 22, 52, and 60). The subcellular distribution of A3 proteins is also a distinguishing feature (1, 2, 7, 8, 10, 12, 25, 26, 28, 37, 41-43, 45, 47, 52, 54, 55). For instance, A3G is predominantly cytoplasmic, whereas A3B is primarily nuclear. These differences in subcellular localization presumably reflect differences in the sets of proteins and/or nucleic acids with which the A3s interact, and they also suggest that the A3s perform distinct functions within the cell.
The A3 proteins are evolutionarily related to the mRNA editor APOBEC1 and the antibody gene DNA deaminase AID. AID is present in all vertebrates, but A3s are found only in mammals, suggesting that an ancestral AID gene duplicated and diverged to give rise to the present day A3 genes (19, 30, 35, 40, 50, 53). It was therefore reasonable to hypothesize that the A3s and AID share conserved properties in addition to DNA cytosine deaminase activity. For instance, both AID and APOBEC1 are nucleocytoplasmic shuttling proteins that are exported from the nucleus by the CRM1-dependent nuclear export pathway (9, 15, 33, 44, 58, 59). A3G also has a putative leucine-rich CRM1 nuclear export sequence spanning residues 369 to 379 (Fig. 1). However, localization studies indicated that A3G is not subject to CRM1-dependent nuclear export (1). Rather, its cytoplasmic localization was ascribed to a 16-residue peptide, dubbed a cytoplasmic retention signal (CRS), spanning residues 113 to 128 (2).
![]() View larger version (59K): [in a new window] |
FIG. 1. Alignment of A3B, A3F, and A3G amino acid sequences, highlighting regions of concentrated identity. Blue and green shading indicates regions of concentrated identity between A3G and A3F and between A3B and A3F, respectively. Gray-shaded residues are identical between at least two of the proteins. The crossing arrows indicate the junctions of chimeric proteins. Dashed boxes outline the putative CRS and NES of A3G and the putative NLS of A3B. The histidine, glutamic acid, and cysteine residues of the two zinc-coordinating motifs are indicated in boldface. The amino acids deleted in A3G1-369 are underlined. Additional details can be found in the text.
|
|
|
|---|
PCR was used to create the truncated A3B and A3G/GFP fusion constructs. The primers were as follows: A3G1-369 (5'-NNN NGA GCT CAG GTA CCA CCA TGA AGC CTC ACT TCA GAA AC [RSH431] and 5'-NNN NGT CGA CTT GGC TGT GCT CAT CTA GTC C), A3B-NTD (5'-NNN NGA GCT CGG TAC CAC CAT GAA TCC ACA GAT CAG AAA T [RSH567] and 5'-NNN NGT CGA CCA TCC TTC CCA GGT ATC TGA GAA TCT CCT TTA G), A3B-CTD (5'-NNN NGT CGA CCA TCC TTC ACA GGT ATC TGA GAA TCT CCT TTA G and 5'-NNN NGT CGA CCA TCC TTC CGT TTC CCT GAT TCT GGA G [RSH580]), A3G-NTD (RSH431 and 5'-NNN NGT CGA CCG AGT GTC TGA GAA TCT CCC CC), and A3G-CTD (5'-NNN GAG CTC AGG TAC CAC CAT GGA TCC ACC CAC ATT CAC TTT C and 5'-NNN GTC GAC TCC GTT TTC CTG ATT CTG GAG AAT [RSH498]). PCR products were digested with SacI and SalI and ligated into similarly digested pEGFP-N3.
The A3B/G chimeric constructs were created using overlapping PCR. The primers used were A3B1-60G60-384 (RSH567, 5'-CTG GGT GGT ACT TAA GTT CGG AAT ACA CCT GGC CTC GAA AGA C, 5'-TCC GAA CTT AAG TAC CAC CCA G, and RSH498), A3G1-59B61-382 (RSH431, 5'-GCG TGG TAC TGA GGC TTG AAA TAC ACC TGG CCT CGA AAG AC, 5'-TTC AAG CCT CAG TAC CAC GC, and RSH580), A3B1-190G195-384 (RSH567, 5'-GTG GGT GGA TCC ATC GAG TGT CTG AGA ATC TCC TTT AGC G, 5'-CAC TCG ATG GAT CCA CCC AC, and RSH498), and A3G1-194B191-382 (RSH431, 5'-GTG TCT GGA TCC ATC AGG TAT CTG AGA ATC TCC CCC AGC A, 5'-TAC CTG ATG GAT CCA GAC AC, and RSH580). PCR products were digested with SacI and SalI and ligated into a similarly digested pEGFP-N3.
The GFP-tagged truncated and chimeric constructs were subcloned into C-terminal hemagglutinin (HA)-tagged mammalian and bacterial expression vectors. The A3 coding region was cut out of the pEGFP-N3-based plasmids by using KpnI and SalI and ligated into KpnI/XhoI-digested pcDNA3.1-HA and pTrc99-HA (52).
Amino acid substitution mutants were generated by using site-directed mutagenesis (QuikChange, Stratagene; primer sequences available on request). A3G double mutants were created by site-directed mutagenesis using single mutants as PCR templates. pEGFP-N3-A3G-NTD mutants were subcloned as KpnI/RsrII fragments into similarly digested full-length A3G expression constructs pEGFP-N3-A3G and pcDNA3.1-A3G-HA, reported previously (52). All constructs were confirmed by restriction digestion and DNA sequencing.
Cell culture. Human embryonic kidney 293T and HeLa cells were maintained in Dulbecco modified Eagle medium supplemented with 10% fetal bovine serum and 25 U of penicillin/ml and 25 µg of streptomycin/ml at 37°C and 5% CO2. All transfections used FuGENE (Roche Applied Science) or TransIT-LT1 (Mirus Bio Corp.) according to the manufacturers' protocols.
Microscopy. HeLa or 293T cells were seeded on eight-well LabTek chambered cover glasses (Nunc). After 24 h of incubation, the cells were transfected with the indicated pEGFP-N3-based constructs. After an additional 24 h of incubation, images of the live cells were captured on a Zeiss Axiovert 200 microscope. Images were cropped, and their brightness and contrast were adjusted linearly using ImageJ software (http://rsb.info.nih.gov/ij/). For leptomycin B (LMB) experiments, cells were treated for 2 h with 20 ng of LMB (LC Laboratories)/ml or mock treated with an equivalent dilution of ethanol (the vehicle).
For immunofluorescence experiments, HeLa cells were seeded onto sterilized cover glasses set in culture dishes. After 24 h of incubation, the cells were transfected with the indicated pcDNA3.1-HA-based constructs. After an additional 24 h of incubation, cells were fixed with 4% formaldehyde, permeabilized in 0.2% Triton X-100, incubated first with anti-HA antibody (Covance) and then with fluorescein-conjugated anti-mouse secondary antibody (Jackson Immunoresearch), and imaged as described above.
Escherichia coli mutation assays. An E. coli-based rifampin-resistance assay was used to quantify the intrinsic DNA cytosine deaminase activity of A3B, A3G, and chimeric and mutant derivatives. This assay has been described previously (see, for example, references 29 and 31).
Immunoblotting. 293T cells were plated in six-well plates. After 24 h of incubation, the cells were transfected with APOBEC expression plasmid (or appropriate control plasmids). At 36 h later, the cells were harvested, and total proteins were extracted in a buffer containing 50 mM Tris (pH 7.5), 150 mM NaCl, 1% Nonidet P-40, 0.1% sodium dodecyl sulfate, 0.5% sodium deoxycholate, and a protease inhibitor cocktail (Roche). The extracts were clarified by centrifugation for 10 min at 20,800 x g at 4°C. The extracted proteins were fractionated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, transferred to a polyvinylidene difluoride membrane (Millipore), and probed with monoclonal anti-HA (Covance) or monoclonal anti-GFP (Clontech) antibody. Primary antibodies were detected by incubation with horseradish peroxidase-coupled anti-mouse immunoglobulin G (Bio-Rad) or anti-rabbit immunoglobulin G (Bio-Rad), followed by chemiluminescence (Roche Applied Science). Ponceau S (Sigma) staining of the polyvinylidene difluoride membrane following immunoblotting served as a protein loading control.
Amino acid alignments. The amino acid sequences of A3B, A3F, and A3G were aligned by using CLUSTAL W and edited manually (Fig. 1). The protein sequences correspond to GenBank accession numbers NP_004891, NP_660341, and NP_068594, respectively.
Structural modeling of A3Gntd. The primary amino acid sequences of A3G-NTD (residues 1 to 196) and A3G-CTD (residues 197 to 384) were aligned by using the homology modeling module of the InsightII program (Accelrys) (see Fig. S1 in the supplemental material). These amino acid sequences are 35% identical and 52% similar (according to the bl2seq program [National Center for Biotechnology Information]). The secondary structure elements of A3G-NTD were predicted by projecting the actual secondary structure of A3G-CTD (PDB 2jyw[13]) over the A3G-NTD/A3G-CTD primary amino acid sequence alignment. Because of amino acid conservation in the corresponding regions, the secondary structure elements of A3G-NTD were predicted to correspond directly with the actual elements in the A3G-CTD (see Fig. S1 in the supplemental material). Finally, this information was used to construct a three-dimensional model of the A3G-NTD (InsightII program; Accelrys). Images of the model were created by using MacPyMol (DeLano Scientific).
|
|
|---|
![]() View larger version (47K): [in a new window] |
FIG. 2. Predicted nucleocytoplasmic shuttling signals in A3G and A3B are not required for their subcellular distributions. Images of representative live HeLa cells transiently transfected with the indicated A3-GFP expression plasmids are shown. Scale bar, 10 µm. (A) LMB caused AID-GFP but not A3G-GFP to accumulate in the nucleus. Minus (–) LMB indicates mock-treated cells. (B) Localization of A3G-GFP and the A3G1-369-GFP mutant lacking the putative C-terminal NES. (C) Localization of wild-type A3B and a mutant A3B with four alanines replacing residues in the putative NLS (A3B4Ala).
|
The N-terminal halves of A3G and A3B recapitulate the localization of the full-length proteins. A3G and A3B are termed "double-domain" A3s because each has two zinc-binding domains. The genes of the double-domain proteins are likely to have arisen from the duplication of an ancestral "single-domain" AID-like APOBEC3 gene (19, 30, 35, 53). We therefore examined the localization of the N- and C-terminal halves of A3B and A3G by themselves to map the determinants of the localization of the full-length protein. Previous structural studies on the C-terminal domain (CTD) informed the selection of the dividing line between the domains, which was chosen so as to not disrupt secondary structure and to ensure that the CTD remained catalytically intact (Fig. 1 and see Fig. S1 in the supplemental material) (13, 14).
The N-terminal domain (NTD) of A3G (A3G1-196-GFP) localized predominantly to the cytoplasm, and the NTD of A3B (A3B1-192-GFP) localized predominantly to the nucleus, much like their respective full-length parental proteins (Fig. 3A). In contrast, their CTDs, A3G197-384-GFP and A3B193-382-GFP, were distributed throughout the cell in a manner indistinguishable from that of GFP alone. Similar data were obtained with HA-tagged proteins in fixed cells (Fig. 3B). Thus, the N-terminal halves of the proteins clearly contained their primary localization determinants. These data are largely consistent with prior deletion studies (2, 7).
![]() View larger version (67K): [in a new window] |
FIG. 3. The N-terminal halves of A3B and A3G recapitulate the localization patterns of the full-length proteins. Representative images of HeLa cells transfected with the indicated A3-GFP or A3-HA expression plasmids are shown. The NTD of A3B and A3G consists of amino acids 1 to 192 and amino acids 1 to 196, respectively. The CTD of A3B and A3G consists of amino acids 193 to 382 and amino acids 197 to 384, respectively. (A) Live cells expressing the indicated GFP-fusion proteins. (B) Fixed cells expressing the indicated HA-fusion proteins. Scale bar, 10 µm.
|
5 in the A3G-NTD model structure; Fig. 1 and see Fig. S1 in the supplemental material). Similar crossover points were previously used to generate functional A3G/F chimeras and an A3G-A3A chimera (4, 28, 29). Like A3B, the chimera consisting of the NTD of A3B and the CTD of A3G localized predominantly to the nucleus (Fig. 4A and B). In contrast, like A3G, the chimera fusing the NTD of A3G and the CTD of A3B localized predominantly to the cytoplasm (although we noted that this chimera exhibited a higher degree of nuclear fluorescence than wild-type A3G). Virtually identical results were obtained with GFP-tagged proteins in live cells and HA-tagged proteins and immunofluorescence in fixed cells (Fig. 4A and B). These data support the conclusion that the N-terminal halves of A3B and A3G provide the primary determinants of their subcellular localization.
![]() View larger version (34K): [in a new window] |
FIG. 4. The first 60 amino acids of A3B and A3G possess essential subcellular localization determinants. Representative images of HeLa cells transfected with the indicated chimeric-APOBEC3 expression plasmids are shown. The GFP-tagged constructs were visualized in live cells, and the HA-tagged proteins were visualized in fixed cells. (A) Cells expressing A3B or A3G. (B) Cells showing the localization of chimeras that fuse the NTD of A3B with the CTD of A3G and vice versa. (C) Cells showing the localization of chimeras that swap the first 60 amino acids of A3B for those of A3G and vice versa. Scale bar, 10 µm.
|
We therefore constructed chimeras that replaced the first 60 amino acids of A3G with the corresponding residues or A3B and vice versa (Fig. 1). These chimeric proteins are also likely to be structurally sound, because the fusion junctions were predicted to lie in a flexible loop between the β2 strand and
1 helix (see Fig. 1 and Fig. S1 in the supplemental material) (13). To verify their structural integrity, we performed E. coli-based mutation assays. All of the chimeras exhibited intrinsic DNA deaminase activity equal to or greater than that of the proteins from which they were derived (data not shown). We next assessed the localization patterns of these chimeras. Like A3G, the A3G1-59B61-382 chimera exhibited a predominantly cytoplasmic localization in the majority of cells (Fig. 4C). In contrast, the A3B1-60G60-384 chimera localized predominantly to the nucleus in most cells. Similar results were again obtained with GFP- and HA-tagged proteins (Fig. 4C). We noted that these results were less clear-cut than those for the chimeras crossing over at the midway point. Nevertheless, the first 59 residues of A3G conferred an A3G-like localization pattern to the rest of A3B. Moreover, replacing the first 59 residues of A3G with those of A3B caused A3G to adopt an A3B-like localization pattern. These results supported the conclusion that the first 60 amino acids of A3G harbor a major subcellular localization determinant.
Single amino acid substitutions can disrupt the subcellular distribution of A3G. To more finely map the region of A3G responsible for its cytoplasmic localization, we constructed a panel of amino acid substitution mutants within this critical 60-amino-acid N-terminal region. A3G and A3B differ at 27 residues within their first 63 amino acids (Fig. 1). At 25 of these differing positions, we generated a mutant with the A3G residue replaced by the corresponding A3B residue. For instance, we created an A3G mutant with lysine 2 replaced by asparagine 2 of A3B (A3G-K2N). We did not generate two possible mutants because they were deemed conservative (M9/V9, I53/V54).
We first examined the subcellular distribution of the single amino acid substitution mutants, initially as A3G-NTD-GFP derivatives. The localization of most of the mutants was predominantly cytoplasmic and indistinguishable from the wild-type protein (e.g., A3G-NTD-S18Y-GFP in Fig. 5A; see also Fig. S2 in the supplemental material for representative fields of cells expressing each mutant). However, several mutants, such as Y19D and Y22E, exhibited a clearly disrupted localization pattern (Fig. 5A). For instance, 86% of the Y19D-expressing cells and 64% of the Y22E-expressing cells had a cell-wide or predominantly nuclear fluorescence localization pattern (Fig. 5B). Several other mutants, such as the R24E, S28Y, T32Y, and S60F mutants, also mislocalized (Fig. 5B). As a control, we characterized the expression level of the A3G-NTD-GFP mutants by immunoblotting (Fig. 5C). Some of the mutations resulted in an apparent decrease in steady-state protein expression level or a decrease in protein solubility. Immunoblotting also revealed the presence of cross-reacting bands of higher mobility that could correspond to fragments of the A3G-NTD-GFP proteins. However, there was no apparent correlation between mislocalization and expression level or fragmentation. These data therefore defined single amino acids required for A3G-NTD localization and confirmed that the first 60 amino acids form a region critical for determining the protein's subcellular distribution.
![]() View larger version (68K): [in a new window] |
FIG. 5. Single amino acid substitutions within the first 60 amino acids of A3G disrupt cytoplasmic localization. (A) Representative images of live HeLa cells transiently transfected with the indicated A3G-NTD-GFP or derivative expression plasmids (see Fig. S2 in the supplemental material for a full set of representative fields). Scale bar, 10 µm. (B) Quantification of the extent of disrupted localization for the mutants. Individual cells expressing the indicated proteins were scored and grouped into three categories based on their overall pattern of cellular fluorescence: those with more apparent nuclear than cytoplasmic fluorescence (C < N), equivalent nuclear and cytoplasmic fluorescence (C = N), or more apparent cytoplasmic than nuclear fluorescence (C > N). The percentage of cells falling into each category is indicated for each mutant. Cells from two to four independent experiments were scored, and the tallies were combined. The total number of cells (N) scored for each construct is indicated. (C) Anti-GFP immunoblot showing the expression of the indicated A3G-NTD-GFP proteins. An arrowhead indicates the band corresponding to A3G-NTD-GFP and mutant derivatives. Ponceau S staining of the membrane served as a protein loading control.
|
![]() View larger version (46K): [in a new window] |
FIG. 6. Two regions within the N-terminal half of A3G cooperatively determine the cytoplasmic localization of the full-length protein. Representative images of live HeLa cells transfected with the indicated A3-GFP expression plasmids. Scale bar, 10 µm. Representative cells showing the localization of the indicated full-length A3G single mutants (A) or full-length A3G double mutants (B) are shown.
|
![]() View larger version (39K): [in a new window] |
FIG. 7. Predicted three-dimensional structure of the N-terminal half of A3G highlighting residues that influence cytoplasmic localization. A3G residues 1 to 65 are colored light green. The conserved zinc-coordinating residues H65, E67, C97, and C100 are colored cyan and labeled (Zn2+). Residues within the first 60 amino acids whose mutation results in a mislocalization of A3G-NTD-GFP are highlighted in red, orange, and yellow according to the degree of mislocalization (>50%, 25 to 50%, or 10 to 25%, respectively). The previously reported CRS (2) is tan, and critical residues within this motif are indicated (F126 and W127). Surface and ribbon representations of the model structure are shown with secondary structures numbered according to the A3G-CTD structure from the N to the C terminus as β1-β2/2' (not visible here)- 1-β3- 2-β4- 3-β5- 4- 5 (Fig. S1 in the supplemental material) (13). Two residues with a significantly altered localization pattern [L49W and ins(R42)] were not predicted to be on the same surface, but it is probable that these mutations perturb the protein's structure.
|
|
|
|---|
Several lines of evidence clearly showed that the first 60 amino acids of A3G contribute to cytoplasmic localization. First, N-terminal amino acid conservation between cytoplasmic A3G and A3F, but not nuclear A3B, implicated this region in such a role. Second, replacing the first 60 amino acids of A3G with the corresponding A3B residues caused the chimeric protein to redistribute to the nucleus. Third, replacing the first 60 residues of A3B with this portion of A3G caused the resulting chimera to become predominantly cytoplasmic. Fourth, single amino acid substitutions in this region of A3G caused the N-terminal half of the protein, which is normally cytoplasmic, to redistribute throughout the cell. Fifth, the critical amino acids so identified clustered on a predicted solvent-exposed surface with the potential to mediate an interaction required for cytoplasmic localization. These data therefore combined to demonstrate that this N-terminal portion of A3G provides key determinants of cytoplasmic localization.
However, there was also evidence that this 60-residue region alone is not solely responsible. In particular, the single amino acid substitutions that disrupted the localization of the N-terminal half of the protein failed to mislocalize full-length A3G. A recent study from the Smith group indicated that A3G residues 113 to 128 may also be involved in cytoplasmic localization (2). Our data showed that, like the Y19 region, this region alone is not sufficient for cytoplasmic localization of full-length A3G because mutation of five of the predicted solvent-exposed residues within this region had no effect on the protein's cellular distribution. However, two double mutants that combined substitutions within the two critical regions, Y19D-F126S and Y19D-W127A, caused full-length A3G to distribute cell-wide. These data therefore indicated that these two regions cooperate to determine the cytoplasmic localization of full-length A3G. It was further interesting that these regions were predicted to be juxtaposed on the same solvent-accessible surface (Fig. 7). Taken together with prior studies demonstrating the propensity for A3G to bind RNA and to form ribonucleoprotein complexes (17, 18, 26, 34, 35, 38), it is likely that the surface defined here mediates at least one of these important macromolecular interactions and thereby controls A3G's subcellular distribution.
M.D.S. was supported in part by a University of Minnesota Cancer Biology Training Grant (CA009138) and a 3M Science and Technology Graduate Fellowship. H.M. was supported in part by NIH grant AI073167. R.S.H. was supported in part by a Searle Scholarship, a University of Minnesota McKnight Land Grant Assistant Professorship, and NIH grants AI064046 and GM080437.
Published ahead of print on 30 July 2008. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
5' on single-stranded DNA. Nat. Struct. Mol. Biol. 13:392-399.[CrossRef][Medline]
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»