Previous Article | Next Article ![]()
Journal of Virology, August 2006, p. 7636-7644, Vol. 80, No. 15
0022-538X/06/$08.00+0 doi:10.1128/JVI.00567-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Thermal Biology Institute, Montana State University, Bozeman, Montana 59717,1 Department of Chemistry and Biochemistry, Montana State University, Bozeman, Montana 59717,2 Physiologisch-Chemisches Institut der Universität Tübingen, Hoppe-Seyler-Strasse 4, D-72076 Tübingen, Germany,3 Department of Plant Sciences and Plant Pathology, Montana State University Bozeman, Montana 59717,4 Department of Microbiology, Montana State University, Bozeman, Montana 597175
Received 17 March 2006/ Accepted 15 May 2006
|
|
|---|
|
|
|---|
Sulfolobus turreted icosahedral virus (STIV) was the first hyperthermophilic virus described with icosahedral capsid architecture. STIV infects Sulfolobus species resident in the acidic hot springs (pH 2.9 to 3.9 and 72 to 92°C) of Yellowstone National Park (54). It encapsidates a circular, double-stranded DNA genome of 17,663 bp containing 36 open reading frames, whose predicted products generally exhibit a lack of similarity to sequences in the publicly available databases, thus precluding functional assignment. The major capsid protein (B345), however, has been isolated from purified virus by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and identified by mass spectrometry (54). Purified virus has also allowed an electron cryomicroscopy reconstruction of STIV revealing a pseudo-T=31 icosahedral capsid with large turret-like projections extending from the vertices and an apparent internal membrane (54). The capsid architecture of STIV has generated much interest because it is common to viruses from all three domains of life, suggesting that this viral lineage shares a common ancestor that predates the fundamental evolutionary events that gave rise to the Archaea, Bacteria, and Eukarya (6, 29, 54). The recent crystal structure of the STIV major capsid protein (29) shows stunning structural homology to the major capsid proteins of bacteriophage PRD1 (7) and the eukaryotic viruses Paramecium bursaria Chlorella virus 1 (PBCV-1) (41) and adenovirus (56), further supporting this evolutionary lineage (29).
It is highly unlikely that most of the STIV proteome consists of unique protein folds serving novel functions. Rather, like the major capsid protein, the general lack of sequence similarity in the STIV open reading frames is probably due to the unique environment in which these viruses live and to the evolutionary distance between them and their bacterial and eukaryotic homologues. With respect to protein evolution, it is well established that similarity between related protein structures will persist longer than similarities in their corresponding amino acid sequences. While the structure of the STIV major capsid protein is an excellent example of this, there are many other examples in which protein structure has suggested functional and evolutionary relationships (6, 7, 11, 29, 31, 32, 39, 55, 61, 62). To this end, we have initiated crystallographic studies of the STIV proteome. Like similar studies of Sulfolobus spindle-shaped virus 1 (31, 32), structural analysis of the STIV gene products is expected to make specific suggestions regarding the functions and roles that these proteins play in the STIV life cycle.
Here we report on the structure of A197, one of the first STIV proteins to lend itself to crystallographic study. A197 is a 197-residue protein whose sequence provides little insight into its function. For example, standard BLAST (3) and CD searches (36) with default settings fail to identify proteins with similar sequence or structure. We have determined the crystal structure of STIV A197 and identify it as a glycosyltransferase. To our knowledge, this represents the first structure of a viral or archaeal glycosyltransferase displaying the GT-A fold. Potential roles for A197 in the STIV life cycle and their evolutionary implications are discussed.
|
|
|---|
Expression, purification, and characterization of A197. Typically, pEXP14-A197 was transformed into E. coli BL21-CodonPlus(DE3)-RIL (Stratagene), and a single colony was used to inoculate 5 ml of LB medium and grown overnight, with subsequent serial expansion to a 10-liter fermentor (New Brunswick; BIOFLO 2000) batch culture, all at 37°C. Medium for batch fermentation was as recommended by the manufacturer (53). All media contained 100 µg/ml ampicillin. Fermentor batch cultures were supplemented with filtered air at a flow rate of 10 liters/minute and stirred at 300 to 500 rpm. Cells were grown to an optical density at 600 nm of 0.8 to 1.5, and protein expression was then induced by addition of 1 mM IPTG (isopropyl-ß-D-thiogalactopyranoside). After an additional 4 to 6 hours of growth, cells were harvested by centrifugation (IEC PR-7000 M) at 6,000 x g for 20 min and pellets were stored at 80°C until needed.
For purification, cell pellets were thawed and resuspended in lysis buffer (10 mM Tris-HCl, pH 8.0, 400 mM NaCl) at 10 ml/g of wet cell pellet mass. Phenylmethylsulfonyl fluoride (0.1 mM) was added to the cell suspension, and cells were lysed by passage through a microfluidizer (Microfluidics Corporation, Newton, MA) or a Power laboratory press (American Instrument Co., Inc., Silver Spring, MD), depending on the volume of the cell suspension. The cell lysate was incubated at 65°C for 20 min to denature many of the contaminating E. coli proteins and clarified by centrifugation (Beckman; Avanti J-30I) at 22,000 x g for 20 min. The resulting cleared lysate was applied to a column containing a 3- to 5-ml bed volume of HIS-Select nickel affinity gel (Sigma-Aldrich), washed with at least 10 column volumes of wash buffer (lysis buffer plus 5 mM imidazole), and eluted with elution buffer (10 mM Tris-HCl, pH 8.0, 50 mM NaCl, 200 mM imidazole). Fractions containing A197 were then pooled and loaded onto a Superdex 75 gel filtration column (Amersham Biosciences) for further purification and buffer exchange into a minimal buffer (10 mM Tris-HCl, pH 8.0, 50 mM NaCl) for crystallization. Peak fractions were pooled and concentrated using a 10-kDa molecular mass cutoff (Amicon ultracentrifugal filter devices; Millipore) to 10.5 mg/ml. Protein concentrations were determined by Bradford assay (9) using "protein assay reagent" (Bio-Rad) and bovine serum albumin as a standard. Protein yield was typically 2 to 5 mg/g of cell pellet. Molecular weight and purity were assessed by 12% sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Dynamic light scattering (DLS) measurements were made on a Brookhaven Instruments ZetaPals (phase analysis light scattering) particle size/
potential analyzer. DLS was measured at 90o using a 661-nm diode laser, and the correlation functions were fitted using a constrained least-squares analysis as described previously (2).
For expression and purification of selenomethionine-incorporated A197, pEXP14-A197 was transformed into E. coli B834(DE3)pLysS (Novagen), a methionine auxotroph. Methionine auxotrophy was confirmed for a single colony, which was then used to inoculate 5 ml of medium, essentially as described previously (60), but supplemented with 1 ng/ml biotin, 50 ng/ml L-(+)-selenomethionine, and 100 µg/ml ampicillin, followed by serial expansion to a 10-liter batch culture (New Brunswick; BIOFLO 2000) in the same medium. After growth to optical density at 600 nm of 0.6 to 0.8, protein expression was induced and the protein purified as described above for native protein.
Crystallization and data collection. Purified A197 was crystallized by hanging drop vapor diffusion at 22°C. Drops were assembled with 2 µl of A197 (10.5 mg/ml in 10 mM Tris-HCl, pH 8.0, 50 mM NaCl) mixed with 2 µl of well solution (0.1 M MES [morpholineethanesulfonic acid], pH 6.5, 1.2 to 1.6 M ammonium sulfate, 10% 1,4-dioxane). Selenomethionine-incorporated A197 crystals were grown under identical conditions. Single rod-shaped or wedge-shaped crystals were isolated, and cryoprotectant was introduced with a quick soak (10 to 60 seconds) in well solution supplemented with 22.5 to 25% glucose. Crystals were then flash frozen in liquid nitrogen.
Multiwavelength anomalous diffraction was used to solve the structure. Data were collected to 2.28-Å resolution at the Se-K edge (peak, remote, and inflection wavelengths) on beamline 11-1 at the Stanford Synchrotron Radiation Laboratory (SSRL) (Table 1). An additional single-wavelength data set to 1.86-Å resolution was collected on beamline 1-5 at SSRL and used for refinement (Table 1). Data were integrated, scaled, and reduced in space group I222 (Table 1) using the HKL2000 software package (43).
|
View this table: [in a new window] |
TABLE 1. Data collectionc
|
X-rays and a MAR345 image plate detector. The data were integrated and scaled using HKL2000 (43).
Structure determination and refinement.
SOLVE (59) was used to determine the positions of the selenium substructure and to calculate initial phases. Four Se sites were identified, corresponding to one A197 molecule per asymmetric unit. The SOLVE output was then used in RESOLVE (57, 58) for density modification and initial model building. The resulting electron density map was of good quality, allowing the best parts of various models output by RESOLVE to be manually assembled into a composite model using O (27), followed by refinement with REFMAC5 (5, 40). The model was then further refined against the 1.86-Å data set. The refinement included the use of TLS (translation/libration/screw) parameters in which the model was broken into 12 TLS groups (1, 1 to 27; 2, 28 to 41; 3, 42 to 52; 4, 53 to 67; 5, 68 to 81; 6, 82 to 91; 7, 92 to 107; 8, 108 to 150; 9, 151 to 166; 10, 167 to 176; 11, 177 to 189; 12, 190 to 198) as suggested by the TLS motion determination home (http://skuld.bmsc.washington.edu/
tlsmd/) (44-46). Iterative rounds of model building and refinement with O and REFMAC5 resulted in a final model with an Rcryst of 17.1% and an Rfree of 20.4% (Table 2). The final model has good stereochemistry, with all residues in allowed regions of the Ramachandran plot (33) (Table 2). Amino acids 139 to 147 were not modeled due to the lack of interpretable electron density, presumably due to disorder in this region of the protein. The protein quaternary structure server (http://pqs.ebi.ac.uk/) was used to calculate the surface area at the putative dimer interface. Structural comparisons were performed using the DALI (http://www.ebi.ac.uk/dali) (26) and VAST (http://www.ncbi.nlm.nih.gov/Structure/VAST/) (22) servers. Structural figures were generated with PYMOL (http://www.pymol.org) (17).
|
View this table: [in a new window] |
TABLE 2. Model refinement
|
|
|
|---|
A197 crystallizes in space group I222 with one copy of the A197 polypeptide in the asymmetric unit. The structure was initially solved at a resolution of 2.28 Å by multiwavelength anomalous diffraction using selenomethionyl-incorporated protein. The final model was refined against data collected on a second crystal to a resolution of 1.86 Å. Details on data collection and model refinement are presented in Tables 1 and 2.
Structure of A197.
The structure of the A197 monomer reveals a six-stranded, predominantly parallel,
/ß/
sandwich that is flanked by a four-stranded antiparallel ß-sheet with an extended C terminus. This C-terminal tail shows significant interaction with a neighboring molecule in the crystal (Fig. 1). Residues 139 to 147, which connect helices
4 and
5, are disordered in the crystal structure and thus are not included in the model. The topology of the central ß-sheet is 3, 1, 4, 7, 6, 8, with strand 7 running in the antiparallel direction, while the flanking four-stranded antiparallel ß-sheet is composed successively of ß-strands 5, 9, 10, and 11.
![]() View larger version (77K): [in a new window] |
FIG. 1. (A) Stereo image of the A197 monomer. The ribbon diagram depicts secondary structural elements of A197 (ß-strands are blue, -helices are red, and loops are wheat), which are labeled in ascending order from the N terminus to the C terminus. *, N- and C-terminal ends of the disordered region within the 4- 5 loop (residues 139 to 147). (B) Ribbon diagram of the A197 homodimer looking into the putative active site. One monomer is blue, and the second is red. The orientation of the blue monomer is rotated slightly counterclockwise with respect to that depicted in Fig. 1A. Secondary structural elements and the termini of the disordered region within the 4- 5 loops are labeled as in panel A. (C) Relative to panel B, the A197 homodimer has been rotated 90° about the depicted axis. The putative active site now runs along the top of the figure. (D) Stereo image of the electrostatic surface of the A197 homodimer. The orientation is identical to that in panel B, serving to highlight the putative active site. *, termini of the disordered region within the 4- 5 loop. The putative donor substrate-binding sites are marked with stick representations of the superpositioned donor substrate of GnT I. The electrostatic potential was mapped to the surface of A197 with SPOCK (13), using a probe radius of 1.4 Å, a temperature of 353°K, an ionic strength of 0.15 M, and protein and solvent dielectric constants of 4 and 80, respectively. The color ramp of the surface is from 15 kT/e (red, acidic) to 15 kT/e (blue, basic). All images were prepared and rendered using PyMOL (17).
|
/ß/
sandwich to seven strands, showing a topology of 12, 3, 1, 4, 7, 6, 8, with strands 12 and 7 running antiparallel to the remaining strands. These interactions suggest that the dimer seen in the crystal is related to the higher-order oligomer indicated by dynamic light scattering. This putative dimer exhibits overall dimensions of approximately 80 Å by 45 Å by 40 Å. Relative to known dimer interfaces, this putative dimer interface is extensive, with occlusion of 1,950 Å2 of solvent-accessible surface area per monomer and a buried surface that is rich in hydrophobic residues.
The top edge of the central ß-sheet, along with ß-strand 2, creates the "floor" of a large pocket that is lined by elements of helix
2, helix
5, and the ß4-ß5 loop on one side and by the N-terminal half of the long ß6-ß7 loop, ß-strand 5, ß-strand 9, and much of the ß9-ß10 loop on the other. Formation of the crystallographic dimer results in an extended cleft that spans the dimer interface (Fig. 1D). The largest pocket on a protein surface is usually involved in ligand recognition and, in the case of enzymes, will mark the active site (34). Thus, this pocket is likely to play an important role in the activity of A197.
Structural homology to glycosyltransferases. A DALI search (26) for structurally similar proteins identifies members of the glycosyltransferase superfamily as the closest structural homologues. Glycosyltransferases catalyze the transfer of a sugar moiety from an activated donor sugar onto saccharide and nonsaccharide acceptors (15). Members of the glycosyltransferase superfamily exhibit extreme diversity in their amino acid sequences, reflecting the large number of different donor and acceptor molecules that are utilized by this class of enzymes. Sequence similarity and biochemical characterization have been used to group the glycosyltransferase superfamily into 85 distinct families (as of May 2006) designated GT1 to GT85 (14, 15). However, this number is certain to grow as more of these genes are identified and their products characterized. Characteristics of the various families along with current membership are available from the carbohydrate-active enzyme (CAZy) database (http://afmb.cnrs-mrs.fr/CAZY/).
In contrast to the diverse sequence composition of these enzymes, structural studies have identified only two predominant folds, designated GT-A and GT-B, though a third fold related to the GT-A fold has been found in sialyltransferase CstII from Campylobacter jejuni (10, 12, 16). Recent structural work in a number of laboratories has allowed the GT-A and GT-B folds to be assigned to many of the GT families (10). Importantly, our DALI search with A197 identified glycosyltransferases that exhibit the GT-A fold. Interestingly, three of the four most similar matches are to eukaryotic glycosyltransferases. These show highly significant Z scores (
9) and root mean square deviations (RMSDs) near 3.0 Å for superposition of structurally equivalent C
atoms, despite less than 15% sequence identity over equivalent residues.
The structure of A197 differs from the canonical GT-A fold in a few minor ways. First, the canonical fold shows a seven-stranded central ß-sheet, whereas there are only six strands in A197. The ß-strand running along the outside edge of the central ß-sheet and the preceding
-helix are not present. In A197, however, dimer formation supplies a replacement in the form of ß-strand 12, though it runs in the opposite direction (antiparallel). A second minor difference is the addition of ß-strands 10 and 11 to extend the size of the flanking antiparallel ß-sheet, though extended versions of this flanking ß-sheet have been seen before. For example, the nucleotide-diphospho-sugar transferase from Bacillus subtilis, SpsA (11a) also shows a four-stranded antiparallel ß-sheet, though the connectivity differs from that in A197. Apart from the addition to this flanking ß-sheet in A197, there is little decoration of its GT-A fold. In contrast, most enzymes with glycosyltransferase activity are significantly larger than A197 and often contain additional domains. Thus, this putative viral glycosyltransferase represents a minimal form of this large, extremely diverse, and highly important class of enzymes.
A197 shows greatest structural similarity to the catalytic domain of rabbit N-acetylglucosaminyltransferase I (GnT I) (62), an inverting glycosyltransferase in CAZy family GT13 (Fig. 2A and B). DALI identifies 149 structurally equivalent residues, with 9% sequence identity, whose C
atoms superpose with an RMSD of 2.9 Å. A refined superposition of the two proteins with LSQMAN (30) identifies a highly conserved structural core consisting of 129 equivalent C
atoms with 12.4% identity and an RMSD of 1.78 Å.
![]() View larger version (66K): [in a new window] |
FIG. 2. Ribbon diagrams of A197 and the equivalent domains of two structural neighbors. The relative orientations are the same in all panels and are identical to that in Fig. 1A. (A) Residues 1 to 197 of the A197 monomer. (B) Residues 106 to 317 of rabbit GnT I (PDB identifier [ID] 1FOA) (62). Secondary structural elements that are shared with A197 are labeled accordingly. The donor substrate (UDP-GlcNAc) is depicted with sticks, and the coordinated divalent manganese cation (Mn2+) is shown as a pink sphere in the active site. The C-terminal end of the structure is marked with dots to indicate the continuation of the polypeptide leading to an additional domain that is not present in A197. (C) Residues 75 to 310 of human GlcAT-I (PDB ID 1FGG) (48). Labels are consistent with those in panels A and B. UDP in the donor-binding site and Galß1-3Gal in the acceptor-binding site are shown as sticks. The coordinated manganese cation is shown as a pink sphere. (D) Superposition of A197 (blue), GnT I (orange), and GlcAT-I (yellow) with detail of key GT-A glycosyltransferase active-site features. Residues constituting the DXD motif (Asp61, Glu62, and Asp63) and catalytic base (Asp151) of A197 are shown in sticks, as are the equivalent residues in GnT I (Glu211, Asp212, Asp213, and Asp291, respectively) and GlcAT-I (Asp194, Asp195, Asp196, and Glu281, respectively). For clarity, only the ribbon diagram of A197 is shown. The green mesh depicts difference electron density (contoured at 6 ) for the manganese binding site; it is found adjacent to the DXD motif and coincides nicely with the expected metal binding site. In GnT I and GlcAT-I, the DXD motif is involved in coordination of the diphosphate moiety of the donor substrate through the intervening Mn2+ ion, while the catalytic base is appropriately positioned for proton extraction from the hydroxyl moiety of the acceptor sugar substrate. Superpositions were prepared using LSQKAB (28). All images were prepared and rendered using PyMOL (17).
|
![]() View larger version (45K): [in a new window] |
FIG. 3. Structure-based sequence alignment of A197 (PDB identifier [ID] 2C0N) with GT-A domains of close structural neighbors. GnT I, residues 106 to 317 of rabbit N-acetylglucosaminyltransferase I (PDB ID 1FOA; family GT13; inverting mechanism) (62); GlcAT-I, residues 75 to 310 of human ß1,3-glucuronyltransferase I (PDB ID 1FGG; family GT43; inverting mechanism) (48); MGS, residues 2 to 99 of Rhodobacter marinus mannosylglycerate synthase (PDB ID 2BO4; family GT78; retaining mechanism) (19); ß4Gal-T1, residues 180 to 347 of bovine ß1,4-galactosyltransferase T1 (PDB ID 1FGX; family GT7; inverting mechanism) (21). The secondary structural elements of A197 are mapped above the alignment (arrows denote ß-strands, and rounded rectangles denote -helices). Long stretches of sequence that are not structurally equivalent to A197 have been removed from the alignment for conciseness and are marked by double shills. The missing residues of the A197 4- 5 loop are in lowercase in the alignment and are marked with dashes in the secondary structure map. Identical residues are highlighted in gray, and the DXD motif and catalytic base are in boldface and are boxed. The structure-based sequence alignment was created with the 3DCoffee@igs web server (http://www.igs.cnrs-mrs.fr/Tcoffee/) (51), using Msap_pair to compute the library and slight manual adjustment around the gaps with consideration of the pairwise structural alignments created by LSQMAN (30).
|
5 as the putative catalytic base in A197. Thus, active-site features that are hallmarks of glycosyltransferase activity, the DXD motif and a putative catalytic base, are preserved within the proper structural context of the GT-A fold, strongly suggesting that A197 is indeed a glycosyltransferase.
Donor substrate binding site.
The structure of GnT I (62) was solved as a complex with its donor substrate, UDP-N-acetylglucosamine, and Mn2+ (Fig. 2B). Thus, superposition of GnT I and the donor substrate upon A197 suggests probable binding sites for an activated sugar substrate and a metal cofactor in the active site of A197. The donor substrate is predicted to sit within the extended active-site pocket that runs across the surface of A197, with the uridine moiety situated over the C-terminal ends of ß1 and ß3, adjacent to the N-terminal end of
2 (Fig. 1D). The diphosphate moiety is predicted to span the C terminus of ß4 and the N terminus of ß7, with the sugar moiety situated over the N terminus of ß7, adjacent to helices
2 and
5. The superposed Mn2+ ion is positioned between the ß4-ß5 loop and the phosphodiester bond of the UDP, a position that corresponds nicely with the major peak seen in the low-resolution difference maps from Mn2+-soaked crystals of A197 (Fig. 1D and 2D).
Interestingly, there is a loop adjacent to the active site in GnT I that remains disordered until the donor substrate is bound. Closure of the loop over the donor substrate not only serves to hold it in place but also serves an important role in the binding of the acceptor substrate, providing a structural explanation for the ordered substrate binding that is seen in GnT I (62). Similarly, nine residues of the loop connecting helices
4 and
5 of A197 are disordered in the absence of substrate. However, relative to GnT I, this disordered loop is situated on the opposite side of the substrate-binding pocket. Nevertheless, its position would allow it to fulfill a similar role in closing over the substrate, possibly serving to complete the formation of an acceptor-binding site.
Acceptor substrate binding site. The DALI search also identified human ß1,3-glucuronyltransferase I (GlcAT-I) (47, 48), an inverting glycosyltransferase in CAZy family GT43, as a close structural homologue (3.1-Å RMSD and 13% identity over 148 structurally equivalent residues) (Fig. 2C). Once again, a refined superposition of GlcAT-I onto A197 with LSQMAN identifies the same highly conserved structural core (120 residues, 15% sequence identity, 1.83-Å RMSD). Interestingly, while the structure of the C-terminal extension in A197 differs significantly from that seen in GlcAT-I, both proteins utilize the C-terminal tail in the formation of an extensive dimer interface. However, unlike A197, the active-site clefts of each monomer do not merge across the dimer interface.
Importantly, the GlcAT-I structure was solved in complex with UDP, Mn2+, and an acceptor substrate analog, Galß1-3Galß1-4Xyl (48). The superposition of GlcAT-I on A197 suggests that the acceptor substrate for A197 will bind adjacent to strand ß6, in a groove formed predominantly by the long intervening region between ß6 and ß7. The acceptor substrate will then be positioned adjacent to the N terminus of
5, an optimal arrangement relative to the catalytic base and the expected position of the donor substrate. A second structure of GlcAT-I has been solved with UDP-glucuronic acid, the full donor substrate, in the active site (47). Not surprisingly, the glucuronic acid moiety binds in a manner similar to that seen for the N-acetylglucosamine in GnT I, further indicating that the relative positions of the donor and acceptor substrates in A197 are reliably indicated by superposition of its closest structural homologues.
|
|
|---|
Due to the large number of potential substrates, limited sequence similarity and three-dimensional structural information are generally insufficient to allow reliable prediction of substrates for a given glycosyltransferase. In the absence of biochemical characterization with the known substrates, it is also difficult to predict whether the enzyme will catalyze sugar transfer with inversion or retention of stereochemistry at the anomeric carbon. For example, all of the structural homologues aligned in Fig. 3 utilize different substrates, and three of the four are inverting enzymes (GnT I, GlcAT-I, and ß4Gal-T1), while mannosylglycerate synthase (MGS) is retaining. Yet the GT-A fold, the DXD motif, and the catalytic base are common to all. MGS illustrates the difficulty in assigning specific properties to a glycosyltransferase in the absence of biochemical characterization. Sequence-based assignment had placed it in inverting family GT2; however, subsequent characterization showed that it utilizes a retaining mechanism; hence MGS was reclassified as the founding member of family GT78 (19). Thus, in the case of A197, the identity of donor and acceptor substrates and elucidation of mechanistic details await biochemical and genetic studies.
A197 is one of the smallest glycosyltransferases that we are aware of, composed of only the core catalytic GT-A fold and lacking additional functional domains. Thus, its structure may define the minimal components necessary for glycosyltransferase activity. The presence of minimal, "stripped-down" structural motifs is common in viruses (37), where the need for genetic efficiency within the viral genome is one common explanation for their occurrence. A197 has yet to be assigned to a GT family in the CAZy database. However, the lack of sequence similarity to other known glycosyltransferases suggests that A197 represents yet another sequence-based glycosyltransferase family.
The CAZy database includes a number of viral glycosyltransferases, including annotated glycosyltransferase genes from other crenarchaeal viruses AFV1 (8), AFV2 (24), ARV1 (63), SIFV (4), SIRV1 (49, 50), SIRV2 (49), and STSV1 (65). These crenarchaeal viral glycosyltransferases have been placed in family GT4, a family that utilizes the retaining mechanism of sugar transfer and lacks structural representation (15). We note that GT4 also includes a glycosyltransferase from PBCV-1 (23, 37), a eukaryotic virus that is believed to be evolutionarily related to STIV. Thus, glycosyltransferases are not uncommon to viral genomes and are frequently found in viruses infecting the Crenarchaea.
Viruses commonly decorate their proteins with sugars as a means of regulating interactions with their hosts. While viruses can accomplish this task by utilizing the host's glycosylation machinery, it is apparent that many viruses also encode their own proteins for specialized glycosylation needs (37). Virally encoded glycosyltransferases are known to serve a variety of functions and target a wide variety of acceptor substrates, including DNA, proteins, and small molecules. For example, some lytic bacteriophages glycosylate their DNA to protect it from host restriction enzymes while others alter the glycosylation of cell surface antigens to induce serotype conversion of the host bacteria during lysogeny or, in the case of lytic phage, to induce receptor conversion, preventing retention of progeny on the host cell debris (37).
In eukaryotic systems, many baculoviruses utilize virally encoded glycosyltransferases to glycosylate host ecdysteroids. This modification alters the biological activity of the hormone, suppressing development of the infected host. Glycosyltransferase-encoding genes are also common in the phycodnavirus family. This includes PBCV-1, where these virally encoded enzymes have been implicated in glycosylation of its major capsid protein, Vp54. While eukaryotic viral proteins are typically glycosylated by host-encoded enzymes located in the endoplasmic reticulum (ER) and Golgi apparatus, Vp54 is apparently glycosylated in an ER and Golgi apparatus-independent manner by virally encoded glycosyltransferases that are expressed in the cytosol. Thus, it has been suggested that glycosylation of Vp54 reflects an ancestral pathway that existed prior to ER and Golgi apparatus formation (23, 37).
This is particularly interesting in light of the putative evolutionary relationship between PBCV-1 and STIV. The similarities between these viruses are not limited to their overall capsid architecture and the structures of their major capsid proteins. In this issue Maaty et al. show that, like Vp54 in PBCV-1, the STIV major capsid protein (B345), is also glycosylated (35). While A197 might serve any number of functional roles, the similarities to PBCV-1 with respect to capsid architecture, fold of the major capsid protein, glycosylation of the major capsid protein, and the occurrence of a virally encoded glycosyltransferase suggest that A197 participates in glycosylation of the major capsid protein in STIV and is likely involved in particle maturation.
Maaty et al. also observe that the STIV major capsid protein gives a horizontal spot train on two-dimensional gels, indicating a series of increasingly acidic species compared to the bacterially expressed protein, which is not glycosylated (35). This suggests that glycosylation introduces negative charge through addition of acidic sugar moieties such as galacturonic acid, glucuronic acid, and iduronic acid (18). Alternatively, the spot train might represent sulfation of neutral saccharides (18).
In addition to the obvious structural and functional roles for protein glycosylation, increased glycosylation has been shown to correlate with environmental extremes of temperature, pH, and salt concentration (1, 18, 25, 38, 64, 66). Thus, it has been suggested that increased protein glycosylation may stabilize extracellular proteins in these harsh environments (1, 18, 25, 38, 64, 66). In this light, it is not surprising that many crenarchaeal viruses apparently encode glycosyltransferases.
Glycosyltransferases are an enormous and extremely diverse class of enzymes that are thought to have evolved from a common ancestor, particularly those of the GT-A superfamily (20). As glycosyltransferases are ubiquitous to the three domains of life, they probably predate the evolutionary events that established these domains. Likewise, there is accumulating evidence that STIV is part of a viral lineage that predates the split of the three domains. Identification of a glycosyltransferase in STIV extends the observed similarities among double-stranded DNA viruses inhabiting the three domains of life beyond those of capsid architecture and the conserved fold of the major capsid protein. The last virus common to the STIV/PBCV-1 lineage is likely to have carried a glycosyltransferase gene in its genome. With the development of immunity, however, descendant viruses infecting multicellular eukaryotic organisms would have found it advantageous to lose this feature, relying instead on enzymes supplied by their hosts.
In summary, the structure of A197 provides substantial insight into its function and its relevance to the STIV life cycle. It is yet another demonstration that structure can reveal functional and evolutionary relationships.
|
|
|---|
, K. Schulstad, J. Hilmer, L. Liepold, B. Weidenheft, T. Douglas, M. Young, and B. Bothner. 2006. Characterization of the archaeal thermophile Sulfolobus turreted icosahedral virus validates an evolutionary link among double-stranded DNA viruses from all domains of life. J. Virol. 80:7625-7635.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»