Previous Article | Next Article ![]()
Journal of Virology, August 2006, p. 7625-7635, Vol. 80, No. 15
0022-538X/06/$08.00+0 doi:10.1128/JVI.00522-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
,3
Katie Schulstad,1
Jonathan K. Hilmer,1
Lars Liepold,1
Blake Weidenheft,2
Reza Khayat,4
Trevor Douglas,1
Mark J. Young,2 and
Brian Bothner1*
Department of Chemistry and Biochemistry, Montana State University, Bozeman,1 Departments of Microbiology and Plant Sciences, Montana State University, Bozeman,2 Department of Microbiology, Montana State University, Bozeman, Montana 59717,3 Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 920374
Received 13 March 2006/ Accepted 17 May 2006
|
|
|---|
|
|
|---|
STIV was isolated from Sulfolobus enrichment cultures that were established from a high-temperature acidic hot spring (
80°C, pH
3) in Yellowstone National Park (38). The virus was subsequently shown to infect virus-free isolates of Sulfolobus solfataricus strain P2, for which the complete genome has been sequenced. The electron cryomicroscopy (cryo-EM) reconstruction revealed a capsid with pseudo-T=31 icosahedral symmetry that is composed primarily of a 37-kDa major capsid protein, plus at least three additional capsid proteins at the fivefold vertices (38). Two features in particular stand out: the turrets, which are proposed to function in host recognition and DNA translocation, and two electron-dense layers sandwiched between the protein capsid and the packaged genome that may be composed of lipids (30) (Fig. 1). While not common, internal lipid layers are present in a number of dsDNA viruses. Paramecium bursaria Chlorella virus 1 (PBCV-1), which replicates in unicellular Chlorella-like green algae (41, 46), and the bacteriophage PRD1 (13) have a lipid layer beneath the glycoprotein capsid shell. The spherical halophilic euryarchaeon Haloarcula hispanica virus (SH1) also has an internal lipid layer that is selectively acquired from the host (4).
![]() View larger version (113K): [in a new window] |
FIG. 1. Model of an STIV particle. Shown is a cutaway view of the T=31 icosahedral capsid of STIV based on cryo-EM reconstruction (30). Extending from each of the fivefold vertices are turret-like projections. The protein shell is blue and has been removed from one quarter of the particle to reveal the inner lipid layers (yellow). The capsid diameter is 70 nm, and the turrets extend 13 nm above the surface.
|
Characterization of a virus necessarily involves the identification and analysis of the components that assemble to form the particle. Proteomics-based approaches are a powerful tool for dissecting macromolecular complexes such as virus particles (44). In this report STIV particle composition was characterized using mass spectrometry. Nine viral and two host proteins in purified preparations of STIV were identified by mass spectrometry. One of the viral proteins originates from a noncanonical reading frame, confirming that standard translation rules are not sufficient to generate the entire proteome of archaeal organisms. Even though primary sequence-based searches failed to find homologous proteins, fold recognition-based searches suggested potential roles for many of the virion-associated proteins. Structural prediction indicates that four of the proteins are likely to be part of the turrets, including an ATPase involved in DNA packaging. We confirm the presence of an internal lipid layer and show that the virus selectively incorporates a subset of total host lipids. We also demonstrate that the major capsid protein (B345) is glycosylated.
|
|
|---|
6,000 x g for 10 min), and the supernatant was filtered through a 0.22-µm Seritop filter (Millipore). The filtrate was concentrated over Amicon membranes (Millipore) with a molecular mass cutoff of 100 kDa until the retained volume was
15 ml. The filter retentate was subsequently loaded onto Cs2SO4 (39%, wt/vol) and centrifuged at 247,600 x g for 16 h. Virus bands were removed and dialyzed against 50 mM citrate buffer (pH 3.2). Purity was gauged by banding characteristics on Cs2SO4 gradients, negative-stain transmission electron microscopy, and UV-visual spectroscopy. SDS-PAGE analysis and in-gel digestion for protein identification. Purified virus was denatured and separated on 4 to 20% SDS-PAGE gels and stained with Coomassie brilliant blue R250 (Bio-Rad), and all visible protein bands were excised. The gel slices were destained using 50% acetonitrile in 50 mM ammonium bicarbonate (pH 7.9) and vacuum dried. Samples were rehydrated with 1.5 mg/ml dithiothreitol (DTT) in 25 mM ammonium bicarbonate (pH 8.5) at 56°C for 1 h, subsequently alkylated with 10 mg/ml iodoacetamide (IAA) in 25 mM ammonium bicarbonate (pH 8.5), and stored in the dark at room temperature for 1 h. The pieces were subsequently washed with 100 mM ammonium bicarbonate (pH 8.5) for 15 min, washed twice with 50% acetonitrile in 50 mM ammonium bicarbonate (pH 8.5) for 15 min each, vacuum dried, and rehydrated with 4 µl of proteomics grade modified trypsin (100 µg/ml; Sigma) in 25 mM ammonium bicarbonate (pH 8.5). The pieces were covered in a solution of 10 mM ammonium bicarbonate with 10% acetonitrile (pH 8.5) and incubated at 37°C for 16 h. Solution digestion of viral capsid proteins using thermolysin and LysC followed by trypsin was also performed. The purified capsid buffer consisted of 15 µl 50 mM Tris (pH 8.8) and 10 µl 1.5 M NaOH. This buffer was added to 200 µl of virus in citrate buffer for a final pH of 8.4, the optimum alkalinity for LysC. A volume of 1 µl of LysC (proteomics grade; Sigma) was added to 10 µl of virus solution in the presence or absence of 8 M urea, DTT, and IAA; the reaction proceeded at 37°C for 12 h. A threefold volume of 50 mM Tris (pH 8.8) was added, followed by the addition of 2 µl of trypsin and 1.4 µl of 0.1 M CaCl2. The samples were incubated at 37°C for 2 h.
MALDI analysis.
The matrix for matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry was prepared by mixing twice-recrystallized
-cyano-4-hydroxy cinnamic acid in a solution of 50% acetonitrile with 0.1% trifluoroacetic acid. Volumes of 1 µl sample and 0.7 µl matrix were spotted onto an aluminum MALDI-TOF plate and allowed to air dry. Calibration was performed using bradykinin and insulin peptide standards. Spectra were collected by using a Biflex III MALDI-TOF mass spectrophotometer (Bruker Daltonics, Billerica, MA) and internally calibrated by using known tryptic digest peptides. Results were compared with theoretical trypsin digests of viral ORFs by using a protein analysis work sheet from ProteoMetrics Inc. (http://65.219.84.5/paws.html).
Nanospray LC-MS/MS. Peptides eluted from the in-gel digests were analyzed on an integrated Agilent 1100 liquid chromatograph (LC) (XCT-plus) mass-selective detector (MSD) controlled by ChemStation LC 3D (Rev A.10.02). The Agilent XCT-plus ion trap mass spectrometer is fitted with an Agilent 1100 CapLC and nano-LC sprayer under the control of MSD trap control version 5.2 Build no. 63.8 (Bruker Daltonic GmbH). Injected samples were first trapped and desalted isocratically on a Zorbax 300 SB- C18 precolumn (5 µm, 5x 300-µm inside diameter; Agilent) for 5 min with 0.1% formic acid delivered by the auxiliary pump at 3 µl/min. The peptides were then reverse eluted from a precolumn and loaded onto an analytical C18 capillary column (15-cm by 75-µm inside diameter, packed with 3.5-µm Zorbax 300, SB C18 particles; Agilent) connected in-line to the mass spectrometer with a flow of 300 nl/min. Peptides were eluted with a 15 to 50% acetonitrile gradient over 50 min. Data-dependent acquisition of collision induced dissociation tandem mass spectrometry (MS/MS) was utilized. Parent ion scans were run over the m/z range of 400 to 2,000 at 8,100 m/z-s. MGF compound list files were used to query an in-house database using Biotools software version 2.2 (Bruker Daltonic) with 0.2-Da MS/MS ion mass tolerance. Samples were run in parallel on a Waters Q-TOF Premier mass spectrometer equipped with a nano-Acquity LC system and MassLynx 4.0 software under similar conditions.
Homology searches for the STIV structural proteins.
The amino acid sequences of the identified proteins were analyzed using BLAST and PSI-BLAST (40) homology searches against the nonredundant protein database at NCBI. An automated procedure implemented in SAM (28) was used to align all identified sequences and to generate profile hidden Markov models. Sensitive profile-profile comparisons (43) were used for alignments with known structures. Putative transmembrane domains in each structural protein sequence were predicted by TMpred (http://www.ch.embnet.org/software/TMPRED_form.html) and DNASTAR (Lasergene Inc., Madison, WI). Signal peptide prediction for each protein was performed using SignalP 3.0 (http://www.cbs.dtu.dk/services/SignalP/) and Psort 2.0 (http://bioweb.pasteur.fr/seqanal/interfaces/psort2-simple.html) against the SwissProt database, with each search submission consisting of less than 80 amino acids taken from the N terminus of the protein. Sequence repeats were analyzed using EMBL-EBI rapid automatic detection and alignment of repeats (http://www.ebi.ac.uk/Radar/). Protein fold recognition was done using one-dimensional (1D) and three-dimensional (3D) sequence profiles coupled with secondary structure information as implemented in the 3D position-specific scoring matrix (PSSM) (29) (http://www.sbg.bio.ic.ac.uk/
3dpssm/index2.html).
Protein modeling and docking into the cryo-EM map. Sequence profiles generated as described above were used for profile-profile comparisons (43) with various databases of protein families, including the collection of protein structures in PDB (15). Alignments obtained in this fashion were used for 3D modeling (39). Based on model evaluations (39), the alignments were adjusted manually and used for additional cycles of model building until no further improvements were observed. Structural models of B164 were docked into a reconstructed cryo-EM map using 3SOM (11).
Glycoprotein analysis. The Pro-Q Emerald 488 glycoprotein gel and blot stain kit (P21875; Molecular Probes) were used to visualize glycosylated proteins in 1D and two-dimensional (2D) gels. Briefly, purified virus solutions were adjusted with 1/4 volume of 4X Pro-Q buffer. The samples were then boiled for 2 min at 100°C before being loaded onto a 4 to 20% SDS-PAGE gel. Molecular weight markers, the CandyCane glycoprotein molecular weight marker (Molecular Probes; C21852), and SDS-PAGE standard broad range (Bio-Rad) were used as standards. The gel was run for 44 min at 120 V, rinsed in distilled H2O (twice for 10 min), and fixed with a solution of 50% methanol and 5% glacial acetic acid (twice for 1 h). The gel was then washed twice with 3% glacial acetic acid and incubated in oxidizing solution (component C, Pro-Q Emerald glycoprotein gel and blot stain kit) for 40 min. The gel was washed again (three times for 15 min). The gel was stained in the dark using 25 ml of Pro-Q Emerald 488 staining solution for 4 h, followed by two more 15-min washes. Glycoprotein gels were analyzed using a Typhoon scanner (blue laser, 488 nm) and stained using Sypro Ruby (Bio-Rad) and Coomassie brilliant blue R250 (Bio-Rad).
For an analysis of potential N glycosylation 20 µl of STIV and S. solfataricus P2 was solubilized in 7 M urea-2 M thiourea-4% CHAPS {3-[(3-cholamidopropyl)-dimethylammonio]-1-propanesulfonate}-50 mM Tris-HCl (pH 8.8) buffer. Solubilized capsid proteins were acetone precipitated and resuspended into 40 µl phosphate-buffered saline (PBS) buffer, pH 7.4. An equal volume of glycoprotein-denaturing buffer (BioLabs) was added. The solution was boiled at 100°C for 10 min. A 1/10 volume of NEBuffer for G7 (BioLabs) was added, followed by a 1/10 volume of 10% NP-40 (BioLabs). Of this solution, 40 µl was kept as the enzyme control. The remaining solution received 2 µl of the enzyme PNGase F (BioLabs). This digestion solution was then split, with one half being incubated at 37°C for 1 h. To test for the presence of O glycosylation, STIV and S. solfataricus P2 proteins were resuspended in 50 mM PBS, pH 7.4, mixed with 2 µl of O-glycosidase (Roche Applied Science), and incubated at 37°C overnight.
Lipid analysis. Viral and host cell lipids were extracted and analyzed as described previously (18). Briefly, each sample was acetone precipitated and the air-dried pellet was solubilized with chloroform-methanol (1:3). Virus samples were diluted 100-fold and analyzed by direct infusion electrospray in both positive and negative modes on a micro-TOF (Bruker Daltonics). Instrument settings were as follows: capillary, 4,000 V; capillary exit, 240 V; dry gas (N2), 3 liters/min; temperature, 200°C. The negative mode had a more intense and consistent signal in the expected m/z range for tetraether lipids (900 to 2,000), as previously reported (33).
|
|
|---|
40 and 45 kDa on the 2D gel (spots 17 to 20 and 8 to 15, respectively) represent multiple forms of two viral proteins (Fig. 2B). The isoforms differ in pI, but their molecular weights are the same or nearly the same. Posttranslational modifications which alter the pI but do not significantly change the molecular weight offer a possible explanation for the observed pattern. Nomenclature for STIV proteins is based on the reading frame (A, B, or C) and predicted number of amino acids. To investigate this further, the major capsid protein of STIV, B345, was expressed in Escherichia coli and this protein was analyzed with a 2D gel (30). The E. coli-expressed protein comigrated with the most basic isoform of B345 found in infectious virus particles (spot 14). Analysis of the spot pattern using the protein modification screening tool ProMost (23) (http://prometheus.brc.mcw.edu/promost/) indicated that differential phosphorylation and/or modifications to basic side chains were likely low-molecular-weight posttranslational modifications that could account for the observed gel shifts. Alternative explanations include heterogeneous glycosylation with sulfate or acidic carbohydrates and isoelectric focusing artifacts.
![]() View larger version (48K): [in a new window] |
FIG. 2. Analysis of STIV capsid proteins by SDS-PAGE. Purified virus was denatured and separated (A) on a 4 to 20% SDS-PAGE gel or (B) on isoelectric focusing (3 to 10 nonlinear) gel strips and then separated in the second dimension on an 8 to 18% SDS-PAGE gel. Gels stained with Coomassie brilliant blue and individual protein spots and bands were then excised and digested with thermolysin and/or trypsin, and the peptides were analyzed using mass spectrometry. The unmodified major capsid protein expressed in E. coli runs similarly to 14 (*).
|
|
View this table: [in a new window] |
TABLE 1. Virus-associated proteins
|
![]() View larger version (32K): [in a new window] |
FIG. 3. Location of capsid proteins on the STIV genome map. ORFs are named according to frame (A to F) and number of predicted amino acids. Proteins identified by protease mass mapping (solid black arrows), in general, cluster together. There are eight proteins from annotated ORFs and one additional protein, A78. This protein lacks a start methionine and was not included in the original annotation. The map was created using Vector NTI Advance 10.1.1.
|
Structural predictions of the first 347 N-terminal residues of C557 show high similarity to an ankyrin repeat (3D PSSM confidence > 95%) that mediates protein-protein interaction (16) and the presence of an
30-amino-acid repeat. The sequence (VEKVIQKDITHPPKFYLPPVHLPNIHQIEAGIGH) is repeated three times between residues 32 and 130 (see supplemental material at www.chemistry.montana.bothner/jvi_stiv2006). Each repeat contains a conserved YLP motif (underlined), which is involved in protein-protein interaction in many organisms. For example, this motif has been found in receptor tyrosine kinases and members of the ErbB family of human epidermal growth factor receptors (20, 50). The C-terminal region is marked by a proline- and serine-rich motif, the functional significance of which is currently unknown. In an attempt to determine relative capsid composition a densitometry analysis of the 1D gel using ImageJ was performed. This shows a 15:1 ratio between the major capsid protein (B345) and C557. However, this ratio may not be accurate due to posttranslational modifications (see below).
PSI-BLAST (40) searches with B164 identified significant matches to a large class of P-loop ATPases. Profile-profile comparisons (43) showed the most significant similarity (E = 1 x 1033) to poxvirus A32 proteins (Pfam accession number PF04665; Fig. 4), which are thought to be involved in viral DNA packaging (10). This group of proteins, however, has no close homologue of known structure. For that reason, several less-related ATPases (with E values between 107 and 1013) were used as templates for B164 model building in order of their match significance (39). After several cycles of manual realignment and building, we generated models that were of good quality regardless of the starting template. It should be pointed out, however, that even though these models are likely to have the correct overall fold, they are not expected to be atomic-quality models because the starting templates were considerably different at sequence level from B164. Each of these models was fitted into the cryo-EM reconstruction of STIV (30) by surface overlap maximization (11). This procedure is specifically designed to place partial structures or models into low-resolution density maps by maximizing surface overlap. All models docked into the same general area at the base of the turret on the fivefold axis of symmetry. Depending on the template used for modeling, the surface overlap between models and the map was between 0.6 and 0.78, with the corresponding correlation coefficients in the 0.5 to 0.6 range. Importantly, when we attempted to dock more than 12 models at a time, the new solutions overlapped with the existing ones, thus indicating that the position at the base of turrets is the only one compatible with B164 models.
![]() View larger version (56K): [in a new window] |
FIG. 4. B164 alignment with several proteins from the family Poxviridae. SwissProt accession numbers are separated by an underscore from species abbreviations (FOWPV, Fowlpox virus; 9POXV, Vultur gryphus poxvirus; POXVV, Vaccinia virus; MCV1, Molluscum contagiosum virus subtype 1; SWPV, Swinepox virus; YMTV, Yaba monkey tumor virus; RPOXV, Rabbit fibroma virus). Residues are shaded according to 90% consensus, with white letters on black background signifying perfect conservation. Abbreviations on the consensus line are as follows: h, hydrophobic; s, small; l, aliphatic, b, big; a, aromatic; c, charged.
|
Docking of the X-ray model of B345 into the cryo-EM reconstruction revealed that the C terminus of this protein interacts with the proposed lipid layer (30). In addition, a search using TMpred indicates that STIV proteins B130 (residues 110 to 127), C381 (354 to 373), C557 (365 to 383 and 538 to 556), and A55 (37 to 54) also possess possible transmembrane domains. However, known sequence motifs for signal peptides in bacteria and eukaryotes were not detected by either Psort version II or SignalP v3.0. This does not appear to be due to the archaeal origin of these proteins as these algorithms have proven valuable in accurately identifying both transmembrane regions and signaling domains in large-scale analysis of archaeal species (6).
Glycosylation of STIV proteins. The description of a glycosyltransferase in the genome of STIV (31) prompted an investigation into the glycosylation status of virion-associated proteins. We tested for glycosylation by staining 1D and 2D gels with the carbohydrate-specific fluorescent stain Pro-Q (Fig. 5). The band corresponding to the major capsid protein was clearly positive for glycosylation. In an attempt to determine the nature of the protein-polysaccharide linkages, samples were treated with enzymes specific for O- and N-link glycosidic bonds. A variety of reaction conditions were used, without significant deglycosylation (lanes 4 and 5 in Fig. 5). Control reactions using total isolated SP2 protein indicate that the enzymes were active under the reaction conditions used, and both O- and N-link treatments led to dramatic reductions in the overall level of glycosylation (lanes 7 and 8 in Fig. 5). At the present time, the nature of the viral carbohydrates is not known. It is possible that, due to the highly stable nature of these thermophilic proteins, attempts at denaturation where not successful or that these virus-associated glycans are not recognized by standard enzymes. Interestingly, glycosylation has been shown to increase protein thermal stability (47) and modulate recognition and is present on Vp54 from PBCV-1.
![]() View larger version (87K): [in a new window] |
FIG. 5. Glycosylation of STIV proteins. Purified virus and whole-cell lysates of S. solfataricus P2 were digested with O- or N-deglycosidase enzymes and loaded onto a 4 to 20% SDS-PAGE gel. The proteins were analyzed for glycoproteins and visualized using a Pro-Q glycoprotein stain. CC Mr, CandyCane glycoprotein marker. The glycosylated major capsid protein ran at 41 kDa (lanes 3, 4, and 5). The band at 35 kDa is the N-link-specific glycosidase and not a deglycosylated product (lane 5). In contrast to the viral proteins, total Ssp2 proteins show a significant reduction in glycosylation after treatment (lanes 7 and 8).
|
![]() View larger version (22K): [in a new window] |
FIG. 6. Total lipid extracts from host and viral membranes. Lipids isolated from host cells (upper) and STIV (lower) were analyzed by negative-mode electrospray mass spectrometry. Viral lipids are a subset of those found in the host.
|
A close-up view of the peak at m/z 1,858 reveals the isotope pattern found in the STIV lipid (Fig. 7A). The monoisotopic peak for the major component was at m/z 1,858, and there was a minor component at m/z 1,856. The unit molecular weight suggested that this is either PGL1 or PGL2 with four cyclopentane groups (C104H194O24P1). However a closely related neutral molecule with a chloride molecule attached (C104H190O24Cl1) differs by only 0.0362 amu. Theoretical isotope distributions for these molecules clearly show the effect chloride has on a molecule (Fig. 7B and C). The influence is largest 2 mass units above the monoisotopic peak. A comparison of the STIV sample and the two theoretical spectra reveals that this is an acidic lipid. The measured m/z of 1,858.3745 ± 0.009 also matches that for the acidic species (m/z 1,858.3703) but not that for the neutral lipid (m/z 1,858.3341). The second major component of the STIV lipid layer, m/z 1,534.2670 ± 0.0077, is consistent with PL1 having six cyclopentane groups (C92H170O14P1; m/z 1,534.2647). The expected isotope ratio also matches that for the acidic species (Fig. 8). The structures of these interesting cyclic lipids are shown in Fig. 8C. Small numbers of both PGL1/2 and PL1 molecules have higher degree of cyclopentane incorporation. Each cyclized component reduces the mass by two, and these minor species are seen to the left of the monoisotopic peak in Fig. 7 and 8.
![]() View larger version (17K): [in a new window] |
FIG. 7. Experimental and theoretical isotope distributions for the lipid at m/z 1,858. A) Close-up of peak at m/z 1,858 in the STIV sample. The minor component present at m/z 1,856 has an additional cyclopentane group in the iospranyl chain. Based on the unit molecular weight, this could be PGL1 or PGL2 with four cyclopentane groups (B). However, a closely related neutral molecule that has become ionized by chloride differs by only 0.0362 amu (C). Theoretical isotope distributions for the acidic (B) and neutral (C) lipids clearly show the effect chloride has on the pattern. The influence is largest 2 mass units above the monoisotopic peak. A comparison of the STIV sample and the two theoretical spectra reveals that STIV has the acidic lipid. The measured m/z of 1,858.3745 ± 0.009 also matches that for the acidic species (1,858.3703m/z) but not that for the neutral molecule plus Cl (1,858.3341m/z).
|
![]() View larger version (14K): [in a new window] |
FIG. 8. Isotope pattern for the ion at m/z 1,534. Shown are the experimental (A) and theoretical (B) patterns for the STIV predominant lipid molecule. The acidic lipid has a measured mass that is consistent with PL1. Approximately 30% of this lipid is present with an additional cyclopentane unit (m/z 1,532), and a small amount has a second cyclopentane unit (m/z 1,530). (C) Lipids found in STIV. The cyclic tetraether lipids found in Sulfolobus have the general backbone shown above. The number of cyclopentane groups per chain is variable.
|
|
|
|---|
Proteomic analysis of gradient-purified virus particles in solution and after 1D and 2D gel analysis led to the identification nine virally encoded proteins. One of these proteins, A78, lacks a canonical initiation signal and was not listed in the original genome annotation (38) (Fig. 3). Recoding of genetic information is well documented in archaea, including the use of nonstandard amino acids, frameshifts, read-through of amber codons, and alternative sites for the initiation of transcription (12, 26). To assure that virally expressed proteins were not overlooked, we reformatted our protein sequence database such that ORFs began immediately after stop codons and allowed for read-through products of the amber codon. The identification of A78 illustrates the utility of a proteomics-based approach. Due to the short lengths of the A78 ORF (234 bp) and the preceding A55 ORF (165 bp), these two ORFs may encode a single protein in which the C terminus is a read-through product. It is noteworthy that these ORFs are separated by an amber stop codon, a common site of recoding in archaea (26). The combined ORF has been tentatively annotated A137b, and this is under further investigation.
The genes coding for seven of the nine virally encoded proteins detected in the particle (Table 1) are clustered together on the viral genome map (Fig. 3). The sequential order of these proteins is indicative of a shared transcriptional regulatory mechanism, likely in an operon organization. The use of operons to express and regulate related clusters of genes is common in Archaea and has also been observed in other viruses that replicate in S. solfataricus (36).
A223 and C381 had apparent molecular masses of
200 kDa on 1D SDS-PAGE reducing gels (Fig. 2A); however, their predicted masses are 24 kDa and 42 kDa, respectively. These proteins have significant hydrophobic and aromatic compositions: A223, 42% and 10%; C381, 24% and 12%. Their anomalously high molecular masses in 1D gel analysis suggest that they may reside in a complex that resists disruption. This behavior has been described for viral structural proteins from mesophilic organisms, such as PBCV-1 (34). Under the conditions used for 2D gel analysis, the proteins were detected close to their expected masses (Fig. 2B), lending support to the presence of highly stable noncovalent multimers.
It is tempting to speculate that a pentameric arrangement of C381 and A223 would be
320 kDa and could be involved with formation of the turret structures present at the fivefold vertices. The turret structures are the most striking feature of STIV (Fig. 1). It has been suggested that these are used for cell entry and DNA translocation (30, 38). As described above, proteins A223 and C381 form highly stable oligomers. Results from the fold recognition algorithm PSSM (29) indicate that both A223 and C381 are structurally similar to the vertex protein P5 of PRD1 (32). Since the vertex of PRD1 is responsible for host cell binding and dsDNA delivery (21), we propose that putative turret components A223 and C381 have similar functions. Because icosahedral averaging was used during refinement of the cryo-EM data, there is currently no evidence to suggest that a unique vertex is present, as has been shown for PRD1 (21).
A third STIV protein possibly involved in turret formation is C557, which has a predicted molecular mass of 58.6 kDa. As described above, this protein has three repeats of a 30-amino-acid sequence in the N terminus and a second proline-rich repeat in the C terminus along with two potential transmembrane domains. Based on the size of this protein and the structural model of the capsid, C557 would be expected to be part of the turret. Threading results indicate that an ankyrin-like fold is likely (PSSM score > 95%) for the N-terminal repeat region. This fold is abundant in the genomes of a wide range of viruses (2, 3, 27); however, to our knowledge it has not been found in a structural model of archaeal proteins to date. It is of interest to note that the topology of ankyrin domains resembles a curved cylinder, like the petals attached to the turrets. The estimated mass of the turret structure, minus the cementing protein, is 637 kDa (30). We propose that each pentamer subunit is composed of one copy of C557, C381, and A223. Together, these three proteins have an expected mass of 125 kDa, close to the 127 kDa estimated from the electron density present at each fivefold vertex. The ratio of C557 to the major capsid protein (B345), based on protein gel densitometry, was determined to be 1:15. This ratio is not consistent with the predicted ratio of
1:30, based on the T=31 architecture of the STIV capsid. This underrepresentation of B345 is likely due to inefficient protein staining because of the glycosylation and the fact that on a molar basis large proteins stain more intensely than small ones (19).
Both sequence- and structure-based analyses suggested that B164 is a virally encoded ATPase. This 19-kDa protein is highly basic and has a nucleotide binding sequence motif found in all ATPases. This was confirmed by model building based on four known ATPase structures, each of which gave good models. Docking experiments consistently positioned a single copy of the protein at the base of the turret complex. It should be noted that we used homology models of B164, not actual B164 structures, and that the map was of relatively low resolution (30). This leaves some uncertainty about the fine details of B164 position within the capsid. This issue will be addressed when the B164 structure or better-homology models become available. Nevertheless, we are confident that the placement of B164 is largely accurate to the extent allowed by our homology models and the resolution of the map. There are several indirect lines of evidence that support this observation. First, this position of B164 is consistent with the expected location of a protein involved with nucleic acid packaging and/or release (1, 10). Second, profile-profile comparisons (43), arguably the most sensitive sequence-based approach to remote homology detection, identified putative viral packaging ATPases (10) as the closest relatives of B164 (Fig. 4), further strengthening the possibility that B164 may be involved in nucleic acid packaging. Finally, the central axis of the turret lacks substantial density except at the ends and its channel has a 3-nm opening (38) that could accommodate dsDNA, again consistent with the transport of nucleic acid (8, 21, 38).
Two host proteins were consistently found to copurify with STIV particles. SSO7D is a small (7-kDa), basic (pI 10.12) DNA-binding protein (22) that was identified with high confidence. We speculate that this protein plays a direct role in stabilizing and packaging the viral DNA within the virion. It has been suggested that the same 7-kDa protein plays the role of a histone-like protein in association with the S. solfataricus genome. A protein of similar size and charge, VP2, is found in purified preparations of the SSV1 (S. solfataricus spindle-shaped virus 1) particle (52). VP2 binds viral DNA in a cooperative manner and is suspected of playing a role in packaging the viral genome. The role of VP2 in packaging was very compelling until homologues were not found in other SSV genomes (49). An explanation for the absence of this protein is that the viruses have usurped a host protein to aid in the packaging of their genomes. Identification of proteins similar to SSO7D in SSV2, SSVKm, and SSVRH particles would provide compelling evidence for this hypothesis. A second host-derived protein, SSO0881, is a conserved hypothetical protein similar to yeast and mammalian VPS24, which has been implicated in cell sorting and trafficking (48, 51). It seems unlikely that the two detected host proteins in STIV preparations are merely contaminants because they consistently cosediment with particles, although this cannot be ruled out. Further experimentation will be required to determine if these host proteins play a functional role in the life cycle of STIV.
The composition of the suspected internal lipid layer of STIV has been unknown. Extraction of lipids from host cells and purified virus particles and subsequent analysis using negative-mode electrospray ionization showed that STIV does contain lipids and that this layer is enriched in a subpopulation of those present in the host (Fig. 6). A similar approach to lipid extraction was used to show that SH1 virus particles from Haloarcula hispanica selectively acquire a subset of lipids from the host pool (4). We were able to make identifications of the two major lipid components found in the STIV particles. Accurate mass measurements and isotope distribution were consistent with the acidic tetraether molecules PL1 and PGL1-like (Fig. 8) (33). The major capsid protein, B345, has a basic C-terminal helix that likely interacts with the inner lipid leaflet. The acidic nature of the lipids is therefore consistent, as this would stabilize the protein-lipid interface. This is also consistent with the lipid and protein charge complementation described for PRD1 (13).
The biological function of the internal lipid layer of STIV is not known. Liposomes composed of tetraether lipids such as those isolated from the particles are thermally stable (45), and this layer could help to protect the viral genome. Alternative roles for the lipid could be in particle assembly and or disassembly. The internal lipid layer of PRD1 is believed to function as a mediator of capsid assembly, recruiting structural proteins to the lipid surface (1). As described above, four of the viral proteins identified in this study contain potential transmembrane domains (C557, A55, C381, and B130), and B345 is known to interact with the lipid (30). Interactions between the protein domains and a preassembled lipid layer could initiate formation of the protein capsid, or, conversely, the protein domains could serve as nucleation sites for formation of the lipid layer.
The exterior surface of the virus particle formed by B345 is the likely region for glycosylation. Protein glycosylation has a number of functional roles, one of which is to increase thermal stability (47). Consequently it is not surprising that many of the proteins found in S. solfataricus are glycosylated (17). We have shown here that B345 is glycosylated (Fig. 5). Based on SDS-PAGE migration, B345 has a mass of
41 kDa whereas the expected mass for the apoprotein is 37,790 Da. Even though the protein contains carbohydrate, it runs as a single band on a 1D gel, whereas 2D analysis clearly demonstrates that differentially charged forms are present. We cannot rule out the possibility that the multiple forms of B345 visualized on the 2D gels are an artifact of the technique. The exact sites of glycosylation are not known; however, predictions using standard rules map the potential sites to the surface of B345 that faces the exterior of the capsid. B345 and VP54, the major capsid protein of PBCV-1 (33), have strikingly similar tertiary structures (30), and the predicted sites of glycosylation on B345 overlay well with the identified positions on VP54. Interestingly, the sites of glycosylation on VP54 are noncannonical, and the PBCV-1 genome contains ORFs encoding glycosyltransferases. It has now been confirmed that STIV also encodes a glycosyltransferase (31). As reported here, deglycosylation experiments failed to identify the nature of the carbohydrates on B345. Taken together, these results suggest that B345 of STIV may have a novel form of glycosylation. The purpose of encoding a glycosyltransferase in the viral genome is unclear. This may be due to protein/carbohydrate structure or may be a remnant of an ancestral pathway of glycosylation.
The detailed characterization of STIV particles involved the analysis of protein, lipid, and carbohydrate. Assembly of this data has revealed a picture of STIV akin to tail-less dsDNA viruses from the other domains of life. The evolutionary relationship of PBCV-1, PRD1, SH1, and STIV was not immediately apparent upon comparison of their genomes. Separation on an evolutionary time scale will lead to sequence divergence, but in the case of viruses this is exaggerated by necessary adaptations to unique viral lifestyles, unique environmental parameters (variable pH, temperatures, and salinity), variable host G/C ratios, and different mechanisms of transcriptional and translational regulation. It is therefore not surprising that the relationships between viral proteins from different domains of life can be reliably detected only at the level of structure (9, 30). While this makes the annotation of viruses much harder, it is deeply satisfying to uncover these shared architectural principles that have persisted both at the level of individual proteins and whole viral capsids over millions of years (5). The continued analysis of archaeal viruses, such as STIV and SH1, continues to strengthen the evidence for the existence of a common viral ancestor that predates the divergence of Eukarya, Bacteria, and Archaea.
We thank Rachel Neal for assistance with the mass spectrometry.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»