Previous Article | Next Article ![]()
Journal of Virology, November 2002, p. 11645-11658, Vol. 76, No. 22
0022-538X/02/$04.00+0 DOI: 10.1128/JVI.76.22.11645-11658.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907-13921
Received 23 May 2002/ Accepted 8 August 2002
|
|
|---|
-helical coiled coils that were consistent with T=4 symmetry. The known structure of the capsid protein was fitted into the density corresponding to the nucleocapsid, revising the structure published earlier. |
|
|---|
Alphaviruses are among the simplest of enveloped viruses on account of both their relatively small genome and their icosahedral symmetry. They have a number of structural and functional similarities to flaviviruses, suggesting a possible common primordial origin of some of their components (24, 27, 37, 52), although their genomes differ significantly. Alphaviruses have been favored for structural studies because of the higher yields compared with other enveloped RNA virus propagations. Many alphaviruses have the further advantage for laboratory studies that they mostly produce inapparent or mild symptoms in human hosts, compared with the frequently severe effects of flavivirus infections.
Alphaviruses have a positive-strand RNA genome with about 11,700 bases enclosed in a protein core consisting of 240 identical subunits. A subgenomic mRNA encodes a polyprotein that is posttranslationally cleaved into five individual proteins: capsid, E3, E2, 6K, and E1 (6, 17, 40, 49). Glycoproteins E1 and E2 assemble into heterodimers that are glycosylated in the Golgi and are transported to the plasma membrane. During membrane trafficking, the E1-E2 heterodimers organize themselves into 80 (E1-E2)3 trimeric spikes to form an icosahedral scaffold on the surface of the virus (Fig. 1) (14, 27, 37). The nucleocapsid cores are assembled in the cytoplasm of infected cells and are then transported to the plasma membrane, where they interact with the cytoplasmic domains of the glycoprotein spikes. During the budding process of the virion from the cell, the glycoproteins and a part of the plasma membrane are used to form the viral envelope to produce virions in which both the internal core and external glycoproteins have matching T=4 icosahedral quasisymmetry (7, 8, 15, 33, 57). The mature Sindbis virus is composed only of the E1, E2, and capsid proteins.
![]() View larger version (122K): [in a new window] |
FIG. 1. CryoEM density distribution for Sindbis virus. Top left, surface of the virus at 20-Å resolution. The red triangle marks the boundary of an icosahedral asymmetric unit. The numbers show the positions of icosahedral two-, three-, and fivefold axes limiting the asymmetric unit. Note that there is one trimeric spike associated with an icosahedral threefold axis plus a second trimeric spike in a general position in the icosahedral asymmetric unit. Top right, cross section through the 11-Å map along the black line shown in the top left and bottom right panels. Note the clearly defined lipid bilayer and the transmembrane domains crossing the membrane. The nucleocapsid (NCP) is seen inside the membrane. Bottom left, a central cross-section through the 11-Å resolution map, showing the glycoproteins (blue), the lipid bilayer (green), the nucleocapsid (red), the mixed RNA-protein region (orange), and the internal RNA (magenta). The orientation of the icosahedral (two-, three-, and fivefold) as well as quasi-threefold (q3) axes is shown in yellow. Below the cross section is shown the radially averaged mean density (blue) and the root mean square deviation of the density from the mean (magenta). Right bottom are radial sections, for the 11-Å resolution map, at r1 = 199 Å (nucleocapsid core), r2 = 243 Å (transmembrane), r3 = 261 Å (base of the glycoproteins), r4 = 298 Å (includes the base of the spike), and r5 = 324 Å (leaf-like structure of E2) with grey scale used to denote density levels, black being the highest density. Note the T=4 organization of the transmembrane region in r2 and r3.
|
The structure of a large fragment of the ectodomain component of E1 has been determined and was suggested to be homologous to the E glycoprotein found in flaviviruses (27), although there is no remaining trace of any common heritage in the amino acid sequence. The overall structure of the flavivirus tick-borne encephalitis virus E protein has three domains (39) (Fig. 2A). The second domain (DII) can be described as a six-stranded ß-barrel with one of its end loops being a peptide required for fusion with the host cell (23, 28). The third domain (DIII) has an immunoglobulin-like ß-barrel fold and may be important for binding to cellular receptors.
![]() View larger version (50K): [in a new window] |
FIG. 2. Comparison of the backbone structures of Semliki Forest virus (SFV) E1 with tick-borne encephalitis virus (TBEV) E. (A) Ribbon diagrams, with domain I in red, domain II in yellow, fusion peptides in green, and domain III in blue. (B) Stereodiagram showing superposition of Semliki Forest virus E1 (blue) on tick-borne encephalitis virus E (red). (C) Alignment of Sindbis virus (SINV) and Semliki Forest virus E1 with tick-borne encephalitis virus E, based on the structural superposition of Semliki Forest virus E1 on tick-borne encephalitis virus E. Residues of tick-borne encephalitis virus E that do not match the structure of Semliki Forest virus E1 have been omitted. Residues in domains I, II, and III are colored red, yellow, and blue, respectively, consistent with the coloring scheme used above and throughout. Shown is the complete amino acid sequence of Sindbis virus and just those residues of tick-borne encephalitis virus that could be structurally aligned with Semliki Forest virus E1. Sequences underlined with a solid bar are ß-strands, and those underlined with dashed bars are -helices. Secondary structural nomenclature is taken from Rey et al. (39).
|
Here, we report a systematic analysis of an 11-Å resolution cryoEM map of a deglycosylated mutant of Sindbis virus. The program EMfit (8, 42, 45) was used to orient and position the known structures of Semliki Forest virus E1 (27), of the Sindbis virus capsid protein (25), and of the GCN4 dimeric coiled coil (12) into the cryoEM map. A difference map showed that the molecular envelope of the E2 monomer is remarkably similar to that of the E1 monomer, suggesting a possible similarity in fold. This difference map also demonstrated the close association of E1 and E2 in the formation of a heterodimer and organization of the trimeric spike. The virus map showed clearly the dimensions of the lipid bilayer and the coiled-coil structure of E1 and E2 in the transmembrane region. When put together, these observations give a fairly complete look at an RNA-enveloped virus structure in quasiatomic detail, allowing some insight into the assembly and fusion processes. The virus structure is one of the first visualizations of a lipid bilayer in situ and also shows that the external icosahedral symmetry of the glycoproteins is in register with the internal symmetry of the nucleocapsid.
|
|
|---|
Electron microscopy.
Small (2.8-µl) aliquots of purified N318Q mutant Sindbis virus sample at a concentration of approximately 6 mg/ml equilibrated in TNE buffer (50 mM Tris-HCl [pH 7.5], 200 mM NaCl, and 1 mM EDTA) were adhered to holey carbon-coated grids and frozen in liquid ethane (3). Images were recorded on Kodak SO-163 films in a Philips CM200 field emission gun transmission electron microscope (Philips, Eindhoven, The Netherlands) under low-dose conditions (
18 e-/Å2) at a nominal magnification of x38,000. Micrographs were digitized on a Zeiss SCAI scanner with 7-µm intervals, and sets of four pixels were averaged to give effective sampling steps of 3.68 Å at the specimen.
A total of 10,868 particles were selected from 27 micrographs whose defocus levels ranged from 1.10 to 2.58 µm underfocus. The self-common-lines method (16) was used to determine an initial model of the virus. The orientation and origin of each particle image were determined through use of the model-based, polar-Fourier transform method (2). Orientations were further refined with a modified cross-common-lines procedure (16), which compares phase differences between the model projections and viral images. The microscope contrast transfer function (CTF) was determined for each viral image (see reference 3 for the definition of CTF). Corrections to compensate in part for the effects of the CTF on each image were applied in Fourier space by modifying structure factors (obs) computed from the image according to the following formula:
![]() |
![]() |
![]() |
The resolution of the final reconstruction density map was estimated by comparing the structure factors computed from two reconstructions, each based on independent halves of the whole image data set. All densities in the two reconstruction maps were used for comparison. The Fourier shell correlation coefficient dropped from 0.6 to 0.29 between 11.4- and 11.1-Å resolution. The phase residues at these resolutions were
40 and 58°, respectively. The final map was calculated from 4,931 viral images to a Fourier limit of 10.0 Å with a parallelized version (C. Xiao, unpublished data) of the EM3DR reconstruction program, which uses the original Fourier-Bessel procedure of Crowther (10). The density map was sharpened through the application of an inverse temperature factor of 1,000 Å2 (21). Some features of the map had negative densities as a result of the modifications and omission of the zero-order Fourier term.
Interpretation of the 11-Å map. All fitting of X-ray crystallographic structures of components was performed with the EMfit program (8, 42, 45). The final atomic coordinates were deposited in the PDB under accession number 1LD4.
|
|
|---|
The fitting process (Table 1) depended upon finding the orientation and position of the Sindbis virus E1 model that simultaneously maximized the fit of the atoms into all four quasiequivalent cryoEM densities and minimized the distance between the center of the glycosylation sites in the map and the corresponding glycosylated Asn residues in the atomic model (45). Both in positioning E1 and also in positioning the nucleocapsid protein (see below), only the C
atoms were used initially, but all atoms were used for the final fitting (Table 1). The positions of the Sindbis virus E1 glycosylation sites for residues 139 and 245 were used as constraints by minimizing their distance from the respective C
atoms when merely fitting the C
backbone or with respect to the Asn N
atoms when fitting all atoms of the Sindbis virus E1 model. The known positions of the glycosylation site for residue 141 in Ross River virus and Semliki Forest virus were also used as a restraint for fitting the Sindbis virus E1 model (37). In this case, the restraint was made with respect to the C
atoms both for fitting the C
backbone and for fitting the complete Sindbis virus E1 model.
|
View this table: [in a new window] |
TABLE 1. Results obtained by fitting the crystallographic structure components into the cryoEM map
|
![]() View larger version (36K): [in a new window] |
FIG. 3. T=4 quasi-symmetry generation depicted diagrammatically in terms of the E1 glycoprotein. E1 molecules related by icosahedral symmetry are all of the same color (purple, red, blue, or green). Quasi-symmetry operators relate molecules of different color. The initial molecule is placed in position 1. The adjacent icosahedral threefold axis (marked by a 3) then generates molecules 2 and 3, and the adjacent quasi-twofold axis (marked by an orange oval) generates molecule 4. A quasi-threefold axis (marked by q3) generates molecules 5 and 6 from molecule 4. Subsequently, the icosahedral fivefold axis (marked by a 5) generates molecules 7 through 18 from molecules 4, 5, and 6, and the icosahedral twofold axis (marked by a 2) generates molecules 19, 20, and 21 from molecules 1, 5, and 7, respectively. Note that molecules 1, 5, 7, 19, 20, and 21 form a hexamer and molecules 6, 15, 16, 17, and 18 form a pentamer, as in the nucleocapsid. The thick black outlined triangle marks one of the 60 icosahedral asymmetric units.
|
|
View this table: [in a new window] |
TABLE 2. Polar angles defining quasisymmetry axesa
|
![]() ![]() View larger version (137K): [in a new window] |
FIG. 4. Stereodiagrams showing the glycoproteins E1 and E2. (A) Surface-shaded representation of the E1-E2 glycoprotein spike at the quasi-threefold axis. Note the pore at the center of the spike, consistent with the results of Parades et al. (34). (B) Fit of E1 monomer C backbone into density (top view) around the quasi-threefold axis, viewed as in A. The molecules, represented by their C backbone, are colored as in Fig. 3. The cryoEM density is grey. (C) E1 i3 and q3 trimers (top view), color coded and oriented as in Fig. 3. Part of the triangular icosahedral asymmetric unit is outlined. The E1 carbohydrate difference densities (139 grey, 245 brown) are also shown, as well as their corresponding C atoms (black circle for 139 and black dot for 245). (D) Heterodimer showing the E1 structure and the E2 difference density (side view). The density corresponding to the lipid bilayer is shown in green. (E) The E2 difference density for molecule 5 (purple) fitted with the E1 C backbone (black). Note the central hole in the E2 density visible in the mauve orientation, probably as a result of an -helix. (F) E2 difference density (side view) around a q3 axis colored blue (molecule 4), mauve (molecule 5), and brown (molecule 6). The E2 carbohydrate difference densities (196 mauve, 318 red) are also shown. The lipid bilayer density is shown in green. The fitted E1 molecules are shown as their C backbones, colored as in Fig. 3.
|
The results of the restrained fitting procedure were essentially the same whether only the C
atoms or all atoms were used (Table 1). There was a remarkably good agreement between the height of the cryoEM density at quasiequivalent positions (Table 3), showing that the properties of the T=4 symmetry operators had been well optimized. The correlation of densities at C
atoms between pairs of quasiequivalent molecules varied from 0.84 to 0.92. The quality of fit, measured in terms of the average density of the fitted atoms (Sumf), was roughly the same for each of the three E1 domains in each of the four quasiequivalent positions (Table 3), demonstrating that the crystallographically determined Semliki Forest virus structure could be fitted to the cryoEM density without resorting to adjustment of interdomain hinge angles. Independent verification of these results was obtained by using the SITUS program (64).
|
View this table: [in a new window] |
TABLE 3. Sumf values of the separate domains when fitting T=4-equivalent molecules into the cryoEM mapa
|
Plots (Fig. 5) confirmed the exact orientation of the quasi-symmetry axes and showed that the transmembrane regions obeyed, within experimental error, the same quasisymmetry as the E1 and E2 surface glycoproteins. These plots also showed that the quasi-threefold axes and quasi-twofold axes were almost radial. The quasisymmetry axes used for the fitting operations (Table 1) were assumed to be radial in order to minimize the number of parameters. Although this assumption is not completely accurate, the very small deviation shown by the plots (Fig. 5) can have only a very small impact on the fitting procedure.
![]() View larger version (18K): [in a new window] |
FIG. 5. Plots of x and y versus z, representing the trace of the quasi-threefold and quasi-twofold axes provided by the positions of the carbohydrate and transmembrane sites. Coordinates are for the mass center of sets of three (tracing the quasi-threefold axis) and two (tracing the quasi-twofold axis) carbohydrate sites and the transmembrane region. Equations for the least-squares fitted lines are shown. The appropriate Sindbis virus carbohydrate sites (318, 245, 139, and 196) and transmembrane site are shown at the top.
|
or N
atom was 15.9 Å and 13.7 Å, respectively. These distances are reasonable to account for about two well-ordered sugar moieties at each glycosylation site. E2 glycoprotein. The cryoEM density associated with E2 was calculated by setting the density to zero at every grid point within a 4.5 Å radius of every atomic position, as determined by the restrained fitting of the E1 structure to the 11-Å resolution Sindbis virus map. Thus, the remaining density outside the lipid bilayer should belong to the E2 glycoprotein. It was immediately obvious that E2 was a long, thin molecule, somewhat similar in shape to and twined around E1 (Fig. 4), but the boomerang-shaped E2 molecule is far more radial in its disposition than is E1. E2 is known to contain the receptor attachment site near residue 218, situated in the large, protruding, external leaf-like surface (51). The external surface also contains the alphavirus glycosylation site 196 (in Sindbis virus) and 200 (in Semliki Forest virus and Ross River virus). The glycosylation site at residue 262 (in Semliki Forest virus and Ross River virus) is at the beginning of the stem leading from the external leaf-like surface. The Sindbis virus glycosylation site at residue 318, nearer the carboxy terminus of E2, is close to the membrane surface (37). Thus, all the E2 amino-terminal residues to at least residue 218 probably form the external large, leaf-like domain. From about residue 218 to the transmembrane region, the E2 polypeptide forms a long, narrow stem running roughly parallel to E1 and entering the viral membrane on the inside of the E1 trimer. The highly exposed leaf-like surface of E2 is followed by the narrower stem, which twists around the more tangentially disposed E1 molecule.
E1 and E2 are in contact with each other at both ends of the heterodimer, leaving a small gap between the two contact regions (Fig. 4D). Three E1-E2 heterodimers lean against each other, making a three-start, right-handed helix in the formation of each spike. The distal end of one heterodimer covers the proximal end of the neighboring heterodimer within the helical spike organization. The leaf-like structures of the three distal ends of the E2 molecules cover the fusion peptides of the three E1 molecules within the trimer. The only E2-to-E2 contacts are between the E2 leafs. There is a gap of about 30 Å between the base of neighboring E1-E2 heterodimers (Fig. 4), which would allow E2 to move out of the center of the spike at low pH to permit the exposure of viral membrane in the middle of each spike during fusion, as suggested by Kuhn et al. (24). At the same time, the three E1 molecules within each spike are reorganized into a trimer (19, 59), altering the surface-accessible residues of both E1 and E2. The fit of E1 is consistent with the suggested conformational changes as probed by proteolytic digestions and antibody accessibility (1, 36). The organization of the trimeric structure has some similarities to that of the flavivirus E and membrane (M) proteins in tick-borne encephalitis virus recombinant subviral particles (13).
Although the model of Sindbis virus E1 based on the Semliki Forest virus C
atoms is unlikely to be accurate, it should be sufficient to determine, at least roughly, which residues of E1 make contact with E2. As described above, a map showing the E2 density was calculated with an atomic radius of 4.5 Å for each of the atoms in the Sindbis virus E1 homology model. In the next step, the average density within a radius of 6.5 Å was calculated for each atom. Those atoms of E1 with large average densities must thus be in contact with E2. The E1-to-E2 contacts at the membrane-proximal ends of the heterodimer involve charged residues in domain III of E1. The E1-to-E2 contacts between heterodimers involve primarily charged residues in domain II. This may account for the conformational changes that occur prior to fusion in an acidic environment (11, 15).
Frequently, a primordial gene is tandemly duplicated, and then each gene evolves separately. A typical example is the three tandem viral proteins VP2, VP3, and VP1 of picornaviruses, which have similar three-dimensional structures but no remaining, detectable sequence similarity (44). Thus, the remarkably similar long, thin shapes of E1 and E2, as well as their tandem organization in alphavirus genomes, might suggest that E1 and E2 have similar folds. Based on this speculation, a three-dimensional orientation search was made with the EMfit program for the best fit of the available Semliki Forest virus E1 backbone structure into the difference density representing E2. The fit to the density was good (Fig. 4E), with a mean C
atom density of 30.3% of the maximum map density (Table 1). However, the tip of domain II penetrates the outer leaflet of the lipid bilayer. The central portion of the density ascribed to E2 looks like a donut in which the loops surround areas of low density (Fig. 4D, E, and F). The fit of the assumed E2 structure placed an
-helix into one of the loops. The same helix, when fitted to the E1 density, showed a similar but less distinct feature. The model fitted into the E2 density had the opposite direction from E1, with domains I and III forming much of the broad leaf-like external surface of E2 and the thin and long domain II forming the lower end, near the viral membrane. This proposed orientation is significant because domain III in tick-borne encephalitis virus E protein functions as the receptor attachment site. Domain III would form the leaf-like structure of E2 on the outside of the virus, a position known to be associated with alphavirus cell recognition and attachment (51). Furthermore, domain II, with the equivalence of a fusion peptide at the tip of domain II, is now inserted into the outer leaflet of the viral membrane.
At 11-Å resolution, the fit of Semliki Forest virus E1 to the E2 density establishes similarity not of fold, but only of shape. Indeed, as the carboxy-terminal domain III of the E1 structure fitted into the E2 density forms much of the external leaf-like structure, and as it has already been established that the leaf-like structure consists of the amino-terminal section of E2, there must be some differences in fold between E1 and E2. Nevertheless, the glycosylation sites corresponding to E2 residues 196, 200, 262, and 318 align approximately with residues 21, 23, 153, and 225 on the fitted Semliki Forest virus structure, respectively. The correspondence between E2 glycosylation sites and residues on the fitted Semliki Forest virus E1 structure suggests that the sequence of the domains on E2 is DIII-DI-DII, placing at least 110 amino acids of a probable immunoglobulin-like domain at the amino end of the E2 polypeptide, although the spatial arrangement of the domains in E1 and E2 is the same. Similar domain rearrangements are found in other protein structures (5, 47). Verification of the proposed E2 structure will require determination of the atomic resolution structure of E2 by X-ray crystallography.
Structural similarity between Semliki Forest virus E1 and tick-borne encephalitis virus E.
It has long been established that the rate of change of amino acids is usually much faster than that of the three-dimensional fold in the evolution of proteins (48). Therefore, it is not surprising that there is no obvious sequence similarity between Semliki Forest virus E1 and tick-borne encephalitis virus E, although the folds are clearly rather similar (27). A rough scale of similarity can be established by measuring the number and percentage of C
atoms that can be sequentially aligned in three dimensions (22, 31, 56).
Structural alignment based on the C
backbone (38, 43) showed that 63% of the residues in domain I and 61% of the residues in domain II of the Semliki Forest virus E1 structure could be equivalenced to residues in the tick-borne encephalitis virus E structure, accounting for the superposition of nearly all the secondary structural elements in domains I and II (Fig. 2). However, only 14% of the residues in domain III could be structurally aligned. Separate alignments of the domains produced essentially the same alignment for domains I and II, demonstrating that these form a single rigid body. Separate alignment of domain III made it possible to increase the number of equivalenced residues to 23%, a level of similarity still well below a significant structural relationship (31). In contrast, in a comparison of the viral proteins VP1, VP2, and VP3 of picornaviruses, the number of residues that can be aligned structurally is between 50 and 70%, although there is no easily recognizable amino acid sequence similarity (29).
The absence of a significant similarity between Semliki Forest virus E1 and tick-borne encephalitis virus E in domain III suggests that it has had a different history or is evolving faster than the other two domains. Thus, these domains may have been derived from different sources for these two families of viruses. This would also be consistent with domain III being fused to the amino instead of the carboxy end of E2 (see above).
The fitting procedure was also used to position tick-borne encephalitis virus E (instead of Semliki Forest virus E1) into the 11-Å cryoEM map of Sindbis virus with only the C
atoms. Restraints were established by equivalencing the alphavirus E1 glycosylation sites with the tick-borne encephalitis virus flavivirus E amino acid sequence, based on the structural alignment of tick-borne encephalitis virus with Semliki Forest virus E1 (Fig. 2). This showed that the tick-borne encephalitis virus E structure resulted in an average density (Sumf) for C
atoms of only 29%, as opposed to 43% for the Semliki Forest virus E1 structure (Table 1), demonstrating how well the Sindbis virus map is able to differentiate between the distantly similar tick-borne encephalitis virus E structure and the closely related Semliki Forest virus E1 structure.
Lipid membrane bilayer. The most striking feature in the 11-Å resolution cryoEM map is the characteristic appearance of the lipid bilayer between 217-Å and 261-Å radii (Fig. 1). The pair of high, roughly spherical, uniform densities centered at 222 Å and 256 Å are generated by the phosphate head groups and are separated by much lower density, centered around a radius of 239 Å, produced by the aliphatic region associated with the phosphate layers. The mean density of the phosphate head groups is about 75% of the average density in the glycoproteins, a ratio closely similar to that observed in a 24-Å resolution cryoEM map of dengue virus (24).
The membrane is penetrated by a number of higher-density regions (Fig. 1 and 6), whose location is consistent with the T= 4 symmetry of the virus. The three transmembrane densities related by a quasi-threefold axis (Fig. 3 and 6) can be superimposed on themselves after a
120° rotation to within a root mean square deviation of 1.9 Å. These, in turn, can be superimposed onto the three icosahedrally related transmembrane densities with a root mean square deviation of 1.6 Å. These densities locate the positions of the quasi-twofold and quasi-threefold axes, which are in excellent agreement with the quasi-threefold and quasi-twofold symmetry axes defined by the site of the previously determined carbohydrate positions (Fig. 5) (37).
![]() View larger version (37K): [in a new window] |
FIG. 6. (A) Side view of one of the four T=4 related transmembrane regions fitted with a 28-residue coiled-coil segment derived from the GCN4 structure (C , trunc, in Table 1). (B) Sequence of the E1 and E2 glycoproteins fitted to the A and B chains of GCN4, respectively. Ectodomain residues are in blue, residues in phospholipid leaflets are in green, residues in the aliphatic center of the membrane are in yellow, cytoplasmic residues are in red, and residues that bind to capsid protein (CP) are in black.
|
The heterodimeric coiled-coil component of GCN4 (PDB accession number 1YSA) was used as a model to fit into the four quasiequivalent transmembrane densities (Table 1) with the program EMfit. The fitting process was constrained by the established T=4 symmetry (Table 2). The fitting was also restrained by minimizing the distances between the amino terminus of the truncated GCN4 A chain and the carboxy-terminal residue 380 of E1. The fit shows a good agreement between the coiled-coil component of GCN4 and the Sindbis virus transmembrane densities (Fig. 6). The amino ends of the A and B chains of the GCN4 structure are well separated from each other but, nevertheless, are in or near
-helix-like density. The amino end of helix B is close to the density representing the E2 glycoprotein. Thus, the GCN4 structure is a mimic of E1 and E2 in the vicinity of the viral membrane.
There are 28 amino acids in the membrane section in both the A and B chains, with 17 residues of each chain associated with the central aliphatic region of the membrane (Fig. 6). Building of the E1 transmembrane amino acid sequence into the A chain of GCN4 shows that this region extends roughly from residues Thr409 to Ser435, leaving four residues on the cytoplasmic side, in contrast to previous suggestions (4). The extension of E1 and especially E2 towards the nucleocapsid is clearly visible in the cryoEM map. Furthermore, the B chain has nine residues in the cytoplasmic region of E2, emerging from the viral membrane at residue Cys390 and finishing close by the amino terminus of the peptide (residues 400-Tyr-Ala-Leu; see below) associated with the capsid protein binding site (26), consistent with expectations. Thus, chain A of GCN4 runs from near the carboxy terminus of the fitted E1 fragment (probably residue Thr390; see below) to the E1 carboxy terminus at Arg438 on the cytoplasmic side of the viral membrane, whereas chain B runs from the vicinity of the E2 ectodomain to the E2 binding site on the nucleocapsid.
There are 28 amino acids between the carboxy terminus of the fitted E1 fragment at residue 380 and the entrance of the E1 polypeptide into the membrane at residue 409. However, the amino end of the fitted GCN4 A chain is only about 10 Å away from the final E1 ordered carboxy-terminal residue Cys380. There are 18 residues in the GCN4 A chain before it enters the transmembrane region. Thus, there are about 10 amino acids between the end of E1 and the beginning of the GCN4 A helix, for which there is no density and little available space.
The Sindbis virus E1 model was based on the crystallographic determination of the homologous Semliki Forest virus E1 structure. The carboxy terminus of the crystallized Semliki Forest virus E1 had been determined to be at Tyr390 (62), although the final 10 residues (381 to 390) were found to be disordered. In view of the difficulty encountered in assigning the amino acid sequence to the X-ray electron density (27) in domain III, it is possible that the final short, disconnected amino acid sequence might have been incorrectly assigned and that the residues leading to Semliki Forest virus E1 Cys380 really should have been residues leading to Tyr390.
Nucleocapsid. The density on the inside of the lipid bilayer, between 220-Å and 180-Å radii, also has large positive and negative density values, resulting in the largest root mean square deviations in the map, corresponding to the nucleocapsid core region (Fig. 1). The average density of the core is almost equal to the average density in the glycoprotein annulus, demonstrating good structural order in both. In comparison, the core region in dengue virus is only half the height of the glycoprotein (24), demonstrating a very different structural character of the nucleocapsid in these two families of enveloped RNA viruses.
The X-ray crystallographic structure of Sindbis virus capsid protein (9) was fitted into the 11-Å resolution cryoEM map of Sindbis virus (Table 1) with the EMfit program. The procedure was the same as had been used to interpret the capsid structure of Ross River virus at 24-Å resolution (8) except for the resolution of the maps and the parameters used to define the quasisymmetry (see above). The orientation of the capsid protein was found to be completely different from that reported previously (Table 1). Whereas the new orientation has a quality of fit similar to that of placing the E1 glycoprotein into the 11-Å map, the old orientation gives a significantly worse fit (Table 1). The new orientation fills more of the cryoEM density, including the intersubunit region, has few clashes between neighboring subunits, and is consistent with the higher-resolution details of the 11-Å map. Furthermore, in the new orientation, the surface binding pocket (26) is close to the carboxy end of the GCN4 B chain. Thus, a final fit was established by adding a peptide (corresponding to E2 residues 400-Tyr-Ala-Leu) to the binding pocket of the search model and by minimizing the distance between the amino end of the bound peptide and the carboxy end of the fitted GCN4 B chain (Table 1 and Fig. 7).
![]() View larger version (76K): [in a new window] |
FIG. 7. Fit of the capsid protein (CP) crystal structure into the cryoEM map. (A) Top view, showing the hexamer and pentamer around an icosahedral twofold axis and a fivefold axis, respectively. Molecules are colored as in Fig. 3. The electron density is colored grey. (B) Side view of one capsid protein crystal structure (black) fitted into the cryoEM density. The membrane density is colored green. Tyr180 in the peptide binding pocket is shown in blue. (C) CryoEM difference density after subtracting the density due to the capsid protein. Note that the amino-terminal residues 106 to 114 are out of density but that there is an obvious density region where these residues are positioned in the virus as opposed to the crystal structure. This density leads from the mixed protein-RNA region into the capsid protein shell.
|
The charge distribution on the pentameric and hexameric rings of capsid protein subunits is largely positive on the external side facing the negatively charged membrane surface, confirming that the present orientation of the capsid protein is chemically reasonable. The internal face of these oligomeric rings, facing the RNA and the positively charged amino-terminal capsid domain, is more neutral.
The revised capsid structure places the NG atom of Lys250 on the inside of the capsid hexamers and pentamers, which would allow adjacent subunits within the same ring to be cross-linked by a 12-Å-long dimethyl suberimidate cross-linker. It has been shown that Sindbis virus cores can be self-assembled in vitro in the presence of suitable RNA or DNA oligomers and isolated capsid protein (53). Furthermore, assembly can occur from dimers of cross-linked capsid dimers (54, 55). This made sense in terms of the previous interpretation of the capsid protein orientation, which placed Lys250 on the outside of hexamers and pentamers separated by a distance suitable for cross-linking between hexamers and pentamers. The capsid orientation that now fits the density better would make it impossible to assemble pentamers from sets of dimers. Presumably, therefore, the in vitro-assembled cores, with pairwise cross-linked protein subunits, would be imperfect, missing one of five subunits in each of the 12 pentamers. However, a deficiency of 12 monomers out of a total of 240 monomers per particle would probably have gone undetected in the electron microscopy used to assay the assembly reaction. It is noteworthy that only about 20% of dimers could be isolated when in vitro-assembled cores or cores isolated from cytoplasm were cross-linked (54).
Previous experiments that introduced site-specific mutations in the residues thought to create the interface between subunits did not eliminate virus particle formation (K. E. Owen and R. J. Kuhn, unpublished data). Thus, our finding an alternative solution to the structure of the nucleocapsid was consistent with the previous negative results. The new structure has a remarkably small subunit-to-subunit interface, suggesting that formation of the interface is not the major driving force that creates the observed pentamers and hexamers. Instead, it is probably the association of RNA with the amino-terminal domain (residues 1 to 113), as well as the formation of a coiled-coil association between the amino-terminal domain of the capsid proteins (35), that drives the assembly. Nevertheless, the presently proposed structure does place the guanidinium group of Arg174 within 2.3 Å of the carboxy group of Glu259 in the neighboring subunit. Arg174 is a conserved basic residue, and Glu259 is completely conserved in all alphaviruses sequenced, suggesting that this interaction might be important for assembly.
RNA structure. Within the core, between a radius of 150 and 180 Å, is a region whose average density is only half the height of the glycoprotein or nucleocapsid densities and whose character is more uniform, with a smaller root mean square deviation (Fig. 1). As mentioned above, the difference density produced by subtracting the fitted capsid protein clearly shows the path of the capsid protein residues from about 100 to 114 leading from this layer of partially disordered structure to the beginning of the well-ordered capsid structure (Fig. 7).
It has been shown that residues 76 to 113 interact specifically with RNA (32, 61), whereas residues 1 to 75 in the N-terminal domain of the capsid protein interact nonspecifically with the predominantly negatively charged RNA (18, 60). This is consistent with the assumption that the lower, more uniform density inside the ordered capsid structure is a mixed protein-RNA region. The ratio of the volume of the capsid protein density compared to the volume of the mixed protein-RNA region is 2.0. However, the ratio of the number of amino acids in the corresponding regions is 1.50 (assuming that all the 100 amino acids of the capsid's amino-terminal domain are in the mixed region). Thus, the mixed protein-RNA region has space for an ample amount of RNA as well as all the protein in the amino-terminal region of the capsid.
The lack of extensive capsid-capsid contacts suggests that the RNA-capsid association has to play a critical role in assembly, as was also recognized by the requirement of nucleic acid for the in vitro self-assembly of cores (53). The density distribution of the mixed protein-RNA region features higher densities running towards the quasi- and icosahedral-threefold axes (Fig. 7C). Thus, each quasi-threefold axial position associates one capsid protein pentamer with two hexamers, and each icosahedral threefold position associates three capsid hexamers. This implies that the association of pentamers and hexamers is determined in the same way by their relationships with the quasi- and icosahedral spikes as they are with their association with each other in forming a complex with the genomic RNA.
Finally, inside the 150-Å radius is a region of moderately uniform density, corresponding to the remainder of the genomic RNA. The features in the RNA region are long and narrow, similar to the RNA region in dengue virus (24).
We thank Robert Ashmore for his RobEM graphic user interface program, Chuan Xiao for writing several programs that helped to speed up the calculations, and Sharon Wilder as well as Cheryl Towell for help in the preparation of the manuscript. We are grateful to Paul Chipman, who participated in some of the cryoEM data collection.
The work was supported by an NIH Program Project Grant (AI45976) to T.S.B., R.J.K., and M.G.R. as well as NIH grants to T.S.B. (GM33050) and to R.J.K. (GM5627).
|
|
|---|
helices: crystal structure of the protein-DNA complex. Cell 71:1223-1237.[CrossRef][Medline]
-helix important for core assembly. J. Virol. 75:1-10.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»