Previous Article | Next Article ![]()
Journal of Virology, August 2006, p. 7902-7908, Vol. 80, No. 16
0022-538X/06/$08.00+0 doi:10.1128/JVI.00483-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Zhiyong Lou,1,2,
Fei Sun,1,2
Yujia Zhai,1,2
Haitao Yang,1,2
Rongguang Zhang,2,3
Andrzej Joachimiak,3
Xuejun C. Zhang,2,4
Mark Bartlam,1,2 and
Zihe Rao1,2*
"Tsinghua-IBP Joint Research Group for Structural Biology," Tsinghua University, Beijing 100084, China,1 National Laboratory of Biomacromolecules, Institute of Biophysics (IBP), Chinese Academy of Sciences, Beijing 100101, China,2 Biosciences Division, Argonne National Laboratory, Argonne, Illinois 60439,3 Crystallography Research Program, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma 731044
Received 8 March 2006/ Accepted 9 May 2006
|
|
|---|
|
|
|---|
Prior to the global severe acute respiratory syndrome (SARS) outbreak in 2003, scant attention was paid to coronaviruses by researchers, as this genus of viruses causes severe diseases predominantly in animals and only comparatively mild diseases in humans. In the wake of the SARS outbreak, greater attention has been focused on the replicase proteins with a view to understanding the replication/transcription machinery and to identify new therapeutic targets. To date, the three-dimensional structures of a series of nsp proteins have been reported (1). nsp5, also called the main protease or 3C-like protease, was the first SARS protein structure determined in 2003 (24) and has since been the focus of concerted efforts for the design of antiviral inhibitors. Last year, a broad-spectrum inhibitor was reported with efficient in vitro inactivation of multiple coronavirus main proteases, potent antiviral activity, and extremely low cellular toxicity (23). The structure of nsp9 was determined in 2004, and it was found to be a single-stranded RNA binding protein (4, 17). More recently, the complex structure between nsp7 and nsp8 revealed a hexadecameric assembly that should constitute a processivity factor for nsp12, an RNA-dependent RNA polymerase (25). Also determined recently was the ADP ribose 1-phosphatase domain of nsp3 (14).
nsp10 and nsp11 immediately precede nsp12 in the pp1ab sequence and have been implicated in RNA synthesis, while the mechanism remains to be illustrated through both structural and functional studies. To improve our understanding of functions of CoV nsp proteins, we carried out systematic structural studies on this group of proteins. We report the crystal structure of nsp10. This protein appears to have a novel fold, featuring two zinc fingers with C-(X)2-C-(X)5-H-(X)6-C and C-(X)2-C-(X)7-C-(X)-C motifs. Furthermore, 12 identical subunits assemble into a novel spherical dodecameric architecture which is proposed to be a functional form of nsp10.
|
|
|---|
Crystallization. SARS nsp10-nsp11 crystals were grown at 291 K by the hanging drop vapor diffusion method from a sodium chloride system in desalting buffer supplemented with 65 mM zinc chloride. The optimum brick-like crystals were obtained using a reservoir solution of 1.8 M NaCl, 0.1 M NaH2PO4, 0.1 M KH2PO4, 0.1 M DTT, and 2-(N-morpholino)ethanesulfonic acid (MES; pH 6.5). A single crystal was transferred to the reservoir solution (supplemented with 5 M sodium formate) for 1 h prior to submersion in a cold nitrogen stream.
Data collection and processing. The "frozen" nsp10-nsp11 crystal diffracted at up to a 2.1-Å resolution. Single-wavelength anomalous dispersion (SAD) data using the bound zinc were collected for the native nsp10-nsp11 crystal at 100 K using an SBC2 (3,000 by 3,000) CCD detector on beamline BL19-ID at the Advanced Photon Source (Argonne National Laboratory). The intensity data were processed and scaled using the software HKL2000 (11). Data collection statistics are summarized in Table 1.
|
View this table: [in a new window] |
TABLE 1. Data collection and refinement statistics
|
Initial tracing of the monomer polypeptide chain was performed manually using the program O (7), and a starting dodecameric model was generated using noncrystallography symmetry (NCS) relations generated from CNS. Initial refinement was performed using simulated annealing in CNS with very tight NCS restraints. During the later stages of positional refinement, restraints were relaxed and a bulk solvent correction was applied under the guidance of Rfree. Model geometry was verified using the program PROCHECK (9). Solvent molecules were located from stereochemically reasonable peaks in the
A-weighted Fo Fc difference electron density map, where Fo is the observed structure factor amplitude and Fc is the calculated structure factor amplitude. Final refinement statistics are shown in Table 1.
Protein structure accession numbers. Coordinates and structure factors for the SARS-CoV nsp10 crystal structures have been deposited in the RCSB Protein Data Bank (PDB) under accession numbers 2G9T (for the 2.1-Å structure) and 2GA6 (for the 2.7-Å structure).
|
|
|---|
root mean square deviations (RMSD) being less than 0.5 Å. Residues before Ala9 (including five leading residues left from the tag) and Pro86-Gly88 and after Ser129 could not be traced due to lack of interpretable electron density. Since the latter region includes the entire nsp11 peptide, the crystallized protein is nsp10 per se; it is called such hereafter. The absence of Cys130 and subsequent residues is a result of incidental cleavage, as indicated by (i) the reduced molecular weight band from sodium dodecyl sulfate polyacrylamide gel electrophoresis analysis compared with that of a freshly prepared protein sample and (ii) matrix-assisted laser desorption ionization-time of flight mass spectrometry analysis (data not shown). The cause of the loss of Cys130 and its C-terminal residues remains unclear. However, analysis of the crystal packing suggests that the C-terminal peptide deletion is required for the formation of the crystal lattice. The 24 independent copies of nsp10 assemble into two identical spherical dodecamers related by a local twofold symmetry. The novel dodecameric sphere possesses a tetrahedral symmetry and can be viewed as the assembly of four nsp10 trimers (Fig. 1). The dodecamer is hollowed in the center, with an outer radius of 42 Å and an inner radius of 18 Å (Fig. 2). Two zinc binding sites were identified in the three-dimensional structure of each nsp10 monomer. The first one is located in the N-terminal region and on the inner surface of the dodecamer, and the second one is located at the C terminus on the outer surface. The missing C tail of nsp10-nsp11 peptide would also be located on the outer surface of the dodecamer. Both inner and outer surfaces of the dodecamer have a dominantly positive electrostatic potential (Fig. 2), although the calculated pI of nsp10 is 5.8.
![]() View larger version (47K): [in a new window] |
FIG. 1. Ribbon representation of the nsp10 crystal structure. A. One nsp10 trimer is viewed along the noncrystallography threefold axis from the outside of a dodecamer. The three protomers are colored in magenta, gold, and green. B. The remaining three trimers of the dodecamer are shown in the same orientation as that for panel A using the same color scheme. C. The same as panel B but rotated by 90°. D. The relationship between threefold axes. The twist angle between pairs of threefold axes is approximately 108°. E. The complete dodecamer structure of SARS-CoV nsp10. The three protomers in a trimer are colored in magenta, gold, and green. This figure was prepared with the programs Molscript (8), Bobscript (5), and Raster3D (10).
|
![]() View larger version (93K): [in a new window] |
FIG. 2. Electrostatic potential of the nsp10 crystal structure. A. The electrostatic potential of a complete dodecamer is mapped on its outer molecular surface. Negatively charged regions are colored in red, positively charged regions are colored in blue, and neutral regions are colored in gray. The C-terminal-bound zinc ions, which are located on the outer surface, are depicted as red spheres. The position of the front trimer is highlighted by a triangle. B. The inner surface electrostatic potential of the dodecamer is shown in the same orientation as that of panel A. C. A cross-section of the nsp10 dodecamer to illustrate the minimum and maximum radii of the shell. D. The electrostatic potential surface of an isolated trimer. This figure was drawn with the program CCP4 mg (12, 13).
|
/ß fold comprised of five
-helices (
1 to
5), one 310-helix, and three ß-strands (ß1 to ß3) (Fig. 3). The monomer peptide model consists of residues Ala9 to Ser129 and is continuous, except that the region Pro86-Gly88 is missing in the
4-ß3 connecting loop. Nevertheless, in the 2.7-Å structure which was solved using crystals grown in the absence of additional zinc, we were able to build the Pro86-Gly88 loop region for some nsp10 monomers, suggesting that this region is flexible in general. The central core of the nsp10 monomer is an antiparallel ß-sheet formed by strands ß1 (residues 55 and 56), ß2 (65 to 69), and ß3 (96 to 100). The central ß-sheet is flanked on one side by helices
3 (residues 70 to 73) and
4 (75 to 79), while helices
1 (residues 10 to 18),
2 (23 to 32) at the N terminus, helix
5 (107 to 113), and the extended C-terminal coil shy away from the central core. Residues on the
4 helix and
4-ß3 loop constitute the N-terminal zinc binding site, and the C-terminal coil contributes to the C-terminal zinc binding site. A DALI (http://www.ebi.ac.uk/dali/) search indicated no similar match to the nsp10 monomer in the current Protein Data Bank (PDB), suggesting a novel fold for nsp10.
![]() View larger version (41K): [in a new window] |
FIG. 3. nsp10 monomer fold. A. Stereo diagram showing a C trace of the nsp10 monomer. Two zinc ions are shown as gray spheres, and one chelating water molecule is shown as a smaller red sphere. B. Stereo ribbon diagram of the nsp10 monomer with ß-strands in purple and -helices in gold. Secondary structure elements are labeled. This view is rotated by 90° relative to that of panel A. This figure was prepared with the programs Molscript (8), Bobscript (5), and Raster3D (10).
|
-S
distances of 5.5 Å and 4.7 Å, respectively, they do not form disulfide linkages. This observation is consistent with the presence of high concentrations of reducing agents in the crystallization reservoir.
![]() View larger version (65K): [in a new window] |
FIG. 4. Stereo views of the two independent zinc binding sites. A. The N-terminal zinc binding site with electron density. Key residues interacting with the zinc ion are labeled and depicted as sticks with carbon, nitrogen, oxygen, and sulfur atoms colored in yellow, blue, red, and green, respectively. The zinc ion is shown as a gray sphere. A refined-model-phased 2Fo-Fc electron density map is shown for the zinc ion and chelated residues at 1.3 . Secondary structure elements are labeled. B. The C-terminal zinc binding site with electron density. The turn between C117 and C120 is labeled. This figure was prepared with the programs Molscript (8), Bobscript (5), and Raster3D (10).
|
1-
2 loop of one monomer forms hydrophobic interactions with residues Val57 and Thr58 in strand ß1 of an adjacent monomer; residues Thr115 and Thr118 of one monomer also form hydrophobic interactions with Thr118 and Val119 of an adjacent monomer. Additional hydrogen bonds are formed between the N
atom of Lys25 in one molecule and the O
2 atom of Glu60 in an adjacent molecule, with a distance of 2.6 Å, and between the main chain O atom of Pro84 in one molecule and the N
atom of Lys95 in an adjacent molecule, with a distance of 3.3 Å. Furthermore, the three monomers are oriented such that their C-terminal zinc fingers are clustered around the threefold axis, with their zinc ions separated by approximately 14 Å.
Trimer-trimer interactions and the dodecamer architecture.
The assembly of the 24 copies of nsp10 into two identical dodecamers indicates that the dodecamer is a stable structural unit of nsp10 under the crystallization conditions. The four nsp10 trimers (named trimers 1 to 4) in a dodecamer are related by a tetrahedral symmetry (Fig. 1). Any combination of three trimers is related by a local threefold symmetry, and so is a combination of two trimers related by a local twofold symmetry. The buried SAS of a trimer in the dodecamer is about 3,040 Å2 (or about 16% of the total SAS of an nsp10 trimer), with 69% contributed by hydrophobic atom groups. The crystal structure shows that helix
1 and residues in the
2-ß1 loop (residues 42 to 46) play a key role in trimer-trimer interactions. In the first (threefold symmetry-related) interaction region, Leu14, Cys17, Ala18, and Cys79 of trimer 1 form a hydrophobic base, which is directed towards Phe19 of trimer 2. At the same time, Phe19 of trimer 1 forms hydrophobic interactions with the equivalent hydrophobic base of trimer 3, and so on. In the second (twofold symmetry-related) interaction region, Met44 and Leu45 of trimer 1 are oriented to interact directly with their counterparts in trimer 2; Val42 and Tyr96 of trimer 1 interact with Tyr96 and Val42 of trimer 2, respectively. Both Cys17 and Cys79 contribute to stability of the dodecamer architecture, although they do not form a disulfide bridge in the reduced conformation state. Following this protocol, the 12 molecules can assemble to form a pseudododecahedron.
The zinc ion binding sites.
Zinc binding is a major structural feature of nsp10. Spectral analysis of the nsp10 crystal during synchrotron data collection clearly demonstrated the presence of zinc, evident by a clear peak near the zinc absorption edge. It allows the nsp10 crystal structure to be solved by the SAD method. Furthermore, the 2.7-Å resolution structure of nsp10 determined from crystals prepared in the absence of additional zinc shows the same monomer fold, dodecameric architecture, and conformation in the zinc binding sites, with an RMSD of 0.4 Å for all C
atoms in a dodecamer.
Two bound zinc ions were identified from the crystal structure of nsp10 with unambiguous electron density: one located in the N-terminal region on the inner surface of the dodecamer and the other one at the C-terminal region on the outer surface (Fig. 2 and 4). The first zinc binding site is formed by residues on helix
4 and the
4-ß3 loop. This zinc ion is tetrahedrally chelated by the S
atoms of three cysteine residues (Cys74, Cys77, and Cys90) and the N
2 atom of His83, which have bond distances of 2.3 Å, 2.3 Å, 2.4 Å, and 2.2 Å, respectively (Fig. 4A; Table 2). This binding site constitutes a CCHC-type zinc finger with a C-(X)2-C-(X)5-H-(X)6-C sequence motif.
|
View this table: [in a new window] |
TABLE 2. Zinc chelation
|
atoms. The fourth ligand is a water molecule, with clearly defined electron density, whose distance to the zinc ion is 2.5 Å. This chelating water molecule also interacts with Ser129 through a perfect hydrogen bond to form a stabilized hydrogen bond network. All bond lengths related to zinc binding are summarized in Table 2. Noteworthy is the observation that Cys130 neighbors Ser129 in the nsp10 sequence but did not exist in the crystallized protein. Otherwise, the presence of Cys130 might suitably position it to chelate the zinc ion, suggesting the second zinc binding site should also be a zinc finger of a C-(X)2-C-(X)7-C-(X)-C sequence motif. This CCCC-type zinc finger is distributed on the surface of the dodecamer and is relatively flexible in our structure compared to the CCHC-type zinc finger (Table 2). However, the flexibility of this region may not reflect the conformation of the full-length protein. A multiple-sequence alignment of nsp10 from SARS-CoV with coronaviruses from groups I, II, and III of the genus Coronavirus indicates that all seven observed zinc-chelating residues, plus Cys130, are strictly conserved (Fig. 5), implying their importance in the functional replicase-transcriptase complex. In contrast, a number of other cysteine residues in SARS nsp10 are not conserved at all.
![]() View larger version (28K): [in a new window] |
FIG. 5. Multiple-sequence alignment of SARS-CoV nsp10 with representatives from all three groups of the genus Coronavirus. HCOV-229E, human coronavirus strain 229E; MHV, mouse hepatitis virus; IBV, avian infectious bronchiolitis virus. The secondary structure for SARS-CoV is shown at the top of the alignment; arrows indicate ß-strands, and helical curves denote - or 310-helices. Residues highlighted in red are identical among the compared proteins; residues highlighted in yellow are conserved. Residues important for zinc binding are marked with green triangles, and residues important for stability of the dodecamer are marked with blue vertical arrows. The alignment was generated by ClustalX (20) and drawn with ESPript (6).
|
Although the DALI search did not find any candidate structure similar to SARS-CoV nsp10 from PDB, a PFAM search (http://www.sanger.ac.uk/Software/Pfam) for similar sequence motifs identified several members of the HIT-type zinc finger family as nsp10 homologous candidates. Named after the first protein that originally defined the domain, the yeast HIT1 protein, the HIT-type zinc finger contains seven conserved cysteines and one histidine that can potentially coordinate two zinc atoms. While the function of the HIT-type zinc finger is unknown, this motif is mainly found in nuclear proteins involved in gene regulation and chromatin remodeling. To date, there are no three-dimensional structures of HIT-type zinc finger domains reported in the PDB. Therefore, our nsp10 crystal structure serves as the first example of a three-dimensional structure of this novel class of double zinc finger-containing motif.
This finding provides strong structural evidence that, like other better studied nsp proteins, nsp10 also likely plays a role in RNA synthesis, as suggested by other researchers (26). nsp10 is involved in network interactions with other nsp proteins, and the integrity of its zinc fingers seems important for such interactions. Experiments on mouse hepatitis virus (MHV), a group II coronavirus along with SARS-CoV, demonstrated the colocalization of nsp10 with nsp7, nsp8, and nsp9, providing solid evidence for their interaction in the coronavirus life cycle (2). Our chemical cross-linking experiment further demonstrated that SARS-CoV nsp10 can be cross-linked with nsp9 (data not shown), which itself interacts with nsp8 (17). Furthermore, an MHV ts mutant, Alb ts6, encoding a mutant form of nsp10 with a Gln65-to-Glu mutation, was shown to have a defect in negative-strand RNA synthesis (15). The Gln65 residue, conserved in all three groups of the genus Coronavirus, is located on strand ß2 of the SARS nsp10 structure and hydrogen bonds via the N
2 atom to the main chain carbonyl oxygen of Gly52. Gln65 is thus important for the conformational stability of nsp10 and particularly for the
4 helix which forms part of the N-terminal CCHC zinc finger. Therefore, mutation of Gln65 might be expected to perturb the folding of pp1a into a less productive conformation that would prevent it from participating in the formation of a replicase-transcriptase complex with negative-strand activity.
Conclusions. The scientific significance of the SARS-CoV nsp10 structure is at least threefold. First, nsp10 has a novel protein fold. A search with the DALI web engine (http://www.ebi.ac.uk/dali/) for structural homologs failed to yield any match to the nsp10 fold, suggesting a novel function for the nsp10 family members. Second, nsp10 possesses two zinc fingers, located in the N-terminal region and at the C-terminal region, with C-(X)2-C-(X)5-H-(X)6-C and C-(X)2-C-(X)7-C-(X)-C sequence motifs, respectively. These motifs are conserved in all three groups of the genus Coronavirus, and our crystal structure illustrates for the first time the significance of these conserved residues. Further sequence analysis suggests that nsp10 is related to the HIT-type zinc finger family, which is often found in nuclear proteins involved in gene regulation and chromatin remodeling. Thus, our nsp10 crystal structure becomes the first of a new class of zinc finger protein three-dimensional structures to be revealed experimentally. Third, the molecular assembly of nsp10 is a hollow dodecamer with an outer diameter of 84 Å and an inner diameter of 36 Å. Twelve C-terminal zinc fingers stick out from the outer surface of the sphere, and another 12 zinc fingers are distributed around the inner surface. The strong positive electrostatic potential found on both the inner and outer surfaces of the dodecamer is intriguing, consistent with the probable function of nsp10 in the RNA synthesis machinery.
To date, the structures and functions of several components of the replication/transcription machinery have been determined, including the nsp3 ADP ribose 1-phosphatase domain (14); nsp5 (23, 24), nsp7, and nsp8 in complex (25); and nsp9 (4, 17). nsp5 is the main protease for cleavage of the replicase polyproteins, nsp7 and nsp8 are proposed to function as processivity factors for nsp12 (the RNA-dependent RNA polymerase), and nsp9 is a single-strand RNA binding protein. The crystal structure reported here will help to clarify the function(s) of nsp10, in which the presence of two zinc fingers should enable it to play an important role in RNA synthesis. Elucidation of the nsp10 structure will provide further insights into the sophisticated replication/transcription mechanism of SARS-CoV and other coronaviruses, such as mouse hepatitis virus (MHV), human coronavirus strain 229E, and human coronavirus strain HKU1.
This work was supported by Project 973 of the Ministry of Science and Technology of China (grant number 2004CB720000), the NSFC (grant number 30221003), the Sino-German Center [grant number GZ236(202/9)], and the "Sino-European Project on SARS Diagnostics and Antivirals" (SEPSDA) of the European Commission (grant number 003831).
These authors made equal contributions. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»