Previous Article | Next Article ![]()
Journal of Virology, April 2007, p. 3151-3161, Vol. 81, No. 7
0022-538X/07/$08.00+0 doi:10.1128/JVI.01939-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Department of Molecular Biology, Skaggs Institute for Chemical Biology, Consortium for Functional and Structural Proteomics of the SARS-CoV, and Joint Center for Structural Genomics, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037,1 Institut für Molekularbiologie und Biophysik, ETH Zürich, CH-8093 Zürich, Switzerland2
Received 5 September 2006/ Accepted 19 December 2006
|
|
|---|
/ß-fold formed by a mixed parallel/antiparallel six-stranded ß-barrel, an
-helix covering one opening of the barrel, and a 310-helix alongside the barrel. We further characterized the full-length 179-residue protein and show that the polypeptide segments of residues 1 to 12 and 129 to 179 are flexibly disordered. The structure is analyzed in a search for possible correlations with the recently reported activity of nsp1 in the degradation of mRNA. |
|
|---|
The large number of mature proteins produced from the polyprotein indicates a high level of complexity of the viral replication process. Some of the enzymatic activities that were detected or predicted in SARS-CoV to date include the main protease (nsp5), a papain-like proteinase (nsp3d; PLpro), an RNA-dependent RNA polymerase (nsp12), an RNA helicase (nsp13), an endoribonuclease (nsp15), an ADP-ribose-1"-phosphatase (nsp3b), a deubiquitinase (nsp3d), a 3'
5' exoribonuclease (nsp14), and a ribose-2'-O-methyltransferase (nsp16) (3, 5, 18, 21, 22, 44, 59, 70). For the two proteases, the endoribonuclease, and the ADP-ribose-1"-phosphatase, three-dimensional structures have been solved, which together with biochemical data have revealed some aspects of the enzyme mechanisms (27, 54, 56, 58, 63, 67; reviewed in references 4, 35, and 65). The physiological functions of several of the other nonstructural proteins remain to be determined, but high-resolution structure determinations have allowed the identification of possible functional sites and provided the basis for further biochemical studies (15, 30, 52, 61, 62, 68). Other replicase proteins still remain to be characterized.
Nsp1 is the N-terminal cleavage product of the replicase polyprotein and is produced by the action of PLpro. It is among the least well-understood nsps, and other than in coronaviruses, no viral or cellular homologs are known. Levels of sequence conservation among the different coronaviruses are highest at the 3' end of the genome, and the sequences are very divergent at the 5' end, especially in nsp1 to nsp3, which are products of PLpro cleavage. nsp1 has been proposed to be useful as a group-specific marker (59). In the group 1 coronaviruses, nsp1 (also known as p9) is a protein of about 110 residues, with 20 to 50% sequence identity among all group 1 CoVs. The viruses of subgroup 2a, such as murine hepatitis virus (MHV) and human coronavirus OC43, encode an nsp1 protein of about 245 residues, also known as p28, while the group 3 viruses (avian) do not encode an nsp1. The nsp1 of SARS-CoV, which has been classified as the only member to date of the subgroup 2b (19, 20, 59), comprises 180 residues, with a molecular mass of 20 kDa. nsp1 sequences are divergent between groups 2a and 2b, and no sequence similarity between SARS-CoV nsp1 and group 2a nsp1 proteins could be identified using standard searching tools such as BLAST.
Biochemical experiments demonstrated interactions between MHV nsp1 and two other replication proteins (nsp7 and nsp10) and colocalization with nonstructural proteins and the nucleocapsid protein at viral replication complexes in the cytoplasm during the early stages of infection (6). In contrast, during the later stages of infection, MHV nsp1 was found to colocalize with structural proteins at virion assembly sites (6). Mutations at the nsp1/nsp2 cleavage site of MHV that prevented the cleavage of nsp1 from the polyprotein caused slower growth and reduced RNA synthesis relative to wild-type viruses (13). Deletion of the nsp1-coding region in infectious clones of MHV yielded viruses that were unable to productively infect cultured cells (7). Furthermore, exogenous expression of MHV nsp1 in mammalian cells arrested the cell cycle in the G0/G1 phase and inhibited cell proliferation (8). A point mutation in the proteolytic cleavage site between nsp1 and nsp2 in the full-length genome, and in minigenomes of the group 1 CoV porcine transmissible gastroenteritis virus, blocked the release of nsp1 from the nascent polyprotein and caused a dramatic reduction in virus viability (17). SARS-CoV nsp1 was shown to specifically accelerate the degradation of mRNA and thus lead to a reduction in cellular protein synthesis, which may provide a survival advantage for the virus (31). Overall, these observations indicate that nsp1 might participate in multiple stages of the coronavirus life cycle, and they implicate this protein as a potentially important virulence factor.
This paper describes the nuclear magnetic resonance (NMR) structure of SARS-CoV nsp1. Before this study, no information about the three-dimensional structure of nsp1 was available, and SARS-CoV nsp1 does not have significant amino acid sequence similarity with any protein with known three-dimensional structure. SARS-CoV nsp1 was therefore selected for NMR structure determination by the Consortium for Functional and Structural Proteomics of SARS-CoV-Related Proteins (http://sars.scripps.edu). The availability of a high-resolution solution structure will help to guide further investigations of the biochemical and physiological functions of nsp1.
|
|
|---|
Protein preparation. Large-scale expression of uniformly 15N-labeled or 13C- and 15N-labeled nsp1(13-128) in E. coli BL21(DE3) cells was carried out at 18°C in 500 ml of M9 minimal medium containing either 0.5 g 15NH4Cl or 0.5 g 15NH4Cl and 2 g [13C6]-D-glucose as the sole nitrogen and carbon sources, respectively. For the protein purification, the cells were disrupted by sonication in the presence of 25 mM HEPES at pH 8.0, 250 mM NaCl, 2 mM dithiothreitol, 0.03% NaN3, and EDTA-free Complete protease inhibitor tablets (Roche). The cell lysate was loaded onto a 10-ml HisTrap FF column equilibrated with 50 mM imidazole in the same buffer system as mentioned above. The retained proteins were eluted with a 50 to 500 mM imidazole gradient and incubated with recombinant tobacco etch virus protease at 22°C for 2 days. The resulting solution was loaded onto a 300-ml Superdex 75 column equilibrated with 25 mM sodium phosphate at pH 7.0, 250 mM NaCl, and 0.03% NaN3. The protein eluted with a retention volume equivalent to about 13 kDa. The solution was concentrated with ultrafiltration centrifugal devices and supplemented with 10% D2O to a final sample volume of about 300 µl.
NMR spectroscopy and structure calculation.
The NMR samples contained 2 mM of nsp1(13-128). NMR spectra were collected at 298 K with Bruker Avance 600-MHz and Avance 800-MHz spectrometers equipped with TXI HCN z-gradient probes. The sequence-specific resonance assignment (66) has been described elsewhere (1). The input for the structure calculation consisted of the chemical shift list obtained from the resonance assignment, a 3D 15N-resolved 1H,1H nuclear Overhauser effect spectroscopy (NOESY) spectrum, and two 3D 13C-resolved 1H,1H NOESY spectra optimized for the aliphatic and aromatic 13C regions. The nuclear Overhauser effect (NOE) data were measured at 800 MHz with a mixing time of 60 ms. For the peak picking of the NOESY spectra, NOE assignment, and structure calculation, the stand-alone ATNOS/CANDID program (24, 25) was used in conjunction with the CYANA torsion angle dynamics algorithm (23). The standard protocol with seven cycles of peak picking, NOE assignment, and 3D structure calculation with simulated annealing in torsion angle space (24, 25) was applied. Backbone
and
dihedral angle constraints derived from the C
chemical shifts (40, 60) were used as supplementary data in the structure calculation. The 20 conformers with the lowest residual CYANA target function values obtained from cycle 7 of the ATNOS/CANDID/CYANA calculation were energy minimized in a water shell with the program OPALp (34, 39), using the AMBER force field (9). The program MOLMOL (33) was used to analyze the protein structure and to prepare the figures showing the NMR structures. Analysis of the stereochemical quality of the models was accomplished using the Joint Center for Structural Genomics validation central suite (http://www.jcsg.org) and the Protein Data Bank validation server (http://deposit.pdb.org/validate).
Steady-state 15N{1H} NOEs were measured with transverse relaxation-optimized spectroscopy (TROSY)-based experiments (55, 69) on a Bruker Avance 600-MHz spectrometer, using a saturation period of 3 s and an interscan delay of 5 s.
Accession numbers. The chemical shifts have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu) under accession number 7014. The atomic coordinates of the bundle of 20 conformers used to represent the nsp1 structure have been deposited in the Protein Data Bank (http://www.rcsb.org/pdb) with the code 2GDT, and those of the conformer closest to the mean coordinates have the code 2HSX.
|
|
|---|
Since nsp1 has no identifiable sequence similarity with proteins with known three-dimensional structures, it was not possible to predict the domain structure of this protein based on sequence comparisons. However, the presence of flexibly disordered regions in the protein identified by 1H NMR spectroscopy (see "Characterization of the full-length SARS-CoV nsp1" below) was consistent with the results of secondary structure prediction, which indicated that a few residues at the N terminus, as well as a greater number of residues in the C-terminal one-third of the protein, would not adopt regular secondary structure. To investigate the boundaries of the globular domain and to optimize conditions for protein expression, sample preparation, and NMR structure determination, we designed a set of truncated variants of nsp1, bearing in mind the results of secondary structure predictions for this protein. The variant constructs have molecular masses of 12.7 kDa to 19.5 kDa, not including the N-terminal tag of 3.5 kDa (Table 1). These constructs were used to transform E. coli strains Rosetta(DE3), BL21(DE3) RIL, and BL21(DE3), and the recombinant proteins were expressed in a microshaker at 37°C, 27°C, and 18°C. The best growth rates and expression levels were obtained with the strain BL21(DE3). Table 1 provides a survey of the expression results with six different nsp1 constructs. Most of the protein in the samples expressed at 37°C was insoluble for all six variants. For two constructs, higher yields of soluble protein were obtained at 27°C, but the best results were achieved with expression at 18°C, where most of the expressed protein was in the soluble fraction.
|
View this table: [in a new window] |
TABLE 1. Summary of the recombinant production of nsp1 variants in BL21(DE3) E. coli cells
|
Since cysteine residues are susceptible to oxidation and formation of intermolecular disulfide bonds, which can lead to unstable and heterogeneous protein samples, we also investigated variant constructs of nsp1(13-128) with Cys 52 replaced by Ala, Ser, Arg, or Asp as part of our initial target optimization strategy, using 1D 1H NMR and circular dichroism spectroscopy to evaluate their foldedness and stability. The variants with Ser 52, Asp 52, or Arg 52 were thus found to be unstable. The variant with Cys 52 replaced by Ala led to a stable, folded protein. However, after observing excellent sample stability of the wild-type protein despite the single Cys residue, we chose the wild-type protein for the structure determination.
The parameters in Table 2 show that a well-defined NMR structure of nsp1(13-128) was obtained. Above-average local disorder is limited to the C-terminal heptapeptide segment of residues 122 to 128 and to a disordered loop of residues 77 to 86 (Fig. 1a). The structure of intact nsp1 includes a globular domain of residues 13 to 121 and the disordered regions of residues 1 to 12 and 122 to 179 (Fig. 1b).
|
View this table: [in a new window] |
TABLE 2. Input for the structure calculation and characterization of the bundle of 20 energy-minimized CYANA conformers representing the NMR structure of nsp1(13-128)
|
![]() View larger version (61K): [in a new window] |
FIG. 1. (a) Bundle of 20 energy-minimized CYANA conformers of nsp1(13-128). In this stereo view, the polypeptide backbone is shown as a gray spline function through the C positions. Selected sequence positions are identified by numerals. (b) Ribbon representation of the closest conformer to the mean coordinates of the bundle of 20 conformers used to represent the NMR structure. The ß-strands are cyan, the helices are red, and polypeptide segments with nonregular secondary structure are gray. The regular secondary structures are further identified by lettering. The polypeptide segments shown in green represent the additional, structurally disordered polypeptide segments of the full-length nsp1.
|
1-ß2-310-ß3-ß4-ß5-ß6. There is a mixed parallel/antiparallel six-stranded ß-barrel, where the spatial arrangement of the ß-strands is ß1-ß2-ß5-ß3-ß4-ß6, and ß1 makes contact with ß6 (Fig. 2 and 3). The ß-strands consist of residues 15 to 21, 52 to 56, 69 to 73, 87 to 92, 104 to 110, and 117 to 124. The helix
1 with residues 36 to 49 is located across one barrel opening, and the 310-helix of residues 62 to 64 is positioned alongside the barrel. A search of the Protein Data Bank using the structure of nsp1 as input for the DALI server (26) did not indicate statistically significant structural similarity to any other protein described to date.
![]() View larger version (57K): [in a new window] |
FIG. 2. Two stereo views of the globular domain of nsp1. (a) Ribbon presentation of the closest conformer of nsp1 to the mean coordinates of the bundle in Fig. 1a, shown in the same orientation as in Fig. 1a. The organization of the ß-strands in the barrel is indicated by the labels. (b) Same as panel a after rotation about a horizontal axis, so that one looks at one side of the ß-barrel; the axes of the ß-barrel and the helix 1 are nearly perpendicular to each other, and those of the barrel and the 310-helix are nearly parallel to each other.
|
![]() View larger version (16K): [in a new window] |
FIG. 3. Two topology diagrams of the nsp1 mixed parallel/antiparallel six-stranded ß-barrel (see text). The numbering indicates the first and last residues of each ß-strand.
|
) of each strand, which is the angle between the barrel axis and the line adjusted for best fit to the N, C
, and C' atoms of each strand (43, 46, 47). S must be an even integer because of the hydrogen bonding pattern between the ß-strands (46). Using standard values for the mean C
-C
distance along the strands (a = 3.3 Å) and between the strands (b = 4.4 Å), the following geometric relations characteristic of ß-barrel structures in proteins have been proposed (43):
![]() | (1) |
![]() | (2) |
atoms of the three residues in opposite strands that are closest to the central part of the barrel (47).
The ß-barrel in nsp1 contains six strands and has a shear number (S) of 10. The measured tilt of the strands to the barrel axis (
) ranges from 38o (ß2) to 78o (ß4), with an average value of 60o. The wide variation among the tilt angles of the individual strands reflects that the ß-barrel of nsp1 is pronouncedly irregular (Fig. 2a and b).
The residues used to calculate the radius of the nsp1 barrel are 18 to 20, 53 to 55, 71 to 73, 86 to 88, 106 to 108, and 122 to 124, which give a mean barrel radius (R) of 7 ± 1 Å. Overall, we thus have for nsp1 that the theoretical tilt angle value of 51o calculated from equation 1 shows a discrepancy with the observed average of 60o, whereas the theoretical value of the mean barrel radius of 7 Å, as calculated from equation 2, is in close agreement with the observed value of 7 ± 1 Å.
The interior of the nsp1 barrel and the interfaces between the two helices and the barrel surface consist primarily of hydrophobic residues. The arrangement of the side chains inside the barrel is highly compact, as expected for a barrel of six strands, but the inspection of space-filling models suggests that there is a tight cavity along the center of the barrel, with a radius of about 1.2 Å (not shown). The inside of the barrel consists of 17 side chains, which are contributed by all six strands and which are arranged in three layers. One layer contains L105 and the four hydrophilic residues E56, R74, K85, and R120. The four peripheral hydrophilic groups mediate the contacts with the solvent at the barrel opening opposite to helix
1 (Fig. 4) (in the orientation of Fig. 2b, these residues would be at the bottom of the structure). The charged groups of the side chains of these residues are fully solvent exposed, and E56 makes a salt bridge with R120. The side chain of V21 in the second layer and the ßCH2-
CH2 fragment of R120 are located between this first layer and the other side chains of the second layer, which is in the narrowest portion of the barrel and includes the all-hydrophobic side chains of residues L54, I72, V87, and V122. A third layer consists of the side chains of residues L17, L19, V70, L89, A91, L108, and L124, which make hydrophobic contacts with the side chains of residues V36, A39, L40, A43, and L47 from the amphipathic helix
1, and the side chains of residues C52, F32, P110, and P68. The second and third layers of ß-strand side chains thus combine with the inner side of the helix
1 to form a large hydrophobic core (Fig. 4). It is worth noting that the variant proteins with Cys 52 replaced by Ser, Asp, or Arg were unstable, which is consistent with a disruption of the ß-barrel core, as one would predict from the NMR structure.
![]() View larger version (34K): [in a new window] |
FIG. 4. Stereo view of nsp1(13-128) in the same orientation as in Fig. 2b. The side chains in the interior of the barrel are differently colored to visualize their arrangement in three layers, as discussed in the text. The polypeptide backbone is shown as a gray spline function through the C positions. Amino acid side chains are shown as stick drawings. Color code: red, residues of layer 1 at the barrel opening opposite to helix 1, where the four hydrophilic residues are in solvent contact; green, residues in the central layer 2; blue, residues of the third layer, which make hydrophobic contacts to the residues shown in magenta at the top, where V36, A39, L40, A43, and L47 originate from the amphipathic helix 1.
|
ß-barrel (38), and the acid protease fold (51). In addition to the apparently unique ß-strand topology and the irregular ß-barrel geometry, another interesting feature of the nsp1 fold is that the polypeptide chains connecting the ß-strands run along the side of the barrel, except for the loop between ß3 and ß4 (Fig. 2 and 3). This is a rare feature for barrels with n = 6 and S = 10, and besides nsp1, it has been observed only between two ß-strands in the ribosomal protein L25-like fold. It is intriguing that none of the aforementioned folds are quite as irregular as that of nsp1. The distortion of the nsp1 structure seems to be related to the polypeptide segments connecting the ß-strands across the side of the barrel. Interestingly, although the adjoining ends of strands ß5 and ß6 are the furthest apart in space of all strand combinations in nsp1 (approximately 15 Å between P110 and I117), they are connected by the shortest polypeptide segment across the side of the barrel (Fig. 2 and 3). This imposes a lower limit on the shear between these strands. The shear number of 10 seems to be the result of a balance between tight hydrophobic packing inside the nsp1 barrel, which is favored by lower shear numbers, and unstrained arrangement of the linker polypeptide segments on the outside the barrel, which is favored by larger shear numbers. We discuss the ß-barrel topology in much detail in order to advance the hypothesis that the outstanding irregularity of the nsp1 ß-barrel might be related to a so-far-unknown, possibly entirely novel physiological function of nsp1.
The arrangement of the linker polypeptide segments on the outside of the barrel is puzzling also with regard to the folding pathway of nsp1. For example, if the strand ß1 formed hydrogen bonds with ß2 early during translation, this would also fix the first linker across the barrel, which might limit the ease with which ß6 could make hydrogen bonds with ß4 and ß1. Schemes representing the topology of the ß-barrel (Fig. 3) would intuitively suggest that folding starts midway during translation with the formation of a ß-hairpin of the strands ß3 and ß4. In the folded protein, this pair of ß-strands forms the least distorted part of the ß-barrel, with highly regular hydrogen bonds, and the loop between ß3 and ß4 is the only one that does not run along the barrel surface. In subsequent folding steps the two-stranded sheets of ß2 and ß5 and of ß1 and ß6, respectively, might be formed, which also have quite regular hydrogen bonding in the nsp1 structure. The linkers between ß2 and ß3 and between ß4 and ß5 have almost the same lengths, which should support to position ß5 close to ß2 if ß4 is arranged close to ß3. The three regular two-stranded ß-sheets (Fig. 3a) are connected in the barrel by the formation of irregular hydrogen bonding patterns.
Characterization of the full-length SARS-CoV nsp1. The full-length nsp1 was characterized by comparison of the numbers of backbone 15N-1H correlation peaks and the HN and 15N chemical shifts with those of nsp1(13-128) and by heteronuclear NOE measurements of the truncated and full-length nsp1. The truncated construct nsp1(13-128) has an NMR spectrum with large 1H and 15N chemical shift dispersion (Fig. 5a), which is typical for a well-folded globular domain, where the atoms of different individual amino acid residues experience different local microsusceptibilities due to the nonperiodic nature of the interiors of globular proteins. The spectrum of the full-length protein, nsp1(1-179) (Fig. 5b), contains a set of peaks that overlays very closely with those of nsp1(13-128), showing that the globular domain is contained in both constructs. All the additional peaks have HN chemical shifts of 7.9 to 8.5 ppm, which is the region characteristic of "random-coil" polypeptide chains (66).
![]() View larger version (15K): [in a new window] |
FIG. 5. (a) 2D 15N,1H heteronuclear single-quantum coherence (HSQC) spectrum of nsp1(13-128). (b) 2D 15N,1H HSQC spectrum of full-length nsp1(1-179). (c) 2D TROSY-based 15N{1H} NOE experiment with full-length nsp1(1-179), with negative peaks shown in red. The spectra were recorded at a 1H frequency of 600 MHz at 298 K.
|
0.8 identify residues in the folded cores of small and medium-size globular proteins, with mobility of the individual 15N-1H moieties restricted to the overall rotational tumbling of the molecule. This is illustrated with the 15N{1H} NOE data for nsp1(13-128) (Fig. 6), which also serve as a reference for assessing the state of the additional chain segments in nsp1(1-179). 15N{1H} NOE values of about 0.8 are seen for most of the residues in the regular secondary structure elements (Fig. 6). Increased flexibility of the polypeptide chain that causes reduced NOE intensities is found in the disordered loop between residues 75 and 87 and in the region of residues 94 to 103, which forms nonregular secondary structure with one
-turn of residues 97 to 99 and a type II ß-turn of residues 98 to 101 (Fig. 1). Most of the resonances in full-length nsp1 that are not present in nsp1(13-128) have either small positive or negative 15N{1H} NOEs (Fig. 5c), showing that the polypeptide segments of residues 1 to 12 and 129 to 179 are best described as a short N-terminal and a long C-terminal flexibly disordered tail, respectively (Fig. 1b). Interestingly, it has been determined that the carboxy-terminal half of the related protein MHV nsp1 is not needed for viral replication in culture but is important for efficient proteolytic cleavage between nsp1 and nsp2 and for optimal viral replication (7).
![]() View larger version (19K): [in a new window] |
FIG. 6. Plot of the 15N{1H} NOE intensities versus the sequence of nsp1(13-128). The data were collected at a 1H frequency of 600 MHz at 298 K. The positions of the regular secondary structure elements are indicated. Each point represents the mean of three measurements, and the error bars represent the standard deviations of the three measurements.
|
![]() View larger version (66K): [in a new window] |
FIG. 7. (a) Amino acid sequence of nsp1, with solvent-exposed residues highlighted in green. A residue is considered to be exposed if at least one atom of its side chain has more than 50% surface accessibility to the solvent. For glycines, the CO and HN exposure is considered. (b) Surface views of nsp1 in a space-filling representation. In the surface view shown on the left, the structure has the same orientation as in Fig. 2b. Some of the surface-exposed side chains discussed in the text are identified with the one-letter amino acid code and the residue number. Color code: gray, hydrophobic and polar residues; red, negatively charged; blue, positively charged. (c) Sequence alignment between SARS-CoV nsp1 and MHV nsp1 identified with the FFAS server. Identical residues are shown in red. Arrows indicate single-amino-acid replacements in MHV p28 that were generated and studied by Brockway et al. (7). Mutations that are detrimental to the viral replication are identified by boldface, while those that are not detrimental are in italic. Residues removed in the truncated variant protein MHV1 nsp1 C are shown in lowercase (see text). Residues in ß-strands and in helical secondary structures are underlined with solid and dashed lines, respectively.
|
The most striking result of the alignment of SARS-CoV nsp1 with the polypeptide fragment consisting of residues 46 to 247 of MHV p28 is the observation of a consensus sequence, LRKxGxKG, positioned at the end of strand ß6 of the globular domain of SARS-CoV nsp1, which is conserved not only in MHV p28 (Fig. 7c) but also in human CoV OC43 p28. It includes the two residues R125 and K126, which contribute to the positively charged patch on the nsp1 molecular surface (Fig. 7b). If future studies of the p28 proteins of group 2a CoVs should show that these proteins share mRNA degradation activity with SARS-CoV nsp1, this conserved region could be a candidate for mRNA interaction.
Analysis of the nsp1 structure also provides indications for functional differences between the p28 proteins and SARS-CoV nsp1. For example, the motif K109-R110-L111 in MHV p28 was identified by Chen et al. as a potential cyclin-binding motif (8), and SARS-CoV nsp1 lacks residues corresponding to R110 and L111. In addition, Chen et al. identified residues 30 to 33 (S/NPER) of p28 as a potential site for phosphorylation by cyclin-dependent kinases (8). These residues occur in an N-terminal 45-residue segment of p28 that appears not to be homologous to SARS-CoV nsp1. The propensity to induce cell cycle arrest may therefore be unique to MHV p28, or possibly to the group 2a p28 proteins in general, and it might not be shared by SARS-CoV nsp1 even if it turned out that these proteins all share a similar fold.
In other comparisons, no significant sequence identity between SARS-CoV nsp1 and the nsp1 (p9) proteins of the group 1 CoVs could be detected. These results are consistent with the analysis by Snijder et al. (59), who described nsp1 as a specific marker of group 2 CoVs. The p9 proteins of group 1 CoVs most likely differ from those of group 2 CoVs in both structure and function.
The MHV1 p28 protein was subjected to a mutagenesis study by Brockway et al. (7), who generated single-amino-acid replacements and truncated versions of this protein and studied their impact on viral replication in cultured cells. Among the mutations found to affect viral replication, only some occur in residues conserved between MHV1 p28 and SARS-CoV nsp1 (Fig. 7c). Deletion of the entire p28 protein or of the polypeptide segment from residue 87 to 164 of MHV p28 prevented the virus from productively infecting cultured cells (7). If MHV p28 and SARS-CoV nsp1 did indeed share a similar fold, the latter construct would lack most of the globular domain. In contrast, the carboxy-terminal half of MHV p28 (residues 124 to 241) has been shown to be dispensable for replication in culture, but it is important for efficient proteolytic cleavage of the protein and for optimal viral replication. If MHV p28 were to contain regular secondary structures similar to those of SARS-CoV nsp1, removal of the polypeptide segment from residue 124 to 241 would correspond to the loss of the strands ß3, ß4, ß5, and ß6, as well as of the flexibly disordered C-terminal tail, which would appear to entail a considerable disruption of the protein fold. The following considerations might help to resolve the apparent ensuing discrepancies. First, the increased flexibility and lack of a globular fold in the C-terminal region of the protein may ensure accessibility of the protease recognition site between nsp1 and nsp2 but may not be directly involved with the activity exerted by the protein. Second, it appears that the strands ß1 and ß2 and the helix
1 might provide for a sufficiently stable fold to maintain the so-far-unidentified biological activity, in particular if one assumes that the additional N-terminal 45-residue segment of MHV p28, which is not homologous to SARS-CoV nsp1, could participate in a globular fold and help to stabilize the shortened protein.
In conclusion, this paper shows that the SARS-CoV protein nsp1, which is encoded at the 5' terminus of the genome, forms a previously unknown complex ß-barrel fold with several unique structural features. We hypothesize that the uniqueness of the irregular ß-barrel fold may be related to a so-far-unknown, unique biological function of nsp1. The definition of the globular region of nsp1 and the identification of residues on the molecular surface likely to contribute to mRNA degradation activity may provide a platform for continued research on the role of this protein in SARS-CoV and in other coronaviruses.
This study was supported by NIAID/NIH contract no. HHSN266200400058C "Functional and Structural Proteomics of the SARS-CoV" to P. Kuhn and M. J. Buchmeier and by the Joint Center for Structural Genomics through NIH/NIGMS grant no. U54-GM074898. Additional support was obtained for M.S.A. through the Pew Latin American Fellows Program in the Biological Sciences and the Skaggs Institute for Chemical Biology and for M.A.J. through a fellowship from the Canadian Institutes of Health Research and the Skaggs Institute for Chemical Biology. Kurt Wüthrich is the Cecil H. and Ida M. Green Professor of Structural Biology at TSRI and a member of the Skaggs Institute for Chemical Biology.
Published ahead of print on 3 January 2007. ![]()
|
|
|---|
chemical shifts in protein structure determination. J. Magn. Reson. 109:229-233.
5' exoribonuclease that is critically involved in coronavirus RNA synthesis. Proc. Natl. Acad. Sci. USA 103:5108-5113.
and Cß 13C nuclear magnetic resonance chemical shifts. J. Am. Chem. Soc. 113:5490-5492.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»