Previous Article | Next Article ![]()
Journal of Virology, February 2009, p. 1823-1836, Vol. 83, No. 4
0022-538X/09/$08.00+0 doi:10.1128/JVI.01781-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.

Departments of Molecular Biology,1 Molecular and Integrative Neurosciences,2 Cell Biology,3 Chemistry,4 Skaggs Institute for Chemical Biology, The Scripps Research Institute, 10550 North Torrey Pines Rd., MB-44, La Jolla, California 92037,5 School of Biological Sciences, University of Reading, Whiteknights, RG6 6AJ Reading, United Kingdom6
Received 22 August 2008/ Accepted 23 November 2008
|
|
|---|
|
|
|---|
Although the SARS outbreak has been contained by public health measures, a vaccine against the virus is still elusive, and the continued search for effective drug treatments is tightly linked to ongoing research on the virus and the proteins associated with it. We determine atomic resolution three-dimensional (3D) structures of proteins encoded by the SARS viral genome to provide a basis for the design of biochemical assays that might unravel some or all of the protein functions and establish structure-function relationships for SARS-CoV proteins. A special focus is on the 213-kDa protein nsp3 (27), which is the largest nonstructural SARS-CoV protein, with 1,922 amino acid residues that correspond to the segment 819 to 2740 of pp1a (GenBank accession number NP_828862; gi:34555776) (44). Based on considerations of phylogenetic conservation and amino-acid-sequence-based secondary structure prediction, SARS-CoV nsp3 has been annotated as a multidomain protein (27) consisting of a minimum of seven domains, nsp3a to nsp3g (27, 38). So far, three SARS-CoV nsp3 domains have been structurally and biochemically characterized: nsp3a (residues 1 to 183) has a ubiquitin-like fold and is an RNA-binding protein with affinity for single-stranded RNA (ssRNA) (37), nsp3b (residues 184 to 351) is a poly(ADP-ribose)-binding protein and has ADP-ribose-1"-phosphatase activity (7, 33), and nsp3d (residues 723 to 1037) contains a ubiquitin-related fold and is a papain-like protease involved in the proteolytic processing of the polyproteins pp1a and pp1ab (30). This paper describes a nuclear magnetic resonance (NMR) structure determination and a preliminary functional annotation for part of the region described as the "SARS-unique domain" (SUD), nsp3c.
Nsp3c, which is the polypeptide segment of the nsp3 residues 366 to 722, has been termed the SUD to reflect its apparent uniqueness to the SARS-CoV (38). From previous work, there have been indications that the SUD may comprise more than one structural domain (4, 42), and nucleic acid-binding activity has been attributed to the carboxy-terminal region of SUD, which is conserved among several bat coronaviruses (27, 42, 51). In this paper, we describe the structure of a globular domain, SUD-M, in the center of the SUD, which has been shown to fold independently and has long-term stability in aqueous solution (4). This sequence segment has less than 30% amino acid identity with known proteins, except for the corresponding polypeptide segments in SARS-like and HKU3-like bat coronaviruses (27), which have more than 90% sequence identity but for which no 3D structures have as yet been determined. It is, however, of interest that SARS-CoV SUD-M shows 16 to 28% sequence identity with homologous regions in group IIc and group IId bat coronaviruses, as such an identity, although small, might indicate an evolutionary development of the SUD. The NMR structure determination of SUD-M residues 513 to 651 [SUD-M(513-651)] now reveals that this polypeptide forms a globular domain of residues 528 to 648, which is flanked by a flexibly extended N-terminal tail of residues 513 to 527 and a C-terminal flexible tail of residues 649 to 651. To investigate possible effects of the unstructured N-terminal tail on the globular domain, we then also determined the NMR structure of the construct SUD-M(527-651). A search of the Protein Data Bank (PDB) for folds homologous to SUD-M and NMR screening of likely reaction partners of SUD-M were then performed for an initial functional annotation.
|
|
|---|
NMR data acquisition and chemical shift assignment.
NMR measurements were performed at a temperature of 298 K with Bruker Avance 600, DRX 700, and Avance 800 spectrometers equipped with TXI-HCN-z- or TXI-HCN-xyz-gradient probe heads. The NMR experiments acquired for obtaining the sequence-specific resonance assignments of SUD-M(513-651) were described previously (4). For the SUD-M(527-651) protein, the new automated projection spectroscopy (APSY) technology was used. Four-dimensional APSY-HNCOCA, four-dimensional APSY-HACANH, five-dimensional APSY-CBCACONH, and five-dimensional APSY-HACACONH data sets were recorded and analyzed with the software GAPRO (9, 13, 14). The resulting peak lists were used as input for the software MATCH (45) for automated polypeptide backbone assignments. The side-chain assignments for the nonaromatic residues were based on 3D 15N-resolved [1H,1H]-total correlation spectroscopy (TOCSY) (
m = 60 ms), 3D HC(C)H-TOCSY (35), 3D 15N-resolved [1H,1H]-nuclear Overhauser effect spectroscopy (NOESY) (
m = 60 ms) (41), and 3D 13C-resolved [1H,1H]-NOESY (
m = 60 ms) (24) experiments. The assignment of the aromatic side-chain resonances was based on 3D 13C-resolved [1H,1H]-NOESY (
m = 60 ms) and two-dimensional (2D) [13C,1H]-heteronuclear single quantum coherence spectroscopy (HSQC) experiments (24, 52). Proton chemical shifts were referenced to internal 3-(trimethylsilyl)-1-propanesulfonic acid sodium salt (DSS). The 13C and 15N chemical shifts were referenced indirectly to DSS using the absolute frequency ratios (49).
Steady-state 15N{1H}-nuclear Overhauser enhancements (NOEs) for studies of high-frequency dynamics were measured using transverse relaxation optimized spectroscopy (TROSY)-based experiments (31, 54) with a Bruker Avance 600 spectrometer with a saturation period of 3.0 s and a total interscan delay of 5.0 s.
The interaction of SUD-M(527-651) with ADP-ribose and ssRNA was evaluated by comparison of the 2D [15N,1H]-HSQC spectra of SUD-M(527-651) recorded in the presence and absence of ssRNA or ADP-ribose using the uniformly 15N-labeled protein at a 0.4 mM concentration. The ssRNAs used were the homodecamers of uridine [poly(U10)], guanosine [poly(G10)], and adenosine [poly(A10)].
Structure calculation from the NMR data.
The structure calculations were performed with the software ATNOS/CANDID/DYANA (10-12). The standard protocol of seven cycles of automated NOESY peak picking and NOE cross-peak identification with ATNOS (12), automated NOE assignment with CANDID (11), and structure calculation with the torsion angle dynamics algorithm contained in CYANA (10) were performed. In the second and subsequent cycles, the intermediate protein structure was used as an additional guide for the interpretation of the NOESY spectra (11, 12). Backbone
and
dihedral angle constraints derived from the 13C
chemical shifts were used as supplementary data for the NOE upper distance constraints in the input for the structure calculation (22, 39). The 20 conformers with the lowest residual CYANA target function values obtained from the seventh ATNOS/CANDID/CYANA cycle were energy minimized in a water shell with the program OPALp (17, 21) using the AMBER force field (6). The program MOLMOL (18) was used to analyze the ensemble of 20 energy-minimized conformers. The stereochemical quality of the models was analyzed using the Joint Center for Structural Genomics validation central suite (http://www.jcsg.org) and the PDB validation server (http://deposit.pdb.org/validate).
Enzyme assays. The NTPase activity was evaluated by monitoring the phosphate released when ATP or GTP was added to SUD-M(527-651) by using an Enzchek assay (Molecular Probes Inc., Eugene, OR) according to the manufacturer's instructions. This assay uses a method described previously by Webb (47), in which the release of inorganic phosphate is monitored by the coupling of the phosphatase reaction with the purine nucleoside phosphorylase conversion of the substrate 2-amino-6-mercapto-7-methyl purine riboside (MESG) to 2-amino-6-mercapto-7-methyl purine and ribose-1-phosphate. MESG has an absorbance maximum of 330 nm, whereas that of the product is 360 nm. The reaction mixture contained 50 mM Tris (pH 7.5), 1 mM MgCl2, 0.1 mM sodium azide, 200 mM MESG, 1 U purine nucleoside phosphorylase, and 5 µM SUD-M(527-651). It was checked for activity by adding variable amounts of ATP or GTP. No phosphate release was detected by monitoring the absorbance at 360 nm.
Bioinformatics. The following amino acid sequences were used for alignments: transmissible gastroenteritis virus (GenBank accession number NP_840002), bat coronavirus BtCoV-HKU8 (accession number YP_001718611), BtCoV-1B (accession number YP_001718596), BtCoV-HKU2 (accession number YP_001552234), porcine epidemic diarrhea virus (accession number NP_598309), BtCoV-512/2005 (accession number YP_001351683), human coronavirus HCoV-229E (accession number NP_073549), HCoV-NL63 (accession number YP_003766), SARS-CoV (accession number AAP41036), BtCoV-Rm1 (accession number YP_001382397), BtCoV-HKU5 (accession number YP_001039961), BtCoV-HKU9-1 (accession number YP_001039970), and BtCoV-HKU9-3 (accession number ABN10926). Homology searches were carried out using BLASTP 2.2.18+ (40). Alignments were performed using ClustalW 2.0 (19) and displayed using JalView (5). Coronavirus naming and abbreviation follow ICTV conventions where possible or follow the abbreviation proposed in the first publication of each virus otherwise.
Electrophoretic mobility shift assays. Purified SUD-M(527-651) was mixed with ssRNA substrate in an assay buffer containing either 150 mM NaCl (physiological salt concentration) or 56 mM NaCl (low salt) in addition to 50 mM sodium phosphate at pH 6.5, 7% glycerol, and 4 mM MgCl2. The following custom-synthesized RNA oligomers (Integrated DNA Technologies, Inc., San Diego, CA) were tested: (ACUG)5; the homopolymers A10, A15, C10, and U10; 5'-CCCGAUACCC-3', which contains the core GAUA sequence that was shown to bind to nsp3a (37); 5'-CUAAACGAAC-3', which is the leader transcription regulatory sequence from the SARS-CoV genome [TRS(+)]; and 5'-GUUCGUUUAG-3', which is the leader transcription regulatory sequence from the SARS-CoV antigenome [TRS(–)]. Protein-nucleic acid mixtures were incubated for 45 min either at room temperature or at 37°C and then analyzed by native electrophoresis on precast 6% acrylamide DNA retardation gels (Invitrogen). Nucleic acid was detected by SYBR gold poststain (Invitrogen) and photographed using a UV light source equipped with a digital camera. SYBR gold was rinsed out, and protein was subsequently detected by SYPRO ruby poststain (Invitrogen).
Protein structure accession numbers. The chemical shifts of SUD-M(527-651) have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu) under accession number 15618. The atomic coordinates of the two ensembles of 20 conformers used to represent the SUD-M(513-651) and SUD-M(527-651) structures have been deposited in the PDB (http://www.rcsb.org/pdb) under accession numbers 2RNK and 2JZD, respectively, and a single representative conformer for each protein (the conformer with the lowest root mean square deviation [RMSD] from the mean coordinates of the ensemble) have been deposited under accession numbers 2JZF and 2JZE, respectively.
|
|
|---|
For each of the two proteins, the input for the NMR structure determination consisted of a 3D 15N-resolved [1H,1H]-NOESY spectrum and two 3D 13C-resolved [1H,1H]-NOESY spectra optimized for the aliphatic and the aromatic spectral regions and of the chemical shift lists taken from the previously reported sequence-specific resonance assignments for SUD-M(513-651) (4) and from the presently obtained assignments for SUD-M(527-651) (BioMagResBank accession number 15618). The near identity of the overlapping parts in the two sets of chemical shifts is visualized in Fig. 1 by the [15N,1H]-HSQC spectra of SUD-M(513-651) and SUD-M(527-651).
![]() View larger version (35K): [in a new window] |
FIG. 1. Superposition of the 2D [15N,1H]-HSQC spectra of SUD-M(513-651) (red) and SUD-M(527-651) (blue). The protein concentrations were 1.2 mM and 1.4 mM for SUD-M(513-651) and SUD-M(527-651), respectively. The solvent contained 25 mM sodium phosphate buffer at pH 6.5, 150 mM NaCl, and 2 mM NaN3. The spectra were recorded at a 1H frequency of 600 MHz and a temperature of 25°C, with 256 increments in the 15N dimension and 4 scans/increment. The resonance assignments for SUD-M(527-651) are marked in blue, where the assignments for the crowded central region are shown as an insert in the lower right corner. Residue –1 indicates the methionine residue of the tetrapeptide segment –4GSHM–1 that is left after thrombin cleavage (see the text). The side-chain amide resonances of asparagine and glutamine are connected by blue horizontal lines.
|
|
View this table: [in a new window] |
TABLE 1. Input for NMR structure calculations of the SUD-M(513–651) and SUD-M(527–651) proteins, statistics of the convergence of the CYANA structure calculations, and characterization of the bundle of 20 conformers used to represent the NMR structures
|
-helices, and one 310-helix in the sequential order β1-
1-β2-
2-β3-β4-
3-β5-310-
4-β6-
5 (Fig. 2). The first regular secondary structure element is the short-strand β1 formed by residues 528 to 530, which is in SUD-M(513-651), preceded by a flexible N-terminal tail of residues 513 to 527 (Fig. 3). The lengths of all regular secondary structures are marked in Fig. 2d. It is interesting that the structure contains two well-defined long loops with nonregular secondary structure, containing residues 564 to 571 between helix
2 and strand β3 and residues 606 to 616 between strand β5 and helix
4.
![]() View larger version (37K): [in a new window] |
FIG. 2. NMR structure of SUD-M(527-651). (a) Stereo view of the polypeptide backbone of a bundle of 20 energy-minimized conformers superimposed for the minimal RMSD value of the backbone atoms of residues 528 to 648. The helical regular secondary structures are red, the β-strands are green, and the polypeptide segments with no regular secondary structure are gray. Selected sequence positions are identified by numerals. (b) Stereo view in the same orientation as described above (a), of a ribbon presentation of the closest conformer of SUD-M(527-651) to the mean coordinates of the bundle above (a). The regular secondary structures are identified. (c) Same as above (b) after a 90° rotation about a horizontal axis. (d) Topology of the regular secondary structures in SUD-M(527-651). β-Strands are shown as gray arrows, helices in the front of the β-sheet are in black, and helices behind the β-sheet are represented by white rectangles. The numbers represent the starts and the ends of the individual regular secondary structure elements.
|
![]() View larger version (28K): [in a new window] |
FIG. 3. (a) NMR structure of SUD-M(513-651). The polypeptide backbone of a bundle of 20 energy-minimized conformers has been superimposed for the minimal RMSD value calculated for the backbone atoms of residues 528 to 648. The flexibly extended N-terminal tail of residues 513 to 527 and the C-terminal flexible tail of residues 649 to 651 are red. (b) Relative 15N{1H}-NOE intensities plotted versus the sequence of SUD-M(513-651). Diamonds represent the experimental measurements, which are linked in sequential order by straight lines. Gaps represent either proline residues or residues for which the 15N-1H correlation peak could not be integrated because of spectral overlap. The experiment was recorded at a 1H frequency of 600 MHz using a saturation period of 3.0 s and a total interscan delay of 5.0 s. The red line represents a cutoff at 0.6; residues with values below this cutoff value are identified as having high-frequency intramolecular mobility. Positions of the regular secondary structures are indicated at the bottom of the figure.
|
Internal mobility in SUD-M(513-651). The intramolecular flexibility on the subnanosecond time scale was characterized for the polypeptide backbone of SUD-M(513-651) by 15N{1H}-NOE measurements (Fig. 3b). For residues 528 to 648, positive NOE values of 0.6 or larger show that the mobility of the backbone 15N-1H moieties is essentially limited to the overall tumbling of the molecule. For residues 513 to 527 and 649 to 651, NOE values in the range of –0.6 to 0.5 indicate increased high-frequency mobility. These results were interpreted to indicate that the central SUD-M region with residues 528 to 648 forms a compact globular domain with flexibly extended polypeptide segments attached at both chain ends.
SUD-M(527-651) is an RNA-binding protein with affinity for purine bases. As part of a search for a functional annotation of SUD-M(527-651), we performed NMR chemical shift perturbation experiments by comparing the [15N,1H]-HSQC spectrum of SUD-M(527-651) in the absence of potential ligands to that in the presence of potential ligands, such as ssRNA, ATP, GTP, and ADP-ribose. The motivation for choosing ssRNAs came from recent studies by Neuman et al. (27) and Tan et al. (42), which showed that the SUD binds RNA. Nucleoside triphosphates (NTPs) and ADP-ribose were selected based on the observation of structural homology between SUD-M and various NTPases, with the closest structural similarity to SARS-CoV nsp3b, which displays ADP-ribose-binding activity (7, 33).
The addition of the ssRNA poly(G10) led to extensive precipitation of the protein, which may be rationalized by the fact that poly(G) ssRNA is much less water soluble than single-stranded poly(A) or poly(U). The addition of poly(U10) had a measurable effect only on the chemical shifts of residue L533 (Fig. 4b). The addition of poly(A10) resulted in significant shifts of 11 peaks corresponding to the residues G527, W531 to L533, I556 to Q561, and V611 (Fig. 4a). These residues are marked by green lines above the sequence in Fig. 8 and highlighted in magenta in the space-filling model of the structure shown in Fig. 5a. It is seen that all the perturbed residues are located at or near a putative ligand-binding cleft (see also below), with residues N532, L533, I556, T559, and V611 within the cleft and residues M557, A558, I560, and Q561 in helix
2, adjacent to the cleft. None of the 15N-1H correlation peaks in the HSQC spectrum of SUD-M(527-651) showed significant chemical shift changes upon the addition of ATP, GTP, or ADP-ribose (Fig. 4c). From these data, we conclude that SUD-M(527-651) is a poly(A) ssRNA-binding protein and does not bind either NTPs or ADP-ribose.
![]() View larger version (12K): [in a new window] |
FIG. 4. Superposition of pairs of 2D [15N,1H]-HSQC spectra of 0.4 mM SUD-M(527-651) (solvent composed of 25 mM sodium phosphate buffer at pH 6.5, 150 mM NaCl, and 2 mM NaN3) recorded in the absence (red peaks) and presence (blue peaks) of 0.4 mM of three different ligands: poly(A10) ssRNA (a), poly(U10) ssRNA (b), and ADP-ribose (c). (a and b) The peaks that show chemical shift changes after the addition of the ligand are identified. The spectra were recorded on a Bruker DRX 700 spectrometer with a 1.7-mm TXI HCN z-gradient probehead at a temperature of 25°C; 256 increments in the 15N dimension were accumulated, with 32 scans per increment.
|
![]() View larger version (42K): [in a new window] |
FIG. 8. 3D structure-based sequence alignment of the SUD-M protein with its closest structural homologues, as identified through a DALI (16) search of the PDB with SUD-M(527-651). PDB accession numbers are given in parentheses. Above the sequence, the locations of the regular secondary structures in SUD-M are indicated by cylinders for helices and by arrows for β-strands. The aligned residues are highlighted in red, and conserved sequence motifs described in the text are indicated in boldface type. The residues that form the adenosine-binding cleft in nsp3b and the corresponding residues in SUD-M are boxed. The loops that form the ribose-binding cleft in nsp3b (see the text) and the corresponding regions in SUD-M are underlined (SUD-M has a three-residue deletion at the location corresponding to loop 3). The residues that show chemical shift changes upon the addition of poly(A10) ssRNA (see the text and Fig. 4a) are marked by green lines above the sequence.
|
![]() View larger version (30K): [in a new window] |
FIG. 5. Space-filling models of the NMR structure of SUD-M(527-651). (a) Regions affected by poly(A10) ssRNA binding (data from Fig. 4a) are highlighted in magenta. (b) The residues in positions structurally corresponding to those that contact the ADP-ribose ligand in nsp3b are highlighted. (c) Display of the electrostatic surface potential, with positive and negative electrostatic charges represented in blue and red, respectively. (d) Nsp3b (PDB accession number 2ACF). Shown is the same presentation of the electrostatic surface potential as that in panel c. (c and d) The black circle surrounds the ligand-binding clefts discussed in the text. Selected residues within the cleft of SUD-M(527-651) and in the active site of nsp3b are identified.
|
![]() View larger version (64K): [in a new window] |
FIG. 6. Investigation of RNA binding by electrophoretic mobility shift assay (see the text). Data are given for poly(A15), poly(U10), poly(A10), (ACUG)5, TRS(+), TRS(-), and 5'-CCCGAUACCC-3' (GAUA). These single-stranded oligonucleotides were incubated with various concentrations of SUD-M either at room temperature (a and d) or at 37°C (b and c) before analysis by native polyacrylamide gel electrophoresis. Lane designations indicate the final concentration of protein and RNA or the presence of a double-stranded DNA marker (DNA). The binding assays in a and b were carried out in low-salt buffer (50 mM phosphate at pH 6.5 containing 56 mM NaCl, 7% glycerol, and 4 mM MgCl2), and those in c and d were carried out using buffer containing physiological salt concentrations (50 mM phosphate at pH 6.5 with 150 mM NaCl, 7% glycerol, and 4 mM MgCl2). Nucleic acid was detected by SYBR gold staining (left), and protein was detected by SYPRO ruby staining (right). White arrowheads indicate the electrophoretic mobility of SUD-M, and black arrowheads indicate free nucleic acid. Complexes of intermediate mobility are indicated by a gray filled bracket.
|
|
|
|---|
SUD-M(527-651) forms a macrodomain fold.
In a search of the PDB for proteins with 3D structural similarity to SUD-M(527-651), the program DALI (15, 16) identified more than 300 structures with a z score larger than 2.0, which is a value that indicates "overall fold similarity" (16). The closest match was found for macrodomains, and DALI z scores of
5 were also obtained for various helicases and NTP-binding proteins (Table 2). As a first result from our homology studies, we thus found that the polypeptide fold of SUD-M(527-651) corresponds to a macrodomain fold (1, 3, 25): the six β-strands in the arrangement 165243 form the protein core, whereby the third β-strand is oriented antiparallel to the other strands (25) and the
-helices form an outer layer of the protein architecture (Fig. 2b and c).
|
View this table: [in a new window] |
TABLE 2. Protein structures with z values of 5 identified by a DALI search of the PDB with SUD-M(527–651)
|
-helix inserted between helix
2 and strand β3, and a 310-helix inserted between strands β4 and
3. These differences are highlighted in Fig. 7 with yellow coloring of the sequence insertions in nsp3b (Fig. 8) that have no matching residues in SUD-M(527-651). The close 3D structure homology visualized in Fig. 7 is remarkable considering that the sequence homology in the matching segments amounts to only 5% (Fig. 8).
![]() View larger version (38K): [in a new window] |
FIG. 7. Stereo view of a ribbon presentation showing a superposition of the NMR structure of SUD-M(527-651) (red) and the X-ray structure of nsp3b (33) (gray). The following residues for the superposition were identified with the software DALI (15), yielding an RMSD value of 2.9 Å for the C atoms of these residues: residues 527 to 541, 544 to 552, 553 to 556, 557 to 566, 567 to 570, 572 to 575, 576 to 579, 580 to 586, 588 to 597, 599 to 625, and 626 to 649 in SUD-M(527-651) and residues 199 to 213, 214 to 222, 227 to 230, 232 to 241, 244 to 247, 263 to 266, 268 to 271, 274 to 280, 290 to 299, 300 to 326, and 328 to 351 in nsp3b. The insertions in the sequence of nsp3b are highlighted in yellow (see the text). Selected sequence positions are identified by black numerals for nsp3b and red numerals for SUD-M(527-651).
|
![]() View larger version (50K): [in a new window] |
FIG. 9. Conservation of SUD-M in bat coronavirus lineages. (a) Multiple-sequence alignment of domains homologous to SARS-CoV SUD-M. Homologies are highlighted with clustalx conservation coloring, and sequences are numbered from the first residue of nsp3. (b) Schematic representation of the homology between macrodomains found in coronavirus nsp3 and eukaryotic and prokaryotic organisms. Genetic homology (BLAST) (blue), structural homology (DALI) (red), and combined homology (violet) are indicated. Coronavirus subgroup nomenclature was taken from http://www.ncbi.nlm.nih.gov/Taxonomy/Browser. PEDV, porcine epidemic diarrhea virus; TGEV, transmissible gastroenteritis virus; FCoV, feline coronavirus.
|
In the complex with nsp3b, the adenosine moiety of ADPR is located in a cleft surrounded by residues D204, I205, V231, A234, P307, A336, and N338 (7). The corresponding residues in SUD-M (marked by boxes in Fig. 8) lead to a quite different surface topology than that of the corresponding area in nsp3b. It was further observed that the ribose-binding site adjacent to the adenosine-binding area in the catalytic center of nsp3b is surrounded by three loops, with residues 310SAGIF314, 228GGG230, and 282LNA284 (underlined in Fig. 8). These three loops form a groove that accommodates the ribose moiety. In SUD-M, the residues corresponding to the first two loops are 609GYVTH613 and 553VRA555, and the third loop is deleted (Fig. 8), which again contributes to differences in the protein surface topology compared to that of nsp3b. Furthermore, in nsp3b, the hexapeptide segment consists of the second loop and the following residues: 228GGGVAG233, which is reminiscent of a Walker A motif (46), and GX4GK[T/S] (X can be any residue), which forms the NTP-binding site in P-loop NTPases (34). In SUD-M, it is the residues 609GYVTHG614 and 548PICMD552 that are reminiscent of Walker A and Walker B motifs (46), but although these sequence motifs are part of the presently discussed putative SUD-M active site, they are not in the same relative positions as in P-loop NTPases (34).
Overall, it is interesting to note that the comparisons with the well-characterized nsp3b and its substrate complex revealed the presence of potentially functional sequence motifs in SUD-M(527-651), although these potentially functional elements are not properly arranged in the 3D structure to confer nsp3b-like enzymatic activity to the SUD. The outcome of this part of the homology investigations is fully compatible with data for the experimental functional assays, which showed that SUD-M does not have either NTPase activity or affinity for the binding of ADP-ribose.
3D structure homology of SUD-M(527-651) with non-SARS-CoV proteins. Although SARS-CoV nsp3b is its closest structural homologue, SUD-M also shows significant similarity to other classes of NTP-binding proteins. Thus, for example, comparison with the hepatitis C virus helicase (PDB accession number 1HEI) (53) yielded a DALI z score of 3.5 and revealed similarity to the catalytic domain of the helicase. However, a comparison of the sequences and the 3D structures of the two proteins shows that SUD-M lacks the characteristic "DEXH" (where X can be any residue) helicase sequence (36). Similar conclusions resulted from comparisons with other proteins so that a putative functional assignment for SUD-M remains elusive also on the basis of comparisons with non-SARS-CoV proteins.
Progress with the structural coverage of nsp3. The data in this paper are yet another step toward a complete structural characterization of SARS-CoV nsp3, which was initially annotated as consisting of seven domains, nsp3a to nsp3g (27, 38). Figure 10 summarizes the current structural coverage of the N-terminal half of nsp3, which includes globular domain structures solved by NMR and by X-ray crystallography, as well as flexibly disordered linker segments characterized by NMR 15N{1H}-NOE measurements. Two white boxes in Fig. 10 indicate polypeptide segments with unknown folds that are currently under investigation.
![]() View larger version (14K): [in a new window] |
FIG. 10. Structural coverage of the N-terminal half of the 1,922-residue nsp3. Initially annotated domains are marked above the thick line, and the numbers below this line represent the residues that bound the individual domains. Circles indicate globular folds, with the NMR structures in green and the crystal structures in blue. The curved thick lines represent flexibly disordered segments that were characterized by NMR 15N{1H}-NOE measurements either as an N-terminal attachment of the nearest globular domain (red) or as a C-terminal tail (black). The white boxes indicate domains that are currently being investigated by NMR. The residues binding the individual structural entities are indicated below the circles and the boxes. The dotted line represents SUD-M(513-651), and the broken line represents SUD-M(527-651). UB1 and UB2 are ubiquitin-like folds. AC is a region rich in acidic residues. ADRP is an ADP-ribose-1"-phosphatase. SUD-N (MBD) (27), SUD-M, and SUD-C represent three structural regions of the SARS-unique domain. PL2pro is a papain-like protease.
|
K.W. is the Cecil H. and Ida M. Green Professor of Structural Biology at the Scripps Research Institute.
This is the Scripps Research Institute manuscript 19411.
Published ahead of print on 3 December 2008. ![]()
|
|
|---|
chemical shifts in protein structure determination. J. Magn. Reson. 109:229-233.[CrossRef]
and Cβ 13C nuclear magnetic resonance chemical shifts. J. Am. Chem. Soc. 113:5490-5492.[CrossRef]This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»