Previous Article | Next Article ![]()
Journal of Virology, September 2008, p. 8647-8655, Vol. 82, No. 17
0022-538X/08/$08.00+0 doi:10.1128/JVI.00298-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Tsinghua-Nankai-IBP Joint Research Group for Structural Biology, Tsinghua University, Beijing 100084, China,1 College of Life Sciences and Tianjin State Laboratory of Protein Sciences, Nankai University, Tianjin 300071, China,2 National Laboratory of Biomacromolecules, Institute of Biophysics (IBP), Chinese Academy of Sciences, Beijing 100101, China3
Received 11 February 2008/ Accepted 10 June 2008
|
|
|---|
|
|
|---|
Group 2A HCoVs were less widely studied prior to the global SARS epidemic in 2003. However, they are closely associated with a wide range of acute or chronic respiratory syndromes (3, 4, 7-9, 11, 12, 15, 20, 22, 35, 39, 40, 47). In the wake of the SARS outbreak, several novel HCoVs have been discovered, one of which is HCoV-HKU1 (9, 39). HCoV-HKU1 has achieved global distribution since it was first identified in 2005: infections were first characterized in Hong Kong (26), followed by the identification of several strains of the virus in Korea (9), Europe (5, 17), Australia (31), and North America (14). In contrast to the lethal SARS-CoV, infection by HCoV-HKU1 usually leads to self-limiting syndromes affecting the lower respiratory tract. Nevertheless, the consequences could be more severe in patients with a compromised or immature immune system, such as asthma sufferers or newborn infants (24). Genome sequencing has confirmed that the HCoV-HKU1 virus belongs to CoV group 2A and shares high sequence homology with MHV and BCoV (39).
The functional components of the CoV replication machinery are released via posttranslational cleavage by two or three proteases. These proteases were first designated the papain-like protease (PLP) and 3C-like protease (3CL) for their respective sequence homology to the papain and rhinovirus 3C proteases. The 3CL protease also is commonly known as the main protease (Mpro) because of the major role it plays in the proteolytic pathway, which makes it the most attractive pharmacological target for anti-CoV drug design. CoV Mpros have been intensively studied, and crystal structures have been determined for the Mpros from the following CoVs: HCoV strain 229E (HCoV-229E) (2), porcine TGEV (1), avian IBV (41), and SARS-CoV (44). These structures are representative of group 1 (HCoV-229E and TGEV), group 2B (SARS-CoV), and group 3 (IBV) CoVs. However, no structure of the Mpro from a group 2A CoV (MHV, HCoV-HKU1, and HCoV-OC43) has been determined to date. The absence of structural data presents a major obstacle for structure-aided drug optimization targeting group 2A CoVs.
The Mpros from different CoV groups are homologous in both sequence and main-chain architecture. They share a similar substrate binding sequence, with a requirement for glutamine at the P1 position and a strong preference for leucine/methionine at P2. Based on this information, broad-spectrum lead compounds (43) with micromolar Ki values have been designed that target CoV Mpros. However, structural data for the Mpros from classical group 2A CoVs still are not available, posing a problem for further optimization.
Although CoV Mpros exhibit absolute specificity for glutamine in the P1 position, recent research (38) has shown that the Mpro from HCoV-HKU1 may possess an unusual substrate preference at P1 site quite different from that of other CoV Mpros. Here, we report the structure of HCoV-HKU1 Mpro, which serves as a model for group 2A CoVs in complex with a synthetic peptidomimetic inhibitor, N3. The structure and subsequent enzyme activity assays help to resolve the issue of the relative conservation at the P1 position based on genome sequencing. Moreover, this complex structure provides further structural data for rational drug design against HCoVs.
|
|
|---|
Protein expression and purification. The plasmid was first transformed into the commercial Escherichia coli strain BL21(DE3) Rosetta (Invitrogen). After incubation at 37°C overnight on an Amp+ algae Luria-Bertani (LB) plate, fresh transformants were inoculated into 5 ml LB medium in the presence of 100 µg/ml ampicillin. After growth for 12 h, the incubation system was scaled up to 1 liter LB medium with the same concentration of antibiotics in a 2-liter flask, and the solution was shaken vigorously at 37°C until the optical density at 600 nm reached 0.6. Cells were induced by 0.5 mM isopropyl-β-D-thiogalactopyranoside (Sigma) at 16°C overnight.
Cell pellets were harvested by centrifugation, resuspended in 40 ml phosphate-buffered saline buffer with 2 mM dithiothreitol and 7 mM β-mercaptoethanol, and sonicated on ice for 25 min. The supernatant was collected after the centrifugation of the sonicant at 15,000 rpm for 40 min.
Affinity purification was achieved by letting the supernatant flow through 2 ml glutathione S-transferase affinity medium twice. On-column digestion lasted for 16 h at 4°C with thrombin (New England BioLabs), and the protein of interest was harvested and concentrated to 30 mg/ml. The N3 inhibitor then was added to a final molar ratio of 1:1 and incubated at 4°C overnight. Finally, the HCoV-HKU1 Mpro-inhibitor complex was purified by gel filtration using a Superdex 200 (10/30) column (GE Healthcare). The protein concentration was adjusted to 20 mg/ml for crystallization trials.
Crystallization and structure determination. Crystals of HCoV-HKU1 Mpro were grown in 0.1 M imidazole, pH 6.0, and 0.6 M sodium acetate by the hanging drop vapor diffusion method. Synchrotron X-ray diffraction data were collected on beamline BL-5A of the Photon Factory (Tsukuba, Japan) and processed to 2.5-Å resolution, using HKL2000 (29) for data indexing and scaling. Molecular replacement using the SARS Mpro structure (Protein Data Bank entry 2AMQ; 48% identity) as a template was performed with PHASER (32). The manual rebuilding of the structure was performed using Coot (13), and the structure was refined using REFMAC in the CCP4 suite (10). Final modification was carried out using CNS (6). The volume of the S1 cavity was calculated using VOIDOO (25).
Enzyme activity assays. Substrates and analogs were designed through three rounds of affinity optimization (42) by substrate mimicry and from a library of substrate analogs. The substrate and analogs were synthesized by Dawei Ma from the Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, China.
The strategy employed for enzyme activity assays of HCoV-HKU1 Mpro has been described previously (43). Activity assays for HCoV-HKU1 Mpro against the CoV consensus substrate and the HCoV-HKU1-specific substrate followed a similar protocol, which is described briefly below. The consensus substrate and HCoV-HKU1-specific substrates were fluorescent compounds with the sequences MCA-AVLQSGFR-Lys(Dnp)-Lys-NH2 and MCA-PRLHCTTN-Lys(Dnp)-Lys-NH2, respectively (greater than 95% purity; GL Biotech Shanghai Ltd., Shanghai, China). A P1 single-mutant substrate also was synthesized with sequence MCA-AVLHSGFR-Lys(Dnp)-Lys-NH2.
The excitation and emission wavelengths of the fluorescent substrates were 320 and 405 nm, respectively. A buffer consisting of 50 mM Tris-HCl (pH 7.3) and 1 mM EDTA was used for enzyme activity assays at 30°C. The reaction was initiated by adding protease (final concentration, 2 µM) to a solution containing different final concentrations of the substrate (3.2 to 40 µM). Strict kinetic parameters for the inhibition assay were determined according to the previously reported protocol (43). All results from enzyme activity assays were calculated using data based on at least three independent parallel experiments.
Coordinate accession number. Coordinates and structure factors for the HKU1 Mpro in complex with inhibitor N3 have been deposited in the Protein Data Bank under entry ID 3D23.
|
|
|---|
-helical cluster that is unique to CoV Mpro. The catalytic site, including the Cys-His dyad, and the relatively shallow substrate binding pocket of HCoV-HKU1 Mpro are located in the cleft between domains I and II. The substrate-binding pocket features two deeply buried sites (P1 and P2) and several sites with different levels of solvent exposure (P3, P4, and P5) (Fig. 1b). X-ray data collection and refinement statistics are summarized in Table 1.
![]() View larger version (34K): [in a new window] |
FIG. 1. (a) Structural overview of four protomers (A, green; B, cyan; C, magenta; and D, yellow) in one asymmetric unit, represented as cartoons. N3 inhibitors are shown as blue sticks. (b) Structural overview of the enzyme-inhibitor complex of one monomer unit. The main chain of the enzyme is represented as blue cartoons, and the synthetic inhibitor is shown as yellow sticks. The three domains are labeled.
|
|
View this table: [in a new window] |
TABLE 1. X-ray data-processing and refinement statistics
|
atom of Cys145 in the inhibitor-bound HCoV-HKU1 Mpro structure. We conclude that this reaction can be categorized as an electrophilic addition mediated by a Michael acceptor, obeying the Ki – k3 kinetics, where Ki is the dissociation constant and k3 is the turnover number, according to the following scheme:
![]() | (1) |
![]() | (2) |
![]() View larger version (59K): [in a new window] |
FIG. 2. (a) Details of the interaction between the P1 side chain and the defined oxyanion loop, shown in stereo representation. Side chains are shown as sticks, and the crucial hydrogen bond between His163 and the substrate side chain is shown by a cyan dashed line. (b) Details of the substrate-binding pocket. The inhibitor is shown in the following color scheme: C, white; O, red; and N, blue. The crucial residues of the enzyme are shown in the following color scheme: C, cyan; O, red; N, blue; and S, yellow. Hydrogen bonds are shown as red dashed lines.
|
atom of Cys145, we observe well-defined amides from the loop from residues 142 to 145. Similarly to human rhinovirus 3C proteases (28), these amide dipoles construct a tetrahedral oxyanion hole. From native and complex structures of SARS-CoV Mpro, the correct orientation of these backbone amides is triggered and maintained by substrate binding, in particular by the binding of the P1 residue and interaction between the N finger and the substrate (2, 44), in which the backbone carbonyl of Leu141 is hydrogen bonded to the side chain oxygen of Ser144. The correct position of Leu141 is maintained by a hydrogen bond between the carbonyl group of Phe140 and the amide group of the P1 side chain and by hydrophobic stacking between His163 and Phe140. Although the analysis of the HCoV-HKU1 Mpro structure in complex with N3 (Fig. 2a) shows that the P1 side chain exerts no direct influence on the residues forming the oxyanion hole, its side chain oxygen atom forms a strong hydrogen bond (2.6 Å) with His163 and helps to strengthen the stacking interaction with Phe140. Furthermore, the nitrogen atom of the P1 side chain also forms a hydrogen bond (3.1 Å) with the backbone of Phe140, thus helping to maintain the oxyanion loop (Phe140-Cys145) in its proper conformation. For the above reasons, we conclude that the P1 side chain is important for the network of interactions stabilizing the oxyanion hole.
The S1 pocket has a smaller size to accommodate P1 histidine.
Given its crucial role in the catalytic process, glutamine outperforms other residues as the signature of the Mpro substrate at the P1 position. In addition to this advantage, the side chain of glutamine in the P1 position suitably fits with residues forming the S1 subsite via Van der Waals interactions (Fig. 2b). From the HCoV-HKU1 genome sequence, 11 out of 12 Mpro recognition sites have Gln at the P1 position. In our structure, the N3 molecule has a lactam ring as an analog to the glutamine residue (the cross-linking between the C
and N atoms helps to select the stretching conformation from the ensemble of rotamers and better occupy the binding cleft). In HCoV-HKU1 Mpro structures, the lactam ring protrudes into the S1 pocket via a hydrogen bond to the imidazole ring NH of His162 at a distance of 2.6 Å. However, unlike the SARS-CoV Mpro structure in complex with N3, the NH of the HKU1-N3 lactam ring fails to recruit a water molecule to satisfy a second S1 hydrogen bond. Instead, the N-terminal O
atom might provide a weak electronegative interaction to stabilize the NH atom; the interaction likely is stronger due to the presence of redundant residues as a cloning artifact, hindering the N-terminal Ser from coming any closer to the NH of P1 side chain.
Nevertheless, compared with the Mpros from other CoV groups, the structure of the HCoV-HKU1 Mpro has an S1 pocket with a relatively smaller volume of
18.1 Å3. In contrast, the volume of the S1 pocket of TGEV Mpro is
19.1 Å3, that of of IBV Mpro is
21.7 Å3, and that of SARS-CoV Mpro is
19.5 Å3. The reduced size of the S1 pocket might be caused by the position of the loop Leu167-Cys171, which is bent upward by about 90°. As a result, the smaller S1 pocket might tolerate mutation to short-chain residues at the P1 position, in which case a weakened oxyanion hole is to be expected. Novel substrate specificity already has been found in the HCoV-HKU1 genome, in which the Mpro recognition site between the helicase and exonuclease utilizes histidine instead of glutamine at the P1 position. Mimicking proteolysis in the cell, enzyme activity assays using a synthetic fluorogenic substrate confirm the existence of such a cleavage event in vitro and exhibit novel enzymatic properties not seen with the consensus substrate (Table 2).
|
View this table: [in a new window] |
TABLE 2. Activity assay of HCoV-HKU1
|
The S2 pocket presents group-specific features but no group-specific substrate preferences. The P2 side chain of the ligand protrudes into the S2 pocket via interactions with the hydrophobic side chains of Met25, Pro52, and Tyr54 (Fig. 2B). The lid of the pocket is covered by a short 310 helical region from Ser45-Asn51. To compare the diversity of the S2 pockets of all three CoV groups, the backbones of Mpro complex structures from all groups were superimposed (Fig. 3): for group 1, TGEV Mpro in complex with the inhibitor N1, an ancestor of N3; for group 3, IBV Mpro in complex with N3; and for group 2B, SARS-CoV in complex with inhibitor N3. We observed three modes of secondary structure: the 310 helix (HCoV-HKU1 and SARS-CoV), a loose loop (IBV), and a tight loop (TGEV). Interestingly, the clustering of the secondary structure correlates with the temporary classification of CoVs. We then explored the natural recognition sequences to examine whether the group-specific features could result in different substrate specificities at the P2 site (Table 3). After summarizing the P2 residue type in the protease recognition sites of the HKU1 PP1ab genome, we observe that Mpros prefer a hydrophobic residue at this position, which is also the case for SARS-CoV, IBV, and TGEV. Although there are a few exceptions, such as asparagine or valine residues, leucine/methionine are the most abundant. This is consistent with the observation from our structure that the hydrophobic P2 side chain extends into the deep S2 site without clashing with the Van der Waals surface of the pocket (Fig. 2b). Therefore, on the one hand, considering the flexibility of S2 pocket as well as the residual space after occupation by the P2 residue, the optimal choice for leucine or methionine might be related to the size of the S2 pocket. On the other hand, the similar preferences on S2 sites among group 1, 2a, 2b, and 3 CoV Mpros does challenge the efficacy of designing group-specific inhibitors by altering only P2 moieties.
![]() View larger version (53K): [in a new window] |
FIG. 3. S1 and S2 binding sites of HCoV-HKU1 Mpro main chains of four Mpro structures are superimposed and displayed in the neighborhood of the substrate-binding site. The S1 and S2 binding sites are highlighted by light green shadows. The main chains are represented in worm forms. Different colors are used to represent the strain of CoV. Lemon, synthetic compound; magenta, HCoV-HKU1; light green, SARS-CoV; light blue, TGEV; and yellow, avian IBV.
|
|
View this table: [in a new window] |
TABLE 3. P2 residues in different CoV genomes
|
![]() View larger version (66K): [in a new window] |
FIG. 4. (a) Overview of the P3 pocket. The inhibitor resides between the interface of molecules B and D. A cyan surface model is shown covering protomers B and D. The inhibitor is shown in magenta. (b) P3 interaction site of substrate in detail. Neighboring residues within 4 Å of the S3 site are colored green. An Fo-Fc map contoured at 1.5 around the inhibitor is displayed in cyan. For the inhibitor, C atoms are colored yellow, N atoms are colored blue, and O atoms are colored red. The protein carbon atoms are colored gray. Neighboring main chains are displayed as white ribbons.
|
|
|
|---|
![]() View larger version (52K): [in a new window] |
FIG. 5. (a) Sequence alignment of a typical Mpro from CoV group 2A exhibits high homology. The alignment was performed with ClustalW (33), and the final figure was generated with ESPript1.0 (18). White letters with red backgrounds refer to identical residues, red letters with white backgrounds refer to conservative variation, and black letters with white backgrounds refer to nonconservative mutations. (b) Three-dimensional representation of nonconserved mutations in group 2A CoV Mpros mapped onto the HCoV-HKU1 Mpro structure. Identical residues are colored white, conserved mutations are colored yellow, and nonconserved mutations are colored red. The inhibitor is colored blue.
|
![]() View larger version (65K): [in a new window] |
FIG. 6. Superposition of representative CoV Mpro in complex with Michael acceptor-based inhibitors of group 1 (TGEV; blue), group 2A (HCoV-HKU1; green), group 2B (SARS-CoV; magenta), and group 3 (IBV; red). The color of each inhibitor is consistent with that of its host.
|
Considering the high identity shared by group 2A CoVs, these structural features of HCoV-HKU1 Mpro, together with corresponding enzyme activity assays, will help to profile HCoV-HKU1 and other newly emerging etiologic agents from this group of CoVs.
This work was supported by Project 973 of the Ministry of Science and Technology of China (grant numbers 2006CB806503 and 2007CB914301), the National Natural Science Foundation of China (grant numbers 30221003 and 30730022), the Sino-German Center [grant number GZ236(202/9)], the Sino-European Project on SARS Diagnostics and Antivirals (SEPSDA) of the European Commission (grant number 003831), and the Tsinghua University Ph.D. student innovation fund.
Published ahead of print on 18 June 2008. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»