Previous Article | Next Article ![]()
Journal of Virology, June 2002, p. 5949-5958, Vol. 76, No. 12
0022-538X/02/$04.00+0 DOI: 10.1128/JVI.76.12.5949-5958.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Department of Virology II, National Institute of Infectious Diseases, Shinjuku, Tokyo 162-8640, Japan
Received 7 November 2001/ Accepted 20 March 2002
|
|
|---|
|
|
|---|
The NLV genome contains three open reading frames (ORFs). ORF1 encodes a large polyprotein (nonstructural protein), which is processed by the viral protease into six proteins, including putative 2C-like RNA helicase (with NTPase activity, but not RNA helicase activity, proven [35]), 3B VPg, 3C-like protease, and 3D RNA-dependent RNA polymerase. ORF2 encodes a capsid protein (structural protein), and 180 molecules of the protein are organized into a native NLV virion (36). ORF3 encodes a small basic protein, which was recently shown to be a minor structural protein included in a virion (13). When expressed in insect cells by using the baculovirus system, ORF2 alone is capable of producing virus-like particles lacking the genomic RNA (21, 30).
The 3C-like proteases from NLVs and the 3C proteases from picornaviruses are assumed to have chymotrypsin-like folds (2, 10, 14, 32, 39). The X-ray structures of picornaviral 3C proteases from poliovirus (34), rhinovirus (33), and hepatitis A virus (HAV) (1, 3) support this assumption. However, while the active-site nucleophile of chymotrypsin is a serine residue within the GDSG sequence, that of 3C(-like) proteases is a cysteine residue within the GDCG or GMCG sequence. Therefore, 3C(-like) protease is called chymotrypsin-like protease or serine-like protease (10, 14). 3C(-like) proteases constitute a clearly distinct group in the chymotrypsin-like protease family and are different from papain-like cysteine proteases.
Several studies by site-directed mutagenesis have been addressed to identify active-site amino acid residues of picornavirus 3C proteases (7, 15, 17-19, 25, 29), which corresponded to the catalytic triad (His57, Asp102, and Ser195) of chymotrypsin (12). The X-ray crystal structures revealed that amino acid residues His40, Glu71, and Cys147 formed the catalytic triad in the 3C poliovirus protease (34), and His40, Glu71, and Cys146 formed the catalytic triad in the rhinovirus 3C protease (33). However, in the HAV 3C protease, the side chain of Asp84 corresponding to Asp102 of chymotrypsin was directed away from the active site (1, 3). Therefore, it was proposed that the catalytic dyad (His44 and Cys172) was mandatory for the proteolysis.
On the other hand, as for 3C-like protease in the family Caliciviridae, the active-site amino acid residues of 3C-like protease from rabbit hemorrhagic disease virus (RHDV) were determined by site-directed mutagenesis study (5). It was indicated that the catalytic triad composed of His27, Asp44, and Cys104 was critical for the protease activity.
The Chiba virus (Hu/NLV/Chiba407/1987/JP) is one of the NLVs that was isolated in Chiba Prefecture, Japan (43). We have already cloned and sequenced the whole genome of the Chiba virus (40). The Chiba virus 3C-like protease expressed in Escherichia coli has been shown to retain enzymatic activity and substrate specificity (40). In this study, we attempted to identify active-site amino acid residues of the Chiba virus 3C-like protease by using the 3BC fragment derived from Chiba virus ORF1 to assess the protease activity. Thirty-seven charged amino acid residues and putative nucleophile Cys139 (designated by the amino acid number of the 3C-like protease) within the GDCG sequence were subjected to site-directed mutagenesis, and the mutant 3BC construct was tested to see whether or not it was separated into 3B and 3C when expressed in E. coli. It was revealed that the catalytic dyad composed of His30 and Cys139 played a critical role in the proteolysis.
|
|
|---|
Bacterial strains. E. coli JM109 (45) and TG1 (42) were used for genetic manipulation. E. coli CJ236 (27) was used for the preparation of deoxyuracil-containing single-stranded DNA. E. coli BL21-CodonPlus-RIL (Stratagene) was used for expression of the constructs derived from the Chiba virus ORF1.
Plasmids. The gene fragment encoding the 3B (VPg)-3C (protease) region was amplified by PCR with pUCCVORF1 (40) as a template. The forward primer (5'-ACTAGCGCTGGTAAAAACAAAGGAAAGACCAAG-3') was designed so as to introduce an Aor51HI site prior to the first codon for the Gly1 of VPg. The reverse primer (5'-TTTGCATGCTTACTCTAGGGTGGTTTCACCTTC-3') was designed in order to introduce a stop codon (TAA) and a PaeI (SphI) site after the last codon for the Glu181 of protease. After verification by DNA sequencing, the Aor51HI-PaeI fragment containing the entire 3BC region was cloned into the corresponding sites of pUCHisNd (40) to construct pUCHis3BC, on which the expression of the His-tagged 3BC construct was controlled by the lac promoter (Fig. 1B). The SacI-PaeI fragment encoding part of the VPg and the entire protease was subcloned in pUC119 (45), and the resultant plasmid, designated pUCCV3BC3059, was used for site-directed mutagenesis.
![]() View larger version (22K): [in a new window] |
FIG. 1. Schematic representation of the Chiba virus ORF1 polyprotein and construction of the expression plasmids. (A) Map of the putative proteolytic cleavage sites within ORF1. The numbers indicate the positions of the amino acids. Slashes indicate the cleavage sites. The calculated molecular masses of the cleavage products are also indicated. (B) Plasmid pUCHis3BC expressing the His-3BC fragment. An extra sequence (underlined) containing six consecutive histidine residues was fused to the N terminus of VPg. The His-3BC protein ends with the last residue (Glu181) of the 3C-like protease. (C) Plasmid pUC3CD expressing the 3CD fragment and its related N-terminal truncation mutant plasmids. The 3CD protein starts with the first residues (Ala1) of the 3C-like protease after the initial Met residue and ends with the last residue (Val508) of the 3D RNA-dependent RNA polymerase. The 3CD- N5, 3CD- N8, and 3CD- N11 proteins start with Trp6, Val9, and Phe12, respectively.
|
N5, pUC3CD-
N8, and pUC3CD-
N11, respectively (Fig. 1C). Site-directed mutagenesis. Site-directed mutagenesis was based on the method of Kunkel (27) obtained with the Mutan-K mutagenesis kit (Takara). Deoxyuracil-containing single-stranded DNA of pUCCV3BC3059 was prepared with the E. coli CJ236 strain and used as a template. Mutagenic primers are listed in Table 1. Usually, mutations were first detected by the appearance of a new restriction site or the disappearance of the original restriction site. These mutations did not affect the desired amino acid sequences, which were finally verified by DNA sequencing. The SacI-PaeI fragment from mutated pUCCV3BC3059 was cut out and ligated with the SacI-PaeI vector fragment from pUCHis3BC, resulting in mutant pUCHis3BC plasmids.
|
View this table: [in a new window] |
TABLE 1. Mutagenic primers used for site-directed mutagenesis
|
Expression of the 3BC and 3CD constructs. E. coli BL21-CodonPlus-RIL was transformed with each of the wild-type pUCHis3BC plasmids, mutant pUCHis3BC plasmids, or pUC3CD derivatives. E. coli cells were grown at 37°C. The transformant was first grown in 3 ml of 2xYT broth (3ba). One hundred microliters of the culture was inoculated into 30 ml of fresh 2xYT broth, and the culture was further grown overnight (about 16 h.). Isopropyl-ß-D-thiogalactopyranoside (IPTG) was added to a final concentration of 0.5 mM and incubated for 3 h. Cells were harvested, washed with 20 mM Tris-HCl (pH 7.4), and resuspended with 3 ml of the same buffer. After sonication, the cell lysate was centrifuged to remove unbroken cells, and the supernatant was used as a sample. The protein concentration was determined by the Bradford method (6), with bovine serum albumin as a standard.
Western blotting. The supernatant containing 10 µg of proteins was used for sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), followed by electroblotting onto a nitrocellulose membrane. The His-tagged proteins were detected by mouse anti-His monoclonal antibody (Sigma, St. Louis, Mo.) and alkaline phosphatase-linked goat anti-mouse immunoglobulin G (IgG) (Jackson ImmunoResearch Laboratories, West Grove, Pa.). The Chiba virus 3C-like protease was detected by rabbit antiserum raised against the purified His-tagged 3C-like protease (40) and alkaline phosphatase-linked goat anti-rabbit IgG (Vector Laboratories, Burlingame, Calif.). Rabbit antiserum raised against keyhole limpet hemocyanin conjugates of the synthetic peptide derived from RNA polymerase (40) was used for the detection of the Chiba virus RNA polymerase.
|
|
|---|
![]() View larger version (67K): [in a new window] |
FIG. 2. Amino acid alignments of the Chiba virus 3C-like protease. (A) Alignment of the 3C-like proteases of NLVs for which whole genomic sequences had been determined. Chiba virus has 37 charged amino acid residues; 22 out of 37 residues indicated by large letters are conserved in all NLV 3C-like proteases. The putative nucleophile Cys139 is also indicated by a large letter. Asterisks indicate conserved residues. Dots indicate similar amino acid residues. The numbers in parentheses indicate the amino acid identities with the Chiba virus 3C-like proteases. Accession numbers: Chiba virus, AB042808; Norwalk virus, M87661; Southampton virus, L07418; BS5 strain, AF093797; Jena virus, AJ011099; Lordsdale virus, X86557; Camberwell virus, AF145896; Hawaii virus, U07611. (B) Alignment with the RHDV 3C-like protease and the picornaviral 3C proteases whose structures had been determined. The Chiba virus 3C-like protease was aligned with the RHDV (M67473), the HAV (P14553), rhinovirus (P03303), and poliovirus (P03300) 3C(-like) proteases. The roles of the residues whose background color is black or gray were discussed in the text.
|
![]() View larger version (46K): [in a new window] |
FIG. 3. Western blot analyses of alanine-scanning mutants of the His-3BC constructs. Ten micrograms of the proteins from E. coli BL21-CodonPlus-RIL cells harboring the wild-type or mutant pUCHis3BC plasmid were separated by SDS-PAGE, followed by electroblotting. Relevant proteins were detected by antiprotease antiserum (A), or anti-His monoclonal antibody (B). Mutations were represented by using one-letter codes of amino acids. Numbers indicate the positions of amino acid residues. Letters preceding the number and following the number indicate the wild-type amino acid and the introduced amino acid, respectively.
|
To investigate the role of Cys139, this residue was replaced with Ser, Thr, Tyr, and Met. As shown in Fig. 4, only the Ser mutant had protease activity comparable to that of the wild-type enzyme. Although Thr and Tyr, like Ser, have the OH group and Met has a lone pair of electrons on the sulfur atom, these mutants exhibited no activity. Since the Thr residue has an additional methyl group bound to the ß carbon compared with the Ser residue, rotation of the side chain is assumed to be limited in the folded molecule. The side chain of the Tyr residue is much bulkier than those of the Cys and Ser residues. Therefore, the OH group of Thr or Tyr would not be able to work as a nucleophile. Even if a lone pair of electrons on the side chain of the Met residue might work as a nucleophile under certain conditions and a nucleophilic attack against a peptide carbonyl happened, a methyl cation, compared with a proton, would not be stabilized by its environment and would not be suitable as a leaving group. Thus, the presence of the flexible SH or OH group is required for the side chain of the amino acid residue at position 139, supporting the idea that Cys139 is a nucleophile of the active site.
![]() View larger version (46K): [in a new window] |
FIG. 4. Western blot analyses of the Arg8, His30, Lys88, Arg89, Asp138, Cys139, and His157 mutants of the His-3BC constructs. Experiments were performed as described in the legend to Fig. 3.
|
His30 was replaced with Tyr, Gln, Asn, Glu, Asp, and Arg. All of the mutants exhibited no protease activity (Fig. 4). The presence of the imidazole ring was essential for the proteolysis, since any chemical nature (aromaticity, hydrophilicity, the ability to accept a proton, or having a positive charge) of the side chain could not be substituted for a tautomeric property of imidazole. It was thought that both the basic nature of an imidazole ring contributed to accepting a proton released from the nucleophile after an attack against the peptide carbonyl, and the aromaticity of a protonated imidazole ring contributed to delocalization of a positive charge. These results were consistent with the idea that His30 was the general base of the active site.
His157 in the S1 specificity pocket. As described above, His157 was thought to be one of the residues forming the S1 specificity pocket. The corresponding histidine residues in poliovirus and rhinovirus 3C proteases were shown to be involved in recognition of the side chain of the P1-position Gln residue at the cleavage sites (4, 7, 17, 19, 29).
The H157A mutation (CAT
GCT), performed with the H157A primer (Table 1), was not successful. Instead, the Ser and Pro mutants of His157, which had undesired nucleotide changes (CAT
TCT and CAT
CCT, respectively), were spontaneously obtained and characterized here. Furthermore, His157 was replaced with Tyr, Gln, and Arg. All of the His157 mutants, as well as H157A, produced a very slight amount of His-3B (Fig. 4B), although the cleaved protease fragment was not clearly detected by the antiprotease antiserum (Fig. 4A). These indicated that His157 was not essential for the protease activity. Severely reduced protease activity might be due to impaired substrate recognition. The relationship between the substrate specificity of His157 and the amino acid sequence at the cleavage site will be the subject of a future study.
Asp138. Asp138 was the only negatively charged residue found to be essential for the proteolysis by alanine-scanning mutagenesis. When Asp138 was replaced with Asn, Glu, and Met, all mutants had no protease activity (Fig. 4), indicating that the Asp residue is essential at this position in the Chiba virus 3C-like protease. Asp138 is the amino acid residue next to a nucleophile, Cys139. In the X-ray structures of chymotrypsin and picornavirus 3C proteases, the side chain of the residue preceding a nucleophile was oriented away from the catalytic residues in the active site (1, 3, 33, 34, 41) and therefore may not be able to participate in the catalysis. This observation is consistent with the fact that the Asp residue was not conserved at the corresponding position. Asp appears at this position in the chymotrypsin, RHDV 3C-like protease, and poliovirus and rhinovirus 3C proteases, while Met occupies the HAV 3C protease and Tyr occupies the FMDV 3C protease (Fig. 2B). Asp138 may therefore be involved in stabilizing the protein structure through the ionic interaction or hydrogen bonding, although the details of this will not be clear until information on the high-order structure is obtained.
Site-directed mutagenesis performed here did not find the third member carboxylate of the active site corresponding to Asp102 in chymotrypsin. Amino acid alignment shown in Fig. 2B indicated that either of the following two residues of the Chiba virus 3C-like protease might work as the third member carboxylate: Glu54, which corresponded to the third member Asp44 of the RHDV 3C-like protease; or Asp67, which corresponded to Glu71 of poliovirus and rhinovirus 3C proteases. As shown by the crystal structure of the HAV 3C protease (1, 3), its side chain might point away from the active site and might not interact with the general base histidine (His30) in the Chiba virus 3C-like protease. This feature of these two proteases is in marked contrast with those of the chymotrypsin, poliovirus, and rhinovirus 3C proteases. However, in the HAV 3C protease, since Tyr143 is almost perpendicular to the plane of the imidazole of His44 and seems to be deprotonated and negatively charged, it might contribute to the electrostatic stabilization and the delocalization of a positive charge on the imidazole ring (3). Therefore, we cannot rule out the possibility that an anion, as served by the side chain of a tyrosine residue, may stabilize a positive charge on the imidazole ring of His30 or withdraw a proton from the imidazole ring as the third member of the active site in the Chiba virus 3C-like protease.
Lys88 and Arg89. Lys88 was replaced with Arg, Gln, and Leu, and Arg89 was replaced with Lys, Gln, and Leu. As shown in Fig. 4, both the Gln88 and Gln89 mutants were not expressed substantially in E. coli. The Arg88 mutant had significant protease activity, although uncleaved His-3BC remained. In contrast, the Leu88 mutant had no activity. Therefore, it was likely that a positively charged residue at position 88 was important for the protease activity. The Arg residue was likely to be essential at position 89, since the Lys and Leu mutants of Arg89 did not show any protease activity.
When amino acid sequences of the Chiba virus 3C-like protease are aligned with that of the HAV 3C protease, Lys88 and Arg89 of the Chiba virus are likely to correspond to Lys69 and Lys70 of RHDV and Lys105 and Lys106 of HAV (Fig. 2B), which were located on the surface of the enzyme distant from the active site according to the crystal structure of the HAV 3C protease and were implicated in RNA recognition and/or the maintenance of the integrity of the RNA recognition site (3). Although these two Lys residues are not found in the poliovirus and rhinovirus 3C proteases, this may be explained by the difference in local structures (
-helix in HAV [3] versus nonstructured in poliovirus [34] and rhinovirus [33]). In any case, Lys88 and Arg89 of the Chiba virus 3C-like protease are likely to be far from the active site and are not involved in the catalysis.
Effects of Arg8 mutations on 3B/3C and 3C/3D cleavage.
Arg8 was replaced with Lys, Gln, and Leu. None of the mutants showed protease activity (Fig. 4). Therefore, Arg was essential for the proteolysis. According to the amino acid sequence alignment with the picornavirus 3C proteases, Arg8 is likely to correspond to the first or second positively charged residue in the highly conserved sequence motif K(/R)-R(/K)-N-I(/L) (Lys12 or Arg13 in poliovirus 3C, Arg12 or Lys13 in rhinovirus 3C, and Arg10 or Lys11 in HAV 3C), although the motif was not conserved in NLVs and RHDV (Fig. 2B). The crystal structures revealed that this sequence motif formed the last turn of the N-terminal
-helix. Khan et al. (26) previously suggested the role of the motif and proposed the intriguing model of autocatalytic (intramolecular) cleavage at the 3B/3C junction by using the structure of the poliovirus 3C protease. In this model, before the intramolecular or cis cleavage at the 3B/3C site occurs, the region including the C terminus of the 3B VPg and the N terminus of the 3C protease is in an extended conformation, so as to reach into the active site of the protease. After the cleavage, the N terminus dissociates out of the active site and then forms a final conformation identical to the crystal structure. The idea was derived from the crystal structure of the precursor of the
-lytic protease, a serine protease, which underwent autocatalytic cleavage during maturation (37). This model implies that the Arg8 residue is important for the 3B/3C cleavage, but is not necessary for cleavage at the other junctions.
To examine whether or not Arg8 is important for the 3C/3D cleavage, first, the R8A, R8K, R8Q, and R8L mutants were constructed on the basis of the 3CD precursor (Fig. 1C). E. coli cells expressing the wild-type 3CD precursor generated the 20-kDa 3C and the 56-kDa 3D fragments, as detected by the respective antisera (Fig. 5), indicating that the 3C-like protease moiety was enzymatically active. The 60-kDa band in Fig. 5A and B and the 33-kDa band in Fig. 5B were nonspecific ones because they also appeared in the lysate of plasmidless host E. coli cells. As for four Arg8 substitution mutant 3CD precursors, not any 3C-like protease-specific proteolytic events were observed, since only the 75-kDa bands were detected by both antiprotease and antipolymerase antisera (Fig. 5). This result seemed to suggest that the Arg8 residue was also involved in the 3C/3D cleavage.
![]() View larger version (48K): [in a new window] |
FIG. 5. Western blot analyses of the 3CD precursor and its derivatives. Ten micrograms of the proteins from E. coli BL21-CodonPlus-RIL cells harboring the wild-type or mutant pUC3CD plasmid were separated by SDS-PAGE, except that 50 µg of proteins was used for the 3CD- N8 and 3CD- N11 mutants, followed by electroblotting. Relevant proteins were detected by antiprotease antiserum (A) or antipolymerase antiserum (B). Arrows in panel A indicate the 3C, 3C- N5, or 3C- N8 fragments, and those in panel B indicate the 3D RNA polymerase fragments.
|
N5), 8 (3CD-
N8), or 11 (3CD-
N11) amino acids, were constructed and examined (Fig. 1C). As shown in Fig. 5, the 3CD-
N5 truncation mutant and even the 3CD-
N8 mutant lacking Arg8 could significantly generate about 18.5-kDa N-terminal-truncated 3C and the 56-kDa 3D peptides, although both mutants had less proteolytic activity than the wild-type 3CD, because a portion of the mutant 3CD precursors remained uncut (Fig. 5). In the 3CD-
N8 mutant, the amount of the truncated 3C fragment was quite low, consistent with the low level of expression of the 3CD-
N8 precursor in E. coli cells. It was clear that the cleavage of the 3CD-
N5 and 3CD-
N8 mutant proteins was exerted by the proteolytic activity of the respective truncated 3C moiety, since the truncated mutants carrying an active-site mutation, C139A, produced only the 75-kDa species, exhibiting no proteolytic activity (data not shown). The 3CD-
N11 truncation mutant was inactive as to the protease activity, since only the 75-kDa precursor band was observed (Fig. 5). These results indicated that the N-terminal region containing 8 amino acids rather than the Arg8 residue itself was not necessary for the 3C/3D cleavage and that the N-terminal portion itself of the 3C-like protease was not involved in the proteolysis. The effect of the Arg8 substitution on the protein structure might be more detrimental than that of the N-terminal truncation, and/or the Arg8 mutation might alter the conformation of the N terminus, resulting in the fact that the N-terminal region containing the Arg8 mutation hindered access of the cleavage site to the active center of the protease. However, whether the model proposed by Khan et al. (26) is valid and is the explanation for the NLV 3C-like proteases and picornavirus 3C proteases cannot be judged from these results. To judge if this is true or not, we will have to await information on the highly ordered structure of the protease. Conclusion. First, in analogy to chymotrypsin and picornavirus 3C proteases, the catalytic dyad composed of His30 and Cys139 functions as active-site residues in the Chiba virus 3C-like protease. While the catalytic triad works in the proteolysis in chymotrypsin (12) and 3C proteases from poliovirus (34) and rhinovirus (33), it was found that the HAV 3C protease was the first chymotrypsin-like protease in which the catalytic dyad was likely to work (1, 3). Therefore, it is very likely that the dyad composed of a nucleophile (Ser or Cys) and a general base (His) is mandatory for the protease activity. In the HAV 3C protease and the Chiba virus 3C-like protease, the protonated imidazole ring carrying a positive charge might have been strongly stabilized by its environment through an effect such as a stack of aromatic rings. However, we cannot rule out the possibility that an anion such as a tyrosine phenolate rather than a carboxylate might participate in the proteolysis as the third member of the active site.
Second, the substitution of Arg8, which seemed to be located on the N-terminal
-helix, affected both the cleavage at the 3B/3C junction and the cleavage at the 3C/3D junction, whereas the deletion of at least eight residues of the N terminus, including Arg8, did not affect the cleavage at the 3C/3D junction. These results indicated that the Arg8 residue itself was not directly involved in the proteolysis. It was also suggested that the Arg8 substitution, but not the deletion of the N-terminus region containing Arg8, altered a conformation of the protease, leading to inactivation of the protease as to both the cleavage at the 3B/3C junction and the cleavage at the 3C/3D junction.
This study was the first evaluation of the active site of the NLVs at the molecular level. The information obtained in this study is expected to provide the basis for drug design in diarrheal therapy for humans. Determination of the high-order structures of the protease and its related precursors will not only elucidate the accuracy of results described here, but also greatly contribute to and accelerate computer-based drug design.
This work was supported by a grant for Research on Emerging and Re-emerging Infectious Diseases from the Ministry of Health, Labour and Welfare and a grant-in-aid from the Ministry of Education, Culture, Sports, Science and Technology, Japan.
|
|
|---|
-lytic protease complexed with its pro region. Nat. Struct. Biol. 5:945-950.[CrossRef][Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»