Previous Article | Next Article ![]()
Journal of Virology, April 2006, p. 4157-4167, Vol. 80, No. 8
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.8.4157-4167.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Virology Division, Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands,1 Molecular Virology Laboratory, Department of Medical Microbiology, Center of Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands2
Received 12 October 2005/ Accepted 2 February 2006
|
|
|---|
(S, A) as the substrate consensus sequence. EToV
Mpro combines several traits of its closest relatives. It
has a predicted three-domain structure, with two catalytic
ß-barrel domains and an additional C-terminal domain of unknown
function. With respect to substrate specificity, the EToV
Mpro resembles its coronavirus homologue in its preference
for P1-Gln, but its substrate-binding subsite, S1, more closely
resembles that of arteri- and ronivirus Mpros, which prefer
P1-Glu. Surprisingly, in contrast to the Mpros of corona-
and roniviruses, but like that of arterivirus, the torovirus
Mpro uses serine instead of cysteine as its principal
nucleophile. Under the premise that the Mpros of corona- and
toroviruses are more closely related to each other than to those of
arteri- and roniviruses, the transition from serine- to cysteine-based
proteolytic catalysis (or vice versa) must have happened more than once
in the course of nidovirus evolution. In this respect, it is of
interest that a mutant EToV Mpro with a
Ser165
Cys substitution retained partial enzymatic
activity. |
|
|---|
Many RNA virus proteinases
resemble, in their mechanism of peptide hydrolysis and in their
three-dimensional structure, either one of the prototypical cellular
proteases papain and chymotrypsin. Papain and related proteinases are
characterized by a two-domain
+ß-fold with
typically a cysteine and downstream histidine as the catalytic
residues. Chymotrypsin has a distinctive two-ß-barrel structure
in which the principal nucleophile, a serine residue, is part of a
catalytic His-Asp-Ser triad
(20,
49). Chymotrypsin-like
serine proteinase domains have been identified within the polyproteins
of among others flavi-, alpha-, luteo-, astro-, and sobemoviruses
(13,
17,
29,
38,
42,
53,
66).Intriguingly, however, many viruses, for instance picorna-, como-,
poty-, and caliciviruses, encode proteinases with a (predicted)
chymotrypsin-like fold in which, instead of serine, a cysteine acts as
the main catalytic residue
(1,
5,
6,
27,
28,
31,
52,
55).
The order Nidovirales comprises a group of evolutionarily related enveloped (positive) stranded RNA viruses (corona-, toro-, arteri-, and roniviruses) (10, 15, 19, 26, 56, 58, 62), which have taken the polyprotein strategy to the extreme. The 5'-most two-thirds of the nidovirus genome is occupied by two large overlapping open reading frames (ORFs), ORF1a and ORF1b, the most downstream of which is only translated upon ribosomal frameshifting (9, 15, 18, 58). These ORFs specify two huge polyproteins, pp1a and the C-terminally extended frameshift protein pp1ab. In the case of the coronaviruses, pp1ab measures up to 7,200 amino acids in length, representing the largest polyprotein known to date. Among nidoviruses, the largest degree of sequence conservation is found within the ORF1b-encoded part of pp1ab, which contains the chief replicative domains (15, 19, 30, 57), i.e., the RNA dependent RNA polymerase (RdRp) and the helicase, as well as up to three additional domains, ExoN, NendoU, and 2'-O-methyltransferase (2'-O-MT), which display sequence identity to cellular RNA-modifying enzymes (8, 37, 57). Of the latter, the ExoN and 2'-O-MT domains are lacking in the arterivirus pp1ab.
Processing of the arteriviral and coronaviral polyproteins gives rise to 12 to 13 and 15 to 16 mature products, respectively, and an as yet unknown number of functional intermediates (57, 76). The N termini of the replicase polyproteins are processed by "accessory" papain-like proteinases (31). The vast majority of cleavages, however, are carried out by a single chymotrypsin-like main proteinase (Mpro), which is located in the C-terminal half of pp1a, flanked by hydrophobic regions that are thought to be involved in targeting the replicase to intracellular membranes (30, 54, 67, 71). Recent X-ray crystallographic analysis of the structures of corona- and arterivirus Mpros confirmed the predicted chymotrypsin-like two-ß-barrel fold (2, 4, 73). Interestingly, the corona- and arteriviral enzymes both possess an additional C-terminal domain, a feature not commonly seen in other viral and cellular chymotrypsin-like proteinases (2, 4). Also for the ronivirus Mpro, such an additional C-terminal domain was predicted (74).
One might surmise that the nidoviral Mpros, given their pivotal role in viral replication, are well conserved. Conversely, however, the primary sequences of the Mpros of corona-, roni-, and arteriviruses have diverged almost beyond recognition, and sequence identity is restricted to the regions immediately surrounding the main catalytic residues. Moreover, whereas the arteriviral Mpro is a serine proteinase with a canonical His-Asp-Ser catalytic triad (4, 64), the Mpros of the corona- and roniviruses are cysteine proteinases (28, 30, 43, 74). In the latter, the acidic catalytic residues have been lost and, apparently, these enzymes solely rely on the remaining catalytic His-Cys dyad (2, 73, 74).
Toroviruses have been poorly studied, but because of their relatively close phylogenetic relationship to coronaviruses they may in many respects represent "missing links" in comparative nidovirus studies. Here, we are the first to report the biochemical characterization of a torovirus Mpro. Upon analysis of the C-terminal half of pp1a of equine torovirus (EToV) strain Berne, we identified a chymotrypsin-like proteinase domain, which, like the Mpros of the arteri-, corona-, and roniviruses, is flanked by hydrophobic regions. When expressed in prokaryotic and mammalian cells, the proteinase domain autocatalytically released itself from its surrounding sequences. Surprisingly, the toroviral Mpro seems no more related to that of coronaviruses than to the Mpros of arteri- and roniviruses. Most saliently, in contrast to the Mpros of corona- and roniviruses, but like that of arteriviruses, the torovirus Mpro is a serine proteinase. A tentative processing scheme for the torovirus pp1a/pp1ab polyproteins is presented, and the implications of our findings for nidovirus evolution are discussed.
|
|
|---|
Isolation of intracellular EToV RNA, cDNA synthesis, and PCR amplification. Total cytoplasmic RNA, isolated from the equivalent of 4 x 106 EToV-infected Ederm cells (68), was subjected to reverse transcription-PCR (RT-PCR) with RNase H-free Moloney murine leukemia virus reverse transcriptase (Superscript II; Invitrogen) and with the Expand long template PCR system (Roche Diagnostics GmbH), according to the instructions of the manufacturers. Oligonucleotide primers 1071 and 1072 (Table 1) were designed after sequences located at the 5' end and 3' end of EToV ORF1A, respectively (59, 61). Primers 1288 and 1325 were designed after sequences from cDNA clones p133 and p135. These clones had been identified previously as EToV specific after screening of a genomic cDNA library (61) and had been tentatively mapped in ORF1a (E. J. Snijder, unpublished). Amplicons of approximately 5.5 kb were obtained, which were gel purified, blunt ended using large fragment DNA polymerase I (LFDP) according to the instructions of the manufacturer (Invitrogen), and cloned into an EcoRV-digested vector pWSK29 (70). Sequence analysis was performed commercially (BaseClear Labservices) using the ABI PRISM BigDye Terminators v3.0 cycle sequencing kit (Applied Biosystems) on at least two independent clones in both orientations.
|
View this table: [in a new window] |
TABLE 1. Oligonucleotides
used in this study for amplification or mutagenesis of EToV sequences
|
Antisera specific for the N terminus of EToV Mpro (K270), the C terminus of the Mpro (K272), or the N terminus of pp1a/pp1ab (K274) were prepared by repeated subcutaneous immunization of New Zealand White rabbits with 200 µg of the bovine serum albumin-coupled synthetic peptides NH2-SVFSKATSPFTLHARPPK-COOH, NH2-KPLQYFHVPSFWQPFKKQ-COOH, and NH2-MFRILKNNTRETEQHLSSSK-COOH, respectively, as described previously (63).
Construction of prokaryotic and eukaryotic EToV Mpro expression vectors. For prokaryotic expression of EToV Mpro, residues 10376 to 11472 of the EToV genome were RT-PCR amplified with oligonucleotide primers 1606 and 1607 (Table 1). A product of the anticipated size of 1,112 bp was obtained, which was gel purified, inserted into plasmid pQE9, and cloned in E. coli M15/pREP4 cells (QIAGEN) as described above. The resulting expression vector, pQE-SS-Mpro, codes for a 42-kDa polypeptide, encompassing the EToV Mpro domain and flanking sequences.
The expression construct pQE9-SS-Mpro-GST is a pQE-SS Mpro derivative, which consists of nucleotides (nt) 10376 to 11473 of EToV ORF1a and nt 258 to 935 of the glutathione S-transferase (GST) gene from plasmid pGEX2T (Amersham Pharmacia Biotech). This expression cassette was produced via PCR with oligonucleotides 2124 and 2125 and conventional cloning procedures. It codes for a 60-kDa fusion protein consisting of the 366-amino-acid-residue Mpro domain with flanking sequences linked by a proline residue to amino acid residues 1 to 226 of the GST protein.
For mammalian expression, we employed the recombinant vaccinia virus-based vTF7-3 system. To generate a suitable expression vector, the 1606/1607 PCR amplicon, digested with BamHI and EcoRV, was cloned into SacI/BamHI-digested pBluescript K/S+ (Strategene) together with a double-stranded linker, consisting of oligonucleotides 1721 and 1722. The resulting plasmid, pBS-SS-Mpro, contains residues 10376 to 11472 of the EToV genome, placed under the control of the bacteriophage T7 promoter and provided in frame with an initiation codon in an optimal context for translation initiation (40).
Site-directed mutagenesis of (putative) active-site residues was performed by splicing overlap extension-PCR mutagenesis (35) and conventional cloning techniques; mutations introduced in Mpro and the primers used are listed in Table 1.
The
dual-expression construct pBS-SS-Mpro-dual, is a
pBS-SS-Mpro derivative in which, downstream of the
Mpro sequences, a second expression cassette is inserted,
which is placed under the translational control of the
encephalomyocarditis virus (EMCV) internal ribosomal entry site (IRES;
taken from plasmid pTN2)
(69) and which consists
of nt 821 to 1128 of EToV ORF1a fused in frame to nt 10.378 to 11.472
of the EToV genome (encoding Mpro and surrounding sequences)
and nt 3 to 483 of the EToV N gene. This chimeric expression cassette
was produced via a series of splicing overlap extension-PCRs with
oligonucleotides 1638, 1640, 1902, and 1914 through 1918 (Table
1) and conventional
cloning procedures. It codes for a 66-kDa fusion protein, consisting as
listed from the N terminus to C terminus of (i) residues 1 to 103 of
pp1a; (ii) the 65 aa residues immediately upstream of Mpro;
(iii) the 292-residue Mpro domain; (iv) the 8 residues
downstream of Mpro, including the cleavage site; and (v)
residues 2 to 160 of the EToV N protein. In this fusion protein, the
N-terminal Mpro cleavage site is abolished (QS
AG)
and the Mpro proteinase is inactivated by an active site
Ser-to-Ala substitution.
Purification and N-terminal sequence analysis of Mpro cleavage products. EToV Mpro and Mpro-GST were expressed from plasmids pQE-SS-Mpro and pQE9-SS-Mpro-GST, respectively, in E. coli M15/pREP4 as described above. Of the 1-ml bacterial culture, the cells were collected by centrifugation at 14,000 rpm for 30 s, resuspended in 100 µl Laemmli sample buffer, and incubated for 5 min at 95°C. Proteins were separated in 15% SDS-polyacrylamide gel electrophoresis (PAGE) gels (20 µl cleared supernatant/lane), transferred to polyvinylidene difluoride membranes (Bio-Rad Laboratories), and subsequently stained with Coomassie brilliant blue R-250. The regions containing the cleavage products of interest were excised. N-terminal sequence analysis was performed by subjecting the proteins to 5 to 8 rounds of Edman degradation using standard procedures with a pulse-liquid protein sequencer (model 476A; Applied Biosystems).
Infection/transfection of mammalian cells and metabolic labeling.
Monolayers of Ederm cells (
1
x 106 cells), grown in 35-mm wells, were washed once
with PBS containing 50 µg/ml DEAE-dextran and then infected in
PBS-DEAE with EToV at a multiplicity of infection of 5 PFU/cell. At
1 h postinfection (p.i.), the inoculum was replaced by 1 ml
DMEM10 and incubation at 37°C continued.
Subconfluent
monolayers of OST7-1 cells (
2 x 106 cells),
grown in 35-mm wells, were infected with recombinant vaccinia virus
vTF7-3 and, at 1 h p.i., transfected with 2 µg
plasmid DNA as described previously
(46).
Prior to metabolic labeling, the cells were depleted for methionine and cysteine for 30 min by replacing the tissue culture supernatant with 600 µl of prewarmed starvation medium (DMEM without L-cysteine and L-methionine [Invitrogen] supplemented with 5% FCS, 10 mM hydroxyethylpiperazine-N'-2-ethanesulfonic acid [HEPES; pH 7.4], 1 mM Glutamax). Then 100 µCi of Tran35S-label (ICN) was added and the cells were labeled for the indicated length of time at 37°C.
After labeling, the
cells were washed once with ice-cold PBS and then lysed on ice in 600
µl ice-cold lysis buffer (20 mM Tris-HCl, pH 7.5, 1 mM EDTA,
100 mM NaCl, 1% Triton X-100, and 1 µg/ml aprotinin, leupeptin,
and pepstatin A each). Nuclei and cell debris were pelleted by
centrifugation for 5 min at 14,000 rpm and 4°C, and the
supernatant was subjected to radioimmunoprecipitation assay (RIPA). To
this end, the lysates were adjusted to 2% SDS, diluted 10 times with
detergent solution (50 mM Tris-HCl, pH 8.0, 62.5 mM EDTA, 0.4% Na
deoxycholate, 1% NP-40, and 1 µg/ml of aprotinin, leupeptin,
and pepstatin A each). RIPA was performed in a final volume of 1 ml
with 3 µl of antiserum K158, K270, K272, K274 (see below), or
R-
EToV N (14).
The immune complexes were adsorbed to Pansorbin cells (Calbiochem) for
30 min at 4°C, collected by centrifugation for 1 min at 14,000
rpm, and washed three times with detergent solution containing 0.25%
SDS. The pellets were then resuspended in 25 µl Laemmli sample
buffer and heated for 5 min at 95°C. Proteins were separated in
15% SDS-PAGE gels, which were fixed for 30 min in 50% methanol-10%
acetic acid and dried for 60 min at 80°C. Labeled proteins were
visualized by fluorography using Kodak intensifying
screens.
|
|
|---|
The EToV serine proteinase domain, comprising residues 3278 to 3430, displayed sequence similarity with the SpIB serine protease of Staphylococcus aureus and with the pp1a serine proteinase of turkey astrovirus. Similarity was particularly evident in the areas surrounding the predicted active-site residues, with His75, Asp113, and Ser193 of S. aureus SplB (50); GenBank accession no. AF271715) aligning with His3304, Glu3347, and Ser3416 in the EToV sequence (Fig. 1A). The putative EToV proteinase domain resides within pp1a/pp1ab at a relative position similar to that of the Mpros of other nidoviruses and, likewise, is flanked by hydrophobic regions (HR-1 and -2; Fig. 2A). Based upon these observations and upon the fact that no other chymotrypsin-like proteinase motifs were evident in the EToV pp1a/pp1ab sequence, we postulated the identified domain to represent the toroviral Mpro.
![]() View larger version (52K): [in a new window] |
FIG. 1. Identification
and sequence comparison of the toroviral Mpro. (A)
Alignment of EToV strain Berne Mpro with SplB of
Staphylococcus aureus. Standard and PHI and PSI BLAST searches
with pp1a residues 2385 to 4570 as a query revealed sequence identity
across an extensive region (residues 3278 to 3430) with serine
proteinase SplB of Staphylococcus aureus. Active-site residues
predicted for SplB and corresponding residues in the EToV sequence are
boxed and indicated by #; predicted key residues of the S1 pocket are
shown in bold and indicated by @. Residues of EToV
Mpro are presented as numbered from the initiating
Met1 of pp1a. (B) Multiple sequence alignment of
corona-, arteri-, roni-, and torovirus Mpro domains. Clustal
X-based alignments of coronavirus, arterivirus, and gill-associated
okavirus (GAV) Mpros, generated previously
(3,
4,
74), were used to align
the EToV and BToV sequences. Abbreviations of virus names and GenBank
accession numbers for the sequences are as follows: TGEV, porcine
transmissible gastroenteritis virus (group 1;
CAB91144); MHV, mouse hepatitis virus strain A59
(group 2; NC_001846); SARS, SARS
coronavirus (group II;
NC_004718); IBV, avian infectious bronchitis virus (group 3;
AY692454); EToV, equine torovirus strain Berne;
BToV, bovine torovirus strain Breda 1
(AY427798); GAV, gill-associated okavirus
(AF227196); EAV, equine arteritis virus
(NC_002532); LDV; lactate
dehydrogenase-elevating virus neurovirulent type C
(NC_02534); and PRRSV, porcine
reproductive and respiratory syndrome virus strain Lelystad
(AY588319). Residues shared by a torovirus
Mpro and any of the other sequences are indicated by dark
shading; residues conserved between Mpros of different
nidoviruses are indicated by light shading. Residue numbers for the
mature Mpros of arteri-, corona-, and roniviruses are given;
residues of EToV Mpro are numbered from the initiating
Met1 of pp1a. Residues of the catalytic center and the S1
specificity pocket are boxed. Squares indicate the catalytic His and
Ser/Cys residues, as well as the catalytic Asp residues of arterivirus
Mpros; the latter are aligned with SARS coronavirus and TGEV
residues, which in the tertiary structure of the coronavirus
Mpro occupy an equivalent main chain position. Triangles
indicate key residues involved in P1 substrate
binding.
|
![]() View larger version (39K): [in a new window] |
FIG. 2. Autocatalytic
processing of torovirus Mpro in E. coli.
(A) Schematic representation of the EToV polymerase
polyprotein pp1ab with the ORF1a- and ORF1b-encoded polypeptides shown
as boxes (top). Also shown is the domain organization of the C-terminal
half of pp1a with hydrophobic regions depicted as gray boxes. The
Mpro domain and flanking hydrophilic sequences (indicated as
a white box) were cloned in pQE9 and placed under the control of an
IPTG-inducible bacteriophage T5 promoter (indicated by the black box
and arrow). Furthermore, the structure of a construct encoding
Mpro fused to GST is shown. White arrowheads indicate
(predicted) Mpro cleavage sites (bottom). (B)
Prokaryotic expression of the EToV Mpro domain. (Left panel)
E. coli M15/pREP4 cells, transformed with
pQE-SS-Mpro (lanes 1 and 2) or
pQE-SS-Mpro/Gly3304 (lanes 3 and 4), were grown
in the presence (+) or absence () of 1 mM IPTG for
5 h. Total cell lysates were analyzed in 15%
SDS-polyacrylamide gels. Proteins were stained with Coomassie brilliant
blue. Positions of the 42-kDa precursor (P42) and the 33-kDa
Mpro and 16-kDa internal cleavage products (IC) are
indicated. (Right panel) Expression of an EToV Mpro-GST
chimera in E. coli. E. coli M15/pREP4 cells,
transformed with pQE-SS-Mpro (lanes 1 and 2) or
pQE9-SS-Mpro-GST (lanes 3 and 4), were grown in the presence
(+) or absence () of 1 mM IPTG for 5 h.
Total cell lysates were analyzed in 15% SDS-polyacrylamide gels.
Proteins were stained with Coomassie brilliant blue. Positions of the
33-kDa Mpro and 16-kDa internal cleavage products are
indicated as well as the 26-kDa GST product. wt, wild
type.
|
Proteolytic activity of EToV Mpro. The bioinformatical analyses of the putative EToV Mpro provided a theoretical framework for subsequent experimental studies. The hydrophilic region between HR-1 and -2, which spans the predicted proteinase domain, was expressed in E. coli (Fig. 2). A construct in which the predicted active-site His residue was replaced by Gly was used as a control (Fig. 2B). IPTG-induced expression of these constructs should yield a 42-kDa primary translation product. Indeed, a protein of this size was detected in cells expressing the His-to-Ala mutant. However, in cells expressing the intact proteinase domain, two products with apparent molecular masses of 33 and 16 kDa were found instead. These results suggested that the EToV pp1a region comprising residues 3186 to 3550 indeed contains a functional proteinase domain mediating autocatalytic processing.
The 33- and 16-kDa products were purified from E.
coli and N-terminally sequenced by performing 5 or 8 cycles of
Edman degradation, respectively. Analysis of the 33-kDa product
indicated that cleavage had occurred at the sequence
3246SNFSFQ
SVFSKAT3258,
thus identifying Ser3252 as the N-terminal residue of EToV
Mpro (Fig.
3) (Note that, throughout this article, amino acid residues flanking the
scissile bond, indicated by
, are given from N to C terminus
in the single-letter code, where "x" indicates any
residue.) The 16-kDa product proved to be a mixture of two protein
species. For one of these, the N terminus was identical to that of the
33-kDa product. The other product apparently arose from proteolytic
cleavage at an internal sequence,
3392SEFATQ
AWQTVN3403(data not shown). Indeed, cleavage of Mpro at this site
would cut the proteinase effectively in half, thus giving rise to two
equal-size (
16.5 kDa) products. These results tentatively
identified FxxQ
(S, A) as the EToV Mpro substrate
consensus sequence (residues are given in parentheses to indicate the
variation occurring at a particular position within the cleavage
site).
![]() View larger version (26K): [in a new window] |
FIG. 3. Characterization
of the N-terminal EToV Mpro autoprocessing site by protein
sequencing. The 33-kDa cleavage product, purified from bacterial
lysates (Fig. 2B, lane 2),
was subjected to Edman degradation. Phenylthiohydantoin (PTH)-amino
acids generated during each reaction cycle were detected by their
absorbance at 269 nm (expressed as milliabsorption units [mAU];
y axis) and identified by their characteristic retention times
(in minutes; x axis) on a reversed-phase high-pressure liquid
chromatography support. (A) Chromatogram of PTH-amino acid
standards. (B to F) Chromatograms of PTH-amino acids from reaction
cycles 1 to 5. Specific peaks are indicated by the single-letter
code.
|
S3544. In the
full-length precursor product encoded by plasmid
pQE-SS-Mpro, this sequence is located only 5 residues
upstream of the C terminus. Hence, cleavage at this site could not be
confirmed. As the hydrophobic region HR-2, C-terminally flanking
Mpro, might interfere with efficient expression in E.
coli, we did not extend the expression product with EToV
sequences, but instead designed an Mpro-fusion protein,
which contained the predicted cleavage site including the downstream
seven residues, linked via a newly introduced proline residue to
residues 1 to 226 of GST (Fig.
2A). Expression in
E. coli yielded, as anticipated, two major products
of 33 kDa and 26 kDa (Fig.
2B), apparently
corresponding to mature Mpro and the cleaved-off GST moiety.
N-terminal sequence analysis of the 26-kDa product confirmed this
notion and indicated that proteolytic cleavage indeed had occurred at
the predicted site 3543Gln
Ser3544 (data
not shown). Cleavage at the sites upstream and downstream of the Mpro domain should yield a 33-kDa protein. Accordingly, in lysates from metabolically labeled EToV-infected cells, a product of this size, comigrating exactly with the bacterial cleavage product, was detected by RIPA with Mpro-specific antisera (Fig. 4) (data not shown). The 16-kDa products, however, were not detected in EToV-infected cells.
![]() View larger version (57K): [in a new window] |
FIG. 4. Detection
of Mpro in EToV-infected cells. Ederm cells, infected
(+) or mock-infected () with EToV, were metabolically
labeled with [35S]Met plus Cys from 6 to 9 h p.i.
Cell lysates were supplemented with SDS to a final concentration of 2%
in order to dissociate protein-protein complexes. The lysates were then
diluted 10-fold with detergent solution and subjected to RIPA with
polyclonal antiserum K274, directed against the C-terminal 17 residues
of Mpro (post) or with preserum (pre). Similar
results were obtained with polyclonal antisera K158 and K272, directed
against a peptide comprising residues 5 through 196 or the N-terminal
17 residues of EToV Mpro, respectively (not shown). The
samples were analyzed in 15% SDS-polyacrylamide gels. Molecular size
markers are given in kilodaltons. The 33-kDa
Mpro species is indicated. High-molecular-mass
productspresumably Mpro precursorsare
evident at the top of the
gel.
|
Cys substitution retained partial activity
(Fig. 5B).
![]() View larger version (39K): [in a new window] |
FIG. 5. Mutagenesis
of predicted active-site residues of EToV Mpro.
(A) Mammalian expression constructs for the EToV
Mpro domain. The upper panel shows the organization of the
C-terminal half of pp1a with hydrophobic regions depicted as gray boxes
and the Mpro domain as an oval. The Mpro domain
and flanking hydrophilic sequences (represented by a white box) were
cloned in pBluescript and placed under the control of the bacteriophage
T7 promoter (depicted by the black box and arrow). The relative
locations of the predicted active-site His3304 and
Ser3416 residues are indicated. White arrowheads indicate
Mpro cleavage sites. Also shown schematically are the
structure of the P42 precursor and that of the mature 33-kDa
Mpro species. (B) vTF7-3 expression of EToV
Mpro. OST7-1 cells were infected with recombinant vaccinia
virus vTF7-3 and transfected with pBluescript-based expression vectors
encoding either the wild-type EToV Mpro (wt) or mutants in
which the predicted active-site His3304 and
Ser3416 residues had been replaced by Arg or Gly or Cys or
Ala, respectively. Mock-transfected cells were taken along as a
negative control (m). Cells were metabolically labeled from 5 to
6 h p.i. Cell lysates were subjected to RIPA with a
polyclonal antibody directed against Mpro (K158).
Precipitates were analyzed in 15% SDS-polyacrylamide gels, and
labeled proteins were visualized by fluorography. The positions of the
42-kDa precursor (P42) and of the 33-kDa mature Mpro species
are indicated. Molecular masses of marker proteins are given in
kilodaltons. (C) Mutagenesis of putative active-site acidic
residues. The effect of conservative and nonconservative substitutions
of Asp3338, Glu3347, Glu3354, and
Asp3358 was assessed in the vTF7-3 system as described
above. Amino acid residues are indicated in the single-letter
code.
|
![]() View larger version (53K): [in a new window] |
FIG. 6. Effect
of substitutions of possible active-site acidic residues as determined
in a trans-cleavage assay. (A) Schematic outline of
the trans-cleavage assay. The upper panel shows the structure
of the expression constructs with the ORFs encoding the Mpro
enzyme and substrate depicted as boxes. The expression of the ORFs was
under the control of the T7 promoter (black box and arrow). Translation
of the downstream ORF for the substrate was driven by the EMCV IRES
(indicated by a black box). For the Mpro enzyme, the
relative locations of the active-site His and Ser residues are
indicated. Cleavage sites in the Mpro precursor and in the
substrate are indicated by white arrowheads. Substrate sequences
derived from ORF1a/pp1a and from EToV N are indicated by hatching or
dots, respectively. In the substrate, the N-terminal cleavage site was
abolished (QS AG) and the enzyme was inactivated by a
Ser3416 Ala substitution. Shown below are the
structures of the various precursor and cleavage products.
(B) Mutagenesis of possible active-site acidic residues. The
effect of conservative and nonconservative substitutions of
Asp3338, Glu3347, Glu3354, and
Asp3358 was assessed in the vTF7-3 system essentially as
described in Fig. 5.
Substitutions are indicated in the single-letter code. Metabolic
labeling was from 3.5 to 4.5 h p.i., and RIPA was performed
with antisera K270 ( -pp1a), K158 ( -Mpro),
and rabbit anti-EToV N ( -EToV N). The positions of the various
precursors and mature products are indicated on the right. Molecular
masses of marker proteins are given in kilodaltons on the left. Amino
acid residues are indicated in the single-letter code. m,
mock-transfected negative control; wt, wild
type.
|
AG
substitution, while the Mpro was inactivated by changing the
active site Ser3416 to Ala. However, the cleavage site C
terminal of Mpro was left intact and hence remained
available for proteolytic cleavage in trans. This should give
rise to products of 52 kDa (the pp1a-Mpro fusion protein)
and of 19 kDa (the cleaved-off N moiety), which are readily
distinguishable from the enzymatically active Mpro and its
precursors. Processing was monitored by RIPA with antisera against the
N terminus of pp1a, Mpro, and the N protein and the effect
of substitutions of the selected acidic residues was assessed (Fig.
6B). A construct in which
the Mpro enzyme was inactivated by a
Ser3416
Ala substitution was taken along as a
negative control. Mpro derivatives, in which Asp3338, Glu3354, and Asp3358 had been altered, behaved like the wild-type enzyme in the trans-cleavage assay. Substitution of Glu3347, however, reduced proteolytic activity as indicated by accumulation of the 71-kDa noncleaved substrate precursor and of the Mpro precursors of 42 and 34 kDa; the latter protein species apparently represented an intermediate generated by processing of the 42-kDa Mpro precursor at the N-terminal but not at the C-terminal cleavage site (Fig. 6B).
|
|
|---|
Within the order Nidovirales, the toroviruses seem to be most closely related to the coronaviruses (10, 25, 58, 60). We therefore anticipated at the onset of this study that the torovirus Mpro might bridge the large evolutionary gap between the main proteinase of the coronaviruses and those of the other nidoviruses (76). Surprisingly, however, from its primary sequence, the Mpro of EToV Berne seems no more related to the coronaviral proteinase than to those of the arteri- and roniviruses (Fig. 1B). The characteristics of the torovirus Mpro in relation to those of the other nidoviruses are discussed in the paragraphs below.
Mpro-mediated processing and cleavage site specificity. Nidoviral Mpros and related proteinases in other RNA viruses are released from adjoining polyprotein domains by autocatalytic processing (21, 76). Likewise, when pp1a segments, containing the predicted EToV Mpro domain, were heterologously expressed in prokaryotic and in mammalian cells, the proteinase liberated itself by cleaving both at an N-terminal site and at a C-terminal site. This yielded a mature Mpro of 33 kDa, a protein of which size was also detected in EToV-infected cells by RIPA with Mpro-specific antisera. Interestingly, in EToV-infected cells, autocatalytic release of Mpro might be inefficient as during a 3-h continuous metabolic labeling period a considerable proportion of the proteinase seemed to remain part of a large precursor (Fig. 4). Similar observations were reported for the arteri- and coronavirus Mpros (32, 54, 63, 71). It has been suggested, that the long-lived Mpro intermediates are membrane associated via the flanking hydrophobic domains. Anchoring of proteinase precursors to intracellular membranes might well be a central phenomenon in nidovirus polyprotein processing and/or replication (39, 54, 71).
N-terminal
sequence analysis of mature EToV Mpro showed that
autocatalytic cleavage occurred at the location
3246SNFSFQ
SVFS3255.
The sequence
3538QPFKKQ
SVSN3547was identified as the C-terminal cleavage site. Unexpectedly, in
prokaryotic cells, processing occurred also at a site internal to
Mpro,
3392SEFATQ
AWQT3401,
effectively cleaving the proteinase in half. It is tempting to
speculate that, during EToV infection, Mpro might inactivate
itself as part of a feedback autoregulatory mechanism or, otherwise,
that processing at the internal site gives rise to an alternative
cleavage pathway (71).
However, we have no evidence that internal processing actually occurs
in mammalian cells. The 16-kDa cleavage products were detected neither
in the vTF7-3 expression system nor in EToV-infected cells. Hence, it
cannot be excluded that internal cleavage, as observed in E.
coli, is an artifact resulting from overexpression and possibly
aided by protein misfolding, which might have exposed cleavage sites
that are normally inaccessible. Intriguingly, however, the internal
cleavage site is also conserved in BToV Mpro
(22).
Catalytic center and substrate specificity. Although sequence identity between EToV Mpro and its nidovirus homologues or other viral and cellular chymotrypsin-like enzymes is limited, sequence alignments clearly implicated Ser3416 and His3304 as the main catalytic residues. Consistent with this notion, their replacement by Ala and by Gly or Arg, respectively, abrogated enzymatic activity. The Mpros of corona- and roniviruses (2, 73, 74), like the hepatitis A virus 3C proteinase (1, 7), employ a catalytic Cys-His dyad within the context of the chymotrypsin-fold. Under the assumption that the EToV Mpro is a classical serine proteinase, a third acidic catalytic residue was expected (20). Although sequence alignments of EToV Mpro with other proteinases were far from robust, particularly in the region that should contain this third residue, our analyses suggested Glu3347 as the most likely candidate. However, neither substitution mutagenesis of Glu3347 nor that of the proximal Asp3338, Glu3354, and Asp3358 noticeably affected autocatalytic release of Mpro in heterologous expression studies. Yet, in an assay that tested for processing in trans, the efficacy of Mpro cleavage of an artificial substrate was clearly reduced by substitution of Glu3347, but not by that of the other acidic residues. Although our findings do not prove that Glu3347 is included in a catalytic triad with Ser3416 and His3304, they would at least be consistent with this notion. This view might seem at odds with observations made for various other viral and cellular chymotrypsin-like proteinases, where site-directed mutagenesis of the third residue resulted in a far more dramatic loss of proteolytic activity (11, 16, 33, 41, 64, 65). Still, in the case of severe acute respiratory syndrome (SARS) coronavirus Mpro, replacement of the active-site Cys by Ser yields a functional enzyme. Although this mutant proteinase relies exclusively on a Ser-His catalytic dyad, its activity is merely 40-fold reduced as compared to that of wild-type SARS coronavirus Mpro (36). A reduction in activity of this magnitude might have only a modest effect on processing even in our trans-cleavage assay under conditions of vTF7-3-driven EToV Mpro overexpression. As the mapping of active-site residues by the combination of comparative sequence analysis and site-directed mutagenesis is inherently fraught with a measure of uncertainty, definitive answers with respect to catalytic center composition and the presence/identity of a third catalytic residue will require the elucidation of the torovirus Mpro crystal structure.
Comparison of the sequences flanking the
scissile bonds of the internal and the N- and C-terminal cleavage sites
tentatively identifies FxxQ
(S, A) as the substrate consensus
sequence of EToV Mpro. Its cleavage site specificity thus
resembles that of other nidoviral Mpros, which, as is
typical for 3CLPs, all require Gln or Glu at the P1 position
(74,
76). The specificity for
P1-(Gln, Glu) of the picorna-, arteri-, and coronavirus enzymes is
determined by a strictly conserved His, which via its imidazole side
chain hydrogen bonds to the carbonyl/carboxylate oxygens of the P1 side
chain (1,
2,
4,
7,
44,
45,
47,
73,
74,
76). A corresponding His
residue is also present in the torovirus Mpro (EToV
His3430) as part of a Gly-X-His sequence (Fig.
1B), a motif well
conserved in most 3CLPs (including arteri- and ronivirus
Mpro), but saliently absent in the Mpros of
coronaviruses; the latter have a Tyr-Met-His motif instead
(2,
4,
34,
64,
74,
75). In many 3CLPs,
conserved Ser/Thr residues, located five residues upstream of the
catalytic Ser/Cys, are also part of the substrate-binding subsite S1
and assist in binding by forming additional hydrogen bonds with the
carbonyl/carboxylate oxygen atoms of the P1 side chain
(4,
44,
47,
48,
74). We propose that, in
the torovirus Mpro, Thr3411 plays a similar role.
This would again be a difference from the coronavirus Mpro,
which lacks a corresponding Ser/Thr residue
(2,
74).
The arteri- and ronivirus Mpros prefer glutamic acid at the P1 position of their substrates, whereas coronavirus Mpro requires glutamine (64, 74, 76). In arterivirus Mpros, this substrate preference is presumably conferred by a Ser residue (located three residues downstream of the S1 subsite His; S137 in EAV Mpro; see Fig. 1B), which would hydrogen bond with the carboxylate oxygen atom (that is, the one not interacting with the subsite S1 His/Thr pair) of the P1-Glu side chain (4). This Ser is not conserved in the corona- and torovirus Mpros; instead, at the corresponding positions, Glu and Leu residues, respectively, are found (Fig. 1b). The S1 specificity site of coronavirus Mpro consists of the main chain atoms of Met165, Glu166, and His172 and the side chains of Phe140 and His163, with the imidazole of the latter residue interacting with the P1-Gln side chain (2, 3, 73) (residue numbering according to SARS-CoV Mpro). Also with respect to S1 subsite composition, the torovirus main proteinase differs from that of coronaviruses. Although EToV Mpro, like its coronavirus homologues, apparently prefers P1-Gln, none of the coronavirus S1 subsite residues are conserved, except for His163. As yet, it is not clear from the EToV Mpro sequence how the P1-Gln is accommodated.
Tentative Mpro processing scheme of torovirus pp1a/pp1ab.
Although each of the three established
EToV Mpro cleavage sites conforms to the consensus
FxxQ
(S, A), we believe, on the basis of alignments of
pp1a/pp1ab of coronaviruses and the toroviruses EToV and BToV, that
Tyr/Met/Leu and Gly/Lys may also be tolerated at the P4 and P1'
positions, respectively. Although the picture is still incomplete, the
substrate specificity of the torovirus Mpro is reminiscent
of and yet distinct from those of the corona-, arteri-, and
roniviruses, which mainly conform to LQ
(S, A), E
(G,
S, A), and VxHE
(L, V), respectively
(74,
76). In particular, a
preference for a bulky hydrophobic residue at P4 and no apparent
predilection for certain residues at P2 would give the EToV
Mpro a unique cleavage site formula. A tentative
Mpro processing scheme of EToV pp1a/1b is presented in Fig.
7, with the sequences and locations of potential cleavage sites listed in
Table
2. Several points are of note. (i) All cleavage sites predicted for EToV
are conserved in BToV
(22). (ii) A putative
cleavage site, 2870FKKQ
SV2875 (Table
2, site 1), identical in
sequence to the one C terminal of the Mpro domain (Table
2, site 4), is found
halfway in EToV pp1a at a position topologically similar to that of an
accessory papain-like proteinase 2 site within coronavirus
pp1a. (iii) Of the predicted sites in the ORF1b-encoded
part of the replicase (Table
2, sites 10 to 14), only
one, located between the RdRp and zinc finger/helicase domains,
conforms to the proposed consensus FxxQ
(S, A) sequence (site
10; Table 2). Future
studies should determine whether the predicted sites within pp1a/pp1ab
are actually cleaved by Mpro. (iv) At the very C terminus of
torovirus pp1a, there is a domain that is absent from all other
nidovirus pp1a proteins and that shares identity with a predicted
cyclic phosphodiesterase (CPD) motif in the ns2A protein of group 2
coronaviruses and in the rotavirus VP3 guanylyl transferase (Fig.
7)
(57,
60). Puzzlingly, a
cleavage site C terminal of this domain is not immediately apparent,
raising the question whether Mpro utilizes a noncanonical
site, whether another viral or host proteinase is involved, or whether
processing of pp1ab yields an RdRp with the CPD homologue attached;
note that in the latter case, processing of pp1a would still produce an
additional, "free"
CPD.
![]() View larger version (39K): [in a new window] |
FIG. 7. Comparisonof the pp1a/pp1ab domain organization in different nidoviruses and a
tentative Mpro processing scheme of the torovirus replicase
polyproteins. The C-terminal half of pp1a (dark gray) and the
ORF1b-encoded part of pp1ab (light gray) are shown schematically, to
scale, for corona-, toro-, arteri-, and roniviruses. The border between
amino acids encoded in ORF1a and ORF1b is indicated as RFS (ribosomal
frameshift). The locations of domains that have been
identified as structurally or functionally related are highlighted and
indicated as follows: TM, putative transmembrane domains;
Mpro, main proteinase; RdRp, RNA-dependent RNA polymerase
motif; Z, zinc finger motif; Hel, helicase; Exo, (putative)
3'-to-5' exonuclease (ExoN); N, uridylate-specific
endoribonuclease (NendoU); MT, (putative) 2'-O-methyl
transferase (26,
30; see also reference
57 and references
therein). Also indicated is the (predicted) cyclic
phosphodiesterase domain (CPD) that resides near the C terminus of the
torovirus pp1a. Arrowheads represent known cleavage sites for accessory
papain-like proteinase (cyan) or Mpro (red), as established
for corona-, arteri-, and roniviruses and as established casu quo
predicted here for toroviruses (see also Table
2). The figure was adapted
from reference
56.
|
|
View this table: [in a new window] |
TABLE 2. Tentative
cleavage sites in pp1a/pp1ab for the torovirus main proteinase
|
Corona- and
toroviruses share many traits (e.g., with regard to virion composition,
genome organization, and the extent of colinearity in the ORF1b-encoded
region of pp1ab), which would justify their grouping together, separate
from the arteri- and roniviruses
(15,
19,
25,
26). However, in a recent
study of Gonzalez et al.
(25), entailing a
systematic quantitative analysis of sequence conservation among
nidoviral proteins, the expected sequence affinity between corona- and
toroviruses was not evident. Thus, with the topology of the nidovirus
tree undecided, it was suggested that "the nidovirus phylogeny
must be verified later when the number and diversity of torovirus and
ronivirus sequences will match those of arteriviruses and
coronaviruses"
(25). If it were to be
firmly established that of the nidoviruses, corona- and toroviruses
are, after all, the most closely related, and if this evolutionary
relationship also holds for the Mpro (i.e., if no RNA
recombination-based exchange of proteinase domainsor parts
thereofhas occurred during nidoviral divergence), our
observations would have one more major implication: namely that the
transition from serine- to cysteine-based proteolytic catalysis (or
vice versa) has happened more than once in the course of nidovirus
evolution. In this respect, it is of interest that a mutant EToV
Mpro with a Ser3416
Cys substitution
retained enzymatic activity and that, conversely, the catalytic site
Cys
Ser substitution in SARS-CoV Mpro also yields an
active enzyme
(36).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»