Previous Article | Next Article ![]()
Journal of Virology, November 2008, p. 10477-10486, Vol. 82, No. 21
0022-538X/08/$08.00+0 doi:10.1128/JVI.01113-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Stefano Gastaldello,1,2,
Sebastian Hildebrand,1
Nikolay Zinin,1
Gerco Hassink,1
Kristina Lindsten,1
Juergen Haas,3
Bengt Persson,1,4 and
Maria G. Masucci1*
Department of Cell and Molecular Biology, Karolinska Institutet, S-171 77 Stockholm, Sweden,1 Department of Biomedical Sciences, University of Padua, 35121 Padua, Italy,2 Division of Pathway Medicine, School of Biomedical Sciences, College of Medicine, University of Edinburgh, Edinburgh, United Kingdom, and Max von Pettenkofer-Institute, LMU-München, Munich, Germany,3 IFM Bioinformatics, Linköping University, S-581 83 Linköping, Sweden4
Received 27 May 2008/ Accepted 8 August 2008
|
|
|---|
|
|
|---|
Conjugation of Ub or UbLs is achieved through the activity of a cascade of enzymes, including activating enzymes (E1s), conjugating enzymes (E2s), and specific ligases (E3s) that catalyze the covalent linkage of the modifier to Lys residues in the substrate (11). In addition, deconjugating enzymes act as regulators of the system by maintaining the pool of free Ub and UbLs and by determining the rate of turnover of the conjugates (14, 50). In line with this proposed regulatory function, recent evidence points to a critical role of Ub deconjugases (deubiquitinating enzymes [DUBs]) in intracellular signaling, as exemplified by the activation of the transcription factor nuclear factor-
B (NF-
B) by cylindomatosis tumor suppressor, A20, Cezanne, and ubiquitin-specific protease 31 (25, 33) and by the initiation of DNA repair by USP1 (37).
Based on similarity with known family members, the human DUBs have been classified in five subfamilies including: ubiquitin C-terminal hydrolases (UCHs), ubiquitin-specific proteases (USPs), Machado-Joseph disease proteases (MJDs or Josephins), ovarian tumor proteases (otubains, OTUs), and the JAMM (Jab1/MPN/Mov34 metalloenzyme) motif proteases (2, 38). With the exception of JAMMs, the DUBs are cysteine proteases identified by a catalytic triad of Cys, His, and Asp/Asn residues and by the presence of conserved amino acid domains known as Cys- and His-boxes (C- and H-boxes, respectively) that are unique for each family (38). Mutational studies have shown that the Asp/Asn residue is not essential for catalytic activity, although it may contribute to the enzymatic activity by stabilizing the active-site thiolate and imidazolium ion pair (26, 35). The JAMM metalloproteases lack a C-box, while two conserved His residues in the JAMM motif were shown to be essential for activity (49).
Proteins with DUB activity are encoded in the genome of human adenovirus, herpesvirus, coronavirus, and bunyavirus (5, 6, 18, 29, 42). Furthermore, DUB activity was recently demonstrated in proteins encoded by some pathogenic bacteria that lack an intrinsic UPS (40, 52), suggesting that these enzymes play specific roles in the regulation of both viral and bacterial infection. Although these findings make the systematic identification of microbial DUBs a worthy endeavor, the task is complicated by the wide sequence variation of the known enzymes. Moreover, viral and bacterial proteins are often considerably different in sequence and domain organization compared to their mammalian counterparts, which further hampers the identification of distant functional homologues.
We have used a bioinformatics strategy here to identify putative virus-encoded DUBs. To overcome the difficulty posed by sequence variation, we focused on the conserved C- and H-boxes. Family-specific sequence alignments, pattern searches, and hidden Markov models (HMMs) were generated based on the amino acid sequences of known members of the five DUB subfamilies and then used to search a custom-made database of open reading frames (ORFs) from Epstein-Barr virus (EBV), a human gamma herpesvirus that establishes latent infections and is associated with a broad spectrum of malignancies of lymphoid and epithelial cell origin.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Data sets of DUBs used in this study
|
10 and containing Cys or His residues were selected for further analysis. (ii) Pattern search. C- and H-box specific patterns were constructed to search for putative catalytic domains. The C- and H-boxes of each DUB family were aligned with respect to the conserved Cys and His residues, and search patterns were manually derived from the sequence alignments based on the physicochemical properties.
(iii) HMM search. HMMs were constructed by using the HMMER (17) suite of programs. For each DUB subfamily the HMMs were trained with multiple sequence alignments of C- and H-boxes from a nonredundant set of sequences (Table 1, HMM training set). The HMM results were further categorized based on E-values and raw scores. In every search with subfamily-specific HMMs, matches with E-values of <25 were selected and manually screened for the presence of Cys or His residues in the alignment. Candidates containing putative C- or H-boxes were further categorized based on the HMM scores of the domains. The HMM score is a base 2 log of the probability of alignment to the HMM divided by the probability of random alignment. A score of zero represents 50% likelihood that the sequence is a true match to the model. Since the EBV ORF database contains 106 entries, scores greater than 6.7 correspond to a 99% probability to detect true homologs while a score of –6.6 corresponds to 1% probability. Due to the fact that the genomes of viral and other pathogenic organisms undergo higher mutation rates (16), low thresholds of statistical parameters were used to detect catalytic domains of DUBs.
(iv) Conserved Cys and His residue search.
Homologous proteins that are conserved between members of at least one subfamily of human herpesviruses (
-HHVs: herpes simplex virus type 1 [HSV-1], HHV1, HSV-2, HHV2, varicella-zoster virus, and HHV3; β-HHVs: human cytomegalovirus, HHV5, HHV6, HHV7; and
-HHVs: EBV, HHV4, Kaposi sarcoma herpesvirus, and HHV8) were identified by using BLAST with a cutoff E-value of 0.01, corresponding 99% probability to be a true homolog. Homologous proteins were then aligned by using Vector NTI (32), and the multiple alignments of each cluster were scanned for the presence of conserved Cys and His residues. Sequences flanking the conserved residues were searched for putative C- and H-boxes using the HMMs.
Functional validation of candidate DUBs: cloning of EBV ORFs into prokaryotic expression vector.
ORFs derived from the B95-8 virus were transferred from the GATEWAY expression vector pDONR207 (48) to the prokaryotic expression vector pDEST-GST (ampicillin resistance), and E. coli DH5
was used for transformation.
Ub-GFP construct. Green fluorescent protein (GFP) was amplified by PCR from pEGFP-N1 vector (Clontech, Palo Alto, CA) with the sense primer 5'-GTTTTCAGATCTAAAGGAGAAGAGCTGTTCACCGGCGTGAGCA AGGGCGAGGAGCTGTTCACC-3' and the antisense primer 5'-GTTCTCGAGCTTGTACAGCTCGTCCATGCCGAGAGTGAT-3' and cloned in the BglII and XhoI sites of the pACYCDuet-1 vector (Novagen, Darmstadt, Germany). Ubiquitin was amplified by PCR from an Ub-βGal plasmid with the primer 5'-GTTGAATTCAATGCAAATCTTCGTGAAGACTCTGACTGGTA-3' and the antisense primer 5'-GTTAGATCTGAACCCACCTCTGAGACGGAGTACCA-3' and inserted in frame in the EcoRI and BglII sites of the pACYCDuet-1 vector containing GFP. The underlined sequences represent restriction sites.
DUB assays. The GST-ORFs and Ub-GFP reporter plasmids were cotransfected by electroporation in competent BL21(DE3) bacteria that lack endogenous DUBs (9). Antibiotic-resistant colonies were selected in LB agar supplied with 100 µg of ampicillin/ml and 30 mg of chloramphenicol/ml. Exponentially growing bacteria (OD at 600 nm of 0.5) were induced for 5 h at 30°C with 0.5 mM IPTG (isopropyl-β-D-thiogalactopyranoside) and then lysed by sonication in phosphate-buffered saline supplemented with protease inhibitors (Complete Mini protease inhibitor cocktail tablets; Roche, Germany). The lysates were clarified by centrifugation for 10 min at 13,000 rpm, and cleavage of the reporter was analyzed by fractionation in acrylamide bis-Tris 4 to 12% precasted gradient gel (Invitrogen), followed by Western blotting. After transfer to polyvinylidene difluoride membrane (Millipore), the filter was blocked in phosphate-buffered saline supplemented with 5% nonfat milk and 0.1% Tween 20 and incubated for 1 h with anti-rabbit-GFP serum (Abcam, Cambridge, United Kingdom) diluted 1:10,000. After incubation for 1 h with peroxidase-conjugated goat anti-rabbit serum, the complexes were visualized by enhanced chemiluminescence (GE Healthcare, United Kingdom). For the enzyme kinetic assays, GST-USP19, GST-USP19mut, GST-BPLF1-N, GST-BSLF1, and GST-BXLF1 were purified from bacteria lysates (50 mM Tris-Cl [pH 7.5], 150 mM NaCl, 5 mM MgCl2, 1% Triton X-100, 1 mM phenylmethylsulfonyl fluoride, 1 mM dithiothreitol) by incubation for 90 min at 4°C with a solution of 10 mM glutathione in 50 mM Tris-Cl (pH 8.0), and the protein concentrations were determined with a Bio-Rad protein assay. Hydrolysis of the fluorogenic substrate ubiquitin-AMC (Ub-AMC, SE211; Biomol) was assayed at 30°C in 100 µl of assay buffer (50 mM Tris-Cl [pH 7.5], 150 mM NaCl, 2 mM EDTA, 2 mM dithiothreitol) containing 200 nM concentrations of the enzyme, and the Vmax and Km of the reactions were calculated by titrating the substrate at concentrations ranging between 0 and 20 mM. The fluorescent product was measured by using a spectrofluorimetry plate reader at 380-nm excitation and 460-nm emission wavelengths. To block the DUB activity, 10 mM N-ethylmaleimide (NEM; Sigma) was added to the assay buffer.
Site-directed mutagenesis. Point mutations of the predicted catalytic Cys residues of BPLF1, BSLF1, and BXLF1 genes were engineered by using a QuikChange site-directed mutagenesis kit according to the manufacturer's protocol (Stratagene). Two BPLF1 mutants (C61A and C65A), three single mutants (C462A, C824A, and C819A), and one double mutant (C824/819A) of BSLF1 and two mutants of BXLF1 (C84A and C491A) were constructed. All constructs were then sequenced to verify that only the desired mutations had been introduced.
|
|
|---|
Sequence alignment searches.
A first attempt to identify putative DUBs was performed using CLUSTAL W to align the EBV ORFs with the human DUB data set. Three ORFs clustered with specific DUB subfamilies, BGLF4 and BBLF4 clustered with the OTU subfamily and BcLF1 ORF aligned with JAMMs (not shown), but the homologous regions did not contain Cys or His residues, and these viral ORFs are therefore unlikely to possess enzymatic activity. Since alignment of the entire amino acid sequence may not detect homology restricted to short domains that are critical for enzymatic activity, BLAST was used to identify EBV ORFs containing sequences of homology with the conserved C- and H-boxes of each DUB family using an arbitrary cutoff E-value of
10. Twenty-five candidates containing Cys or His residues in homologous regions were found in this search (see Table S1 in the supplemental material), including for example, the BBRF3 ORF that contains a region of strict homology with the USP C-box (E-value = 0.018).
Pattern search. Additional search strategies were conducted to achieve a more stringent assessment of the likelihood that the identified domains might be true C- or H-boxes homologues. Family-specific search patterns were derived from the aligned C- and H-box sequences of the known DUBs. To increase the stringency of the search, when fewer than 10 family members were found in the human data set, homologues from other species were also included in the alignment. The derivation of search patterns for the C- and H-boxes of the UCH family is illustrated in Fig. 1, and the search patterns for USP, OTU, MJD and JAMM families are given in Fig. S1A to D in the supplemental material. In a first attempt, stringent search patterns were derived for each DUB family (UCH, C-box –11/+11 and H-box –7/+16; USP, C-box –8/+7 and H-box –17/+3; OTU, C-box –8/+10 and H-box –8/+5; MJD, C-box –10/+10 and H-box –6/+7; and JAMM motif –7/+17) by including at each position all residues observed in the data set (Fig. 1B, boldface residues) and then used to search the EBV ORF data set, allowing for only a single mismatch. The derived patterns were in all cases able to identify the DUBs belonging to the family but failed to identify any candidate in the EBV ORFs (results not shown), suggesting that they may be too stringent to recognize distant homologues. Less-stringent search patterns were then derived by allowing at each position additional amino acids with similar physicochemical characteristics or, when residues with different physicochemical characteristics were observed, all amino acids (indicated by an "X" in Fig. 1B and in Fig. S1A to D in the supplemental material). Regions of similarity with the C- and/or H-boxes of different DUB families were identified in 30 EBV ORFs (see Table S2 in the supplemental material). Four ORFs contained at least one putative C-box and one H-box, while three ORFs contained only one putative C-box and twenty-three ORFs contained one or more H-box-like domain.
![]() View larger version (54K): [in a new window] |
FIG. 1. Derivation of search patterns for the UCH subfamily catalytic domains. (A) The aligned amino acid sequences of the catalytic Cys and His-box regions of UCH (peptidase C12) family members are shown. The numbers above the alignment correspond to the amino acid position relative to the catalytic Cys and His residues. Polar residues are colored blue, hydrophobic residues are gray, and small residues (Gly and Ala) are white. (B) Stringent search patterns were derived by including at each position all amino acids observed in the aligned DUB sequences (indicated in bold). Less-stringent patterns were then obtained by allowing amino acids with similar physicochemical properties (indicated in italics). When different types of amino acids were observed at a given position, all amino acids were allowed in the pattern (indicated by X).
|
25 were therefore considered for further analysis. All hits containing Cys or His residues were further categorized based on the HMM scores (Table 2). Scores of
0, corresponding to more than 50% likelihood to be a true match, were considered high. Intermediate scores were between 0 and –6.6 (down to 1% probability to be a true match), while scores less than –6.6 were considered low. Seven EBV ORFs—BALF2, BALF5, BBRF3, BcLF1, BERF3-BERF4, BRLF1, and the predicted ORF88—contained at least one C- or H-box domain with score above 0, suggesting a potential homology with the corresponding DUB domains. Two ORFs—BNRF1 and BPLF1—contained both C- and H-box-like domains with scores
–6.6, and eighteen additional ORFs contained either a C- or an H-box-like domain with scores of less than –6.6. |
View this table: [in a new window] |
TABLE 2. Putative DUB catalytic boxes identified by the HMM searcha
|
- or β-HHVs were detected in two and seven ORFs, respectively; thirteen EBV ORFs shared conserved Cys or His residues only with homologues encoded by the other member of the
-HHV subfamily, KSHV, that resembles EBV in cell tropism and oncogenic capacity (see Table S3 in the supplemental material). The sequences adjacent to the conserved residue were searched for putative C- and H-boxes using the DUB family specific HMMs. Nine EBV ORFs were found to contain conserved Cys or His residues located in regions homologous to the DUB C- or H-boxes, and one ORF, BGRF1/BDRF1, contained both C- and H-box homologous domains (see Table S3 in the supplemental material).
DUB score and exclusion criteria.
A relatively large number of candidate DUBs were identified by at least one search, and several were identified by more than one. To select the most likely candidates for functional studies, scoring criteria were devised for each search, and these were then compiled in a global DUB score. The scores assigned to each search reflected the likelihood of identifying candidates that possess critical features associated with DUB activity (see Table S4 in the supplemental material). Candidates identified by the BLAST search were given a score of 1, 2, or 3 depending on the degree of similarity with DUB sequences. Candidates identified by the pattern search were scored 1 or 2 based on the presence of C- or H-box-like domains or both, while the score of candidates identified by the HMM considered both the presence of C- and H-box-like domain and their HMM scores (high = >0, intermediate = between 0 and –6.6, or low = <–6.6). The same scoring system was used for candidates identified by the presence of conserved Cys and His residues. The scores of 58 candidates identified by at least one search are shown in Table S4 in the supplemental material. Two ORFs—BGRF1/BDRF1 and BPLF1—were identified by the four searches, nine ORFs were identified by three searches, twenty-two ORFs were identified by two searches, and twenty-five ORFs were found by only one search. All candidates identified in at least three searches and five candidates identified by two searches received scores of
4. This group included 16 ORFs, all of which were identified by HMM, and 7 were shown to contain conserved Cys or His residues.
In order to further restrict the number of candidates to be validated by functional studies, ORFs with scores of
4 were scrutinized for the presence of structural constraints that could prevent DUB activity. The locations of the putative C- and H-boxes were mapped relative to each other and to other protein domains of known function (Fig. 2). Based on this domain analysis, the BBRF3 and BILF1 ORFs could be excluded since their putative C- and H-boxes are located in transmembrane regions, which would preclude DUB activity. Analysis of the site of expression in EBV-infected cells, expression during different phases of the virus cycle, and functional annotation (Table 3) revealed that some of the high-scoring candidates are structural proteins or glycoproteins associated with the virus capsid and envelope. These proteins are unlikely to possess enzymatic activity, although this cannot be excluded a priori. The remaining candidates include immediate-early, early, late, or latent proteins that are expressed in the nucleus or, in one case, in the cytoplasm of the infected cells or are components of the viral tegument. It is noteworthy that two tegument proteins were found in the group with the highest DUB score and one of them, the protein encoded by the BPLF1 ORF, was previously shown to possess DUB activity (23, 42).
![]() View larger version (28K): [in a new window] |
FIG. 2. Mapping of the putative C- and H-box domains in ORFs with high DUB scores. The positions of the putative C- and H boxes identified by the different searches and the Cys and His residues that are conserved in homologues encoded by other HHV family members are indicated. Putative transmembrane domains were identified by a TMHMM search in the candidate ORFs and are indicated as yellow hexagonal regions. In BBFR3, BILF1, and ORF88 the C- and/or B-boxes overlap with or flank transmembrane domains.
|
|
View this table: [in a new window] |
TABLE 3. ORF annotation and filtering criteria
|
4 and an approximately equal number of ORFs with scores of
3 were tested for their capacity to hydrolyze a Ub-GFP reporter plasmid coexpressed in bacteria. The results of a representative assay wherein the ORFs were tested alongside the human USP19 and a catalytic mutant that lacks enzymatic activity are shown in Fig. 3.
![]() View larger version (32K): [in a new window] |
FIG. 3. Functional validation of the candidate DUBs. (A) Representative Western blot illustrating cleavage of the Ub-GFP reporter by a restricted panel of EBV ORFs. BL21 cells were cotransfected with Ub-GFP, and the indicated ORF and protein expression was induced by treatment with IPTG. Cell lysates were fractionated on 4 to 12% sodium dodecyl sulfate-polyacrylamide gel electrophoresis gradient gels, and GFP was detected by Western blotting. Cleavage of the reporter in lysates expressing active DUBs is visualized by appearance of a free GFP band. (B) Enzymatic activity of candidate EBV DUBs (score 4) and controls (score 3). The intensity of the bands corresponding to the uncleaved reporter and free GFP was determined by densitometry, and the percent specific cleavage was calculated as follows: intensity of the free GFP band - background/total GFP (uncleaved + cleaved) - background x 100. A cutoff activity value was calculated as 3 x the mean percent specific cleavage in cells expressing ORFs with a score of 3. The mean of three experiments in which all of the ORFs were tested in parallel is shown in the figure. (C) Time course of Ub-AMC hydrolysis. Ub-AMC (1.2 µM) was mixed with 200 nM concentrations of purified GST-USP19wt, GST-USP19mut, GST-BPLF1-N, GST-BSLF1, and GST-BXLF1 in the absence (empty symbols) or presence (filled symbols) of NEM, and the reactions were monitored for 180 min. The release of fluorescent AMC, expressed in arbitrary units (AU), shows the activity of these enzymes to cleave Ub-AMC. The Km of the reactions was calculated in separate experiments and is indicated in the Figure. (D) Time course of Ub-AMC hydrolysis of BPLF1, BSLF1, and BXLF1 expressed in mammalian cells. The ORFs were cloned in frame in the 3XFlag tag vector and expressed in HEK293 cells. The proteins were immunoprecipitated from the transfected cells using antibodies to the Flag tag, and the cleavage of Ub-AMC was assayed as described for panel C. Equal amounts of proteins were used in all enzymatic assays. The faster kinetics of Ub-AMC cleavage (compare panels C and D) indicates that the enzymatic activity of mammalian expressed BSLF1 and BXLF1 is significantly improved compared to the proteins expressed in bacteria.
|
3 [Fig. 3B]). The expression of the fusion proteins was in all cases confirmed in Western blots probed with anti-GST antibodies (data not shown). In order to further characterize their DUB activity, bacterially expressed GST-BPLF1-N, GST-BSLF1, and GST-BXLF1 were purified, and their enzyme activity was assayed by cleavage of the fluorogenic substrate Ub-AMC (Fig. 3C). The three EBV ORFs hydrolyzed Ub-AMC with different kinetics and Km values, suggesting very different affinities for the substrate, but the specificity of the reaction was in all cases confirmed by blocking with the cysteine protease inhibitor NEM. Interestingly, the enzymatic activity of BSLF1 and BXLF1 relative to BPLF1 was significantly improved when equal amounts of immunoprecipitated proteins expressed in mammalian cells were compared for the cleavage of Ub-AMC (compare Fig. 3C and D), suggesting that posttranslational modifications that are not achieved in bacteria cells may be required for optimal activity. In order to further validate the DUB activity of the candidates, the predicted catalytic Cys residues in BPLF1, BSLF1, and BXLF1 were mutated to Ala by using PCR-mediated site-directed mutagenesis, and the enzymatic activity of recombinant GST tagged proteins was tested against the Ub-AMC substrate (Fig. 4). Two putative C-boxes were predicted in the BSLF1 and BXLF1 ORFs, while a single domain containing two Cys residues was predicted in BPLF1. Alignment of the corresponding sequences in other members of the HHV family revealed that, while BPLF1 Cys61 and BSLF1 Cys819 and Cys824 are conserved in all HHVs, other potentially catalytic Cys residues identified by the bioinformatics search are unique for EBV or shared only by some of the family members. Mutation of the conserved C61 of BPLF1 to Ala abolished the enzymatic activity, while the C65A mutation had no effect, confirming that C61 is the catalytic residue in the C-box (Fig. 4). Single mutation of the nonconserved C462 and conserved C824 of BSLF1 did not affect the enzymatic activity, while the activity of the C819A mutant was significantly reduced. Interestingly, the hydrolysis of Ub-AMC was further decreased in the double mutant C819/824A, suggesting that C824 may partly substitute for the catalytic C819 residue. Homologs of BXLF1 were present only in alpha- and gammaherpesviruses, and the putative catalytic cysteine C491 is conserved only in gammaherpesvirus. Mutation of this residue abolished the enzymatic activity of BXLF1, while mutation of the nonconserved C84 had no effect (Fig. 4). Thus, catalytic Cys residues were identified by mutation analysis in BPLF1, BSLF1, and BXLF1, confirming that the three EBV ORFs are bona fide viral DUBs.
![]() View larger version (48K): [in a new window] |
FIG. 4. Identification of the catalytic Cys residues by site-directed mutagenesis. Sequence alignments of the putative catalytic C-boxes in BPLF1, BSLF1, and BXLF1 and their homologs encoded by other HHV members shows conservation of Cys residues that are involved in DUB activity. The conserved residues are shown in white text with a black background, while similar residues in the alignment are shaded in gray. The asterisk above the alignment indicates the Cys residues identified in the bioinformatics search. Mutations were introduced into the predicted catalytic Cys residues, and the ability of the mutants to hydrolyze Ub-AMC was assayed as described in the legend to Fig. 3.
|
4, and several scored even higher than the EBV-encoded protein (Table 4) . This is in line with the reported DUB activity of several BPLF1 homologues. In agreement with the very low sequence similarity of the BXLF1 homologues expressed in alpha- and gammaherpesvirus these ORFs received very low DUB scores, suggesting that this protein may have different functions in different viruses. |
View this table: [in a new window] |
TABLE 4. DUB scores for the herpesvirus homologs of the identified EBV DUBs
|
|
|
|---|
We reasoned that, in spite of strongly divergent sequences, a higher degree of conservation might be expected in regions that are important for enzymatic activity. Thus, although the homology between the catalytically relevant regions of viral and cellular enzymes falls below the statistically significant cutoffs used in conventional searches, distant homologues may be identified using less-stringent search criteria. The human genome encodes for approximately 95 DUBs subdivided in five families (38). Although the sequences of individual DUBs vary widely, the members of each family share a high degree of homology in regions surrounding conserved Cys and His residues that form the catalytic core of the enzymes. We therefore focused on the identification of EBV ORFs containing a region of homology with the C- and H-boxes of each DUB family and scored the candidates based on the degree of homology with the consensus sequences and on the presence of only one or both types of consensus domains. Four different strategies based on sequence alignment, pattern search, HMM motifs, and detection of conserved Cys and His residues in the context of HMM motifs were used to search for putative C- and H-boxes. The use of multiple searches provides a distinct advantage since each search is biased by different criteria. Thus, while sequence alignment scores for the presence of a continuous sequence of homologous residues, the pattern search restricts the hits to sequences that share similar residues at critical positions in the motif and HMM gives a global score based on the likelihood that a given residue will be found in each position of the motif. A further refinement of the HMM search was based on the assumption that functionally relevant proteins will be conserved among members of a virus family, and therefore conserved Cys and His residues will be found in the putative catalytic domains. By assigning individual scores to each of these searches and then combining the scores into a global DUB score, we have identified 16 candidates with DUB scores of
4 out of 106 annotated or predicted ORF in the EBV genome. Five of the candidates were excluded from further analysis because the putative catalytic domains are located within or at the opposite sides for transmembrane domains or because they are either known structural components of the virus capsid or envelope glycoproteins and are therefore unlikely to possess DUB activity.
A previously identified viral DUB encoded by the BPLF1 ORF was found among the high score candidates. BPLF1 is the EBV homologue of UL36, a presumably multifunctional protein of the alphaherpesvirus HSV-1. DUB activity was previously mapped to the N-terminal portion of UL36 by using a functional proteomic approach based on labeling of total cell extracts with and epitope-tagged, Ub-based suicide substrates followed by immunoprecipitation, and identification of the labeled enzyme by mass spectrometry (23). Using a reporter substrate that carries ubiquitin fused to GFP in a conformation that mimics the ubiquitin precursors and, to some extent, a true ubiquitinated substrate, we have now confirmed that the N terminus of BPLF1 is a very potent ubiquitin deconjugase. Interestingly, two additional ORFs in the high-score group, BSLF1 and BXLF1, were also capable of cleaving Ub-GFP at levels significantly above background. The enzymatic activity of the new putative DUBs was in both cases confirmed by the capacity of the purified GST fusion proteins to cleave the fluorogenic substrate Ub-AMC. Furthermore, the catalytic Cys residues were identified in both proteins by mutation analysis, confirming that these EBV ORFs are bona fide DUBs. It is noteworthy that bacterially expressed full-length BSLF1 and BXLF1 exhibited a poor enzymatic activity compared to both the N-terminal portion of BPLF1 and the human DUB USP19 that was included as control. Comparison of the enzymatic activity of purified proteins expressed in mammalian cells suggests that this may be due to the requirement for posttranslational modifications that are not achieved in bacteria. Indeed, only the N terminus of UL36 was identified in the functional screen performed in HSV-1-infected cells, suggesting that the protein may be processed during infection.
The new viral DUBs identified in the present study are expressed in the nucleus of EBV-infected cells during the early phases of the productive virus cycle and, based on their annotated functions, are involved in DNA replication and nucleotide metabolism. It is noteworthy that the efficiency and fidelity of DNA replication is regulated by posttranslational modification of several components of the replication complex, including for example, monoubiquitination, polyubiquitination, and sumoylation of PCNA (36). The BSLF1 gene product shares 23% sequence identity with HSV UL52, the primase component of the trimolecular helicase-primase complex was shown to be essential to HSV replication (12, 13). In EBV the complex is made up of BSLF1, BBLF4 (UL5 homolog), and BBLF2/3 (UL8 homolog) (51), but its activity in viral replication has not been directly confirmed. Similar to UL52, BSLF1 contain several conserved amino acid domains, such as the DxD motif (aa 481 to 498) and a putative zinc finger (Znf; aa 782 to 829) motif at the C-terminal end of protein that are also found in prokaryotic and eukaryotic primases (7, 15, 24). Mutational analysis of the Znf of HSV UL52 showed that this region is required for the virus replication (7, 8, 10). The Znf of BSLF1 partially overlaps with the predicted catalytic C-box.
BXLF1 gene encodes a protein of 607 aa that was annotated as the EBV thymidine kinase (TK) gene due to its capacity to complement TK-negative strains of E. coli (30) and partial sequence homology with regions of HSV-1 TK (19). We found that the regions that identify BXLF1 as a putative DUB and the catalytic Cys residue are not conserved in other herpesvirus homologues. Moreover, in contrast to other family members, BXLF1 was shown to localize to the centrosomes, where it encircles the tubulin-rich centrioles in a microtubule-independent manner (20). It remains to be seen whether this atypical localization may reflect a dual enzymatic activity that is not present in the homologues.
In conclusion, we have shown that it is possible to use bioinformatics tools to identify distant homologues of DUBs encoded by viruses. Our strategy of choosing candidates based on a global DUB score obtained by combining the results of multiple low stringency searches identified 16 high scoring ORFs in a database of 106 putative or confirmed EBV ORFs. Enzymatic activity was confirmed for 3 of 11 ORFs tested, whereas ORFs with low score and ORFs lacking putative C- or H-boxes were uniformly negative. Thanks to its relatively good predictive capacity, our approach could provide a valuable complement to the functional screening described by Kattenhorn et al. (23), which, due to the requirement for validation by mass spectroscopy, is strongly biased toward enzymes that are overexpressed in the infected cells. Further studies will be required to assess how the DUB activity of BPLF1, BSLF1, and BXL1 contributes to their role in virus replication and to the remodeling of the cellular environment during productive EBV infection.
This study was supported by grants from the Swedish Cancer Society, the Swedish Medical Research Council, and the Karolinska Institutet, Stockholm, Sweden (to M.G.M. and K.L.); by the Carl Trygger Foundation and Linköping University (to B.P.); by Bayerisches Genomeforschungsnetzwork and MRC G0501453 (to J.H.); and by the European Community Integrated Project on Infection and Cancer, project LSHC-CT-2005-018704 (to M.G.M.).
Published ahead of print on 20 August 2008. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
R.S. and S.G. contributed equally to this study. ![]()
|
|
|---|
B signalling by deubiquitination. Nature 424:801-805.[CrossRef][Medline]
B roadmap. Mol. Interv. 4:144-146.
B activation. J. Exp. Med. 202:1327-1332.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»