Previous Article | Next Article ![]()
Journal of Virology, June 2003, p. 6167-6177, Vol. 77, No. 11
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.11.6167-6177.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Viral Immunology Center, Department of Biology, Georgia State University, Atlanta, Georgia 30303,1 School of Biology and School of Biomedical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-023022
Received 19 November 2002/ Accepted 3 March 2003
|
|
|---|
134.5 gene, which encodes a neurovirulence factor. Absence of this gene was verified in two low-passage clinical isolates derived from a rhesus macaque and a zoonotically infected human. This finding suggests that B virus most likely utilizes mechanisms distinct from those of HSV to sustain efficient replication in neuronal cells. Despite the considerable differences in G+C content of the macaque and B virus genes (51% and 74.2%, respectively), codons used by B virus are optimal for the tRNA population of macaque cells. Complete sequence of the B virus genome will certainly facilitate identification of the genetic basis and possible molecular mechanisms of enhanced B virus neurovirulence in humans, which results in an 80% mortality rate following zoonotic infection. |
|
|---|
B virus (Cercopithecine herpesvirus 1, monkey B virus) is a member of the subfamily Alphaherpesvirinae, which together with human herpes simplex virus types 1 and 2 (HSV-1 and HSV-2) constitutes the genus Simplexvirus. B virus generally causes only mild localized or asymptomatic infections in its natural hosts, Asian monkeys of the genus Macaca (33, 34, 74). In contrast, B virus infections in foreign hosts, humans or monkey species other than macaques, often result in encephalitis, encephalomyelitis, and death (53, 73, 74).
The genome organization of B virus is similar to that of HSV-1 and HSV-2: the unique long (UL) and unique short (US) segments flanked by inverted long (RL) and short (RS) repeat sequences are covalently joined in four possible isomeric configurations (26). Sequence analysis of the partial US regions of B virus and simian agent 8 virus demonstrated that human and primate viruses are colinear in this genomic segment (51). However, only a limited number of B virus gene sequences from the UL region have been published (3, 36, 62, 63), and nothing has been reported about the structure and the gene content of the repeat genomic elements.
The size of the B virus genome was estimated previously as 165 kb (26) and 162.5 kb (40), which is significantly larger than the HSV-1 and HSV-2 genomes (152 and 155 kb, respectively). Extra DNA might contain additional genes that are not present in human viruses and may provide insight about B virus pathogenicity in foreign hosts.
In this study, the complete genomic sequence of the B virus reference strain E2490, isolated from a rhesus macaque (Macaca mulatta), was determined, and viral DNA termini were located. The genomic structure and gene content were analyzed and compared to those of human herpesviruses HSV-1 and HSV-2.
|
|
|---|
DNA cloning and sequencing. The BamHI, KpnI, SalI, PstI, and XhoI restriction fragments of B virus genomic DNA were cloned into the pUC19 vector digested with the same enzyme. The following three techniques were employed to generate full sequences of the clones: primer walking, unidirectional deletions, and random transposon insertions (Epicentre Technologies).
To obtain the sequence of the IRL-IRS regions, two shotgun libraries of random overlapping clones were prepared from the adjacent PstI fragments P13 and P1, covering this region (Fig. 1). DNA fragments of 13 kb (P13) and 4.8 kb (P1) were isolated after separation of PstI-digested genomic DNA on a 0.8% agarose gel. Each DNA fragment was then treated with DNase I from a DNaseI Shotgun kit (Novagen) according to the manufacturer's suggested protocol. The DNase I-treated fragments were fractionated on a 1.4% agarose gel, and then 300- to 1,000-bp fragments were eluted from the gel, blunt ended with T4 DNA polymerase, dA-tailed with Tth polymerase and a dA tailing kit (Novagen), and ligated into the pScreen T vector (Novagen). The ligation mixtures were transformed into E. coli NovaBlue cells (Novagen).
![]() View larger version (24K): [in a new window] |
FIG. 1. Molecular cloning of B virus genome. The first letter in the clone designations indicates the cloning site: B, BamHI; K, KpnI; P, PstI, S, SalI; X, XhoI. The number in the clone designations reflects the order of clone isolation. Double-sided arrows indicate PCR fragments used for verification of clone junctions. The map scale is in kilobases.
|
Cloning of genomic DNA termini.
B virus genomic DNA (
0.5 µg) was blunt ended by treatment with 5 U of T4 DNA polymerase (Novagen) as suggested by the manufacturer. After purification on a Qiaquick spin column (Qiagen), the blunt-ended DNA was digested overnight with restriction enzyme SphI and ligated into SmaI- and SphI-digested vector pUC19. E. coli NovaBlue (Novagen) competent cells were transformed with the ligation mixture.
Southern blots. SphI-digested genomic DNA was transferred from a 0.8% agarose gel onto a nylon membrane (Roche) by alkali blotting according to the instructions supplied with the membranes. Plasmid DNA was labeled with digoxigenin with a HighPrime labeling kit (Roche). Prehybridization, hybridization, and washing steps were carried out according to standard protocols (59). Chemiluminescence detection of the bound probe was performed with an ECL detection kit (Amersham Pharmacia Biotech).
DNA sequence analysis. Identification of open reading frames (ORFs), repeats, and DNA regulatory sequences was performed with DNAStar suite of programs. GenBank database searches were carried out with BlastN, BlastP, and BlastX with default settings. Multiple alignments between B virus, HSV-1, and HSV-2 genes and proteins were performed with and analyzed by DNAStar MegAlign program (version 4.0.3) with the PAM250 amino acid substitution matrix and Joltun Hein method (30) and the following parameters: gap opening penalty = 11 and gap extension penalty = 3. Identity values were determined from the generated alignments.
The GeneMark and GeneMark.hmm gene-finding programs were used to refine the procedure of gene identification (4, 41). The parameters of the statistical models were defined by training on the set of B virus ORFs encoding protein products homologous to known HSV-1 and HSV-2 proteins. These statistical methods are able to identify frameshifts and unusual gene starts in addition to detecting genes not found by similarity search. To prove that a homolog of the HSV neurovirulence factor is absent in the B virus genome, BlastP and PSI-Blast searches with ICP34.5 protein as the query were used against the whole B virus genome translated in six frames. In addition, a Hidden Markov Model profile was created from the conservative domain of the ICP34.5 protein and used to scan the translated B virus genome.
In a search for unique genes, the PSI-Blast and RPS-Blast programs were used to search for possible conserved domains in proteins predicted by statistical methods. Predicted proteins shorter than 20 residues were excluded from the analysis. Analyses of protein structures were performed with the DNAStar Protean program (version 4.0.3), signal peptide prediction program SignalP version 2.0.b2 (http://www.cbs.dtu.dk/services/SignalP-2.0/), and transmembrane prediction programs TMHMM version 2.0 (http://www.cbs.dtu.dk/services/TMHMM/), DAS (http://www.sbc.su.se/~miklos/DAS/), and TMpred (http://www.ch.embnet.org/software/TMPRED_form.html).
GenBank accession numbers. The complete sequence of the B virus genome (GenBank accession number AF533768) and the L-terminal sequences of two clinical B virus isolates (GenBank accession numbers AY230747 and AY230748) have been deposited in the NCBI database.
|
|
|---|
Location of B virus genomic termini.
To determine the first and last nucleotides in the B virus genome, genomic termini were defined. Initially, the locations of the internal a sequence and the joint between the B virus long and short internal repeat regions (L/S joint) were estimated after aligning the HSV-1 and B virus IRL-IRS sequences (not shown). Two SphI restriction sites were located approximately 1.8 kb (in the IRL region) and 1.5 kb (in the IRS region) from the predicted internal a sequence. Since the RL and RS internal and terminal copies in alphaherpesvirus genomes are identical, SphI restriction sites were expected to be located
1.8 kb and
1.5 kb from the B virus L and S genomic termini, respectively, and therefore could be used for cloning the B virus genomic ends.
Blunt-ended B virus genomic DNA was then digested with SphI and ligated into SmaI- and SphI-digested vector pUC19. All internal genomic fragments had SphI sticky ends on both sides, and consequently only terminal genomic fragments, which had one blunt end and another SphI end, were compatible with the prepared vector and successfully cloned by this procedure. As predicted, two sets of recombinant clones were generated with inserts of the estimated sizes. To confirm that the clones isolated included viral DNA ends, plasmid DNA from clones containing 1.5-kb and 1.8-kb inserts was used to probe SphI-digested B virus genomic DNA on Southern blots. Theoretically, if these clones indeed contain genomic ends, they will hybridize to the two SphI genomic fragments of 1.5 kb or 1.8 kb (terminal) and 3.1 kb (internal from the L/S junction). These fragments and two additional 2.1-kb and 3.3-kb fragments of lower intensity were detected (Fig. 2A). The observed cross-hybridization patterns indicated that the two genomic ends had common sequences. The additional fragments are most likely L-terminal (2.1 kb) and junction (3.3 kb) fragments containing an extra copy of a sequence. The variable number of a sequence repeats has also been described at the L terminus and L/S junction in HSV-1 (57).
![]() View larger version (39K): [in a new window] |
FIG. 2. Identification of B virus genomic termini. (A) Southern blot of SphI-digested genomic DNA hybridized with digoxigenin-labeled plasmids containing 1.6-kb (lane 1) and 1.8-kb (lane 2) terminal fragments. An autoradiograph of the membranes is shown. The positions of DNA size markers are shown on the left. Arrows indicate terminal small (TS), terminal large (TL), and junction (J) fragments. (B) Cloned genomic fragments containing B virus termini. The structural organization of the B virus genome is shown, with the UL and US regions represented by solid lines and the TRL, IRL, TRS, and IRS regions represented by open boxes. Terminal a sequences and the oppositely oriented internal a' sequence are indicated. Below the genome diagram, a schematic alignment of the isolated terminal fragments is shown. Arrows denote the locations of a sequence copies and their orientations in the genome. The numbers in parentheses indicate the number of clones sequenced. (C) Alignments of the a sequences of B virus (BV), HSV-1, and HSV-2. Arrows indicate genomic ends. The conserved motifs of pac1 and pac2 signals are shown in bold.
|
Although overall sequence homology between the a sequences of B virus, HSV-1, and HSV-2 was low, the sequence composition of the cleavage and packaging signals was very well conserved (Fig. 2C). An A-rich motif followed by three CGCGGCG motifs formed the B virus pac2 signal. Interestingly, most herpesviruses have only one copy of the CGCGGCG motif, which contributes significantly to the efficiency of genome cleavage and packaging (45). The B virus pac1 contained three conserved motifs, an A-rich region flanked by two stretches of seven C residues and 13 C+G residues. Both pac1 and pac2 are located at conserved distances from the genomic ends (17), 31 bp from the L terminus and 34 bp from the S terminus of B virus DNA. There were no DR2 repeats between the pac1 and pac2 signals in the B virus a sequence, which were thus noticeably shorter than the HSV-1 a sequence.
Overall genome organization.
The complete B virus genomic sequence was assembled according to the HSV-1 prototype genome structure (57). The genome of B virus is 156,789 bp in length, considerably shorter than earlier predictions based on the constructed B virus physical map (26) or direct measurements by electron microscopy (40). The 74.5% G+C base composition of the genome is very similar to earlier estimates of 75% determined by DNA buoyant density centrifugation (40). Table 1 presents a comparison of the length and G+C content of B virus, HSV-1, and HSV-2 by genomic region. The G+C composition was elevated in all B virus regions compared to that in HSV types 1 and 2, with unique regions demonstrating the highest G+C content among known herpesviruses. The sizes of genomic segments were comparable among the viruses with the exception of RS, which in the B virus genome was
1.5 kb longer.
|
View this table: [in a new window] |
TABLE 1. Comparison of B virus, HSV-1, and HSV-2 genomic regions
|
|
View this table: [in a new window] |
TABLE 2. Sets of reiterated sequences in the B virus genome
|
![]() View larger version (19K): [in a new window] |
FIG. 3. Comparison of B virus and HSV-1 origins of replication. The DNA sequences of the oriL and oriS core elements are shown. BV/L1, B virus oriL1; BV/L2, B virus oriL2; BV/S, B virus oriS1 and oriS2; HSV1/L, HSV-1 oriL; HSV1/S, HSV-1 oriS. The OBP-binding sites (box I, box II, and box III) are boxed. Residue substitutions in all B virus origins relative to HSV-1 oriL are shown in bold. Residue substitutions in B virus oriL2 relative to oriL1 are underlined.
|
(i) HSV gene homologs. All but one B virus gene were identified on the basis of sequence homology to HSV-1 and HSV-2 genes and named correspondingly. Table 3 summarizes their locations in the genome, predicted sizes, and percent identities with the corresponding HSV-1 and HSV-2 gene products. Seventy-two genes existed as a single copy within unique genomic regions, whereas two genes, ICP0 and ICP4, appeared twice due to duplication of the large and small repeat regions where they reside. Two ORFs (RL2 and UL15) are predicted to have introns matching ones in the corresponding HSV genes.
|
View this table: [in a new window] |
TABLE 3. ORFs and other features of the B virus genome
|
The extent of amino acid identity between B virus and HSV polypeptides varied from 26.6% (US5) to 87.7% (UL15). The conservation of specific protein domains observed in HSV-1 and HSV-2 was mirrored when HSV-1 and B virus were compared. The following proteins were significantly conserved in B virus: DNA cleavage and packaging proteins, i.e., UL15, UL28, UL32, and UL33; capsid proteins, i.e., UL18, UL19, and UL38; proteins involved in DNA replication, i.e., UL2, UL5, UL29, and UL30; and glycoprotein B. The three least-conserved proteins in B virus were US4, US5, and US12. Similar levels of conservation have been described for homologous proteins in many other mammalian herpesviruses (58).
The B virus proteins could be divided into three groups based on the degree of similarity to HSV-1 and HSV-2 proteins. The largest group (46 proteins) showed greater similarity to HSV-2 proteins, e.g., DNA cleavage and packaging proteins. The second group (20 proteins), with capsid proteins among others, showed higher levels of similarity to HSV-1 proteins. In seven proteins, differences in similarity to HSV-1 or HSV-2 homologs were marginal, and these proteins formed the third group. The proteins that enable HSV-1 to replicate and reactivate more efficiently at orofacial sites and HSV-2 at genital sites are unknown, but the fact that B virus replicates and is reactivated with similar efficiencies at both sites may be explained by the similarities of selected B virus proteins with either HSV type 1 or 2 proteins. These hybrid properties in B virus raise many questions about alphaherpesvirus evolution.
(ii) Gene homologs absent. In recent years, the existence of additional HSV-1-specific genes has been proposed and substantiated by experimental data. UL20.5 is located between UL20 and UL21, ORF P and ORF O map antisense to the 34.5 gene, and UL43.5 and UL27.5 map antisense to UL43 and UL27, respectively (8, 55, 56, 71, 72). Like HSV-2 (18), B virus has no equivalents to these proposed genes.
One well-established gene, the HSV
134.5 (RL1) homolog, was observed to be absent in B virus. This conclusion was reached after repeated searches with state-of-the-art computational genome analysis tools (see Materials and Methods). To determine that this observation was not limited to the laboratory strain of B virus, we cloned and sequenced L-terminal SphI restriction fragments from two low-passage clinical isolates, one of which was derived from a rhesus macaque (54) and the other post mortem from a zoonotically infected human patient (16). Sequence comparison of these fragments with the corresponding fragment of the laboratory strain did not reveal any significant differences: only single nucleotide substitutions and variations in the number of copies of short reiterations were found in the clinical isolates relative to the laboratory strain. Since the absence of the
134.5 gene homolog was verified in three independent B virus strains, we concluded that it was a genuine feature of the B virus genome.
The protein product of the HSV-1
134.5 gene, infected-cell protein 34.5 (ICP34.5), is a neurovirulence factor with at least two known, distinct functional activities (9, 10, 12-14, 64, 66). One function, encoded by the carboxyl-terminal domain, negates the antiviral effect of induced protein kinase R by redirecting the host protein phosphatase 1
to dephosphorylate translation initiation factor eIF2
, preventing protein synthesis shutoff in infected cells (9, 11, 12, 29, 39). Another function, mapped to both amino-terminal and carboxyl-terminal domains, somehow enables the virus to replicate in the peripheral and central nervous systems of experimentally infected animals (10, 64). Deletion of
134.5 leads to complete neuroattenuation of highly neurovirulent HSV-1 strains. In the absence of
134.5, other genes may supply these functions, given the striking similarities between the replication characteristics of B virus and the human simplex viruses.
We predict that B virus currently uses compensatory strategies to block host responses to infection, similar to those described for HSV-1. For example, the HSV-1 US11 protein can inhibit protein kinase R activation and compensate for the absence of the ICP34.5 function in deletion mutants if expressed early in infection (5, 6, 28, 48). It was proposed that the US11 gene encodes an alternative mechanism to preclude the shutoff of protein synthesis that is currently inactive in HSV-1 (5). In addition, B virus might have evolved unique mechanisms to prevent termination of protein synthesis and elude the DNA replication blocks imposed by neuronal cells. To examine this possibility, a search for B virus-specific genes was performed.
(iii) Unique genes in B virus genome.
Two major sequence differences between the B virus and HSV-1 and HSV-2 genomes were detected. The B virus RS region contains an additional
1.5 kb of sequence between the S terminus and the ICP4 gene homolog, while the RL region is shorter in B virus than in the human viruses, with no sequence homology to the HSV ICP0 flanking regions. However, no potential genes were identified in these regions despite rigorous analyses.
A putative two-exon gene (UL53A) was identified by statistical analysis in the B virus UL region on a complementary strand (positions 112186 to 112495 and 112755 to 113344). This putative gene had a codon usage pattern compatible with codons present in established B virus genes. The UL53A protein is a hydrophilic basic protein (pI 11.9) with an amino acid composition biased to Pro and Arg residues. The first 21 residues were predicted to be a signal peptide by the SignalP program, and residues 222 to 240 were predicted to be a transmembrane domain by two out of three programs applied (TMpred and DAS), but probability scores were not high enough to classify this protein as a membrane protein. PSI-Blast analysis detected sequence similarity to domains in three neuronal proteins encoded in mammalian genomes: neural cell adhesion molecule NCAM-180 (GenBank accession number P13595, E value of 0.16), calcineurin inhibitor cabin 1 (GenBank accession number AAD40846, E value of 0.014), and brain calcium channel
1A subunit (GenBank accession number AAB64179, E value of 0.007). However, the similarity was insufficient to confidently characterize the gene at this time, and pending experimental evidence will be critical to determine whether this gene, which is missing in HSV-1 and HSV-2, provides B virus with a unique mechanism to attack neural cells.
B virus genome high G+C content and optimization of codon usage. The G+C content of the protein-coding regions in B virus, 74.4%, is only slightly lower than the G+C content of the noncoding regions, 75.4%. These data indicate a strong mutation pressure toward G and C nucleotides in B virus DNA evolution. Obviously, the genetic drift toward the abundance of G's and C's has been compensated for by positive selection for conservative A's and T's in the first (26% A and T) and second (44% A and T) positions of a codon in protein-coding regions and by positive selection for A's and T's in a few evolutionarily conserved regions (promoters, repeats, and other regulatory sites) in noncoding DNA. The G+C content of the third position of codons in B virus genes is extremely high, 93.1%. This bias, created by mutation pressure, seems to be a driving force in the formation of the codon usage pattern in B virus.
Genomes of primates tend to have low G+C content, and the logical question arises whether the highly G+C-biased codon usage in B virus matches the proportions of the isoacceptor tRNA in host cells. Unfortunately, no experimental data are currently available regarding the tRNA pool in macaque cells. Still, relative amounts of the isoacceptor tRNAs could be predicted based on the frequencies of codon use in rhesus macaque genes because a strong direct correlation between these parameters has been demonstrated in a number of organisms (23, 32, 49).
A codon usage catalog of rhesus macaque genes was generated based on 288 protein-coding sequences from the GenBank database and compared with the codon usage in B virus genes (Table 4). As expected, the overall codon usage patterns of B virus and the macaque hosts differed substantially. However, remarkably, the favored codons of the B virus genes mirrored the most frequently used codons in the rhesus macaque genes (except Ser and Arg, both encoded by six codons). Moreover, these most frequent codons in B virus were almost the only codons used for each amino acid. This observation correlates with intriguing data from a number of unicellular and multicellular organisms that highly expressed genes utilize preferable codons in each synonymous group, the optimal codons (20, 21, 23, 61). Thus, the combined effects of directional mutation pressure and selection for preferential codons put the B virus genome in a favorable state to efficiently employ the host cell translation machinery for expression of viral proteins.
|
View this table: [in a new window] |
TABLE 4. Codon usage in rhesus macaque and B virus genesa
|
134.5 gene homolog, acquisition of this gene by human viruses is an even more recent evolutionary event than was suggested previously (6, 7). Comparative analysis of the B virus and HSV genomes is essential to understand the mechanisms of B virus pathogenesis in humans. The complete sequence of the B virus reference strain also supplies much-needed information to determine the genetic basis of phenotypic and pathogenic differences among B virus isolates derived from different macaque subspecies. With this information, new antiviral and vaccine strategies can be designed to target critical viral components in order to control this deadly zoonotic agent.
|
|
|---|

134.5 herpes simplex virus 1 recombinant blocks the shutoff of protein synthesis induced by activated protein kinase R and partially restores neurovirulence. J. Virol. 76:942-949.
134.5 gene if present before activation of protein kinase R by precluding its phosphorylation and that of the alpha subunit of eukaryotic translation initiation factor 2. J. Virol. 72:8620-8626.
134.5 genes precludes shutoff of protein synthesis by blocking the phosphorylation of eIF-2
. J. Virol. 72:7005-7011.
2 gene antisense to the herpes simplex virus 1 gene encoding glycoprotein B. J. Virol. 72:6056-6064.
and premature shutoff of protein synthesis after infection with
134.5- mutants of herpes simplex virus 1. Proc. Natl. Acad. Sci. USA 92:10516-10520.
134.5, a gene nonessential for growth in culture. Science 250:1262-1266.
134.5 gene of herpes simplex virus. J. Virol. 68:8304-8311.
134.5 gene of herpes simplex virus 1 precludes neuroblastoma cells from triggering total shutoff of protein synthesis characteristic of programmed cell death in neuronal cells. Proc. Natl. Acad. Sci. USA 89:3266-3270.
134.5 gene function, which blocks the host response to infection, maps in the homologous domain of the genes expressed during growth arrest and DNA damage. Proc. Natl. Acad. Sci. USA 91:5247-5251.
134.5- herpes simplex virus 1: failure of activated RNA-dependent protein kinase to shut off protein synthesis is associated with a deletion in the domain of the
47 gene. J. Virol. 71:6049-6054.[Abstract]
134.5 protein of herpes simplex virus 1 complexes with protein phosphatase 1
to dephosphorylate the alpha subunit of the eukaryotic translation initiation factor 2 and preclude the shutoff of protein synthesis by double-stranded RNA-activated protein kinase. Proc. Natl. Acad. Sci. USA 94:843-848.
134.5 gene and may account for the attenuation of the mutant virus. J. Virol. 71:7750-7757.[Abstract]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»