ABSTRACT
Human herpesvirus 6 (HHV-6), which belongs to the betaherpesvirus subfamily and infects mainly T cells in vitro, causes acute and latent infections. Two variants of HHV-6 have been distinguished on the basis of differences in several properties. We have determined the complete DNA sequence of HHV-6 variant B (HHV-6B) strain HST, the causative agent of exanthem subitum, and compared the sequence with that of variant A strain U1102. A total of 115 potential open reading frames (ORFs) were identified within the 161,573-bp contiguous sequence of the entire HHV-6 genome, including some genes with remarkable differences in amino acid identity. All genes with <70% identity between the two variants were found to contain deleted regions when ORFs that could not be expressed were excluded from the comparison. Except in the case of U47, these differences were found in immediate-early/regulatory genes, DR2, DR7, U86/90, U89/90, and U95, which may represent characteristic differences of variants A and B. Also, we have successfully typed 14 different strains belonging to variant A or B by PCR using variant-specific primers; the results suggest that the remarkable differences observed were conserved evolutionarily as variant-specific divergence.
Human herpesvirus 6 (HHV-6) was first isolated in 1986 from the peripheral blood of patients with lymphoproliferative disorders (53). By comparison with other human herpesviruses, molecular and immunological analyses revealed its distinct nature (22). The virus replicates predominantly in CD4+ lymphocytes (33, 60) and may establish latent infection in cells of the monocyte/macrophage lineage (23). Infection with this virus results in exanthem subitum (ES), a common illness of infancy (74), but has not yet been clearly linked to any disease in adults. Two variants of HHV-6 have been identified on the basis of differences in epidemiology, in vitro growth properties, reactivity with monoclonal antibodies, restriction endonuclease mapping, and nucleotide sequence (1, 2, 8, 15, 55, 72, 73), leading to adoption of the nomenclature HHV-6A (variant A) and HHV-6B (variant B). HHV-6A has not yet been clearly linked to any disease, and HHV-6B is the causative agent of ES (3). Gompels and coworkers (16) have determined the complete DNA sequence of HHV-6A strain U1102. In the present report, to examine which gene(s) could be related to pathogenicity and other viral properties, we present the complete DNA sequence of HHV-6 strain HST, which belongs to variant B, and compare the sequence with that of HHV-6A. We also report that some of the HHV-6 genes show remarkable differences in deduced amino acid sequence identity compared with variant A. Such differences could represent characteristic, or possibly pathogenic, differences between the two variants.
MATERIALS AND METHODS
Cells and viruses.Umbilical cord blood mononuclear cells (CBMCs) were separated on a Ficoll-Conray gradient, cultured in RPMI 1640 medium containing 10% fetal calf serum, and stimulated with phytohemagglutinin (5 μg/ml) for 2 or 3 days. HHV-6 strain HST, which was isolated from a patient with ES and belongs to the variant B group, was propagated in fresh human peripheral mononuclear cells (74). To prepare virus stocks, virus was propagated in stimulated CBMCs. When more than 80% of the cells showed cytopathic effects, the culture of infected cells was frozen and thawed twice; after centrifugation at 1,500 × g for 10 min, the supernatant fraction was stored at −80°C as a cell-free virus stock. To infect cells with virus, stimulated CBMCs (107 cells) were washed twice with phosphate-buffered saline, suspended in 1 ml of virus solution with 107 50% tissue culture infective doses per ml, and spun at 1,500 × g for 40 min at 37°C for adsorption. To prepare the viral DNA, the infected cells were cultured for 2 or 3 days in RPMI 1640 medium supplemented with 10% fetal calf serum and harvested when approximately 50% of the cells showed cytopathic effects. For analysis of independently isolated viruses by PCR, CBMCs infected with the viruses were used. The following viruses were kindly supplied by other researchers: strain Z29 by C. Lopez (32), strains GS and DA by D. V. Ablashi (1), strain U1102 by R. Honess (11), and strain St. W. by G. Enders (12). Other HHV-6 strains (MNM1, MNM2, TMK1, TMK2, NKT2, OKC1, OKC3, and MCZ2) were isolated in our laboratory from patients with ES. All strains were confirmed as HHV-6 by using specific monoclonal antibodies to HHV-6 (data not shown). To test the specificities of the primers used for conducting virus strain typing, herpes simplex virus type 1 (HSV-1; strain F), HSV-2 (strain G), varicella-zoster virus (strain Oka), human cytomegalovirus (HCMV; strain AD169), Epstein-Barr virus (strain B95-8), and HHV-7 (strain KHR) were used for PCR.
Preparation and purification of HHV-6 DNA.To determine the DNA sequence of the entire HHV-6 HST genome, viral genomic DNA was purified as described elsewhere (35, 43). To amplify HHV-6 genome DNA by PCR, DNA from HHV-6-infected CBMCs (106cells) 2 days after infection was extracted in 0.5 ml of lysis buffer (0.45% Nonidet P-40, 0.45% Tween, 10 mM Tris-HCl [pH 8.3], 50 mM KCl, 3 mM MgCl2, 10 μg of protease K per ml) at 65°C overnight and at 98°C for 10 min to denature the proteinase K.
PCR.For assembling the various DNA sequences into the complete genome DNA sequence, the junction regions of nonoverlapping sequences, except the origin of DNA binding (Ori) region, the 3′ end of the unique region, and the concatemer junction region, were amplified by PCR using genomic viral DNA prepared as described above with corresponding oligonucleotide primers, which were synthesized based on the previously determined HHV-6 sequence at the fragment termini. PCR was carried out with approximately 10 ng of the viral DNA template as follows: 25 cycles of denaturation (95°C for 1 min), annealing (55°C for 1 min), and extension (72°C for 2 min), using a PCR Thermal Cycler TP480 (Takara Shuzo, Kyoto, Japan). For the Ori region and the right-hand end of the unique region, amplified DNA fragments were obtained from 10 ng of purified strain HST DNA by shuttle PCR as described below, using LA Taq (Takara Shuzo), an appended buffer, 200 μM each deoxynucleoside triphosphate, and two pairs of PCR primers, 06-3end (5′-AAGACACCATACTCGGAAGAATCATGTTACGACGTT)–22-5end (5′-TTGGCCTGAGACCTTACAACGGAGACGCCCGTGATC) and 07-5end (5′-GAGATAGAGTCGGCCTTGGAACACGTCGTGGCAGAAC)–45-3end (5′-GATAAAAAAGTAATTCGTTCATATGAGTAGTCGGAT), respectively. In the case of the concatemer junction region, the DNA fragment was obtained from extracted viral DNA by shuttle PCR as described below, using primers 5Ter-JuncC.LA (5′-CAGGGCGTGGCTGACACGCGCCGGCAGCGGCAT) and 3Ter-JuncF.LA (5′-ACGACGCCAAGGGAAGCCTCTGGCGCAATCTAT). The PCR product was then cloned into vector pCR2.1 (Invitrogen, San Diego, Calif.). Shuttle PCR conditions were as follows: incubation at 98°C for 3 min for hot start, 30 cycles of 98°C for 30 s, and 68°C for 10 min, with a final extension at 72°C for 10 min.
DNA sequencing and sequence analysis.Purified viral genomic DNA was digested with restriction enzymes PstI,BamHI, HindIII, or KpnI (Takara Shuzo). The resultant fragments were cloned into the corresponding restriction enzyme sites of pUC19. To make deletion libraries, pUC19 clones carrying PstI-DNA fragments (designated pSTY clones) and the other restriction fragments (Fig.1) were subjected to unidirectional progressive deletions, except for pSTY57, which contained an insert of approximately 200 bp. This procedure was carried out with a deletion kit for kilo-sequencing (Takara Shuzo), and the DNAs were subsequently transformed into Escherichia coli to obtain the corresponding deletion mutants. These inserts, as well as the normal PCR products described above, were sequenced as described by Isegawa et al. (20). Their corresponding sequences were confirmed by using the opposite unidirectional progressive deletions of the pSTY clones, the cassette-ligation-mediated PCR method, or direct sequencing of PCR products amplified from DNA (20). The resultant shuttle PCR products mentioned above were directly sequenced by cassette-ligation-mediated PCR (20) using a Li-Cor (Lincoln, Neb.) 4000L DNA sequencer. The nucleotide sequence of the entire HHV-6 genome thus obtained was analyzed to identify potential open reading frames (ORFs), using the software packages MacDNASIS version 3.7 (Hitachi Soft Engineering Co., Ltd., Yokohama, Japan) and ALIGN (Genome Information Research Center, Osaka University). Sequence comparisons were carried out using the sequences of HHV-6A (strain U1102) (16), HHV-6B (strain Z29) (9, 26, 27, 48), and HHV-7 (strain JI) (44).
Restriction fragments used for the sequencing of HHV-6 strain HST. The HHV-6 genome is represented diagrammatically, together with the positions of BamHI, HindIII,KpnI, and PstI sites. Various plasmid-cloned restriction fragments are prefixed with Ba, H, Kp, or pSTY (indicating their terminal restriction sites). Sequenced restriction fragments are indicated by shading.
Nucleotide sequence accession number.The complete sequence of HHV-6 strain HST has been deposited in the GenBank, EMBL, and DDBJ Data Libraries under accession no. AB021506 .
RESULTS
Assembling the contiguous HHV-6B sequence.The HHV-6 genome was sequenced by using the PstI, BamHI,HindIII, and KpnI deletion libraries described above. The fragments used for the project are shown in Fig.1. The resultant PstI-DNA fragments of 15.3, 13.1, 9.7, 6.6, 6.5, 5.5, 5.3, 5.3, 5.2, 4.8, 4.6, 4.5, 4.4, 4.4, 4.1, 3.8, 4.0, 2.5, 2.7, 2.1, 1.7, 1.7, 1.5, 1.5, 1.5, 1.4, 1.2, 1.2, 1.1, 1.0, 1.0, 0.8, 0.7, and 0.2 kb were cloned into pUC19 and designated pSTY01, -02, -03, -05, -06, -07, -08, -09, -10, -13, -15, -16, -22, -23, -24, -27, -28, -29, -32, -34, -36, -37, -41, -42, -43, -45, -46, -47, -48, -49, -50, -52, -56, and -57, respectively. These pSTY clones covered about 85% of the HHV-6 genome. Junctions between nonoverlapping restriction fragments were determined by directly sequencing PCR fragments spanning the junction regions (see Materials and Methods). The sequences of the terminal direct repeat (DR) elements were derived by compiling partial sequences of DRL and DRR to generate the complete DR sequences. The positions of the left and right genome termini were determined by sequencing PCR fragments that had been produced with 5′- and 3′-terminal concatemer junction primers (see Materials and Methods). As shown in Fig.2, the HHV-6 genome was composed of the DRs at the left and right ends of the genome (DRL and DRR) and the unique region (UR; containing the repeat regions [R1, R2, and R3] and Ori), for a total of 161,573 bp, including 8,231 bp in the DRs (DRL, nucleotide positions 1 to 8231; DRR, nucleotide positions 153,333 to 161573) and 145,101 bp in the UR (nucleotide positions 8232 to 153332). The genome length of the HST strain was 2,244 bp larger than that of the U1102 strain (144 and 1,966 bp larger at DR and UR, respectively). The overall GC content of the HST strain was 43%, the same as that of the U1102 strain. When the nucleotide sequence was compared with that of the U1102 strain, marked differences were observed only at specific regions of the DR and at the junction between the UR and DR. Higher similarity was observed in the UR. The nucleotide sequence variation of these regions is attributable differences between strains rather than differences between variants (43).
Predicted ORF organization of HHV-6 strain HST. Repeat regions (DRL, DRR, R1, R2, and R3) are boxed; telomeric repeat regions (T1 and T2) are denoted by striped bars; UR is indicated by a solid line. Protein coding regions are indicated as open arrows and are numbered DR1, DR2, DR3, DR6, DR7, DR8, DRHN1, and DRHN2 within the direct repeats and HN1, HN2, HN3, and U1 to U100 (excluding U78, U88, U92, U93, and U96) within the UR. The Ori is indicated by an asterisk. These regions are further described in Table 1 and in the text. The US22 gene family is shaded. Abbreviations not given in text: GCR, G-protein-coupled receptor; Ig, Ig immunoglobulin superfamily; RR, large subunit of ribonucleotide reductase; mCP, minor capsid protein; CA, capsid assembly protein; Teg, tegument protein; Pol, DNA polymerase; tp, transport protein; mDBP, major single-stranded DNA binding protein; TA, conserved herpesvirus transactivator; dUT, dUTPase; Pts, protease/assembly protein; pp65/71, phosphoprotein 65/71K; MCP, major capsid protein; PT, phosphotransferase; Exo, exonuclease; OBP, origin binding protein; Hel, helicase; UDG, uracil-DNA glycosylase; Che, chemokine; AAV rep1, adeno-associated virus replication protein homolog.
Identification of HHV-6B ORFs.The contiguous 161,573-bp sequence of the entire HHV-6 genome was analyzed to identify potential ORFs, using DNASIS version 3.7 (Hitachi Soft Engineering). Potential genes in the HHV-6 strain HST genome were selected by the following criteria: (i) ORFs larger than 300 nucleotides with initiating methionine codons (ATG) were generally considered to be genes encoding proteins; and (ii) ORFs smaller than 300 nucleotides but larger than 80 nucleotides were considered to be genes if their deduced amino acid sequences showed significant homology to those of U1102 and other herpesviruses or cellular proteins. The ORFs and homologous regions thus identified are illustrated in Fig. 2 and listed in Table 1. Within the DR, there are several ORFs (LT1, DR1, DR2, DR3, DR6, DR7, DR8, DRHN1, and DRHN2), and numerous ORFs are present within the UR. In general, the degrees of identity between corresponding ORFs were very high between variants A and B (Table 1). However, variant-specific ORFs were observed in the genome of each strain: HN1, HN2, HN3, DRHN1, and DRHN2 in the HST strain and U78, U88, U92, U93, U96, DR4, and DR5 in the U1102 strain. HN1 and HN3 turned out to be exons of the U89-IE1 (immediate-early 1) and spliced glycoprotein gp82/105 genes, respectively, when their cDNAs were analyzed. The functions of these variant-specific ORFs are not clear. It is possible that some ORFs may not encode functional proteins at all. However, it is conceivable that the others are associated with biological and pathogenic differences between variants A and B.
ORF identity between HST and U1102 strains of HHV-6
Comparisons of ORFs between HHV-6 variants.Sequence alignments of ORFs in the UR between the HST and U1102 strains indicated that the amino acid identity between these viruses was about 94% on average, except for the variant-specific ORFs. Putative functions of the various HHV-6 genes implicated by their corresponding homologs are listed in Table 1, which shows that 67 of the 103 ORFs compared exhibited >90% identity between these strains. As shown in Table 1, ORFs with 80 to 90% identity were DR1, DR3, DR6, U8, U11, U12, U14, U15, U18, U21, U23, U65, U71, U80, and U83. Of these, U8 and U18 were IE/regulatory genes, U80 was a replication gene, U12, U21, and U23 encoded glycoproteins, U11 encoded a viral protein, and U83 encoded a chemokine (76). ORFs with 70 to 80% identity were U54, U79, U91, U97, U98, U99, and U100. U79 was a replication gene, U97, U98, U99, and U100 encoded glycoproteins, and U54 was a viral protein gene. ORFs with <70% identity were LT1, DR2, DR7, DR8, LJ1, RJ1, U1, U47, U86, U89, U90, and U95. It is possible that high divergence in amino acid sequence leads to biological and pathogenic differences between variants A and B. Some of these 12 divergent genes could be directly involved in such functional differences. The ORFs with different starting or termination sites were classified as having different sizes if the difference was more than five deduced amino acid residues. The ORFs with different starting sites were DR2, DR7, DR8, LJ1, U1, U2, U3, U7, U8, U21, U23, U47, U55, U83, U98, and U100. The ORFs with different termination sites were DR7, DR8, LJ1, U1, U10, U20, and U47. The ORFs containing insertions were DR2, DR3, DR7, U11, U47, U86, U89, and U95. Finally, in the case of the chemokine receptor (U12) and gp82/105 (U97, HN3, U98, U99, and U100), the differences were due to differential splicing.
(i) IE/regulatory genes.IE proteins control the temporal cascade of gene expression and are a second line for determining the cellular specificity of infection. Based on in vitro chloramphenicol acetyltransferase or luciferase assays, at least eight genes that might serve as transcriptional activators have been defined: DR7, U3, U16/U17, U18/U19, U27, U86, U89, and U94 (13, 34, 42, 46, 66, 67, 75). There are multiple US22 genes in HCMV, HHV-6, and HHV-7 (5), and this feature appears to be a characteristic of the betaherpesvirus subgroup (7, 16). The translation products of these genes in strain HST differed greatly in length (89 to 1,197 amino acid residues), but they each contained at least one of four US22 amino acid sequence motifs (7, 16). This US22 gene family of HCMV is transcribed with IE kinetics and possesses transactivating functions (57). Strain HST contained 11 ORFs of the US22 gene family: DR1, DR2, DR6, DR7, U2, U3, U7, U8, U16, U25, and U95 (Table 1 and Fig. 2). cDNA analysis showed that HN1 and U90 corresponded to exons of both the IE1 and IE2 genes, which also contained U89 and U86, respectively (18). Amino acid sequence comparisons also suggested that other regions, including U42 and U54, could be involved in transcriptional activation. U42 is a member of the only family of genes involved in transactivation that is conserved among all of the herpesviruses. An HCMV homolog, UL69, has been shown to transactivate gene expression in vitro (71), but studies of the HSV-1 homolog showed that it also has roles in transcriptional termination and in snRNP distribution (37, 51). U54 has similarity to the duplicated genes UL82 and UL83 of HCMV, and UL82 has been shown to be a transcriptional activator and a component of the tegument (29). It is noteworthy that the adeno-associated virus 2 rep gene homolog identified in HHV-6, capable of specifying transregulatory and adeno-associated virus 2 DNA replication functions, was absent in other herpesviruses (67, 68). Among the IE/regulatory genes (DR1, DR2, DR6, DR7, U2, U3, U7, U8, U16, U17, U18, U19, U25, U27, U42, U54, U86/U90, U89/U90, U94, and U95), amino acid sequence comparisons revealed the following amino acid sequence identities between strains HST and U1102: U2, U3, U7, U16, U17, U19, U25, U27, U42, and U94, >90%; DR1, DR6, U8, and U18, 80 to 90%; U54, 79.5%; and DR2, DR7, U86, U89, U90, and U95, <70%.
(ii) Replication genes.Each herpesvirus that has been sequenced contains a set of conserved replication genes that are essential for viral DNA replication during productive infection. The core replication genes in HHV-6 were those encoding DNA polymerase (U38), DNA polymerase processivity factor (U27), a major DNA binding protein (a single-stranded DNA binding protein; U41), and components of the heterotrimeric helicase/primase complex (U43, U74, and U77). Other genes necessary for DNA replication were specific to herpesvirus subgroups and/or particular viruses. The alphaherpesvirus origin binding protein homolog encoded by HHV-6 (U73) presumably is also functionally analogous (17, 25). The UL84, UL112, and UL113 genes of HCMV likely encode proteins essential for replication functions, such as those enhancing the expression of the replication genes (21). HHV-6 encoded U55, U79, and U80, which were homologous to UL84, UL112, and UL113, respectively (16). Comparison of the deduced amino acid sequences of the replication genes (U27, U38, U41, U43, U55, U73, U74, U77, U79, and U80) and revealed that U27, U38, U41, U43, U55, U73, U74, and U77 showed >90% identity between strains HST and U1102 and that U79 and U80 showed 77.4 and 81.0% identities, respectively. cDNA analysis showed that U80 was an exon of U79, and five different cDNAs were expressed from this region in infected cells, due to alternative splicing (65). All replication genes except U79 and U80 were highly conserved between the two variants. These results suggest that replication genes may not contribute to the biological differences between these two variants.
(iii) Nucleotide metabolism and DNA repair.We identified HHV-6 homologs of herpesvirus gene products that are involved indirectly in DNA replication by acting as nucleotide substrates or through DNA repair. These gene products included ribonucleotide reductase (U28), dUTPase (U45), phosphotransferase (U69), alkaline exonuclease (U70), and uracil-DNA glycosylase (U81). The ribonucleotide reductase homolog of HHV-6 was encoded by a single ORF specifying a single protein (7, 15, 44), as is the case in the other betaherpesviruses (HHV-7 and HCMV), while the ribonucleotide reductase homolog of alphaherpesvirus, gammaherpesvirus, and cells is formed with subunits. HHV-6 was sensitive to ganciclovir (6, 41), though we do not know which gene(s) is associated with the phosphorylation of ganciclovir. The phosphotransferase gene product of HCMV (UL97), which is a homolog of HHV-6 U69, can phosphorylate nucleotides and the antiviral nucleoside analogue ganciclovir (28, 58). The HHV-6 dUTPase and uracil-DNA glycosylase homologs presumably specify enzymatic activities involved in the excision of uridine residues from DNA, like the U81 gene product, which possesses uracil-DNA glycosylase activity when expressed in E. coli (54). Among HHV-6 ORFs, all of the genes involving nucleotide metabolism and DNA repair (U28, U45, U69, U70, and U81) were strictly conserved, exhibiting more than 94% identity between strains HST and U1102. These results suggest that nucleotide metabolism and DNA replication genes may not be involved in the functional divergence between variants A and B.
(iv) Glycoproteins.In all sequenced herpesviruses, there are conserved glycoprotein genes encoding glycoprotein B (gB), gH, gM, and gL, all of which were also conserved in HHV-6 (U39, U48, U72, and U82, respectively). gB and gH are structurally highly conserved between herpesviruses, are membrane bound, and appear to play roles in virus-cell fusion and cellular spread of virus infection. Both gB and gH of HHV-7 are encoded by late genes (56), whereas gB and gH of HHV-6 are encoded by IE and early-late genes, respectively (19, 39). However, the actual translation of the gB gene of HHV-6B occurs at the late phase (63). These results suggest that the gB gene of HHV-6 is regulated posttranscriptionally. gL physically associates with the gH precursor protein and may be required for gH transport and/or processing (30, 31). gM is fairly conserved between herpesviruses and was shown to be a component of the virus particle in HSV-1 and equine herpesvirus type 1 (4, 52).
HHV-6 contained two genes (U12 and U51) encoding G-protein-coupled receptor homologs. U12 functionally encodes a calcium-mobilizing receptor for the β-chemokines RANTES, MIP-1α and -1β, and MCP-1 but not for the α-chemokine interleukin-8 (19). HHV-6 U51 was a positional and structural homolog of the HHV-7 and HCMV UL78, showing relatively high sequence similarity to the G-protein-coupled receptor of herpesvirus saimiri (15, 45). This similarity could be an indication that they were derived from the same gene within a progenitor herpesvirus genome. Interestingly, the HHV-6 U51 gene product showed the closest sequence similarity to a cellular opioid receptor.
A highly spliced gene at the right end of the UR of the genome has been characterized by analyzing a cDNA resulting from multiple splicing, and it encodes the glycoprotein gp82/105 (50). In the case of strain GS, this gene contains 12 exons, and the single long ORF begins within exon 3 and ends in exon 12. In contrast, in strain HST, this gene contained eight or nine exons, and the single long ORF began within exon 2 and ended in exon 8 (61). The single long ORF encoding gp82/105 in strain GS is longer than in strain HST. The differential splicing may play a role in the generation of various mRNA species that encode some protein components in the gp82/105 complex of the two variants. Furthermore, the amino acid sequence of each exon was divergent between variants A and B (Table 1), and some monoclonal antibodies for gp82/105 are variant A-specific neutralizing antibodies (49). Taken together, these findings suggest that the high degree of divergence in gp82/105 may lead to differences in the cell tropism between variants A and B.
HHV-6 ORFs U20, U21, U23, U24, and U85 were predicted to encode glycoproteins. U20 and U85 encode homologs of the immunoglobulin E (IgE) C chain and OX-2 membrane antigen, both of which are members of the Ig superfamily (10, 16). U24 of strain HST lacked an N-terminal signal sequence and may represent an exon (15).
Genes encoding glycoproteins were U12, U20, U21, U22, U23, U39, U48, U51, U72, U82, U85, U97, HN2, U98, U99, and U100. Among these, the ORFs with >90% identity were U20, U22, U39, U48, U51, U72, U82, and U85; ORFs with 80 to 90% identity were U12, U21, and U23; and those with 70 to 80% identity were U97, U98, U99, and U100. Of these glycoproteins, gp82/105 exhibited the most divergence.
(v) Capsid, tegument, and virus assembly proteins.Several ORFs of HHV-6 were homologous to herpesvirus genes encoding characterized or candidate structural proteins. Table 1 shows that these include homologs of the major capsid protein (U57), minor capsid protein (U29), large tegument protein (U31), and virion proteins (U33, U34, U36, U50, U56, and U76) (7, 15, 36, 44). Betaherpesvirus-specific and conserved structural proteins are encoded by HHV-6 U11 (major antigenic phosphoprotein, pp100) (47, 48) and U54 (tegument transactivator), which are homologs of HCMV UL32 (antigenic phosphoprotein, pp150) and UL82/U83 (tegument transactivator, pp65/72K), respectively. The gene products involved in DNA packaging and capsid assembly corresponded to HHV-6 U29, U30, U53, and U60/U66. The U53 gene encoded the protease/assembly protein (assemblin) and the scaffolding protein. Assemblin and scaffolding protein are derived from proteolytic cleavage and internal initiation (15, 59, 69).
The genes U11, U29, U30, U31, U33, U34, U36, U49, U50, U53, U54, U56, U57, U60/U66, and U76 encoded viral structural proteins, and all except U11 and U54 showed >90% identity between strains HST and U1102. In contrast, U11, a viral protein, and U54, a tegument transactivator, showed rather low identity (80.1 and 79.5%, respectively). These results suggest that the replication genes, except those for the U11 viral protein and the U54 tegument transactivator, may not be involved in the functional differences between variants A and B.
Major repetitive elements.It has been observed that a remarkable divergence in the nucleotide sequence between strains JI and RK is present only in the DR (38, 44). Similarly, between strains HST and U1102, marked differences were observed only in specific regions of the DR and the junction between the UR and DR. In the DRs of both HST and U1102, the basic element of regions T1 and T2 (TAACCC) was related to human telomeric sequences (Fig. 2). These reiterations consisted of arrays of the basic element interspersed with 11 types of human telomere-related elements, all hexonucleotides. In the case of strain HST, T1 was the longer reiteration, consisting of 54 copies of the telomeric element: 26 copies of the basic telomeric element and 28 copies of the human telomere-related elements. In contrast, T2 consisted of 26 copies of the basic telomeric element alone. In the case of strain U1102, T1 consisted of 54 copies of the telomeric element: 14 copies of the basic telomeric element and 40 copies of the related telomeric elements, most of which were also hexonucleotides but some of which were larger. Furthermore, T2 of U1102 consisted of 59 copies of the basic element alone. Thus, T2 of HST was half the size of that of U1102, and T1 of HST contained more conserved telomeric elements than that of U1102.
Repeat region R1 contains 52 copies of a dodecameric repeat element, consisting of its basic element (GAGGCCCTGCTG) and variants. However, it was difficult to determine the copy number of the dodecameric repeat element in R1 of U1102, because the reiterations in R1 consist of remarkably variant elements. R1 was located in U86, the putative IE2 locus of HHV-6. R2 consisted of two types of repeats, a 79- to 80-bp-long tandem repeat and a G/T repeat. The tandem repeat region in R2 of strain HST contained five copies of a 79- to 80-bp repeat element, while that of U1102 contained two copies of imperfect repeats. R3 (Kpn repeat region) contained 24 copies of tandemly repeated 104- to 107-bp-long elements and 10 recognition sites for KpnI in strain HST (24), while R3 of U1102 contained 28 copies of tandemly repeated 104- to 107-bp-long elements, all of which contained recognition sites for KpnI. Because of these differences, the region corresponding to the U92- and U93-homologous ORFs in strain HST was divided by six small ORFs and a small part of HN2. On the whole, the repeat elements of strain HST were more highly conserved than those of strain U1102.
Genetic divergence between HHV-6 variants.The marked divergence of variants A and B permitted the design of variant-specific oligonucleotide primers to detect DNA fragments that differed in size between variants A and B HHV-6 DNA using PCR. To analyze the genetic polymorphism of HHV-6 isolates, we examined 14 different strains belonging to HHV-6 and one HHV-7 strain, KHR (Fig.3K). In this experiment, viral DNAs were extracted from human umbilical CBMCs that were infected with each virus using the protease K treatment method. PCR was performed as described elsewhere (19), using the appropriate pairs of primers for U95, U86, U47, U11, and DR2, as shown in Fig. 3. No positive PCR products were detected with uninfected cells or cells infected with other human herpesviruses, including HSV-1 and -2, varicella-zoster virus, HCMV, and Epstein-Barr virus (data not shown). In the case of HHV-7 (Fig. 3B, D, F, H, and J, lanes 15), however, among the genes tested, a positive PCR product was detected only for the U47 gene, which was larger than those of HHV-6 (Fig. 3F). HHV-6 DNA was detected in all 14 isolates by this method, and PCR products were clearly separated into two groups in the case of the U95, U86, U47, and U11 genes: variants B and A showed fragments of 342, 311, 362, and 168 bp and of 264, 209, 209, and 204 bp, respectively (Fig. 3K; also see Fig.3B, D, F, and H). After amplification of the DR2 genes, however, PCR products were clearly separated into three groups: 361 bp (variant B), 175 bp (U1102), and approximately 265 bp (GS and DA) (Fig. 3J). These differences observed in variant A may be associated with the characteristic divergence of variant A viruses.
PCR typing of HHV-6. (A, C, E, G, and I) Schematic primary structures of ORFs U95, U86, U47, U11, and DR2 of strains HST and U1102. Positions and directions of the PCR primers are indicated by arrowheads at vertical lines. (B, D, F, H, and J) PCR amplification of DNAs from isolates 1 to 15 (K) of HHV-6 and HHV-7. Templates were purified from peripheral blood mononuclear cells infected with several laboratory strains and clinical isolates and were amplified with PCR primers (L). The products were fractionated by electrophoresis on 6% polyacrylamide gels and stained with ethidium bromide. The PCR products are indicated by arrowheads in panels B, D, F, H, and J. (K) Viruses used in the experiment. CFS, chronic fatigue syndrome.
DISCUSSION
We have determined the complete DNA sequence of HHV-6 variant B strain HST, the causative agent of ES, and identified 115 potential ORFs within the 161,573-bp contiguous sequence of the entire HHV-6 genome. When the sequence was compared with that of the variant A strain U1102, some genes with remarkable differences in amino acid identity were identified.
We conclude that the LT1, DR3, DR8, LJ1, RJ1, and DR3′ ORFs probably do not encode proteins because LT1 and DR3 did not contain poly(A) signals, the ORFs of DR8, LJ1, and RJ1 contained telomeric sequences, and DR3′ contained a telomeric sequence between the ORF and the poly(A) signal. As variant A-specific ORFs DR4, DR5, U78, U88, U92, and U93 were small and did not exhibit homology with HHV-6B, HHV-7, other herpesvirus, and/or cellular genes, it is not likely that these genes encode functional proteins either. In contrast, HN1 and HN3 were variant B-specific genes and corresponded to exons of the IE1/IE2 and gp82/105 genes, respectively. We do not know whether HN2, DRHN1, and DRHN2 have homologous cellular genes or encode functional proteins.
All of the ORFs with <70% identity between the two variants were found to include deleted regions when the above-mentioned ORFs were excluded from the comparison (Fig. 3 and reference73). The remarkable differences except for U47 accumulated in some of the IE/regulatory genes (DR2, DR7, U86/90, U89/90, and U95) and may lead to characteristic differences between variants A and B, although point mutations that we have not detected could be associated with these differences. The variation in IE proteins may be relevant to variant divergence because IE proteins control the temporal cascade of gene expression and are a second line in determining the cellular specificity of infection. Chou and Marousek (9) reported that variant B isolates segregate into two groups with even less divergence between them (92.6% identity). U89 derived from clinical isolates and strain Z29, which belongs to group 1, show more than 99.4% amino acid identity. In contrast, U89 of strain HST had 92.3% amino acid identity compared with strain Z29 (Table 1). Clinical isolates, which belong to group 2, have more than 99.4% amino acid identity compared to strain HST. These results suggest that strain HST belongs to group 2, although we do not know what function this divergence is associated with. In the case of U47, we cannot predict its function, but homologous genes exist in HHV-7 and HCMV (U47 and UL74, respectively).
HHV-6 genes transcribed under IE conditions, that is, in the absence of prior protein synthesis, are U16/U17, U39, U42, U81, U89/U90, U91, and U94 ORFs (39). As some of these transcripts are not translated under IE conditions, they may be regulated in the posttranscriptional phase. If this is the case, some regulatory gene(s) likely plays an important role, and this mechanism would be quite different from the known transcriptional regulation of the herpesviruses.
In all sequenced herpesviruses, conserved glycoprotein genes encode gB, gH, gM, and gL, and the existence of neutralizing antibodies for gB and gH suggests that gB and gH/gL are essential for infection (62, 64). HSV-1 encodes an additional glycoprotein, gD, which recognizes cellular receptors (HveA, HveB, and HveC) (14, 40, 70), while HHV-6 encodes a different additional glycoprotein, gp82/105, which is recognized by neutralizing antibodies (49). This finding suggests that gp82/105 may recognize a cellular receptor(s). In addition, the gp82/105 glycoprotein of HHV-6 showed the highest divergence in amino acid sequence between the two variants. It is possible that gp82/105 leads to the difference of cell tropism between variants A and B.
U83 of strain HST encodes a chemokine which functions as a chemoattractant for monocytes (76). Similarly, U83 of strain U1102 encodes a chemokine homolog, but it does not contain a signal peptide sequence. Thus, U83 of strain U1102 may not be secreted and thus cannot exert its chemotactic activity. If U83 functions as a chemoattractant in vivo, it is possible that HHV-6 can easily spread through cell-to-cell infection, even in the presence of neutral antibodies. Although we do not know whether the U83 gene product of HHV-6A contains a signal peptide, if it does not, we predict that variant B viruses could easily spread in blood, whereas variant A viruses should not.
In analyzing the divergence in the functional proteins of HHV-6A and HHV-6B, we must consider that splicing would affect the gene layout and the level of relatedness between the HHV-6A and HHV-6B proteins. We have already obtained the splicing patterns of some mRNAs. These patterns are not as simple as predicted by Megaw et al. (38). For example, in the case of U79 and U80, there are a few more donor and acceptor splice sites than predicted by Megaw et al., and at least five kinds of mRNAs encoded by them can be generated by alternative splicing (65). In the case of IE1 and IE2, U90 is linked to U89 or U86 by alternative splicing. In the case of gp82/105, although there are donor and acceptor motifs, some signals on the mRNA appear not to be functional (50, 61). We currently have no information about splicing in HHV-6A, except in the case of gp82/105. Further studies of splicing patterns showing functional proteins may provide new insights into biological divergence between HHV-6A and HHV-6B and molecular analysis of the proteins.
Finally, we would like to point out that the divergence we have observed with IE/regulatory genes, gp82/105, and U83 between variants A and B may lead to biological and pathogenic differences between the two variants. Further functional studies of the genes showing such remarkable divergence between variants may provide new insights into mechanisms for the molecular pathogenesis and cell tropism of HHV-6 infection.
ACKNOWLEDGMENTS
We are grateful to P. L. Ward for critical reading of and valuable comments on the manuscript.
This study was partly supported by a grant-in-aid for general scientific research from the Ministry of Education, Science and Culture of Japan.
FOOTNOTES
- Received 19 April 1999.
- Accepted 14 June 1999.
- Copyright © 1999 American Society for Microbiology