| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Previous Article | Next Article ![]()
Journal of Virology, December 2007, p. 13218-13229, Vol. 81, No. 23
0022-538X/07/$08.00+0 doi:10.1128/JVI.00846-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
University Hospital Freiburg, Internal Medicine 2/Molecular Biology, Hugstetter Str. 55, D-79106 Freiburg, Germany,1 European Molecular Biology Laboratory, Meyerhofstr. 1, D-69117 Heidelberg, Germany2
Received 20 April 2007/ Accepted 11 September 2007
| ABSTRACT |
|---|
|
|
|---|
-helices but is decorated at its tip with an extra element that contains at least one helix and that is made up only in part by the previously predicted insertion sequence. All subelements are interconnected, such that structural changes at one site are transmitted to others, resulting in an unexpected variability of particle morphologies. Key features of the model are independently supported by the accompanying epitope mapping study. These data should be valuable for functional studies on the impact of core protein structure on virus replication, and some of the mutant proteins may be particularly suitable for higher-resolution structural investigations. | INTRODUCTION |
|---|
|
|
|---|
However, although the DHBV genome is even smaller than that of HBV (3.0 kb versus 3.2 kb), its core protein (DHBc) is substantially larger (262 versus 183 or 185 amino acids) than that of HBV (HBc). Both core proteins are the sole building blocks for the viral capsid shell. The capsids are actively involved in reverse transcription (21, 33, 55) and genome trafficking (23); are the substrate for various phosphorylation and dephosphorylation events (1, 17, 25, 32, 37, 57); and provide interaction sites, regulated by the maturation state of the packaged genome (47), for envelopment by the surface proteins (9). Evidently, the short HBc sequence fully supports these multiple functions; hence, the biological reasons behind the larger size of the avihepadnavirus core proteins are enigmatic. Knowledge of the DHBc structure would be crucial to understand this unresolved issue, and it might help to exploit the experimental advantages of DHBV (43) for tackling the structural dynamics of the hepadnaviral nucleocapsid. Presently, however, such information is scarce.
In contrast, the structure of the HBc protein and of assembled HBV capsids is known in detail from biochemical (4, 26, 27, 36) and biophysical (46) investigations of recombinant HBV capsid-like particles (CLPs). The first about 140 amino acids (aa) constitute the assembly domain (4, 53); this is followed by a 9-aa morphogenic linker (53) that affects the distribution between a larger (triangulation number T = 4) and a smaller (T = 3) class of particles. The C-terminal domain (CTD) downstream of position 149 contains clusters of R residues that bind nucleic acid. Most of the CTD is required for pregenomic RNA encapsidation and reverse transcription (25, 30, 34); similarly, the RNA content of recombinant CLPs containing at least part of the CTD is much higher than if the CTD is deleted (4, 36). The T = 4 particles consist of 120 HBc dimers, and the T = 3 particles consist of 90 HBc dimers (14, 24). The HBc assembly domain (Fig. 1A) contains five
-helices (6, 12, 54), of which
3 and
4, composed of
4a and
4b, form a hairpin, which at its tip harbors the immunodominant c/e1 B-cell epitope (3, 11, 13). Association of two such hairpins into a four-helix-bundle, protruding as a spike from the capsid surface, provides for most intradimer contacts, with the N termini wrapping around the base of the spike. The interdimer contacts are mainly provided by the "hand region" (6) consisting of
5 (residues 112 to 127) onto which downstream residues to about position 140 fold back. Although the individual interdimer contacts are weak (58), the intact particles are so stable that even complete foreign proteins can be inserted into the c/e1 epitope (28, 35, 44); this is achieved by an inherent flexibility within the subunits, as well as in their arrangement on the icosahedral lattice (5, 7). Such structural plasticity may be crucial for the active role of the capsid in reverse transcription, although only subtle differences between HBV CLPs and genome-containing nucleocapsids were detected in a recent cryo-electron microscopic (cryo-EM) study (40).
|
2a, and a second one encompassing HBc residues 96 to 143, i.e., helix
4b, the kink at G111, and the hand region; the homologous segment in DHBc encompasses aa 142 to 189. Second, the DHBc sequence from about position 200 to the C terminus, although nearly twice as long as the HBc CTD, also contains many basic residues. Deletions downstream of about position 225, caused, in DHBV-transfected cells, similar replication defects (42, 56), as did CTD truncations in HBc (25, 30, 33), and recombinant DHBc truncated after positions 229 or 237 still formed particles (55). This suggests that DHBc has a CTD that is dispensable for assembly, but its borders, as well as the existence of a morphogenic linker as in HBc, are not known. A low-resolution cryo-EM reconstruction of DHBc CLPs showed T = 4 and T = 3 particles with a HBc-like clustered-dimer architecture (24) with, however, laterally wider surface spikes, suggesting that they comprise more residues than in HBc. According to an earlier alignment (8), DHBc contains an "insertion sequence" comprising residues P86 to K130. The alignment routine used in the PFAM database also predicts such an insertion (Fig. 1B), although with somewhat shifted borders. Finally, secondary structure algorithms such as PHD (41) and PSIPRED (22) predict several
-helices in DHBc, some of which could correspond to those in HBc, most significantly in the second highly conserved region (Fig. 1B). Accordingly, DHBc residues 142 to 189 might adopt a structure similar to that of HBc helices
4b and
5 kinked at G111 (43). A recent PepScan (18) analysis identified six antigenic regions (AR1 to AR6) in DHBc that were recognized by sera from DHBV-infected and, in part, from DHBc-immunized ducks (49); of these, AR2 (aa 64 to 84), AR3 (aa 99 to 112), and AR5 (aa 183 to 210) were proposed to be surface exposed. However, the structural state of the antigen as encountered by the ducks' immune system is unclear, and isolated peptides may or may not mimic the authentic protein structure. Directly testing surface exposure was not possible due to the polyclonality of the antisera.
In the present study we used an extensive mutagenesis approach to identify primary sequence constraints for the ability of DHBc to assemble into particles. We generated a large panel of DHBc mutants, including C-terminal and internal deletions variants plus a transposon-derived library of variants containing 5 aa insertions throughout the protein's primary sequence. Exploiting an efficient Escherichia coli expression system, we determined their assembly properties by velocity sedimentation, native agarose gel electrophoresis, and negative-staining EM. This enabled us to define the domain structure of DHBc, to demonstrate the existence of a morphogenic linker region that is much more extended than in HBc and, eventually, to combine the data into a plausible model for the DHBc fold. Although structural evidence obtained via mutagenesis may still be considered indirect, key topological features predicted by the model were independently verified by the accompanying epitope mapping study.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Plasmid constructs.
The expression vector for DHBc, pET28a2-DHBc, was generated by replacing the HBc gene in plasmid pET28a2-HBc (52) by the DHBc gene from the DHBV16 genome (31) in plasmid pCD-16 (38); pET28a2-HHBc was obtained similarly using a plasmid encoding the genome of HHBV4 (45). A C-terminal His7 tag was added, via PCR, to yield plasmid pET28a2-DHBc_H7. C terminally truncated DHBc variants and the point mutant DHBc_R124E with residue R124 replaced by glutamic acid were also generated via PCR; of the truncated variants, named DHBcn, with "n" indicating the position of the last authentic DHBc amino acid, DHBc230 contained a C-terminal unrelated peptide of the sequence YKGEPLKA, and DHBc195 contained a single nonauthentic L residue. Internal deletion variants were obtained by cutting plasmid pET28a2-DHBc at the unique EcoRI site overlapping the codons for R124, I125, and H126 (AGA ATT CAT; the EcoRI site is in italics), limited Bal31 nuclease digestion, and subsequent digestion with AlwNI in the vector part. Fragments of the appropriate size range were ligated either with the 5'-terminal or the 3'-terminal AlwNI-EcoRI fragment of the unmodified pET28a2-DHBc plasmid in which the EcoRI overhang had been blunted. This yielded a collection of plasmids lacking DHBc sequence upstream, or downstream of the I125 codon; in one clone, termed (I125)2, this codon was fortuitously duplicated. The sequences of the proteins analyzed are indicated by "
" followed by the positions of the deleted residues; for instance,
121-124 lacks DHBc aa 121 to 124.
Transposon linker scanning mutagenesis. Transposon mutagenesis was performed with plasmid pCD16 as a template using the GPS-LS linker scanning system as recommended by the vendor (New England Biolabs). Transformants were selected on kanamycin containing agar, and plasmid DNA was isolated from the pooled colonies. Next, the DHBc gene was amplified by PCR; the inserted transposon increases its size from 0.8 kb to about 2.5 kb. The 2.5-kp products were cut with either NcoI (overlapping the DHBc start codon) plus EcoRI (overlapping codons 124 to 126), or EcoRI plus AvrII (overlapping codons 260 and 261), or NcoI plus AvrII. The corresponding restriction fragments were cloned into the appropriately cut pET28a2-DHBc_H7 vector. DNAs from randomly picked colonies showed restriction patterns indicative of random insertions. The body of the transposon sequence was removed by PmeI digestion and religation of plasmid DNA from about 2,000 pooled colonies, yielding about 1,200 colonies after religation. The positions of the insertions sites were determined by DNA sequencing. Individual constructs with in-frame insertions, and their encoded proteins, are designated by the prefix "i" followed by the position of the insertion site. C-terminal deletion variants, arising from about one-third of the integration events that introduce a premature translational stop, are correspondingly designated by the prefix "st." Inherently, these stop codons are preceded by a transposon-encoded V residue.
Recombinant expression of DHBc proteins. The pET28a2 plasmids were transformed into E. coli BL21 Codonplus cells (Stratagene), and protein expression and purification were performed essentially as described for HBc (52). In brief, for expression screening 2-ml cultures of the transformed bacteria were grown at 25°C in the presence of 100 µM IPTG (isopropyl-ß-D-thiogalactopyranoside) for about 5 h, and then the pelleted cells were boiled in sodium dodecyl sulfate (SDS) sample buffer (100 µl per ml of culture). Aliquots (5 µl) of these SDS lysates were analyzed by SDS-polyacrylamide gel electrophoresis (PAGE) and Coomassie blue staining. For large-scale preparations (200-ml cultures, induced at 25°C for 12 to 16 h), 5 ml of cleared lysates (52) was subjected to sedimentation in 38-ml sucrose gradients (in steps of 10, 20, 30, 40, 50, and 60% sucrose (wt/vol) in TN300 buffer (50 mM Tris-HCl, 300 mM NaCl [pH 7.5]) in an SW28 rotor run for 4 h at 20°C and 28,000 rpm. For small-scale preparations (20-ml cultures), 200 µl of 1 ml of cleared lysate was sedimented through 1.4-ml 10 to 60% sucrose gradients in a TST-55 rotor (45 min at 20°C; 55,000 rpm). Gradients were harvested in 14 fractions from the top, and the distribution of the recombinant proteins was determined by SDS-PAGE analysis of 5-µl aliquots from each gradient fraction followed by Coomassie blue staining.
Native agarose gel electrophoresis. Electrophoresis was performed as previously described (51, 52) in 1% agarose gels containing 0.5 µg of ethidium bromide/ml to visualize encapsidated RNA. Protein was subsequently detected by staining with Coomassie blue.
EM. Aliquots (2 µl) from the relevant gradient fractions were applied to glow-discharged carbon-coated EM grids, incubated for 2 min to allow specimen adsorption, washed with water, blotted, and stained with 2% uranyl acetate for 3 min. Excess liquid was removed by blotting, and the grids were air dried. Micrographs were recorded on a Morgagni 268 instrument and run at an acceleration voltage of 100 kV at nominal magnifications of 71,000- or 140,000-fold, respectively. Approximate particle size distributions were manually determined by measuring the diameters of between 100 and 150 well-formed particles from the topmost three gradient fractions containing the bulk of the corresponding protein. For class 4 mutants, characterized by the formation of two distinct peaks in the gradients, both peak fractions were analyzed accordingly.
| RESULTS |
|---|
|
|
|---|
|
DHBc contains an extended morphogenic linker region that encompasses part of the basic CTD. To correlate the different sedimentation profiles of the truncated DHBc variants with potential differences in particle morphology, we next used the electron micrographs to determine the approximate diameters of between 100 and 150 individual particles per construct. The accuracy of these measurements is limited by the staining procedure, potential flattening of particles as they stick to the grid, by the different particle orientations that sometimes displayed prominent surface spikes and sometimes did not, and by the number of particles analyzed. However, wt-DHBc clearly produced two main classes of particles with diameters of about 37 and 32 nm, plus some smaller particles (ca. 10%) about 28 nm in diameter; representative examples are shown in Fig. 2C. DHBc230 produced wt-like 37- and 32-nm particles, but an increased fraction of smaller particles with mean diameters of about 26 to 28 nm and 22 to 24 nm were observed. DHBc226 and DHBc218 contained a further increased proportion of particles in the 22- to 28-nm range, plus even smaller particles with diameters of about 17 to 19 nm. DHBc195, finally, essentially lacked 37-nm particles; besides a few 32-nm particles, the majority had diameters between 22 and 28 nm, and particles <20 nm in diameter were also abundant. Despite the limitations of the negative-staining method, particles with distinctly different diameters were frequently seen side by side on one micrograph, as is particularly evident on the enlarged view shown for DHBc226 (Fig. 2A, rightmost panel). Together with the distinct sedimentation profiles, these data strongly suggest that DHBc can assemble into a larger range of differently sized particles than HBc and that a much larger sequence in DHBc (residues 195 to 226) than the morphogenic linker peptide in HBc (residues 141 to 149) affects particle morphology.
Linker scanning precisely maps the C-terminal border of the DHBc assembly domain and identifies internal regions important for folding and assembly. Transposon mutagenesis provides a means to randomly integrate short peptide sequences into a target gene. The system used here produces eventually a 15-nucleotide (nt) insertion; 10 nt are derived from the transposon and contain the recognition sequence of the restriction enzyme PmeI preceded by an A and followed by a T (aGTTTAAACt), or vice versa (tGTTTAAACa), depending on the insert orientation; additional five nt are duplicated from the insertion site. Two out of six possible insertion events introduce a premature translational stop (GTT TAA; the stop codon is underlined); the other four create 5 aa insertions containing a limited set of different amino acids. Individual mutants were derived from three plasmid pools containing the insertions in the N-terminal half or C-terminal half of DHBc or throughout the entire sequence (see Materials and Methods for details). Constructs containing an insertion in the full-length DHBc context are designated by the prefix "i" plus the amino acid position of the insertion, constructs with a premature stop codon by the prefix "st" plus the position after which the stop codon was introduced.
Of about 150 clones sequenced more than 90% showed insertions at differing positions and in different orientation. Few sequences occurred twice, and very few occurred thrice. About 100 of the plasmids were transformed into the E. coli expression strain, and aliquots from 2-ml induction cultures were analyzed by SDS-PAGE (results not shown). A major fraction of plasmids led to the expression of proteins that comigrated with wt-DHBc or its His-tagged derivative DHBc_H7; another fraction produced distinctly smaller proteins, a finding indicative of premature translational stops.
We first exploited such truncated variants to more precisely map the C-terminal border of the DHBc assembly domain by sedimentation analysis. Five variants with translational stops after position 232 showed a wt-DHBc-like distribution in the gradient, and variants st213 and st210 produced profiles similar to those of DHBc226 and DHBc220 (data not shown). Most informative were variants st203 (not shown), st199, st187, st183, and st167. The first three all produced gradient profiles like variant DHBc195, with a distinct peak in fractions 5 to 7 (Fig. 3); the further truncated variants st183 and st167 did not form any fast-sedimenting material. Hence, the C-terminal border of the DHBc assembly domain is located between aa 183 and 187. Notably, I186 of DHBc is homologous to HBc aa L140 (Fig. 1B), which must be present to allow for HBc assembly (53, 59). Upon gel filtration on Superdex 75, most of the material from variant st183 present in the top gradient fractions eluted in the void volume, indicative of larger aggregates, but a detectable amount eluted at about the same volume as the 44-kDa marker protein ovalbumin (not shown), as expected for a dimer with a calculated mass of 43 kDa. Notably, a pronounced aggregation tendency was also observed for the assembly incompetent HBc1-139 (53).
|
|
|
The presumed insertion sequence is partly unstructured but contains structured elements that are important for folding and multimerization.
Which part of the central DHBc region actually represents the insertion cannot reliably be predicted from alignments because of the low sequence homology between HBc and DHBc upstream of the highly conserved region II (Fig. 1B). The previous experiments showed that the region between positions 80 to about 100 was largely tolerant toward peptide insertions, whereas the following 30-aa segment was not (Fig. 5). To further modify its primary sequence, we used recombinant HHBV core protein (HHBc) as a natural DHBc variant with several amino acid exchanges in this region (Fig. 6A), and we generated a collection of internal DHBc deletion variants upstream or downstream of I125. HHBc formed CLPs with sedimentation properties similar to those of wt-DHBc (Fig. 6E). Two large deletions upstream I125 (variants
86-124 and
82-124) were still competent for multimer formation, however, of the mixed class 4 sedimentation profile. Smaller deletions of 4, 7, and 13 aa (variants
121-124,
118-124, and
112-124) all produced clear class 2 profiles (Fig. 6B); hence, deletion of this predicted unstructured part had no negative impact on particle formation. In contrast, even very small deletions on the C-terminal side of I125 (
126-127,
126-128,
126-130, and
126-136) and the duplication of a single amino acid [I125 in (I125)2], all produced class 4 profiles (Fig. 6C), often with a large proportion of the protein in the bottom fractions, a finding indicative of heavy aggregation. Hence, this region contributes importantly to overall folding. All of the class 2 deletion variants lacked R124, the only charged residue between positions 112 and 129, whereas R124 was present in all mutants with deletions downstream of I125. To test for a potential structural influence of this residue, we finally exchanged it against a negatively charged E residue. The corresponding variant DHBc_R124E was well expressed and generated a clear-cut class 2 phenotype (Fig. 6D).
|
121-124), or the single R124E exchange. Like wt-DHBc CLPs, most HHBc CLPs (>70%) had diameters of around 37 and 32 nm; in addition, particles about 28 nm in diameter plus a possibly distinct 25-nm species, not seen for DHBc, were observed (Fig. 7A). In contrast, all of the class 2 variants analyzed, regardless of the underlying mutations, produced regular CLPs consisting mainly (>70%) of the largest (about 37 nm in diameter) and a smaller fraction of the second largest (about 32 nm) size class. Particles with diameters of less than 30 nm were essentially absent (Fig. 7B). This increased homogeneity was also reflected in native agarose gel electrophoresis, where the class 2 mutants migrated as more distinct bands than wt-DHBc (52a). Thus, the class 2 mutations do cause a distinct structural phenotype characterized by a reduced particle size heterogeneity and by a reduced tendency of the particles to aggregate.
|
| DISCUSSION |
|---|
|
|
|---|
|
Recombinant DHBc CLPs contained comparable amounts of RNA as HBc CLPs (4, 36, 40); the ratio of ethidium bromide versus Coomassie blue staining in native agarose gels did not significantly change by truncation to position 218 (Fig. 2B). Hence, a fraction of the basic CTD residues is sufficient for RNA packaging, as with HBc (4, 21). Whether the single basic CTD residue in DHBc195, R191, is sufficient for RNA encapsidation is not clear; although essentially no ethidium bromide staining was seen, the more diffuse protein band could also indicate instability of the DHBc195 particles. Certainly, however, and in accord with previous cell culture data (42, 56), the DHBc CTD serves as a nucleic acid binding domain.
A marked difference from HBc was the strong impact of C-terminal truncations on particle morphology. In HBc the morphogenic linker encompasses just the nine residues (positions 141 to 149) from the end of the assembly domain to the first R cluster, and its absence versus presence affects the ratio between two distinct particle forms, i.e., T=3, predominating in HBc1-140, and T=4, predominating in HBc1-149 (36, 53, 59). Other forms have not been observed. In contrast, DHBc truncations from positions 230 to 195 led to gradual shifts toward a spectrum of smaller particle sizes as determined by negative-staining EM (Fig. 2C), and this was paralleled by a corresponding upward shift in the sucrose gradients. By analogy to HBc, we hence refer to this part of the DHBc sequence as the morphogenic linker region, although it encompasses part of the CTD (Fig. 8A). Notably, the removal of various basic amino acid residues in the truncated variants could affect the way RNA is packaged and this, in turn, could influence particle morphology.
Model for the fold of DHBc.
The N-terminal about 185 aa of DHBc are functionally equivalent to the HBc assembly domain, and most of the predicted DHBc helices would have direct counterparts in HBc, providing the potential for an HBc-like framework that somehow accommodates the presumed insertion sequence. The most likely candidate segments for actually folding into a HBc-like structure are the conserved regions I and II. As a starting point, we first modeled the structure of a hypothetical chimeric HBc protein in which these two regions (HBc positions L19 to Y38 and positions K96 to L143) were exchanged for the homologous DHBc sequences (l14 to Y37 and R142 to I189). As shown in Fig. 8B, the N-proximal DHBc segment can indeed substitute for part of the arm around the base of the spike, including helix
2, and the second segment can substitute for helices
4b and
5, including the kink around the conserved G111 residue (G157 in DHBc). The addition of appropriate DHBc counterparts to HBc helices
3 and
4a would then generate an HBc-like architecture, with the extra DHBc sequence at a location equivalent to the c/e1 epitope in HBc. This is the basic concept of the model in Fig. 8C onto which the mutational data, shown in linear form in Fig. 5, are superimposed, together with a summary of the epitope mapping data derived as reported in the accompanying study (52a).
For the sake of simplicity, we consider sites where mutations caused class 1 or class 2 phenotypes as tolerant and those causing either insolubility, or a class 3 or 4 phenotype, as sensitive toward tertiary or quaternary structure perturbation. Class 1 and class 2 insertion mutations were located in three distinct segments, i.e., at the very N terminus, between positions 34 and 48, and interspersed between positions 72 and 97; small internal deletions upstream of I125 and the R124E replacement also generated class 2 profiles. In the model, nearly all of these insertions are placed in loops or at the very ends of predicted helices. The only exception is the predicted helix D
1; hence, either this segment is not helical or perturbing its structure is innocuous. Both interpretations are compatible with an HBc-like arrangement wherein the very N-terminal residues can be deleted or replaced without affecting particle formation (39, 50). Furthermore, the first 15 aa of HBc were identified as one of two regions ("domain I") where various mutations did not negatively affect expression in E. coli; about one-half of the mutants remained assembly competent (27). In the second such domain (residues F24 to P50) various mutations between amino acids L31 and E46 allowed particle formation, again in accord with our data for the DHBc segment from aa 34 to 48. Lastly, few mutations between HBc positions 15 and 30 allowed stable expression, and all prevented assembly; congruently, DHBc was sensitive to insertions at positions 14, 22, and 28. This strikingly similar pattern strongly supports that the first about 50 aa of DHBc adopt a structure similar to that in HBc.
The next DHBc segment from about positions 50 to 76 should contain a long helix homologous to HBc
3; indeed, after a class 2 insertion site at position 48, class 4 mutations occurred at positions 53, 54, 65, and 76, with a single class 2 mutation at position 72. This indicates that the predicted helix D
3 does exist but, possibly, is not contiguous to the very end. Alternatively, the part around position 72 may not be structurally crucial. Apart from the small helix Dins
1, the following segment down to position 103 is predicted not to have a defined structure, and indeed it harbored three class 2 mutations. Thereafter followed, densely clustered, highly sensitive sites until the end of the assembly domain. Such a crucial structural role is fully compatible with DHBc residues 137 and 185 being structurally homologous to HBc helices
4b and
5. Completion of this framework structure requires one more helix as a counterpart to HBc
4a; this missing link must be provided by either Dins
2 or Dins
3 in the presumed insertion sequence.
DHBc insertion sequence.
The start of the second highly conserved region around DHBc position 135, and the experimentally supported existence of helix D
3, confine the presumed insertion sequence to somewhere between positions 77 and 135. However, an exact assignment solely based on primary sequence is not possible. In fact, the alignment of DHBc aa 80 to 90 to HBc
4a, as in Fig. 1B, appears highly unlikely because the entire DHBc segment between aa 77 and 100 bears hallmarks of surface-exposed loops, such as a high frequency of P, and of polar (T) and charged (E) residues. In contrast, the DHBc sequence from 125 to 134 produces very high scores (8 or 9 for all seven central positions) in the PHD
-helix prediction, and it is nearly contiguous with the DHBc equivalent, D
4b, to HBc helix
4b. We therefore strongly favor that helix Dins
3 is structurally equivalent to HBc
4a and consequently should be named D
4a, as in Fig. 8C. A key structural role for D
4a is supported by the strong negative impact of even small sequence modifications downstream, but not upstream, of I125. We therefore propose that the segment between the end of helix D
3 around position 77, and the beginning of D
4a around position 122, constitutes the actual DHBc insertion sequence.
Within this segment two stretches, positions 81 to 97 and positions 112 to 124, tolerated insertions or deletions, in accord with the predicted lack of defined structure. No definite statements are possible for the putative short helix Dins
1. In contrast, the existence of helix Dins
2 between positions 105 and 114 is strongly supported by three class 4 plus an insolubility-causing insertion. This is in line with a recent cell culture study of a fortuitously isolated core protein variant lacking the codon for H107 (20). This protein was unstable and did not form detectable nucleocapsids, a prerequisite for virus replication. Replication was, however, rescued by reintroduction of helix-compatible amino acids but not proline.
Implications for DHBV nucleocapsid assembly and replication.
A similar structural framework in DHBc and HBc implies similar assembly properties. For wild-type DHBc and HHBc, this was largely confirmed by the predominant formation of two major capsid size classes as with HBc. The largest,
37-nm-diameter DHBc particles definitely conform to T=4 symmetry (B. Böttcher and M. Nassal, unpublished data), and the
32-nm-diameter particles are compatible with T=3 symmetry (see below). However, we also found two striking differences, namely, a pronounced tendency of wild-type and various mutant DHBc particles to aggregate and an apparently much wider spectrum of particle sizes. Wild-type DHBc already contained a significant fraction (ca. 10%) of smaller (
28-nm) particles, and wild-type HHBc seemed to produce an additional, possibly distinct, class of 25-nm particles. This size variability was further extended by C-terminal truncations and also occurred in the fast-sedimenting material from class 4 mutants (see Fig. S2 in the supplemental material). Both size variability and aggregation were strongly reduced in class 2 mutants.
HBc with its ability to form T=3 and T=4 particles, also in vivo (15, 24, 40), represents one of few examples of T number polymorphism (29). Hence, the wide size range of the DHBc particles is unusual. Despite the limitations of the negative-staining approach (see above), we consider it highly unlikely that the apparently different particle sizes are a mere artifact of the procedure. First, differently sized particles were frequently seen side by side on one micrograph (Fig. 2C). Second, smaller-appearing particles were predominantly seen for mutants which also displayed slower sedimentation profiles. Finally, very little size variation was observed for the class 2 mutants using the same technique. Quasi-equivalence theory (10) predicts that icosahedral capsids should conform to one of the allowed T numbers, defined by the equation T=(h2 + hk + k2), with h and k being integers. In the simplest, T=1 form, 60 subunits with identical conformations make up the capsid. For the next allowed numbers, T=3 and T=4, the constituent capsid protein subunits must adopt three and four, respectively, similar but nonidentical conformations. Because the surface (F) of a sphere is defined as F = 4
x r2, where r is the radius, and assuming each subunit contributes an equal proportion of surface area, the ratio of the diameters of a T=4 to a T=3 particle is
4/3 to 1, or 1.15:1, and that of a T=4 to a T=1 particle is
4/1 to 1, or 2:1. Accordingly, the T = 3 form of a 37-nm T=4 DHBc capsid should be around 32 nm in diameter, as observed, whereas T=1 particles are expected to be in the 18.5-nm diameter range, but not between 25 and 28 nm as observed. Nominally, this size could correspond to the nonallowed triangulation number T=2 (calculated diameter of 26 nm). Recently, the in vitro formation of T=2 particles has been shown for Brome mosaic virus capsids (29, 48). Another example is the VP3 subcore of bluetongue virus (19). Such nonquasiequivalent assemblies require substantial distortions within the subunits. Moreover, the nearly continuous size range of some of the mutant DHBc particles (Fig. 2C) suggests that not all can conform to strict icosahedral symmetry. Hence, DHBc can apparently use a whole variety of different interdimer contacts, indicating an inherent structural flexibility exceeding that of HBc (5, 7). In addition, the degree of flexibility in wt-DHBc appears to be intermediate between that of class 4 mutants and that of class 2 mutants.
In HBc, most interdimer contacts are provided by the hand region, plus by small N-proximal segments encompassing aa 14 to 17 and aa 29 to 36 (54). For various DHBc intra-assembly domain mutations, their impact on the interdimer contacts is easily imagined by their location in the model. However, mutations at nearly all other sites, including the remote Dins
2, also caused changes in particle morphology. Hence, structural alterations at any one site are conveyed through the body of the protein, as in a mechanical system of interconnected rods.
This invokes two tempting speculations. First, the flexibility of the wild-type DHBc structure may be an adaptation to the multiple core protein functions in viral replication. The panel of DHBc mutants causing distinct assembly properties will now allow us to systematically address this aspect in cell culture and even in vivo. Second, structural alterations at the inner capsid face could cause corresponding alterations on the capsid surface, such as those proposed to trigger selective envelopment of mature hepadnaviral nucleocapsids (47). Magnified by to the ordered extra elements such as helix Dins
2, they may be more easily detectable than in HBV capsids (40). Lastly, although the flexibility and variable particle morphologies disfavor high-resolution structural analyses of wt-DHBc, this problem would be alleviated by using class 2 mutants.
| ACKNOWLEDGMENTS |
|---|
We thank Bettina Böttcher for many helpful discussions and D. D. Loeb and H. Will for providing cloned HHBV genomes.
| FOOTNOTES |
|---|
Published ahead of print on 19 September 2007. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
| REFERENCES |
|---|
|
|
|---|