Previous Article | Next Article ![]()
Journal of Virology, March 2006, p. 2808-2814, Vol. 80, No. 6
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.6.2808-2814.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Microbiology, School of Medicine, University of Alabama at Birmingham, 1025 18th Street South, Birmingham, Alabama 35294
Received 29 September 2005/ Accepted 13 December 2005
|
|
|---|
helix, and another ß hairpin. The
helix provides the stabilizing force for forming a homodimer, while the two ß hairpins add additional stabilization by forming a four-stranded ß sheet through domain swapping between two molecules. This central dimer positions the N- and C-terminal domains of P to interact with the N and L proteins, allowing the L protein to specifically recognize the nucleocapsid-RNA template and to progress along the template while concomitantly assembling N with nascent RNA. The interdimer interactions observed in the noncrystallographic packing may offer insight into the mechanism of the RNA polymerase processive reaction along the viral nucleocapsid-RNA template. |
|
|---|
The N-RNA forms a supercoil structure beneath the viral envelope. Upon entering the host cell, the N-RNA is released and serves as the template for transcription and replication. The N-RNA template is recognized by the viral polymerase, L, and by the P protein that tethers L to the N-RNA template (13, 14). During the early stage of the virus replication cycle, the L and P proteins, which are carried into the host by the virion, initiate transcription of mRNAs of each viral gene. The relative abundance is highest for the N protein, followed by the P protein, simply because they are located near the 3' end of the viral genome, while the L protein, which is nearest the 5' end of the viral genome, is produced in the least abundance (38). When sufficient amounts of the N and P proteins are produced, the polymerase, L, switches from transcribing mRNAs to replicating the viral genome. In genomic replication, a complementary positive-strand genome is first synthesized from the negative-strand genomic template and is then used as the template for producing more copies of the negative-strand genome (16). The newly synthesized genome is then transferred to the budding site to assemble progeny virions. During the entire process, the viral genome, both negative and positive, is always assembled with the N protein, and the L/P polymerase does not bind the viral genomic RNA unless it is in the form of an N-RNA complex.
The P protein plays an essential role in many steps during the replication cycle. The P protein binds both the N and L proteins specifically, which makes it the determining factor in template recognition (15, 26). The P protein also binds the RNA-free form of N, called N0, and brings it to the replication complex so the newly synthesized genomic RNA can be enwrapped by N as genomic replication progresses (5, 27, 32). P also appears to be required for productive virion assembly (11). The P protein can be divided into three domains: the N-terminal domain, the central domain, and the C-terminal domain (12) (Fig. 1). The N-terminal domain contains the residues (Ser60, Thr62, and Ser64) that are phosphorylated by host enzymes and is indispensable for the transcriptional activity of the P protein (8, 30). The C-terminal domain also contains two phosphorylation sites (Ser226 and Ser227) and is necessary for optimal replication (8, 24). The C-terminal domain is the main association site for the N and L proteins (31). The C-terminal domain can still bind N even after the first two domains are deleted (22). Mutations in this domain abolish the replication activity of the P proteins (11). The central domain seems to be the oligomerization domain (12). Experimental data have shown that the ability to form a tetramer is required for the replication activity of the P protein (18, 19). Deletion mutations within the central domain do not reduce the production of the P protein but lead to loss of replication activities (11).
![]() View larger version (20K): [in a new window] |
FIG. 1. Schematic representation of domains and their amino acid sequences in the P protein. Three domains have been mapped previously by limited proteolysis (12). Phosphorylation sites are also indicated. The amino acid sequence of the phosphoprotein is derived from VSV serotype Indiana, strain Mudd-Summers (DDBJ/EMBL/GenBank accession no. CAA27788). The sequence of the N-terminal domain is shown in blue letters. The linker region between the N-terminal domain and the central domain has not been identified, so it is included in the N-terminal domain in this representation. The sequence of the central domain (also named P3) is shown in red letters. Residue Leu139 was mutated to methionine for structure determination. The sequence of the C-terminal domain is shown in green letters. ß strands are indicated by arrows, and the helix is indicated by a cylinder above the amino acid sequence. Residue numbers defining the ß strands and helix are on top of the secondary-structure markers. circles mark the 310 helix.
|
|
|
|---|
helix. Leu139, predicted to be located in the hydrophobic part of the
helix, was the residue most suited for mutation into methionine for generating a mutated P3 protein (referred to as P3mut139), since methionine is likely to maintain the hydrophobic interaction of leucine if there is any. The mutation was completed by use of the expression clone P3 and the QuikChange XL site-directed mutagenesis kit (Stratagene). The expression and purification of the P3mut139 protein proceeded according to the protocol established for the wild-type protein as described in reference 12, with changes according to the protocol for seleno-methionine protein production. The resultant recombinant protein has the amino acid sequence shown in Fig. 1, which allowed us to prepare a seleno-methionine-substituted protein for crystallization. The elution profile of the recombinant P3mut139, purified by the gel filtration column Superdex-75 (Amersham Biosciences, Sweden), suggested that it may form an oligomer in the buffer (20 mM HEPES, 150 mM sodium chloride [pH 8.0]). Confirmation of selenium incorporation by the P3mut139 protein was performed by electrospray-time-of-flight mass spectrometry at the UAB Comprehensive Cancer Center Mass Spectrometry Shared Facility. Crystals of P3mut139 were obtained at 291 K with the hanging-drop method by mixing equal volumes of reservoir buffer (2.65 to 2.73 M ammonium sulfate, 100 mM ammonium nitrate, 7 to 8% ethylene glycol) and 25 mg/ml protein. Data collection and processing. Crystals were cryoprotected in reservoir buffer in which ethylene glycol was added up to a final concentration of 15% before flash-freezing of the crystals in liquid nitrogen. Crystals belong to space group P41212 with unit cell dimensions a = b = 74.39 Å and c = 157.31 Å, with six molecules per asymmetric unit. Diffraction data were collected at beamline 22-ID (or 22-BM) in the facilities of the South East Regional Collaborative Access Team at the Advanced Photon Source, Argonne National Laboratory, using a MAR 300 CCD (or MAR 225 CCD) area detector. Beams with series of energy from 12,661.47 to 12,663.00 eV were chosen to collect single-wavelength anomalous diffraction data. Integration, scaling, and merging of data were performed with the HKL2000 package (29) (Table 1).
|
View this table: [in a new window] |
TABLE 1. Crystallographic statistics
|
Refinement of the atomic coordinates was carried out with program CNS (6) and data set Se-Met2. With this initial refined model, the atom update and refinement procedure of Arp/Warp was used to further improve the electron density maps (33). Up to seven residues were added at the C terminus in the improved map. The final refined structure resulted in the statistics presented in Table 2. The model was refined against the more complete data set Se-Met2. The refined model contains the core residues 109 to 170 in each of the six molecules; residue 1 and 107 to 108 additional ordered residues at the N terminus (where residue 1 is the last residue of the histidine tag); and 171 to 177 additional ordered residues at the C terminus in different molecules. These additional ordered residues were stabilized by crystal packing or interdimer interactions. Eighty-four water molecules were also included in the final refined model. The protein data bank identification number for the coordinates is 2FQM.
|
View this table: [in a new window] |
TABLE 2. Crystallographic refinement
|
|
|
|---|
helix (residues 131 to 151), and a one-turn 310 helix (residues 154 to 156) followed by ß3 (residues 158 to 160 and 162) and ß4 (residues 165 to 167 and 169) (Fig. 2). ß1/ß2 and ß3/ß4 can be described as typical ß-hairpin motifs. Five residues in ß1 form main-chain hydrogen bonds with five residues in the antiparallel ß2, with four residues (116 to 119) at the tip of the ß hairpin forming a type I turn. Similarly, five residues in ß3 form main-chain hydrogen bonds with the antiparallel ß4, with a loop at the tip of the hairpin formed by residues 162 to 165. The tips in each ß hairpin appear to be more flexible than the main body of the structure unless stabilized by crystal packing contacts (see below).
![]() View larger version (26K): [in a new window] |
FIG. 2. Ribbon drawings of the structure of the central domain of the P protein (A) and the central domain dimer (in stereo) (B). The helix is represented by the spiral ribbon (in cyan), and the ß strands are represented by flat arrows (in green). A one-turn 310 helix is represented by a purple twist. Coils are represented by brown tubes. Residues corresponding to the termini and secondary-structure elements are numbered in the top panel. (C) Close-up view of the dimeric interactions near residue Trp138 presented as a stereo drawing at 90° to the structure shown in the middle panel. The ribbon drawings in this figure and the following figures were prepared by use of the program RIBBONS (7).
|
helices from each molecule interact with each other in a parallel orientation. The major dimer interactions are located at the N-terminal end of the
helix. Trp138 is at the core of this hydrophobic interaction. Side chains of Leu126, Pro127, and Leu130 on the loop leading to the
helix are packed with the side chain of Trp138. This cluster of hydrophobic side chains keeps the side chain of Trp138 in a conformation that is perpendicular to the side chain of Trp138 in the other helix of the dimer. This causes the side chain of Trp138 in the second helix to have a different conformation, resulting in the formation of a hydrogen bond between the N7 of the Trp138 side chain in the second helix and the hydroxyl group of the Thr141 side chain in the first helix. The Trp138 side chain in the second helix has a slightly different packing arrangement, with the side chains of Leu126, Pro127, and Leu130 in its own molecule. The interactions of side chains are therefore asymmetric in this region even though the two helices are oriented in parallel. Farther down the helix, the side chains of Ile142 and Val145 have more-symmetric interactions between the two helices. Interactions between the ß strands provide additional force to stabilize the dimer. ß2 in one molecule forms main-chain hydrogen bonds with ß4 in the other molecule in an antiparallel fashion. A total of seven hydrogen bonds are formed by this interaction. ß1/ß2 in one molecule and ß3/ß4 in the other molecule therefore constitute a four-stranded antiparallel ß sheet. This hydrogen bond network in the ß sheet may provide forces to stabilize the dimer that are as substantial as the hydrophobic interactions between the helices. Furthermore, the side chains of the four-stranded ß sheet also offer hydrophobic interactions on each side of the helices. The side chains of Leu113, Lys120, Leu122, and Leu124 from ß1/ß2 interact with the side chains of Leu140, Ile142, Ala144, and Val145 in the second helix, whereas the side chains of Phe159, Ala161, Val166, and Ile168 in ß3/ß4 interact with the side chains of Trp138, Met139 (Leu139 in the wild type), Ile142, Lys143, and Val146 in the second helix. A number of hydrophobic residues, including Trp138, Ile142, and Val145, participate in both helix-helix and ß sheet-helix interactions. The side chains of Trp108, Pro111, Pro127, Trp152, and Leu154 on loops are also packed with the hydrophobic side chains involved in the ß sheet-helix interactions.
In addition, there are three hydrogen bonds formed between the side chain of Glu128 and the side chain of Gln137 in the second molecule, between the side chain of Gln134 and the main-chain amide nitrogen, and between the side chain of Gln137 and the carbonyl oxygen of Glu128 in the second molecule. There is one intermolecular salt bridge between the side chain of Glu164 and the side chain of Lys135 in the second molecule.
Within this compact dimer, the intermolecular interactions are extensive. There are hydrophobic interactions between the helices, as well as between the helix and the ß sheets on both sides of the helices. Hydrogen bonds are formed between the ß hairpins, to form a continuous ß sheet between the two molecules. In addition, there are hydrophobic interactions, hydrogen bonds, and charge-charge interactions contributed by residues on the loops. The total surface area buried by the interactions between the two molecules is 4,514 Å2 when the dimer is formed. For the protein molecule of 78 residues, the extensive interactions may provide strong forces to make this dimer exceptionally stable, which may be the essential requirement for the functions of the central domain within the P protein.
Interdimer interaction. In the asymmetric unit of the crystal, there are three individual dimers (Fig. 3). The ß1 strand in one dimer forms main-chain hydrogen bonds with the ß1 strand in the neighboring dimer in an antiparallel fashion, resulting in an eight-stranded antiparallel sheet between two neighboring dimers. There are four hydrogen bonds formed between residues 113 and 115 in the first ß1 strand and residues 113 and 115 in the second ß1 strand. This ß-sheet interaction extends throughout the crystal so that long strings of the dimers are constructed by interdimer interactions involving the twofold crystallographic symmetry. In addition, the main-chain amide nitrogen and carbonyl oxygen of Gln110 in the dimer-related molecule form two hydrogen bonds with the side chain of Gln147 from the neighboring helix, while the side-chain nitrogen of the same Gln110 forms a hydrogen bond with the carbonyl oxygen of residue 140, also from the neighboring helix. These additional hydrogen bonds further stabilize the interdimer interactions. The twofold axis in each dimer is approximately perpendicular to the central axis of the dimer string. The dimer axes are rotated 120° from each other about the central axis. Three dimers, therefore, make one complete rotation in the dimer string. Both the N and C termini are quite flexible in each dimer. The first ordered residue ranges from 1 (one leftover tag residue prior to residue 107) to 109 at the N terminus. Residues prior to residue 109 become ordered only when they are stabilized by interactions with residues in a neighboring dimer. On the other hand, the C-terminal residues become ordered by extending the last residue (Val177) into a hydrophobic pocket in the adjacent dimer strings. The hydrophobic pocket is formed by the side chains of Leu113 and Lys120 in one molecule and of Trp152 in the other dimer-related molecule. In full-length P, the regions corresponding to the termini in the central domain fragment are likely to be very flexible, to allow a wide range of movements by the N-terminal and C-terminal domains as required by their functions (see Discussion). The ordered termini observed in this crystal structure are the result of crystal packing.
![]() View larger version (26K): [in a new window] |
FIG. 3. Three dimers found in the noncrystallographic asymmetric unit are presented in panel A and are shown rotated 90° in panel B. The central axis through the three dimers is evident in panel B. An eight-stranded ß sheet can be seen between two neighboring dimers in panel A. A proposed tetramer is isolated in panel C, with separate molecules represented in green, yellow, blue, and magenta.
|
Functions of the central domain in RNA polymerization. The C-terminal domain of VSV P plays an essential role in binding the N-RNA template and in viral replication (31). The N-terminal domain mainly influences transcription (8, 30). The central domain positions these two functional domains at the opposite ends and brings multiple copies of P (such as a tetramer) in close proximity through oligomerization. It is not clear whether the central domain interacts directly with any other viral proteins. In a recent study, a number of insertion and deletion mutations were introduced into the central domain and their effects on P activities were studied (11). Four Tn5-based insertion mutations of 16 amino acids were introduced at residues 136, 148, 159, and 167, respectively. The insertions at residues 136, 148, and 159 greatly reduced the replication activity, with the insertion at residue 136 being the most severe (about 90% reduction), while the insertion at residue 167 reduced the replication activity by about 40%. Only the insertion at residue 136 significantly reduced the transcription activity. Residue 167 is 3 amino acids away from the last ordered residue at the C terminus of the central domain. The inserted amino acids should have minimum effects on the integrity of the dimer structure of the central domain, probably simply extending the flexible region linking the C-terminal domain to the central domain. On the other hand, residue 136 is at the center of the helix-helix interaction in the dimer. Insertions should be strongly destructive to the P structure, resulting in a significant loss of both replication and transcription activity. Similarly, a deletion from residues 130 to 151 (two-thirds of the helix) significantly reduced replication and transcription activity, while various deletions between residues 140 and 201 had only limited reduction effects on replication and transcription activity. Residues 107 to 139 were able to retain substantial intermolecular interactions through the hydrophobic core around residue Trp138, as well as the potential tetramer interactions by the ß1 strands (Fig. 4). This stripped-down dimer may still preserve the basic oligomerization function of the central domain. It is therefore plausible to postulate that the essential role of the central domain is to maintain the dimeric structure to properly position the N-terminal and C-terminal domains for their functional activities. The central domain probably would not have significant specific, direct interactions with any other viral proteins. The fact that insertions and deletions that disrupted most of the helix-helix interactions, as well as intermolecular ß sheets, reduced but did not completely diminish replication and transcription activity supports the suggestion that the central domain has the isolated function of oligomerization.
![]() View larger version (12K): [in a new window] |
FIG. 4. Hypothetical dimer constructed for mutant P 7 (b). Residues 141 to 200 were deleted from the full-length P (a) in this mutant (described in reference 16). Residues 141 to 177 were deleted from the central domain, and the remaining -helix portion (residues 131 to 140) was manually translated to match the position of residue 140 to that of residue 151 in the original dimer structure. The ß1/ß2 hairpin was kept in its original position. The polypeptides connecting the two secondary-structure elements were manipulated to reconnect them. The N-terminal and C-terminal domains are represented by triangles and ovals, respectively.
|
|
|
|---|
The P central domain not only forms a stable dimer but also shows a potential tetrameric form, as observed in the noncrystallographic interaction. The ability to form a tetramer has been shown to be required for VSV replication activity (18, 19). The initiation site for viral replication has been shown to be separate from that for transcription (34, 39), and the replicase was shown to be formed by L/P/N with unspecified stoichiometry while the transcriptase is composed of L/P and host factors (34). The P molecules in the replicase must interact with both the N-RNA template and the additional N protein in the replicase complex. Our previous results showed that the N proteins are aligned in parallel when assembled with RNA (9). Since P also forms a parallel dimer, its twofold relationship prohibits the two C-terminal domains from binding N molecules that are not related by a twofold axis. However, the C-terminal domain of P is linked to the central domain by a very flexible linker. There is a possibility that one C-terminal domain is rearranged in the dimer in order to bind N in an orientation parallel to the first C-terminal domain. This type of interaction would introduce significant constraints on the P structure, although the possibility cannot be ruled out. The P dimer in the replicase complex may be essential for the processivity of the replicase along the N-RNA template. The two C-terminal domains in the P dimer may act as two feet to allow the complex to walk along the N-RNA template through an unclear mechanism. Since the formation of a P tetramer is required for replication activity, the tetramer structure observed in the noncrystallographic asymmetric unit may exist in the replicase complex. There are at least four C-terminal domains of P available in the replicase complex to bind both the N-RNA template and the additional N protein.
At the early stage of virus infection, the L and P proteins are carried into the cytoplasm by the ribonucleoprotein core. Since the virion genome does not carry any N0, there is no P dimer-N0 complex present to form the active replicase complex. It is likely that the L protein is associated with P proteins and host factors to form the transcriptase. The first isolation of the transcriptase complex, by Qanungo et al. (34), showed that a number of host factors, such as translation elongation factor 1
, heat shock protein 60, and guanylyltransferase, are present but that the N protein is not. The dimeric P proteins in the transcriptase are not required to recruit N0. Instead, the P proteins mainly function to tether the transcriptase to the N-RNA template and to play a role in processive transcription.
Use of the Advanced Photon Source was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under contract no. W-31-109-Eng-38. This work is supported in part by a grant to M.L. (NIH AI050066).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»