Structure of the Nucleoprotein Binding Domain of Mokola Virus Phosphoprotein

ABSTRACT Mokola virus (MOKV) is a nonsegmented, negative-sense RNA virus that belongs to the Lyssavirus genus and Rhabdoviridae family. MOKV phosphoprotein P is an essential component of the replication and transcription complex and acts as a cofactor for the viral RNA-dependent RNA polymerase. P recruits the viral polymerase to the nucleoprotein-bound viral RNA (N-RNA) via an interaction between its C-terminal domain and the N-RNA complex. Here we present a structure for this domain of MOKV P, obtained by expression of full-length P in Escherichia coli, which was subsequently truncated during crystallization. The structure has a high degree of homology with P of rabies virus, another member of Lyssavirus genus, and to a lesser degree with P of vesicular stomatitis virus (VSV), a member of the related Vesiculovirus genus. In addition, analysis of the crystal packing of this domain reveals a potential binding site for the nucleoprotein N. Using both site-directed mutagenesis and yeast two-hybrid experiments to measure P-N interaction, we have determined the relative roles of key amino acids involved in this interaction to map the region of P that binds N. This analysis also reveals a structural relationship between the N-RNA binding domain of the P proteins of the Rhabdoviridae and the Paramyxoviridae.

Like rabies virus (RABV), Mokola virus (MOKV) belongs to the Lyssavirus genus and the Rhabdoviridae family. Lyssaviruses often cause lethal encephalitis and are a significant global health risk. RABV alone is responsible for an estimated 55,000 deaths a year, primarily in poor rural areas in Africa and Asia (26), in spite of the availability of effective vaccines. In addition, poor surveillance and the emergence of novel variants that appear to be insensitive to traditional rabies vaccines (such as MOKV) mean that the true risk that these viruses pose remains underappreciated (44). Lyssaviruses belong to the order Mononegavirales and have a nonsegmented, negative-sense (NNS) RNA genome that contains five common genes in the following order: N (nucleoprotein), P (phosphoprotein), M (matrix protein), G (glycoprotein), and L (large protein) (9). During all stages of the virus life cycle, their RNA genome and RNA antigenome are encapsidated by N, which forms a large helical polymer known as the nucleocapsid (13,22). The replication/transcription complex is formed by L, P, and N. The L protein harbors the RNA-dependent RNA polymerase (RdRP) catalytic activity but also caps and polyadenylates the viral mRNA as shown for vesicular stomatitis virus (VSV) (1,33). The P protein fulfils a crucial role during RNA synthesis, as it is an essential cofactor for the RdRP activity of L. P contains a dimerization domain located between residues 91 and 131 (RABV P numbering) (15,16,23). Both structural and biochemical analyses suggest that RABV P forms elongated dimers in solution, which is in line with other evidence suggesting an important role for dimerization in replication (15). In addition, P binds two distinct forms of N, i.e., N o and N-RNA; N o refers to a soluble form of N that is not associated with RNA molecules, and N-RNA corresponds to the multimerized form of N associated with genomic RNA. There is evidence that N°interacts with two distinct regions of P located at the N terminus (RABV P residues 4 to 40) (7, 31) and C terminus (RABV P residues 185 to 297) of the phosphoprotein (7,14). In contrast, only the C-terminal domain of P interacts with N when it is bound to genomic RNA in the nucleocapsid (16). The lyssavirus phosphoprotein is also phosphorylated (20). Experimental evidence obtained for VSV further suggests that the differential phosphorylation of the N and C termini of P may be involved in regulating viral transcription and replication (8). Conversely, N is also phosphorylated, but the exact role of phosphorylation of these two proteins in regulating their binding remains controversial (29,45). Binding of P to N-RNA probably involves the C-terminal region of N (RABV N residues 376 to 450) (38). A recent model proposes that during replication the L protein forms a complex with P, which in turn binds to the N-RNA polymer and acts as a bridge to allow access of L to the RNA. The model further suggests that the P-N 0 complex may bind to the replicating L-P-N-RNA complex and feed the newly formed RNA strand with uncomplexed N for immediate encapsidation (1). However, it is im-portant to note that many of the details of the interactions between L, P, and N in the context of the replication or transcription complexes are not well understood. This is due primarily to the lack of high-resolution structural data about these proteins and their complexes. To date the structures of the VSV and RABV N-RNAs have been determined (2,19), as well as individual domains of the VSV and RABV P proteins (specifically the dimerization domain of VSV P and the N-RNA binding domains of VSV and RABV P) (10,30,36). Very recently the structure of VSV N-RNA complexed with the C-terminal domain of VSV P has been determined, which reveals that P binds in the cleft between two adjacent N molecules in the nucleocapsid (18) primarily via helix ␣4 and residues in ␤2. In contrast, no high-resolution structural data are available for L. The structure of the RABV P N-RNA binding domain in combination with previous studies of P-N complexes has led to a tentative model for the interaction between RABV N-RNA and P (1,37). However, the lack of structural data means that the exact nature of the interactions remains uncertain.
Because the functional analysis of P in the L-P-N-RNA complex remains incomplete, we crystallized full-length MOKV P to better define the structure of this protein. Here we present a structure of the C-terminal domain of MOKV P, which is involved in the interaction with N, obtained by expression of full-length P in Escherichia coli, which was subsequently truncated during crystallization. The structure shows a high degree of homology with the RABV P. The details of the contacts between molecules of P in the crystal reveal that the C-terminal acidic tail specific to MOKV P and absent in RABV P packs against a positively charged surface area of P that may represent the N-RNA binding site. On the basis of this structure, we tested a number of amino acid residues potentially involved in P-N-RNA interaction, by yeast twohybrid analysis. Using this strategy, we have been able to define a number of residues in the C-terminal domain of P, mutations of which block binding to N.

MATERIALS AND METHODS
Cloning. For in-fusion cloning, the cDNA of the gene encoding the phosphoprotein of Mokola virus was obtained by reverse transcription-PCR (RT-PCR) from total RNA isolated from infected BSR cells (a clone of BHK-21) using hexamer primers (Roche Boehringer). PCR was carried out with the primers 5Ј-AGGAGATATACCATGAGCAAAGATTTGGTGCATCCTAGTC-3Ј and 5Ј-GTGATGGTGATGTTTCTCTGCCTCCTCGAGCCGGGC-3Ј, where the genespecific sequence is indicated in bold. The PCR product was cloned into pOPINE using the in-fusion cloning methodology (6), yielding a full P construct with a C-terminal 6ϫHis tag.
For cloning of the target and bait proteins for the yeast two-hybrid experiments the same cDNA was also used to amplify by PCR the DNA sequence encoding full-length N and P⌬176, which corresponds to P deleted of the first 176 amino acids. The following pairs of primers were used: 5Ј-GGGGACAACTTT GTACAAAAAAGTTGGCATGGAGTCTGACAAGATTGTG-3Ј and 5Ј-GGGG ACAACTTTGTACAAGAAAGTTGGTTATGATCCATCTGCATACAC-3Ј for N and 5Ј-GGGGACAACTTTGTACAAAAAAGTTGGCATGCACACACCCG ATCACGATG-3Ј and 5Ј-GGGGACAACTTTGTACAAGAAAGTTGGTTACT CTGCCTCCTCGAGC-3Ј for P⌬176. The gene-specific sequence is indicated in bold. PCR products were cloned into pDONR207 (Invitrogen) using Gateway.
Site-directed mutagenesis was performed with the QuikChange site-directed mutagenesis kit (Stratagene) following the manufacturer's instructions, using P⌬176 cloned into pDON207 as a template for PCR. Primers used for site-directed mutagenesis were the following: P⌬176( Expression and purification. The P protein was expressed and purified using standard methods as described previously (35). Briefly, E. coli expressing MOKV P was grown in 0.5 L of TB Overnight Express (Novagen) for 5 to 6 h at 37°C followed by 20 h at 25°C. The cell pellet was harvested and frozen at Ϫ80°C. For protein purification, the pellet was defrosted on ice and resuspended in buffer A (25 mM Tris [pH 7.5], 500 mM NaCl, 30 mM imidazole) with 0.1% Tween 20, protease inhibitors, and DNase I added. The suspension was passed through a cell disruptor at 30 kpsi and then centrifuged for 30 min at 30,000 ϫ g at 4°C. The soluble fraction was transferred to an Aktä Express equipped with an Ni-nitrilotriacetic acid (NTA) column connected in-line to a Hiload 16/60 Superdex 75 gel filtration column (GE Healthcare). The Ni-NTA column was washed using buffer A, eluted using buffer B (25 mM Tris [pH 7.5], 250 mM NaCl, 250 mM imidazole) before transfer to the gel filtration column equilibrated in buffer C (20 mM Tris [pH 7.5], 200 mM NaCl). Peak fraction collection was performed using the Aktä Express software. Peak fractions were pooled, and the protein was concentrated to 12 mg/ml. The integrity of the protein was verified by electrospray ionization (ESI) mass spectroscopy as described previously (35).
Crystallization, data collection, and structure determination. The concentrated protein was subjected to the OPPF automated sitting-drop vapor diffusion crystallization pipeline as described previously (48). Crystallization trials were attempted at 20.5°C in sitting drops containing 100 nl protein and 100 nl precipitant solution equilibrated against 95-l reservoirs in 96-well plates. In total, 576 conditions were tested. Diffraction-quality crystals appeared after 15 days in a condition containing 30% (vol/vol) pentaerythritol ethoxylate (15/4 EO/OH), 50 mM ammonium sulfate, and 100 mM bis-Tris, pH 6.5 (Hampton Index screen).
Crystals were cryoprotected by brief immersion in reservoir solution supplemented with 20% (vol/vol) glycerol before flash-cryocooling in a stream of cold (100 K) nitrogen gas. The diffraction quality of the crystal was significantly improved by annealing, i.e., blocking the cryogenic gas stream with a credit card for 5 s (3). Diffraction data were recorded from a single crystal at the I03 Diamond beamline ( ϭ 0.890 Å) on a Quantum 315 detector (Area Detector Systems Corporation) and processed using HKL2000. The structure was solved by molecular replacement using the structure of the RABV P N-RNA binding domain (Protein Data Bank [PDB] accession no. 1VYI) with Molrep (46) and TLSϩ restrained refinement at 2.1 Å (0.21 nm) completed using REFMAC (32). Refinement statistics are shown in Table 1. Structure-based superpositions were performed using SHP (40), and sequence alignments carried out with ClustalX (43) or MUSCLE (12). Graphics were produced using PYMOL (http://www .pymol.org/).
Yeast two-hybrid procedure. Yeast culture media were prepared as previously described (47). P⌬176 or the appropriate mutants were transferred by in vitro recombination from pDONR207 in a Gateway-compatible pDEST32 yeast expression vector (Invitrogen) to be fused with the DNA binding domain of Gal4 (Gal4DB). N protein was transferred from pDONR207 into pPC86 (Invitrogen)   1090 ASSENBERG ET AL. J. VIROL.
to be fused to the activation domain of Gal4 (Gal4AD). These constructs were transformed in yeast strain AH109 (Clontech). Yeast cells were plated on a selective medium lacking histidine and supplemented or not with the indicated amount of 3-aminotriazole (3-AT) (Sigma-Aldrich) to test the interaction-dependent transactivation of the HIS3 reporter gene. Protein structure accession number. The final refined coordinates and structure factors have been deposited with the PDB under accession number 2wzl.

Crystallization of MOKV P and structure determination.
We expressed full-length MOKV P in E. coli and, following purification, subjected it to nanodrop crystallization trials. Diffraction-quality crystals formed within 15 days, and the 2.1-Å structure was determined by molecular replacement using the structure of the RABV P N-RNA binding domain (PDB accession no. 1VYI [30]) and refined with residuals R work /R free of 0.220/0.266. The crystal structure belongs to space group P2 1 2 1 2 1 , with cell dimensions a ϭ 25.8 Å, b ϭ 48.9 Å, and c ϭ 86.0 Å, and reveals one copy of a truncated form of P in the asymmetric unit. The region of P crystallized is the C-terminal, N-RNA binding domain (residues 197 to 303), and the experimental electron density shows a sharp cutoff at residue 197, suggesting that this is where the truncation occurred.
The refined model for residues 197 to 303 is complete and has good stereochemistry (Table 1), with more than 90% of residues in the most favored region of the Ramachandran plot (28). The data show that P had been truncated during the crystallization process, as mass spectrometry analysis of the protein immediately before the crystallization trials showed the protein to be intact at that point (data not shown). It is interesting to note that the previously published structure of the rabies virus C-terminal domain of P was obtained by expression of full-length P in insect cells, which was subsequently also truncated during the crystallization process (30).
Analysis of MOKV P structure. The structure of the MOKV P C-terminal domain is formed by six tightly folded ␣ helices, a 3 10 helix, and a small ␤ sheet consisting of two ␤ strands (Fig.  1), with a morphology previously described as resembling a sliced pear, with a flat and curved face. It shows a high degree of structural homology with RABV P (PDB accession no. 1VYI) (Fig. 1B) (101 amino acids aligned with a root mean square deviation [RMSD] of 0.66 Å using SHP [40]), consistent with the high degree of sequence identity (68%). As the first residue visible in the electron density is residue 197, this means that the MOKV P structure is 10 amino acids shorter at the N terminus than the available RABV P structure (which starts at residue 187), yielding a shorter ␣1 helix. In addition, there are five extra residues at the C terminus compared to RABV P, leading to an extended ␣6 helix. Apart from the N-and Cterminal extremities of the P structures, the MOKV and RABV P proteins have very similar structures ( Fig. 1) with similar surface charge properties. As observed for RABV P, a clear positively charged cluster is present on the round face of MOKV P, which is implicated in binding N (23), whereas for the flat face the charge is more evenly distributed. The socalled "W hole," present on the flat face, which has been implicated in binding N (although no direct evidence exists to support this [30]), is formed by three residues (C262, F266, and I288 in MOKV P and C261, W265, and M287 in RABV P). In the Mokola virus structure, however, this hole is not as pronounced as it is in the RABV P structure, where W186, from a neighboring molecule, packs into the hole and displaces W265. The role of this hydrophobic region, if any, remains unclear.
There is also clear structural similarity to P of VSV, although the fragment of VSV P solved by nuclear magnetic resonance (NMR) is considerably smaller. In this case SHP aligns 57 residues out of 70 in VSV, with an RMSD of 2.73 Å, despite the aligned residues sharing less than 11% sequence identity.
Mapping the P-N binding site. Analysis of the crystal structure of MOKV P using PISA (27) reveals that the major crystal    packing interface involves the positively charged cluster implicated in N-RNA binding (23). Specifically, the interface is established primarily by the interactions of the C-terminal ␣6 helices of two symmetry-related monomers (symmetry operator: xϪ1/2,Ϫyϩ3/2,Ϫz), orientated in an inverted fashion relative to each other ( Fig. 2A). The C-terminal four residues (E300, E301, A302, and E303) of one monomer (drawn in yellow in Fig. 2) interact, via a combination of both polar and hydrophobic contacts, with a positively charged region consisting of residues located on ␤1, ␤2, ␣4, and ␣6 (K212, K213, Y214, K215, I223, L225, R261, and Y295) in the other monomer (drawn in green in Fig. 2A and B). K212, K213, K215, R261, and Y295 engage in both main-and side-chain polar contacts with the acidic tail, with Y214, K215, I223, L225, and R261 contributing to additional hydrophobic interactions ( Ta-ble 2). This interaction probably does not occur under physiological conditions because the interaction surface is small, with a total buried surface area between the two monomers of 757.4 Å 2 and 392.3 Å 2 contributed by the acidic tail. This is further supported by the observation that the acidic extension of MOKV P is poorly conserved among the lyssaviruses. A total of 384 GenBank sequences of P that encompass at least the motif described here and represent all the described genotypes of lyssaviruses were aligned using ClustalX The same view as in panel B but with the green monomer shown as an electrostatic surface representation, generated using the PDB2PQR server (using the AMBER force field [11]) in combination with APBS tools (4) and pymol (http://www.pymol.org/).  (23). Moreover, the acidic tail bears a striking resemblance to the acidic region in N (DEED in MOKV N versus EEAE in MOKV P) immediately following S389, whose phosphorylation status may modulate the interaction of N with P (45).
To precisely map which amino acid residues of the MOKV P domain are involved in this interaction with N, we mutated the eight amino acids (K212 to 215, I223, L225, R261, and Y295) implicated in the interaction with the acidic tail of P. F210 and S211, which were previously postulated to play a role in P-N interaction, were also tested (23). Yeast cells were transformed with Gal4DB-P and Gal4AD-N, and the absence of toxicity when expressing these two proteins was assessed on synthetic medium without leucine and tryptophan (ϪLϪW). P-N interaction was monitored by measuring yeast growth on the same medium but without histidine (ϪLϪWϪH) and supplemented or not with 3-AT to increase stringency. The interaction between P and N was tested by yeast two-hybrid experiments between full-length MOKV N and truncated MOKV P lacking amino acids 1 to 176 (P⌬176) to remove both the P-N°i nteraction site in the N-terminal part of P and the dimerization domain (7,16,31). This approach is similar to the experiments conducted previously by Jacob et al. (23) but with the important difference that we introduced specific, single mutations whereas Jacob et al. used a random mutagenesis approach yielding constructs with multiple substitutions.
As shown in Fig. 3, mutations of Y214, K215, and R261 resulted in the most significant growth defects of the yeast strain, while for K212 and L225 higher-stringency conditions were necessary to reveal an effect on N binding. The number of bonds formed and the reduction in accessible surface area upon binding the acidic tail of each of the residues shows a good overall correspondence with the Y2H results (Table 2).
These results suggest that these residues are involved in the interaction between MOKV P and N. However, it should be noted that it is distinctly possible that the Y214E mutation causes incorrect protein folding, given the packing of this aromatic side chain in the core of the molecule. Table 2 shows that the accessible surface area of Y214 in the unbound state is low (36.74Å 2 ) and is reduced only slightly upon binding the acidic tail (27.25Å 2 ), indicative of a weak interaction with the tail and burial of a significant portion into the core of the protein. Similarly, increasing the stringency by the addition of 3-AT revealed that the Y214A mutation did impair yeast growth, but whether this is due to the effects on P-N complex stability or destabilization of P remains to be seen.
For residues K212, K215, L225, and R261 the MOKV and RABV P structures show that the effects observed are not likely due to a defect in P folding but are likely due to disruption of the interaction with N. Figure 3 clearly reveals the importance of the electrostatic environment in the interaction, as mutations of these residues to glutamic acid had a much more substantial effect on disrupting the interaction with N than alanine substitution. This further supports the idea that the interaction observed for P in the crystal with its acidic tail mimics the interaction with N, as the crystal structure showed that the interaction with the acidic tail was largely electrostatically driven and involved primarily mainchain atoms of the acidic tail.

DISCUSSION
We have crystallized and solved the structure of the N-RNA binding domain of Mokola virus P via expression of full-length P which was subsequently truncated during the crystallization process. The MOKV P structure shows a high degree of structural similarity with the homologous domain of RABV P, consistent with the high degree of sequence identity (68%). Due to the longer sequence of MOKV P compared to RABV P, the C-terminal alpha helix (␣6) is six residues longer than that of RABV P. Whether the MOKV P ␣1 in the intact protein is longer than observed here remains to be established, as the sequence prior to residue 197 is poorly conserved between the RABV and MOKV P proteins and secondary structure prediction methods do not predict the presence of a helix even for RABV P in this region.
Analysis of the crystal packing interactions revealed that a positively charged region of P, previously shown to be involved in the interaction with N, interacts with four acidic C-terminal amino acids in a symmetry-related P molecule that may resemble part of the putative P binding region in N. The interaction is electrostatically driven and, together with the small buried surface area, suggests that the interacting regions in P and/or N are likely to be larger than can be ascertained on the basis of the MOKV P crystal structure alone, consistent with the recent structure of VSV N-RNA in complex with P indicating that the opposing face of the domain in P, involving helices ␣4 and ␣5, is involved in the interaction with N (18,37,38). The involvement of the positively charged region in P in binding N was further confirmed by a combination of site-directed mutagenesis and yeast two-hybrid analysis to study the effects of the mutations on the interaction of MOKV P with MOKV N. The analysis revealed that out of 10 amino acids in P potentially involved in binding N, five negatively affected the P-N inter-  (23), we confirmed that K212, K215, and perhaps Y214 are involved in P-N binding.
We further show that a positive charge at positions 212 and 215 is crucial for P-N interaction. Two other positions not previously described, L225 and R261, were also shown to be necessary for the P-N interaction to occur. However, our work suggests that the role of amino acids 210, 211, and 213, recognized as potentially involved in P-N binding, is less significant (or at least they contribute less to the interaction with N), as their mutation did not alter the interaction. This further reinforces the hypothesis of potential core packing destabilization (30) of some mutations when generated at random since, only a small number of the residues of P identified show here a clear effect on N binding in the yeast two-hybrid screen when tested in isolation. The high degree of conservation of these positions in lyssaviruses despite the high variability of this region of the P further supports their importance in the interaction with N.
The observation that at all sites alanine substitutions are tolerated in the interaction with N indicates that the acidic tailpositive cluster interactions are probably weak and therefore mutations in this region may be compensated for by the presence of other interacting regions in P and/or N. The region of N interacting with P probably involves residues in the vicinity of S389, as its phosphorylation state may be important for the interaction with P (45) and trypsin removal of RABV N residues 376 to 450 abolishes P binding (16,38). S389 is followed by four acidic residues with a high degree of similarity with the acidic C-terminal extension of MOKV P (DEED in MOKV N versus EEAE in MOKV P). However, mutation of all four acidic residues in MOKV N (D390A, E391A, E392A, and D393A) did not affect binding of N to P⌬176 in a yeast two-hybrid screen (not shown). Another acidic region in N is also highly conserved among N proteins (ELEE in MOKV N, positions 371 to 374). Here again mutations E373A and E374A did not alter P-N binding in yeast two-hybrid experiments (not shown). One or both of these sites may still interact with P, since it has recently been shown that measles virus (MeV) N has two boxes of amino acid residues that are involved in P-N binding (5). However, MOKV N mutated at both sites (residues 373 to 374 and 390 to 393) still binds to MOKV P⌬176 (not shown). This lack of effect on bind-ing of P by mutating individually or in combination the two acidic regions of N may reflect the fact that these regions are simply not involved in P-N binding or that the P binding region involves more than just the acidic residues mutated in this study, as suggested by the structure of the VSV N-RNA P complex (18). Given that, in our model, much of the molecular interaction of the negatively charged tail involves main-chain atoms, it remains possible that replacement of even the entire acidic region with alanine is not sufficient to destabilize the interaction with P, as a significant number of main-chain contacts with P are likely to remain in place, as shown for VSV (18).
None of the MOKV P residues identified in this study as being important for the interaction with MOKV N appear to be conserved in the VSV P structure, suggesting that VSV P interacts differently with N than the lyssavirus P. In addition, biochemical data also indicate that P-N and P-L interactions are likely to be different (41,42). This has been confirmed by the structure of VSV N-RNA complexed with the C-terminal domain of VSV P, which reveals that those residues of VSV P that are involved in binding lie close to but do not map directly onto the residues that we predict are involved in MOKV N-P binding (18). This lack of agreement is probably due primarily to differences in the sizes of the P proteins (the C-terminal domain of MOKV P is 50% larger than the equivalent Cterminal domain of VSV).
Alongside the structures of the N-RNA binding domains from P proteins of members of the Rhabdoviridae (MOKV and RABV for the Lyssavirus genus and VSV P for the Vesiculovirus genus), the structures of functionally equivalent domains of the P proteins from other members of the order Mononegavirales have been solved. The structures of the N-RNA binding domains of Sendai virus (SeV) and measles virus (MeV) P proteins consist of an antiparallel triple-helix bundle, and, as for the Rhabdoviridae, the N-RNA binding domains among paramyxoviruses are also structurally conserved in spite of low sequence conservation (5,21,24,25). Careful analysis of the structures reveals a structural relationship between the N-RNA binding domains of the P proteins of the Rhabdoviridae and the Paramyxoviridae, where the three-helix bundles of the SeV and MeV P proteins align with helices ␣3, ␣4, and ␣6 of MOKV (and RABV) (Fig. 4B). Structural superposition using SHP (40) shows that 43 and 40 residues of MOKV P (out of 107) can be aligned with SeV P and MeV P, with RMSDs of 3.3 Å and 3.2 Å, respectively, over the aligned residues, despite 14% and 0% sequence identity. The similarity between VSV P and the paramyxovirus and lyssavirus P proteins is much poorer due to a shorter C-terminal helix ␣4 and the absence of a helix equivalent to MOKV ␣3 in VSV P, although the alignment suggests that at least VSV P ␣3 (␣4 in MOKV P) is conserved. This low but detectable degree of structural similarity would suggest that the P proteins of Rhabdoviridae and Paramyxoviridae originated from a common ancestor in spite of the high degree of amino acid divergence.
Finally, it is of interest to note that the MOKV P positively charged cluster (K212, K213, and K215) in combination with R261 was recently identified as a functional nuclear localization signal (NLS) (34). Although the possible role of P nuclear localization remains to be established, the structures of the RABV and MOKV P proteins show that these regions do not conform to the classic NLS structure (39), as they are not part of a disordered or flexible loop but rather are part of an ordered region that includes beta sheet ␤1, the (ordered) loop separating ␣1 and ␤1, and helix ␣4. As noted by Pasdeloup et al. (34), the overlap with the NLS suggests that the N-RNA binding region in P may additionally act to mask the NLS upon binding to N. If verified, this would constitute another example of the modulation of the biological activities achieved by a lyssavirus protein by sequestering a biologically active protein interface during morphogenesis and/or the viral life cycle, as already postulated for the matrix protein (17).