Previous Article | Next Article ![]()
Journal of Virology, May 2009, p. 5109-5116, Vol. 83, No. 10
0022-538X/09/$08.00+0 doi:10.1128/JVI.02667-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Departments of Molecular Microbiology and Pathology & Immunology, Washington University School of Medicine, St. Louis, Missouri 63110,1 Department of Pathology, University of Texas Medical Branch, Galveston, Texas 775512
Received 28 December 2008/ Accepted 26 February 2009
|
|
|---|
|
|
|---|
Midway virus (MIDWV) was first isolated in 1966 from seabird ticks of two species [Ornithodoros (Alectorobius) spp.] collected on the Midway, Kure, and Manana islands in the Central Pacific and from northern Honshu, Japan. In addition, on Aomatsushima Island, nestling seabirds of two seabird species, Larus crassirostris and Nycticorax nycticorax, were found to have antibody to MIDWV. This virus is pathogenic for newborn Swiss mice but not 4-week-old Swiss mice injected intracranially. Also, the virus is cytopathic in BHK-21 cells and produces plaques in Vero cells. Efforts to classify MIDWV have revealed only that MIDWV is antigenically related to NYMV in cross-box complement fixation (CF) assays (22).
To date, conventional approaches, such as serological analysis and electron microscopy (EM), have not yielded definitive characterization of MIDWV or NYMV. Heretofore, it was not known whether MIDWV and NYMV are DNA viruses or RNA viruses or to which virus family they belong. In this paper, we describe the application of unbiased high-throughput sequencing to define the genome sequences of NYMV and MIDWV. By analysis of their genomes, NYMV and MIDWV were determined to be negative-stranded RNA viruses highly divergent from all known viruses but most closely related to viruses in the order Mononegavirales. On the basis of the analysis presented herein, we propose that NYMV and MIDWV define a novel taxon within the order Mononegavirales.
|
|
|---|
Transmission EM. Infected monolayers of BHK-21 cells were fixed 6 days after infection in a mixture of 2.5% formaldehyde, 0.1% glutaraldehyde in 0.05 M cacodylate buffer (pH 7.2) containing 0.03% trinitrophenol and 0.03% CaCl2. The monolayers were washed in 0.1 M cacodylate buffer (pH 7.2), scraped off the plastic, pelleted, and further processed as a pellet. The cells were postfixed in 1% OsO4 in 0.1 M cacodylate buffer. Infected cells were stained en bloc with 1% uranyl acetate in 0.1 M maleate buffer, dehydrated in ethanol, and embedded in Poly/Bed 812 epoxy resin (Polysciences, Warrington, PA). Ultrathin sections were cut on a Reichert-Leica Ultracut S ultramicrotome, stained with 2% aqueous uranyl acetate and lead citrate, and examined with a Philips 201 or CM 100 EM at 60 kV.
CF test. Suckling mouse brain, sucrose-acetone extracted antigens, and hyperimmune mouse ascitic fluids were prepared for NYMV and MIDWV as described previously (25). CF tests were performed by the microtiter technique (8), using 2 units of guinea pig complement and overnight incubation of the antigen and antibody at 4°C. The CF titers were recorded as the highest dilutions giving CF levels of 3+ or 4+ on a scale of 0 to 4+.
RNA extraction. Total RNA was extracted from Vero cells mock infected or infected with MIDWV or NYMV by using TRIzol-LS (Invitrogen, Carlsbad, California). The sample was mixed with chloroform (4 volumes chloroform:1 volume original sample) and centrifuged at 12,000 x g for 15 min at room temperature. The aqueous phase was transferred and mixed with an equal volume of isopropyl alcohol and centrifuged at 12,000 x g for 10 min at room temperature. The RNA pellet was washed with 70% ethanol and then dissolved in RNase-free water. Extracted total RNA was quantified using UV spectrophotometry.
Reverse transcription, amplification, and shotgun sequencing. Random amplification of the extracted RNA was performed as described previously (29). In brief, RNA was reverse transcribed with primer A (5'-GTTTCCCAGTCACGATANNNNNNNNN), and second-strand DNA synthesis was carried out with Sequenase (United States Biochemical, Cleveland, OH). Subsequently, this material was used as the template for 40 cycles of PCR with primer B (5'-GTTTCCCAGTCACGATA) by using the following profile: 30 s at 94°C, 30 s at 40°C, 30 s at 50°C, and 60 s at 72°C. Primer B-amplified nucleic acid was cloned into pCR4-TOPO and sequenced using standard Sanger chemistry with an ABI 3730xl (Applied Biosystems, Foster City, CA) sequencer. The resulting sequence reads were trimmed to remove vector and primer B sequences and then assembled using Phred/Phrap. Contigs with sequence similarity to known viral sequences were identified using tBLASTx (1).
Genome sequencing. Assembled contigs from the shotgun reads were confirmed by reverse transcription-PCR (RT-PCR), followed by sequencing. Gaps between contigs were closed by designing PCR primers from the existing contigs to span each gap. PCR amplicons were cloned into pCR4 and sequenced. To extend the sequences, 5' rapid amplification of cDNA ends (RACE) and 3' RACE were performed. The virus-specific primers used can be found in Table S1 in the supplemental material.
Northern blot analysis of MIDWV. Five micrograms of total RNA from MIDWV-infected Vero culture was electrophoresed through a 1% agarose-formaldehyde gel, transferred to a nylon membrane, and hybridized under standard conditions to a radiolabeled double-stranded DNA probe generated by PCR amplification of a fragment of the MIDWV L (polymerase) gene, using the primers MIDWV_5F (5'-GTCACGAAAGGAGGGTTGTTGAGCT-3') and MIDWV_5R (5'-CCCTGAGGAAGATCCTTACTCTCAA-3').
Gene prediction and analysis. The open reading frames (ORFs) of NYMV and MIDWV were predicted using the NCBI ORF finder (30). The initial predictions were then refined based on the identification of putative transcription initiation and termination signals (see "Noncoding region analysis" below). For each ORF, signal peptides were predicted using SignalP 3.0 (5). Transmembrane regions were predicted using PolyPhobius (15). Predicted ORFs were analyzed via ProtParam (11) to determine the isoelectric points of their products, NetPhos 2.0 (6) to determine potential phosphorylation sites, and NetOGlyc 3.1 (13) and NetNGlyc 1.0 (R. Gupta, E. Jung, and S. Brunak, unpublished data) to determine potential glycosylation sites.
Noncoding region analysis. The presence of conserved transcription termination and initiation sequences was determined using MEME (2). To define any potential transcriptional termination motifs, the intervening sequences from each putative stop codon to the first downstream poly-U tract were used as input for MEME. For the terminal ORF VI (L or polymerase) gene, 50 bp of sequence upstream of the stop codon was included in the analysis, since it is known for other viruses in the Mononegavirales that the transcriptional termination signals for the polymerase genes are within the coding region. In order to search for any conserved transcriptional initiation motifs, MEME was used to analyze the following set of sequences from each ORF: (i) 1 nucleotide (nt) downstream of the previous poly-U tract to the 1st in-frame ATG; (ii) the first in-frame ATG to the second in-frame ATG; and (iii) the second in-frame ATG to the third in-frame ATG. Thus, the first three in-frame ATGs were considered candidates for the true ORF start.
Phylogenetic analysis. Amino acid sequence alignments were performed using ClustalX V1.83 (26) with default parameters. Maximum parsimony phylogenetic trees were constructed from the ClustalX alignments by using PAUP 4.0b with 1,000 bootstrap repetitions, using default settings (21). Maximum likelihood analyses were performed using RAxML with 100 repetitions, using BLOSUM matrices.
Nucleotide sequence accession numbers. The sequences for MIDWV and NYMV have been deposited in GenBank under accession numbers FJ554525 and FJ554526, respectively.
|
|
|---|
![]() View larger version (26K): [in a new window] |
FIG. 1. EM of NYMV and MIDWV. (A) EM of extracellular NYMV. Shown is an ultrathin section of BHK cells infected with NYMV embedded in epoxy resin. Arrows indicate the viral particles. (B) EM of extracellular MIDWV. Shown is an ultrathin section of BHK cells infected with MIDWV embedded in epoxy resin. Arrows indicate the viral particles.
|
|
View this table: [in a new window] |
TABLE 1. Results of CF tests showing serologic relationship between NYMV and MIDWV
|
The assembled viral contigs were experimentally confirmed by sequencing of RT-PCR products generated by contig-specific primers. The contigs were then joined by RT-PCR with primers designed to span the gaps between the contigs. The contigs were extended first through 3' RACE reactions targeting the viral mRNAs. Finally, the sequences at the extreme 5' and 3' ends of the genome were obtained through a series of 5' RACE reactions to obtain the noncoding sequence from the viral genome and antigenome. Multiple 5' RACE reactions for NYMV yielded consensus sequences suggesting that the entire genome of NYMV was 11,631 nt. For MIDWV, 5'RACE targeting the genome strand yielded a consensus sequence for the 5' untranslated region of MIDWV. Multiple 5'RACE reactions targeting the antigenomic polarity strand of MIDWV gave multiple products containing various deletions. Despite many efforts, we were unable to resolve these discrepancies in the 3' untranslated region of MIDWV. The 3' end of the existing contig was defined using the regions of the 5' RACE products that were in agreement in multiple experiments. The near-complete genome of MIDWV is 11,752 nt.
Experimental confirmation of MIDWV genome size. To experimentally confirm the size of the MIDWV genome, we performed Northern blot analysis on total RNA from MIDWV-infected Vero cell culture. The MIDWV probe, which targets a portion of the MIDWV RNA polymerase, yielded two bands of approximately 11 kb and 6 kb (Fig. 2, lane 1). The observation of a band of approximately 11 kb was consistent with the 11,752-nt contig assembled to date, suggesting that we had sequenced the majority of the MIDWV genome. The smaller, 6-kb band was presumed to represent the mRNA for ORF VI.
![]() View larger version (34K): [in a new window] |
FIG. 2. Northern blot hybridization using radiolabeled MIDWV cDNA as a probe. Lane 1, total RNA from Vero cells infected with MIDWV; lane 2, total RNA from uninfected Vero cells. The position of the single-stranded RNA marker segments are indicated.
|
![]() View larger version (5K): [in a new window] |
FIG. 3. Genome organization of NYMV and MIDWV. Diagrammatic representation of the 3'-to-5' arrangement of ORFs of the genomes of NYMV (A) and MIDWV (B). The putative ORFs are shown as open rectangles.
|
![]() View larger version (53K): [in a new window] |
FIG. 5. Phylogenetic analysis of the viral polymerase (L) proteins. Phylogenetic trees were constructed with the protein sequences by using the maximum parsimony method. The first and second numbers at the branches show the bootstrap percentage values obtained from maximum parsimony (1,000 repetitions) and maximum likelihood (100 repetitions) analyses, respectively. The type species for each genus in the order Mononegavirales was included in the analysis. Genus names are shown in italics. Abbreviations: BDV, Borna disease virus; ZEBOV, Zaire Ebola virus; LVMV, Lake Victoria marburgvirus; NDV, Newcastle disease virus; HeV, Hendra virus; NiV, Nipah virus; MeV, measles virus; HPIV-1, human parainfluenza 1 virus; SeV, Sendai virus; MuV, mumps virus; aMPV, avian metapneumovirus; hMPV, human metapneumovirus; RSV, respiratory syncytial virus; LNYV, lettuce necrotic yellows virus; BEFV, bovine ephemeral fever virus; VSIV, vesicular stomatitis Indiana virus; RABV, rabies virus; IHNV, infectious hematopoetic necrosis virus; MMV, maize mosaic virus; RYSV, rice yellow stunt virus.
|
|
View this table: [in a new window] |
TABLE 2. Predicted ORFs of NYMV and MIDWVa
|
|
View this table: [in a new window] |
TABLE 3. Calculated and predicted properties of NYMV and MIDWV ORF II, III, IV, and V proteins
|
The matrix protein (M) of mononegaviruses is a basic, nonglycosylated membrane protein that ranges in size from 16.2 kDa (for Borna disease virus) to 43.1 kDa (for human parainfluenza 4a virus). Either ORF III or ORF IV of NYMV and MIDWV could potentially encode the matrix protein. They are all basic proteins, but the ORF III proteins are larger than the known M proteins (NYMV, 44.6 kDa; MIDWV, 49.1 kDa), while ORF IV proteins are smaller than the known M proteins (NYMV, 15.1 kDa; MIDWV, 15.3 kDa). None of these ORF proteins were predicted to be glycosylated, as only MIDWV III had any potential glycosylation sites and none of the ORF proteins had the signal peptide necessary for targeting to the N or O glycosylation machinery. No transmembrane region was predicted for either ORF protein, which is consistent with the role of Borna disease virus M as a peripheral membrane-associated protein (17). We cannot conclude at this time whether ORF III or ORF IV encodes the M protein.
The glycoproteins of rhabdoviruses, filoviruses, and bornaviruses are classical type I transmembrane proteins containing a cleavable signal sequence and an exterior N terminus and cytoplasmic C terminus, while the glycoproteins of paramyxoviruses are type II transmembrane proteins with a cytoplasmic N terminus, an exterior C terminus, and a noncleavable signal peptide (7). Using SignalP 3.0 (5) and PolyPhobius (15), we looked for the presence and cleavability of a signal peptide and the presence and orientation of transmembrane regions in the NYMV and MIDWV ORF proteins. Only the NYMV and MIDWV ORF V proteins were predicted to possess a cleavable signal peptide and to be type I transmembrane proteins. In addition, ORF V proteins contained multiple predicted glycosylation sites (Table 2). These data, coupled with the observation that these proteins were located immediately preceding the viral polymerase, suggested that ORFs V of NYMV and MIDWV encode the viral glycoproteins (G).
Analysis of noncoding viral RNA. To better assess the relationship between NYMV, MIDWV, and the viral families in the order Mononegavirales, we compared the 5' and 3' untranslated regions and intergenic regions of NYMV and MIDWV to those of members of the Mononegavirales. Specifically, we searched for the putative transcription initiation and termination signals for each ORF in each virus. By use of MEME (2), a computational algorithm for identifying conserved sequence motifs, a transcription initiation motif of 3'-AGUUGG(G/A)(G/A)GAA-5' was identified upstream of every putative ORF for NYMV (Fig. 4A). The motif was present 3 to 68 nt before the start codon. MEME also detected a transcription initiation motif of 3'-GGUUGG(A/G)GGG(G/A) –5' in MIDWV (Fig. 4B). This motif was positioned 1 to 68 nt upstream of every putative ORF. Similarly, conserved transcription termination motifs of 3'-AGAAAUGUUUUU-5' and 3'-AGAA(U/A)UGUUUUU-5' were detected for NYMV (Fig. 4C) and MIDWV (Fig. 4D), respectively.
![]() View larger version (36K): [in a new window] |
FIG. 4. Analysis of the noncoding regions of NYMV and MIDWV. Putative transcription initiation sequences for NYMV (A) and MIDWV (B) and transcription termination sequences for NYMV (C) and MIDWV (D) are shown. The consensus motif is shown in the 3'-to-5' orientation below the sequences for each viral ORF.
|
![]() View larger version (47K): [in a new window] |
FIG. 6. Phylogenetic analysis of the viral nucleocapsid (N) proteins. Phylogenetic trees were constructed with the protein sequences by using the maximum parsimony method. The first and second numbers at the branches show the bootstrap percentage values obtained from maximum parsimony (1,000 repetitions) and maximum likelihood (100 repetitions) analyses, respectively.
|
|
|
|---|
Additional analyses of the noncoding regions of MIDWV and NYMV identified conserved transcription initiation and termination motifs among the ORFs of each virus. Typical viruses in the order Mononegavirales have transcriptional termination signals that contain several features, including a U tract of 4 to 8 nt, which serves as a template for the addition of the poly(A) tail to the nascent mRNAs through reiterative transcription, a conserved C residue immediately preceding this poly(U) tract and an A/U-rich tract (31). In some rhabdoviruses and paramyxoviruses, the A/U-rich tracts are conserved among the genes within a particular virus (3, 4, 18, 20, 27, 28). In contrast, within the Borna disease virus genome, the transcriptional termination motif contains an A followed by six or seven U's (19). Although this motif is preceded by an A/U-rich tract, the residues within the tract are not conserved among the Borna disease virus genes. Thus, the presence of these conserved motifs in each ORF of MIDWV and NYMV was more similar to the pattern found in the rhabdoviruses and paramyxoviruses than to that found in the bornaviruses.
Other physical properties of MIDWV and NYMV were not useful classifiers of these viruses. The genome sizes of NYMV (11.6 kb) and MIDWV (
11.8 kb) were most consistent with that previously described for rhabdoviruses (11 to 15 kb) but are not significantly greater than that for Borna disease virus (9 kb), while those for the paramyxoviruses (15.1 to 19.2 kb) and the filoviruses (
19 kb) are generally much greater (10). The shape of the virions seen by EM was more similar to those typically seen with some paramyxoviruses or Borna disease virus (9, 32) than to those seen with the classic bullet-shaped virions usually associated with rhabdoviruses or with the thread-like particles of filoviruses. Clearly, experimental data are needed to definitively classify NYMV and MIDWV. For example, key properties of Borna disease virus that differentiate it from the rhabdoviruses include the facts that it replicates in the nucleus and that its genes undergo extensive splicing. It is unknown at this point where NYMV and MIDWV replicate and whether splicing of their transcripts occurs.
In conclusion, we have sequenced the genomes of NYMV and MIDWV, two previously unclassified viruses. Based on our analysis, MIDWV and NYMV clearly define a novel genus, tentatively named Nyavirus, in the order Mononegavirales. Because they are highly divergent from all members of known virus families, it is also possible that they define a novel family, although additional data will be necessary to clarify the relationship of the nyaviruses to other mononegaviruses. This study underscores the power of high-throughput sequencing strategies to identify highly divergent viruses and to resolve long-standing mysteries such as the identities of NYMV and MIDWV as well as the tremendous virus diversity in the world that remains unexplored to date.
We thank Fred Murphy for assistance with interpretation of the electron micrographs and for reading the manuscript.
Published ahead of print on 11 March 2009. ![]()
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»