Previous Article | Next Article ![]()
Journal of Virology, November 2004, p. 12576-12590, Vol. 78, No. 22
0022-538X/04/$08.00+0 DOI: 10.1128/JVI.78.22.12576-12590.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Biological Sciences,1 Tropical Marine Science Institute, National University of Singapore, Singapore2
Received 19 March 2004/ Accepted 29 June 2004
|
|
|---|
|
|
|---|
Iridovirus pathogens have been regarded as a cause of serious systemic diseases among feral, cultured, and ornamental fish in the recent years. Mortalities of fish due to systemic iridovirus infection reaching 30 to 100% were observed. Histopathological signs in iridovirus-infected fish may include enlargement of cells and necrosis of the renal and splenic hematopoietic tissues (28). In 1994, a novel viral disease called sleepy grouper disease (SGD) resulted in significant economic losses in Singapore marine net cage farms. Finally, this novel iridovirus of the genus Ranavirus, designated Singapore grouper iridovirus (SGIV), was successfully isolated in 1998 from brown-spotted grouper (6, 29). Further, it was successfully grown in an alternate grouper embryonated egg (Epinephelus tauvina) cell line, with good resultant titers (9) and was used as a source to purify SGIV. The physiochemical properties of SGIV have been reported previously (28). At the molecular level, only a partial sequence encoding the highly conserved major capsid protein in SGIV has been reported (28). Due to its relevance in the aquaculture industry, it is important to study the molecular mechanism of viral infection and virus-host interaction in grouper. As an initial part of these studies, we have determined the complete genomic sequence of SGIV. We have also confirmed the authenticity of some open reading frames (ORFs) using the proteomic approach and reverse transcription-PCR (RT-PCR).
|
|
|---|
Construction of libraries.
Soluble genomic DNA was quantified by spectrophotometry (UV-1600; Shimadzu). Sixty micrograms of genomic DNA was diluted with TM buffer (5 mM Tris-HCl [pH 8.0], 1.5 mM MgCl2) to a final volume of 200 µl and ultrasonicated (3-s bursts) using an ultrasonic liquid processor (model XL2020; Misonix Inc., Farmingdale, N.Y.). The appropriate viral DNA fragments (500 to 800 bp) were excised from the 1.0% agarose gel and extracted using the QIAquick gel extraction kit (QIAGEN). Genomic DNA fragments were end repaired with T4 DNA polymerase, followed by phosphorylation with T4 polynucleotide kinase. DNA fragments were purified using a High Pure PCR product purification kit (Roche) before the next enzymatic reaction. Sonicated fragments were ligated by incubation at 16°C overnight to the pUC19 vector, which had been prelinearized by SmaI followed by dephosphorylation. After purification, chimerical plasmids were transformed into electrocompetent-cell DH5
. More than 1,000 recombinants were selected from the library by the blue/white screening assay. To construct the restriction library, DNA fragments were obtained by restriction digestion with BamHI and cloned into the corresponding site of pBluescriptII KS(+) vector. Both libraries were used to scaffold the SGIV genome.
Assembly and analysis of SGIV genome. Sequencing of the viral fragments was carried out following the standard protocol supplied by Applied Biosystems. All cycle sequencing products were loaded onto the ABI PRISM 3100 genetic analyzer to acquire nucleotide sequences from both directions. Before the scaffolds were created, high-throughput BLAST analysis was performed for all nucleotide sequences to eliminate contamination reads, followed by vector screening with the InterPhace program (University of Washington). A software package, Vector NTI Suite 7.1 (InforMax Inc., Frederick, Mass.), was applied to create the contigs, assemble the genome, identify ORFs, analyze presumptive genes, and draw the genomic map. The whole genome was also submitted to http://www.softberry.com (Softberry Inc., Mount Kisco, N.Y.) for identification of all potential ORFs. These ORFs were searched against the mirror site of National Center for Biotechnology Information (NCBI) nucleotide database at the Singapore Bioinformatics Institute. The presumptive genes were submitted to the NCBI network service to search for conserved domains. Protein motifs were analyzed by using the PROSITE database, release 18.17 (8). Signal peptides and signal anchors were predicted with SignalP V2.0 (24, 25). Signal anchors exist in certain membrane proteins (type II membrane proteins) attaching to the membrane by an N-terminal sequence which shares many characteristics with a signal peptide sequence but is not cleaved. Transmembrane domains were predicted with TMpred (15).
Mass spectrometric analysis of SGIV proteins.
The protein pellet of the lower band from sucrose gradient ultracentrifugation was separated by one-dimensional sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). Thirty-nine well-separated protein bands were excised, reduced, alkylated, and digested with trypsin (31). To extract the peptides, the gel particles were twice treated with 20 mM NH4HCO3 and 5% formic acid in 50% acetonitrile, respectively. All supernatants were combined and dried in a vacuum centrifuge. Dried peptides were dissolved in 3 to 20 µl of 0.1% trifluoroacetic acid in 50% acetonitrile. Dissolved peptides (0.5 µl) were spotted onto a target plate, followed by an equal volume of 10-mg/ml
-cyano-4-hydroxycinnamic acid in 50% acetonitrile-0.1% trifluoroacetic acid. After the spots had dried, the target plate was loaded into a Voyager-DE STR BioSpectrometry workstation mass spectrometer (PerSeptive Biosystems, Framingham, Mass.). Mass spectra were acquired with 20.5 kV, 73.5% of grid, and a delayed time of 380 ns under a positive-ion reflector mode. The resulting peptide mass fingerprints were searched against the SGIV ORF database using the AutoMS-Fit search program (version 1.2.18; PerSeptive Biosystem).
RT-PCR. Total RNA was extracted from viral cultures at different infective stages using an RNeasy Mini kit (QIAGEN). After the treatment of the total RNA with the RNase-free DNase I (QIAGEN), gene-specific primers were used to amplify the target genes by using the OneStep RT-PCR kit (QIAGEN). All the steps were followed according to the manufacturer's manual. Briefly, cDNA was reverse transcribed at 50°C for 30 min. The PCR amplification segment was started with an initial heating step at 95°C for 15 min (in order to simultaneously deactivate omniscript and sensiscript reverse transcriptases). After the activation of the HotStarTaq DNA polymerase, PCR amplification reactions were performed for 30 cycles under conditions of 95°C for 30 s, 51 to 58°C for 15 s, and 72°C for 1 min per cycle. The annealing temperature was optimized for different target genes. RT-PCR products were analyzed with 1% agarose gel and also subjected to nucleotide sequencing.
Virus abbreviations. ALIV, African lampeye iridovirus; ATV, Ambystoma tigrinum virus; BIV, Bohle iridovirus; BVDV, bovine viral diarrhea virus; CIV, Chilo iridescent virus; CV, chlorella virus; CZIV, Costelytra zealandica iridescent virus; EHDV, epizootic hemorrhagic disease virus; EHNV, epizootic hematopoietic necrosis virus; EHV-1, equine herpesvirus; FPV, fowlpox virus; FV3, frog virus 3; GIV, grouper iridovirus; GSIV, giant seaperch iridovirus; HVAV, Heliotis virescens ascovirus; IMRV, Ictalurus melas ranavirus; ISKNV, infectious spleen and kidney necrosis virus; LBIV, largemouth bass iridovirus; LCDV-1, lymphocystis disease virus 1; LYCIV, large yellow croaker iridovirus; MSEPV, Melanoplus sanguinipes entomopoxvirus; OMRV, Oncorhynchus mykiss ranavirus; PBCV, Paramecium bursaria chlorella virus; RGV, Rana grylio virus; RRV, Regina ranavirus; RSBI, Red Sea bream iridovirus; SBIV, sea bass iridovirus; SCV, Siniperca chuatsi virus; SFAV, Spodoptera frugiperda ascovirus; SGIV, Singapore grouper iridovirus; SIV, Simulium iridescent virus; SOV, Sciaenops ocellatus virus; TFV, tiger frog virus; TIV, Tipula iridescent virus; WIV, Wiseana iridescent virus.
Nucleotide sequence accession number. The complete SGIV genome sequence has been deposited in GenBank under accession no. AY521625. Accession numbers of 162 annotated ORFs are from AAS18016 to AAS18177, consecutively.
|
|
|---|
![]() View larger version (52K): [in a new window] |
FIG. 1. Organization of the SGIV genome. The SGIV genome is shown in a linear format. A total of 162 ORFs, predicted by the FGENESV program (available through: http://www.softberry.com), supplemented with Vector NTI suite 7.1, are indicated by their locations, orientations, and putative sizes. Blue arrows represent ORFs with known function, while red arrows represent ORFs detected by RT-PCR. "M" represents an ORF whose expressed product was identified by MALDI-TOF mass spectrometry. Yellow lines represent repetitive sequence regions. The scale is in 5 kbp.
|
|
View this table: [in a new window] |
TABLE 1. Listing of potential expressed ORFs in SGIV
|
|
View this table: [in a new window] |
TABLE 2. Positions of repetitive sequences in SGIV genome
|
![]() View larger version (67K): [in a new window] |
FIG. 2. Sequence alignment of selective SGIV ORF060R, ORF152L, and ORF076L with other known proteins. The homologous regions are shaded (black represents identical, grey represents conservative). The positions of the amino acid sequence are indicated on the left of the sequence. (A) Alignment of deduced amino acids of SGIV, ORF060R, accession no. AAS18075; TFV, ORF009L, accession no. NP_571991; ATV, ORF007L, accession no. AAP33184; RRV, accession no. AAK53744; LCDV, ORF132L, accession no. NP_078720; ISKNV, ORF063L, accession no. NP_612285; and CIV, ORF022L, accession no. NP_149485. (B) Alignment of deduced amino acids of SGIV, ORF152R, accession no. AAS18167; ATV, ORF050R, accession no. AAP33229; EHNV, accession no. CAB37349; TFV, ORF056L, accession no. NP_571999; and CIV, ORF161L, accession no. NP_149624. (C) Alignment of deduced amino acids of SGIV, ORF076L, AAS18091; human, Homo sapiens, accession no. NP_000261; cattle, Bos taurus, accession no. AAB34886; and mouse, Mus musculus, accession no. BAB25491.
|
Transcription and mRNA biogenesis. The putative SGIV gene products that are related to DNA transcription comprise the two largest subunits of DNA-dependent RNA polymerase II (ORF073L and ORF104L), one transcription elongation factor, TFIIS (ORF085R), and one RNase III enzyme (ORF084L; RNase III).
In addition, ORF063L exhibits similarity to one of the rat transcription factors which are important for transcriptional initiation. It may normally act to repress transcription at a variety of loci and may also play a role in chromatin structure or assembly (32). ORF061R encodes a TFIIF-interacting CTD phosphatase motif. It includes an NLI-interacting factor involved in RNA polymerase II regulation. ORF102L contains a fusion protein domain consisting of ubiquitin at the N terminus and ribosomal protein L40 at the C terminus. It also contains a zinc finger-like domain and is located in the cytoplasm (4). Ubiquitin is a highly conserved nuclear and cytoplasmic protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome. It is also involved in the maintenance of chromatin structure, the regulation of gene expression, and the stress response.
Nucleotide metabolism.
Predicted amino acid sequences of proteins required for the nucleotide transport and metabolism contain
and ß subunits of ribonucleoside-diphosphate reductase (ORF064R and ORF047L), a ubiquitous cytosolic enzyme with a key role in DNA synthesis as it catalyzes the biosynthesis of deoxyribonucleotides. ORF049L encodes a dUTPase which is critical for the fidelity of DNA replication and repair. It also decreases the intracellular concentration of dUTP so that uracil cannot be incorporated into DNA (7). Purine nucleoside phosphorylase, which is involved in nucleotide transport and metabolism and encoded by ORF076L and which exists widely in mammals, was first identified in the family of Iridoviridae (Fig. 2).
Cell signaling. ORF078L and ORF081L encode two protein kinases that share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. There are a number of conserved regions in the catalytic domain of protein kinases. The protein corresponding to ORF067L belongs to the family of deoxynucleoside kinases that consists of various cytidine, guanosine, adenosine, and thymidine kinases (which also phosphorylate deoxyuridine and deoxycytosine). These enzymes catalyze the production of deoxynucleotide 5'-monophosphate from a deoxynucleoside.
Immune evasion function. ORF028L, ORF029L, ORF031L, ORF033L, ORF035L, and ORF131R encode homologs of the immunoglobulin (Ig)-like domains. Cellular members of the Ig superfamily include secreted and membrane-bound receptors and cell adhesion proteins (ORF029L and ORF035L) (39). ORF005L encodes a homolog of a mammalian amino acid transporter. It is also comprised of a C-type lectin signature which may bind to major histocompatibility complex (MHC) class I complex antigens and may promote or inhibit immune activity through intracellular signaling pathways. Thus, it is possible that ORF005L may interfere with normal immune surveillance or host responses (2). ORF068L is composed of an Ig-MHC signature ([FY]-x-C-x-[VA]-x-H). It is known that Ig constant domains and a single extracellular domain in each type of MHC chain are related. These homologous domains are approximately 100 amino acids long and include a conserved intradomain disulfide bond (26). These genes may function in host immune evasion, immune modulation, and aspects of cell and/or tissue tropism or perform other cellular functions (2).
ORF070R encodes a thiol oxidoreductase that impels the formation of disulfide bond. The correct formation of disulfide bonds is important for the folding and function of many secretory and membrane proteins. Organisms from all kingdoms of life have evolved a diverse range of thiol oxidoreductases (21).
ORF155R exhibits homology to mammalian semaphorin homologue. The sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor (41).
ORF053R encodes a prokaryotic membrane lipoprotein lipid attachment site found in prokaryotes. To our knowledge, this is a first report of this motif in iridovirus. Membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached (12).
Cellular function. ORF003L is similar to 3-ß-hydroxysteroid dehydrogenase from TFV and other poxviruses. It catalyzes the oxidative conversion of both 3-ß-hydroxysteroid and ketosteroids, playing a critical role in biosynthesis of all classes of steroid hormones. ORF130L encodes a TonB-dependent receptor that interacts with outer membrane receptor proteins that carry out high-affinity binding and energy-dependent uptake of specific substrates into the periplasmic space. These substrates are either poorly permeative through porin channels or are encountered at very low concentrations. In the absence of TonB, these receptors bind to their substrates but do not carry out active transport. ORF115R encodes a homolog of a Bak protein, a member of the B-cell lymphoma (32% identity over 152 amino acids). Bcl-2 and related cytoplasmic proteins are key regulators of apoptosis, the cell suicide program critical for development, tissue homeostasis, and protection against pathogens. Bcl-2 family members are essential for maintenance of major organ systems to prevent a cellular apoptotic response to viral infection (1). ORF019R is composed of a glycoprotein hormone ß chain signature. The function of ORF019R in the viral replication cycle is unknown.
Phylogenetic analysis. Iridoviruses are large cytoplasmic DNA viruses where each type has a specific insect or vertebrate host (38). One of the unifying features of this virus group is the presence of a major capsid protein (MCP) that is approximately 50 kDa in size. MCP is a suitable target for the study of viral evolution, since it contains highly conserved domains, but is sufficiently diverse to distinguish closely related iridovirus isolates (34). The amino acid sequences of the known MCPs are used in comparative analyses to elucidate the phylogenic relationships between different cytoplasmic DNA viruses.
ORF072R encodes SGIV MCP. Phylogenetic analysis indicated that SGIV is distinct from all known iridoviruses (Fig. 3), but it is much closer to the genus Ranavirus. Within the MCP, amino acid identities of 73.0 (BIV), 72.8 (TFV), 72.8 (FV3), 72.4 (ATV), and 72.1% (ENHV) are noted. However, it only shows amino acid identities of 52.2 (LCDV), 45.7 (CIV), and 44.4% (ISKNV). This suggested that SGIV is a novel member of the genus Ranavirus within the family Iridoviridae. Generally, viruses with sequence identities within a given gene of less than 80% are considered members of different species rather than strains of the same species (37). The conserved protein sequence of the ATPase was also used to determine the relationship of SGIV with other iridoviruses (Fig. 3). The phylogenic tree of ATPase supports the view that SGIV is a novel species of the genus Ranavirus.
![]() View larger version (9K): [in a new window] |
FIG. 3. Phylogenetic relationship of SGIV with representative iridoviruses. The analysis was based on the multiple alignments of the protein sequences of the major capsid protein and ATPase of iridoviruses. (A) SGIV, ORF072R, accession no. AAS18087; ATV ORF014L, accession no. AAP33191; ISKNV, ORF006L, accession no. AAL72276; TFV, ORF096R, accession no. AAK55105; FV3, accession no. AAB01722; EHNV, accession no. AAO32315; LCDV-1, ORF147L, accession no. AAC24486; CIV ORF274L, accession no. AAK82135; CZIV, accession no. AAB82569; RSBI, accession no. AAP74204; WIV, accession no. AAB82568; TIV, accession no. VCXFTI; and SIV, accession no. VCXFSI. (B) SGIV, ORF134L, accession no. AAS18149; TFV, ORF016R, accession no. AAL77796; ATV, ORF083L, accession no. AAP33264; FV3, accession no. AAA43823; SOV, accession no. AAN77575; ISKNV, ORF122R, accession no. 98847; GIV, accession no. AAL68652; RSBI, accession no. BAA28670; SCV, accession no. AAL73346; LCDV, ORF054R, accession no. NP_078656; and CIV, 075L, accession no. AAB94422.
|
To make comparisons between SGIV and five other iridovirus genomes (ATV, TFV, LCDV, ISKNV, or CIV), we shifted the starting coordinates and set the start codon (ATG) of MCPs as the first base for all viral genomes. We also altered sense and antisense strands on ATV, LCDV, ISKNV, and CIV genomes in order to get the same nucleotide order on MCPs individually. However, none of the annotated ORFs were affected.
Comparing the SGIV genome to the LCDV, ISKNV, or CIV genome does not show possible clustering of genes in spite of the fact that SGIV shares 43, 22, or 29 real or annotated ORFs with the LCDV, ISKNV, or CIV genome, respectively. Although only 20 ORFs of SGIV reveal similarities to those of TFV genomes, it appears that some genes are located at similar map positions. In contrast, comparison of the SGIV genome with those of other iridoviruses shows that SGIV is much closer to ATV than other iridoviruses whose genomes are known. The sequenced genomes of the two closely related iridoviruses SGIV and ATV were compared with emphasis on genome organization and coding capacity (Fig. 4). The genome size and ORF numbers of the SGIV genome are much larger than those of ATV, which has a genome of 106,332 bp and contains 91 ORFs. Seventy-one ORFs of SGIV and ATV showed close homologies. There were some discrepancies in annotation, but inspection of DNA sequences showed that the corresponding genes are always present. Twenty-two corresponding ORFs between these two genomes are putative genes, but all remaining ORFs have no known function (Table 1). At least eight regions of conserved synteny containing more than three genes or annotated ORFs were also examined. Interestingly, TFIIS, RNase III, and one ORF (SGIV 086R, ATV 023L, and TFV 087R) are arranged in succession among SGIV, ATV, and TFV (Fig. 4). This cluster of genes may become a useful gene marker to distinguish unknown viruses from the genus Ranavirus. Scrambling of gene blocks was also observed between these two genomes. Two continuous conserved regions (blocks 4 and 5) in the SGIV genome were located at two separate gene blocks in the ATV genome, in which blocks 2 and 6 inserted. Orthologous genes between SGIV and ATV are quite similar in sequence conservation and also in gene order. Conserved linkages between SGIV and ATV indicate that they evolved from a common ancestor.
![]() View larger version (28K): [in a new window] |
FIG. 4. Conserved segments between the SGIV and ATV genomes. Both genomes are linearized and shifted genes encoding MCP as the start point. Only linked genes or annotated ORFs are indicated. Straight lines represent the gene linkages between two species. Black bars indicate the conserved syntenic regions of both genomes.
|
![]() View larger version (29K): [in a new window] |
FIG. 5. SDS-PAGE of SGIV proteins. Viral proteins were purified and separated via one-dimensional SDS-PAGE. Thirty-nine visible gel-separated protein bands were excised and digested enzymatically, and their mass spectra were obtained and automatically searched against the SGIV ORF database. Twenty-six proteins were identified by MALDI-TOF mass spectrometry. However, peptide signals from bands 26, 35, 36, 37, 38, and 39 were too low to give satisfactory identification.
|
|
View this table: [in a new window] |
TABLE 3. Identification of SGIV proteins corresponding to ORFs by MS
|
All 20 novel genes mentioned above were further checked and verified at the RNA level by RT-PCR. Total RNA (including virus and host) was extracted at 0-, 6-, 12-, 24-, 48-, and 72-h infective stages. Several genes started transcription early after the cell line was inoculated, 12 h (i.e., ORF090R and ORF093R) (data not shown). All novel genes were detected by RT-PCR after 48 h of infection (Fig. 6). Full lengths of 14 novel genes were amplified by reverse transcriptase and HotStarTaq DNA polymerase (Fig. 6A and 6). However, only partial sequences of ORF012L (2,107 to 3,075 bp), ORF039L (1 to 900 bp), ORF046L (17 to 747 bp), ORF050L (9 to 600 bp), ORF055R (12 to 588 bp), and ORF057L (7 to 832 bp) were amplified (Fig. 6C). Furthermore, RT-PCR products were used for DNA sequencing to confirm their respective authenticity.
![]() View larger version (92K): [in a new window] |
FIG. 6. Amplification of 20 novel genes of SGIV via RT-PCR. Total RNA (harvested after 48 h of infection) was isolated by using the RNeasy Mini kit and amplified by using the OneStep RT-PCR kit. Full lengths of 14 genes were amplified (A and B). Partial sequences were acquired from another six genes (C). Lanes C, control; lane M, 1-kb DNA ladder (Promega).
|
helices. Another 17 proteins were categorized as
/ß proteins. The prediction of transmembrane regions and orientation was also done via TMpred on the ISREC server and is listed in Table 1. We intend to elucidate the three-dimensional structures of these novel proteins by analyzing structural biology and their functions by using small interfering RNA and other related technologies. |
|
|---|
This work was financially supported by the grant "Establishment of a Laboratory of Excellence in Aquatic and Marine Biotechnology (LEAMB)" to Choy Leong Hew.
|
|
|---|
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»