Previous Article | Next Article ![]()
Journal of Virology, August 2006, p. 7699-7705, Vol. 80, No. 15
0022-538X/06/$08.00+0 doi:10.1128/JVI.00491-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Plymouth Marine Laboratory, Prospect Place, The Hoe, Plymouth, PL1 3DH, United Kingdom,1 Scottish Centre for Genomic Technology and Informatics, University of Edinburgh, Chancellor's Building, College of Medicine, 49 Little France Crescent, Edinburgh, EH16 4SB, United Kingdom,2 Marine Biological Association, Citadel Hill, Plymouth, PL1 2PB, United Kingdom3
Received 9 March 2006/ Accepted 10 May 2006
|
|
|---|
|
|
|---|
We have previously sequenced the 407,339-bp genome of EhV-86 and predicted that it encodes 472 protein coding sequences (CDSs) (22). This is the largest Phycodnaviridae genome sequenced to date. The functions of the 472 CDSs are largely unknown: only 66 have been annotated with functional product predictions on the basis of sequence similarity or protein domain matches (22). Genes of known function encode DNA polymerase subunits, RNA polymerase subunits, sphingolipid biosynthesis enzymes, and eight proteases. The Phycodnaviridae family was previously thought to lack RNA polymerase; thus, the detection of six RNA polymerase subunits in EhV-86 suggested a lifestyle distinct from those of other Phycodnaviridae (1-3, 22). A lytic-phase transcriptional profile generated from E. huxleyi infected by EhV-86, 33 h postinfection, detected the presence of 65% of the predicted virus CDS transcripts (22).
In many well-studied biological systems, viral transcripts are commonly assigned into a kinetic class by their dependence on protein synthesis or viral DNA replication (5). This is usually achieved by monitoring the sensitivity of virus gene transcript levels upon exposure to inhibitory drugs (5). Through the inhibition of translation or DNA replication with chemicals such as cycloheximide and phosphonoacetic acid, transcripts can be assigned into kinetic classes such as immediate-early (
), early (ß), early-late (
1), and late (
2). Kinetic stratification strongly correlates with functional classification. Classically, immediate-early proteins perform regulatory functions; early proteins, metabolic roles; and late proteins, structural and morphogenic roles. The culturing conditions needed to sustain optimum growth in tandem with the number of cells needed to extract total RNA in useable quantities (experiments typically involve tens of liters of culture medium) limit what can be done presently with the E. huxleyi experimental system. The application of inhibiting drugs to E. huxleyi cultures is both impractical and costly. Furthermore, the molecular biology of E. huxleyi is almost as poorly understood as the molecular biology of its virus, EhV-86. The use of drugs may have unknown affects on both host and virus systems. To build on previous genome sequencing work and to begin the task of characterizing all the EhV-86 CDSs, the majority of which are of unknown function, an expression profile of the first 4 h of infection was determined. We know from experience that mature virions are released from infected cells 4 h postinfection (data not shown); hence, this initial period was profiled in detail. By exploiting a synchronous infection and sampling early on, we avoided the need for inhibitor drugs and profiled a natural infection cycle.
|
|
|---|
Total RNA was extracted 0, 1, 2, and 4 h postinfection, using a QIAGEN midiprep kit, and used for microarray analysis as described previously (22). Briefly, cell cultures (150 ml) were filtered (0.45-µm Supor-200 membrane filter; Pall Corporation), resuspended in 2 ml phosphate-buffered saline solution, pelleted by centrifugation (20,000 x g, 1 min), resuspensed in 2 ml RNAlater (QIAGEN), and stored at 20°C until processed with the QIAGEN midiprep kit.
As a control, virus particles were checked for the presence of RNA. The virus particles were purified and concentrated as described previously (14). Briefly, virus lysate (5 liters) was filtered (0.45-µm Supor-200 membrane filter; Pall Corporation), concentrated by a tangential flow ultrafiltration unit with a molecular weight cutoff of 50,000 (Vivaflow 50; Sartorius) to a final volume of 20 ml. Aliquots (3.5 ml) of the concentrate were adjusted with CsCl to densities of 1.1, 1.2, 1.3, and 1.4, and gradients from 1.1 and 1.4 were created by ultracentrifugation at 25,000 rpm at 22°C for 2 h in a SW40 Ti Beckman rotor. Virus bands were removed with a syringe and dialyzed against 4 x 1-liter volumes of 10 mM Tris-HCl, pH 7, at 4°C. To remove any contaminating RNA associated with the outside of the virus particles, the concentrate was incubated with RNAseA (final concentration, 50 µg ml1) for 1 h at 37°C and treated with Proteinase K (final concentration, 50 µg ml1) and sodium dodecyl sulfate (SDS) (final concentration, 0.5%), and the samples were incubated at 60°C for 1 h. Samples were then treated to extract total RNA as described previously. Purified total RNA was then treated with RNase free DNase (QIAGEN) in a 50 µl volume. Following DNase treatment, RNA was cleaned and purified, using the Roche Target Purification kit (Roche), and eluted in 50 µl RNase-free water and stored at 80°C. RNA quantity and quality were assessed using the Agilent Bioanalyzer 2100 system. All RNA samples were checked for virus genomic DNA contamination by PCR, using virus major-capsid protein gene primers (14). All samples tested negative for viral genomic DNA contamination.
Fluorescent labeling of cRNA. Random amplification of the entire mRNA population was achieved using the Microarray Target Amplification kit (Roche). This requires the use of a special primer containing a random sequence with no significant homology to any known sequences in public databases (Target Amplification Sequence [TAS]) in addition to a T7 promoter and oligo(dT) sequences for first-strand cDNA synthesis. The TAS region generates a 3' anchor on the cDNA for subsequent PCR amplification with a TAS-PCR primer. First- and second-strand cDNA synthesis was performed from 200 ng of template RNA and 10 pg of spike mRNA (Stratagene), and the product was purified using the Target Purification kit (Roche). Purified cDNA was then randomly amplified by 24 cycles of PCR with TAS primers. PCR products were purified using the Microarray Target Purification kit (Roche) and then concentrated using YM30 Microcon concentration columns (Millipore). The resulting PCR-amplified cDNA was then transcribed into Cy3 fluorescently labeled cRNA, using the T7 Microarray RNA Target Synthesis kit (Roche). Labeled cRNA was then purified using the Microarray Target Purification kit. The EhV-86 microarray has been described previously (22). Briefly, 75-mer oligonucleotides are spotted in triplicate for 425 of the 472 predicted CDSs of EhV-86. For hybridization to microarrays, 7.5 µl 20 x SSC (Sigma), 1.0 µl 10% SDS, and the labeled cRNA sample were combined in a total volume of 50 µl. After being incubated at 100°C for 2 min (lid temperature, 110°C), the samples were cooled (room temperature, 5 min) and the volume was checked (and made up to 50 µl when necessary) prior to loading the samples onto the microarray slide covered by a LifterSlip (Erie Scientific Company). Microarray hybridization was performed in a microarray hybrid chamber (Camlab) at 65°C for 18 h. The microarray slides were given three posthybridization washes. The first wash was in 50 ml of 1 x SSC-0.1% SDS for 5 min with constant agitation, followed by a second wash for a further 5 min (with constant agitation) in 50 ml of 0.1 x SSC-0.1% SDS. A third wash was performed by plunging the slides 20 times into 50 ml of 0.1 x SSC. The microarray slides were immediately centrifuged to remove residual liquid (200 g, 1 min) and stored in the dark prior to being scanned. Three biologically independent samples for each of the four main study conditions (uninfected Emiliania huxleyi, postinfection at 0 h, 1 h, 2 h, and 4 h) were hybridized to one array each. For validation purposes, three further arrays were hybridized with RNA extracted from purified EhV-86 virion particles, and one array was hybridized with just the Stratagene spike RNA.
Microarray data processing and analysis. (i) Scanning and image processing. Hybridized arrays were scanned with an Affymetrix 418 array scanner. Six scans at incremental settings (gains of 0, 5, 10, 15, 20, and 25) were performed for each array in order to determine and select the optimal scan setting, producing a high dynamic signal range without saturation (9). Images from all scans were quantified using GMS Scanner software, version 1.51.0.42, and ImaGene, version 5.6.1.
All subsequent data processing and analysis steps were carried out with the R version 2.1.1 statistical programming environment (www.r-project.org) and Bioconductor version 1.6 microarray modules (www.bioconductor.org).
(ii) Background correction. Assessment of global noise and signal distributions showed evidence for a background noise gradient across some arrays. The effect was only noticeable at very low fluorescence levels and had no corresponding visible effect on signal levels. A log-linear interpolation method (Edwards) was used to correct lower-intensity values for any background carryover effects (8).
(iii) Normalization. A targeted viral array is likely to violate the assumptions that are made before applying global normalization methods; in this case it can be expected that 10% or more of the probes on the array will change expression levels between different biological conditions or time points. In anticipation of this, a large set of positive-control probes were included on the array (10 Stratagene alien probes, printed in triplicate on each of the 16 subgrids). The chosen normalization method was scaling of subset (n = 480 probes) medians to a common reference value, after an initial log2 transformation of all data values.
(iv) Expression determination. Expressed and/or not expressed calls for each gene and time point were generated on the basis of a detection threshold determined by the ninetieth percentile value of negative-control probes. A probe with an intensity value above this threshold was considered to be "on."
(v) Differential expression and hypothesis testing. Prior to further analysis, a nonspecific filter was applied to remove biologically irrelevant genes (control probes). This process reduces problems with statistical testing on multiple variables simultaneously. Of a total of 2,496 probes, 1,440 remained in the analysis set. The threshold for the filtering was in this case based on the slightly less stringent eightieth percentile of negative-control probe intensities. Differential expression was determined in two ways: (i) from an uninfected baseline sample to each of the postinfection time points and (ii) between postinfection time points. A statistical test with relative robustness for small sample sizes (simple linear model enhanced by empirical Bayes) was used to test the null hypothesis of nondifferential expression for each individual gene. Simultaneous testing on large numbers of variables (genes) leads to an increased number of potentially false-positive results; a Benjamini and Hochberg P-value correction was therefore applied.
(vi) Explorative analysis. Prior to explorative analysis, a further nonspecific filter was applied to remove genes which were not "on" in at least 3 out of 18 arrays, reducing the number of probes to 1,288. Absolute expression data were grouped and visualized using a hierarchical biclustering algorithm on expression profiles of genes and samples, combined with a heat map. The clustering parameters were "1 minus correlation" as the distance measure and "average" as the linkage method. The gene clusters were standardized to have means and standard deviations of 0 and 1, respectively, across arrays.
Microarray data accession number. Microarray data (including microarray design, hybridization, and analysis) were stored and curated in maxdLoad2 prior to submission in MAGE/ML format to the EBI Array Express database (http://www.ebi.ac.uk/arrayexpress) under accession number E-MAXD-8 (10). This data is also available at the EGTDC data catalog (http://envgen.nox.ac.uk/) under accession number egcat:000010.
|
|
|---|
![]() View larger version (30K): [in a new window] |
FIG. 1. Grouping of EhV-86 CDSs. EhV-86 CDSs are listed at the experimental time point at which they first show statistically significant (P < 0.01) differential expression compared to that of baseline uninfected samples (T1, T2, and T4 indicate 1, 2, and 4 h postinfection, respectively). CDSs are included in this table only if they match two criteria. The first criterion requires CDSs to have signal intensity values above the detection threshold for at least two out of three print replicates (asterisks indicate two probe replicates in agreement). The detection threshold is defined nonparametrically as the ninetieth percentile of human negative-control probes. The second criterion requires the differential expression compared to that of uninfected samples to be statistically significant for all three print replicates of a CDS.
|
Genes in the 104-kb central region not associated with the putative promoter show a different absolute expression profile to those associated with the putative promoter (Fig. 2). Two apparently non-promoter-associated CDSs can be seen to cluster with the promoter-associated CDSs; these correspond to ehv349, which encodes a putative protease. This CDS is associated with the promoter element but was grouped with the proteases for the purpose of this analysis. The three replicate probes for ehv247 (which is not directly associated with the promoter) can be seen to cluster with the promoter-associated CDSs. It is likely that this CDS is cotranscribed with the immediately adjacent ehv248, which does have the promoter.
![]() View larger version (30K): [in a new window] |
FIG. 2. Scatterplots of absolute CDS expression for uninfected versus each of the infected samples. The center diagonal lines represent "no difference in expression," and the two diagonal lines parallel to the center line represent the threshold for twofold-up and twofold-down regulation. The rectangular box outlined with a dashed line indicates the detection threshold for each of the two biological conditions; i.e., probes inside this box are not expressed or not measurable in uninfected and infected samples. The three replicate values for each CDS are shown.
|
Unexpectedly, expression of RNA polymerase subunit genes (ehv064, ehv108, ehv167, ehv399, and ehv434) does not appear to occur until later on in the infection process (transcripts for RNA polymerase subunits are first detected at 2 h postinfection) (Fig. 1), suggesting that the host RNA polymerase is responsible for expression of T1 genes or that the virus packages its own RNA polymerase into the virion.
Differential expression. An explorative analysis using biclustering and heat map visualization of absolute expression values (Fig. 3) reveals that the uninfected transcriptional profile (Tu) is, as would be expected, very similar to the infected profile immediately after infection (T0), with only a handful of host genes expressed on the array. Host genes display a general trend of down regulation (although this down regulation is not statistically significant) which accounts for the minor differences between the Tu and T0 profiles. These changes are likely to have been caused by the unavoidable delay between sampling cells and freezing of the RNA profile (approximately 5 min), indicating that the virus may have an effect on the host cell within minutes of infection. Two-hour and 4 h postinfection transcriptional profiles (T2 and T4, respectively) are very similar and cluster tightly together with transcripts for the majority of EhV-86 CDSs expressed (Fig. 1 to 3). The 1 h postinfection transcriptional profile (T1) fails to associate closely with either of these two clusters but appears closer to the Tu and T0 profiles than to T2 and T4 (Fig. 3).
![]() View larger version (47K): [in a new window] |
FIG. 3. Biclustering of CDS and sample expression profiles in combination with heat map visualization. Each column represents the averaged log2 expression data for three independent samples for a biological condition. Each row in the heat map represents one CDS color coded for expression level from low (blue) to medium (white) to high (red). Branches on the top and side of the heat map show the identified clusters in the sample, i.e., samples with the most-similar expression profiles across all genes and genes with similar expression profiles across all samples. As expected, results for uninfected (Tu) samples and samples at 0 h postinfection (T0) are very closely related. The later time points T2 and T4 are also closely related. T1 is the most interesting, showing a cluster of CDSs which go from "off" in the Tu and T0 samples to "on."
|
Virion message. When a target was labeled from a total RNA extraction performed on purified EhV-86 virions as a control, the only transcript detected above threshold levels on the microarray was that of ehv315 (P value < 0.01) (data not shown). CDS ehv315 is a proline-rich putative membrane protein of unknown function (22). Proline-rich proteins have previously been implicated in calcium binding (22), which is of particular interest since the host, E. huxleyi, is well known for the sequestration of calcium carbonate onto its surface (in the form of coccoliths) during active growth (21). Whether the protein EHV315 plays a role in disrupting this pathway is yet to be determined but clearly warrants further investigation.
CDSs not expressed. Many of the EhV-86 CDSs where no transcript was detected at 4 h postinfection have been assigned a putative function (22) (Fig. 4). Noteworthy is the lack of detection of transcripts for five of the eight proteases predicted to be encoded in the EhV-86 genome (ehv021, ehv109, ehv160, ehv349, and ehv361) (22). Virally encoded proteases have previously been shown to be involved in the maturation of infectious virions, suggesting further that at 4 h postinfection the full virus replication cycle may not yet have been completed (15). At 4 h postinfection, the cell has only just started releasing virus particles and we know it continues to release viruses for up to 2 days while remaining intact (data not shown). This leaves scope for a range of new transcripts to be up-regulated at specific time points after this initial 4-h period. Moreover, the EhV-86 genome also encodes key components of sphingolipid biosynthesis, a pathway that leads to the production of ceramide (22), and these genes are expressed at 2 h postinfection (Fig. 1). It is known that there is a connection between protease activation and ceramide-induced apoptosis (17). A virus-controlled apoptotic mechanism is unlikely to induce cell death in the early stages of infection; consequently, protease expression will most likely be up-regulated closer to the onset of apoptosis and cell disintegration, up to 2 days later.
![]() View larger version (29K): [in a new window] |
FIG. 4. EhV-86 CDSs for which expression of the transcript could not be detected above threshold levels, confirmed, or tested. CDSs that have previously been detected in a 33-h postinfection lytic-phase transcriptional profile are in bold (22). A transcript is defined as "not detected" if at least two out of three print replicates for this gene have signal intensity values at or below the ninetieth percentile for the signal intensity value of human negative-control probes.
|
Closing discussion. The infection cycle of EhV-86 can be divided into two broad stages: a primary stage in which a distinctive subgroup of localized CDSs associated with a putative promoter element are transcribed and a secondary stage during which CDSs are transcribed regardless of their genomic location. The function of the primary stage is difficult to ascertain, since the vast majority of the CDSs expressed have little or no database homologues. CDSs from this region have been shown to have some of the highest levels of expression during the infection process (22), presumably due to their early and then constant high levels of expression, suggesting that they are of vital importance to the infection strategy.
The presence of RNA polymerase genes in the EhV-86 genome implies that the virus has the capacity to transcribe its own genes from within the cytoplasm during infection. It is not implausible to suggest that the biphasic stages of expression that occur during EhV-86 infection of E. huxleyi are compartmentalized. There is no evidence that transcription of virus RNA polymerase genes occurs until at least 1 h into the infection; therefore, there are two main possibilities that could account for the expression of the primary genes: (i) a functional viral RNA polymerase that recognizes family A promoters is packaged into the mature virion and causes expression of primary CDSs in the cytoplasm, or (ii) following infection, viral DNA targets the host nucleus where host RNA polymerase recognizes family A promoters, leading to transcription of primary genes. Targeting of virus genomic DNA to the nucleus has previously been suggested to occur during the infection of Chlorella by PBCV-1 (18). The change from nuclear to cytoplasmic intracellular compartments (or vice versa) could account for the distinctive change in transcriptional profile observed in these experiments. If promoters similar to the family A repeats are identified during the genomic sequencing of E. huxleyi (currently under way), the second option above should certainly gain greater credence. If this is the case, subsequent breakdown of host genomic DNA (ehv041 encodes an endonuclease and is expressed between 1 and 2 h postinfection) and the nuclear envelope (a viral protease, encoded by ehv349, is expressed during the first hour of infection) could initiate the secondary stage of infection, where the remaining virus genes are expressed.
The presence of RNA polymerase in the EhV-86 genome, so far unique among the Phycodnaviridae, suggested a unique replication cycle for this giant virus. The virally encoded RNA polymerase may be intrinsically linked to the virus-encoded sphingolipid biosynthesis pathway (22). Sphingolipids are well documented as playing a crucial role in controlling cell death (12). If degradation of host DNA does occur during infection, then the ability to make new host-encoded RNA polymerase would be lost. Furthermore, if cell death is delayed extensively by the manipulation of sphingolipid biosynthesis, the active production of a virus-encoded RNA polymerase during the prolonged infection would prove vital to the replication strategy of the virus. Consequently, this could account for the retention of RNA polymerase function in coccolithoviruses (2).
The NCLDV group is composed of the Asfariviridae, Poxviridae, Mimiviridae, Iridoviridae, and Phycodnaviridae families (2). These diverse families are likely to have shared a common ancestor which was likely to have had both nuclear and cytoplasmic phases in its life cycle (11). Lineage-specific gene loss and gain within the NCLDV families is thought to contribute to the highly diverse characteristics of present-day forms. Poxviruses, asfarviruses, and iridoviruses encode their own transcription and replication machinery and undergo their replication cycles entirely in the cytoplasm (poxviruses) or start it in the nucleus and complete it in the cytoplasm (asfariviruses and iridoviruses) (11). Prior to the sequencing of EhV-86, the phycodnaviruses were thought to be characterized by their nuclear-dependent replication cycles. This interesting coccolithovirus appears to have a replication cycle more similar to that of the ancestral virus, a cycle distinct from all currently known Phycodnaviridae.
We acknowledge help with microarray data submission from the NERC Environmental Bioinformatics Centre, Centre for Ecology and Hydrology, Oxford, United Kingdom.
M.J.A. designed and performed the research, analyzed the data, and wrote the manuscript. W.H.W. conceived, designed, and coordinated the research. T.F. analyzed the data and helped to draft the manuscript. D.C.S. designed the microarray. D.R. and P.G. helped to design the experimental protocol. M.H. aided in sample collection.
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
B regulation by blocking its recruitment on the I
B
promoter and directing the factor on viral genes. J. Biol. Chem. 281:7110-7117.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»