Previous Article | Next Article ![]()
Journal of Virology, September 2006, p. 8989-8999, Vol. 80, No. 18
0022-538X/06/$08.00+0 doi:10.1128/JVI.01158-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
I-Shou Chang,2
Lin-Wei Huang,1
Po-Cheng Chen,1
Chi-Chung Wen,3
Shu-Chen Liu,1
Li-Chu Chien,3
Chung-Yen Lin,3
Chao A. Hsiung,3 and
Jyh-Lyh Juang1*
Division of Molecular and Genomic Medicine, National Health Research Institutes, Miaoli, Taiwan,1 Institute of Cancer Research, National Health Research Institutes, Miaoli, Taiwan,2 Division of Biostatistics and Bioinformatics, National Health Research Institutes, Miaoli, Taiwan3
Received 5 June 2006/ Accepted 5 July 2006
|
|
|---|
|
|
|---|
The early transcripts of AcMNPV are synthesized by host RNA polymerase II, while its late transcripts are mediated by virus-encoded RNA polymerase (15, 21). To utilize the host cell transcriptional machinery, baculoviruses have evolved promoters that closely resemble cellular RNA polymerase II-responsive promoters containing the TATA element (3, 4, 10, 16, 45). Other cis-acting regulatory elements, such as the initiator element CAGT motif (3, 44) and downstream activating region elements (24, 45), have also been found in promoters of several early AcMNPV genes. Some unconventional promoter motifs, e.g., CGTGC (33, 42, 53), are highly responsive to transactivators expressed by the baculovirus. In DNA viruses, the potent viral transactivators are synthesized in an early phase to stimulate early viral gene expression, while the host cell viral DNA concentration is still low. Four baculovirus transactivators have been found to stimulate early gene transcription, i.e., IE0, IE1, IE2, and PE38 (13). The pe38-deficient baculovirus has a reduced capacity for genome replication, budded-virus (BV) production, and virulence in Heliothis virescens larvae (39). The roles of the other early transactivators in viral infection, however, still need to be specified.
Late-phase infection is divided into two stages, late and very late, which extend from around 6 to 76 h postinfection (hpi). In late-stage infection (6 to 12 hpi), studies have identified the genes involved in replication of the viral genome and production of BVs, both known as late expression factors (lef genes) (30, 36, 46). Also from these late RNA transcripts, a TAAG sequence motif has been identified and found to constitute a primary element for late and very late promoter activity (35). In the very late stage of infection (18 to 76 hpi), also known as the occlusion phase, proteins associated with occlusion, lysis of the cell, and release of mature occluded viruses are expressed (35). At this stage, other gene products, some not essential for BV production, are also hyperexpressed. One such product, polyhedrin (polh), for example, functions as a major matrix protein for mature occluded viruses.
Although many temporally expressed viral genes have been identified, very little is known about the kinetics of genome-wide expression and regulation during the virus replication cycle in their host lepidopteran cells. To better understand how transcription regulatory mechanisms operate in cells, the temporal dynamic transcription program that occurs during the baculovirus infection cycle can be investigated. To do this, we took advantage of the recent advances in genome sequences of several baculovirus species (1, 18) and developed a whole-genome microarray analysis system. By using AcMNPV to infect Sf21 cells, we examined the viral transcription cascade in natural host cells. Our analysis of the array profile suggests that expression kinetics correlate with the viral infection, replication, and pathological effect in infected cells. An early-stage baculovirus transactivator pe38 deletion mutant virus strain was used to study its interface with the regulatory network. A comprehensive view of the baculovirus temporal transcription program in different virus genetic backgrounds would makes it possible to decipher the molecular mechanisms underlying the regulation of gene expression during infection cycles, which could help in making better use of this virus in biomedical research.
|
|
|---|
Several recombinant baculoviruses were used for temporal gene expression profiling. Bac-PH-EGFP, in which the coding region of the polh gene was replaced with a sequence encoding enhanced green fluorescent protein, was used to infect Sf21 cells. The recombinant baculovirus's titer was determined on and it was propagated in Sf21 cells as previously described (28). Wild-type AcMNPV (strain E2) and pe38 deletion virus
pe38-E9/E9, generously donated by David Theilmann (39), were applied in our differential gene expression profile analysis.
Baculovirus microarray preparation. To prepare PCR-amplified probes to be spotted onto microarray slides, 156 pairs of primers (shown at http://www.nhri.org.tw/nhri_org/mm/jljuang/supplementary/AcMNPV-Sf21/Table_SI.pdf) were designed with PROBEWIZ (41). PCR amplification was performed to produce 155 nonredundant cDNA probes representing the potential coding ORFs (1) in the genome of AcMNPV strain C6 (NC_001623) as shown at the NCBI website (http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?val=NC_001623). Genomic DNA of wild-type AcMNPV was extracted with the Viral RNA Mini kit (Viogen). It was then used as the PCR template. PCR products were purified with the QIAquick Mini kit (QIAGEN), dried in a SpeedVac, and resuspended in a spotting solution containing 50% dimethyl sulfoxide. The concentration of each amplicon was adjusted to 0.2 to 0.5 mg/ml, and then it was spotted onto aminosilane-coated UltraGAPS glass slides (Corning) by a homemade robotic microarray spotter.
In order to obtain replicates from each microarray hybridization experiment, each cDNA probe was printed four times on the glass slide. Additionally, a series of control DNA probes in the Lucidea Universal ScoreCard (Amersham Bioscience), which were derived from sequences of yeast intergenic regions, were also spotted onto different regions of the baculovirus microarray slide. These were used to generate a calibration curve for universal reference and normalization (see below).
RNA extraction, spiked-in targets, and fluorescent dye labeling. Cells infected by a specified recombinant baculovirus were harvested at various times postinfection and washed with culture medium. Total RNA of each sample was then extracted with the RNeasy Mini kit (QIAGEN) without further purification. To reduce the possibility of RNA degradation during preparation before fluorescent dye labeling, the integrity of all RNA samples was examined with a Bioanalyzer 2100 (Agilent) with an RNA Nano Labchip (Agilent).
The samples were then labeled with fluorescent dye; 20 µg of total RNA harvested from infected cells was reverse transcribed to Cy3- or Cy5-labeled cDNA with the CyScribe First Strand labeling kit (Amersham Bioscience) in the presence of Cy3-dUTP or Cy5-dUTP (Amersham Bioscience). We used oligo(dT)18-20 as the primer for reverse transcription.
It was important to generate a calibration curve for normalization of gene expression data across experiments. To do this, the mixture of spiked-in poly(A) RNA control targets of the Lucidea Universal ScoreCard (Amersham Bioscience) was also included in the labeling reaction mixtures, according to the manufacturer's instructions.
Microarray hybridization. Cy3- or Cy5-labeled cDNA was hybridized to a baculovirus array (at 16 h and 42°C in 25% formamide-5x SSC [1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate]-0.1% sodium dodecyl sulfate [SDS]-0.1 mg/ml salmon sperm DNA). For posthybridization processing, the array was washed twice (5 min each time, 42°C) in buffer I (2x SSC, 0.1% SDS), followed by a wash (10 min, room temperature) in buffer II (0.1x SSC, 0.1x SDS). Finally, the array was washed four times (1 min each time) in buffer III (0.1x SSC). With a GenePix 4000B laser scanner (Axon), the microarray image was then retrieved and processed with GenePix Pro software (Axon).
Real-time quantitative RCR. Templates for real-time quantitative PCR analysis were made from samples (1 µl) of a 10-fold dilution of cDNA. Superscript II (Invitrogen) was used for this cDNA synthesis based on total RNA (2 µg) samples harvested at 12 specific time points throughout the course of the viral infection. Real-time PCRs were carried out with the LightCycler DNA master SYBR green I kit (Roche) in the LightCycler (Roche) machine. All but one of the real-time PCR primers for the genes tested were the same as those used to amplify viral probes for microarray production. The one exception was the orf-68 gene, for which 5'-AATCTATCCGATGGCAAATAC was used as the forward primer and 5'-CGGACGGGTCTGTTCA was used as the reverse primer.
Data analysis. To normalize microarray data, fluorescence signal intensities of the external RNA spiked-in controls (see above) with predetermined quantities of input mRNA were used to fit a Bayesian isotonic regression model. This regression curve was regarded as the plot of the expected intensity versus the relative expression level and was used to transform each probe's signal intensity to obtain its normalized expression level. We note that this approach uses a Bayesian model to take into account the smoothness and the monotonicity of the regression function and falls into the framework outlined in our previous work (8). This has recently been detailed in work emphasizing its application in microarray data analysis (7). For each gene at a specific postinfection time, the mean of the four normalized expression levels was used as the basis for further data analysis.
We analyzed the gene expression kinetics of Bac-PH-EGFP in Sf21 cells throughout the infection time course (0 to 72 hpi). For mathematical convenience, we transformed unequal intervals into a time scale (0 to 1) with equal intervals between adjacent time points. Calibrated signal intensities were used to fit a piecewise polynomial regression model with the Markov chain Monte Carlo algorithms.
This gene-specific regression function represents the expression pattern of a gene during the time course of infection. Generally speaking, this function begins with a value of 0 and at some point increases until it reaches its maximum. Two key features of each gene-specific regression function were used to characterize the expression kinetics of that gene during the time course of infection. One was the time to onset (Ton), i.e., the point where the expression level of a gene first exceeds the background noise. The time to maximum (Tmax) was estimated on the basis of the time when the expression of a gene was at its highest. To group genes according to distinct kinetic features, a two-dimensional clustering of Ton and Tmax was performed with the algorithm described by Hall and Heckman (17).
Dynamically regulated gene expression patterns were also clustered by the K-means algorithm. The expression levels shown were normalized by a constant so that the sum of the 12 squares of the expression levels (at 12 time points) for each gene equals 1. The average Ton for each gene cluster was expressed as the mean ± the standard deviation.
We used GeneSpring software (Agilent) to identify possible common upstream regulatory sequences in a given set of clustered genes. The common upstream regulatory sequence denoted sequences ranging from 15 to 190 bases upstream of an ORF, and the entire span of the sequences was about 5 to 6 oligonucleotides in length. Only the exact match of a consensus sequence was retrieved. A cutoff value of 0.1 for the probability of a false-positive result was used.
|
|
|---|
![]() View larger version (30K): [in a new window] |
FIG. 1. AcMNPV microarray design and calibration method. (A) Illustration of a small region in an array with two out of four replicates of virus-specific and external control probes (represented by V1 to -18 and C1 to -23, respectively). (B) Array image of a mock infection experiment revealing virtually no cross hybridization to virus-specific probes. (C and D) Array assays of Sf21 cells infected with Bac-PH-EGFP at an MOI of 10 at 6 and 48 hpi, respectively. (E) Nonlinear calibration curve showing the relationship between signal intensities obtained from external control probes and predetermined quantities of spiked-in control targets.
|
Probe specificity was validated by hybridizations with cellular targets derived from mock-infected Sf21 cells. There was no evident hybridization signal detected by those baculovirus probes for mock-infected cells, compared to those for Bac-PH-EGFP-infected cells at 6 and 48 hpi (Fig. 1B to D). This difference suggests that all of the 155 predicted potentially coding ORFs were transcribed in Sf21 cells and that the selected probes on the custom-made microarray did not cross-hybridize with those control targets (external spiked-in yeast mRNAs) or nontarget genes (mRNAs of Sf21 cells).
Viral gene expression cascade in permissive cells. To obtain a comprehensive view of the gene expression profile throughout the course of infection, microarray analysis was performed. This was done with RNA extracted from cells at 12 postinfection time points. Bac-PH-EGFP recombinant baculovirus (28) was employed to infect the fully permissive Sf21 cells at a multiplicity of infection (MOI) of 10. Total RNAs from cell lysates were harvested at 12 different time points between 0 and 72 hpi. Array analysis was done with each of these samples. The normalized hybridization signals for each gene at different time points were then obtained (shown at http://www.nhri.org.tw/nhri_org/mm/jljuang/supplementary/AcMNPV-Sf21/Table_SII.pdf). A temporal ordering of expression profiles was created from a two-dimensional heat map (Fig. 2A) to illustrate a genome-wide view of specific gene expression patterns observed at each of the times during the course of infection. If the global baculovirus gene expression pattern in Sf21 cells can be viewed as a sequential array of cellular events, the microarray results revealed a highly ordered, orchestrated transcription program process. We detected approximately 20% of the transcripts in the viral genome from 3 to 6 hpi (Fig. 2A, group I), 50 to 60% from 6 to 9 hpi (Fig. 2A, group II), and the remaining 20 to 30% from 12 to 15 hpi (Fig. 2A, group III).
![]() View larger version (46K): [in a new window] |
FIG. 2. Time course microarray analysis of recombinant AcMNPV gene expression patterns in Sf21 cells. (A) Sf21 cells infected with Bac-PH-EGFP at an MOI of 10 were harvested at the indicated time points for microarray analysis. The patterns of viral gene expression during the course of infection are depicted as a two-dimensional heat map. Black blocks represent zero expression, and yellow blocks symbolize different expression levels corresponding to the scale bar at the bottom. (B) Quantitative real-time PCR confirmation of microarray results for five randomly selected genes. For data comparison, results derived by both methods were adjusted to 10,000 (lef-3, orf-68, iap1, and gp37) or 1,000 (sod) arbitrary units. All PCR data are averages of duplicate tests, and microarray data are averages of four replicates.
|
Viral gene expression kinetics. To allow us to further clarify the events within the expression cascade that are essential for AcMNPV infection, we introduce two parameters, Ton and Tmax, representing the postinfection time of onset of gene expression and the time to the maximal expression level, respectively (shown at http://www.nhri.org.tw/nhri_org/mm/jljuang/supplementary/AcMNPV-Sf21/Table_SII.pdf). For example, in chitinase (orf-126) gene expression kinetics (Fig. 3A), Ton is 12.7 hpi and Tmax is 23.9 hpi. While Ton points to the time when a viral transcript is produced in a given host cell type, Tmax reflects the time for the ultimate steady-state accumulation of a viral transcript during the infection process. The genome-wide Ton and Tmax were calculated and scatter plotted in two dimensions (Fig. 3B). Ton varied from 2 to 16 hpi, while Tmax had a much wider range of 13 to 53 hpi. However, most transcripts were centered between 13 and 30 hpi. By 30 hpi, more than 95% (149 out of 156) of the AcMNPV genes had reached their highest levels of expression. However, the coefficients of correlation between Ton and Tmax (r = 0.20) for all of the viral genes were weak, meaning that there was no constant lag between the onset time of transcription and the time to the maximal expression level. This indicates that the onset of transcription and time to the maximal accumulation of a transcript in host cells were not necessarily related, a finding similar to that for human herpesvirus 8 (43). Collectively, these data suggest that Ton and Tmax are the two key kinetic parameters that show the dynamic gene expression pattern in the infection process.
![]() View larger version (18K): [in a new window] |
FIG. 3. Gene expression kinetics and classification. (A) Ton (time to onset of gene expression) and Tmax (time to maximum expression level) were demonstrated by using chitinase (orf-126) as an example. (B) A classification methodology for AcMNPV genes based on the criteria of Ton and Tmax kinetics. Selected known genes in each classified group are listed at the bottom.
|
The late stage of the infection life cycle initiates at the onset of viral genome replication, which is about 5 to 6 hpi. During this stage, the genes for late structural proteins in groups II and IV, including vp39, vp80, and p24, together with the newly synthesized viral DNA, were assembled to form BV for cell-to-cell transmission of the virus within a single host or in cell culture. As a final point in the very late phase (Tmax = 49 to 53 hpi), genes in group III encoding envelope- or capsid-related structural proteins, such as p10, orf-1629, and p74, were heavily transcribed for virus packaging. Therefore, the expression cascade of viral transcripts in the late and very late phases appeared to be reflected in our transcription kinetic model.
Gene expression pattern and cytogenetic distribution. Genes participating in the same biological process may be transcriptionally coregulated (2, 27, 38, 54). Thus, baculovirus genes having the same temporal expression pattern during the time course of infection may imply sets of functionally correlated genes or suggest a common regulatory sequence motif in the genome. On the basis of K means clustering analysis, the 155 viral gene expression patterns in Sf21 cells were grouped into five clusters (Fig. 4A). These clusters were rearranged in order of increasing average Tons, ranging from top to bottom as 3.3 ± 0.5, 5.0 ± 0.8, 5.5 ± 0.7, 8.8 ± 1.1, and 12.5 ± 1.4 for clusters 1, 2, 3, 4, and 5, respectively. The time course expression profile cluster seemed to be dependent on Ton, because the amplitude of variability of Ton within the same cluster was much smaller than that among different clusters. Also, many genes in the same cluster appeared to be functionally correlated, although the factors behind this are not known. For example, many genes classified in cluster 1 were functioning as viral transcriptional activators, DNA replication factors, and apoptotic suppressors, whereas many genes grouped in clusters 4 and 5 were viral capsid-associated proteins or occlusion-derived virus envelope proteins (shown at http://www.nhri.org.tw/nhri_org/mm/jljuang/supplementary/AcMNPV-Sf21/Table_SII.pdf).
![]() View larger version (48K): [in a new window] |
FIG. 4. Cluster analysis of gene expression patterns and colocalization in baculovirus genomic map. (A) Five gene clusters obtained by K means clustering analysis were arranged in an increasing order of the average Tons. (B) Correlation between the frequency of promoter sequence motifs and gene clusters. (C) Genome map view of the five gene clusters color coded in the AcMNPV genome. Gene clusters that colocalized to nearby loci are indicated. A colocalized cluster is defined as a region that contains at least five consecutive genes from the same gene cluster where no more than one interruption occurs by a gene from other gene clusters in either the plus or the minus strand. ID, identification.
|
Besides the previously described promoter motifs, there are at least 23 genes in different expression clusters whose regulatory sequence motifs are still unidentified. It is also possible that there are other novel regulatory motifs or machineries coexisting with the identified motifs in the same gene cluster and that they also regulate the transcription pattern. In support of this idea, we found that the expression patterns of some genes appeared to be uncorrelated with their upstream sequence motifs. For example, some early-expressed genes, including orf-81, orf-66, tlp, and orf-43, possessed only late promoter motifs (http://www.nhri.org.tw/nhri_org/mm/jljuang/supplementary/AcMNPV-Sf21/Table_SII.pdf), whereas some late-expressed genes, such as orf-92 and orf-107, possessed only early motifs. These data, together with other recent evidence (37), indicate that the finding of an early or late promoter motif does not necessarily predict gene expression patterns because there might be other motifs or trans-acting factors involved in the regulation of transcription.
In addition to trying to discover if there is a correlation between the expression pattern and upstream sequence motifs, we also wanted to explore if there are any chromosomally adjacent genes that are transcriptionally coregulated. Marking the AcMNPV cytogenetic loci of genes with the color of their specific expression clusters, we found most genes to be randomly interspersed throughout the genome, with no obvious correlation with their specific patterns of expression (Fig. 4C). Nevertheless, some genes displaying synchronized expression patterns were clustered in a particular region of the genome map. There were about six such regional clusters (Fig. 4C), including the orf-1-to-orf-12 cluster, at which 9 out of 12 genes were grouped in cluster 3, and the orf-141-to-orf-150 cluster, where 9 out of 10 genes were members of cluster 5.
Physically adjacent genes could possibly share similar patterns of expression because they may be subject to the same regulatory control when transcribed. Exploring this possibility, we attempted to identify a potential common upstream regulatory sequence that might correlate with the regional clusters in the AcMNPV genome. Of the regional clusters we examined, only one was found to have a putative coregulatory motif sequence. A novel GTGCG sequence motif, located between upstream positions 15 and 190, was found in seven (orf-5, ctx, orf-1629, ptp, orf-4, lef-2, and orf-603) out of nine genes (P = 0.05) in the orf-1-to-12 cluster (data not shown), but this motif was far less common in the upstream sequences of other ORFs of the AcMNPV genome. The biological significance of this common sequence for the regulation of gene expression is not clear. The clustered genes that had very short conserved upstream sequence motifs may offer a spatial or positional advantage, making possible the efficient production of functionally related gene products that facilitate host infection.
Transcription network revealed by mutation of a very early-transcribed transactivator gene.
The transcription network in a baculovirus infection was closely coordinated by a sequential expression of early, late, and very late genes (Fig. 2A). Genes may have been interdependent along the time course, with later genes relying on other genes previously expressed. By microarray analysis, we studied the effect of a mutant baculovirus,
pe38-E9/E9 (39), whose earliest transcribed baculovirus gene, pe38 (Ton = 2.4 hpi), was deleted, on the transcription network. A parallel global gene expression analysis of wild-type AcMNPV versus the
pe38-E9/E9 mutant virus was conducted at six different time points (0 to 72 hpi) over the infection period. In theory, a scatter plot analysis of expression levels for wild-type and mutant viruses would reveal that pe38-independent genes fall along the diagonal and differentially expressed genes deviate from the diagonal. At an early stage (3 hpi), only five viral transcripts, me53, ie-2, orf-116, orf-72, and tlp, were clearly expressed in cells infected with the
pe38-E9/E9 mutant virus. These were also coincidently expressed in the same manner as the wild-type virus at later stages (Fig. 5A and G, group a), suggesting that the expression of these five genes was essentially pe38 independent. The above possibility of independent transcription of me53 was supported by a previous study using transient expression of a reporter construct containing the me53 promoter to demonstrate that the transcription of me53 may be independent of other trans-acting viral factors (23). In contrast, 12 other genes (Fig. 5G, group b), including pe38, orf-68, gp64, he65, p35, and orf-16, were expressed only in cells infected with the wild-type virus and not in cells infected with the
pe38-E9/E9 virus at 3 hpi. This suggests that expression of pe38 is vital to the regulation of many early genes in AcMNPV.
![]() View larger version (31K): [in a new window] |
FIG. 5. Differential gene expression profiling of a pe38 deletion mutant virus in strain Sf21 cells. (A to E) Scatter plot analysis for the dynamic differential gene expression delay of the pe38-E9/E9 mutant. Both wild-type and mutant viruses at an MOI of 10 were used to infect Sf21 cells. The gene expression levels shown here have been subjected to normalization. (F) Statistics of the delayed gene expression patterns at various times postinfection. An expressed gene is defined as one whose normalized expression level at the indicated time postinfection was greater than zero. (G) Temporal expression profiles of selective AcMNPV genes that were expressed in the wild-type virus in an earlier phase (<4.5 hpi) and their counterparts in the pe38-E9/E9 mutant virus. Genes in groups a and b are pe38 independent and pe38 dependent, respectively.
|
pe38-E9/E9-infected cells at early stages of infection, their expressions were deferred only at the beginning. Later, they gradually returned to normal levels at later stages of infection (Fig. 5B to F). Differences in gene expression patterns between these two viruses became less noticeable at 72 hpi (Fig. 5E). Expression of early genes (e.g., ie-1, lef-3, DNA-pol, he65, lef-11, and several novel ORFs) was not found until 4.5 to 9 hpi. Some (e.g., p143, p35, and orf-13) were only detectable at 9 to 15 hpi (Fig. 5G). Most affected by the mutation of pe38 was the expression of gp64, which was transcribed only from 15 to 72 hpi in Sf21 cells (Fig. 5G). These findings suggest that the very early-transcribed pe38 gene product may be critical at the beginning of infection but less important in the later stages in Sf21 cells. Alternatively, one might speculate that there are redundant viral or cellular factors, compensating for the deficiency of pe38 in regulation of the transcription network. |
|
|---|
We did find, however, a low correlation between raw array data and the real quantitative results of virus transcripts during infection (data not shown). Therefore, direct interpretation of raw array data should be done cautiously. Appropriate transformations are needed for the interpretation of data, particularly with regard to early-expressed genes whose expression levels are relatively low at the onset of expression. We found that an objective benchmark is needed in order to calibrate experimental and biological deviations when trying to characterize gene expression kinetics at different time points over the infection period from a set of array data that are inherently noisy. This need has not been demonstrated by previous related DNA virus microarray studies (6, 43, 49). We used external control probes plus spiked-in targets to create a calibration system and used an innovative statistical method (Fig. 1) to systematically process and normalize large amounts of gene expression data. As shown in Fig. 2B, the results of the array analysis, after data transformation, were very similar to the results obtained by the generally accepted real-time PCR method. The calibration and normalization schemes established in this study can be applied to other, related microarray studies for analysis of viral gene expression profiles.
This study also proposes a new strategy for statistically investigating the kinetics of global baculovirus transcription. The kinetics of a specific viral gene was highlighted from two independent indexes, Ton and Tmax. These were individually generated through modeling procedures with array data obtained at different time points over the infection period. On the basis of the results obtained for previously characterized genes, the expression kinetics provided clues to possible viral strategies, which may be involved in the activation or inactivation of downstream genes in target cells. For example, several members of group I in Fig. 3B were early transactivators, including ie-1, ie-2, and pe38, and the DNA synthesis regulator gene me53. The products of these genes are critically needed at the very beginning of the regulatory events, when the baculovirus aggressively utilizes host RNA polymerase for viral gene transcription. Later, they quickly become less important, when viral RNA polymerase starts to take over the transcription machinery. Therefore, it is reasonable to assume that genes in group I are transcribed early and quickly reach a maximum expression level. In contrast, several genes in group II are needed throughout the whole period until the later stages of infection, ensuring proper DNA replication. Four such genes are gp64, which encodes an envelope fusion protein essential for BV production; the helicase p143 gene (9, 34); lef-2 (25); and lef-11 (32), which are required for DNA replication. Thus, the groups of expression kinetics seem to be functionally associated with the baculovirus replication program in permissive cells.
The study of baculovirus transcription kinetics can characterize the global transcription program of known genes, as well as provide clues to potential functions of previously uncharacterized ORFs in the network. For example, a putative ORF, orf-68, located upstream of an early gene, lef-3, was shown to have Tons and Tmaxs similar to those of the three most immediate-early transcripts (i.e., pe38, me53, and lef-3), suggesting that orf-68 may play a parallel role in the early infection cycle.
Another approach used to portray the transcription program, unsupervised clustering analysis, is typically used for extracting particular subsets of expression patterns from microarray data. By this approach, we discovered two major findings regarding the regulation of baculovirus transcription programs. First, transcription onset timing was strongly influenced by the conserved promoter sequence motifs. Second, the onset time of expression might predict the subsequent viral gene expression pattern. Furthermore, because functionally correlated or coregulated genes in an operon of a bacterial genome may be located in nearby loci of the physical genome (29), we thought it might be interesting to investigate whether a similar gene organization exists in the AcMNPV genome. We tagged these gene clusters into the cytogenetic map and analyzed the relationship between the gene clusters in the viral genome and the temporal expression patterns. Most of the gene clusters were randomly dispersed in the baculovirus genome, although a few gene groups were located in adjacent regions. To our surprise, two putative late-operon-like regions were found in the orf-89-to-orf-94 and orf-141-to-orf-150 regions, where most (n = 5 and n = 7 in orf-89 to orf-94 and orf-141 to orf-150, respectively) gene products function as structural proteins or display a late expression pattern. To further confirm the existence of two late-operon-like regions in AcMNPV, we conducted a comparative genomic analysis of these sequences in three phylogenetically closely related baculoviruses: AcMNPV, Bombyx mori nucleopolyhedrosis virus, and Orgyia pseudotsugata multicapsid nucleopolyhedrosis virus. Both regions were found to be evolutionarily conserved (data not shown), supporting the argument that they had grouped together in common regulatory clusters. Although similar operon-like structures have been reported in the human herpesvirus 8 (Kaposi's sarcoma virus) genome (11, 51), it is still unclear whether an operon-like strategy is indeed used by baculoviruses to coregulate the transcription of genes clustered in the neighboring viral genomic loci. Therefore, it is important to gather information that may help answer the question of whether the observed gene organization in the AcMNPV genome actually affects viral gene expression. In addition to two gene cassettes, we discovered a common GTGCG sequence motif in the upstream coding sequences of many genes in the orf-1-to-orf-12 region. Still further studies are needed to determine whether such a motif may also coregulate transcription.
To uncover the role of a viral gene in the regulatory pathway, a viral gene needs to be mutated or deleted and the functional consequence evaluated. We analyzed the differential transcription profile of a mutant virus defective in pe38 (
pe38-E9/E9), an immediate-early viral gene presumed to be vital but probably not essential in viral DNA synthesis and BV production (39). As expected, global gene expression patterns were indeed greatly altered at the early and late stages (Fig. 5A to D) while the delayed transcripts were gradually returned to the same levels as those of the wild-type control in the very late stage (Fig. 5E), suggesting that although the pe38-encoded gene product plays a critical role in the regulation of early transcription units, it might not be as important for all regulatory cascades, especially later stages. Furthermore, the delayed temporal expression pattern we observed also might give clues helpful in understanding why the mutant virus has the phenotype of delayed BV production but still produces infectious virus particles similar to the wild-type virus (39). Most importantly, the sequential expression map in the pe38-deficient background revealed a regulatory hierarchy. Expression of the several early-transcribed genes at the top of the hierarchy (e.g., me53, tlp, and ie-2) was not altered by the mutation of pe38. An early transactivator, ie-1, was only affected in the early stage, whereas the expression levels of most of the early-transcribed genes were greatly affected by the mutation of pe38. Data from earlier studies also support this. For example, the expression of the helicase p143 (33) and gp64 (39) genes is regulated by pe38. To display the putative regulatory cascade mediated by pe38, we extracted 16 early-transcribed genes and compared the differences in temporal expression patterns between wild-type and mutant viruses. pe38 seemed crucial for the transactivation of five genes (lef-3, DNA-pol, he65, p143, and lef-11) (Fig. 5G). These five, in turn, seemed to affect the transcription of p35 and gp64. The regulatory cascade involving the pe38-deficient virus may well be complemented by another early transactivator, possibly ie-1, to modulate downstream gene expression.
This is the first study to use array analysis to elucidate the regulatory pathway controlled by a specific baculovirus early-expressed gene. Systematic mutation of other transactivator genes in the AcMNPV genome will allow further exploration of the transcription regulatory network. The same strategy can possibly be used to uncover novel roles of known genes or putative functions of uncharacterized ORFs in the genome. In the present study, for example, we identified the delayed expression of he65, a gene encoding a putative RNA ligase (19), in cells infected with the pe38 mutant virus. On the basis of this finding, it is tempting to investigate whether PE38's modulation of he65 expression is a counterreaction to the host defense. After all, he65 has been implicated in the repair of essential RNA molecules damaged by the host defense mechanism to block infection. Although this is speculation, further investigation of this possibility may forge a path into the heart of the regulatory network involving pe38. Moreover, this array-based strategy for the study of the transcription regulatory network may be particularly useful for those randomly generated mutant viruses that display specific phenotypes although the underlying genes or mechanisms need elucidation (20).
By introducing an array strategy to systematically characterize the previously established baculovirus mutants, this study points to a possibly very productive direction for constructing a baculovirus gene regulation network. The discovery and collection of these insights can allow us to build an integrated database that can greatly enhance the research of baculoviruses and their future applications.
This work was supported by NHRI.
Present address: Institute of Cancer Research, National Health Research Institutes, Miaoli, Taiwan. ![]()
|
|
|---|
-Amanitin-resistant viral RNA synthesis in nuclei isolated from nuclear polyhedrosis virus-infected Heliothis zea larvae and Spodoptera frugiperda cells. J. Virol. 38:916-921.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»