Previous Article | Next Article ![]()
Journal of Virology, February 2005, p. 2506-2516, Vol. 79, No. 4
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.4.2506-2516.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
José L. Moreno,1,
Sonia Zúñiga,1
Sara Alonso,1 and
Luis Enjuanes1*
Department of Molecular and Cell Biology, Centro Nacional de Biotecnología, CSIC, Campus Universidad Autónoma, Cantoblanco, Madrid, Spain1
Received 13 July 2004/ Accepted 23 September 2004
|
|
|---|
G values was observed, suggesting that a more precise definition of the TRS limits might be proposed, specifically that it consists of the central CS and around 4 nucleotides flanking 5' and 3' the CS. Sequences downstream of the CS exert a stronger influence on the template-switching decision according to a model of polymerase strand transfer and template switching during minus-strand synthesis. |
|
|---|
Coronavirus transcription, and in general transcription in the Nidovirales order, is an RNA-dependent RNA synthesis that includes a discontinuous step during the synthesis of subgenomic mRNAs (sgmRNAs) (16, 30). This transcription process ultimately generates a nested set of sgmRNAs that are 5'- and 3'-coterminal with the virus genome. The common 5'-terminal leader sequence of 93 nucleotides (nt), derived from the genome 5' terminus, is fused to the 5' end of the mRNA coding sequence (body) by a discontinuous transcription mechanism. Sequences preceding each gene represent signals for the discontinuous transcription of sgRNAs. These are the transcription-regulating sequences (TRSs) that include a conserved core sequence (CS; 5'-CUAAAC-3'), identical in all TGEV genes (the CS of the body sequence [CS-B]), and the 5'- and 3'-end-flanking sequences (5' TRS and 3' TRS, respectively) that regulate transcription (2). Since this CS sequence is also found at the 3' end of the leader sequence (CS-L), it may base pair with the nascent minus strand complementary to each CS-B (cCS-B). In fact, the requirement for base pairing during transcription has been formally demonstrated to occur in arteriviruses (25, 38) and coronaviruses (44) by experiments in which base pairing between CS-L and the complement of CS-B was engineered in infectious genomic cDNAs. Subgenomic RNA (sgRNA) synthesis in CS-L and CS-B mutants was regulated by changing only the base pairing between these two elements. Moreover, alternative mRNAs were synthesized in TGEV from noncanonical CSs, provided that their flanking sequences extended complementarity with TRS-L (34, 44). In this report, the role in transcription of nucleotides immediately flanking the CS-B has been analyzed using infectious genomic TGEV cDNAs. Base pairing between leader sequences and the nascent negative RNA strand beyond the canonical CS sequence (5'-CUAAAC-3') has been shown in this report to be a determinant factor in coronavirus transcriptional regulation.
Although two major models have been proposed to explain the discontinuous transcription in Nidovirales (16, 30), current experimental data favor the model of discontinuous transcription during negative-strand synthesis (28, 29, 31, 32). This concept was reinforced by demonstrating for arterivirus and coronavirus that the CS included in the sgmRNA was derived from the CS preceding each gene and not from the CS present at the 3' end of the leader sequence (25, 38, 44). In this model, the TRS-B acts as an attenuation and dissociation signal for the transcription complex during the synthesis of the RNA negative strand. Template switching at the sites of RNA-dependent RNA polymerase (RdRp) pausing resembles a high-frequency similarity-assisted copy choice RNA recombination (3, 20, 23) in which the noncontiguous TRS-B and TRS-L sequences are probably brought into physical proximity by RNA-protein and protein-protein interactions (44). Also in this model, the nascent negative RNA with the TRS complement at its 3' end (cTRS-B) (donor molecule) dissociates from the genomic-RNA (gRNA) template at the CS domain to join the TRS-L (acceptor) and resume synthesis of sgRNA. This sgRNA serves as a template for multiple rounds of sgmRNA synthesis. It has been experimentally proven that CS in TRS-L is exposed in a stem-loop in arteriviruses (35) and is, probably, also exposed in a similar secondary structure in coronaviruses (I. Sola, S. Alonso, S. Zúñiga, and L. Enjuanes, unpublished data), as shown by RNA structure predictions, making the TRS-L accessible as an acceptor during template switching.
Transcription in coronavirus is probably regulated by two main factors: (i) base pairing between the TRS-L and the nascent negative strand and (ii) RNA-protein and protein-protein interactions involving TRS sequences and viral and cellular proteins. The proximity of body TRSs to the 3' end of the genome probably influences the relative amount of sgmRNA, because the RdRp finds less attenuation and dissociation TRS-B signals during the synthesis of sgRNAs of smaller size. This is the case for other viruses that produce multiple sgRNAs (19) and, in general, for coronaviruses. Nevertheless, a perfect correlation between mRNA abundance and the relative position of TRS in the genome could not strictly be established (27, 37). The relevance in transcription of other potential factors, such as relative TRS position in the genome and the TRS-B secondary structure, could not be confirmed either for arteriviruses or for coronaviruses (21, 22, 24).
Previous studies of TGEV have shown that the canonical CS was nonessential for the generation of subgenomic mRNAs, but its presence led to transcription levels at least 103-fold higher than those in its absence (44). It was also shown that the synthesis of sgmRNAs proceeds only when the CS is located in an appropriate context (2). That seems to be the case with CS-S2, a canonical CS sequence within the S gene (152 nt downstream of CS-S1, the CS leading to the synthesis of mRNA S coding for S protein) that does not lead to the synthesis of any detectable amount of the corresponding sgmRNA (2), probably because it is located in a nonfavorable sequence context.
In a previous work, the role of CS nucleotides (5'-CUAAAC-3') in coronavirus transcription was analyzed (44). In this report, the role in transcription of four nucleotides immediately flanking the CS both at the 5' and the 3' end has been studied using as a model the transcriptionally inactive canonical CS-S2. The rationale for selecting 5' and 3' TRS flanking sequences consisting of four nucleotides comes from the results of an in silico analysis (see below) showing that to predict both viral mRNAs and alternative mRNAs at noncanonical junction sites, an optimal TRS-L should include the CS plus four nucleotides flanking the CS both at the 5' and 3' ends. Furthermore, these predictions have been supported by experimental data. Using TGEV infectious genomic cDNAs, we have shown that CS-S2 inactivity is a genome position-independent phenomenon. A good correlation between mRNA S2 levels and the free energy (
G) of duplex formation between TRS-L and cTRS-S2 was observed in mutants that extended complementarity with TRS-L, indicating that this base pairing during the synthesis of nascent RNA is a key factor for transcription regulation in coronavirus, leading to detectable transcription levels only when a minimum threshold is reached. It has also been shown that for similar
G values, mutations extending complementarity with TRS-L by the 3' TRS sequences led to larger amounts of mRNA-S2 than mutations at the 5' TRS region, indicating that sequences downstream of the CS exert a stronger influence on the template-switching decision. This observation is most consistent with a model of polymerase strand transfer and template switching during minus-strand synthesis in which the CS is probably the major nascent chain detachment signal.
|
|
|---|
Plasmid constructs.
TGEV cDNAs including mutated sequences of the inactive TRS-S2 transcription unit in the place previously occupied by the nonessential genes 3a and 3b were generated by PCR-directed mutagenesis. The 918-bp BmgBI-BlpI fragment, including the 3a gene and the first 320 nt of the 3b gene, was removed from intermediate plasmid pSL-TGEV-AvrII, comprising nt 22965 to 25865 from the TGEV genome (GenBank accession no. AJ271965), and replaced by mutant TRS-S2 transcription units consisting of CS-S2 and 30 nt from both the 5' TRS and 3' TRS, including a series of nucleotide substitutions that extend complementarity with TRS-L (Fig. 1). An 80-bp product corresponding to wild-type (wt) or mutant TRS-S2 transcription units was generated by PCR with the oligonucleotides described in Table 1, which included BmgBI and BlpI restriction sites, using the plasmid pSL-SC11-3EMN/C8-BGH (8) as the template. PCR products were digested with BmgBI and BlpI and cloned into the same restriction sites of plasmid pSL-TGEV-AvrII, leading to plasmid pSL-TGEV-AvrII-
3-TRS-S2wt and the collection of TRS-S2 mutants M1 to M10. To introduce mutations in the TGEV infectious cDNA, the 1,982-bp AvrII-AvrII fragment from plasmid pSL-TGEV-AvrII-
3-TRS-S2 with the corresponding mutations was inserted into the same sites of plasmid pBAC-TGEV
Cla (nt 22976 and 25867 of the TGEV genome), leading to plasmid pBAC-TGEV
Cla-
3-TRS-S2. To obtain the full-length TGEV-cDNA, the toxic ClaI-ClaI fragment (5,198 bp) was introduced as previously described (1). All cloning steps were checked by sequencing the PCR-amplified fragments and cloning junctions.
![]() View larger version (61K): [in a new window] |
FIG. 1. Sequences of TRS-S2 mutants that extend the complementarity with the leader TRS. Mutations introduced into the nucleotides immediately flanking the CS-S2 are shown in the figure (mutants M1 to M10). The right column shows the titers of each mutant when it was grown on ST cells after three plaque purification steps. L, sequences adjacent to the CS-L in the leader region; S1, sequences adjacent to the native CS-S1; S2, sequences adjacent to the native CS-S2. Virus titers (PFU per milliliter) are indicated in the right column.
|
|
View this table: [in a new window] |
TABLE 1. Oligonucleotides used for site-directed mutagenesis
|
RNA analysis by Northern blotting. Total intracellular RNA was extracted at 16 h postinfection (hpi) from virus-infected ST cells using the RNeasy mini kit (QIAGEN) according to the manufacturer's instructions. RNAs were separated in denaturing 1% agarose-2.2 M formaldehyde gels and blotted onto positively charged nylon membranes (BrightStar-Plus; Ambion) as described previously (2). The 3'-untranslated region-specific single-stranded DNA probe was complementary to nt 28300 to 28544 of the TGEV strain PUR46-MAD genome (27). Probe labeling was performed with the BrightStar psoralen-biotin nonisotopic labeling kit (Ambion), and Northern hybridizations were performed according to the manufacturer's instructions. Detection was done with the BrightStar BioDetect kit (Ambion).
RNA analysis by RT-PCR. RNA analysis of the rTGEVs was performed by reverse transcription-PCR (RT-PCR). Total intracellular RNA was extracted at 16 hpi from rTGEV-infected cells as previously described. cDNAs were synthesized at 50°C for 1 h with avian myeloblastosis virus reverse transcriptase (Reflectase) (Active Motif) and the antisense primer CS-RS (5'-ATCACCATTGAGAAGTTCAACTGCT-3'), complementary to nt 375 to 351 of ORF 3b. The cDNA generated was used as a template for specific PCR amplification of mRNAs using the reverse primer CS-RS and the forward primer SP (5'-GTGAGTGTAGCGTGGCTATATCTCTTC-3'), complementary to nt 15 to 39 of the TGEV leader sequence. For RT-PCR amplification of genomic sequences in the regions of the 3a and 3b genes, oligonucleotides CS-RS and S-4310-VS (5'-ATTACGAACCAATTGAAAAAGTGC-3'), complementary to nt 4316 to 4339 of gene S, were used. RT-PCR products were separated by electrophoresis in 2% agarose gels, purified, and used for direct sequencing with the same oligonucleotides used for PCR.
Real-time RT-PCR was used for quantitative analysis of gRNA (used as an endogenous standard) and mRNA-S2 from different rTGEVs with mutated TRS-S2 sequences. Oligonucleotides used for RT and PCRs, described in Table 2, were designed by the Primer Express software. Forward oligonucleotide mRNACS1 JS4 was used to amplify mRNA-S1. Forward oligonucleotide mRNACS2 JS3 amplified mRNA-S2 from mutants M1, M2, M3, and M4. Forward oligonucleotide mRNACS2 JS7 was used to amplify mRNA-S2 from mutants M5 to M10. SYBR-green PCR master mix (Applied Biosystems) was used in the PCR step, according to the manufacturer's specifications. Detection was performed with an ABI PRISM 7000 sequence detection system (Applied Biosystems). Data were analyzed with ABI PRISM 7000 SDS version 1.0 software.
|
View this table: [in a new window] |
TABLE 2. Oligonucleotides used for RT-PCR analysis of RNA from rTGEV-infected cells
|
|
|
|---|
![]() View larger version (43K): [in a new window] |
FIG. 2. Synthesis of sgmRNA-S2 by TRS-S2 mutants. (A) Diagram showing the insertion site in the TGEV genome (at the position of nonessential genes 3a and 3b) of the TRS-S2 transcription unit, which consists of CS-S2 and 30 nt from both the 5' TRS and the 3' TRS, including a series of nucleotide substitutions that extend the complementarity with the leader TRS. The structure of the expected sgmRNA-S2 is shown under the gRNA structure. Arrows indicate the approximate positions of primers used for RT-PCR. UTR, 3' untranslated region. (B) Specific detection by RT-PCR of mRNA-S2 in TRS-S2 mutants. The mRNA 3a.2 species corresponds to the alternative mRNA generated from a noncanonical leader-to-body junction located at the 3' end of ORF S, 57 nt upstream of the CS-S2 engineered at the 3ab site. TRS-S1, transcription unit containing the sequence of the active TRS that drives the synthesis of sgmRNA S; TRS-S2 wt, transcription module containing the wt sequence of the inactive TRS, including CS-S2. (C) Sequence analysis of the body-to-leader junction sites in the two mRNA species detected. As an example, the case of M8 is shown. The sequence on the top corresponds to gRNA in the fusion site. The bottom sequence in the light-gray box corresponds to the leader (L) sequence. The CS appears as white letters in a black box. CGAA and GAAA motifs are shown in dark-gray boxes. Vertical bars represent the identity between the sequences, with thick bars at the possible fusion site. Dotted vertical bars represent the possible non-Watson-Crick interactions. Crossover should occur in any nucleotides above the arrow.
|
Prediction of subgenomic mRNA synthesis by potential base pairing between the TRS-L and the nascent negative RNA sequences. It was previously shown that potential base pairing between TRS-L and the nascent negative RNA sequences of TGEV was an important regulatory factor of discontinuous transcription (44). Potential base pairing was estimated using an in silico approach, based on a local alignment algorithm that analyzed the identity between TRS-L and genomic RNA sequences and predicted not only the TGEV sequences leading to the synthesis of structural and nonstructural TGEV mRNAs, but also the noncanonical sites involved in the generation of alternative mRNAs. Analysis of the potential base pairing between TRS-L and sequences complementary to gRNA in the region of insertion of TRS-S2 showed two peaks of high identity corresponding to the canonical and noncanonical leader-to-body junction sites found in all TRS-S2 mutants that led to mRNAs S2 and 3a.2, respectively (Fig. 3). In the rTGEV mutant including the wt TRS-S2 sequence, the peak with the highest TRS-L identity to gRNA (score, 45) corresponded to the junction site 3a.2, whereas a low potential base pairing value (score, 32) was associated with the CS-S2 site. This result may explain the detection of a unique mRNA species generated at the most favorable junction site, 3a.2 (Fig. 2B). In contrast, for all engineered TRS-S2 mutants, in addition to the peak corresponding to the 3a.2 junction site, a second peak with increasing levels of identity between TRS-L and gRNA (score between 35 and 70) was also predicted. This peak corresponds to the CS-S2 junction site in mutants with different sequences immediately flanking CS-S2. This result may explain the detection of two mRNA species generated at the 3a.2 and CS-S2 sites, with varied relative intensities according to the different values of potential base pairing at the CS-S2 junction site (Fig. 2B). In the TRS-S1 mutant, the smallest peak was associated with CS-S1 (score 35).
![]() View larger version (37K): [in a new window] |
FIG. 3. In silico analysis of the identity between TRS-L (5'-CGAACUAAACGAAA-3') and the TGEV genomic sequences in the region of TRS-S2 insertion, the place previously occupied by genes 3a and 3b. A graphical plot of the potential base-pairing score versus the genome position is shown. Each of the three-dimensional lines represent the identity between TRS-L and the sequences corresponding to TRS-S1, TRS-S2, or the different TRS-S2 mutants with increasing complementarity with the leader (M1 to M10). The peaks assigned to the two sgmRNA species are indicated.
|
![]() View larger version (101K): [in a new window] |
FIG. 4. Northern blot analysis of intracellular RNA from TRS-S2 mutants. ST cells were infected with rTGEVs at a multiplicity of infection of 3. Total RNA was extracted at 16 hpi and analyzed by Northern blotting with a probe complementary to the 3' end of the gRNA as described in Materials and Methods. The difference in the sizes of mRNA S and wt TGEV is due to the deletion of ORFs 3a and 3b (918 bp), which have been replaced by the TRS-S1 or TRS-S2 transcription unit (80 bp). Viral mRNAs are indicated on the left side of the figure, and the mRNA-S2 detected in mutants M9 and M10 is indicated on the right.
|
G on sgmRNA-S2 synthesis.
To study the effect of base pairing between the nascent negative sgRNA and the TRS-L on the transcriptional activity of TRS-S2, mRNA-S2 levels were quantified in TRS-S2 mutants by real-time RT-PCR using specific oligonucleotides and the gRNA as an internal standard. The amount of mRNA-S2 in TRS-S2 mutants was expressed in relation to that of an rTGEV including wt TRS-S2 sequences (Fig. 5). Subgenomic mRNA-S2 was expressed at different levels in all TRS-S2 mutants, with relative abundances ranging from 10- to 7 x 104-fold the amount of mRNA-S2 produced from wt TRS-S2 sequences. Just by extending complementarity to TRS-L with a unique nucleotide immediately flanking the CS-S2 at either the 5' or the 3' end (mutant M1 or M5, respectively), mRNA-S2 expression was significantly increased, suggesting that a minimum complementarity of TRS-L to cTRS-S2 was necessary to promote the synthesis of the corresponding sgmRNA from a canonical CS that was previously inactive due to the insufficient complementarity to TRS-L of its adjacent sequences. A good correlation was observed between the mRNA-S2 amount and the value of
G for the duplex formation between the TRS-L and the cTRS-S2 in the mutants, confirming that this thermodynamic parameter is a decisive factor for template switching during the synthesis of coronavirus subgenomic RNAs. In general, for similar
G values, mutants extending complementarity with TRS-L through nucleotides within the 3' TRS synthesized larger amounts of mRNA-S2 than mutants whose complementarity with TRS-L was extended within the 5' TRS (Fig. 5). As an example, M8 extended complementarity with TRS-L through the 5'-GAAA-3' motif within 3' TRS sequences (5'-CUAAACGAAA-3') and expressed 15-fold more sgmRNA-S2 than M4, with similar
G values for duplex formation, but extended complementarity with TRS-L through the 5'-CGAA-3' motif within 5' TRS sequences (5'-CGAACUAAAC-3'). These results indicated that nucleotides adjacent to CS-S2 by the 3' region are more decisive for mRNA synthesis than nucleotides in the 5' region. The differential behavior of mutants that extended complementarity with TRS-L by the 5' TRS or 3' TRS nucleotides is also shown in the graphical representation of the
G values for duplex formation between TRS-L and cTRS-B versus the relative amount of mRNA-S2 (Fig. 6). Two separate representations for 5' TRS and 3' TRS mutants were required, with the mRNA-S2 level scale significantly shifted, suggesting that complementarity with the TRS-L at the 3' TRS region was a stronger determinant for template switching than a similar complementarity at the 5' TRS. In both representations, a tendency to reach a plateau was observed, indicating that during discontinuous transcription, the
G for duplex formation between TRS-L and cTRS-B varied towards a minimum value associated with maximum levels of mRNA synthesis. These data are compatible with the proposed transcription mechanism model postulating a template switch during synthesis of the negative strand and suggest that the CS sequence would behave as a detachment signal for the transcription complex, providing that a minimum
G in the duplex between TRS-L and cTRS-B is reached.
![]() View larger version (41K): [in a new window] |
FIG. 5. Quantification of sgmRNA-S2 by real-time RT-PCR. Shown are the amounts of sgmRNA-S2 in TRS-S2 mutants relative to the wt mRNA-S2 level. The log levels of mRNA-S2 are represented. The data presented are the averages of results from six independent experiments carried out in triplicate in each case. Error bars represent standard deviations. The G value (shown as - G) of the formation of the TRS-L-cTRS-S2 duplex is represented for each virus.
|
![]() View larger version (24K): [in a new window] |
FIG. 6. Relation between mRNA-S2 transcription levels and the G of the formation of the TRS-L-TRS-B duplex. Shown is a graphical representation of the G of the formation of the TRS-L-cTRS-S2 duplex (shown as - G) versus the amount of mRNA-S2 in TRS-S2 mutants, relative to the wt mRNA-S2 level. The upper curve represents the TRS-S2 mutants in which the complementarity with the TRS-L was extended by replacing nucleotides at the 5' TRS (mutants M1, M2, M3, and M4). The lower curve represents the mutants in which the complementarity with the TRS-L was extended by replacing the nucleotides at the 3' TRS (M5, M6, M7, and M8) or at both the 5' and 3' TRSs (M9 and M10). The scale for the relative amounts of mRNA in both representations is significantly shifted.
|
|
|
|---|
Within Nidovirales, it has also been postulated that the RNA secondary structure in the TRS-B regions regulates transcription. Regions with a characteristic secondary structure were proposed either as polymerase-pausing signals during transcription (14) or as binding sites for host proteins (41). However, the role of the predicted TRS secondary structures could not be confirmed for the arterivirus equine arteritis virus (EAV) (22). Similar results were obtained with a bovine coronavirus defective interfering RNA system, in which the inactive canonical TRS of sgmRNA5 was buried in the stem of a stable hairpin. Mutations predicted to unfold the stem and make the canonical TRS accessible for base pairing did not result in its transcriptional activation (21). Predictions of RNA secondary structures for all TGEV TRSs, including the inactive TRS-S2, by the M-fold algorithm (17, 43) did not show any differential features between functional and nonfunctional TRSs (data not shown). A similar analysis was performed with the TRS-S2 mutants described above, and no correlation between the predicted secondary structure and the different degrees of transcriptional activity could be established, indicating that the secondary structure surrounding a body CS motif might not be a strong determinant factor in transcription. According to the proposed model for coronavirus transcription, the helicase activity of the RdRp complex may unwind the double-stranded RNA structures during the synthesis of the negative RNA strand, making the RNA secondary structure of the body TRS not decisive in transcription.
In a previous work on TGEV TRSs, the effect of nucleotide substitutions within the conserved core sequence CS (5'-CUAAAC-3') of leader and body TRSs was analyzed (44). It was shown that the canonical body CS was not absolutely essential for the generation of sgmRNA, since alternative sgmRNAs generated from noncanonical CSs were detected in virus mutants. However, point mutations in the body CS nucleotides reduced by more than 103-fold sgmRNA synthesis compared to that of the wt virus, confirming the requirement of complementarity between CS-L and cCS-B for transcription. This concept was reinforced by showing that sgmRNA synthesis was partially or completely restored by the introduction of nucleotide substitutions allowing the formation of non-Watson-Crick or Watson-Crick base pairs. The extent of sgmRNA synthesis was related to the base-pairing potential between CS-L and cCS-B, calculated as
G values. From these results, it was proposed that the lack of transcription observed for the canonical CS-S2 could be explained by the relatively low potential base pairing and
G value between the TRS-L and cTRS-B, as a consequence of the context sequence surrounding the CS. The results described in this report confirm and extend this hypothesis in which the
G associated with duplex formation between TRS-L and cTRS-B is a determinant factor in the transcriptional regulation of the TGEV coronavirus. Using the TGEV genomic cDNA clone, the TRS-S2 cassette with the wt sequence or an alternative one including mutations that extended complementarity with TRS-L, were inserted at the site of nonessential genes 3a and 3b. We have shown that a minimal increase in the complementarity between TRS-L and cTRS-B by a single substitution in the nucleotides immediately flanking 5' or 3' the CS-S2 (M1 or M5, respectively), promoted the synthesis of detectable amounts of sgmRNA-S2, indicating that sgmRNA synthesis in coronaviruses requires a minimum thermodynamic stability in the TRS-L and cTRS-B duplex. This result implies that complementarity limited to CS-L and cCS-B is not sufficient to drive sgRNA synthesis (Fig. 7), as observed in the arterivirus EAV (26). Therefore, the nucleotides adjacent to the CS are decisive in the duplex formation between the leader and the nascent negative RNA. In fact, in all TGEV genes, complementarity with TRS-L includes at least one additional nucleotide adjacent to the CS (Fig. 7). In addition, mutant M1 (Fig. 1) extended complementarity with TRS-L by one adenine immediately upstream of the CS, generating a sequence similar to that of TRS-S1, which promotes the synthesis of sgmRNA S. A good correlation was observed between the
G of the formation of the TRS-L and cTRS-B duplex and the relative amounts of sgmRNA-S2 in TRS-S2 mutants, confirming that the
G value, as a measure of duplex stability, is a determinant factor during sgRNA synthesis. A similar conclusion was reached for the EAV arterivirus (26), suggesting common elements in the transcriptional regulation of both coronaviruses and arteriviruses, despite the relatively large evolutionary distance.
![]() View larger version (75K): [in a new window] |
FIG. 7. Identity between the TRS-L and TRS-Bs of all TGEV sgmRNAs. The CS sequence is in white letters in a black box. White boxes highlight the identity in the sequences immediately flanking the CS both at the 5' and 3' ends. The transcriptional activity of each TRS is shown in the right column.
|
G values with the increase of sgmRNA levels was observed. Since
G values are directly proportional to the number of complementary nucleotides, that plateau would correspond to a limited extension in the TRS-L region involved in duplex formation with cTRS-B. This observation suggests that during the synthesis of sgmRNA, only a restricted number of nucleotides within the TRS is relevant for the transcription complex. Since there is a correlation between
G values and the amount of sgmRNA, a limited sgmRNA synthesis would also be reached. These results contribute to a more precise definition of the TRS limits in TGEV, consisting of the central CS and about four nucleotides flanking the CS 5' and 3'. However, future knowledge of RNA-protein interactions regulating the transcription process in the TRS region might modify the prediction of the TRS extension based strictly on the base pairing between TRS-L and cTRS-B. These observations are in agreement with previous results reported for MHV (36) and TGEV (2) minigenomes showing that sgRNA levels were not indefinitely increased by extending duplex formation between TRS-L and cTRS-B. This concept could be supported by the RNA secondary structure shown for the leader region of EAV and predicted for other coronaviruses (35). According to this prediction, TRS-L would be exposed in the loop of a hairpin structure. It has been proposed that extending complementarity with TRS-L would be relevant only for nucleotides residing within the loop and not for nucleotides base paired in the stem (26). RNA secondary-structure predictions for TGEV TRS-L show that the 5'-AACUAAA-3' sequence would be exposed in a small loop but that some adjacent nucleotides would participate in a 3-bp stem connecting the small loop with another large loop. Similar structures were predicted for other coronaviruses, in contrast to the unique large loop in the leader region of the EAV genome (35).
It has also been observed that for similar
G values, mutants increasing complementarity with TRS-L by the 3' TRS-flanking nucleotides led to higher levels of mRNA-S2 than mutants that extended complementarity by means of the 5' TRS region, indicating that sequences downstream of the CS are more decisive in the RdRp choice during discontinuous sgRNA synthesis. A similar observation was derived from a bovine coronavirus DI system (21). This feature is most consistent with a model of sequence similarity-assisted, polymerase copy choice strand switching during minus-strand synthesis.
A fundamental aspect of the discontinuous transcription model is the identification of the mechanism promoting RdRp to undergo strand transfer and template switching. According to the model that postulates the RdRp jump during minus-strand synthesis, the CS sequence and most probably the TRS sequences exert an attenuating effect on the RdRp. At this postulated site of RdRp pausing, the transcription complex should evaluate the base pairing between the nascent negative RNA strand and the TRS-L. It can be postulated that the CS within the TRS acts as a dissociation signal from the genomic RNA template in order to stimulate a strand transfer to the leader region, providing that duplex formation between TRS-L and cTRS-B has reached a minimum
G. This model would explain that, for similar
G values, mutants extending complementarity with TRS-L at the 3' TRS synthesize larger amounts of mRNA-S2 than 5' TRS mutants. Since negative RNA synthesis proceeds from the 3' TRS to the CS and then to the 5' TRS, when the transcription complex reaches the CS, in 3' TRS mutants providing additional base pairing within the 3' TRS-B flank, nascent RNA will include an extended complementarity with the TRS-L, leading to a template switch at the CS site with a higher frequency. In contrast, in mutants with an increased complementarity at the 5' TRS, when the transcription complex arrives at the CS, the complementarity of the nascent negative chain with the TRS-L will be restricted to the six CS nucleotides, providing an insufficient
G to promote strand transfer to the leader. When 5' TRS nucleotides are added to the nascent negative RNA, increasing complementarity to TRS-L, the dissociation signal CS will have been overcome, and strand transfer will be a less frequent event.
This hypothesis can be integrated into the previously proposed working model of coronavirus transcription (Fig. 8) (44), including three steps. The first step is the formation of 5'-end-3'-end complexes mediated by protein-RNA and protein-protein interactions, which locate the TRS-L in close proximity to sequences at the 3' end of genomic RNA. In the MHV system several host proteins, such as the heterogeneous nuclear ribonucleoprotein A1 and the polypyrimidine tract binding protein, probably involved in these interactions, have been identified (4, 10, 33). Alternatively, complexes between the TRS-L and the TRSs-B present along the coronavirus genome may have previously been formed based on sequence identity and protein-RNA interactions. The second step is scanning of the nascent negative chain by the TRS-L, which looks for complementary sequence domains leading to a favorable
G. The third step is the template switch of the nascent negative RNA strand to the leader TRS to resume the synthesis of negative sgRNA, when complementarity is above a certain threshold.
![]() View larger version (39K): [in a new window] |
FIG. 8. Three-step working model of coronavirus transcription for subgenomic mRNA synthesis in TRS-S2 mutants extending complementarity with TRS-L either at the 3' TRS or 5' TRS. (A) 5'-3' complex formation step. Proteins binding the 5'- and 3'-end TGEV sequences are represented by the green ovals. The leader sequence, represented as the predicted secondary structure, is in red, and CS sequences are in yellow. (A)n, poly(A) tail. (B) Base-pairing scanning step. Negative-strand RNA is in a lighter color than positive-strand RNA. The replication complex is represented by the hexagon. Vertical dotted bars represent the base-pairing scanning by the TRS-L during synthesis of nascent mRNA at attenuation sites. Vertical solid bars indicate complementarity between gRNA and the nascent negative-strand. (U)n, poly(U) tail. (C) Template switch step. The thick arrow indicates the switch in the template made by the transcription complex to complete the synthesis of negative sgRNA.
|
This work was supported by grants from the Comisión Interministerial de Ciencia y Tecnología (CICYT), la Consejería de Educación y Cultura de la Comunidad de Madrid, Fort-Dodge Veterinaria, and the European Communities (Frame V, Key Action 2, Control of Infectious Disease Projects). I.S., J.L.M., and S.Z. received fellowships from the Community of Madrid and the European Union (Frame V, Key Action 2, Control of Infectious Disease Projects QLRT-2000-00874, QLRT-2001-00825, and QLRT-2001-01050).
I.S. and J.L.M contributed equally to this work. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»