Previous Article | Next Article ![]()
Journal of Virology, May 2005, p. 5288-5295, Vol. 79, No. 9
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.9.5288-5295.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Ji'an Pan,
,
Yu Chen,
Yalin Yang,
Jing Xu,
Yu Peng,
Ying Wu,
Zhaoyang Li,
Ying Zhu,
Po Tien, and
Deyin Guo*
Modern Virology Research Centre and National Key Laboratory of Virology, College of Life Sciences, Wuhan University, Wuhan, People's Republic of China
Received 13 October 2004/ Accepted 14 December 2004
|
|
|---|
|
|
|---|
A unique feature for coronaviruses and some related viruses in the order Nidovirales is that the viral sgRNAs contain a common leader sequence of 55 to 92 nucleotides (nt), which is derived from the 5' end of the genomic RNA (11). It has been shown that the synthesis of each subgenomic mRNA involves a discontinuous step by which the so-called 3' body sequence is fused to the genomic 5' leader sequence (22). The fusion of leader and body sequences during discontinuous transcription is determined, at least in part, by cis-acting elements termed transcription-regulatory sequences (TRS). These elements are located both at the 5' end of the genome and at 5'-proximal sites corresponding to the individual transcription units (5). Although the mechanism for synthesis of sgRNAs is not fully understood, several models have been proposed. Two major models are leader-primed transcription (10, 12) and discontinuous transcription during minus-strand synthesis (19, 21), and the latter model has gained more support from recent evidence for the existence of transcriptionally active, subgenome-sized minus strands containing the antileader sequence and a transcription intermediate active in the synthesis of mRNAs (21, 22, 23, 24).
The genomes of many SARS CoV isolates have been sequenced, and they consist of approximately 29,700 nucleotides (13, 15, 16). Fourteen ORFs have been identified, of which 12 are located in the 3'-proximal one-third of the genome (13, 25). The exact mechanisms of expression of the 3'-proximal ORFs are unknown, but by analogy with other coronaviruses, these ORFs are expressed through a set of sgRNAs (15). Rota and colleagues could readily identify six sgRNAs, and later Thiel et al. demonstrated the existence of eight sgRNAs in SARS CoV-infected cells (15, 26). However, the exact number and molecular mechanism underlying the synthesis of SARS CoV sgRNAs have not been clarified yet. Therefore, identification of new sgRNAs and characterization of the molecular details of the leader-body fusion in the sgRNAs will help elucidate the regulatory mechanism of SARS CoV transcription and replication, and this knowledge could further be used for development of antiviral therapeutic agents and a vaccine for the cure and prevention of this newly emerged disease.
In this study, we showed the coexistence of both plus- and minus-strand sgRNAs in SARS CoV-infected cells and identified 10 sgRNAs, including two novel subgenomic mRNAs (named 2-1 and 3-1) with noncanonical leader-body fusion sites.
|
|
|---|
Northern blotting. The total cellular RNA from SARS CoV-infected Vero E6 cells was extracted by using Trizol reagent (Invitrogen) according to the manufacturer's instructions. Twenty micrograms of extracted RNA was fractionated in a 1.2% denaturing agarose gel containing 2.2 M formaldehyde with 1x MOPS (morpholinepropanesulfonic acid) buffer (17), transferred to a nylon membrane (Hybond-A; Amersham Pharmacia), and UV cross-linked. The Northern blot was probed overnight at 42°C with 32P-labeled strand-specific single-stranded DNA probes according to the protocol of the manufacturer (Amersham Pharmacia). The signals were detected and analyzed with a PhosphorImager and Image Quant software (Molecular Dynamics). The membrane was stripped of the first probe according to the protocols provided with the Hybond A membrane and was reprobed with the second probe.
The negative probe was complementary to the 3' ends (positions 29421 to 29725) of SARS CoV mRNAs and was used to detect plus-sense sgRNAs. The positive probe, complementary to the 5' ends of viral antisense RNAs, was used to detect minus-strand subgenomic RNAs.
Reverse transcriptase PCR (RT-PCR) of SARS CoV minus-strand RNA and subgenomic mRNAs. One microgram of total cellular RNA, extracted from SARS CoV-infected Vero cells, was reverse transcribed into single-stranded cDNA with Moloney murine leukemia virus reverse transcriptase (Promega). Oligo(dT)15 or the strand-specific oligonucleotide SR18 (Table 1) was used to prime cDNA synthesis from plus-sense RNAs, while oligonucleotide SF8 (Table 1), which is complementary to the antileader sequence, was used for cDNA synthesis from minus-sense viral RNAs under the conditions recommended by the manufacturer (Promega). A 0.2-µl amount of cDNA product from the RT step was used for PCR. Primers for PCR (Table 1) were originally designed on the basis of the published SARS virus genome sequences of strains BJ01 (accession number AY278488), HKU (accession number AY278491), and Urbani (accession number AY278741) by using OLIGO 4.1 (National Biosciences).
|
View this table: [in a new window] |
TABLE 1. Oligonucleotides used for RT-PCR analysis of SARS CoV subgenomic RNAs
|
competent cells. Screening was done by colony PCR and restriction endonuclease digestion, and multiple independent cDNA clones were selected and sequenced for each species of subgenomic RNA. The sequencing reaction was carried out by using AmpliTaq DNA polymerase and universal primers with the Big Dye Terminator cycle sequencing ready reaction kit (PE Applied Biosystems) and analyzed on an ABI Prism 377 DNA sequencer (PE Applied Biosystems). |
|
|---|
|
View this table: [in a new window] |
TABLE 2. Names of SARS-CoV mRNAs and ORFs or genes
|
![]() View larger version (14K): [in a new window] |
FIG. 1. Schematic representation of the genomic and subgenomic organizations of SARS CoV. The genome organization is based on the sequence of the SARS WHU isolate. In the upper panel, the genomic structure is shown. Known and potential ORFs are indicated by open boxes and are not to scale. The leader region is represented by a small solid box, and the poly(A) tail is represented by AAA. ORF 8a of isolate WHU contains a 2-nucleotide deletion and thus gives rise to a small ORF of 24 amino acids. Positions of forward (SF8 and -9) and reverse (SR11 to -18) primers used for cDNA synthesis and PCR amplification of different subgenomic RNAs are indicated by arrows under the genome. The bottom panel illustrates the 3'-coterminal nested set of mRNAs detected in this study. The small black boxes at the 5' ends of the genomic and subgenomic RNAs represent the common leader sequence. The first (grey boxes) and second (open boxes) ORFs that are located at the 5'-proximal end and may be expressed from the mRNAs are shown.
|
![]() View larger version (53K): [in a new window] |
FIG. 2. Northern blot analysis of SARS CoV subgenomic mRNAs Twenty micrograms of total cellular RNA from SARS CoV-infected Vero E6 cells was separated by electrophoresis through a 1.2% denaturing agarose gel containing 2.2 M formaldehyde. The resolved RNA was transferred to a nylon membrane, and 32P-labeled antisense and sense probes containing 305 nucleotides (positions 29421 to 29725) from the 3' end of the SARS CoV genome were used to detect subgenomic mRNAs and minus-strand subgenomic RNAs, respectively. The mRNA designations and their approximate sizes (in parentheses) are indicated on the right. (A) Subgenomic mRNAs detected by the negative probe; (B) minus-strand subgenomic RNAs detected by the positive probe.
|
The rationale for the cloning is that the subgenomic mRNAs of coronaviruses are 3' coterminal to the viral genome and possess a common leader sequence of about 70 nucleotides derived from the 5' end of the viral genome. Thus, cloning of the junction region of the RNA leader and body sequences would reveal the existence of the corresponding sgRNAs. To clone each possible sgRNA, we designed two common primers (SF8 and SF9) complementary to the antileader sequence and reverse specific primers (SR11 to -16) 200 to 600 nt downstream from the start codon of each ORF (Table 1 and Fig. 1). For cloning of the leader-body junction regions of plus-sense mRNAs, the cDNA was synthesized from total RNA of SARS CoV-infected cells with oligo(dT) and the strand-specific primer SR18. Using the common primer SP8 and one downstream specific primer, the junction sequences of various sgRNAs were amplified by PCR (Fig. 3A). As shown in Fig. 3A, all of the primer combinations amplified at least one major band of the expected size. It was not surprising that some primer combinations gave rise to multiple PCR bands, because the same primer combination could produce larger PCR fragments which correspond to the next-larger sgRNAs; e.g., primer SR14, intended to amplify the leader-body fusion sites of mRNA 7, could also amplify those of mRNAs 6 and 5 (Fig. 3A, lane 4), and primer SR15 could amplify the junction sequences of mRNAs 8, 7, and 6 (Fig. 3A, lane 5). It was expected that primer SR16, located in ORF N, could gave rise to multiple bands, but only one major band corresponding to the junction region of mRNA 9 was observed (Fig. 3A, lane 6). This was probably due to the high abundance of mRNA 9 and its preferred binding with the primer SR16.
![]() View larger version (115K): [in a new window] |
FIG. 3. RT-PCR analysis of SARS CoV subgenomic RNAs. One microgram of total cellular RNA from SARS CoV-infected Vero E6 cells was reverse transcribed and amplified by PCR with different combinations of forward (SF9) and reverse (SP11 to SP16 [lanes 1 to 6, respectively]) primers. The bands representing the specific SARS CoV sequences are indicated by arrowheads. The bands which revealed two novel subgenomic RNAs are boxed. Lanes 1, mRNA 2 (arrowhead) and mRNA 2-1 (boxed faint band in panel A); lanes 2, mRNA 3 (major band) and mRNA 3-1 (boxed); lanes 3, mRNA 5 (lower major band) and 4 (upper minor band); lanes 4, mRNA 7 (lower band), mRNA 6 (middle band), and mRNA 5 (upper band); lanes 5, mRNA 8 (lower band), mRNA 7 (middle band), and mRNA 6 (upper band); lane 6, mRNA 9. (A). The cDNA used for PCR was made with oligo(dT) or SR18 primer, and thus the sequence of corresponding plus-strand RNA was amplified. (B). The cDNA used for PCR was made with primer SF8, which is complementary to the antileader sequence, and therefore the sequence of corresponding minus-strand RNA was amplified.
|
![]() View larger version (38K): [in a new window] |
FIG. 4. Leader-body fusion sites of subgenomic mRNAs and their corresponding intergenic sequences. The 5' genomic leader TRS is in italic. The hexanucleotide core sequence of the TRS is indicated in boldface, and the mismatched nucleotides with the leader core sequence (ACGAAC) are in lowercase. (A) Leader-body junction sites of subgenomic mRNAs in comparison with the genomic leader TRS. The junction sequences in subgenomic RNAs are underlined. (B) The TRS in the intergenic regions. The body sequences that are fused with the 5' leader are underlined.
|
Identification of novel subgenomic RNAs. The Northern blots could readily detect eight subgenomic RNA bands, and the sequencing confirmed their existence. PCR amplification is generally more sensitive to detect sgRNAs with low abundances. By carefully sequencing the minor bands amplified by RT-PCR (Fig. 3), two novel sgRNAs which could not be revealed in Northern blots were found, and these were named 2-1 and 3-1 (Fig. 1 and 4). sgRNA 2-1 (minor band in Fig. 3A, lane 1) was obtained with primer SR11, which amplified the S junction region as the major band (Fig. 3A). The leader-body fusion site (ACGAGC) of sgRNA 2-1 (Fig. 4A) is located inside the S gene, 384 nucleotides downstream from the authentic core sequence (ACGAAC) for mRNA 2/S. In sgRNA 2-1, the first AUG codon is followed immediately by a stop codon, UAA, and the second AUG is 43 nt downstream and in the same reading frame of the S gene, which could result in the synthesis of a truncated S protein (named S') missing the N-terminal 143 amino acids. The corresponding ORF is named ORF 2b. By fusion of the green fluorescent protein gene with the 5' part of sgRNA 2-1, a fusion protein could be detected, indicating the translatability of this sgRNA (data not shown). However, the existence of protein S' in infected cells is yet to be determined.
The second novel sgRNA (3-1) (minor band in Fig. 3A, lane 2, and B, lane 2) corresponded to ORF 3b, which had been predicted to be expressed from mRNA 3 (25, 26). The leader-body fusion site (AAGAAC) for subgenomic mRNA 3-1 is 10 nucleotides upstream of the AUG start codon of ORF 3b and has a mismatch (underlined) with the leader core sequence (CS-L) (ACGAAC) of SARS CoV. Therefore, the existence of sgRNA 3-1 may indicate that ORF 3b could be expressed from a separate mRNA other than mRNA 3. The expression of 3b from sgRNA 3-1 was subsequently verified by fusion with the green fluorescent protein gene (data not shown).
The leader-body fusion sites of both sgRNA 2-1 (ACGAGC) and sgRNA 3-1 (AAGAAC) (Fig. 4A) have one nucleotide difference (underlined) from the core sequence (ACGAAC) in the leader TRS (TRS-L) of SARS CoV (26) but are identical to the core sequence of TRS-B (Fig. 4), which is consistent with previous findings that the core sequence in subgenomic mRNAs is derived from the body TRS but not from the leader TRS (20, 27, 30).
After sequencing of 12 independent clones of the junction region of mRNA 3-1, one clone showed a variant fusion site within or upstream of the AAA sequence motif (Fig. 5B), which is three nucleotides preceding the body core sequence (CS-B) (AAGAAC) for mRNA 3-1. This variant sequence was confirmed by sequencing another set of independent clones and may not represent a random event of template switching around the CS-B (AAGAAC) during the discontinuous transcription.
![]() View larger version (40K): [in a new window] |
FIG. 5. Junction sequences of SARS CoV mRNA 3-1 and models for template switch. The upper strand in the alignments represents the intergenic region of mRNA 3-1, and the lower strand is the genomic leader sequence. Dots indicate identity between the sequences. The conserved hexanucleotide core sequence is shaded, and the possible site for the template switch is indicated by arrow. The nucleotides in color are derived directly from the sequence profiles. (A) Leader-body fusion site of mRNA 3-1; (B and C) junction sequence and models of template switch of a rare variant of mRNA 3-1.
|
We also analyzed viral RNAs prepared from viruses at different passages for 4 months. Sequencing of multiple cDNA clones of each mRNA showed that the fusion sites were stable for all mRNAs and did not change over time with virus passage.
|
|
|---|
The synthesis of subgenomic mRNAs of coronaviruses involves a discontinuous step in which the 5' leader and 3' body sequences of mRNA are joined through the transcription-regulating sequences in the 3' end of the leader and in the intergenic region preceding each mRNA body (30). Although the molecular details of the discontinuous RNA transcription are not completely known, the discovery of transcriptionally active, subgenomic-size minus strands containing antileader sequence (23, 24) favors the model of discontinuous transcription during the minus-strand synthesis (19). In this report, we have shown the coexistence of both plus- and minus-strand subgenome-length RNAs in SARS CoV-infected cells, consistent with previous findings for other coronaviruses (1, 2). Therefore, the present data are more compatible with the discontinuous minus-strand synthesis model.
The proposed coronavirus discontinuous transcription mechanism implies a close interaction between leader TRS (TRS-L) and complementary body TRS (cTRS-B) in the intergenic region (30). The eight sgRNAs (mRNAs 2 to 9) of SARS CoV, which are easily detected by Northern blotting, possess junction sequences of 6 to 12 nucleotides, all containing the canonical core sequence (5'-ACGAAC-3'). The 100% identity of leader and body core sequences for these eight sgRNAs made it impossible to judge the origin of the junction sequences (from TRS-L or TRS-B) and the template switch site within the TRS. However, identification of two novel SARS CoV sgRNAs with noncanonical fusion sites shed light on these questions. The CS-Bs of both mRNA 2-1 (ACGAGC) and mRNA 3-1 (AAGAAC) (Fig. 4) contain one-nucleotide mismatches (underlined) with the CS-L (ACGAAC), but the sequence patterns of CS-B were retained in the junction region, indicating that the junction sequences of coronavirus sgRNAs originate from the CS-B, and this, in turn, supports the discontinuous minus-strand synthesis model. While the mismatch in mRNA 3-1 is at the second position of the hexanucleotides, the template switch can be envisaged to take place at the 3' end of the nascent minus-strand RNA (Fig. 5A), again reinforcing the model of discontinuous transcription at minus-strand synthesis (19).
In this study, a rare species of mRNA 3-1 which contains a junction of only three nucleotides (AAA) was discovered (Fig. 5B and C). Although transcription of this subgenomic RNA could represent a rare event for SARS CoV, it did give more evidence for the use of noncanonical transcriptional signals in synthesis of sgRNAs. The template switching takes place at the sequence motif AAA, just preceding the leader core sequence (CS-L) and 3 nt upstream from the complementary body core sequence of mRNA 3-1. Two models could be proposed for the synthesis of this rare RNA: (i) the AAA is used directly as a transcription-regulating signal, and the complementarity between CS-L and CS-B takes place in the AAA region during the template switch step (Fig. 5B), and (ii) the interaction and complementarity between CS-L and CS-B are the same as those of mRNA 3-1, but the RNA polymerase can slide 4 nucleotides back on the leader template (Fig. 5C). In another coronavirus, mouse hepatitis virus, the UUA sequence was characterized as a noncanonical site for subgenomic RNA synthesis (28), implying that the AAA sequence in a specific sequence context in SARS CoV might also suffice for subgenomic synthesis and thus supporting the former model.
Although we have shown that the mRNAs 2-1 and 3-1 identified in this study could be functional messages, we have not identified their natural expression product in SARS CoV-infected cells due to the strict control on, and later prohibition against, using living SARS virus. According to the sequence of mRNA 2-1, it can lead to translation of a truncated S protein (S'). A similar truncated S protein has been reported for porcine respiratory coronavirus (3, 18). Currently, we are making efforts to construct an infectious cDNA clone of SARS CoV, and the use of reverse genetics will be helpful to elucidate the molecular mechanism of the discontinuous transcription and to reveal the biological functions of the new sgRNAs and their encoded proteins in the viral life cycle and pathogenesis.
This study was supported by China 973 basic research program grant 2003CB514102, China NSFC grant 30270313, and special SARS research funding from Hubei Province and Wuhan University. S.H. is supported by the China Scholarship Council and HEC Pakistan. D.G.'s lab is supported by the startup package and Luojia professorship program of Wuhan University.
S.H. and J.P. contributed equally to this work. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»