Previous Article | Next Article ![]()
Journal of Virology, September 2005, p. 11443-11456, Vol. 79, No. 17
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.17.11443-11456.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Laboratoire de Biologie Moléculaire, Département des Sciences Biologiques, Université du Québec à Montréal, Québec, Canada
Received 14 March 2005/ Accepted 1 June 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Retrovirus-induced tumors are interesting models to study those different pathways. Indeed, retrovirus pathogenesis mimics classical events in the development of a tumor cell but with a shorter latency. The leukemogenic potential of nontransforming retroviruses is a direct consequence of the random integration of their proviral DNA genome in the host cell DNA during the replicative cycle. Retroviral integration sites, when common to several tumors, are considered significantly associated with tumor induction and/or development. As those loci generally contain oncogenes, retroviruses have been widely used over the years as molecular tools to identify cancer-associated genes. Recently, mouse genome sequencing demon-strated powerful use of retroviruses with the emergence of high-throughput retroviral tagging (30, 32, 45, 50, 68). When performed on a transgenic mouse model overexpressing a known oncogene or a mutated tumor suppressor gene, collaborating oncogenes from a particular pathway could be identified (27, 45, 50). All these data are now listed in the Mouse Retroviral Tagged Cancer Gene Database (http://genome2.ncifcrf.gov/RTCGD).
The radiation leukemia virus (RadLV) is a radiation- induced retrovirus that became leukemogenic after serial passages in mice (61). Originally thymotropic, weakly leukemogenic, nondefective, and ecotropic when isolated from X-ray-induced T-cell lymphomas in C57BL/6 mice (40, 42, 60), serial passages in newborn C57BL/Ka mice gave rise to a nonfibrotropic virus preparation with a very high leukemogenic potential (31, 40, 60). Several lymphoid cell lines were established from tumors induced by the passaged RadLV, including the BL/VL3 cell line (41). Several molecular clones were obtained from the BL/VL3 cell line as proviruses and several variants were characterized (59, 61). More particularly, the highly leukemogenic molecular clone V-13 provides an interesting T-cell lymphoma model, mostly because the disease latency is less than 3 months (60, 61).
Until now RadLV/VL3 has been rarely used in retroviral tagging. Previous studies report common integration near the cyclin D2 gene (Vin1 locus), Pim1, and c-myc in 5%, 30%, and 15%, respectively, of tumors (23, 70). Pim1 and c-myc are both known to be involved in T-cell lymphomas (9, 64, 65, 71). Common integrations in the Kis1 locus on mouse chromosome 2 have also been reported, but, so far, no gene has been associated with this locus (35).
In the present study, we used the CpG island technique of Li et al. (39) in an attempt to identify new proto-oncogenes in RadLV-induced T-cell lymphomas. We provide evidence of RadLV integration in the Notch1 gene. We report the identification of a new common integration site named Kis2 (Kaplan Integration Site 2) which is located on the mouse X chromosome. Retroviral integration was observed in 11% of the tumors and it modifies the expression of two genes in the region.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Genomic DNA and RNA analysis. Genomic DNA was extracted from frozen normal and leukemic tissues by standard procedures as described previously (63). For Southern analysis, 15 µg of genomic DNA was digested overnight with suitable restriction enzyme, separated on a 0.8% agarose gel, and transferred to a nylon membrane (Osmonics, Fisher, Ottawa, Ontario, Canada).
Total RNA was extracted from frozen normal and leukemic tissues with the TRIzol reagent (Invitrogen, Burlington, Ontario, Canada) according to the manufacturer's protocol. For Northern analysis, 15 µg of total RNA was separated on a 1% formaldehyde agarose gel as described previously (63), and transferred to a nylon membrane (Osmonics).
All the membranes were prehybridized for 1 h at 42°C in 50% formamide-5x SSPE (3 M NaCl, 20 mM EDTA, 20 mM sodium phosphate pH 6.8)-5x Denhardt's-0.5% sodium dodecyl sulfate (SDS),-0.1 mg/ml denatured salmon sperm DNA. Hybridization was performed overnight at 42°C in the same buffer containing 10% dextran sulfate. The probes were labeled by the random primer extension method using oligohexamers (GE Healthcare, Baie d'Urfé, Quebec, Canada), and approximately 2 x 106 cpm/ml were used. The membranes were washed for 10 min at 42°C in 2x SSC (1x SSC is 0.15 M NaCl-0.015 M sodium citrate), 30 min at 42°C in 2x SSC-0.1% SDS, and finally for 15 min at 65°C in 0.1x SSC-0.1% SDS. Membranes were revealed by exposure and scanning with a phosphorimager (Bio-Rad, Mississauga, Ontario, Canada).
RadLV integration site amplification by inverse PCR. The viral insertion site amplification involving inverse PCR was performed essentially according to the method of Li et al. (39). Briefly, 2 µg of tumor DNA was digested with SacII and allowed to circularize with T4 DNA ligase (GE Healthcare). Due to the presence of a SacII site in the RadLV/VL3 gag region, DNA molecules that contained a provirus portion and flanking cellular sequences were specifically PCR amplified with a pair of oligonucleotides complementary to the U3 long terminal repeat and the gag region upstream of the SacII site, respectively.
Confirmation of retroviral integration at the Kis2 locus was done by PCR with the Taq polymerase kit (QIAGEN, Mississauga, Ontario, Canada) according to the manufacturer's protocol. Briefly, 200 ng of genomic DNA from tumors was used as the template in a 50-µl reaction, with 10 pmol of either the U3 sense RadLV primer (U3624, 5'-TGGGCCCCGGCTCCGTTAGACATA-3') or the U3 reverse RadLV primer (U3nsens, 5'-GGCTGGGACTTTCCAGAAACTGTTG-3'), and 10 pmol of either Kis2 genomic primer sense (255.4s, 5'-CATGCGCACTACCTCACAGCTAGAT-3') or Kis2 genomic primer reverse (255.4ns, 5'-ATCTAGCTGTGAGGTAGTGCGCATG-3'). The PCR program was one cycle of 94°C for 2 min and 30 cycles of 94°C for 20 seconds, 57°C for 30 seconds, 72°C for 5 min, and one cycle of 72°C for 10 min.
5' and 3' RACE, circularized RNA RT-PCR, and RT-PCR. Both 5' and 3' rapid amplification of cDNA ends (RACE) on Kis2 RNAs were performed using the GeneRacer kit (Invitrogen), according to the manufacturer's protocol. To isolate 3' ends, the reverse transcription reaction was primed with the oligo(dT) anchor of the kit, and PCR was performed with a combination of the 3' anchor primer and a Kis2 primer (oligonucleotide 2, 5'-GTGACATCTCTAGGCCATCTGTGCGGTATGTG-3'). To isolate 5' ends, the reverse transcription reaction was primed with a primer internal to Kis2 (oligonucleotide D, 5'-CTATTCCCGCATCTTCAGCAGCGGCTCCAG-3'), and PCR was performed with the primer specific to the 5' anchor and a nested Kis2-specific primer (oligonucleotide IS, 5'-CAAAACGCGCCAAGCTTAAGCCTCTCCCTCCATTCTC-3'). PCR conditions for both 5' and 3' RACE were 94°C for 5 min, and 30 cycles of 94°C for 30 seconds, 63°C for 30 seconds, 72°C for 1 min 30 seconds, and 72°C for 10 min.
Circularized RNA RT-PCR was performed as described by Kuhn (34). Briefly, 10 µg of total RNA from Kis2 tumors was dephosphorylated, the cap was removed, and the resulting RNA was circularized by ligation. Reverse transcription (Omniscript, QIAGEN) was primed with two different Kis2 primers, either oligonucleotide D or oligonucleotide IS described above, followed by PCR with nested primers. PCR products were loaded on 1% agarose gel and the bands were eluted, cloned, and sequenced.
RT-PCR for detection of Phf6 in tumors was performed on 300 ng of total RNA as follows. Reverse transcription was primed with an oligo(dT) and PCR was performed with primers g49rik1 (5'-CTGTCCCACGAACCTAGTTTCCATTAAG-3') and g49rik2 (5'-CAGGAAATCAAACAGGTACCCAGGAAAATG-3'). PCR was performed with one-half of the reverse transcription product reaction: 94°C for 5 min, and 25 cycles of 94°C for 30 seconds, 62°C for 30 seconds, 72°C for 1 min 30 seconds, and 72°C for 10 min.
Nested-RT-PCR to detect Kis2 expression in normal tissues was performed as follows. Total RNA from normal tissues extracted by TRIzol reagent were subsequently cleaned by phenol-chloroform extraction followed by chloroform extraction to eliminate phenol contamination and ensure reverse transcription reproducibility. reverse transcription was performed with 300 ng of total RNA and primed with the Kis2 primer (oligonucleotide B, 5'-GACATCGCCAAGGCATACTCCAGGAGTGTAACC-3'). The first PCR was done with oligonucleotide B and oligonucleotide IS4bis (5'-ATCGGGCTTTGGGACGCAGCTCGAG-3'), and the following program was used: 94°C for 5 min, and 15 cycles of 94°C for 30 seconds, 62°C for 30 seconds, 72°C for 2 min, and 72°C for 10 min. The PCR products were then cleaned on Qiaquick PCR purification columns (QIAGEN), and 1/30 of the eluate was used in the nested PCR performed with oligonucleotide IS4bis and a Kis2 oligonucleotide nested to oligonucleotide B (oligonucleotide B2, 5'-TCTTCCACGAACTCTTCTTTGCCAGATCTGCAG-3'), following the PCR program 94°C for 5 min, and 25 cycles of 94°C for 30 seconds, 64°C for 30 seconds, 72°C for 2 min, and 72°C for 10 min. The PCR products were analyzed on a 2% agarose gel.
Nested RT-PCR to detect Phf6 transcripts in normal tissues was performed as for Kis2. Reverse transcription was primed with oligo(dT). The primers used in the first PCR were oligonucleotide 1-3R1 (5'-TGGAATTCTCGAAACTACCCCTGGCGACAG-3') and oligonucleotide 5-4Sma1 (5'-TACTGACCCGGGTGTTTCCATTAAGCTGCTG-3'). The two oligonucleotides contained an EcoRI site and a SmaI site, respectively. The PCR program was 94°C for 5 min, and 15 cycles of 94°C for 30 seconds, 56°C for 30 seconds, 72°C for 1 min 30 seconds, and 72°C for 10 min. For the nested PCR, the primers were oligonucleotides 1-3R1 and Phf6TCSma (5'-TACCCCGGGTCCACATTTAGGCCTGGTAT-3') but the PCR program was for 25 cycles. The PCR products were run on a 2% agarose gel.
The ß-actin gene was used as control in all the RT-PCRs performed. The reverse primer was either oligo(dT) or an internal primer (ßact2, 5'-GACGGGGTCACCCACACTGTGCCCATCTA-3'). PCR was performed with 1/10 of the reverse transcription product reaction in the presence of primers ßact2 and ßact1 (5'-CTAGAAGCACTTGCGGTGCACGATGGAGGG-3'). The PCR program was 94°C for 5 min, and 25 cycles of 94°C for 30 seconds, 62°C for 30 seconds, 72°C for 1 min, and 72°C for 10 min.
cDNA cloning of Phf6. A cDNA library was prepared from the Kis2-rearranged tumor T3 with the Superscript Plasmid System with Gateway Technology for cDNA synthesis and cloning kit (Invitrogen). The library was made using 4 µg of polyadenylated RNA, and cDNA ligation product was precipitated prior to electroporation in electrocompetent Escherichia coli DHB10 (Invitrogen); 1 µl of the ligation was used to transform 25 µl of bacteria. A total of 5 x 105 clones were plated on L-broth agar plates (22,5 by 22,5 cm; 8 x 104 clones per plate) containing 100 µg/ml ampicillin, grown overnight at 30°C, transferred on nylon membranes in duplicate and screened with a Phf6-specific probe (RT-PCR product eluted from agarose gel). Positives clones were identified on the original plates, isolated and sequenced.
PHF6 localization in NIH 3T3 cells. NIH 3T3 cells were maintained in Dulbecco's modified Eagle's medium supplemented with 10% calf serum (Invitrogen). The DNA sequence corresponding to the protein encoded by transcripts t2 and t7 from Phf6 was PCR amplified using primers 1-3R1 (5'-TGGAATTCTCGAAACTACCCCTGGCGACAG-3') and 5-4Sma1 (5'-TACTGACCCGGGTGTTTCCATTAAGCTGCTG-3') for Phf6-t7, and primers 5-4R1 (5'-TGGAATTCTCCTTAAAGTGGCAGTCTCTAAAC 3') and 5-4Sma1 for Phf6-t2. PCR products were eluted from agarose gel, and cloned in pEGFP-N1 and pEGFP-C2 (BD Pharmingen, Mississauga, Ontario, Canada). NIH 3T3 cells were seeded at 2 x 105 cells per plates and transfected with the different constructs mixed with Polyfect reagent (QIAGEN) according to the manufacturer's protocol. Fluorescent cells were analyzed 24 h post transfection by confocal microscopy.
Sequence analysis. DNA sequences were performed at Canadian Molecular Research Services Inc. (www.cmrsinc.com; Montreal, Quebec, Canada). Sequence analysis was done with the BLAST program of both databases Ensembl and NCBI. Sequence alignment were performed by CLUSTAL.
| RESULTS |
|---|
|
|
|---|
|
For RadLV/VL3, integrations were clustered in the last exon of the gene. This corresponds to the type II integrations described by Hoemann et al. (24) although, contrary to their study, we found proviruses in both orientations. RadLV/VL3 integrations occur, in terms of protein, before the OPA motif (tumors 153.1 and 271.1), within the OPA motif (tumor 255.6), after the PEST motif (tumor 255.7), or 400 nucleotides downstream of the stop codon in the 3' untranslated region (tumor 137.2). Notch1 activation in leukemia has been reported to be often associated with c-myc (15, 24) but none of our Notch1-rearranged tumors was also rearranged for c-myc (results not shown).
Kis2, a new common integration site. One of the RadLV/VL3 integration site corresponded in the database to a single SL3-3 integration site (67), suggesting that this region could be a common integration site. Southern blot and PCR analyses revealed retroviral integration in this specific locus in 11% of the tumors analyzed (8 tumors out of 72). This new common integration site is situated on the X chromosome, more precisely in a region corresponding to bp 45091000 (XA4 region) according to the mouse genome sequence database (www.ensembl.org). We named this site Kis2 (Kaplan Integration Site 2). Five of the tumors are almost clonal since the rearranged allele appears slightly weaker than the normal one (Fig. 2A and not shown). Note that the normal allele is absent in tumor 2 (T2) because this mouse was a male. DNA rearrangement in the three other tumors (T4, T7, and T8) was not detectable in Southern blots, but retroviral integration was easily detectable by PCR.
|
Figure 2B shows the gene and expressed sequence tag (EST) map of the Kis2 region. We limited the region on one side by the glypican 3 gene (Gpc3), and by the Phf6 gene on the other side. Gpc3 encodes a heparan sulfate proteoglycan which is involved in developmental morphogenesis (13), while Phf6 encodes a protein implicated in a mental disorder called Borjeman-Forsmann-Lehman syndrome (BFLS) (43). A third gene, Rs17, is present and encodes a ribosomal protein. Three ESTs (AI464896 and unigene clusters Mm. 277876 and Mm. 23411) have also been reported in this region (Fig. 2B). All RadLV/VL3 integrations were found clustered in a 4-kbp region between the Rs17 gene and the Mm.277876 EST cluster. SL3-3 and Moloney murine leukemia virus integration sites are also present in this region.
Most of the time, retroviruses were integrated in an antisense orientation with respect to the transcription direction of the Mm.277876 EST cluster (Fig. 2B, tumors T1, T2, T3, T4, T7, and T8), as well as SL3-3. However, in the case of tumors T5 and T6, the retrovirus is integrated in the sense of transcription and closer to Mm.277876 (0.5 kbp). Interestingly, this region was also reported as a common integration site (called XPCL1) for Moloney nurine leukemia virus in p27/ transgenic mice (27). Moloney integrations are localized, although not oriented, to the same area as RadLV integrations (Fig. 2B), and overexpression of EST AI464896 was reported in the Moloney-rearranged tumors. This EST is located 4 kbp downstream of the RadLV and Moloney integrations, and 0.5 kbp downstream of the Mm.277876 cluster (Fig. 2B). However, EST AI464896 expression was not altered in our RadLV-induced tumors (not shown) suggesting that another EST or gene was targeted in our tumors.
Thus, we decided to analyze the expression of all the genes and ESTs present in the entire 450-kbp region. We did not find any change of expression for the GPC3 and Rs17 genes or for Mm.23411 EST. Interestingly, we found that expression of EST from the Mm.277876 cluster was drastically increased in the Kis2 rearranged tumors. Therefore, we believe that the RadLV/VL3 retrovirus, although integrated in the same region as the Moloney murine leukemia virus, is targeting another gene. Also, the expression of the Phf6 gene was affected (see below).
A novel gene is overexpressed in Kis2-rearranged tumors. Each gene and EST present in the region of 450 kbp was tested by Northern or RT-PCR analysis to detect altered expression in the Kis2-rearranged tumors. Mm.277876 expression was drastically altered in all Kis2-rearranged tumors. We named the gene corresponding to the Mm.277876 EST cluster Kis2.
Two clearly different patterns of expression were obtained depending upon the orientation of the provirus. First, five transcripts of different sizes were detected in tumors T1, T2, and T3 (Fig. 3A) and second, the third RNA species was largely overexpressed in tumors T5 and T6, where the retrovirus is integrated in the sense orientation. Overexpression of Kis2 transcripts was barely detectable in tumors T4, T7, and T8 due to the nonclonality of these tumors. In the nonrearranged tumors, the level of expression of Kis2 transcripts remained very low. Interestingly, the BL/VL3 cell line which is a lymphoid cell line established from a tumor induced by the passaged RadLV, showed low but significant quantities of the three longest transcripts. Nevertheless, no integration could be found in the immediate 6-kbp region of the Kis2 locus in this cell line.
|
The five RNA species are mainly nonpolyadenylated, as they are detectable only on Northern blot performed with total RNA. However it is possible to amplify them when performing RT-PCR primed with an oligo(dT), suggesting that at least a little part of these RNAs is polyadenylated. Furthermore, the ESTs from cluster Mm.277876 were obtained from an oligodT-primed library and they contained a 3' end identical to that of those RNAs identified in our 3' RACE study. The canonical AAUAAA polyadenylation signal could not be found in the 3' end of the transcripts. But some genes are known to have alternative signal sequence and we found, at the 3' end, the signal AAAGAA which has been involved in polyadenylation of some non-AAUAAA-containing mRNAs (72). This signal could represent a putative polyA signal of the Kis2 gene.
The longest Kis2 RNA species (t1), which initiates at TSS2 (Transcription Start Site 2), is 2 kb in length and gives rise to species t2, t4, and t5 by alternative splicing (Fig. 3A). Splicing sites are canonical and respect the GT/AG rules. The third RNA species (t3) is initiated from a different transcription start site (TSS3), located 0.6 kbp downstream of TSS2. These two TSSs are themselves different from that of EST BB658090 of the Mm.277876 cluster registered in the database, located 0.4 kbp upstream the TSS2 (TSS1) (Fig. 3A). Thus, The Kis2 gene contains at least three different start sites, each one being localized in a CpG island and sufficiently distant from the others suggesting three different promoters.
The coding potential of the five RNA species has been analyzed by computer and we could not find any open reading frame in their sequences. Thus, we concluded that these RNAs belong to the growing number of noncoding RNA group.
The Kis2 EST unigene cluster (Mm.277876) contains only three ESTs: BB658090 and BB470007, which correspond to the 5' and 3' ends, respectively, of the same cDNA clone isolated from a 12-day embryo eyeball cDNA library, and BB749744, identical to BB470007 and isolated from a pool of tissues. Thus, this gene is very poorly represented in the EST database. In Northern blots and by RT-PCR, the signal corresponding to BB658090 remained undetectable in the Kis2-rearranged tumors as in normal tissues. However, we were able to amplify the transcript by nested RT-PCR (data not shown), confirming the rarity of this transcript.
In normal mouse tissues, Kis2 RNAs were weakly detectable by Northern blot in the thymus only (data not shown), while detection for others tissues was only possible by nested RT-PCR (Fig. 3B), suggesting again a very weak transcription of those RNAs in a normal state. Transcripts 4 and 5 are detectable in almost all the tissues tested but with different intensity, the strongest signal being in thymus. Transcripts 1 and 2 are clearly seen in the thymus and spleen only. In general, in our RT-PCR analysis, we found that the intensity of transcripts 4 and 5 was always higher than that of transcripts 1 and 2. This observation was made in the normal tissue analysis (Fig. 3B) but also in the Kis2-rearranged tumors (result not shown). However, when the same tumors are analyzed by Northern blot for Kis2 expression, we observed a stronger signal for the 2 longest transcripts when compared to the 2 smallest (Fig. 3A). Thus, it appears that the differences observed in signal intensity between the Kis2 long and small transcripts in Fig. 3B is probably the result of a skewed PCR in favor of the smaller transcripts since the same pair of oligonucleotides was used for transcript 1, 2, 4, and 5 amplification. Specific detection of transcript 3 in normal tissues was not possible by RT-PCR because its sequence is entirely included in that of transcripts 1 and 2 (Fig. 3A).
Until now, no orthologues of the Kis2 gene have been found, as it is absent from EST database of other species. Interestingly, a comparison of genomic DNA between mouse, rat, and human revealed some highly conserved regions. In particular, striking sequence conservations are observed in the six regions corresponding to TSSs, splice sites, and a 250-bp sequence located in the last quarter of the Kis2 transcription unit (Fig. 4, region 5). This suggests that the Kis2 gene might be present in other species such as rat and human, even though ESTs have not been reported yet.
|
|
|
The second transcript in size (Phf6-t2) also has 11 exons, but lacks the long 3' untranslated region (Fig. 6). It is weakly expressed in our tumors, as only a very weak signal is observed on poly(A) RNA Northern blots (not shown). Furthermore, it represents only 2% of the ESTs present in the Mm.26870 cluster (see also the legend to Fig. 6). Both Phf6-t2 and Phf6-t3 encode an identical 365 amino acid protein. We could detect expression of t2 and t3 by nested RT-PCR (Fig. 5B) in almost all the normal tissues, with the weakest expression being in muscle. The primers used in this assay did not allow us to distinguish Phf6 transcript 2 from transcript 3. However, we obtained identical results with primers specific to transcript 3 only (reverse primer located at the 3'end of the 3' untranslated region; data not shown). Thus, the signal observed in Fig. 5B corresponds almost entirely to transcript 3.
Interestingly, we also found a third smaller transcript (Phf6-t7) in our cDNA library, lacking exons 2 to 5 inclusively (Fig. 6). This species has never been described before, and it gives rise to a protein lacking the first PHD domain. Its detection required nested RT-PCR and it seemed to be expressed mainly in normal thymus (Fig. 5B).
Analysis of all the ESTs from the Phf6 cluster revealed four other alternative transcripts in the mouse, although they are rarely represented in the database (Phf6-t1, Phf6-t4, Phf6-t5, and Phf6-t6, Fig. 6). Phf6-t1 is composed of exon 1 and a part of intron 1. It contains an open reading frame totally different from that of PHF6, since the normal ATG start codon is in exon 2. Phf6-t4 lacks exons 2 and 3 and codes for a protein that still contains the two plant homology domains but has lost the first 80 amino acids at the N-terminal end. This transcript was detected in thymus, lungs, ovaries, kidneys, liver, and spleen (Fig. 5B). Phf6-t5 has lost exons 6 to 10 and gives rise to a protein lacking the second PHD domain. Phf6-t6 lacks exons 9 and 10, resulting in a protein 41 amino acids shorter at the C-terminal end that has conserved the two PHD domains.
In humans, the PHF6 protein localizes to the nucleolus, and each of the four nuclear localization signals (NLSs) identified seems to be important (43) (Fig. 6). Using enhanced green fluorescent protein (EGFP) fusion protein constructs, we found the same localization in mouse NIH 3T3 cells. Protein from Phf6-t2 did localize to the nucleolus like the human homologue (Fig. 7). Interestingly, protein from transcript 7 of Phf6 (labeled PHF6 t7 on Fig. 7), which lost all four NLSs, still localized to the nucleus but no longer to the nucleolus (Fig. 7, middle panel). These results were the same whether the EGFP was in the N- or C-terminal position in the fusion protein constructs. As NLSs are absent from the PHF6 t7 protein, this suggests that other unidentified NLSs are present in the protein or that the protein interacts with other nuclear proteins to be imported.
|
| DISCUSSION |
|---|
|
|
|---|
First of all, our results show that genes situated quite far from the integration site could be activated as well. This has been previously described in the case of Moloney murine leukemia virus when integrated at the Mlvi4 locus, 300 kbp from the c-myc locus (37). This fact was rarely described in the literature until recently, mainly due to the lack of the appropriate genetic information. Recent high-throughput studies using the mouse genome sequence database have reported common retroviral integrations in regions located much further away from the target gene (11, 32, 68). It is clear that identification of target genes located further away from the common retroviral integration sites will now be possible.
How can retroviral integrations in regions located far away from a target gene act so efficiently on its expression? Specific chromatin structures could be involved so that distant regions are brought closer. Alternatively, key regulatory elements of gene expression could be present at the integration site and be affected by the presence of the retrovirus. So retroviruses could be tools not only to find oncogenes, but also to find their regulatory elements, as already mentioned by Kim et al. (32).
It is also interesting that not all the genes in the region are affected by the integration. This had already been reported in the case of SL3-3 when integrated in the Ras locus: only the expression of Ras is affected, whereas the expression of unr is not, despite its localization close to the integration site (48). This could result from the fact that some genes are located in an active area of the chromatin while others are not. It can also be illustrated by Moloney murine leukemia virus integrations at the XPCL1 locus (Fig. 2B) which affect the expression of EST AI464896 (27), whereas RadLV did not. A response of a given promoter to a specific enhancer could also be involved so that different enhancers would affect gene expression differently. Indeed, long terminal repeat sequences have a great influence on the insertional activation, as recently shown by Johnson et al. (29). In light of this, it would be interesting to know if Moloney murine leukemia virus or SL3-3 enhancers do affect the expression of Kis2 and Phf6 when integrated at the Kis2 locus. Also relevant to this question, it is interesting that antisense retroviral integrations affect transcription from TSS2 and TSS3 in a similar fashion, whereas integrations in the sense orientation affect expression from TSS3 to a much greater extent (Fig. 3A).
Kis2 is a novel gene that has never been described before. It is poorly represented in the mouse database. Indeed, we found that Kis2 is weakly expressed in all the normal tissues, the thymus being the highest expressor (Fig. 3B). To date, no ESTs specific to Kis2 have been described in other species. However, the Kis2 transcription unit, or at least some of its features, seems to be conserved in rat and human, as alignment of the genomic regions reveals highly conserved areas in the sequence overlapping TSSs and splice sites.
The main characteristic of Kis2 transcripts is that they contain only minute open reading frames, thus classifying them as noncoding RNAs. A few years ago, noncoding RNAs were considered noise in the transcriptional system. But recent studies from the human and mouse genome sequencing projects revealed that a real noncoding RNA dimension remains to be explored. The function of most of those noncoding RNAs is still unknown. However, a few of them have now been functionally characterized. Noncoding RNAs are implicated in mechanisms such as imprinting (H19) (57), dosage compensation transcription regulation (Xist [6], Tsix [38], and roX [49]), (SRA [36] and 7SK [33]), regulation of transcript translation by antisense effect involving a particular micro-RNA (lin-4 [54] and let-7 [62]), and finally oxidative stress (adapt33 [73] and gadd7 [25]).
The function of Kis2 remains to be determined. One hypothesis about the role of noncoding transcripts argues against a function of the RNA itself but rather stipulates that only the transcription across the locus would be important, as this is the case for the ß-globin locus control region (16). However, if this is the case for the Kis2 locus, it is difficult to understand the meaning of the three promoters and the six alternative transcripts. Such characteristics have been reported for others noncoding RNAs such as Xist and Tsix, for which no functions for the spliced species have been attributed yet (52, 53, 66). In the case of noncoding RNAs, the sequence could really be important either at the level of antisense functions or as a structure. Because of their sequence differences, the Kis2 RNA species very likely have distinct properties, which argues in favor of a function for the Kis2 RNA per se.
Another interesting particularity of Kis2 RNAs is that they apparently lack a poly(A) tail, as we could not detect them on Northern blot performed with poly(A) RNA. The poly(A) tail is of great importance for mRNA, both at the level of transcription regulation and for stability. Sequence elements promoting mRNA instability have been described, such as those named AU-rich elements, which are located in the 3' untranslated region of mRNA and control their deadenylation (75). Thus, some RNAs could be present as polyadenylated or deadenylated forms as a result of poly(A) tail length regulation, and are qualified as bimorphic RNAs. Alternatively, some mRNAs have sequences in their 3'untranslated region named poly(A)-limiting elements, which limit the length of the poly(A) tail (17, 19). Thus, those RNAs are generated with a poly(A) tail containing less than 20 A's, which leads eventually to transcript instability in response to some regulatory factors. The same situation could occur for the Kis2 mRNAs: They could have a poly(A) tail too short to be successfully purified by oligo(dT) affinity chromatography, but still long enough to be amplified by oligo(dT)-primed RT-PCR. Such poly(A)-limiting element consensus sequences are not present in the Kis2 3' region, although other sequences could be responsible for limiting the poly(A) length, as this is also the case for immunoglobulin M mRNA (58). Finally, the lack of a poly(A) tail of normal length in Kis2 RNAs could be in part responsible for their lower representation in the databases since most cDNA libraries are primed by oligo(dT).
The Phf6 gene was discovered recently in human and was implicated in the Börjeson-Forssman-Lehmann syndrome (43). BFLS is characterized by moderate to severe mental retardation, epilepsy, hypogonadism, hypometabolism, obesity with marked gynecomastia, swelling of subcutaneous tissue of the face, narrow palpebral fissures, and large but not deformed ears (5). The Phf6 gene encodes a 365-amino-acid protein, which has two plant homology domain fingers. Several PHF6 mutations associated with BFLS have been found (43): some are missense mutations, with one of them reported to affect a cysteine in the first plant homology domain, and others are truncation mutations. One of those truncation mutations occurs in the beginning of the protein, resulting in its complete loss, while another one, which appears to be recurrent in BFLS (44), occurs at the end of the protein (R342X).
The multiple symptoms which characterize this syndrome can be correlated with the expression of the gene in a broad variety of tissues, and with the existence of several alternatively spliced transcripts. Although Phf6 expression is detected in most human and mouse tissues, expression is low and only detected by nested RT-PCR. However, in the Kis2-rearranged tumors, its expression was easily detectable by Northern blot and RT-PCR analysis (not shown and Fig. 5). In consequence, this article is the first report suggesting an involvement of Phf6 in the development of leukemia.
PHD finger composition is close to the RING finger and LIM domain (1), which both function in association with two Zn2+ atoms. Although the PHD finger function is still unknown, it is mainly found in proteins involved in transcriptional regulation (1). Two groups can be distinguished: the first one including transcriptional activators, repressors and cofactors (28), and the second one including proteins of chromatin modulating complexes such as acetyltransferases or acetyltransferase-containing complexes such as p300 or CBP (4). To corroborate this, the PHD finger of the transcription factor SPBP is involved in chromatin-mediated transcriptional regulation as a protein-protein interaction domain (46). Furthermore, naturally occurring single amino acid substitutions in the PHD finger of the ATRX and AIRE1 proteins predispose individuals to
-thalassemia, developmental defects and autoimmune diseases (3, 14). Interestingly, ALL1, the human homologue of Drosophila trithorax, and other PHD finger-containing genes are also frequently disrupted by chromosomal rearrangements that occur in acute lymphoid and myeloid leukemias (8, 18, 51, 56, 69).
As in human, the mouse PHF6 protein was found to localize to the nucleolus. This organelle is mainly known as a site dedicated to ribosome biogenesis. Other events take place in the nucleolus, such as for example, the maturation of polymerase III transcripts (7). The nucleolus also seems to behave as a sequestering compartment for regulatory complexes. For example, the p19ARF tumor suppressor, which is implicated in p53 regulation, exclusively localizes to the nucleolus. Cytoplasmic p53 degradation is mediated by the oncoprotein Mdm2 and p19ARF acts by complexing and sequestering mdm2 in the nucleolus, thus preventing its nuclear export and p53 degradation (55). Nucleolus also hosts viral protein, such as HIV-Rev (26). According to these observations, the nucleolus still appears as a "dark box", now including Phf6.
However, it is likely that Phf6 serves fundamental functions in the cell since it is highly conserved among species. Several transcripts are present in the database, but the major one is transcript 3 (Fig. 6). It differs from the others by its long 3' untranslated region, which probably serves some regulatory or stability goals. Transcripts 4 and 6 give rise to a protein which retains the two PHD fingers, while the protein encoded by transcript 5 has lost the second one. Interestingly, we found another alternative transcript (transcript 7, Fig. 6) which has never been described before. It appears very weakly expressed, mainly in the thymus. This splicing eliminates exons 2 to 5, giving rise to a protein initiated at an ATG codon further downstream and thus lacks the first PHD domain and all the identified nuclear/nucleolar localization signals. Furthermore, the corresponding protein no longer localizes to the nucleolus, but still remains in the nucleus.
Finally, it is tempting to make a link between the Kis2 and Phf6 genes which are both expressed simultaneously in Kis2-rearranged tumors. Because of the noncoding nature of Kis2 RNAs, one could imagine a trans-action of these RNAs on the Phf6 locus, which would lead to its overexpression. Otherwise, because of the relative proximity of Kis2 and Phf6 on the chromatin, a cis-action of the transcription across the Kis2 locus on the Phf6 expression, mainly through a chromatin structure remodeling could be imagined.
There is no doubt that the Kis2 locus is involved in thymomas occurrence since it is targeted by three different retroviruses that induce T-cell leukemia: Moloney, SL3-3 and RadLV. Since there is some variability in tumor clonality, some being almost clonal and some not at all, it is tempting to speculate that the role of the Kis2 locus is more oriented towards tumor progression rather than tumor initiation. We must now further characterize Kis2 RNAs and PHF6 protein and determine if either the Kis2 or Phf6 gene or both have oncogenic potential.
| ACKNOWLEDGMENTS |
|---|
This work was supported by grant MOP-37994 from the Canadian Institutes of Health Research.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|