Previous Article | Next Article ![]()
Journal of Virology, September 2002, p. 9046-9059, Vol. 76, No. 18
0022-538X/02/$04.00+0 DOI: 10.1128/JVI.76.18.9046-9059.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Laboratory of Molecular Biology, Clinical Research Institute of Montreal, Montreal, H2W 1R7 Quebéc,1 Département de Médecine and,2 Département de Microbiologie et d'Immunologie, Université de Montréal, Montréal, H3C 3J7 Quebéc,3 Department of Medicine, Division of Experimental Medicine, McGill University, Montreal, H3G 1A4 Quebéc, Canada4
Received 21 March 2002/ Accepted 13 June 2002
|
|
|---|
|
|
|---|
In view of these findings, we have hypothesized that a novel gene within the Ahi-1 locus is targeted by provirus insertion and that its dysregulation contributes to tumor development. We report here the identification of that gene, Ahi-1, which is the target of these Ahi-1 and Mis-2 provirus insertions and which encodes a protein containing an SH3 motif and a WD40-repeat domain.
|
|
|---|
DNA extraction and restriction endonuclease digestion. DNA extraction, digestion with restriction endonucleases, separation of DNA fragments by agarose gel electrophoresis, and hybridization with 32P-labeled probes by the method of Southern were performed as described previously (41).
Molecular cloning of mouse genomic DNA. The molecular cloning of the provirus with the adjacent Ahi-1 sequences has been described elsewhere (41). To initiate chromosome walking, cosmid and phage libraries were constructed by partial digestion of C57BL/6 mouse liver DNA with Sau3A and ligation into the BamHI site of the pWE15 and EMBL-3 vector, respectively, as described elsewhere (41, 48, 62). Molecular clones (Cos1, Cos2, Cos3, and pSW4) were isolated by screening these libraries with mouse Ahi-1 probes E1.9 and G (Fig. 1A). To analyze these clones in more details and to derive additional probes necessary for a finer mapping of this region, a restriction map of the Ahi-1 region was derived and additional single-copy fragments were isolated and subcloned (Fig. 1A). This map was confirmed by digesting normal mouse cellular DNA with various restriction endonucleases (P.L. Pharmacia, Montreal, Quebec, Canada) and hybridizing them with different probes derived from this region (data not shown), essentially as previously described (41, 48).
![]() View larger version (20K): [in a new window] |
FIG. 1. Mapping of the MuLV proviruses within the Ahi-1 locus. (A) Structural organization of the proviruses integrated at the 3' end of the Ahi-1 gene. The restriction endonuclease map of 80 kbp of the Ahi-1 region was determined with phage (pSW4) and cosmid (Cos1, Cos2, and Cos3) clones and confirmed by Southern blot analysis of mouse DNA. The localization of the proviruses relative to the exons is shown. The intron-exon boundaries of exons 25 and 26 have been confirmed by sequencing genomic fragments. The provirus integration within the rat Ahi-1 locus in tumor 1213-10 was detected with probe H but could not be mapped precisely because of the lack of restriction map in the rat. Symbols: open and strippled boxes, coding and noncoding exons, respectively; line, intronic sequences; closed horizontal boxes, single-copy fragments subcloned and used as probes; thin vertical arrows, sites of provirus integration in A-MuLV-induced pre-B-lymphomas; large vertical arrows, sites of provirus integration in T-cell thymomas of Mo-MuLV-infected MMTVD/myc Tg mice; horizontal thin arrows, orientation of provirus (5' to 3'); horizontal thick arrow, orientation of Ahi-1 transcription; asterisks, stop codons. Restriction sites: B, BamHI; Bg, BgII; E, EcoRI; K, KpnI; S, SmaI. Not all sites are indicated. (B) Site of the provirus integration within the Ahi-1 gene in tumor T3465. Sequences were obtained from a PCR product ( 2.0 kbp) generated with primer 256 localized in exon 26 and primer 112 within the Moloney MuLV LTR. The sequences of the intron-exon boundaries are shown, respectively, in small and capital letters and the viral sequences in boldface. Encoded amino acids are shown in the lower line and the numbers refer to the amino acid position in the sequence shown in Fig. 3.
|
Isolation of cDNA clones and sequencing.
A rat brain
gt11 cDNA library was screened by using the RT-PCR fragment A (Fig. 2B). Two other mouse testis and thymus cDNA (the Uni-Zap XR library [Stratagene] libraries and the 5' stretch library [Clontech]) were also screened with cDNA probe E1.3 in order to get a full-length cDNA. Furthermore, the missing 5' end of the cDNA was cloned by 5' RACE (rapid amplification of cDNA ends) as described previously (16) into a dT-tailed vector. In addition, mouse brain RNA was used to amplify mouse cDNA sequences by RT-PCR with specific primers (that is, 276 and 296) derived from the rat cDNA sequences (Fig. 2A). Several clones were isolated: seven overlapping fragments from the rat brain cDNA library, eight overlapping clones, and five full-length clones from the mouse testis and thymic cDNA libraries, respectively. These clones were subcloned into pGEM-3, analyzed with restriction endonucleases, and sequenced by the dideoxynucleotide Sanger's method (49) in both directions.
![]() View larger version (18K): [in a new window] |
FIG. 2. Structure of the Ahi-1 cDNA clones. (A) Two overlapping rat cDNA clones, obtained from a brain cDNA library, are shown with their positions from 5' to 3'. Specific primers used for generating mouse cDNAs are indicated. (B) The complete Ahi-1 cDNA sequence was derived with eight overlapping mouse cDNA clones from two testis cDNA libraries (E and EX) and by 5' RACE. The methionine initiation codon, the termination codon, and the consensus polyadenylation signal are indicated. Symbols: thin line, Ahi-1 untranslated region; closed box, Ahi-1 coding sequences; horizontal lines, Ahi-1 overlapping cDNA clones; open box, the location of the RT-PCR product-A. The numbers refer to the position of the nucleotides in Fig. 3. Numbers in parentheses represent the names of each clone (E or EX) with their respective lengths given in kilobase pairs.
|
RNA expression by Northern analysis. RNA was isolated from various tissues by the method of Chomczynski and Sacchi (6), separated on 1% formaldehyde-agarose gels, transferred onto nylon membranes, and hybridized as described above. Washings were performed for 15 min at room temperature in 2x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate), for 1 h at 65°C in 0.1x SSC-0.5% sodium dodecyl sulfate, and in 0.1x SSC three times for 2 min each time at room temperature. The oligonucleotide used to detect the 18S rRNA was 5'-ACGGTATCTGATCGTCTTCGAACC.
PCR and RT-PCR analysis. Total RNAs from the mouse tumor cells, from Jurkat cells or from mouse thymus (5 µg) were transcribed with Moloney MuLV reverse transcriptase as described previously (16). PCR was performed on the cDNAs obtained by using the following primers. For analysis of the different splicing forms of the 5'-untranslated human Ahi-1 sequences, the primers used were primer 910 (5'-GGGAGTTGATTTGCACTGCTC) and primer 899 (5'-ATCTCCTTTGCCATTTCTTCAG), which are located on exons 1 and 4, respectively (see Fig. 5C). For analysis of the novel sequences of exon 24 present at 3' end of the AK000076 cDNA clone, we used primers 897 (5'-AGACCGACAGTCACTTTGCTG; exon 23/24) and 898 (5'-TCCCAATATTTTATGAGTTTCAAAG; exon 24). For analysis of the 3' end Ahi-1/viral fused transcripts, in Mis-2 and Ahi-1 (including L48) rearranged tumors, the primer 828 within the U5 LTR (5'-AGTGATTGACTACCCGTCAGC), and a primer in exon 16 (primer 276, 5'-GAGAACAATGGTTGCGTTTTG), exon 17 (primer 277, 5'-CGAGTCTCTGTTTCACAAGC), exon 19 (primer 264, 5'-CACCAGGCCCTGCAAAGAAG), or exon 22 (primer 265, 5'-GTTAAAGGAGGGGACGCTC) were used. The PCR products obtained were purified on agarose gels, cloned, and sequenced. To confirm the presence of the novel sequences at the C terminus in the mouse EST clones BB615071 (exon 20) and BG29736 (exon 24), a primer in either exon 20 (primer 1311, 5'-CTGTCACAAATTCTCTGGTCAGG) or exon 24 (primer 1312, 5'-CATTCCTTGGTCCAGCAGCAG) and the primer 264 (exon 19) were used.
![]() View larger version (33K): [in a new window] |
FIG. 5. Structure of the mouse and human Ahi-1 cDNA isoforms. (A) Schematic alignment of mouse and human Ahi-1 cDNA clones. The numbers refer to the positions of the nucleotides. The open reading frames are indicated by thick bars, boxes, and triangles, and the untranslated sequences are indicated by thin lines. In-frame stop codons are indicated by asterisks. The polyadenlyation signals are shown for some cDNAs. All strippled bars represent alternative exons: exons 24 and 20 in mouse and exons 24 and 28 in human Ahi-1 cDNA. Note that the 5' end (nt 1 to 402; ) and the 3' end (nt 3190 to the end) sequences of the full-length mouse Ahi-1 are more divergent from the corresponding human Ahi-1 sequences. Also note that the 5' end sequences (nt 271 to 604; aa 19 to 604; shaded bar) of human Ahi-1 are not present in mouse Ahi-1. The presence of exon 24 (containing a termination codon) in human Ahi-1 cDNA has been confirmed by RT-PCR analysis on a Jurkat cell RNA with primer 897 within the region spanning exons 23 and 24 and primer 898 in exon 24. Similarly, the presence of exon 20 (containing a termination codon) in the mouse Ahi-1 gene was confirmed by RT-PCR on brain RNA with 5'primer 264 in exon 19 and 3' primer 1312 in exon 24. Note that the 5'-untranslated sequences (nt 1 to 250) of the human Ahi-1 identified in our laboratory have been added to the nucleotides listed in the GenBank for each EST clone (AL136797, AK000076, and AK024085). (B) Nucleotide sequence of the 5'-untranslated human Ahi-1. RT-PCR from Jurkat cells was performed with the primers 910 and 899 located, respectively, in exons 1 and 4. The PCR products were cloned and sequenced. The putative ATG translation initiation codon is underlined. Vertical arrrows indicate intron-exon boundaries. Uppercase letters, coding sequences; lowercase letters, untranslated nucleotides. (C) Genomic organization of the 5'-untranslated region of human Ahi-1. Boxes indicate exons. The lines between the boxes represent introns. Translation initiation codon is indicated over exon 4.
|
2.0 and
2.2 kbp were amplified from T3465 and 1213-19 cell DNA, respectively, purified on agarose gels, cloned, and sequenced. Sequence analysis. Homology searches were carried out by computer-assisted nucleotide and protein blasts on the NCBI and EBI Web sites by using the human and mouse genome databases. Search for known domains were done by using the conserved domain database at NCBI with the latest versions of SMART v.3.3, Pfam v6.6, PSPOR II, and ScanProsite programs.
Nucleotide sequence accession numbers The nucleotide sequence accession number of the full-length mouse Ahi-1 cDNA (Fig. 3A) is AY133241; the accession number of clone I of mouse Ahi-1 cDNA (Fig. 5A) is AY133242; the accession number of the untranslated 5'-end of human Ahi-1 cDNA (Fig 5B) is AY133243; and the accession numbers of fragments 1, 2, 4, and 5 of the rat Ahi-1 genome (Fig. 6A) are AY133244, AY133245, AY133246, and AY133247, respectively.
![]() View larger version (103K): [in a new window] |
FIG. 3. Nucleotide sequence of mouse Ahi-1 cDNA. (A) The nucleotide sequence of the Ahi-1 cDNA and the deduced amino acid sequence of the Ahi-1 protein are shown. Amino acids are given in the single-letter code. The conserved SH3 domain is underlined with a dashed line. The WD40 repeats are underlined. (B) Structure of the putative Ahi-1 protein. A summary of structural motifs present in the Ahi-1 protein is shown, including: one SH3 domain (black box), seven WD40 repeats ( ), an acidic-rich domain ("C-rich") ( ), nine proline-rich motifs (P), and three PEST sequences ( ).
|
![]() View larger version (39K): [in a new window] |
FIG. 6. Genomic organization of the mouse Ahi-1 gene and mapping of Mis-2 integration sites. (A) Physical map and schematic representation of exons 1 to 27 of the mouse Ahi-1 gene identified from 11 nonoverlapping combined contigs (A to K) drawn from the preliminary assembly of the mouse WGS reads based on an 15 October 2001 freeze at the Sanger Center and Whitehead Institute genome projects data bank. The contigs were as follows: B, c094702989 (7,308 bp); C, c072403314 (15,675 bp); D, c013001924 (9,046 bp); E, c052402941 (16,394 bp); F, 203795 (5,800 bp); G, c038402794 (11,198 bp); H, c047902506 (15,808 bp); I, c030002862 (8,235 bp); J, c114704732 (4,962 bp); and K, c098905234 (1,959 bp). The 5' and 3' end of Ahi-1 and c-myb genes are indicated. The Ahi-1 and Mis-2 provirus integration sites are indicated. Black boxes represent coding exons, and the white box represents an untranslated exon. Strippled boxes are the rat DNA fragments (Fr 1 to 5) previously subcloned (62) around the Mis-2 provirus integrations. The exons coding for the SH3 and WD40 domains are also shown. Symbols: M, initiation codon; asterisks, termination codons. (B and C) Mapping and genomic organization of the proviruses integrated at the Mis-2 locus in tumors 1213-2 (B) and 1213-19 (C). Note the large deletion of the provirus up to the end of the env gene, with the remaining 3' end env and LTR in tumor 1213-2. Also note the perfect direct repeats (D.R.; 228 bp) flanking the deleted provirus. Black boxes represent exons 16 and 17, and the line represents the intron sequences. (D) Sequences of the intron-exon 14 and 15 junctions in rat DNA. Uppercase and lowercase letters: exon and intron sequences, respectively.
|
|
|
|---|
14% and to be integrated in the same orientation as those found in A-MuLV-induced lymphomas (Fig. 1A). Provirus integrations within Ahi-1 were also observed in 1 (i.e., 1213-10) of 20 distinct DNAs (5%) from Moloney MuLV-induced rat thymomas (series 1213, Fig. 1A). These insertions were mapped in the same clusters of provirus integrations as those found in Abelson virus-induced tumors (Fig. 1A). Therefore, rearrangements of the Ahi-1 locus appear to be involved in the development of lymphomas from different cell lineages.
![]() View larger version (39K): [in a new window] |
FIG. 7. Expression of the Ahi-1 gene in normal tissues and tumors: (A) RNAs (20 µg) from various organs (lanes 1 to 11, as indicated) of a C57BL/6 mouse were screened by Northern blot analysis with Ahi-1 cDNA clone-E1.3 as a probe. Exposure time (EXP) is indicated in hours (h) or days (d). (B) RNAs (20 µg) from Moloney MuLV-induced rat thymomas 1213-3, 1213-8, and 1213-10 (lanes 1 to 3) and rat thymus (lane 4) were hybridized with the 32P-labeled rat cDNA probe-B. (C and D) Northern blot analysis of RNAs (20 µg) from A-MuLV-induced pre-B cell tumors (lanes 1 to 11) (C) or from cell lines established from thymomas of MMTVD/myc Tg mice infected with Moloney MuLV (D). Hybridization was with the mouse Ahi-1 cDNA probe E1.3. (E) Analysis of truncated Ahi-1 transcripts. RT-PCR amplification was performed on tumor RNAs (5 µg) from various tumors showing Ahi-1 rearrangement. RNAs were reversed transcribed in the presence of random hexamers. The cDNAs were then amplified by PCR with the indicated primers specific for the Ahi-1 sequences and the viral LTR. The amplified fragments were separated on 1% agarose gels, purified, cloned, and sequenced. In panels A to D, the blots presented were rehybridized with a 18S ribosomal probe (bottom). In panels B to D, the presence (+) or absence (-) of proviral rearrangement (Rearr) in tumor DNAs is indicated.
|
Molecular cloning and sequencing of mouse and rat Ahi-1 cDNAs. To identify the Ahi-1 gene and to map the provirus integration sites relative to this gene, we cloned the Ahi-1 cDNA. Since the Ahi-1 gene was highly expressed in brain and testis, a rat brain cDNA library was first screened. Several positive clones ranging from 0.6 to 2.5 kbp were obtained (Fig. 2). Northern blot analysis performed with both rat cDNA clone A and clone B showed the same hybridization pattern previously observed with the RT-PCR product A, indicating that part of the Ahi-1 gene had been cloned. The full-length mouse Ahi-1 cDNA was obtained by RT-PCR and 5' RACE methods and by rescreening a mouse thymic cDNA library, as described in Materials and Methods. A schematic representation of these Ahi-1 cDNA clones is shown (Fig. 2).
The complete nucleotide and deduced amino acid sequence of the Ahi-1 cDNA is shown in Fig. 3A. An open reading frame begins with a potential initiation methionine codon at nucleotide (nt) 258, based on Kozak consensus rules (25). An in-frame stop codon is located 204 nt upstream of this site. The predicted encoded protein is
120 kDa long and contains 1,047 amino acids (aa). The in-frame termination codon (TAA) is followed by a long 3'-untranslated region (Fig. 2 and 3A). The sequence immediately upstream of the poly(A) tail contains a AATAAA polyadenylation signal (nt 4720).
The mouse Ahi-1 gene product contains an SH3 motif and seven WD40 repeats. Computer-assisted analysis of the predicted amino acid sequence of the Ahi-1 cDNA product revealed a number of significant structural features (Fig. 3B). In particular, the Ahi-1 cDNA encodes a region of ca. 50 aa residues (aa 905 to 961) with significant sequence homology to the Src homology 3 (SH3) domain (25 to 35% identity [40 to 50% similarity]).
Several regions of the predicted Ahi-1 protein are also proline-rich (Fig. 3B) and contain nine potential SH3-binding sites (PxxP) at positions 9, 98, 182, 352, 803, 858, 976, 985, and 1001; the numbers here refer to the first proline residue in the PxxP consensus sequences present in high-affinity SH3 ligands (7, 43, 44, 56, 64).
Interestingly, another region of the Ahi-1 protein upstream of the SH3 motif and spanning
320 aa residues (aa 448 to 769) exhibits significant sequence homology to the WD40 repeats (33). Seven WD40 repeats were identified in the predicted Ahi-1 polypeptide sequence at positions 448 to 490, 493 to 532, 537 to 576, 583 to 622, 641 to 678, 684 to 721, and 724 to 769. Each WD40 repeat consists of stretch of ca. 40 aa in which tryptophan and aspartic acid residues and certain other amino acids are highly conserved, a finding consistent with the range of 23 to 41 observed in other members of this class of proteins (33).
In addition, computer analysis with the ScanProsite tool revealed the presence of many potential phosphorylation sites: among them, 21 protein kinase C phosphorylation sites, 3 cyclic AMP (cAMP)- and -cGMP-dependent protein kinase phosphorylation sites (23), and 26 potential casein kinase II phosphorylation sites. Also, two potential tyrosine kinase phosphorylation sites are present (residue 71 [KLKEQLTY] and residue 941 [KDNEDWWY]) with sequences matching optimally the consensus sequence of tyrosine kinases [R/K-x-(2,3)D/E-x-(2,3)Y] (8) (Fig. 3A). Tyrosine phosphorylation has been shown to play an important role in several proteins involved in malignant transformation and signal transduction (57). The amino-terminal region of Ahi-1 contains an acid-rich domain (Fig. 3A). Such a domain is found in quite different proteins, such as nucleolines, calcium-binding proteins, and transcription factors (40). Finally, three potential PEST sequences (at residues 19 to 30, 100 to 118, and 417 to 432) (46), three potential nuclear localization signal (aa 62 to 94, 134 to 151, and 825 to 868) (10), and four potential glycosylation sites (at positions 310, 536, 821, and 851) were found in the Ahi-1 protein. A summary of the structural domains present in the Ahi-1 protein is depicted in Fig. 3B.
Conservation of the Ahi-1 gene in evolution. To determine whether the Ahi-1 gene has been conserved throughout evolution, we searched for homologous sequences in other species. Mouse, rat, hamster, cat, cow, and human DNA samples were digested with EcoRI and hybridized at high stringency with Ahi-1 cDNA clone-B probe. Specific DNA fragments were detected in all of these species (Fig. 4), indicating a significant conservation of these sequences among mammals. Comparison of the mouse, rat, and human sequences now available (see below) confirmed the high conservation of this gene. However, no gene coding for a protein containing WD40 repeats and a SH3 domain was found in the sequences of Drosophila melanogaster or Caenorhabditis elegans. This suggests that the Ahi-1 gene may have an important function in mammals.
![]() View larger version (46K): [in a new window] |
FIG. 4. Interspecies conservation of the Ahi-1 gene. DNAs (20 µg) from various species were digested with EcoRI and analyzed by Southern blot with the rat Ahi-1 cDNA fragment B as a probe. Hybridization and washing were done as described in Materials and Methods. Lanes:1, mouse; 2, rat; 3, human; 4, hamster; 5, cat; 6, cow; M, HindIII/EcoRI-digested DNA.
|
In addition, at the 5' end of the coding sequence of the human AL136797 cDNA, there are 420 additional nucleotides downstream of the ATG which showed no homology with the mouse Ahi-1 cDNA. Furthermore, we identified and cloned novel extra untranslated 5' sequences of human Ahi-1. Using RT-PCR on RNA from human Jurkat cells with the primers 910 and 899, we were able to amplify, clone, and sequence three PCR products (
0.8, 0.9, and 1.0 kb) corresponding to three transcribed exons, designated exons 1 (68 bp), 2 (62 bp), and 3 (86 bp) (Fig. 5C). These exonic sequences are not translated and are spliced differently, thus leading to three isoforms of human Ahi-1 5'-untranslated region (Fig. 5B and C). There is no homology between these human 5'-untranslated sequences and the mouse untranslated Ahi-1 sequences.
The amino acid sequences predicted from these cDNA sequences were also analyzed. The human Ahi-1 protein contains an additional 140 aa at its N terminus that are not present in the mouse Ahi-1 protein. This region harbors a potential coiled-coil domain, as revealed by the SMART program. The two other shorter human isoform proteins contain an initiation codon but are likely to be missing some sequences at the N terminus. Two of the human AHI-1 proteins (AL136797 and AK024085) contain the SH3 and WD40 domains found in the mouse homolog. Within this WD40/SH3 region, the amino acid sequences of human and mouse Ahi-1 are highly conserved (>82% identity and 90% similarity). However, the sixth WD40 repeat of the human proteins was more divergent from that of the mouse. Furthermore, the AK000076 isoform lacks the SH3 domain. In addition, this protein contains a novel amino sequence at the C terminus not present in either the mouse or in the other human clones. This novel sequence is encoded by a single exon (exon 24) present in the human genome (see below).
Together, these results show that Ahi-1 gene expression can be modulated by alternative splicing. Interestingly, the spliced variants lacking the SH3 domain (AK000076 and BB615071) would be expected to encode truncated forms of the protein with most likely a distinct function.
Mapping of the Ahi-1 provirus insertion sites at the 3' end of the gene. The orientation of the proviruses relative to that of the Ahi-1 gene and the localization of the coding exons of the gene were performed by Southern blot, PCR, and sequence analyses (Fig. 1B). Moreover, the locations and sequences of these mouse exons were confirmed by data obtained from the database (see below and Fig. 6). These analyses showed that most of the proviruses had integrated at the 3' end of the Ahi-1 gene in two clusters and in an inverse transcriptional orientation relative to that of the Ahi-1 gene. In the more proximal cluster, at least one provirus (T3465) was inserted within the last intron (Fig. 1B), whereas the others were integrated within the last noncoding exon or right downstream of this last exon. The second distal cluster of provirus insertion was ca. 16 kbp downstream of the last exon of the Ahi-1 gene (Fig. 1A). Only in one tumor cell line (L48) did integration occur outside the main clusters in intron 22 (Fig. 1A).
Genomic organization of the mouse and human Ahi-1 genes. Based on the BLAST N and BLAST X homology programs, the organization of the mouse and human Ahi-1 genes was deduced from the comparison of all of the mouse and human Ahi-1 cDNA isoform sequences available from the GenBank Databases. The results of these analyses are shown in Tables 1 and 2. This analysis revealed that the Ahi-1 gene spans more than 200 kb in a region of the human chromosome 6 and a minimum of 100 kbp (distributed on at least nine contigs) on the mouse chromosome 10 (Fig. 6 and Tables 1 and 2). The mouse or human Ahi-1 gene contains at least 27 and 33 exons, respectively. The introns separating exons are between 200 bp and 35 kbp long. The exons range in size from 26 to 560 bp, excluding exons 33 (human) and 27 (mouse), which constitute a large 3'-untranslated region. The exonic sequences are identical to the human and mouse Ahi-1 cDNA sequences with the exception of a few nucleotides. The seven WD40 repeats are distributed in seven exons: each of the putative WD40 repeats is encoded mainly by a single exon (exons 11 to 17 for mice and exons 14 to 20 for humans). Exon 26 (human) or 22 (mouse) encodes the SH3 domain.
|
View this table: [in a new window] |
TABLE 1. Exon-intron boundaries of the mouse Ahi-1 genea
|
|
View this table: [in a new window] |
TABLE 2. Exon-Intron boundaries of the human Ahi-1 genea
|
The full-length human Ahi-1 clone AL136797 (isoform I) consists of 29 exons (i.e., exons 1 to 33, excluding exons 24, 28, 29, and 32). The first three exons were cloned in our laboratory by RT-PCR from Jurkat cells (Fig. 5). The termination codon is located in exon 33. The clone AK000076 (isoform II) lacking the SH3 domain appears to be made of 24 exons (i.e., exons 1 to 24) and contain an in-frame stop codon in exon 24. The clone AK024085 (isoform III) consists of 32 exons (i.e., exons 1 to 33, excluding exon 24) with a termination codon present in exon 28.
Mapping of the Mis-2 provirus insertion sites within the 16th intron of Ahi-1. We have previously identified Mis-2 as a common provirus insertion site in Moloney MuLV-induced rat thymomas (62) and found that it maps to mouse chromosome 10. Long-range restriction enzyme mapping analysis revealed that Mis-2 mapped at a distance of ca. 160 kbp of the Ahi-1 insertion site (62). Considering the length of the human and mouse Ahi-1 genes, we predicted that the Mis-2 integration sites would map within the Ahi-1 gene itself. To test this hypothesis, we sequenced the rat genomic region adjacent to Mis-2 integration sites in tumor 1213-2 and 1213-19 (Fr 1 to 5, Fig. 6A) (62). This analysis confirmed the site of the integration and revealed a high degree of homology of the rat Ahi-1 sequences with those of the mouse Ahi-1 around exons 14, 15, and 16 (contig D, Fig. 6). These results, combined with our previous mapping of the Mis-2 provirus integration sites, indicated that the Mis-2 locus corresponds to insertions within the intron 16 of the Ahi-1 gene.
Expression of the Ahi-1 gene in rodent tissues. Northern blot analysis was performed on RNA extracted from various tissues of adult rat and mouse. This analysis revealed the presence of Ahi-1 transcripts hybridizing with a probe derived from the 5' end of mouse Ahi-1 cDNA (probe E1.3) (Fig. 2B) in several tissues (Fig. 7A). Ahi-1 expression was much higher in the brain and testis than in other organs but was especially low in the liver. In addition, various Ahi-1 RNA species were present in different tissues, suggesting different splicing variants: a major hybridizing species of 5 kb in several mouse tissues tested and a less intensely hybridizing RNA of 3.5 kb in brain, 4.2 and 3.5 kb in testis, and 2 kb in embryos (Fig. 7A). In the rat thymus (Fig. 7B, lane 4) and in the brain and testis (data not shown), two major species of 5 and 2 kb could be detected with a 3' end rat Ahi-1 cDNA clone A (Fig. 2A) as a probe. In rat spleen, shorter (1.2 and 0.7 kb) and more abundant RNA species were detected (data not shown). These results indicated that the Ahi-1 gene expresses several RNA species, a finding consistent with the various spliced variants cDNA identified (Fig. 5).
Expression of Ahi-1 in tumors with provirus insertions. To determine whether provirus integration at the Mis-2 or the Ahi-1 locus led to deregulation of the Ahi-1 gene expression, we first examined Ahi-1 expression in T- and B-cell tumors by Northern blot analysis with a 3' end rat cDNA probe (probe B) or 5' end mouse cDNA probe (probe E1.3) (Fig. 2), respectively. Many tumors showing rearrangement in a very small percentage of the tumor cells [as judged by the intensity of the rearranged Ahi-1 or Mis-2 fragments relative to the germ line fragment (41, 62)] were not suitable for this analysis. Therefore, only tumors harboring a significant proportion of cells showing rearrangement were tested.
In the RNA of Moloney MuLV-induced rat tumors 1213-2 and 1213-19 rearranged in Mis-2, four major Ahi-1 transcripts (5, 4.2, 2.5, and 1.2 kb) were detected. This pattern of expression was the same in rearranged and nonrearranged tumors, but it was distinct from that of the thymus where the 1.2- or 2.5-kb transcripts were, respectively, not detected or less abundant, whereas another 2-kb species was abundant (data not shown). In the rat T-cell thymoma (i.e., 1213-10) which had a provirus inserted in the Ahi-1 locus, the levels of several RNA species (especially the 1.2-kb form) were higher than those found in unrearranged tumors (Fig. 7B). Analysis of Ahi 1 expression in A-MuLV-induced pre-B cell tumors or in T-cell thymomas from Moloney MuLV-infected MMTVD/myc Tg mice, which had a rearranged Ahi-1 locus, showed a modest increase of Ahi-1 5-kb transcript in some, but not all, of these tumors, compared to levels detected in unrearranged tumors (data not shown). Interestingly, some of the tumors also showed higher levels of shorter Ahi-1 transcripts (
4.2, 4, and 3.5 kb) (Fig. 7C). The levels of the shorter 4.2-kb Ahi-1 transcripts were especially high in the L48 cell line (Fig. 7D, lane 3). Therefore, Northern analysis revealed that the integration of a provirus at the 3' end or in exon 16 of the Ahi-1 gene appears to affect only modestly the levels of its transcription in some tumors but significantly in two of the tested tumors.
Since the provirus integration within intron 16 of Ahi-1 (Mis-2 locus) would be expected to cause truncation of the gene, as we previously reported for similar intronic provirus integrations within the Notch-1 gene (16), RT-PCR analysis was also performed to detect such truncated transcripts. For identification of transcripts originating at the 5' end of the gene and terminating at the site of provirus integration, we used primer 276 within exon 16 with primer 828 (i.e., 1213-9) or primer 110 (i.e., 1213-2) within the LTR. This analysis indeed revealed the presence of truncated Ahi-1 transcripts deleted of their half 3' end and fused to viral LTR sequences, with the generation of a translation termination codon in tumor 1213-19 RNA (Fig. 7E) but not in tumor 1213-2 RNA (data not shown). In tumor 1213-19 RNA, exon 16 is properly spliced and the provirus insertion has forced the utilization of a cryptic splice acceptor site in the Ahi-1 intron, 22 nt just upstream of the LTR. In tumor 1213-2 DNA, the LTR is inserted in the same transcriptional orientation as that of Ahi-1 and may work as a promoter. RT-PCR amplification with a U5 LTR primer 878 and primers 277 and 265 in exon 17 or 22 did not reveal such 5' viral/Ahi-1 fused transcripts originating in the LTR (data not shown), suggesting that this promoter is silent.
Similar analysis with RNA from tumors having provirus integration at the 3' end of the gene (Ahi-1 locus) was also carried out. In tumors T3465 and T3481, this experiment showed Ahi-1/LTR fused transcripts with the proper utilization of the exon 26 splice donor site and use of a cryptic splice acceptor site in the viral U5 region (T3465) or in intronic sequences upstream of the LTR (T3481), thus splicing out exon 27 (Fig. 7E). These transcripts would be expected to encode a full-length protein. In the same tumor T3481, another interesting truncated Ahi-1/LTR splice variant transcript species was generated by splicing out exons 20, 22, 23, 24, and 27. These transcripts would be expected to encode truncated Ahi-1 proteins deleted of most of their C terminus, notably of their SH3 domain. In tumor L48, this analysis with primer pairs in exon 19 and in LTR detected Ahi-1/LTR fused transcripts deleted of exons 23 to 27, with the use of a cryptic splice acceptor site in the sequences of intron 22. These transcripts would be expected to encode truncated Ahi-1 proteins deleted of their C terminus but still harboring the SH3 domain. Together, these results show that insertion of provirus within the Ahi-1 gene leads to the generation of distinct species of truncated RNA.
|
|
|---|
16 kbp downstream of the last noncoding exon. Most of these Ahi-1 provirus insertions are located outside the coding region of Ahi-1. Surprisingly, all of the mapped proviruses were found to be in an inverse transcriptional orientation relative to that of the Ahi-1 gene itself. The second provirus insertion site, Mis-2, is intronic (within the 16th intron) and is located at a large distance (
120 kbp) from the Ahi-1 insertion site. At this Mis-2 site, the proviruses are integrated both in the sense and antisense transcriptional orientation relative to that of the Ahi-1 gene. Therefore, it appears that Ahi-1 is the gene targeted by these insertional mutational events. The structure of the predicted protein encoded by the Ahi-1 gene is particularly interesting, containing an SH3 domain, WD40 repeats and SH3-binding sites, all known to mediate specific protein-protein interactions. SH3 domains are known to bind to proline-rich motifs (24, 37, 52). Among the best-studied functionally important SH3 interactions is the complex formed between the Grb2/Sem5 SH3 domain and the GDP-GTP exchange protein Sos, which forms a link in a signal transduction pathway that functionally ties tyrosine kinase receptors to Ras (5, 11, 28, 30, 35, 54). Although the functional significance of the presence of a SH3 domain within Ahi-1 is not known at the present time, this protein clearly represents a novel member of the SH3-containing protein family.
The second most evident motif of the Ahi-1 protein is a WD40-repeat domain. Several proteins containing WD40 repeats have been identified (27, 33, 55, 58). These proteins are associated with multiple and diverse aspects of cellular metabolism, including signal transduction, regulation of cytoskeletal assembly, cell cycle regulation, RNA processing, programmed cell death, and gene regulation (33). In general, their functions appear to be regulatory in nature. Structural models suggest that the WD40-repeat-containing proteins are endowed with the potential for interactions with other proteins, a finding consistent with the fact that many such proteins are components of multiprotein complexes (33, 34). The Ahi-1 protein represents a novel member of this family. To our knowledge, Ahi-1 is the only protein reported to date harboring both WD40 repeats and a SH3 motif.
The third important series of motifs predicted in the Ahi-1 amino acid sequences are nine putative SH3-binding sites. Short (8 to 10 aa), proline-rich, and structurally conserved motifs were initially found to bind specifically to the SH3 domain of Abl (7, 43). These SH3-binding sites are present in a variety of proteins and appear to form two distinct classes: a class I motif containing a conserved N-terminal arginine and a class II motif containing a conserved C-terminal arginine (14, 29). The SH3-binding sites of Ahi-1 could mediate its interaction with other SH3-containing protein(s). One of these sites may even bind the SH3 domain of Ahi-1 itself, intramolecularly. Intramolecular interaction of the Src SH3 domain has been reported (12).
Finally, Ahi-1 harbors other motifs of interest, two potential tyrosine phosphorylation sites, one within the SH3 domain. Phosphotyrosine motifs are known to bind specifically to SH2 domains of proteins (24, 52). If the Ahi-1 protein can be phosphorylated at these sites by a tyrosine kinase, this phosphorylation may allow it to recruit and bind to other SH2-containing protein(s).
Therefore, Ahi-1 appears to be a modular protein containing several putative motifs that have been shown to be present in several signaling molecules and to mediate protein-protein interactions. It is tempting to suggest that Ahi-1 may play a role in signal transduction. Because of the high number of these putative motifs mediating protein-protein interactions, Ahi-1 may be a docking site or scaffold protein recruiting a number of other signaling molecules and modulating and integrating their action.
Ahi-1 provirus insertional mutations generate truncated forms of Ahi-1 RNA. The Ahi-1 gene mutations that were analyzed lead to the generation of various forms of transcripts: some fused to viral sequences and some deleted of their 3' end. This was expected with the provirus inserted within intron 16 (Mis-2 locus), and such molecular rearrangement represents a mechanism very similar to the one we described previously with proviruses inserted within the Notch1 gene (16, 20). However, the generation of similar truncated transcripts in tumors harboring provirus integrated at the 3' end of the gene was more surprising. Some of these 3'-end insertional mutations seem to alter the normal splicing machinery and to force an alternative splicing within viral or intronic sequences close to them, thus deleting exons coding for the C terminus of Ahi-1. The molecular requirements for such unusual consequences of provirus integrations at the 3' end of a gene remain obscure but may involve the opposite transcriptional orientation of the inserted proviruses relative to that of the Ahi-1 gene. Indeed, all of the 27 proviruses inserted within the Ahi-1 locus (also recently designated Epi1) are in the same direction (Fig. 1) (2, 41), and we show here that this is in opposite transcriptional orientation relative to that of Ahi-1. Future work on how such insertional mutants may affect the splicing machinery should be informative. Interestingly, some of the truncated transcripts generated in tumors harboring Mis-2 or Ahi-1 provirus insertions appear to have a similar coding capacity, both having the capacity to code for Ahi-1 proteins truncated of some C-terminal sequences, including of their SH3 domain.
How is the Ahi-1 gene mutation involved in tumor development?
Genetic evidence suggests that provirus insertional mutation of the Ahi-1 gene contributes to tumor formation in different cell lineages. The Ahi-1 locus was identified as a common provirus integration site initially in Abelson (v-abl-induced) pre-B lymphomas (41) and later in c-myc induced T-cell lymphomas of the MMTVD/myc Tg mice (Fig. 1), suggesting that it may collaborate with these oncogenes. This locus has also been found by others to be rearranged by provirus insertion in other types of tumors, namely, in pre B-cell tumors of Eµ/myc Tg mice (18, 61) and in acute myeloid leukemia in Nf1 heterozygous mice (2). Similarly, the Mis-2 locus was identified as a common provirus insertion site in Moloney MuLV-induced rat T-cell lymphomas (thymomas) (62). The fact that these provirus insertional mutations are not random and were identified in a relatively high proportion of tumors (
3% for Mis-2,
15% for Ahi-1 [Fig. 1] [41], and 44% for Epi1 [2]) suggests that they have been selected during the oncogenic process and that they may be involved in this process. This notion is reinforced by the several examples of common provirus insertional sites that have been found to activate protooncogenes in nondefective retrovirus-induced tumors (26, 39, 47, 60).
Molecular evidence also suggests that mutation of the Ahi-1 gene may be involved in tumor formation. Our analysis of RNA in tumors harboring mutations in the Ahi-1 gene suggests that the mechanism by which these insertional mutations may enhance the oncogenic potential of the Ahi-1 gene is through the generation of truncated forms of the Ahi-1 RNA with the capacity to code for truncated Ahi-1 proteins. The levels of these truncated RNA are not very high in the Ahi-1 rearranged tumors analyzed but are specific to these tumors and are not found in nonrearranged tumors. These are likely to be involved in the transformation process. The deletion of sequences coding for the C terminus, including for the SH3 domain in some truncated Ahi-1 RNA found in tumors is likely to significantly affect the interaction of Ahi-1 with other proteins, possibly converting this molecule to a dominant-negative mutant. Alternatively, truncated Ahi-1 proteins may represent gain-of-function mutants, if the SH3 domain binds intramolecularly, as reported for other SH3-containing molecules (12), or bind to an inhibitor. In nonreceptor tyrosine kinases, such as the proto-oncogene c-src or c-abl, deletion or mutation of the SH3 domain generally leads to the oncogenic activation of their tyrosine kinase, suggesting that the SH3 domain binds to an inhibitor of the kinase activity (9, 15, 21, 32, 38, 53, 59, 63). Thus, the SH3 deletion in Ahi-1 may uncover some domains and allow novel protein interactions or may prevent the binding of an inhibitor. The fact that Ahi-1 cDNA with the potential to encode very similar truncated proteins as the ones detected in tumors have been isolated in libraries from normal tissues, suggests that truncated Ahi-1 proteins may not be oncogenic in each tissue. Rather the inappropriate or higher expression of truncated Ahi-1 proteins in some specific cell types may contribute to the transformation process.
Because we were unable to detect truncated transcripts nor Ahi-1/viral fused transcripts in some Ahi-1 rearranged tumors, it is also quite possible that the Ahi-1 gene may be involved in oncogenesis by mechanism(s) not yet uncovered and distinct from the generation of truncated gene products. In addition, we cannot exclude the possibility that some proviruses inserted into the Ahi-1 locus may act by other, as-yet-uncharacterized processes or may affect the expression of other genes surrounding Ahi-1 at a distance. This latter mechanism has recently been proposed for other provirus insertion sites upstream of c-myb (19).
It will be interesting to investigate how the Ahi-1 gene, apparently targeted by these proviruses and coding for a protein that shows all of the characteristics of a modular signaling molecule involved in protein-protein interactions, participate in oncogenesis and what is its normal function.
This work was supported by grants from the Medical Research Council of Canada and the National Cancer Institute of Canada to P.J. X.J. was the recipient of a fellowship from McGill University and from Hydro-Quebec.
We thank Alan J. Buckler for providing the pSPL1 vector. We thank Benoît Laganière and Patrick Couture for excellent technical assistance. We are also grateful to Rita Gingras for preparing the manuscript.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»