Previous Article | Next Article ![]()
Journal of Virology, November 2006, p. 10752-10762, Vol. 80, No. 21
0022-538X/06/$08.00+0 doi:10.1128/JVI.00871-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
Received 28 April 2006/ Accepted 24 July 2006
|
|
|---|
0.001 to
3% of the housekeeping beta-actin gene transcript level. We demonstrated that the main factors affecting the LTR promoter activity were the LTR type (5'-proviral, 3' proviral, or solitary) and position with regard to genes. The averaged promoter strengths of solitary and 3'-proviral LTRs were almost identical in both tissues, whereas 5'-proviral LTRs displayed two- to fivefold higher promoter activities. The relative content of promoter-active LTRs in gene-rich regions was significantly higher than that in gene-poor loci. This content was maximal in those regions where LTRs "overlapped" readthrough transcripts. Although many promoter-active LTRs were mapped near known genes, no clear-cut correlation was observed between transcriptional activities of genes and neighboring LTRs. Our data also suggest a selective suppression of transcription for LTRs located in gene introns. |
|
|---|
|
View larger version (8K): [in a new window] |
FIG. 1. Schematic representation of solitary (left) and proviral (right) LTR expression. The transcription driven from 5'-proviral LTRs results in mRNAs of viral genes, whereas the expression of either solitary or 3'-proviral LTRs results in the transcription of host nonrepetitive genomic sequences flanking the 3' ends of the retroelements.
|
86%) human-specific LTR sequences. The HS family members can be parts of full-sized HERV-K (HML-2) proviruses (12% of individual HS representatives), truncated proviruses (5%), or solitary LTRs (83%).
Recently, we developed a new technique, termed genomic repeat expression monitoring (GREM), for experimental genome-wide identification of promoter-active repetitive elements (7). The technique is based on hybridization of repeat 3'-flanking genomic DNA to pools of total cDNA 5'-terminal parts, followed by selective PCR amplification of the genomic DNA-cDNA heteroduplexes. The resulting library of cDNA/genomic DNA hybrids can be used as a source of tags for individual transcriptionally active repeats. GREM was shown to be adequate for tasks of both quantitative and qualitative analyses of promoter activity. In model experiments, we used GREM to create the first genome-wide map of HS elements that display promoter activity in the testis. Here we utilized GREM for the first comprehensive comparison of HS element promoter activities in healthy human tissue (testicular parenchyma) and in the corresponding cancer (seminoma) from the same patient. We found that at least 50% of HS LTRs were promoter active, and we mapped 20 new functional human-specific promoters. The transcription of many HS LTRs was up- or downregulated in the seminoma. The promoter strengths differed greatly among individual HS elements, and their transcript levels ranged from
3 to
0.001% of the marker beta-actin gene transcript level. We showed that the main factors affecting the LTR promoter activity were the LTR type (5' proviral, 3' proviral, or solitary) and location relative to genes.
|
|
|---|
Oligonucleotides. Oligonucleotides were synthesized using an ASM-102U DNA synthesizer (Biosan, Novosibirsk, Russia). Their structures can be found in Table 1.
|
View this table: [in a new window] |
TABLE 1. Genomic primer sets used for PCR amplification
|
Tissue sampling. A seminoma and normal testicular parenchyma were sampled from a surgical specimen containing a testicular germ cell tumor under non-neoplastic conditions. Representative samples were divided into two parts, with one being frozen immediately in liquid nitrogen and the other being formalin fixed and paraffin embedded for histological analysis.
RNA isolation and cDNA synthesis. Total RNA was isolated from frozen tissues pulverized in liquid nitrogen using an RNeasy Mini RNA purification kit (QIAGEN). All RNA samples were further treated with DNase I to remove residual DNA. Full-length cDNA samples were obtained according to a cap switch effect-based SMART cDNA synthesis protocol (Clontech, BD Biosciences), using an oligo(dT)-containing primer (CDS), PowerScript reverse transcriptase (Clontech, BD Biosciences), and a riboCS oligonucleotide. When PowerScript reverse transcriptase reaches the 5' end of an mRNA, the enzyme's terminal transferase activity adds a few additional deoxycytidine nucleotides to the 3' end of the cDNA. The riboCS oligonucleotide, which contains three guanine ribonucleotide residues at its 3' end, base pairs with the deoxycytidine stretch, creating an extended template. The reverse transcriptase then switches templates and continues replication to the end of the oligonucleotide. The resulting full-length single-stranded cDNA contains 5'-terminal sequences complementary to the riboCS oligonucleotide. An Advantage 2 polymerase mix (Clontech) and CS and CDS oligonucleotides were used to synthesize the second cDNA strands and to PCR amplify double-stranded cDNA. Prior to further hybridization in the GREM procedure, 1 µg of cDNA was digested with 10 units of AluI frequent-cutter restriction endonuclease (Fermentas) for 3 h at 37°C. This enzyme was used because the HS LTR consensus sequence lacks AluI recognition sites.
Selective amplification of genomic regions flanking HS LTRs. Selective amplification of LTR 3'-flanking regions was based on the PCR suppression effect, described in detail elsewhere (6, 29, 41). Human genomic DNA (1 µg) was digested with 10 units of AluI (Fermentas) restriction endonuclease, ethanol precipitated, and dissolved in 20 µl of sterile water. One hundred picomoles of annealed suppression adapters (A1A2/A3) was ligated overnight to 300 ng of the digested DNA, using 3 units of T4 DNA ligase (Promega) at 16°C. The ligated DNA was purified using a Qiaquick purification column (QIAGEN) and eluted with 50 µl of water. One microliter of the eluted DNA was PCR amplified with the HS LTR-specific primer LTRfor1 and the adapter-specific primer A1, using the following cycling program: 72°C for 1 min, 95°C for 1 min, and 20 cycles of 95°C for 15 s, 65°C for 15 s, and 72°C for 1 min. PCR products were diluted 500-fold and used as templates for nested PCR with the downstream HS LTR-specific primer LTRfor2 and the adapter-specific primer A2 under the same cycling conditions, but for 22 cycles. The amplified LTR flanking sequences were treated with ExoIII exonuclease (Promega) to generate 5'-protruding termini exactly as described previously (6, 8).
GREM technique. The GREM technique includes hybridization of PCR-amplified genomic sequences flanking repetitive elements (HS LTRs in our case) with cDNA, followed by selective amplification and cloning of hybrid DNA duplexes. For each tissue (seminoma and testicular parenchyma), 100 ng of ExoIII-treated LTR flanking sequences, obtained as described above, was mixed with 300 ng of cDNA in 4 µl of hybridization buffer (0.5 M NaCl, 50 mM HEPES, pH 8.3, 0.2 mM EDTA), overlaid with mineral oil, denatured at 95°C for 5 min, and hybridized at 68°C for 14 h. The final mixture was diluted with 36 µl of dilution buffer (50 mM NaCl, 5 mM HEPES, pH 8.3, 0.2 mM EDTA), and 1 ng of the obtained DNA equivalent was PCR-amplified with 0.2 µM adapter-specific primer A2 and 0.2 µM cDNA 5'-end-specific primer CS under the following conditions: 72°C for 5 min to fill in the ends of DNA duplexes, followed by eight cycles of 95°C for 15 s, 65°C for 15 s, and 72°C for 1 min 30 s. The PCR products were diluted 500-fold and reamplified by nested PCR for 20 cycles (95°C for 15 s, 65°C for 15 s, and 72°C for 1 min 30 s) with 0.2 µM nested adapter-specific primer A4 and 0.2 µM HS LTR 3'-end-specific primer LTRfor3. The final PCR products were cloned into Escherichia coli by using a pGEM-T vector system (Promega) and sequenced by the dye termination method using an Applied Biosystems 373 automatic DNA sequencer.
RT-PCR. All reverse transcription-PCR (RT-PCR) experiments were reproduced at least three times, using independent cDNA preparations. For RT-PCR control of the LTR transcriptional status, we used pairs of primers, one of which was specific for a 3'-terminal part of a particular HS LTR (see Table S4 in the supplemental material for sequences) and the other of which was specific for a unique sequence within the corresponding genomic LTR 3'-flanking region. Prior to RT-PCR analysis, the priming efficiencies of the primers were examined by genomic PCRs at various temperatures, depending on the primer combination used. These PCRs were done for 19, 22, 25, and 28 cycles, with 40 ng each of the human genomic DNA templates isolated from both tissues. RT-PCR was done with cDNA samples from a mature human seminoma and healthy testicular parenchyma, with an equivalent of 20 ng total RNA being used as a template in each PCR, performed in a final volume of 40 µl. Five-microliter aliquots of the reaction mixture after 21, 24, 27, 30, 33, 36, and 39 cycles of amplification were analyzed by electrophoresis on 1.5% agarose gels. To find out the transcriptional levels of selected known genes in both tissues under study, we performed another series of RT-PCR experiments with primers designed predominantly against neighboring constitutive exons in the middle part of the corresponding cDNA molecule. The cycling conditions of these reactions also varied depending on the particular primer combination used, and the PCRs were performed in a final volume of 40 µl as described above. In all cases, the transcriptional status was determined by the number of PCR cycles needed to detect a PCR product of the expected length, and the PCR product concentration was measured using a Photomat system and Gel Pro Analyzer software.
|
|
|---|
Five hundred ELT clones were sequenced for each tissue. After the removal of rearranged plasmid and low-quality sequences, 395 and 419 ELTs, for normal testicular parenchyma and the seminoma, respectively, were taken for analysis. An ELT analysis allowed us to unambiguously map the corresponding promoter-active solitary and 3'-proviral LTRs. However, such mapping was impossible in the case of 5'-proviral LTRs because the adjoining proviral regions are repetitive and identical in sequence (Fig. 1). The detailed results of ELT mapping are shown in Table S1 in the supplemental material. Apart from the data on HS LTR promoter activities, the table contains a description of every individual HS element's genomic neighborhood and the results of previously performed functional tests, such as RT-PCR and differential methylation analyses.
To test the applicability of GREM to the task of quantitative analysis of LTR promoter activity, we addressed the issue of whether there is a correlation between the LTR-directed transcript level, as measured by RT-PCR, and the frequency of the corresponding ELT occurrence in the GREM library. RT-PCR amplification was done with pairs of primers, one of which was specific for the 3'-terminal part of a particular LTR and directed outwards from the LTR and the other of which was directed towards the LTR, designed against a unique genomic locus located at a distance of 70 to 300 bp from the LTR 3' end. Seminoma first-strand cDNAs were used as templates. Transcript levels were measured relative to the housekeeping beta-actin gene transcript level. For a sample of 20 HS LTRs, the frequencies of ELT occurrence in the seminoma correlated linearly with RT-PCR-measured transcript levels (Table 2), with a correlation coefficient of 0.92, as shown previously for a testicular parenchyma library (7). Such a correlation suggests that in this case, GREM was adequate for quantitative characterization of LTRs displaying promoter activity.
|
View this table: [in a new window] |
TABLE 2. Relative LTR transcript levels and frequencies of occurrence of the corresponding ELTs in seminoma and normal testicular parenchyma libraries
|
|
View this table: [in a new window] |
TABLE 3. Relative ELT contents for promoter-active HS LTRs transcribed in normal testicular parenchyma and seminoma
|
D
35 kb (24 elements); C3, HS elements located within gene introns or with D values of <5 kb (40 representatives); and C4, HS elements within exons of known non-LTR-promoted human cDNAs, and thus partly or wholly readthrough transcribed (12 representatives). For the last group, GREM makes it possible to detect promoter-active LTRs. Detailed information about LTR localization, neighboring genes, and mapped cDNAs is given in Table S1 in the supplemental material.
![]() View larger version (24K): [in a new window] |
FIG. 2. Proportions of promoter-active LTRs in four groups differing by the distance of the LTRs from known human genes or mapped cDNAs. (A) LTRs were grouped according to their distances from known human genes into four categories (C1 to C4) (see the text for details). The relative content of promoter-active LTRs in a group was calculated as the ratio of the number of LTRs in the group, for which the corresponding ELTs were obtained, to the total number of all LTRs in the group. (B) LTRs which were promoter active in at least one of the two tissues studied. (C) LTRs which were promoter active both in testicular parenchyma and in the seminoma. Averages and standard errors of the means (error bars) are presented.
|
1.8-fold greater (63%). For LTRs mapped within gene introns or in close proximity to genes (group C3), the ratio was slightly higher (68%). Finally, the largest proportion (75%) of promoter-active elements was observed for LTRs within exons (group C4). Figure 2C shows the group distribution for promoter-active LTRs functioning both in the seminoma and in normal parenchyma. The proportions of such "ubiquitously" transcribed LTRs in groups C1, C2, C3, and C4 were different, i.e., 10, 29, 20, and 67%, respectively.
It can therefore be concluded that (i) the relative content of promoter-active LTRs in gene-rich regions is significantly higher than that in gene-poor genomic loci, (ii) this content is maximal for the HS elements from those regions where promoter-active LTRs "overlap" readthrough transcripts (group C4), and (iii) LTRs of group C4 most frequently serve as promoters in both tissues. At present, we cannot explain the clearly enhanced promoter activity of the group C4 representatives. This effect might suggest better accessibility of exon regions to transcription factors than to other genomic DNA.
Quantitative analysis shows that LTR promoter activities differ considerably, depending on the genomic neighborhood and the LTR status (solitary or proviral). Counting of ELTs can be used to estimate the promoter activities of individual HS family members. By definition, promoter strength is the number of transcription initiation events per given time period. The GREM approach was used here to quantify the polyadenylated RNAs produced due to LTR promoter activity. Apart from promoter strength, the content of this RNA may also depend on other factors, such as RNA transfer from the nucleus, RNA stability, or polyadenylation. Since we are unable to estimate the contributions of these factors, the terms "promoter strength" and "promoter activity" should be understood as operational definitions throughout this report.
A counting of ELTs revealed quite different promoter activities (see below) for solitary and proviral LTRs, with the difference being dependent on the genomic neighborhood. The level of 5'-proviral LTR expression could not be measured properly because of the reasons mentioned above, so we focused on quantitative analysis of 3'-proviral and solitary LTR promoter activities in the four groups of HS elements (C1to C4). The relative promoter strength of a group of HS elements was calculated as the ratio of the relative content of the corresponding ELTs in the pool of all ELTs (except for those corresponding to 5'-proviral LTRs) to the relative content of the HS elements of this group (except for 5'-proviral LTRs) among HS elements of all groups (except for 5'-proviral LTRs).
The diagrams in Fig. 3 show that 3'-proviral LTRs displayed similar transcriptional patterns in both tissues, with a low transcript level for group C1 members, a sharp
30- to 60-fold increase of this level for group C2, a relatively low level for group C3, and finally, an
2.5- to 5-fold increase for group C4 promoter-active HS elements located within exons. Solitary LTRs displayed different profiles: their average promoter activity was low for group C1 (LTRs located far from genes), moderate for groups C2 and C3 (closer to genes, or intronic locations), and finally, increased four- to sixfold for group C4. The maximal promoter activity of solitary LTRs is characteristic of the group C4 elements located within exons.
![]() View larger version (26K): [in a new window] |
FIG. 3. Relative promoter strengths of 3'-proviral (gray) and solitary (black) LTRs grouped according to their distances from genes (groups C1 to C4) (see the text for details). (A) Testicular parenchyma; (B) seminoma. Averages and standard errors of the means (error bars) are presented.
|
90% of intronically located HS LTRs are inserted in the reverse orientation relative to the gene transcription direction and that their transcription could therefore create a pool of regulatory interfering RNAs (11). One more conclusion is that group C4 HS elements, whose transcripts "overlap" human readthrough RNAs, are enriched in promoter-active elements, thus again suggesting an interplay of readthrough and LTR-directed transcription. We further tried to compare average promoter activities for the 5'-proviral, 3'-proviral, and solitary LTR types (Fig. 4). The relative average promoter strength of a group of HS elements was calculated as the ratio of the relative content of the corresponding ELTs in the pool of all ELTs to the relative content of the HS elements of this group among HS elements of all groups. The results (Fig. 4) demonstrated that average promoter strengths of solitary and 3'-proviral LTRs were almost equal in both tissues under study. The promoter strength of 5'-proviral LTRs was approximately twofold higher in testicular parenchyma and approximately fivefold higher in the seminoma, in accord with extensive previous data in favor of an upregulation of HERV-K (HML-2) proviral gene expression in germ cell line tumors. It can be assumed that the proviral sequences contain some so far uncharacterized downstream regulatory elements that provide significantly more 5'-LTR expression, especially in the seminoma.
![]() View larger version (17K): [in a new window] |
FIG. 4. Comparison of relative promoter strengths of solitary, 5'-proviral, and 3'-proviral LTRs (per LTR). Gray and black bars represent the relative LTR promoter strengths in the testicular parenchyma and seminoma, respectively. For details of relative promoter strength calculation, see the text. Averages and standard errors of the means (error bars) are presented.
|
|
View this table: [in a new window] |
TABLE 4. Relative transcript levels of HS LTRs and closely located human genes in testicular parenchyma and seminoma
|
33% of all ELTs), which was greatly overexpressed in parenchyma and was transcribed at a sixfold lower (yet still rather high) level in the seminoma, revealed that provirus 99 was situated between two known human genes, i.e., 7 kb upstream of the LIPH1 gene, encoding a membrane-bound lipase precursor, and 12 kb upstream of the SENP2 gene, encoding a SUMO1-specific protease (Fig. 5). RT-PCR experiments demonstrated that SENP2 was transcribed in both tissues at a relatively high level of
0.4% of the beta-actin transcript level, whereas LIPH1 was upregulated in testicular parenchyma and significantly downregulated in the seminoma (0.2 and 0.02% of the beta-actin transcript level, respectively). Such a strong proviral 3'-LTR promoter activity might be due to the regulatory elements of both genes. SENP2 could provide a strong basal expression level, whereas LIPH1 could be responsible for the tissue specificity of the expression. Alternatively, HS element 99 and the LIPH1 gene could be colocalized within the same chromatin domain distinct from that containing SENP2. On the other hand, the observed SENP2 and LIPH1 transcription profiles could be significantly affected by numerous regulatory sequences of provirus 99 itself. We therefore concluded that multiple, sometimes contradictory, scenarios may take place in the transcriptional regulation of HS elements.
![]() View larger version (17K): [in a new window] |
FIG. 5. Schematic representation of HS element 99 localization relative to its LIPH1 and SENP2 gene neighbors and their transcript levels in testicular parenchyma and seminoma.
|
1,000-fold (Fig. 6) among expressed individual HS family members, from hardly detectable to levels comparable to those of housekeeping gene transcription. The high expression levels of certain LTRs capable of driving the transcription of host nonrepetitive genomic sequences in human tissues clearly suggest the possibility of their involvement in the formation of new functional genes and/or antisense regulation of preexisting genes.
![]() View larger version (24K): [in a new window] |
FIG. 6. Relative transcript levels of some human genes and LTRs. Relative transcript levels of randomly chosen HS LTRs and those of known human genes were measured using RT-PCR. (A) Testicular parenchyma; (B) seminoma.
|
0.001 to
3% of the housekeeping beta-actin gene transcript level. Although HS elements formed several subclusters on a phylogenetic tree (5), no clear correlation between LTR primary structure and transcriptional activity was found in this study. In contrast, the LTR status (solitary or 5' or 3' proviral) was an important factor affecting LTR activity, as the promoter strengths of solitary and 3'-proviral LTRs were almost identical in both tissues, whereas 5'-proviral LTRs displayed higher promoter activities (approximately twofold and fivefold greater in the testicular parenchyma and seminoma, respectively). These data suggest that a proviral sequence harbors some as yet unknown downstream regulatory elements that provide significantly more 5'-LTR expression, especially in seminomas. Another important factor affecting promoter activity was the LTR distance from genes: the relative content of promoter-active LTRs in gene-rich regions was significantly higher than that in gene-poor genomic loci. Interestingly, in both tissues, this content was maximal for HS elements from those regions where promoter-active LTRs "overlapped" with readthrough transcripts; this effect might suggest better accessibility of exon regions to transcription factors than to other genomic loci. It should be mentioned that all HS elements overlapped with non-protein-coding regions of the corresponding transcripts. The observed preferable expression of "exonic" LTRs might be due to neighboring regulatory sequences, which are frequently present in untranslated exons. The detailed explanation of such a phenomenon is a matter of further studies. Our data also suggest a selective suppression of transcription in both tissues for proviral 3'-LTRs located in gene introns. Such a transcriptional suppression might be aimed at silencing the proviral gene expression in gene-rich regions. In testicular parenchyma, the promoter strengths of intronically located solitary LTRs were also significantly decreased. This may suggest an as yet unknown mechanism(s) for selective suppression of "extra" promoters generated due to mutations or viral integrations and located within gene introns or very close to genes. Such a mechanism might minimize possible destructive effects of undesirable transcription. Many transcriptionally competent LTRs were mapped near known human genes, and as many as 86 to 90% of all genes located in close proximity to promoter-active LTRs are known to be transcribed in the testis. However, in general, no clear-cut correlation was observed between transcriptional activities of genes and closely located LTRs. The high expression levels of certain LTRs located in human gene introns might suggest the possibility of their involvement in antisense regulation of preexisting genes.
Finally, this is the first quantitative and qualitative comprehensive characterization of human promoters provided by a small particular group of endogenous retroviruses. An overwhelming majority of retroviral sequences, which occupy up to 8% of the human genome, still remain a subject of further investigations.
This work was supported by Russian Foundation for Basic Research grants 05-04-48682-a and 2006.20034, by grant MK-2833.2004.4 from the President of the Russian Federation, and by the Molecular and Cellular Biology Program of the Presidium of the Russian Academy of Sciences.
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»