Previous Article | Next Article 
Journal of Virology, May 2006, p. 4640-4642, Vol. 80, No. 9
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.9.4640-4642.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Newly Identified Families of Human Endogenous Retroviruses

LETTER
Human endogenous retroviruses (HERVs) make up approximately
8.3% of the human genome (
12). HERVs have previously been classified
into 31 distinct families based upon sequence alignment of reverse
transcriptase (RT) and envelope domains and subsequent phylogenetic
analyses (
1,
9,
16). Using the data mining program LTR_STRUC
(
13) in conjunction with conventional sequence homology techniques,
we recently completed an analysis of chimpanzee long terminal
repeat (LTR) retrotransposon families (unpublished data). Since
LTR_STRUC searches for LTR retrotransposons based on structure
(e.g., the presence of LTRs, target site duplications, tRNA
binding sites, etc.) rather than homology, elements can be identified
that go undetected in traditional BLAST searches. We identified
nine chimpanzee LTR retrotransposon families that are orthologous
to HERV families not previously identified. These nine newly
discovered HERV families are described and characterized in
this letter.
LTR retrotransposons and retroviruses are grouped into three major classes (14). Class I contains elements related to gammaretroviruses, class II elements are related to betaretroviruses, and class III elements are related to spumaviruses. The RT-based phylogeny indicates that all the newly identified HERVs described here are class I elements (Fig. 1). The detailed characteristics of each of the newly discovered HERV families are presented in Table 1 and Table 2. All are low-abundance families, being composed of only one to seven full-length members with low homology to previously identified HERVs. This may, in part, explain why they have not been previously identified. The newly discovered full-length elements are of standard HERV length (7,198 to 10,675 bp with 359- to 682-bp LTRs) and display typically sized target site duplications (4 or 5 bp). With the exception of a few mutated copies, the newly identified elements have the same canonical dinucleotides terminating the LTRs as previously characterized HERVs (TG/CA). Since LTR_STRUC can only identify elements having two LTRs, we conducted BLAST searches by using identified full-length elements as query sequences to identify solo LTRs and other fragmented elements. Consistent with what has been reported previously for other HERV families (15), we have found that each of the newly identified families is represented by significantly more solo LTRs and fragmented sequences than full-length elements (Table 1).
Because HERV LTRs are synthesized from the same RNA template
during reverse transcription, they are identical in sequence
at the time of integration (
2). Using a primate pseudogene nucleotide
substitution rate of 0.16% divergence/million years (
5,
8,
10),
the relative integration time or age of any full-length HERV
can be estimated from the level of sequence divergence existing
between the element's 5' and 3' LTRs. Using this method, the
estimated ages of the new families of HERVs described here range
from 18.0 to 49.5 million years, indicating that members of
these families have not been transpositionally active in the
primate lineage since well before chimpanzees and humans diverged
from a common ancestor (6 million years ago) (
4). Although caution
must be taken when using LTR divergence to estimate the ages
of individual elements because of confounding processes such
as recombination and conversion (e.g., see references
6 and
7), the method is able to provide useful age estimates, at least
to a first approximation (e.g., see reference
3). Our estimated
ages of the newly identified human elements fall within the
median range of previously described families of HERVs (
16).
The possible contribution of these newly identified LTR retrotransposons
to primate gene or genome evolution is currently under investigation.

ACKNOWLEDGMENTS
This research was supported by a grant from the Georgia Institute
of Technology Research Foundation.

REFERENCES
1 - Benit, L., P. Dessen, and T. Heidmann. 2001. Identification, phylogeny, and evolution of retroviral elements based on their envelope genes. J. Virol. 75:11709-11719.[Abstract/Free Full Text]
2 - Boeke, J. D., and J. P. Stoye. 1997. Retrotransposons, endogenous retroviruses, and the evolution of retroelements, p. 343-435. In J. M. Coffin, S. H. Hughes, and H. Varmus (ed.), Retroviruses. Cold Spring Harbor Laboratory Press, Plainview, N.Y.
3 - Bowen, N. J., and J. F. McDonald. 2001. Drosophila euchromatic LTR retrotransposons are much younger than the host species in which they reside. Genome Res. 11:1527-1540.[Abstract/Free Full Text]
4 - Chimpanzee Sequencing and Analysis Consortium. 2005. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437:69-87.[CrossRef][Medline]
5 - Costas, J., and H. Naveira. 2000. Evolutionary history of the human endogenous retrovirus family ERV9. Mol. Biol. Evol. 17:320-330.[Abstract/Free Full Text]
6 - Hughes, J. F., and J. M. Coffin. 2005. Human endogenous retroviral elements as indicators of ectopic recombination events in the primate genome. Genetics 171:1183-1194.[Abstract/Free Full Text]
7 - Johnson, W. E., and J. M. Coffin. 1999. Constructing primate phylogenies from ancient retrovirus sequences. Proc. Natl. Acad. Sci. USA 96:10254-10260.[Abstract/Free Full Text]
8 - Jordan, I. K., and J. F. McDonald. 2002. A biologically active family of human endogenous retroviruses evolved from an ancient inactive lineage. Genome Lett. 1:1-5.
9 - Jurka, J. 2000. Repbase update: a database and an electronic journal of repetitive elements. Trends Genet. 16:418-420.[CrossRef][Medline]
10 - Kapitonov, V., and J. Jurka. 1996. The age of Alu subfamilies. J. Mol. Evol. 42:59-65.[CrossRef][Medline]
11 - Kumar, S., K. Tamura, and M. Nei. 2004. MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Briefings Bioinformatics 5:150-163.[Abstract/Free Full Text]
12 - Lander, E. S., L. M. Linton, B. Birren, C. Nusbaum, M. C. Zody, J. Baldwin, K. Devon, K. Dewar, M. Doyle, W. FitzHugh, et al. 2001. Initial sequencing and analysis of the human genome. Nature 409:860-921.[CrossRef][Medline]
13 - McCarthy, E. M., and J. F. McDonald. 2003. LTR_STRUC: a novel search and identification program for LTR retrotransposons. Bioinformatics 19:362-367.[Abstract/Free Full Text]
14 - Smit, A. F. 1999. Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr. Opin. Genet. Dev. 9:657-663.[CrossRef][Medline]
15 - Stoye, J. P. 2001. Endogenous retroviruses: still active after all these years? Curr. Biol. 11:R914-R916.[CrossRef][Medline]
16 - Tristem, M. 2000. Identification and characterization of novel human endogenous retrovirus families by phylogenetic screening of the human genome mapping project database. J. Virol. 74:3715-3730.[Abstract/Free Full Text]
| | | | | |
Nalini Polavarapu
Nathan J. Bowen
John F. McDonald*
School of Biology Georgia Institute of Technology 310 Ferst Dr. Atlanta, GA 30332-0230
|
| | | | | |
* Phone: (404) 385-6631, Fax: (404) 894-0519, E-mail: john.mcdonald{at}biology.gatech.edu |
Journal of Virology, May 2006, p. 4640-4642, Vol. 80, No. 9
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.9.4640-4642.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.