Previous Article | Next Article 
Journal of Virology, April 2000, p. 3715-3730, Vol. 74, No. 8
0022-538X/00/$04.00+0
Copyright © 2000, American Society for Microbiology. All rights reserved.
Identification and Characterization of Novel Human
Endogenous Retrovirus Families by Phylogenetic Screening of the Human
Genome Mapping Project Database
Michael
Tristem*
Department of Biology, Imperial College,
Silwood Park, Ascot, Berkshire SL5 7PY, United Kingdom
Received 19 October 1999/Accepted 25 January 2000
 |
ABSTRACT |
Human endogenous retroviruses (HERVs) were first identified almost
20 years ago, and since then numerous families have been described. It
has, however, been difficult to obtain a good estimate of both the
total number of independently derived families and their relationship
to each other as well as to other members of the family
Retroviridae. In this study, I used sequence data derived from over 150 novel HERVs, obtained from the Human Genome Mapping Project database, and a variety of recently identified nonhuman retroviruses to classify the HERVs into 22 independently acquired families. Of these, 17 families were loosely assigned to the class I
HERVs, 3 to the class II HERVs and 2 to the class III HERVs. Many of
these families have been identified previously, but six are described
here for the first time and another four, for which only partial
sequence information was previously available, were further
characterized. Members of each of the 10 families are defective, and
calculation of their integration dates suggested that most of them are
likely to have been present within the human lineage since it diverged
from the Old World monkeys more than 25 million years ago.
 |
INTRODUCTION |
Like those of other vertebrates, the
human genome contains evidence for past infection by many different
kinds of retroviruses (10, 27, 33, 61). Retroviral
integration usually occurs within somatic cells, but occasionally such
events take place within germ line cells, and when this occurs the
retroviral sequences are passed vertically from parent to offspring
(57). Such endogenous retroviruses generally remain
replication competent until inactivated either by recombinational
deletion between two repeat regions (termed long terminal repeats
[LTRs]) situated at the 5' and 3' ends of the virus or by random
mutation which occurs while the host genome is undergoing DNA
replication (10, 53). During the period between insertion
and inactivation, the viral copy number may increase via
retrotransposition to different locations within the genome
(53). The vertical transmission of these elements can occur
over long periods of time; several replication-competent porcine
retroviruses probably first infected their hosts more than 5 million
years ago, and a number of defective endogenous human retroviruses are
thought to have been present in the primate lineage for tens of
millions of years (2, 8, 31, 43, 50).
Many types of human endogenous retroviruses (HERVs) have been
characterized previously, and they have been classified into different
groups, or families, partly on the basis of their sequence identity and
partly according to the similarity of their primer binding sites (PBSs)
to host tRNAs. Thus, members of the HERV.H family contain a PBS with a
sequence similar to a region of tRNAHis, whereas the HERV.E
family is primed by tRNAGlu. Despite the large amount of
data available, the classification of the many different HERV families
within an overall phylogenetic framework has been hampered for several
reasons: (i) some highly divergent retroviruses are primed by the same
type of tRNA; (ii) many HERV families have not been fully
characterized, and the sequence information that has been reported is
often derived from different genomic regions, making interfamily
comparisons problematic; and (iii) the relative lack of sequence
information on other host taxa has made it difficult to distinguish
between genuinely monophyletic HERV families and polyphyletic families
that appear monophyletic only because similar viruses in other hosts
have not yet been described.
Recently these problems have been lessened both by the systematic
isolation of endogenous retroviruses from many different vertebrate
taxa and by the generation of large amounts of sequence data by the
Human Genome Mapping Project (HGMP) (7, 16, 32). As of
December 1998, sequence information was available for over 10,000 BACs,
or cosmids, representing approximately 235,000,000 bp, or 7% of the
human genome (5).
In this study, I investigated the relationships of the known HERV
families to each other and to other nonhuman retroviruses, described
and characterized six novel HERV families, and further characterized an
additional four families for which only partial sequence information
was previously available.
 |
MATERIALS AND METHODS |
Identification of HERVs within sequence data banks.
HERV
sequences were obtained from the EMBL/GenBank/DDBJ database at the
beginning of December 1998. Initially the computer data banks were
screened by BLAST search (1) with part of the reverse
transcriptase (RT) proteins (domains 1 to 7 described by Xiong and
Eickbush [62]) from a number of distinct retroviral groups. These included several endogenous human retroviruses, such as
HERV.L (12) (accession no. X89211), HERV.I (M92067) (29), HC-2 (Z70664) (19), HERV.H (K01891)
(30), and HERV.K (M14123) (42), as well as other
highly divergent retroviruses from nonhuman hosts, such as the walleye
dermal sarcoma virus (WDSV) (L41838) and the dart poison frog
Dendrobates ventimaculatus (Dev I; X95795) (18,
56). This search strategy was expected to result in the
identification of most of the endogenous human retroviral sequences
within the data bank which still encoded the appropriate region (in the
sense that there had been no large postintegration deletion event) of
the RT protein.
However, to confirm that this was indeed the case, a second method of
screening was also performed. The endogenous human sequences identified
during the first screening procedure were aligned, and a phylogeny was
then constructed by the neighbor-joining approach with the program
PAUP4d64 (written by D. L. Swofford). A representative from each
of the monophyletic groups of HERVs present in the resultant phylogeny
was then used in a BLAST search. The endogenous human viruses recovered
by these searches were then examined (and added to the data set if they
had not been identified previously) in descending order of similarity
until representatives of the phylogenetic sister group (according to
the neighbor-joining tree) of HERVs were encountered. An additional
nine sequences were identified during this second screening.
The last group of HERVs included in the data set were the prototype
members of each of the many previously identified HERV
families
(including those reviewed by Wilkinson et al. [
61]
and
by Boeke and Stoye [
10]). When this was not possible
(either
because the known members of the families were defective in the
region of
pol used or because the sequence of this region
was
not available), BLAST searches were performed, with an alternative
region of the incomplete HERV being used as the probe; the most
closely
related cosmid sequences were then identified and recorded.
Since these
best- or closest-match cosmid clones also encoded
the appropriate
region of polymerase (Pol), it was possible to
estimate the probable
phylogenetic location of each incomplete
HERV family in subsequent
analyses.
Occasionally, cosmids with different accession numbers contained
identical HERV sequences; however, in all cases, further
investigation
(using the chromosome and map information attached
to each data bank
submission) showed that the cosmids were originally
derived from the
same location within the human genome, and thus
one sequence was
excluded from subsequent
analyses.
Alignment and phylogenetic reconstruction.
The HERV-derived
RT sequences (162 in total, consisting of 152 sequences obtained from
the HGMP and 10 prototypic HERV family sequences) were then aligned
with a representative sample of nonhuman endogenous virus sequences (66 in total) as described by Xiong and Eickbush (62). The total
number of sequences included in this alignment was therefore 228. Neighbor-joining trees were generated with PAUP4d64, using the protpars
matrix (14) and all 228 elements in the data set. Using the
same matrix, bootstrap values were obtained from 1,000 replicates.
Because of the long computation time periods required, the construction
of maximum-parsimony trees by using the standard simple
or random
addition option was not practical. Instead, an alternative
search
strategy was employed. The data set was first reduced,
or pruned, by
using the output from the bootstrapped neighbor-joining
trees, which
indicated that in the phylogeny there were several
well-supported
terminal clusters of HERVs containing up to 42
members. When these
clusters were supported by at least 95% of
the bootstrapped
neighbor-joining replicates, they were pruned
by removing all but three
of the taxa. This pruning resulted in
the reduction of the number of
taxa in the data set from 228 to
134. All of the
maximum-parsimony-derived phylogenies were constructed
using this
reduced number of taxa and a search strategy described
by Quicke et al.
(D. Quicke, J. Taylor, and A. Purvis, submitted
for publication) as
follows. The data set was first subjected
to 7,500 random additions by
using an unordered matrix with tree
bisection and reconnection, holding
one tree in memory during
each replicate. The shortest tree was then
used in a heuristic
search, with all optimal trees being saved; this
resulted in the
identification of a further 1,200 trees of the same
length. This
pool of minimum trees was then employed to reweight the
data matrix,
using the rescaled consistency index. Searches for minimum
trees
then continued with the reweighted data matrix. The minimum
reweighted
tree was identified and used as an input tree for another
search
in which the characters were again weighted to unity. Several
rounds of reweighting followed by weighting to unity were performed;
there was no further reduction in tree
length.
Maximum-parsimony bootstrap replicates (100 in total) were subjected to
100 random additions by tree bisection and reconnection,
with one tree
being held in memory during each replicate. Although
shorter trees
would have been obtained if each bootstrap replicate
had been subjected
to a larger number of random-addition replicates
(instead of 100 random
additions per bootstrap replicate used),
the computation time would
have been excessive. However, it was
probable that each of these
bootstrapped trees was within a few
steps of the minimum possible
(unpublished
results).
Calculation of integration dates.
The percentages of
divergence between pairs of LTR sequences were calculated using their
entire length, excluding regions containing deletions. These divergence
figures were then corrected to account for the presence of multiple
mutations at the same site, back mutations, and convergent
substitutions, using the two-parameter model (21). Two
estimates of the rate of change of the host genome were calculated,
2.1 × 10
9 and 1.3 × 10
9 per
synonymous site per year, and hence two estimates of the integration
date are provided. The first rate was based on a comparison of the
percentage of divergence (11.6%) of synonymous sites in six genes in
Old World monkeys and humans (25), whereas the second (7.3%
divergence) used
-globin pseudogene and total single-copy DNA
cross-hybridization data (25, 37). Both assumed a divergence date of humans and Old World monkeys of 27.5 million years ago (47).
 |
RESULTS |
HERV identification and data set construction.
A total of 152 novel HERV sequences were identified by screening the EMBL/GenBank/DDBJ
nonredundant data banks. Two rounds of screening were performed; the
first was based on similarity searches using part of the RT proteins
derived from previously identified retroviruses, and the second was
done with several of the newly identified HERVs themselves as the
probes. The HERVs were named according to the cosmid or BAC in which
they were identified, and multiple HERVs situated within the same
vector were given an additional letter-based designation.
An alignment, based on part of the retroviral RT protein, of the novel
HERV sequences and a representative sample (66 in total)
of previously
identified nonhuman endogenous retroviruses was
performed. The
prototypic members of several HERV families were
also added to the data
set, namely HERV.L (
12), ERV-9 (
23),
HERV.I
(
29), HERV.H (
30), HERV.HML6 (
36),
HERV.K (
42),
HERV.W (
46), and HERV.E
(
48). Unfortunately, there is a lack
of sequence information
on some regions of the genomes of members
of other putative HERV
families (Table
1). It was therefore not
possible to include most of them (with the exceptions of HERV.ADP
[
28] and HERV.FRD [
49], which contain
the appropriate region
of
pol) directly in the analyses
presented here. Instead, the
available sequence information from each
of these partially characterized
HERV families (ERV.1
[
11], RRHERV.I [
20], HERV.P
[
22], Hs5
[
24], HERV.HML1-5
[
35], HERV.R [
40], HERV.FTD
[
49], NP-2
[
51], HERV.S71
[
59], and HERV.XA [
60]) was used to
identify
the novel HERV-containing cosmid with the highest level of
sequence
similarity included in the data set, as shown in Table
2.
Sequence identity between the best-match HGDB sequence and the
incomplete HERV sequences ranged from 82% (for cosmid
AC004609 and
ERV-1) to 99% (for cosmid
AC002069 and HERV.P). In two
cases, more
than one HERV family had a best match with the same
cosmid-derived HERV
sequence: ERV.1 and HERV.R with the same region
of
AC004069, and
HERV.HML1 and -2 with the same region of cosmid
Z70820 (Table
2).
Phylogenetic analysis and nomenclature.
The alignment was
subjected to phylogenetic analysis by both the neighbor-joining and
maximum-parsimony approaches. Neighbor-joining trees were constructed
using a data set consisting of 228 taxa (162 of which were of human
origin [Table 3]). Due
to the very long computation time periods necessitated,
maximum-parsimony analyses utilized a smaller data set, numbering 134 taxa, of which 68 were of human origin (Table 3 shows which subset of
HERV sequences were excluded from the maximum-parsimony analyses).
View this table:
[in this window]
[in a new window]
|
TABLE 3.
Location and classification of the HGMP-derived and
prototype HERVs used in the present phylogenetic analyses
|
|
HERV families arise via a single horizontal transmission event followed
by germ line integration and fixation within the host
population. The
copy number can then increase by retrotransposition
or reinfection, and
it is probable that most members of a particular
family are separated
by only a few rounds of viral replication
(although they may well be
separated by many rounds of host DNA
replication) (
10).
Phylogenetic reconstruction of retroviral
phylogenies containing
HERV-derived sequences would therefore
be expected to show
well-supported clusters of such elements,
with members of each HERV
cluster being derived from the same
family. Trees generated by both the
neighbor-joining and maximum-parsimony
approaches were broadly
consistent with this hypothesis, showing
numerous well-supported HERV
lineages scattered across the phylogeny
(Fig.
1).


View larger version (4035K):
[in this window]
[in a new window]
|
FIG. 1.
Phylogenetic analyses of a 159-residue region of
retroviral RT proteins. The trees were rooted on several
gypsy LTR-retrotransposon sequences. To save space, multiple
taxon names in some well-supported terminal clusters are not indicated.
Instead, the number of taxa that actually clustered in that position
are indicated on the taxon label. (a) Neighbor-joining tree with branch
lengths proportional to the degree of divergence between the sequences.
Figures on each branch represent percentage bootstrap support from
1,000 replicates; asterisks show support of at least 95%. HERVs are
indicated by black circles. Novel HERV families are boxed. Previously
described HERV families are shaded gray if the sequence of the
pol gene was not available, and they were classified
according to their closest cosmid matches (see Table 1). Elements in
parentheses are likely to cluster with the adjacent family. Primer
sites indicate the tRNA to which the viral PBS is most similar; these
are boxed when this similarity is first reported in this study. (b)
Strict consensus of 1,200 maximum-parsimony trees. The figure is
labeled as in panel a except that branch lengths are not proportional
to the divergence between the taxa. Also note that in contrast to panel
a, maximum-parsimony data sets were pruned prior to analysis to reduce
computation times. Thus, the first figure presented on some of the
taxon labels represents the actual number of elements included in the
analysis, whereas the number in parentheses represents the total
estimated number that would cluster with the particular family if the
additional elements had also been included in the data set (based on at
least 95% bootstrap support by the neighbor-joining method; see
Materials and Methods).
|
|
Under ideal circumstances, with the complete sampling of retroviral
diversity across all hosts in which they occur, the total
number of
HERV families could be calculated simply by counting
the number of HERV
lineages intermingled with the nonhuman retroviruses
in the phylogeny
(in effect, this counts the number of retroviral
host switches into the
human lineage). However, this is not the
case; there is in fact much
better sampling of endogenous retroviruses
from humans than
from other vertebrate taxa. These differences
in sampling
efficiency mean that some HERVs may appear to belong
to the same family
(i.e., they cluster together after phylogenetic
reconstruction) simply
because closely related viruses in nonhuman
hosts have yet to be
identified. Thus, in order to obtain a reasonable
estimate of the
actual number of HERV families, several assumptions
about their
evolution had to be
made.
First, it was assumed that HERVs with alternative PBS homologies (even
if they clustered together in the phylogenies) were
derived from
separate cross-species transmission events and were
therefore
independent families. For example, in Fig.
1, HERV.P,
HERV.W, and ERV-9
(primed by tRNA
Arg) all cluster together with robust
bootstrap support, but because
they all have different PBS homologies,
they are here regarded
as three separate families. This is consistent
with previous reports
(
46,
61).
A second assumption was that HERVs encoding the same PBS which were
polyphyletic with respect to viruses in nonhuman hosts
also represented
separate families. Thus, in Fig.
1a, HERV.K and
HERV.HML6 are
classified as separate families because they form
a polytomy with
RV-Bower bird (i.e., the presence of only a few
nonhuman viruses in
this region of the phylogeny was enough to
split up the HERV.K and
HERV.HML6 lineages in some trees, and
the addition of further nonhuman
viruses would be likely to split
them in all trees). The fourth member
of the polytomy, HERV.HML5,
has an alternative PBS and is therefore
already regarded as an
independent
family.
Last, it was assumed that HERV families which were paraphyletic with
respect to nonhuman viruses or to HERVs with alternative
PBS homologies
were also independently derived. For example, in
Fig.
1b, HERV.XA,
HERV.F, and HERV.F (type b), which are all primed
by
tRNA
Phe, are paraphyletic with respect to HERV.H and thus
represent separate
families. In this case, the presence of multiple
families was
also confirmed by investigating sequence similarity
between the
LTR and
gag regions of the four families as
described
below.
The above criteria suggested the presence of 22 endogenous retroviral
families within the 7% of the human genome that has
been sequenced to
date (both neighbor-joining and maximum-parsimony
analyses gave the
same figure). Of these, 12 contained prototypic
members which have been
well characterized in previous reports
(although the
pol
genes of 4 of these 12, HERV.S71, RRHERV.I,
HERV.R, and HERV.P, were
defective or incomplete, and so these
families are represented by
closest cosmid matches [Table
2]),
4 contained prototypic members
which had generally been less well
characterized (of these, HERV.HML5
and HERV.XA are based on closest
cosmid matches), and 6 represented
novel families. A variety of
other previously characterized human
endogenous elements are also
likely to be members of one of the 22 families (they are shown
in parentheses in Fig.
1). In contrast, BLAST
searches with the
element HRES-1, which has been reported to be related
to human
T-cell leukemia virus type 1 HTLV-1 (
45), failed to
show a match
with any HERV-containing cosmid (or indeed with any HGMP
sequence),
and therefore this element is not represented. The cosmids
containing
members of each of the families identified after
phylogenetic
reconstruction are shown in Table
3.
Because HERV families have often been classified according to the
similarity of their PBSs to specific types of host tRNA,
I attempted to
identify these regions in all six of the novel
HERV families, the four
partially characterized families, and
HERV.S71 (i.e., those families
for which PBS data were not previously
available). This analysis was
performed by first identifying the
LTR sequences upstream and
downstream of the
pol sequences employed
in the phylogenies
(using the program BLAST to BLAST [
54]) and
then
comparing the sequence immediately 3' to the 5' LTR against
a tRNA
sequence database (
52). From this approach it was apparent
that the partially characterized HERV families are primed by a
variety
of tRNAs, as follows: HERV.XA by tRNA
Phe, HERV.ADP
(probably) by tRNA
Thr, HERV.FRD by tRNA
His, and
HERV.HML5 by tRNA
Ile (Table
4). I was unable to find any 5' LTRs in
members of the
HERV.S71 family (the only other previously characterized
family
for which PBS data were not available). Despite the new data on
the PBS homologies of these families, their nomenclature has been
left
unchanged in this report.
Of the six novel HERV families, two were primed by tRNA
Phe
and two others were primed by tRNA
Ser and
tRNA
Arg and they have therefore been termed HERV.F, HERV.F
(type b),
HERV.S, and HERV.R (type b); an independently derived HERV.R
family
has been described previously by O'Connell and Cohen
(
41). There
were no obvious 5' LTR sequences in the two
remaining families,
and thus they have been designated HERV.Z69907 and
HERV.HS49C23
after the cosmids in which the prototypic members are
located.
HERV families have often been broadly divided into two classes, with
class I HERVs being related to the mammalian type C retrovires,
exemplified by feline leukemia virus (FeLV) and gibbon ape leukemia
virus (GaLV), and class II HERVs being most similar to the mammalian
type B and D retroviruses or avian leukosis viruses, such as mouse
mammary tumor virus (MMTV), simian retrovirus type 1 (SRV-1),
or Rous
sarcoma virus (RSV) (
10,
61). Recently, the presence
of a
third HERV class (class III) has been proposed (
3,
26)
based
on the similarity of HERV.L to spumaviruses such as the
human
spumavirus (HSV) (
12). Although the topologies of the
trees
shown in Fig.
1 differed according to the method of reconstruction
(largely due to the previously identified weak support across
the
backbone of the retroviral phylogeny [
16]), it is
clear
from this analysis that very few of the HERV families are
actually
closely related to these retroviral genera. In particular,
both
the HERV.L and HERV.S families appear to be only distantly related
to the spumaviruses, and the HERV.HS49C23 family clusters with
several groups of viruses isolated from nonmammalian vertebrates
rather
than with the type C retroviruses. Furthermore, no HERV
family appears
to be most closely related to either the previously
identified
mammalian type B and D retroviruses or the avian leukosis
viruses.
Instead, the HERVs in this region of the tree tended
to cluster with
endogenous retroviral fragments derived from birds
and nonplacental
mammals (although the exact relationships revealed
differed depending
to the method of phylogenetic
reconstruction).
Characterization of novel HERV families.
Prototypic members of
each of the novel HERV families were investigated further in order to
provide some background information on the general properties of these
elements. Although it was difficult to identify the exact 5' and 3'
ends of the gag, pol, and env genes
(due to small insertions or deletions and in-frame stop codons), their
presence or absence could still be established by the identification of
certain motifs conserved among different retroviruses (58).
Furthermore, by analyzing the distances between these motifs, it was
also possible to determine whether a large deletion had occurred in the
particular gene under study.
(i) HERV.S.
The prototype member of the HERV.S family is
located on cosmid AC004385, which is derived from the X chromosome
(Table 3). It is approximately 6.7 kb in length and has a typical
retroviral structure which appears relatively intact, with no
appreciable deletions in gag, pol, or
env (Table 4). There are, however, multiple in-frame stop
codons and small deletions, indicating that the element is unlikely to
be able to express any major gene products. The five members of the
HERV.S family described in this report were found via BLAST searches of
7% of the human genome, suggesting (assuming that these elements are
more or less randomly distributed) that the copy number of this family
is at least 70 per haploid genome.
The HERV.S provirus contains typical, though relatively short, proviral
LTR structures (a 5' LTR of 317 bp and a 3' LTR of
318 bp) bounded by
the inverted terminal repeats TG and CA (see
Fig.
3a). Potential
promoter (TATAAA) and polyadenylation (AATAAA)
sequences were also apparent. The PBS and polypurine tract (the
primer sites for minus- and plus-strand DNA synthesis) immediately
follow the 5' LTR and precede the 3' LTR, respectively, with the
PBS
showing 16 of 18 matches to the 3' end of the human serine
tRNA; the
PPT is 11 bp in length. The observed percentage of divergence
between
the two LTRs was 12.4%, corresponding to a corrected divergence
(taking into account back mutations and multiple substitutions
at the
same site) of 13.6%. Because the LTRs were presumably identical
when
the element first integrated into the genome (
55), the
approximate length of time the element has been vertically passaged
can
be estimated by using this figure and the rate of change within
the
primate lineage, which I calculated as being 0.13 or 0.21%
per million
years. This gives an integration date for the HERV.S
element within
cosmid
AC004385 of between 32 and 52 million
years ago. These figures
have a large range since there is some
uncertainty over the level of
divergence between Old World monkeys
and humans (
25,
37).
Furthermore, I have assumed rate constancy
within the primate lineage,
but there is evidence suggesting that
this is not the case; it is
probable that there has been some
degree of slowdown during hominid
evolution (
4,
25).
Most retroviral
gag genes encode a short conserved region,
termed the major homology region (MHR), in the capsid protein and
a
Cys-His motif in the nucleocapsid of the form
CX
2CX
4HX
4C, which
is thought to be
involved in binding to nucleic acids (
58).
Alignments of
these regions are shown for several of the HERV
families investigated
in this study (see Fig.
4a and b). A putative
MHR was identified in the
gag gene of HERV.S, but the Cys-His
motif appears to be
absent (several different members of the family
were investigated for
its presence). Both HERV.L and murine endogenous
retrovirus type L
(MuERV.L) also contain an MHR but lack the Cys-His
motif, whereas the
spumaviruses do not appear to encode either
region (
6,
12,
58). BLAST searches using the 3' end of
the Gag protein from
AC004385 as the probe (i.e., the region
in which the Cys-His motif is
situated within other retroviruses)
demonstrated a low level of
similarity to HERV.L and MuERV.L,
but no matches were obtained with
other retroviral isolates. The
HERV.S family
pol gene was
found to have a typical retroviral
organization, encoding motifs
associated with the protease (Pro),
RT and integrase (Int) proteins but
no other gene products (Fig.
2) (see also
4c and d). BLAST searches with the region between
the end of 3' Pol and
the 3' LTR generally showed few matches,
with the exception of two
short amino acid motifs related to part
of the transmembrane proteins
of other retroviruses (Fig.
4e).

View larger version (98K):
[in this window]
[in a new window]
|
FIG. 2.
Partial RT sequence alignment of representative members
of each of the 22 independent HERV families discussed in this report.
The sequence of this region is not available for the prototype members
of some of the HERV families. In these cases, the cosmid from which the
sequence was derived is also shown. The similarity of the cosmid
sequence to the prototypic family member and its location within the
cosmid are indicated in Tables 2 and 3, respectively.
|
|
It is interesting that two of the viruses most closely related to the
HERV.S family, namely HERV.L and MuERV.L (
6,
12),
show
significant differences in genomic organization. For example,
HERV.L
and MuERV.L both lack an
env gene, which appears to be
present in HERV.S, whereas HERV.L and MuERV.L encode a dUTPase
between
pol and their 3' LTRs (
6,
12), and this gene was
absent from all members of the HERV.S family (unpublished
results).
(ii) HERV.R (type b).
Only one member of the HERV.R (type b)
family was identified during this study, in cosmid AC004045(b), which
is located at q25 on chromosome 4 (Table 3). This element is 8.7 kb in
length, has the structure LTR-gag-pol-env-LTR (Table 4),
and, like the prototypic member of the HERV.S family, does not contain
any large deletions, but it is probably incapable of replication due to the presence of in-frame stop codons and frameshifts. Because only one
member of this family was identified, any estimate of copy number is
subject to considerable uncertainty, but it will probably be low. The
HERV.R (type b) 5' LTR is 643 bp in length, and its 3' LTR is 692 bp,
with the promoter and polyadenylation sequences situated toward the 3'
end (Fig. 3a). The two
LTRs have an observed divergence of 11.4% (12.4% corrected),
corresponding to an estimated integration date of 30 to 47 million
years ago. The element contains a 17-bp PPT and shows 17 of 18 matches
to the 3' end of the mouse arginine tRNA. Two other previously
identified HERV families, ERV-9 and HERV.R, also use an arginine tRNA
primer (23, 41). The gag, pol, and
env genes all contain the expected conserved motifs (Fig. 2
and 4).

View larger version (80K):
[in this window]
[in a new window]
|
FIG. 3.
LTR sequences of five novel HERV families (a) and
four partially characterized families (b). When both the 5' and 3' LTRs
could be identified, they were aligned against each other, with dashes
representing missing residues. The PPT (before the start of the 3' LTR)
and PBSs (after the end of the 5' LTR) are underlined, as are the
direct repeats flanking each end of the element and the inverted
repeats flanking each LTR. The promoter and polyadenylation signals are
boxed. In some cases, not all these features could be identified for
each element (or they differ slightly from the expected sequence). This
is probably due to postintegration mutation. Similarly, this is also
likely to account for the observed differences between the PBS and
closest-match tRNA sequence shown below each alignment. The estimated
integration dates of each HERV are also shown.
|
|

View larger version (79K):
[in this window]
[in a new window]
|
FIG. 4.
Alignment of conserved amino acid motifs present within
the HERV families identified in this report. The figures in parentheses
indicate the cosmids from which the sequences were derived. GaLV,
Jaagsiekte, HSV, and MuERV.L are provided for comparative purposes. (a)
Gag alignment spanning nucleotide positions 1660 to 1713 within the
GaLV sequence (13); (b) Gag alignment spanning positions
2074 to 2157; (c) Pro alignment spanning positions 2266 to 2490; (d)
Int alignment spanning positions 4969 to 5166; (e) Env alignment
spanning positions 7220 to 7504.
|
|
The observed relationship of the HERV.R (type b) family to other
retroviruses differed somewhat depending on the method used
for
phylogenetic reconstruction. In neighbor-joining trees, the
family
clustered with the HERV.FRD and HERV.Z69907 families (see
below) as
well as two partially characterized endogenous retroviruses
derived
from marsupials and monotremes (Fig.
1a). In contrast,
maximum-parsimony analyses placed the family basal to a large
clade of
viruses that included 11 other HERV families (Fig.
1b).
This
discrepancy is probably due to the poor resolution of this
region of
the retroviral tree; the backbone of the phylogeny from
which many of
these sequences emerge is not at all well supported,
as discussed above
(
16).
(iii) HERV.F.
Cosmid Z94277 contains an HERV sequence with a
PBS exhibiting 16 of 18 matches with the human phenylalanine tRNA, and
thus this family has been termed HERV.F (Fig. 3a). This HERV, located at position p11.3-11.4 on the X chromosome (Table 3), was the only
member of the family to be identified by BLAST searches and phylogenetic reconstruction, which indicated that these elements are
likely to be present at low copy numbers within the human genome. Both
neighbor-joining and maximum-parsimony analyses suggested a close
relationship with the HERV.H family (Fig. 1), which, in contrast with
HERV.F, is present at a very high copy number, with at least 600 pol-containing members (30). The HERV.F genome is
approximately 8.7 kb in length and encodes motifs indicative of the
presence of gag, pol, and env (Fig. 2
and 4); there was no evidence of other gene products. Although all
three genes appeared similar in size to those of other retroviruses,
suggesting that there have been no large deletions since integration of
the provirus, only small open reading frames ORFs were present within
them (Table 4). The two LTRs (a 470-bp 5' LTR and a 515-bp 3' LTR with
a 45-bp duplication adjacent to the PPT) were found exhibit 13.0% divergence (14.6% corrected), suggesting that the element first integrated into the primate lineage 35 to 56 million years ago.
(iv) HERV.F (type b).
A second HERV.F family was also apparent
from my phylogenetic analyses (17 of 18 matches with human
phenylalanine tRNA [Fig. 3a]). This family, termed HERV.F (type b),
has also been independently identified by Lindeskog (26).
The two families are closely related, both to each other and to the
previously characterized family HERV.XA (60), which is also
primed by phenylalanine (see below). Despite the similarity of their
PBSs and their close phylogenetic relationship, I think that they
should be regarded as separate families for two reasons. First,
elements from the three families were not placed into a single,
well-supported monophyletic clade in my analyses (and were paraphyletic
with respect to the HERV.H family), as is the case for many other
families, such as HERV.E, HERV.H, and HERV.I (Fig. 1). Second, although
the RT-based amino acid alignment demonstrated a high level of
similarity, this was not the case when other regions of the viral
genomes were compared. For example, there was no obvious nucleotide
sequence similarity between the LTRs of the different families, and
only low levels were observed in comparisons of gag and
env (unpublished results).
Only two members of the HERV.F (type b) family were identified during
this study, indicating a copy number of approximately
30. Both elements
are situated on the X chromosome. The prototypic
member (in cosmid
AC002416) is 6.8 kb in length and contains
a
pol gene with a
deletion 5' to the Int motif shown in Fig.
4d)
and an
env
gene with a large deletion upstream of the transmembrane
region (Table
4 and unpublished results). Although the
gag gene
in
AC002416 is full length and contains an MHR (Fig.
4a), some
of the
conserved residues in the Cys-His motif have probably been
altered by
postinsertion mutation, and thus the equivalent region
from the second
member of the family (in cosmid
Z95126) is
also shown in Fig.
4b. The
5' and 3' LTRs (387 and 388 bp, respectively)
have diverged by 7.5%
(7.9% corrected), with an estimated integration
date for the element
of 19 to 30 million years
ago.
(v) HERV.Z69907.
The prototypic member of the HERV.Z69907
family is located at q11.2 on chromosome 22. The only other member
recovered by BLAST searches is present in cosmid AC004474, and this
suggests a copy number of approximately 30. Both elements were highly
defective; neither appeared to possess a 5' LTR, and in only one
(Z69907) was there a suggestion of a PPT and, thus, a 3' LTR. The
putative LTR did not appear to contain obvious promoter or
polyadenylation signals. The Z69907 element encodes motifs that suggest
the presence of the gag, pol, and env
genes (Fig. 2 and 4 and Table 4). However, it contains a deletion in
Pro (the alternative element in this family is therefore shown in the
alignment in Fig. 4c), and the distance between the Int and
transmembrane motifs was shorter than that observed in other viruses;
thus, env is also likely to contain a large deletion
(unpublished data). Furthermore, no obvious MHR could be detected in
either virus. Like HERV.R (type b), the phylogenetic position of this
family was dependent on the method used in tree reconstruction, being
placed as a sister group to a marsupial-derived element in
neighbor-joining analysis and toward the base of a large, poorly
supported clade (which includes numerous HERV families) with maximum parsimony.
(vi) HERV.HS49C23.
The members of the HERV.HS49C23 family are
unusual in that they tend to cluster with viruses derived from
nonmammalian vertebrates, in contrast to many of the other HERV
families shown in Fig. 1. The prototypic member is located on
chromosome X, and its phylogenetic relationship has been briefly
described in a previous report (16). It now appears that
members of this family are present at approximately 70 copies in the
human genome and that they are probably all highly defective. No LTR
sequences could be identified for any member of the group, and BLAST
searches suggested that only three (those on cosmids HS49C23, AC002319,
and Z78021) contained anything more than small fragments with
similarity to other retroviruses; the first two elements could be
aligned across a 6-kb region of their genomes (unpublished results).
Despite the presence of multiple in-frame stop codons and frameshifts,
it was possible to derive some information on these elements because
HS49C23 itself appears to contain a complete pol gene (i.e.,
there are no large deletions evident) and the Cys-His motif in Gag.
There was no evidence of a transmembrane motif in env, but
one was present in Z78021 (Fig. 4e). The lack of LTRs precluded
attempts at estimating the integration dates of members of the family,
but their highly defective nature suggests that they may be among the
oldest of the HERVs.
Additional characterization of previously identified families. (i)
HERV.HML5.
HERV.HML5 was first identified from a 250-bp
PCR-amplified pol gene fragment by Medstrand and Blomberg
(35). Like the other HERV.HML family constituents, it a
member of the class II HERV superfamily, whose previously characterized
members have been found to be primed by lysine tRNA. The HERV.HML5
pol gene fragment appears closely related to elements within
three cosmid clones (Table 3), and one of them, in AC004536 (with which
there was 88% nucleotide identity across a 250-bp region), was used to
investigate the genomic organization of the family. The element in
cosmid AC004536, which was found to be 7.8 kb in length, inserted into
a 6-bp target site. The two 5' and 3' LTRs were 482 and 488 bp,
respectively, and contained the expected inverted repeats and control
sequences (Fig. 3b). The estimated integration date was 33 to 53 million years ago (based on a corrected LTR divergence figure of
13.7%). Unlike all other previously identified class II HERVs, members
of the HERV.HML5 family are primed by an isoleucine tRNA rather than by
tRNALys. The element within AC004536 showed 14 of 18 matches to the human Ile tRNA, and a second member of the family
(within cosmid Z95437) showed 17 of 18 (Fig. 3b). The structure of the
HERV.HML5 family is similar to those of the other well-characterized
viruses to which it is most closely related, such as HERV.K10, MMTV,
and jaagsiekte retrovirus (38, 42, 63). The Pro, RT, and Int motifs encoded within the pol gene and the MHR and Cys-His
motifs encoded by gag were all apparent, and as with
HERV.K10 and MMTV, these elements also appear to contain a dUTPase
between the protease and RT genes. An alignment of the HERV.HML5
dUTPase and a human dUTPase (34) sequence is shown in Fig.
5. Although there was no similarity to
the transmembrane motif within the envelope (Env) protein shown in Fig.
4e, a BLAST-derived match was obtained with the Env proteins of MMTV,
HERV.K10, and jaagsiekte retrovirus (unpublished results).
(ii) HERV.ADP.
HERV.ADP was originally described by Lyn et al.
(28). This element is defective, consisting of a solitary
pol gene of approximately 1.5 kb in length. Southern
hybridization analysis indicated that the members of this family are
present at high copy numbers, and sequence analysis suggested that they
are most closely related to HERV.I (28). Analyses presented
here show that members of this family are likely to have copy numbers
of around 60 and confirm their close relationship with HERV.I. The
neighbor-joining tree in Fig. 1a placed HERV.I and HERV.ADP in a
monophyletic group, although this was not the case for the
maximum-parsimony tree, which instead placed the HERV.ADP family next
to viruses derived from several different vertebrate classes. Analysis
of the PBS sequences of HGMP cosmid clones closely related to HERV.ADP
(AC005741 shows 83% nucleotide identity to HERV.ADP across 800 bp of
pol, for example) indicated that these elements are probably
not primed by isoleucine tRNA (the tRNA primer for the HERV.I family);
instead, the element within AC005741 showed 14 of 18 matches to the human threonine tRNA, as shown in Fig. 3b. It should be noted, however,
that the same PBS sequence also showed 13 of 18 matches to the murine
leukemia virus proline tRNA PBS, and thus the binding affinity of this
family cannot yet be considered definitive (unpublished results).
For the above-stated reasons, it is likely that HERV.ADP and HERV.I
represent separate families. Investigation of the
AC005741 HERV
sequence indicated that it has a length of 8.4 kb and that
full-length
(but defective)
gag,
pol, and
env
genes are present
(Fig.
2 and
4 and Table
4). The estimated integration
date of
this element, based on a corrected LTR divergence figure of
12.4%,
was 30 to 48 million years ago. This figure is consistent with
the reported insertion of the ADP-ribosyltransferase pseudogene
into
HERV.ADP, which is thought to have occurred at least 27 million
years
ago (
28).
(iii) HERV.FRD.
HERV.FRD was isolated by reverse
transcription-PCR from retrovirus-like particles released from the
human breast cancer cell line T47-D (49). Characterization
of a 2.8-kb region of the pol gene indicated that this
element was most closely related to class I HERVs such as ERV-9
(49). I identified a full-length member of this family
within cosmid AC004022 at q21-22 on chromosome 7 (Table 3). The two
elements exhibited 99% nucleotide identity across the 2.8 kb of
pol and clustered with robust bootstrap support in my
analyses. Further investigation of AC004022 revealed that the element
is longer, at 10.8 kb, than other endogenous retroviruses (Table 4).
The LTRs measured 715 bp (5') and 703 bp (3') in length and differed by
over 19% (Fig. 3b). The corrected divergence of 22.6% suggests an
integration date of 53 to 87 million years ago, implying that this HERV
family may be one of the oldest and that it is probably present in
most, if not all, extant primate species. The PBS sequences
indicate that these viruses, like the HERV.H family members
(30), are probably primed by tRNAHis;
there were 14 matches of 18 to the human histidine tRNA (Fig. 3b).
The
AC004022 provirus has remained reasonably intact and contains all
of the amino acid motifs associated with
gag,
pol,
and
env (Fig.
2 and
4), although only short
ORFS were apparent
in all three genes. The size of each appeared to be
roughly equivalent
to those of other retroviruses, and the unexpectedly
large size
of the
AC004022 element was due to the presence of a 2-kb
region
inserted between the 5' LTR and the
gag gene (the 5'
end of
gag in
AC004022 was identified by comparison with the
GaLV
gag gene [unpublished results]). There are two
possible explanations
for the presence of this region: (i) it
represents a later insertion
into a preexisting HERV sequence and was
not part of the original
provirus, or (ii) members of this family
encode an additional
gene upstream of
gag. If the second
scenario were the case, then
the FRD family would be unique among HERVs
in encoding a gene
in addition to
gag, dUTPase,
pol, and
env. It is difficult to
estimate the
probabilities of the alternative scenarios since
only one member of the
family has been characterized in this region.
Furthermore, BLAST
searches failed to reveal any obvious similarities
to repetitive
sequences (which would support scenario i) or to
other proteins (which
would support scenario ii) (unpublished
results). Determination of the
presence or absence of this putative
gene in other members of the
HERV.FRD family will therefore ultimately
resolve this
question.
The relationship of the HERV.FRD family to other viruses (in common
with several of the other HERV families described in this
report) was
difficult to resolve due to inconsistencies between
trees constructed
by different methods, with the HERV.FRD members
being placed in a
location similar to that of the HERV.R (type
b) and HERV.Z69907
families (Fig.
1).
(iv) HERV.XA.
Members of the HERV.XA family are present at low
copy numbers (16 per haploid genome) in humans and are related to
HERV.H. Similar viruses have been identified in the Old and New World monkeys, indicating that this family is at least 40 to 45 million years
old (60). To date, five members of this family have been partially characterized, with the most complete characterization (HERV.XA38) extending from the central region of the pol
gene to the end of a somewhat truncated env gene
(60). Part of the HERV.XA pol gene was found to
be 88% identical to a 500-bp portion of cosmid AC000378 (Table 2), and
further investigation suggested that this cosmid contains a 6.2-kb
element with a full-length gag gene and a large deletion
spanning the 3' end of the pol gene and the 5' region of
env (Table 4). Unlike previously characterized members of
this family, AC000378 contained intact LTRs (other members have one or
more Alu repeats at their 3' ends [60]). The 5' and 3'
LTRs were 433 and 438 bp in length and exhibited 9.2% divergence
(10.0% corrected), implying that this element first integrated into
the primate lineage 24 to 38 million years ago. Like members of the two
HERV.F families, AC000378 is primed by a phenylalanine tRNA, with 17 of
18 matches conserved between its PBS and the 3' end of the human
tRNAPhe.
The identification of only one member of this family by BLAST search
and the phylogenetic position of this member are in accord
with the
previously observed low copy number and relatively close
relationship
to HERV.H. The lack of PBS homology with HERV.H and
the low levels of
sequence similarity to HERV.F and HERV.F (type
b) (to which the element
within
AC000378 is most closely related)
outside of RT indicate that
HERV.XA probably constitutes a separate
family.
 |
DISCUSSION |
This report describes one of the first attempts to systematically
identify and characterize endogenous retroviruses identified by the
HGMP and to examine their relationship to other vertebrate retroviruses. Recently, Lindeskog (26) used a somewhat
different approach, based largely on pol gene sequence
similarity, to classify the HERVs into 13 groups, several of which
contained more than one family. My analyses, which were based on
phylogenetic criteria rather than sequence divergence, led to the
identification of 22 independent HERV families. Several of the families
described here have not been identified previously, but there are also
other differences between our analyses. For example, Lindeskog (as in this report) identified two families of HERV.I-related elements. However, in his case, they were HERV.I itself and HERV.FTD, whereas my
analyses suggest that HERV.FTD and HERV.I are very similar and that
both are closely related to the HERV.ADP family (which was not included
in the other data set [26]). However, in contrast, Lindeskog identified five ERV-9/HERV.W/HERV.P-related families, whereas
my analyses indicated that there are only three.
Of the 22 families shown in Fig. 1, 6 had not been characterized
previously (although HERV.F [type b] has been independently identified [26]), and the affinities of a further 6 families which had been previously described were based on closest
matches to HGMP-derived cosmid sequences. However, it is likely that
even the figure of 22 is a conservative one, for two reasons. First, some families are probably harbored at sufficiently low copy number that they are not present in the 7% of the human genome investigated here. Indeed, available sequence data for several families is currently
limited to a single element. Second, it is likely that a number of the
families which appear monophyletic in this study are (due to relatively
poor sampling of nonhuman viruses in some regions of the retroviral
phylogeny) actually polyphyletic, and these will eventually be split.
For example, the phylogenies shown in Fig. 1 suggest that there are
three families of HERV.K-like viruses within the human genome, namely
HERV.K itself, HERV.HML5, and HERV.HML6. However, previous reports
have suggested (on the basis of sequence divergence) the presence of up
to 10 HERV.K-like families (3, 15, 35), and it is possible
that the isolation of additional vertebrate retroviral sequences will
cause some of the class II HERVs to be broken up into additional
families. The relatively low support for the monophyly of many of the
HERVs in this region of the tree supports this notion; individual HERV families would typically be expected to cluster with robust bootstrap support. For the same reasons, it is possible that the HERV.HS49C23- and HERV.L-related elements are actually derived from more than one
family. The monophyly of two HERV.HS49C23 family lineages have less
than 66% bootstrap support in the two trees, and HERV.L could not be
phylogenetically distinguished from the rodent virus MuERV.L
(6). In contrast, several other previously reported elements
are unlikely to represent independent families since they appear to be
very closely related to other HERVs. Thus, Np2 and HS-5 are probably
members of the HERV.E family, ERV.1 is probably a member of the HERV.R
family, and HERV.FTD probably belongs to the HERV.I family. HRES-1 did
not have any obvious similarity to any retroviral sequence, and its
inclusion with the other HERVs must therefore be considered doubtful.
This report underscores the problems presently associated with HERV
nomenclature. The most widely used system classifies families according
to the tRNA used to prime DNA synthesis (10, 61). However,
this information has often not been available, and thus other HERV
families have been named according to a variety of criteria, such as a
nearby gene (e.g., HERV.ADP [28]), a clone number
(e.g., HERV.S71 [59]), or even an amino acid motif
present within the sequence (e.g., HERV.FRD [49]).
Furthermore, the term HERV family is itself problematic since the
Retroviridae as a whole has also been given family
designation (39). The next hierarchical level places related
families into classes (originally two but recently three), with class I
elements being related to the mammalian type C retroviruses (such as
FeLV and GaLV), class II to being related to the mammalian type B and D
retroviruses (MMTV and SRV-1) and avian leukosis viruses (RSV), and
class III being related to the spumaviruses (HSV) (3, 12, 26,
61).
The greatest drawback with the former system is that it is based on a
single character (the tRNA complementary to the viral PBS) which does
not correspond closely to the viral phylogeny, the most obvious example
being the three HERV families primed by tRNAIle. All three
are phylogenetically more closely related to other HERV families which
use alternative tRNA primers, and, furthermore, HERV.HML5 clusters with
the class II HERVs whereas both of the others are class I. The class
level designations work better because they are based on viral
relatedness. According to the phylogenies presented in Fig. 1, there
are currently 17 class I, 3 class II, and 2 class III families (the 2 class III families being HERV.L [12] and HERV.S).
However, it should be noted that some HERV families within the same
class are only very distantly related to each other. For example,
HERV.R and HERV.HS49C23 exhibit only 33% amino acid identity across
the most highly conserved region of RT, as shown in Fig. 2 (unpublished
data). Furthermore, in some retroviral phylogenies (such as that shown
in Fig. 1a), the spumaviruses appear more closely related to the class
I HERVs than to the class III HERVs. Despite these problems, it is my opinion that the current systems should remain in place, at least until
the human genome sequence has been completed and the full complement of
HERV families has been determined.
All of the families identified in this study encode in-frame stop
codons or frameshift mutations, and several also contain large
deletions. This is consistent with the characterization of previously
identified HERV families, virtually all of which are highly defective
(10, 61). The most-intact element identified so far is
HERV.K10, which has a single stop codon within both gag and
env and possibly a small deletion, also within
env (42). Another unusual feature of HERV.K10 is
the low level of divergence between its LTRs (of about 0.2%),
suggesting that it probably integrated into the primate lineage in the
recent past (42). This is in contrast to other HERV
families, many of which are thought to have been passaged vertically
since the divergence of the common ancestor of humans and Old World
monkeys some 25 to 30 million years ago (2, 31, 44, 47, 50).
LTR divergence data suggest that many of the families described here
have also been present since the human and Old World monkey lineages
diverged and that one family, HERV.FRD, may be exceptionally old:
the two integration dates estimated for this element were 53 and 87 million years. Although these figures should be treated with some
caution, since they assume a molecular clock within the primate lineage (4, 25), they still suggest that the HERV.FRD family may be
of similar age to the HERV.H family (which is thought to have entered
the primate lineage more than 50 million years ago [2, 31,
47]). Another potentially ancient family is HERV.HS49C23. No LTRs could be identified in any member of the family, but none contain ORFs of more than 200 amino acids (unpublished data), and they
have an unusual phylogenetic position, being only distantly related to
other mammalian viruses.
HERV genomic organization as described here is also consistent with
reports of previously identified endogenous elements in that
gag, pol, and env but are generally
encoded there are few, if any, other gene products. There were two
exceptions to this: the HERV.HML5 family (like with the other class II
HERVs) encodes a dUTPase between the Pro and RT genes, and the HERV.FRD
family may encode an additional (~2-kb) gene product between its 5'
LTR and gag gene.
Finally, the HGMP sequence library provides an excellent opportunity to
study the long-term evolutionary biology and retrotransposition dynamics of endogenous retroviral families, and we are currently using
these data to track the evolution of these elements within the primate lineage.
 |
ACKNOWLEDGMENTS |
This work was supported by the Royal Society.
I thank J. Martin for the unpublished caecilian (I. kohtaoensisIII and -IV) and RV Common possumII sequences and
P. Kabat for the unpublished RV EchidnaII, RV Jackdaw, and RV Thrush
sequences. Thanks also to J. Taylor and D. Quicke for permission to use
their tree searching strategy prior to publication and to J. Martin, C. Gosling, and R. Gifford for useful discussions.
 |
ADDENDUM IN PROOF |
The classification of endogenous human retroviruses is currently
being reviewed by the ICTV Retroviridae committee. Comments or suggestions regarding HERV nomenclature can be sent via e.mail to
Roswitha Löwer at: loero{at}pei.de (Paul Ehrlich Institut, Langen, Germany).
 |
FOOTNOTES |
*
Mailing address: Department of Biology, Imperial
College, Silwood Park, Ascot, Berkshire SL5 7PY, United Kingdom. Phone:
(1344) 294 373. Fax: (1344) 294 339. E-mail:
m.tristem{at}ic.ac.uk.
 |
REFERENCES |
| 1.
|
Altschul, S. F.,
T. L. Madden,
A. A. Schaffer,
J. Zhang,
Z. Zhang,
W. Miller, and D. L. Lipman.
1997.
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Nucleic Acids Res.
25:3389-3402[Abstract/Free Full Text].
|
| 2.
|
Anderssen, S.,
E. Sjottem,
G. Svineng, and T. Johansen.
1997.
Comparative analyses of LTRs of the ERV-H family of primate-specific retrovirus-like elements isolated from marmoset, African green monkey, and man.
Virology
234:14-30[CrossRef][Medline].
|
| 3.
|
Andersson, M. L.,
M. Lindeskog,
P. Medstrand,
B. Westley,
F. May, and J. Blomberg.
1999.
Diversity of human endogenous retrovirus class II-like sequences.
J. Gen. Virol.
80:255-260[Abstract].
|
| 4.
|
Bailey, W. J.,
D. H. A. Fitch,
D. A. Tagle,
J. Czelusniak,
J. L. Slightom, and M. Goodman.
1991.
Molecular evolution of the  -globin gene locus: gibbon phylogeny and the hominid slowdown.
Mol. Bol. Evol.
8:155-184[Abstract].
|
| 5.
|
Beck, S., and P. Sterk.
1998.
Genome scale DNA sequencing: where are we?
Curr. Opin. Biotechnol.
9:116-120[CrossRef][Medline].
|
| 6.
|
Bénit, L.,
N. De Parseval,
J.-F. Casella,
I. Callebaut,
A. Cordonnier, and T. Heidmann.
1997.
Cloning of a new murine endogenous retrovirus, MuERV-L, with strong similarity to the human HERV-L element and with a gag coding sequence closely related to the Fv1 restriction gene.
J. Virol.
71:5652-5657[Abstract].
|
| 7.
|
Bénit, L.,
J.-B. Lallemand,
J.-F. Casella,
H. Philippe, and T. Heidmann.
1999.
ERV-L elements: a family of endogenous retrovirus-like elements active through the evolution of mammals.
J. Virol.
73:3301-3308[Abstract/Free Full Text].
|
| 8.
|
Benveniste, R., and G. Todaro.
1975.
Evolution of type C viral genes: preservation of ancestral murine type C viral sequences in pig cellular DNA.
Proc. Natl. Acad. Sci. USA
72:4090-4094[Abstract/Free Full Text].
|
| 9.
|
Blond, J.-L.,
F. Besème,
L. Duret,
O. Bouton,
F. Bedin,
H. Perron,
B. Mandrand, and F. Mallet.
1999.
Molecular characterization and placental expression of HERW-W, a new human endogenous retrovirus family.
J. Virol.
73:1175-1185[Abstract/Free Full Text].
|
| 10.
|
Boeke, J. D., and J. P. Stoye.
1997.
Retrotransposons, endogenous retroviruses, and the evolution of retroelements, p. 343-435.
In
J. M. Coffin, S. H. Hughes, and H. E. Varmus (ed.), Retroviruses. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
|
| 11.
|
Bonner, T. I.,
C. O'Connell, and M. Cohen.
1982.
Cloned endogenous retroviral sequences from human DNA.
Proc. Natl. Acad. Sci. USA
79:4709-4713[Abstract/Free Full Text].
|
| 12.
|
Cordonnier, A.,
J.-F. Casella, and T. Heidmann.
1995.
Isolation of novel human endogenous retroviral-like elements with foamy virus-related pol sequence.
J. Virol.
69:5890-5897[Abstract].
|
| 13.
|
Delassus, S.,
P. Sonigo, and S. Wain-Hobson.
1989.
Genetic organisation of gibbon ape leukaemia virus.
Virology
173:205-213[CrossRef][Medline].
|
| 14.
|
Felsenstein, J.
1993.
PHYLIP (phylogeny inference package), version 3.5c.
Department of Genetics, University of Washington, Seattle.
|
| 15.
|
Franklin, G. C.,
S. Chretien,
I. M. Hanson,
H. Rochefort,
F. E. B. May, and B. R. Westley.
1988.
Expression of human sequences related to those of mouse mammary tumor virus.
J. Virol.
62:1203-1210[Abstract/Free Full Text].
|
| 16.
|
Herniou, E.,
J. Martin,
K. Miller,
J. Cook,
M. Wilkinson, and M. Tristem.
1998.
Retroviral diversity and distribution in vertebrates.
J. Virol.
72:5955-5966[Abstract/Free Full Text].
|
| 17.
|
Hirose, Y.,
M. Takamatsu, and F. Harada.
1993.
Presence of env genes in members of the RTLVH family of human endogenous retrovirus-like elements.
Virology
192:52-61[CrossRef][Medline].
|
| 18.
|
Holzschu, D. L.,
D. Martineau,
S. K. Fodor,
V. M. Vogt,
P. R. Bowser, and J. W. Casey.
1995.
Nucleotide sequence and protein analysis of a complex piscine retrovirus, walleye dermal sarcoma virus.
J. Virol.
69:5320-5331[Abstract].
|
| 19.
|
Kabat, P.,
M. Tristem,
R. Opavsky, and J. Pastorek.
1996.
Human endogenous retrovirus HC2 is a new member of the S71 retroviral subgroup with a full-length pol gene.
Virology
226:83-94[CrossRef][Medline].
|
| 20.
|
Kannan, P.,
R. Buettner,
D. Pratt, and M. Tainsky.
1991.
Identification of a retinoic acid-inducible endogenous retroviral transcript in the human teratocarcinoma-derived cell line PA-1.
J. Virol.
65:6343-6348[Abstract/Free Full Text].
|
| 21.
|
Kimura, M.
1980.
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.
J. Mol. Evol.
16:116-120.
|
| 22.
|
Kröger, B., and I. Horak.
1987.
Isolation of novel human retrovirus-related sequences by hybridization to synthetic oligonucleotides complementary to the tRNAPro primer-binding site.
J. Virol.
61:2071-2075[Abstract/Free Full Text].
|
| 23.
|
La Mantia, G.,
D. Maglione,
G. Pengue,
A. Di Cristofano,
A. Simeone,
L. Lanfrancone, and L. Lania.
1991.
Identification and characterization of novel human endogenous retroviral sequences preferentially expressed in undifferentiated embryonal carcinoma cells.
Nucleic Acids Res.
19:1513-1520[Abstract/Free Full Text].
|
| 24.
|
Levy, L.,
P. Lobelle-Rich,
J. Elder,
S. Payne, and R. Montelaro.
1990.
An unusual retrovirus-like sequence identified in human DNA.
J. Gen. Virol.
71:1613-1618[Abstract/Free Full Text].
|
| 25.
|
Li, W.-H., and M. Tanimura.
1987.
The molecular clock runs more slowly in man than in apes and monkeys.
Nature
336:93-96.
|
| 26.
|
Lindeskog, M.
1999.
Transcription, splicing and genetic structure within the human endogenous retroviral HERV.H family. Ph.D. thesis.
Department of Infectious Diseases and Medical Microbiology, Lund University, Lund, Sweden.
|
| 27.
|
Lower, R.,
J. Lower, and R. Kurth.
1996.
The viruses in all of us: characteristics and biological significance of human endogenous retrovirus sequences.
Proc. Natl. Acad. Sci. USA
93:5177-5184[Abstract/Free Full Text].
|
| 28.
|
Lyn, D.,
L. L. Deaven,
N. Istock, and M. Smulson.
1993.
The polymorphic ADP-ribosyltransferase (NAD+) pseudogene 1 in humans interrupts an endogenous pol-like element on 13q34.
Genomics
18:206-211[CrossRef][Medline].
|
| 29.
|
Maeda, N., and H. S. Kim.
1990.
Three independent insertions of retrovirus-like sequences in the haptoglobin gene cluster of primates.
Genomics
8:671-683[CrossRef][Medline].
|
| 30.
|
Mager, D. L., and P. S. Henthorn.
1984.
Identification of a retrovirus-like repetitive element in human DNA.
Proc. Natl. Acad. Sci. USA
81:7510-7514[Abstract/Free Full Text].
|
| 31.
|
Mager, D. L., and J. D. Freeman.
1995.
HERV.H endogenous retroviruses presence in the New-World branch but amplification in the Old-World primate lineage.
Virology
213:395-404[CrossRef][Medline].
|
| 32.
|
Martin, J.,
E. Herniou,
J. Cook,
R. Waugh O'Neill, and M. Tristem.
1999.
Interclass transmission and phyletic host tracking in the murine leukemia-related retroviruses.
J. Virol.
73:2442-2449[Abstract/Free Full Text].
|
| 33.
|
Martin, M.,
T. Bryan,
S. Rasheed, and A. Khan.
1981.
Identification and cloning of endogenous retroviral sequences present in human DNA.
Proc. Natl. Acad. Sci. USA
78:4892-4896[Abstract/Free Full Text].
|
| 34.
|
McIntosh, E. M.,
D. D. Ager,
M. H. Gadsden, and R. H. Haynes.
1992.
Human dUTP pyrophosphatase: cDNA sequence and potential biological importance of the enzyme.
Proc. Natl. Acad. Sci. USA
89:8020-8024[Abstract/Free Full Text].
|
| 35.
|
Medstrand, P., and J. Blomberg.
1993.
Characterization of novel reverse transcriptase encoding human endogenous retroviral sequences similar to type A and type B retroviruses: differential transcription in normal human tissues.
J. Virol.
67:6778-6787[Abstract/Free Full Text].
|
| 36.
|
Medstrand, P.,
D. L. Mager,
H. Yin,
U. Dietrich, and J. Blomberg.
1997.
Structure and genomic organisation of a novel human endogenous retrovirus family: HERV.K (HML-6).
J. Gen. Virol.
78:1731-1744[Abstract].
|
| 37.
|
Miyamoto, M. M., and M. Goodman.
1990.
DNA systematics and evolution of primates.
Annu. Rev. Ecol. Syst.
21:197-220[CrossRef].
|
| 38.
|
Moore, R.,
M. Dixon,
R. Smith,
G. Peters, and C. Dickson.
1987.
Complete nucleotide sequence of a milk-transmitted mouse mammary tumor virus: two frameshift suppression events are required for translation of gag and pol.
J. Virol.
61:480-490[Abstract/Free Full Text].
|
| 39.
|
Murphy, F. A.,
C. M. Fauquet,
D. H. L. Bishop,
S. A. Ghabrial,
A. W. Jarvis,
G. P. Martelli,
M. A. Mayo, and M. D. Summers.
1995.
Virus taxonomy: the classification and nomenclature of viruses. The Sixth Report of the International Committee on Taxonomy of Viruses.
Springer-Verlag, Vienna, Austria.
|
| 40.
|
O'Connell, C.,
S. O'Brien,
W. Nash, and N. Cohen.
1984.
ERV-3, a full-length human endogenous provirus: chromosomal localization and evolutionary relationships.
Virology
138:225-235[CrossRef][Medline].
|
| 41.
|
O'Connell, C., and M. Cohen.
1984.
The long terminal repeats of a novel human endogenous retrovirus.
Science
226:1204-1206[Abstract/Free Full Text].
|
| 42.
|
Ono, M.,
T. Yasunaga,
T. Miyata, and H. Ushikubo.
1986.
Nucleotide sequence of human endogenous retrovirus genome related to the mouse mammary tumor virus genome.
J. Virol.
60:589-598[Abstract/Free Full Text].
|
| 43.
|
Patience, C.,
Y. Takeuchi, and R. A. Weiss.
1997.
Infection of human cells by an endogenous retrovirus of pigs.
Nat. Med.
3:282-286[CrossRef][Medline].
|
| 44.
|
Patience, C.,
D. A. Wilkinson, and R. A. Weiss.
1997.
Our retroviral heritage.
Trends Genet.
13:116-120[CrossRef][Medline].
|
| 45.
|
Perl, A.,
J. Rosenblatt,
I. Chen,
J. Di Vincenzo,
R. Bever,
J. Poiesz, and G. Abraham.
1989.
Detection and cloning of new HTLV-related endogenous sequences in man.
Nucleic Acids Res.
17:6841-6854[Abstract/Free Full Text].
|
| 46.
|
Perron, H.,
J. A. Garson,
F. Bedin,
F. Beseme,
G. Paranhos-Baccala,
F. Komurian-Pradel,
F. Mallet,
P. W. Tuke,
C. Voisset,
J. L. Blond,
B. Lalande,
J. M. Seigneurin,
B. Mandrand, and the Collaborative Research Group on Multiple Sclerosis.
1997.
Molecular identification of a novel retrovirus repeatedly isolated from patients with multiple sclerosis.
Proc. Natl. Acad. Sci. USA
94:7583-7588[Abstract/Free Full Text].
|
| 47.
|
Purvis, A.
1995.
A composite estimate of primate phylogeny.
Philos. Trans. R. Soc. Lond. Ser. B
348:405-421[Medline].
|
| 48.
|
Repaske, R.,
P. E. Steele,
R. R. O'Neill,
A. B. Rabson, and M. A. Martin.
1985.
Nucleotide sequence of a full-length endogenous retroviral segment.
J. Virol.
54:764-772[Abstract/Free Full Text].
|
| 49.
|
Seifarth, W.,
H. Skladny,
F. Krieg-Schneider,
A. Reichert,
R. Hehlmann, and C. Leib-Mosch.
1995.
Retrovirus-like particles released from the human breast cancer cell line T47-D display type B- and C-related endogenous retroviral sequences.
J. Virol.
69:6408-6416[Abstract].
|
| 50.
|
Shih, A.,
E. E. Coutavas, and M. G. Rush.
1991.
Evolutionary implications of primate endogenous retroviruses.
Virology
182:495-502[CrossRef][Medline].
|
| 51.
|
Silver, J.,
A. Rabson,
T. Bryan,
R. Willey, and M. Martin.
1987.
Human retroviral sequences on the Y chromosome.
Mol. Cell. Biol.
7:1559-1562[Abstract/Free Full Text].
|
| 52.
|
Sprinzl, M.,
C. Horn,
M. Brown,
A. Ioudovitch, and S. Steinberg.
1998.
Compilation of tRNA sequences and sequences of tRNA genes.
Nucleic Acids Res.
26:148-153[Abstract/Free Full Text].
|
| 53.
|
Sverdlov, E. D.
1998.
Perpetually mobile footprints of ancient infections in human genome.
FEBS Lett.
428:1-6[CrossRef][Medline].
|
| 54.
|
Tatusova, T. A., and T. L. Madden.
1999.
Blast 2 sequences a new tool for comparing protein and nucleotide sequences.
FEMS Microbiol. Lett.
174:247-250[CrossRef][Medline].
|
| 55.
|
Telesnitsky, A., and S. P. Goff.
1997.
Reverse transcription and the generation of viral DNA, p. 121-160.
In
J. M. Coffin, S. H. Hughes, and H. E. Varmus (ed.), Retroviruses. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
|
| 56.
|
Tristem, M.,
E. Herniou,
K. Summers, and J. Cook.
1996.
Three retroviral sequences in amphibians are distinct from those in mammals and birds.
J. Virol.
70:4864-4870[Abstract].
|
| 57.
|
Vogt, P. K.
1997.
Historical introduction to the general properties of retroviruses, p. 1-25.
In
J. M. Coffin, S. H. Hughes, and H. E. Varmus (ed.), Retroviruses. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
|
| 58.
|
Vogt, V. M.
1997.
Retroviral virions and genomes, p. 27-71.
In
J. M. Coffin, S. H. Hughes, and H. E. Varmus (ed.), Retroviruses. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
|
| 59.
|
Werner, T.,
R. Brack-Werner,
C. Leib-Mosch,
H. Backhaus,
V. Erfle, and R. Hehlmann.
1990.
S71 is a phylogenetically distinct human retroviral element with structural and sequence homology to simian sarcoma virus (SSV).
Virology
174:225-238[CrossRef][Medline].
|
| 60.
|
Widegren, B.,
C. Kjellman,
S. Aminoff,
L. G. Sahlford, and H.-O. Sjogren.
1996.
The structure and phylogeny of a new family of human endogenous retroviruses.
J. Gen. Virol.
77:1631-1641[Abstract/Free Full Text].
|
| 61.
|
Wilkinson, D. A.,
D. L. Mager, and J.-A. C. Leong.
1994.
Endogenous human retroviruses, p. 465-535.
In
J. A. Levy (ed.), The Retroviridae, vol. III. Plenum Press, New York, N.Y.
|
| 62.
|
Xiong, Y., and T. H. Eickbush.
1990.
Origin and evolution of retroelements based on their reverse transcriptase sequences.
EMBO J.
9:3353-3362[Medline].
|
| 63.
|
York, D. F.,
R. Vigne,
D. W. Verwoerd, and G. Querat.
1992.
Nucleotide sequence of jaagsiekte retrovirus, an exogenous and endogenous type D and B retrovirus of sheep and goats.
J. Virol.
66:4930-4939[Abstract/Free Full Text].
|
Journal of Virology, April 2000, p. 3715-3730, Vol. 74, No. 8
0022-538X/00/$04.00+0
Copyright © 2000, American Society for Microbiology. All rights reserved.
This article has been cited by other articles:
-
Kulski, J. K., Shigenari, A., Shiina, T., Ota, M., Hosomichi, K., James, I., Inoko, H.
(2008). Human Endogenous Retrovirus (HERVK9) Structural Polymorphism With Haplotypic HLA-A Allelic Associations. Genetics
180: 445-457
[Abstract]
[Full Text]
-
Voisset, C., Weiss, R. A., Griffiths, D. J.
(2008). Human RNA "Rumor" Viruses: the Search for Novel Human Retroviruses in Chronic Disease. Microbiol. Mol. Biol. Rev.
72: 157-196
[Abstract]
[Full Text]
-
Frank, O., Verbeke, C., Schwarz, N., Mayer, J., Fabarius, A., Hehlmann, R., Leib-Mosch, C., Seifarth, W.
(2008). Variable Transcriptional Activity of Endogenous Retroviruses in Human Breast Cancer. J. Virol.
82: 1808-1818
[Abstract]
[Full Text]
-
Belshaw, R., Watson, J., Katzourakis, A., Howe, A., Woolven-Allen, J., Burt, A., Tristem, M.
(2007). Rate of Recombinational Deletion among Human Endogenous Retroviruses. J. Virol.
81: 9437-9442
[Abstract]
[Full Text]
-
Peng, X., Pan, J., Gong, R., Liu, Y., Kang, S., Feng, H., Qiu, G., Guo, D., Tien, P., Xiao, G.
(2007). Functional Characterization of Syncytin-A, a Newly Murine Endogenous Virus Envelope Protein: IMPLICATION FOR ITS FUSION MECHANISM. J. Biol. Chem.
282: 381-389
[Abstract]
[Full Text]
-
Dewannieux, M., Harper, F., Richaud, A., Letzelter, C., Ribet, D., Pierron, G., Heidmann, T.
(2006). Identification of an infectious progenitor for the multiple-copy HERV-K human endogenous retroelements. Genome Res
16: 1548-1556
[Abstract]
[Full Text]
-
Sotgiu, S, Arru, G, Mameli, G, Serra, C, Pugliatti, M, Rosati, G, Dolei, A
(2006). Multiple sclerosis-associated retrovirus in early multiple sclerosis: a six-year follow-up of a Sardinian cohort. Mult Scler
12: 698-703
[Abstract]
-
Si, Z., Vandegraaff, N., O'hUigin, C., Song, B., Yuan, W., Xu, C., Perron, M., Li, X., Marasco, W. A., Engelman, A., Dean, M., Sodroski, J.
(2006). Evolution of a cytoplasmic tripartite motif (TRIM) protein in cows that restricts retroviral infection. Proc. Natl. Acad. Sci. USA
103: 7454-7459
[Abstract]
[Full Text]
-
Polavarapu, N., Bowen, N. J., McDonald, J. F.
(2006). Newly Identified Families of Human Endogenous Retroviruses. J. Virol.
80: 4640-4642
[Full Text]
-
Klymiuk, N., Muller, M., Brem, G., Aigner, B.
(2006). Phylogeny, recombination and expression of porcine endogenous retrovirus {gamma}2 nucleotide sequences.. J. Gen. Virol.
87: 977-986
[Abstract]
[Full Text]
-
Pichon, J.-P., Bonnaud, B., Cleuziat, P., Mallet, F.
(2006). Multiplex degenerate PCR coupled with an oligo sorbent array for human endogenous retrovirus expression profiling. Nucleic Acids Res
34: e46-e46
[Abstract]
[Full Text]
-
Belshaw, R., Dawson, A. L. A., Woolven-Allen, J., Redding, J., Burt, A., Tristem, M.
(2005). Genomewide Screening Reveals High Levels of Insertional Polymorphism in the Human Endogenous Retrovirus Family HERV-K(HML2): Implications for Present-Day Activity. J. Virol.
79: 12507-12514
[Abstract]
[Full Text]
-
Lopez-Sanchez, P., Costas, J. C., Naveira, H. F.
(2005). Paleogenomic Record of the Extinction of Human Endogenous Retrovirus ERV9. J. Virol.
79: 6997-7004
[Abstract]
[Full Text]
-
Song, B., Gold, B., O'hUigin, C., Javanbakht, H., Li, X., Stremlau, M., Winkler, C., Dean, M., Sodroski, J.
(2005). The B30.2(SPRY) Domain of the Retroviral Restriction Factor TRIM5{alpha} Exhibits Lineage-Specific Length and Sequence Variation in Primates. J. Virol.
79: 6111-6121
[Abstract]
[Full Text]
-
Gifford, R., Kabat, P., Martin, J., Lynch, C., Tristem, M.
(2005). Evolution and Distribution of Class II-Related Endogenous Retroviruses. J. Virol.
79: 6478-6486
[Abstract]
[Full Text]
-
Song, B., Javanbakht, H., Perron, M., Park, D. H., Stremlau, M., Sodroski, J.
(2005). Retrovirus Restriction by TRIM5{alpha} Variants from Old World and New World Primates. J. Virol.
79: 3930-3937
[Abstract]
[Full Text]
-
Belshaw, R., Katzourakis, A., Paces, J., Burt, A., Tristem, M.
(2005). High Copy Number in Human Endogenous Retrovirus Families is Associated with Copying Mechanisms in Addition to Reinfection. Mol Biol Evol
22: 814-817
[Abstract]
[Full Text]
-
Seifarth, W., Frank, O., Zeilfelder, U., Spiess, B., Greenwood, A. D., Hehlmann, R., Leib-Mosch, C.
(2005). Comprehensive Analysis of Human Endogenous Retrovirus Transcriptional Activity in Human Tissues with a Retrovirus-Specific Microarray. J. Virol.
79: 341-352
[Abstract]
[Full Text]
-
Lavie, L., Medstrand, P., Schempp, W., Meese, E., Mayer, J.
(2004). Human Endogenous Retrovirus Family HERV-K(HML-5): Status, Evolution, and Reconstruction of an Ancient Betaretrovirus in the Human Genome. J. Virol.
78: 8788-8798
[Abstract]
[Full Text]
-
Perron, M. J., Stremlau, M., Song, B., Ulm, W., Mulligan, R. C., Sodroski, J.
(2004). TRIM5{alpha} mediates the postentry block to N-tropic murine leukemia viruses in human cells. Proc. Natl. Acad. Sci. USA
101: 11827-11832
[Abstract]
[Full Text]
-
Yi, J.-M., Kim, H.-M., Kim, H.-S.
(2004). Expression of the human endogenous retrovirus HERV-W family in various human tissues and cancer cells. J. Gen. Virol.
85: 1203-1210
[Abstract]
[Full Text]
-
Gorinsek, B., Gubensek, F., Kordis, D.
(2004). Evolutionary Genomics of Chromoviruses in Eukaryotes. Mol Biol Evol
21: 781-798
[Abstract]
[Full Text]
-
Belshaw, R., Pereira, V., Katzourakis, A., Talbot, G., Paces, J., Burt, A., Tristem, M.
(2004). Long-term reinfection of the human genome by endogenous retroviruses. Proc. Natl. Acad. Sci. USA
101: 4894-4899
[Abstract]
[Full Text]
-
Blaise, S., Ruggieri, A., Dewannieux, M., Cosset, F.-L., Heidmann, T.
(2004). Identification of an Envelope Protein from the FRD Family of Human Endogenous Retroviruses (HERV-FRD) Conferring Infectivity and Functional Conservation among Simians. J. Virol.
78: 1050-1054
[Abstract]
[Full Text]
-
Blaise, S., de Parseval, N., Benit, L., Heidmann, T.
(2003). Genomewide screening for fusogenic human endogenous retrovirus envelopes identifies syncytin 2, a gene conserved on primate evolution. Proc. Natl. Acad. Sci. USA
100: 13013-13018
[Abstract]
[Full Text]
-
de Parseval, N., Lazar, V., Casella, J.-F., Benit, L., Heidmann, T.
(2003). Survey of Human Genes of Retroviral Origin: Identification and Transcriptome of the Genes with Coding Capacity for Complete Envelope Proteins. J. Virol.
77: 10414-10422
[Abstract]
[Full Text]
-
Vitte, C., Panaud, O.
(2003). Formation of Solo-LTRs Through Unequal Homologous Recombination Counterbalances Amplifications of LTR Retrotransposons in Rice Oryza sativa L.. Mol Biol Evol
20: 528-540
[Abstract]
[Full Text]
-
Nelson, P N, Carnegie, P R, Martin, J, Davari Ejtehadi, H, Hooley, P, Roden, D, Rowland-Jones, S, Warren, P, Astley, J, Murray, P G
(2003). Demystified . . . Human endogenous retroviruses. Mol. Pathol.
56: 11-18
[Abstract]
[Full Text]
-
Landry, J.-R., Rouhi, A., Medstrand, P., Mager, D. L.
(2002). The Opitz Syndrome Gene Mid1 Is Transcribed from a Human Endogenous Retroviral Promoter. Mol Biol Evol
19: 1934-1942
[Abstract]
[Full Text]
-
Medstrand, P., van de Lagemaat, L. N., Mager, D. L.
(2002). Retroelement Distributions in the Human Genome: Variations Associated With Age and Proximity to Genes. Genome Res
12: 1483-1495
[Abstract]
[Full Text]
-
Stoye, J. P.
(2002). An intracellular block to primate lentivirus replication. Proc. Natl. Acad. Sci. USA
99: 11549-11551
[Full Text]
-
Lerat, E., Capy, P., Biemont, C.
(2002). The Relative Abundance of Dinucleotides in Transposable Elements in Five Species. Mol Biol Evol
19: 964-967
[Abstract]
[Full Text]
-
Costas, J.
(2002). Characterization of the Intragenomic Spread of the Human Endogenous Retrovirus Family HERV-W. Mol Biol Evol
19: 526-533
[Abstract]
[Full Text]
-
Martin, J., Kabat, P., Herniou, E., Tristem, M.
(2002). Characterization and Complete Nucleotide Sequence of an Unusual Reptilian Retrovirus Recovered from the Order Crocodylia. J. Virol.
76: 4651-4654
[Abstract]
[Full Text]
-
Dolei, A., Serra, C., Mameli, G., Pugliatti, M., Sechi, G., Cirotto, M. C., Rosati, G., Sotgiu, S.
(2002). Multiple sclerosis-associated retrovirus (MSRV) in Sardinian MS patients. Neurology
58: 471-473
[Abstract]
[Full Text]
-
Benit, L., Dessen, P., Heidmann, T.
(2001). Identification, Phylogeny, and Evolution of Retroviral Elements Based on Their Envelope Genes. J. Virol.
75: 11709-11719
[Abstract]
[Full Text]
-
Reus, K., Mayer, J., Sauter, M., Zischler, H., Muller-Lantzsch, N., Meese, E.
(2001). HERV-K(OLD): Ancestor Sequences of the Human Endogenous Retrovirus Family HERV-K(HML-2). J. Virol.
75: 8917-8926
[Abstract]
[Full Text]
-
Wang-Johanning, F., Frost, A. R., Johanning, G. L., Khazaeli, M. B., LoBuglio, A. F., Shaw, D. R., Strong, T. V.
(2001). Expression of Human Endogenous Retrovirus K Envelope Transcripts in Human Breast Cancer. Clin. Cancer Res.
7: 1553-1560
[Abstract]
[Full Text]
-
Bogerd, H. P., Wiegand, H. L., Yang, J., Cullen, B. R.
(2000). Mutational Definition of Functional Domains within the Rev Homolog Encoded by Human Endogenous Retrovirus K. J. Virol.
74: 9353-9361
[Abstract]
[Full Text]
-
Kamp, C., Hirschmann, P., Voss, H., Huellen, K., Vogt, P. H.
(2000). Two long homologous retroviral sequence blocks in proximal Yq11 cause AZFa microdeletions as a result of intrachromosomal recombination events. Hum Mol Genet
9: 2563-2572
[Abstract]
[Full Text]
-
Medstrand, P., Landry, J.-R., Mager, D. L.
(2001). Long Terminal Repeats Are Used as Alternative Promoters for the Endothelin B Receptor and Apolipoprotein C-I Genes in Humans. J. Biol. Chem.
276: 1896-1903
[Abstract]
[Full Text]
-
Kuhelj, R., Rizzo, C. J., Chang, C.-H., Jadhav, P. K., Towler, E. M., Korant, B. D.
(2001). Inhibition of Human Endogenous Retrovirus-K10 Protease in Cell-free and Cell-based Assays. J. Biol. Chem.
276: 16674-16682
[Abstract]
[Full Text]
-
Bowen, N. J., McDonald, J. F.
(2001). Drosophila Euchromatic LTR Retrotransposons are Much Younger Than the Host Species in Which They Reside. Genome Res
11: 1527-1540
[Abstract]
[Full Text]