Previous Article | Next Article ![]()
Journal of Virology, September 2003, p. 9578-9589, Vol. 77, No. 17
0022-538X/03/$08.00+0 DOI: 10.1128/JVI.77.17.9578-9589.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Charles R. Anderson Jr.,4 Howard A. Ross,2 and Mary Poss1,3*
Wildlife Biology Program,1 Division of Biological Sciences, University of Montana, Missoula, Montana,3 School of Biological Sciences, University of Auckland, Auckland, New Zealand,2 Wyoming Cooperative Fish and Wildlife Research Unit, University of Wyoming, Laramie, Wyoming4
Received 27 December 2002/ Accepted 9 June 2003
|
|
|---|
|
|
|---|
The clinical manifestation of FIV infection in domestic cats typically involves chronic immune dysfunction, opportunistic infections, and behavioral disorders. In contrast, the limited data available do not indicate that FIV in naturally infected wild cats is associated with overt disease (9, 43). The apparently low virulence coupled with a high genetic divergence among lentiviruses both within and among wild cat species have been interpreted as the outcome of a long history of coevolution between the virus and natural host in the Felidae (7, 9).
The dynamics and natural history of retroviral infections in natural populations remain poorly understood. At the same time, the recognition of human immunodeficiency virus (HIV) infection as a zoonotic disease, which originated from simian immunodeficiency virus (SIV) in wild primates, has highlighted the importance of wildlife species as sources of viral epidemics (18, 19, 22). More generally, wildlife populations are likely to hold many important clues to the maintenance, emergence, and evolution of viral pathogens (13, 64). Not surprisingly, an increasing number of studies in recent years have examined natural lentivirus infections in primate hosts at both the intrahost (4, 10, 12, 20, 36) and the population (23, 37, 49) levels.
Studies addressing the evolution and epidemiology of FIV have been limited to domestic- and feral-cat populations (1, 8, 11, 53, 62), where the virus is reported to have been introduced only within the last few decades (9). No studies have examined the epidemiology and evolution of endemic lentiviruses within a specific, free-ranging population of wild feline hosts. Such a study provides the opportunity to investigate how the virus is transmitted and how it evolves in a natural system where it has putatively persisted for extended periods of time. To this end, phylogenetic analysis of sequence data obtained from naturally infected individuals can be used first to identify routes and rates of virus transmission within a host population. Secondly, obtaining serial samples from the same individuals and from a large number of infected animals from the same population makes it possible to assess the rate and course of molecular evolution in the virus over time. Determining the evolutionary characteristics of endemic viruses is significant, given that strong selective regimes and high rates of nucleotide substitution are factors frequently considered a hallmark of epidemic retroviruses, such as HIV.
The number of analytic tools available to estimate rates of molecular evolution from sequence data has increased rapidly in recent years. Specifically, methods that incorporate time data in a phylogenetic framework and which are particularly suited for the analysis of quickly evolving organisms, such as RNA viruses, have been introduced (14, 15, 52). In these applications, the time of isolation for each virus sequence is taken into account in the reconstruction of phylogenetic relationships, so as to provide an estimate of the expected amount of molecular change occurring per time interval. Therefore, these methods offer a way of comparing the courses of virus evolution both within and among host individuals, and among different viral genes, from sequence data. Moreover, new methods for rate estimation, such as Bayesian inference approaches (16), have recently become available and represent a significant improvement over earlier approaches by not being reliant on a known genealogy for temporally spaced sequences.
Here, we describe the prevalence and molecular evolution of endemic FIVpco in a population of free-ranging cougars. Prevalence in this population was high (58% overall), particularly among older individuals. Using data from two viral genes, pol and env, we document the occurrence of two divergent types of FIVpco that were specific to the study area. In addition, we found that virus sequences were characterized by low intrahost genetic diversity and purifying selection and were evolving at a rate that was considerably lower than estimates for HIV type 1 (HIV-1) over the short term. These results stand in contrast to those observed in symptomatic lentiviral infections and give insights into possible courses and mechanisms of host-virus adaptation in natural host systems.
|
|
|---|
DNA isolation. Genomic DNA was extracted from lymphocytes of captured animals using a Super Quik-Gene isolation kit (AGTC, Denver, Colo.) at Wyoming State Veterinary Laboratory, Laramie. DNA and serum were then sent to the University of Montana, where all remaining work was conducted. For one infected individual (SR627) that died during the course of the study, a second DNA sample was obtained from brain rather than blood.
Serology. Evidence of FIV infection was determined by using a flow cytometry assay that is based on serological recognition of cells infected with a cougar FIV isolate (M. Poss, unpublished data). Cougar serum was incubated with either a feline thymic lymphoma cell line (3201) or 3201 cells infected with a cougar lentivirus. The cells were then washed twice in phosphate-buffered saline containing 2% fetal calf serum and incubated with cat anti-immunoglobulin G-fluorescein isothiocyanate. The cells were washed and analyzed on a FACSCalibur (Becton Dickinson, San Jose, Calif.), and data were statistically evaluated using Cell Quest software. Serum was considered positive for FIV antibody if there was a significant difference in fluorescence intensity between infected and uninfected 3201 cells that were incubated with the serum. All test results were confirmed by immunoblotting (data not shown).
PCR amplification of pol and env. Fragments of the proviral pol and env genes were amplified from serial dilutions of DNAs of infected individuals. First, a 462- to 527-bp fragment of pol was amplified by nested PCR. The primers used for the first round were 2479F (5' TAG AAG CAT TAA CAG AAA TAG TAG AGA 3') and 3028R (5' TTG TAA TTT ATC TTC AGG AGT TT 3') or 1259F (5' GAA GGA AAG GTA AAA AGA GCA GAT 3') and 1261R (5' ATC TTC AGG AGT TTC AAA TCC CCA 3') (5). If the first round was amplified with 2479F-3028R, the 1259F-1261R pair was used for the second round. In a few cases, instead of 1259F, a third forward primer, 2581F (5' AAA TCA GGA AAA TGG AGA A 3'), was used with primer 1261R. If the first round had been amplified using 1259F-1261R, the primers for the second round were 2506F (5' GGT AAA AAG AGC AGA TCC TA 3') and 3012R (5' AGT TTC AAA TCC CCA CCA TAG 3'). The conditions for the first round of PCR were as follows: 3 min at 94°C, followed by 30 cycles of 94°C for 30 s, 44°C for 40 s, and 71°C for 50 s, and a final extension for 5 min at 72°C. For the second round of amplification, the conditions were 3 min at 94°C, followed by 30 cycles of 94°C for 30 s, 48°C for 30 s (52°C for 2506F-3012R), and 71°C for 50 s, and a final extension for 5 min at 71°C.
In addition, a 719-bp fragment of env was amplified by nested PCR. The first-round primers were 7175F (5' ATT GCA TAT TGG GAT TTT A 3') and 8267R (5' TAT CTT AGA CAC TCG TTG G 3'), while the second round employed primers 7406dF (5' CCR TGG GGW GGR AGT AGR T 3') and 8115R (5'GTG CCA GTG GTT GCT CCT ATC A 3'). The conditions for the first round were 3 min at 94°C, followed by 10 cycles of 94°C for 30 s, 40°C for 30 s, and 72°C for 90 s, followed by 25 cycles of 94°C for 30 s, 52°C for 30 s, and 72°C for 70 s. The conditions for the second round were 3 min at 94°C, followed by 30 cycles of 94°C for 30 s, 50°C for 40 s, and 71°C for 50 s, and a final extension for 5 min at 72°C.
The PCR products from pol and env amplifications were cloned into a PCR4 vector (Invitrogen, Inc., San Diego, Calif.). One to 12 env clones were sequenced for each time point, with a maximum of four clones originating from the same PCR to reduce the chance of resampling. For pol, multiple clones were obtained only from the nine individuals for whom serial samples were available. All primer sites were removed from the sequences prior to analysis.
A number of precautionary measures were taken to minimize the probability of contamination of samples: DNA extractions took place in Wyoming at a facility where no FIV work is conducted. At the University of Montana, PCRs for FIV gene amplification were set up in a room that was kept free of plasmids and PCR products. Samples from all animals were first assessed by PCR for pol, regardless of the animals' serological status. None of the samples from seronegative animals yielded a PCR product. Samples that were FIV positive were then evaluated by PCR for env. At most, samples from two infected individuals, determined by pol PCR to be infected by unrelated viruses, were amplified at the same time. Sequential samples from the same individuals were analyzed as the samples were received and hence were never subjected to PCR at the same time. Finally, PCR were randomly checked by heteroduplex mobility assay for diversity within the amplification reaction and against the clone generated for an animal to ensure (i) that clones represented the original sample, (ii) that we were not missing diversity within the population of amplified sequences, and (iii) that no contamination from our reference strains or other cloned products had occurred.
To determine Taq polymerase error, a 719-bp cloned cougar lentivirus env sequence was amplified and cloned, and 10 clones were sequenced. A 681-bp section (after primer sequences were excluded) was compared to the original sequence, and the polymerase error rate per nested PCR procedure was defined as the number of nucleotide changes observed per site.
Phylogenetic analysis.
Alignments were conducted in Lasergene99 (version 4.06) from DNASTAR, Inc. (Madison, Wis.), using the CLUSTAL W algorithm, and were adjusted manually. For each of the two gene segments, the most appropriate model of nucleotide substitution was found with the program MODELTEST (50), using the full set of available sequences. In both cases, the best model, based on Akaike's information criterion, was a General Time Reversible + gamma (GTR +
) model (Table 1). Unless otherwise indicated, the estimated parameters were used in all further analyses.
|
View this table: [in a new window] |
TABLE 1. Parameters of molecular-evolution modelsa
|
To examine the ancestral relationships of FIVpco from the SR population, an ML tree was constructed from the entire set of env sequences. As for pol, the sequence from a Vancouver Island cougar was used as an outgroup (accession number AY307115). The large number of sequences and a high number of polytomies found in the tree prevented a full heuristic search, and the search was therefore terminated after 100,000 iterations or 168 h of computing time. While additional trees with higher likelihood probably exist, they are unlikely to differ significantly from the tree obtained. Support for individual clusters was assessed through a bootstrap analysis with 500 replicates, using a neighbor-joining (NJ) tree algorithm and the same likelihood settings used in obtaining the ML tree. Figures for both pol and env trees were generated with the help of the program TreeView (44).
Analysis of env sequences.
The complete alignment of all env sequences obtained from SR cougars (n = 292) was analyzed in several ways for evidence of positive or negative selection. First, the proportions of possible nucleotide substitutions for all sites, as well as for synonymous and nonsynonymous sites (Pamilo-Bianchi-Li method) were calculated in the program MEGA2 (27) based on pairwise comparisons. Second, sequencewide evidence of positive or negative selection was sought with a Z test using MEGA2 based on the number of synonymous substitutions per synonymous site (KS) and the number of nonsynonymous substitutions per nonsynonymous site (KA). This test evaluates whether the null hypothesis KS = KA can be rejected in favor of one of the alternative hypotheses (KS > KA or KS < KA). Third, a codon-based test for selection was also performed using ML to test for selection acting on only a fraction of the codon sites. Two models of codon selection were fitted to the sequence data (38). Model M1 assumes that nonsynonymous substitutions can be either fatally deleterious (dN /dS ratio;
0 = 0) or neutral (
1 = 1) to selection. Model M2 extends model M1 by including a third category of otherwise-selected codons, with the nature of the selection indicated by the value of
2 as positive (
2 > 1) or negative (
2 < 1). The parameters of models M1 and M2 were estimated using PAML software (67) from an NJ tree constructed under a GTR model, and selection models were compared using likelihood ratio testing. Different initial values of
2 were used, as advised by the user documentation, because the software is known to suffer from convergence problems (38). All codon sites were also classified as experiencing purifying, neutral, or some other type of selection using an empirical Bayesian approach also provided in PAML. The analyses were performed twice: (i) after removing those codon sites at which there were gaps or stop codons or sites having ambiguous nucleotides in some sequences (16 of 226 codons) and (ii) after removing those sequences containing gaps, stop codons, or ambiguous nucleotides (2, 35, and 2 of 292 sequences, respectively).
To determine whether nucleotide changes were evenly distributed across the region of env and pol amplified or whether they occurred at particular hot spots, the number of observed nucleotide states per site was calculated using the program MacClade version 3.0 (32a) and plotted as the average number of states per nucleotide found within a sliding window of 10 nucleotides.
Rate estimates of molecular evolution. Rate estimates for the evolution of env and pol were based on all sequences available for an individual or on a subset of sequences for the population as a whole. Two estimation techniques were employed. First, we used an ML approach implemented in the program TipDate (52). Because this technique requires a known tree topology, NJ trees were first constructed for the respective sets of sequences using PAUP* (62a). To facilitate adequate rooting of the within-individual trees, two or three intermediately distant sequences from other individuals were included as outgroups during tree construction. These taxa (and the branches leading to them) were subsequently removed prior to rate estimation. For the population as a whole, the data set was limited to one randomly selected sequence per individual and time point, to keep the estimation computationally feasible, and the root was found by the midpoint method. Once NJ trees had been obtained, they were entered into the program TipDate version 1.2 (52) together with the respective sequence data and the sampling date for each sequence (to the nearest decimal fraction of a year) to find the ML rate of substitution, as well as an estimated absolute age of the corresponding tree (i.e., time to the most recent common ancestor), including 95% CIs.
For the second estimation approach, we employed a recently developed Monte Carlo Markov chain (MCMC) framework for the Bayesian estimation of evolutionary rates and population parameters (16; http://www.cebl.auckland.ac.nz/mepi/index.html). This technique also uses sets of dated sequences but offers several specific advantages compared to the ML estimation. First, it does not assume that tree topology is known and it is not limited to estimation of evolutionary rates. Instead, it takes account of the uncertainty in any phylogenetic estimate and the error inherent in any sampling scheme by drawing representative samples from the space of plausible population sizes, evolutionary rates, and phylogenies (16). The result is an estimate of the posterior probability distribution of evolutionary rates (marginalized over phylogenies and population size). The method applied uses a coalescent prior on divergence times (26, 55). Under this prior, viral population size within and between hosts was assumed to remain constant over the time spanned by the genealogy. Previous applications have shown that rate estimation using this method is robust to different assumptions of population history (16, 28). An additional value of the Bayesian framework is that it allows specific prior information to be incorporated into the analysis. In our case, since the approximate ages of the cougar hosts were known, we made the assumption that the most recent common viral ancestor within an individual was unlikely to be much older than the infected animal itself or the time since infection. Thus, we used an exponential distribution with expectation equal to the age of the animal (as estimated in the field) or the known time since infection (based on seroconversion) as a prior for the age of the root of the tree (i.e., the age of the most recent common ancestor of all viral sequences in an infected individual). This had the effect of influencing the depth of the within-host viral phylogeny, since the depth of the phylogeny is necessarily constrained to be bounded by the time to infection because it is assumed that either a single virus or a homogenous group of viruses establishes infection in the host. No prior on the age of the root was used for the estimation of evolutionary rates for the SR population as a whole.
Nucleotide sequence accession numbers. One env sequence per SR individual has been submitted to GenBank under accession numbers AY120787 to AY120810 and AY120812 to AY120815, as well as one pol sequence per individual (AY120816 to AY120839 and AY120841 to AY120844 [see Fig. 2]). Nexus files of the complete set of pol and env sequences used in the analysis can be obtained from http://www.cebl.auckland.ac.nz/FIVpco or from the authors.
![]() View larger version (24K): [in a new window] |
FIG. 2. ML tree of partial FIVpco pol sequences from free-ranging cougars in the Rocky Mountains (United States and Canada). Taxa designated SR represent animals from the study area. For each SR individual, one sequence was randomly selected from a larger data set. Taxa designated Pco represent previously described FIV pol sequences from other cougar populations in western North America. The state or province where the animal was captured is indicated in boldface letters (YNP, Yellowstone National Park). Branch lengths of >0.02 are shown below the branches. The FIVpco sequence from a Vancouver Island cougar (PLV1695) was used as an outgroup. The values next to tree nodes represent bootstrap values of >70 based on 500 replicates using an NJ tree algorithm and the same model of substitution used for the ML tree.
|
|
|
|---|
![]() View larger version (26K): [in a new window] |
FIG. 1. Prevalence of FIVpco infection in a population of free-ranging cougars in southeast Wyoming by sex and age class. Prevalence was determined by serology using a flow cytometric assay and confirmed by PCR. The sample sizes are shown above the bars (n = 52 [total]). For individuals that were sampled repeatedly, only infection status for the most recent sample was considered.
|
Phylogenetic analysis showed that genetic divergence of FIVpco based on pol sequences was high in Rocky Mountain cougars, as documented previously (7). Within lineage I, the corrected divergence among sequences from different areas was 8 to 10%, while differences within lineage II ranged from 2 to 15%. The high diversity found in Rocky Mountain FIVpco contrasted with low degrees of sequence differentiation among pol sequences from within the SR population (Fig. 2). All samples originating from the SR population formed well-supported groups within their respective lineages, with only 0 to 3% sequence divergence within the groups. Together, these data indicate that SR cougars were infected with highly related FIV types that were clearly distinguishable from viruses found in other parts of the Rocky Mountains.
Phylogenetic analysis of env sequences. The relationship of virus sequences from animals within the SR population was evaluated based on env because env sequences were longer than pol sequences and because we made an initial assumption that the evolutionary rate in env would be higher than that for pol. The general topology of the ML tree of env sequences from the SR population (Fig. 3) was congruent with the clustering of sequences derived from SR animals in the pol tree (Fig. 2). Specifically, pol and env sequences from an individual had the same phylogenetic affiliation relative to those from other individuals. At the same time, the genealogical structure of the env tree was more defined and there was stronger support for substructure in each of the two lineages, possibly due to the larger amount of phylogenetic information contained in the longer env fragment.
![]() View larger version (32K): [in a new window] |
FIG. 3. ML tree of 292 partial FIV env sequences from 28 infected cougars in a population in southeast Wyoming. Mother-kitten groups are indicated by open boxes, and animals with related viruses and confirmed overlap in home ranges are indicated by shaded boxes. Where identical sequences were identified, their number is indicated in parentheses next to the taxon symbol. Branch lengths of .0.01 are shown below the branches. The ML search was terminated after 100,000 iterations (see the text). See Fig. 2 for further descriptions.
|
Nucleotide divergence in env. Based on all 292 available clones, uncorrected proportions of possible nucleotide substitutions (p distances) among env sequences derived from the same infected animal varied between 0 and 2% (Fig. 4). This range applied to comparisons both within and among different sampling time points. Different individuals carrying virus from the same phylogenetic lineage exhibited distances ranging from 0.5 to 5.5% but with a bimodal distribution centered around 1 and 4%. The uncorrected average difference among viruses from the two lineages was 14%. It is noteworthy that there were virtually no insertions or deletions among the env sequences.
![]() View larger version (14K): [in a new window] |
FIG. 4. Partitioning of genetic diversity in FIV env sequences from 28 naturally infected cougars based on p distances observed in pairwise comparisons. For each category of comparison (e.g., within individuals, same time point), frequencies sum up to 1.
|
Proportion of synonymous and nonsynonymous changes in env. Within individuals, the mean proportion of synonymous changes per synonymous site (KS) in env sequences was very similar to the proportion of nonsynonymous changes (KA) (Table 2). Among individuals infected with virus from the same viral lineage, KS was twice times as high as KA, and this increased to a 10-fold difference for comparisons between the two lineages. Concordant with the observation that synonymous changes were more common, a test for possible deviations from neutrality indicated that molecular evolution in env was dominated by purifying selection (Z = 10.41; P < 0.001).
|
View this table: [in a new window] |
TABLE 2. Average proportions of synonymous and nonsynonymous sites from pairwise comparisons of FIVpco env sequences
|
2 that matched the seed value (range, 0.7 to 10), suggesting failure to converge. The classification of sites using empirical Bayes' estimates identified 37 to 47% of sites as experiencing purifying selection but, as with the ML estimation, it was inconclusive in respect to the proportion of neutral sites and sites under positive selection (data not shown). Distribution of molecular changes across env andpol sequences. We plotted the average number of different character states observed per nucleotide in a sliding window of 10 nucleotides for both pol and env in order to assess whether the amplified fragments contained both variable and conserved regions (Fig. 5). Both gene fragments were found to contain segments in which nucleotide diversity was elevated, but overall, the distributions of substitutions in both viral genes were uniform.
![]() View larger version (26K): [in a new window] |
FIG. 5. Plots of nucleotide diversity for FIV env (A) and pol (B) gene fragments. Diversity is expressed as the average number of character states found per sliding window of 10 nucleotides.
|
|
View this table: [in a new window] |
TABLE 3. ML and MCMC estimates of evolutionary rates for FIVpcoa
|
1% per decade for the population as a whole. As before, there was no evidence that the evolutionary rates for the two genes were different. Some problems with uncertainty in the estimation were encountered in that most ML estimates were not significantly different from zero. This suggests that overall, the number of changes in the virus that had occurred within an infected individual was small relative to the time intervals between samples (8 to 49 months). Also, the fact that precision was generally higher for estimates for env, which was based on a fragment that was 220 bp longer than pol, points to the fact that longer sequences were more likely to contain evolutionary signal and thus enhanced the ability to estimate rates. While most estimated times to the most recent common viral ancestor fell within or close to the assumed maximum time since infection, estimated tree ages were much higher (>30 years) in a few cases. Such long time periods, which would span several cougar generations and thus transmission events, appear unrealistic given the small number of viruses presumed to be transmitted during natural retrovirus infections (59, 65). We suspect that unrealistically high tree ages and very low rates were related to difficulties in accurately estimating the tree topology when the signal in the data was weak. However, in light of these results, it remains possible that substantial viral sequence diversity occasionally survives transmission bottlenecks. |
View this table: [in a new window] |
TABLE 4. ML and MCMC estimates of evolutionary rates for two FIVpco genes within naturally infected cougarsa
|
Whereas evolutionary rates rarely exceeded 2% per decade in most individuals, data for three animals (SR620, SR623, and SR630) were suggestive of higher rates in pol and env in at least three of the four estimates obtained per cat. Notably, these three cougars must have become infected less than a year before the first serial virus sample, because they either were newborn kittens (SR620 and SR630) or had seroconverted since the last sample (SR623). Thus, higher substitution rates appeared to be associated with more recent infections.
|
|
|---|
The cooccurrence of several distinct FIV lineages within the same population of hosts has been described previously for free-ranging cougars and African lions (5, 7). While the emergence of regional FIV subtypes is certainly not surprising for an animal with a wide geographic distribution, the intermixing of these distinct types suggests extensive cougar movements, current and/or historic, from or to the SR population. At the same time, SR pol sequences formed monophyletic clusters within their respective lineages, indicating that the two virus strains found within the population had been able to rapidly and effectively spread on a local scale.
The presence of two viral lineages in the SR population also raises the possibility that individuals could become coinfected with both types. Indeed, coinfection with divergent FIVpco genotypes was previously documented in a cougar (7). However, we had no evidence for such coinfection in this population, suggesting that it occurs at a low frequency. Furthermore, coinfection with divergent virus types would have been a prerequisite for the generation of possible recombinant types carrying pol and env from different viral lineages. Such genetic reassortment of viral genotypes has been documented, for example, for HIV both in vivo and in vitro (35, 54). However, the same clusters of infected individuals were identified for both pol and env in our data set, thus providing no evidence for ancestral recombination among lineages. It should be noted, however, that coinfection and recombination involving highly related viruses remain possible. Because of the small sequence differences involved, such events would be very difficult to detect but at the same time would be expected to be inconsequential in respect to the general genealogical patterns.
Phylogenetic analysis of env identified a number of well-supported clusters containing closely related sequences (Fig. 3). Adult female cougars in particular shared highly related viruses with their offspring, supporting the premise of vertical transmission (7). Sequence divergence between viruses in the mother and offspring was usually very small but ranged up to >1%. This was likely due to the fact that samples from mother and kitten were typically obtained at different times, and the interval between sampling and infection was unknown for both. In addition to the close affiliation of FIVpco sequences obtained from females and their offspring, closely related virus sequences were also found in individuals with overlapping or neighboring home ranges (Fig. 3). For example, spatial movement data collected using radio telemetry showed that cougars that had contiguous or overlapping home ranges were infected with viruses that clustered together in the phylogenetic tree (600, 603, 607, and 608 in lineage I and 604, 605, 606, and 611 in lineage II). Adult females frequently share home ranges with offspring from a previous mating, and hence, some of these virus clusters may also be derived from related animals. The relationship of all animals in the population is currently under investigation and will illuminate mechanisms of viral transmission among cougars. However, the bimodal distribution of genetic distances among env sequences from different individuals within the two FIVpco lineages (Fig. 3) suggests that genetic distances of up to 2% among individuals reflect recent infection events. In contrast, distances of 4%, observed among viruses from different individuals, likely represent genotypes separated by more virus generations and potentially more than one transmission event.
In contrast to the common local spread of a particular genotype, the introduction of a new virus into the population appears to be relatively rare. The env sequence most distinguishable from all other SR genotypes was isolated from a yearling male, SR622, which was probably not born in the study area. Unlike female offspring, whose presence close to the mother's territory is often tolerated even after maturity, male yearlings are usually expelled from the natal area and tend to disperse further away from it (31). One plausible scenario is that SR622 had immigrated into the SR population from a neighboring or distant area, where it had become infected with a related, yet distinguishable, variant of FIVpco.
Because retroviral diversity increases with time since infection, we expected to observe larger divergence in env in some individuals. Instead, intrahost env diversity in cougars averaged <1%, which is similar to values for pol in cougars and domestic cats (5) but lower than the 1.1 to 2.5% reported for env in domestic cats (62). This low sequence diversity within hosts is remarkable given that cougars can reach ages of >10 years in the wild and that some of the individuals we examined were known to have harbored infections for several years. Also, we found that within individuals, sequences taken at time points that were 8 to 49 months apart were not more different from each other than sequences derived from the same time point. This may be due, in part, to the fact that we derived our sequence data from proviral DNA, and FIVpco may reside in long-lived cell populations. However, these data suggest that evolution of FIV env within cougar hosts may occur at a lower rate than in epidemically occurring retroviruses. It was also noteworthy that there was a virtual absence of insertions or deletions in env fragments from FIVpco, while such changes are commonly observed in domestic-cat FIV and primate lentiviruses.
Concordant with a low genetic diversity among intrahost isolates and the close relation of sequences from putative family groups, the substitution rates we estimated for cougar FIVpco were low compared to estimates for HIV or SIV. At the same time, it should be noted that evolutionary rates have been obtained using a wide range of techniques and that some of these are not equivalent to our approach, making a comparison with published estimates difficult. The only study we are aware of that estimated evolutionary rates in FIV was done with domestic cats and found no changes in the consensus sequence of pol or gag in a naturally infected cat over a 3-year period (21). For a part of the env gene that included a variable region, a rate of 3.4% per 10 years was estimated in the same study, similar to estimates obtained here. Although results were variable, our estimates suggest an overall rate of 1 to 3% per 10 years for both pol and env. In comparison, studies of HIV evolution within hosts or small groups of hosts reported higher rates of 0.3 to 1.0% per year (30, 36, 47, 60), while rates up to 3% per year have been calculated for SIV (36, 47). Interestingly, HIV-1 evolves considerably more slowly at the population level, where, using sequence data collected over a decade or more, rates of 1 to 2 and 4 to 6% per decade have been estimated for pol (based on all substitutions) and env (V3; based on synonymous changes alone), respectively (32). Whereas FIVpco evolution within some infected cougars in our study appeared to be faster than for the population as a whole, our data do not support rates that would exceed half a percent per year. Comparisons of our estimates to those for primate lentiviruses should be done cautiously, however, because most of the studies focused on particularly variable regions of env and, in the case of SIV, were almost all conducted with experimentally rather than naturally infected individuals (36, 47). More studies of naturally infected primate and feline hosts are needed to determine whether a lower rate of molecular evolution, relative to HIV, could indeed be a characteristic of natural host-virus systems or of feline lentiviruses.
Given the estimated substitution rate of <0.5% per year, the time intervals between our samplings (8 to 49 months) were at the low end of what would yield measurable changes in the virus. Clearly, extending the time frame of virus sampling should improve the precision of rate estimates, as would longer sequence data. For the data collected here, the use of a Bayesian MCMC approach that integrated over all possible tree topologies markedly improved our estimates for evolutionary rates from virus sequences compared to the ML approach. Specifically, incorporating prior information on the probable time since infection made it possible to obtain estimates for virus evolution within individuals where ML rates had not been distinct from zero (Table 4). In almost all cases, the probable time since infection fell within the obtained 95% CIs of both ML and MCMC estimates, indicating that this prior information was in agreement with the data.
No prior for tree age was used for the analysis of virus sequences from all SR individuals combined. The obtained estimates overall suggest that the split between the two FIVpco lineages in Rocky Mountain cougars occurred within the last 200 years (Table 3). This relatively recent time point is consistent with the fact that FIVpco from this region is distinct from virus found in cougars elsewhere in North America (7) and that cougar populations in the western United States are currently recovering from low population levels due to persecution during the 19th century and the first half of the 20th century (3).
Several factors may be involved in a lower rate of molecular evolution in FIVpco than in other lentiviruses, including the fidelity of replication and the length of virus generation time. Reported error rates for the replication enzymes of different retroviruses vary by a factor of 20 (51), raising the possibility that lower mutation rates are due to a higher fidelity of the viral reverse transcriptase. In addition, low evolutionary rates may be related to longer virus generation times. Although in HIV, the generation time is as short as 1 to 3 days, it is thought to be substantially longer for a small percentage of viruses that establish latent infections in long-lived cell types, such as macrophages (48). Therefore, it may be that cell types facilitating slower virus replication comprise the majority of FIVpco-infected cells in cougars, resulting in slow virus population turnover. Although the question of cell tissue tropism in FIVpco remains speculative, the observation that HIV variants targeting long-lived cells outlive fast-replicating variants (42, 63) suggests a potential selective regime that could favor slowly replicating viruses in a coevolutionary process.
Evasion of the host immune system is frequently considered a partial explanation for the high rate of molecular evolution observed in many lentiviruses, especially for the envelope glycoprotein. At the same time, evidence for positive selection has been more or less limited to studies of HIV-1 in humans (33, 34, 58, 66) and is controversial even there (25, 57). Higher proportions of nonsynonymous changes also characterize SIV infection of macaques, which are not a natural host of primate lentiviruses. In African green monkeys, a natural host for SIV, the proportion of synonymous substitutions was higher than that of nonsynonymous changes (12, 61). In the domestic cat, a strong role of positive selection in FIV lacks support based on reported KS/KA ratios that rarely exceed 1 (1, 53, 62). Consistent with the latter results, we found evidence for selection largely against rather than in favor of nonsynonymous changes in cougar FIVpco env, supporting the idea that molecular evolution of this gene occurs under some functional constraint. Results for the identification of sites under positive selection were ambiguous but suggest that the number of such sites is likely to be low. In addition, the low rate of nonsynonymous substitutions relative to synonymous substitutions between hosts argues against any process of host-specific adaptation as the virus moves from one host to the next during virus transmission. This lends some weight to the argument that forms of FIV maintained in wild cats are well adapted to their host species. Still, compared to nonpathogenic lentiviruses in other natural hosts, which are often characterized by rapid accumulation of evolutionary changes resulting from high rates of replication (4, 36), intrahost genetic diversity and substitution rates were low in the cougar FIVpco system. This shows that despite overall similarities, life history strategies and evolutionary characteristics may vary considerably among different lentiviruses in their natural hosts.
The cell line infected with a cougar lentivirus was generously donated by S. VandeWoude (Colorado State University, Fort Collins). D. Wroe conducted captures and sampling of cougars, and S. Painter, E. Villegas, and N. Akamine provided technical assistance in the laboratory. We thank three anonymous reviewers for their comments.
Present address: Department of Statistics and Department of Zoology, University of Oxford, Oxford, United Kingdom. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»