| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Laboratory for Clinical Microbiology, Department of Microbiology, Tumor and Cell Biology, Karolinska University Hospital, Karolinska Institutet, SE-17176 Stockholm, Sweden,1 Cancer Center Karolinska, Department of Oncology and Pathology, Karolinska University Hospital, Karolinska Institutet, SE-17176 Stockholm, Sweden,2 Center for Molecular Medicine, Department of Medicine, Karolinska University Hospital, Karolinska Institutet, SE-17176 Stockholm, Sweden,3 Department of Cell and Molecular Biology, Karolinska Institutet, SE-17177 Stockholm, Sweden4
Received 5 January 2007/ Accepted 30 January 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Polyomaviruses are small DNA viruses capable of persistent infection and having oncogenic potential. They have been found in many mammals and birds worldwide. Two polyomaviruses are known to normally infect humans, JC virus (JCV) and BK virus (BKV), both discovered in 1971 (13, 30). They are genetically closely related to each other, and both viruses show 70 to 80% seroprevalence in adults (23). The routes of acquisition and sites of primary infection are largely unknown, but both viruses can establish a latent infection in the kidneys and, in the case of JCV, also in the central nervous system (31). Persistent replication in the kidneys is evidenced by the fact that JCV, and occasionally also BKV, can be detected in the urine of healthy adults (23). BKV has also been detected in the feces of children (35). JCV and BKV are highly oncogenic in experimental animals, but a role in the development of human tumors has not been established (25). Disease caused by human polyomaviruses has been observed in immunosuppressed individuals. JCV is the causative agent of progressive multifocal leukoencephalopathy, a demyelinating disease of the brain and a feared complication of AIDS (21). This disorder has recently received renewed attention after the occurrence of fatal cases among patients treated with natalizumab for multiple sclerosis (22, 24). BKV has been associated with posttransplantation nephropathy and hemorrhagic cystitis in hematopoietic stem cell transplant (HSCT) recipients (7, 17). In addition to JCV and BKV, there are reports on the presence of the primate polyomavirus simian virus 40 (SV40) in humans, possibly introduced by contaminated poliovirus vaccine produced in monkey cells (4), although other ways of transmission have also been suggested (10, 27). SV40 genomic sequences have been detected in human malignant mesothelioma tumors, but its role in human tumor development remains debated (25).
We have developed a system for large-scale molecular screening of human diagnostic samples for unknown viruses (2). With this technology, we have initiated a systematic search for previously unrecognized viruses infecting humans in order to identify agents that are potentially involved in human disease. We describe here the identification and molecular characterization of a hitherto unknown human polyomavirus, which is only distantly related to the other known primate polyomaviruses. In analogy with the nomenclature of the other human polyomaviruses, we propose the name KI polyomavirus, KIPyV, for the newly discovered virus.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Genomic analysis of the KIPyV genome. A 4,808-bp-long PCR product reaching around the circular DNA genome was generated by primers directed "outward" from the first cloned fragment (Pol-82R [TTGACTTCTTGGCCTTGTTAG] and Pol-315F [AGATGCTGACACAACTGTATG]) and by using a long-range enzyme mixture (Platinum Taq High Fidelity; Invitrogen). A second PCR product of 500 bp overlapping both ends of the long product and closing the circle was generated by primers PolconF (GGATTTTGTATGTGCTAGAAC) and PolconR (TTAACTAGAGGTACAACAAGC). Both PCR products were directly sequenced in order to obtain a consensus sequence for the complete genome. The same procedure was applied for determining the full-length sequences of three isolates. Putative open reading frames (ORFs) were identified, and sequences were aligned with Clone Manager Suite 6 (version 6.00) and Align Plus (version 4.10) (Scientific and Educational Software, Durham, NC). Prediction of putative binding sites for transcription factors was performed by comparison with consensus sequences and with the help of the AliBaba software (version 2.1) (16).
Phylogenetic analysis. All sequences were downloaded from GenBank, except those of murine pneumotropic virus, which were based on a corrected sequence (T. Ramqvist, unpublished data). Accession numbers are available upon request. The complete genomes and the amino acid sequences of the early and late proteins, respectively, were aligned and neighbor-joining trees generated with ClustalX version 1.83. The data were bootstrapped with 1,000 replicates, and trees were viewed with NJplot. For whole-genome analysis, the noncoding control regions were removed in accordance with established conventions and the first nucleotide in the T antigens was designated nucleotide 1.
PCR for detection of KIPyV. PCR experiments for detection of KIPyV were performed in a diagnostic laboratory setting, ensuring that the necessary precautions to avoid contamination were taken. Positive and negative controls were included in each experiment. DNA was extracted by commercially available kits as described under the respective sample type. Five microliters of extracted DNA was used as the template for the nested PCR. The 50-µl reaction mixtures used for the first and second PCRs consisted of 1x GeneAmp PCR buffer II (10 mM Tris-HCl [pH 8.3], 50 mM KCl; Applied Biosystems), 2.5 mM MgCl2, 0.2 mM each deoxynucleoside triphosphate, 2.5 U of AmpliTaq Gold DNA polymerase (Applied Biosystems), and 20 pmol of each of the primers. The first-PCR primers were POLVP1-39F (AAG GCC AAG AAG TCA AGT TC) and POLVP1-363R (ACA CTC ACT AAC TTG ATT TGG). The second-PCR primers were POLVP1-118F (GTA CCA CTG TCA GAA GAA AC) and POLVP1-324R (TTC TGC CAG GCT GTA ACA TAC). The cycling conditions for the first and second PCRs were 10 min at 94°C, followed by 35 cycles of amplification (94°C for 1 min, 54°C for 1 min, and 72°C for 2 min). Products were visualized on an agarose gel. The product size after the second PCR was 207 bp. All PCR products were sequenced in order to confirm that they were specific for KIPyV.
Prevalence study populations. (i) Nasopharyngeal aspirates. Six hundred thirty-seven stored nasopharyngeal aspirates submitted to the Karolinska University Laboratory for diagnosis of respiratory virus infections from July 2004 to June 2005 were studied. Sampling month, patient's age and sex, and routine diagnostic (immunofluorescence and virus culture) findings were recorded before samples were made anonymous. The median age of the sampled patients was 7 years (range, 0 months to 90 years). Two hundred seventy-one samples came from children <2 years old. Total nucleic acids were extracted from 200-µl samples by the MagAttract Virus Mini M48 kit (QIAGEN), and nucleic acids were eluted in 100 µl. Eluted nucleic acids were initially analyzed in pools of 10 samples, and 5 µl of the pool was used as the template for the PCR. Single samples from PCR-positive pools were analyzed.
(ii) Feces. One hundred ninety-two fecal samples submitted to the Karolinska University Laboratory for diagnosis of virus infections from 1 July 2005 to 30 November 2005 were studied. Samples were mainly submitted for diagnosis of gastroenteritis. Basic sampling data were recorded before samples were made anonymous. The median age of the sampled patients was 1 year (range, 0 months to 17 years). One hundred nineteen samples came from children <2 years old. Nucleic acids were extracted from 400 µl of a frozen 20% feces suspension by MagAttract Virus Mini M48 kit and the Biorobot M48 instrument (QIAGEN) and eluted in 100 µl, and 5-µl samples were used for subsequent individual PCR assays.
(iii) Urine of HSCT recipients. One hundred fifty urine samples collected from HSCT recipients for the study of BKV and JCV were analyzed (14). Fifty of the samples were selected on the basis of previous analysis results; 20 were previously shown to be positive for BKV, 8 were positive for JCV, 2 were positive for both BKV and JCV, and 20 were negative for both viruses. JCV and BKV status was unknown for the remaining 100 samples. As described previously, samples were analyzed by PCR without preceding DNA extraction (6).
(iv) Serum of HSCT recipients. Thirty-three serum samples drawn from 17 HSCT recipients 2 to 6 weeks after transplantation were studied. Total nucleic acids were extracted from 200 µl of serum by QIAamp Virus Spin Kit (QIAGEN) and eluted in 50 µl.
(v) Whole blood. Whole EDTA blood from 192 healthy volunteer blood donors in Stockholm was analyzed. DNA was extracted from 200-µl samples with the MagAttract DNA Mini M48 kit and the Biorobot M48 instrument (QIAGEN) and eluted in 50 µl.
(vi) Leukocytes. Ninety-six frozen preparations of Ficoll-separated leukocytes were studied. Samples were originally sent to the laboratory for diagnosis of cytomegalovirus by PCR and virus culture and therefore mainly originated from immunosuppressed patients. DNA was extracted from 105 cells with the MagAttract DNA Mini M48 kit and the Biorobot M48 instrument (QIAGEN) and eluted in 100 µl.
Nucleotide sequence accession numbers. The sequences reported in this paper have been deposited in the GenBank database under accession no. EF127906 (KIPyV isolate 60), EF127907 (KIPyV isolate 350), and EF127908 (KIPyV isolate 380).
| RESULTS |
|---|
|
|
|---|
Genome analysis of KIPyV. The source nasopharyngeal aspirate sample containing the SV40-like sequence was identified by PCR analysis of aliquots saved before pooling. The positive sample was named Stockholm 60. A second PCR product reaching around the circular DNA genome was used as a template for determining the complete consensus viral genome sequence. The genome was confirmed to be circular and 5,040 nucleotides in length (accession number EF127906). Two additional isolates that were identified during the subsequent prevalence study (see below) were sequenced by the same approach. (Stockholm 350, accession number EF127907; Stockholm 380, accession number EF127908). The three genomes were highly similar. Both isolates Stockholm 350 and Stockholm 380 differed from the prototype isolate by 10 nucleotide substitutions, and they differed from each other by seven single bases. The variable positions showed some clustering in the regulatory region, but there were also a few isolate-specific amino acid substitutions in the putative proteins.
Overall genome organization. The genomic organization of KIPyV is typical for a member of the family Polyomaviridae, with an early region encoding regulatory proteins (small t [ST] and large T [LT] antigens) and a late region coding for structural proteins separated by a noncoding regulatory region (Fig. 1). The genome size is within the range of polyomaviruses. Properties of the deduced proteins and their similarities to those of JCV, BKV, and SV40 are shown in Table 1. While the nonstructural proteins have substantial amino acid sequence similarity to those of the other primate polyomaviruses, the structural proteins have a very low degree of similarity to those of other known polyomaviruses.
|
|
|
|
The early proteins show similarities to other members of the polyomavirus family, primarily BKV, JCV, SV40, and simian agent 12 (SA12), and alignment with the LT antigens of other polyomaviruses shows that most regions characteristic of LT antigen are present also in KIPyV. The N-terminal 82 amino acids (aa) of the ST antigen are common to the LT antigen. This region encompasses the J domain carrying the conserved region 1 sequence and the HPDKGG box. In the C-terminal part that is unique to the ST antigen, there is a cysteine- rich domain typical of polyomaviruses. In the LT antigen, the HPDKGG box is followed by a putative Rb binding domain (LRCNE), a nuclear localization signal, a DNA binding domain, a Zn finger region including the zinc finger motif (C-312, C-315, H-327, H-331), and finally an ATPase-p53 binding domain containing the highly conserved GPXXXGKT sequence (aa 434 to 441). Unlike BKV, JCV, SV40, and SA12, the host range domain seems to be missing.
Late region. In the late region of the genome, there are putative ORFs for capsid proteins VP1, VP2, and VP3 (Fig. 1). As in all polyomaviruses, VP3 is encoded by the same ORF as VP2 by the use of an internal start codon. There is an overlap between the C terminus of VP2/3 and the N terminus of VP1, as is the case in other polyomaviruses. It can be noted that both VP2 and VP3 of KIPyV are large in comparison with those of other members of the polyomavirus family (400 and 257 aa, respectively).
For VP1, there is only one possible start codon, in contrast to the VP1 proteins of BKV, JCV, and SV40. The degree of homology with other VP1 proteins is remarkably low (Table 1). VP1 has only 30% identity with its closest counterparts (those of JCV and MPyV). In KIPyV VP1, the only region that shows a relatively high degree of similarity to those of other polyomaviruses is the sequence that in MPyV VP1 has been shown to bind calcium, corresponding to approximately aa 237 to 248 in VP1 of KIPyV. Otherwise, VP1 of KIPyV has very limited homology to those of other polyomaviruses.
The VP2/VP3 gene showed even lower similarity to its counterparts in other polyomavirus species (Table 1). In fact, neither a nucleotide nor a translated BLAST search with this gene sequence generated any significant matches in the public databases. Thus, the identity of this ORF is only indicated by its position in the genome. VP2 and VP3 of all other polyomaviruses contain a conserved VP1 binding domain (located at approximately aa 281 to 295 in MPyVP2). No corresponding sequence is found in KIPyV.
Several polyomaviruses such as BKV, JCV, and SV40 express an Agno protein from the late mRNA. In KIPyV, the region between the start codons of VP2 and ST/LT, respectively, is large (513 bp) and this could possibly indicate the presence of an agno gene. However, there is no corresponding ORF present in this region.
Phylogenetic analysis. Phylogenetic trees were constructed on the basis of alignments of the first isolate, Stockholm 60, with known viruses of the Polyomaviridae family. Analysis of early protein genes consistently clustered KIPyV with JCV, BKV, SV40, and SA12 but as an outlier in this clade (Fig. 4). Analysis of the complete genome yielded highly similar results (data not shown). In contrast, analysis of the late protein genes consistently placed KIPyV outside the tree as the most distant group member (Fig. 4).
|
|
| DISCUSSION |
|---|
|
|
|---|
Phylogenetic analysis of the complete genome revealed that KIPyV is clearly separate from all other known polyomaviruses. When the early and late genes were analyzed separately, disparate results were obtained. While the early genes group with JCV, BKV, SV40, and SA12, the late genes form an outlier to the entire polyomavirus family. A possible explanation for this could be that the virus once emerged by recombination of two phylogenetically distant viruses, each contributing half of the genome. Alternatively, the early region may simply be more conserved because of more-rigid functional constraints while the late genes have diverged more rapidly and become very distant from those of its relatives. It is possible that future discoveries of additional polyomavirus species, e.g., in other primates, could make the phylogenetic tree more complete and provide additional clues to the evolution of KIPyV. Several new members of the polyomavirus family besides KIPyV have been discovered in the last few years (18, 19). The unique late region of KIPyV indicates that it may be the first discovered member of a new subfamily of polyomaviruses.
For assignment of nucleotide numbers, two different systems are in use for polyomaviruses. Either the nucleotide adjacent to the start codon of the T antigens, i.e., the first codon in the regulatory region, or a nucleotide in the origin of replication is considered nucleotide 1. The numbering we selected for KIPyV begins within the presumed origin and proceeds clockwise through the late region, as has been done for most primate polyomaviruses, such as JCV, SV40, SA12, and some strains of BKV.
On the basis of the ORF analysis, KIPyV is expected to express VP1 to VP3 and the ST and LT antigens, while the MT antigen and the Agno protein are both missing. The absence of the MT antigen is not surprising, since most polyomaviruses, the primate polyomaviruses included, lack expression of this particular protein. The lack of an ORF for an Agno protein is more interesting, since this protein is expressed by JCV, BKV, SV40, and SA12. The functional implications of this are unclear, since the function of the Agno protein remains to be fully elucidated. However, definitive conclusions about protein expression require further experimental evidence, e.g., in the form of mRNA analysis data.
The previously known primate polyomaviruses are generally not considered to be agents of respiratory tract disease. JCV and BKV have nevertheless been detected in human tonsil tissue, and a respiratory route of transmission of polyomaviruses has been hypothesized (9, 15, 28). BKV has also been found in the feces of children (35). The finding of KIPyV in nasopharyngeal aspirates and feces is consistent with these observations. However, the findings provide few clues to replication or latency sites or to possible disease caused by the virus. The screening of 1,300 clinical samples still provided important data. First, the cloning of a virus infecting humans was confirmed, since KIPyV genomes could be recovered from multiple individuals and since the isolates showed sequence variation. Second, the virus was detected in different age groups. Third, a concomitant finding of a recognized respiratory tract pathogen in most positive persons indicated that KIPyV was likely not the virus responsible for the respiratory tract symptoms.
The prevalence of KIPyV in humans remains unknown. Development of an antibody assay and/or finding relevant material for detecting latent virus is necessary for improved estimates. The findings obtained with nasopharyngeal aspirates suggest that the KIPyV prevalence is at least 1% in our study population. The absence of KIPyV in urine samples suggests that the biology and/or prevalence of KIPyV in kidneys differ significantly from those of JCV and BKV.
The cell type tropism and host range of polyomaviruses stem from both their regulatory regions and the receptor binding characteristics. The existence of multiple predicted c-Ets-1 transcription factor binding sites prompted us to investigate whether KIPyV may possibly replicate in lymphocytes in accordance with lymphotropic papovavirus, which harbors three putative binding sites for this transcription factor (38). This hypothesis is consistent with KIPyV being detected in the nasopharynx during an inflammatory process due to infection by a respiratory virus. However, studies of whole blood of healthy subjects or purified peripheral blood leukocytes of immunosuppressed patients did not support this hypothesis. On the other hand, given the 1% recovery rate in the respiratory tract samples, sample numbers may still be too small for definite conclusions. It must also be recognized that KIPyV is as yet only known by its genome sequence as detected by PCR. The virus has not been replicated in vitro, and no assay for detection of antibodies is available. Such experiments will be important in the further characterization of this virus.
A few newly discovered viruses have still not been associated with disease (20, 29). Nonetheless, the majority of known human viruses are pathogenic in one situation or another and any newly discovered virus must therefore be considered a likely pathogen. A problem with persisting viruses is that they are often discovered out of their symptomatic context, so that establishing their association with a particular disease may require extensive investigation. Historically, this has been the case for hepatitis B virus, Epstein-Barr virus, and parvovirus B19 (5, 8, 11). JCV and BKV are also very prevalent viruses that cause disease only under rare circumstances. Searching for a disease associated with KIPyV will be challenging but may have important medical implications. Primary candidate diseases could include unclear infectious complications in immunocompromised individuals and different types of cancer. Whether JCV, BKV, and SV40 can contribute to tumor development in humans is still a matter of debate, and one could assume that KIPyV will be subject to the same discussion. There are putative binding sites for p53, as well as the Rb family of tumor suppressor proteins, in the LT antigen of KIPyV, which indicates that a role for this virus in tumorigenesis cannot be excluded.
This study reinforces the notion that many human viruses have eluded detection despite more than 100 years of research in virology. Since viruses are likely pathogens, their identification remains an urgent scientific task. This study further illustrates how molecular virus screening of respiratory tract samples can be applied for discovering unknown viruses of different types, and not only agents of respiratory tract disease, thus making it a suitable approach for a "human virome project".
| ACKNOWLEDGMENTS |
|---|
This study was supported by the Torsten and Ragnar Söderberg Foundation, the Swedish Cancer Foundation, Nanna Svartz' Fund, the Gustav Vth Jubilee Society, and the Swedish Society for Clinical Microbiology. T.A. is a fellow of the Swedish Research Council.
| FOOTNOTES |
|---|
Published ahead of print on 7 February 2007. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Mol. Cell. Biol. | Microbiol. Mol. Biol. Rev. |
|---|
| Clin. Vaccine Immunol. | ALL ASM JOURNALS |
|---|