Previous Article | Next Article ![]()
Journal of Virology, April 2006, p. 4179-4182, Vol. 80, No. 8
0022-538X/06/$08.00+0 doi:10.1128/JVI.80.8.4179-4182.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115,1 Division of Immunology, New England Primate Research Center, Harvard Medical School, Southborough, Massachusetts 01772,2 Partners AIDS Research Center and Massachusetts General Hospital, Harvard Medical School, Charlestown, Massachusetts 021293
Received 27 October 2005/ Accepted 31 January 2006
|
|
|---|
|
|
|---|
rhCMV 180.92 was isolated from diseased lung tissue of a simian immunodeficiency virus-infected macaque at the New England Primate Research Center and passaged in vitro six times on human fibroblasts (MRC-5) and seven times on primary rhesus macaque fibroblasts (10). The genome was cloned as seven overlapping cosmids and fully sequenced as previously described (18). The genome sequence was determined using 3,277 contiguous sequences (average of 500 bp) with an average 8.8-fold coverage. The first nucleotide of the genome was identified by homology to the rhCMV 68.1 genome (9).
The rhCMV 180.92 genome is 215,678 nucleotides long with an overall 49% GC content. No perfect repeat regions were identified at the left end, middle, or right end of the genome. The rhCMV 180.92 genome contains 258 ORFs of 21 to 2,178 aa, with an average of 284 aa (Fig. 1) (see Table S1 in the supplemental material). This includes all ORFs beginning with an ATG and encoding a protein of at least 100 aa. When an ORF completely overlaps a larger ORF (49 ORFs), it is included only if it is encoded on the opposite cDNA strand (31 ORFs) or in a different reading frame on the same strand (18 ORFs). Ten ORFs of less than 100 aa are also included since there is significant homology (>20% at the aa level) to a previously annotated CMV ORF. Finally, 11 ORFs have homology with CMV genes that have been previously reported to be spliced.
![]() View larger version (38K): [in a new window] |
FIG. 1. Genome organization of rhCMV 180.92. The scale in kilobase pairs is shown at the top of each line. Protein-coding regions are indicated by leftward or rightward arrows according to the DNA strand on which they are encoded. Introns are shown as narrow white bars. Names of leftward ORFs are shown above the genome, and names of rightward ORFs are shown below the genome. rhCMV 180.92 ORFs with sequence homologues in hCMV (>20% aa similarity) are shown in pink and labeled with numbers in gray boxes, minus the rhUL prefix for ORFs rhUL6 to rhUL148 located between 15 and 166 kbp and the rhUS prefix for ORFs rhUS1 to rh US32 located between 178 and 212 kbp. rhORF6 is a homologue to the previously reported hCMV ORF6 (14). rhCMV 180.92 ORFs with no homologues in hCMV are shown in blue and labeled with plain numbers, minus the rh prefix. rhCMV 180.92 ORFs not found in rhCMV68.1 are shown in yellow, with the other half of the box coded pink or blue for the presence or absence of sequence homology to hCMV ORFs. The arrowhead at nt 166333 indicates the comparable position where 10 ORFs found in 68.1 but not in 180.92 are located. A fusion between rhUL148 and rh167 in rhCMV 180.92 is indicated by the interruption in the ORF.
|
rhCMV isolate 68.1 ORFs were named numerically in order from left to right in the genome, beginning with an Rh (initial letter capitalized) if the ORF had a sequence homologue in hCMV (9). This convention failed to identify the hCMV homologues and made it difficult to incorporate newly identified rhCMV ORFs. In an effort to maximize information and maintain consistency, all rhCMV 180.92 ORFs are identified with the prefix rh. rhCMV 180.92 ORFs with significant homology to hCMV ORFs are identified using the rh prefix followed by the hCMV name (Fig. 1, ORFs shown in pink), e.g., the homologue of hCMV UL11 is identified as rhUL11. rhCMV 180.92 ORFs with significant homology to rhCMV 68.1 ORFs but with no homology to hCMV ORFs are identified using the 68.1 numbering (Fig. 1, blue ORFs), e.g., the homologue of rhCMV 68.1 rh2 is identified as rh2 in the rhCMV180.92 genome. When a new rhCMV ORF is identified and has no homology to hCMV, the rhCMV ORF is identified using the number of the immediately upstream rhCMV 68.1 ORF, followed by a decimal point and additional digit, e.g., a new rhCMV 180.92 ORF located immediately downstream to rhCMV 68.1 ORF rh13 is identified as rh13.1. Exons are identified using the name of the ORF they are part of followed by "ex1," "ex2," etc.; e.g., the two exons encoding the homologue of UL36 are identified as rhUL36ex1 and rhUL36ex2.
The overall amino acid homology between isolate 180.92 and 68.1 ORFs is 98%. The most significant differences between 180.92 and 68.1 are (i) 10 ORFs encoded in 68.1 but not 180.92 (Fig. 1, the arrowhead at nt 166333 indicates the comparable location of the 10 ORFs in 68.1) and (ii) 8 ORFs encoded in 180.92 but not found in 68.1 (Fig. 1, yellow ORFs). Most of these differences are encompassed within a divergent region between approximately nt 160000 and 175000 (Fig. 2). The region of the 180.92 genome, including ORFs from rhUL122 to rh170, is approximately 5 kb smaller than the comparable region in 68.1. Starting from the left end of this region, rhUL122 through rh157.2 are present in both genomes. However, in 180.92, there are two ORFs, rhUL128 and rh157.5, immediately adjacent to rh157.2 that are not present in 68.1 (region A in Fig. 2). UL128 is a protein with sequence characteristics of a CC-chemokine that confers endothelial cell tropism (1, 8). The gene is frequently disrupted in CMV isolates passaged through fibroblasts in vitro; for example, chCMV has a frame shift in the UL128 homologue, and in Toledo UL128, exon 3 is missing due to an inversion of the region between UL128 and UL133 (1, 6, 17).
![]() View larger version (12K): [in a new window] |
FIG. 2. Major region of divergence between rhCMV 68.1 and 180.92 genomes. ORFs present in both rhCMV 68.1 and 180.92 are represented by black arrows, whereas ORFs found in only one rhCMV genome are represented by white arrows. A consensus wild-type rhCMV genome organization for this region is shown at the bottom. Regions A, B, C, and D described in the text are designated by the lines. Inversion of regions B and C is demonstrated by the reversed letters.
|
Interestingly, the regions immediately to the left and right of region B in isolate 68.1, regions C and D in Fig. 2, have both been deleted from 180.92. Furthermore, 180.92 has a fusion between the 5' end of rh167 and the 3' end of rhUL148. In 68.1, rhUL147 and rhUL148 from region C are rightward, whereas they are leftward genes in Towne hCMV and chCMV (6, 17). This combination of events can be most directly explained if both region B and region C have been inverted in 68.1. In this case, the defect in 180.92 could be explained by a single deletion of region C and D from the putative wild-type consensus sequence where the breakpoints were located in rhUL148 and rh167, resulting in a fusion of the two ORFs (Fig. 2).
This region accounts for all 10 ORFs found in isolate 68.1 but not found in 180.92, and 2 of the 8 ORFs found in 180.92 but not in 68.1. The other six ORFs unique to 180.92 have significant nucleotide homology to comparable regions in 68.1, but complete ORFs are not present in 68.1 due to minor nucleotide sequence differences (see Table S3 in the supplemental material). It is not clear whether these represent natural strain variants, alterations during in vitro passage, or technical issues related to cloning or sequencing.
These studies suggest that a consensus wild-type rhCMV genome may be best constructed by comparing multiple genome sequences, similar to the use of chCMV and multiple isolates of hCMV for the description of the hCMV gene repertoire (6, 7, 14, 15). In this case, a consensus wild-type rhCMV would encode 268 ORFs, with 112 of these ORFs having recognized homology to hCMV genes. By contrast, 163 of 168 chCMV genes have homologues in hCMV (6, 14). This analysis is complicated by the use of different criteria for identifying and annotating CMV ORFs, but the results still reflect significant divergence of rhCMV from hCMV. The reason for the dramatic difference between rhCMV and hCMV is not clear. Perhaps some of the "unique" rhCMV ORFs have functional, but not sequence, homology with known or yet to be identified hCMV genes (16).
These studies demonstrate that the rhCMV genome is more complex than previously appreciated. Comparisons with a second rhCMV isolate highlight genomic instability in a common region for both isolates that shares a striking similarity to the genetic changes that arise in passaged hCMV isolates. We cannot exclude the possibility that some portions of the rhCMV genome may have been deleted from both isolates. Recognition that some of the rhCMV genome may be lost or rearranged during in vitro passage suggests that primate CMVs are not free of the genomic instability that plagues hCMV. An approach similar to the use of chCMV and multiple hCMV sequences to develop a consensus hCMV ORF repertoire may be necessary to define the complete ORF repertoire for rhCMV.
Nucleotide sequence accession number. The complete annotation of the rhCMV 180.92 genome sequence was submitted to GenBank under the accession number DQ120516.
We thank M. D. Daniel for the original isolation of the 180.92 rhCMV.
Supplemental material for this article may be found at http://jvi.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»