This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowReprints and Permissions
Right arrow Copyright Information
Right arrow Books from ASM Press
Right arrow MicrobeWorld
Google Scholar
Right arrow Articles by Rancurel, C.
Right arrow Articles by Karlin, D.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rancurel, C.
Right arrow Articles by Karlin, D.

 Previous Article  |  Next Article 

Journal of Virology, October 2009, p. 10719-10736, Vol. 83, No. 20
0022-538X/09/$08.00+0     doi:10.1128/JVI.00595-09
Copyright © 2009, American Society for Microbiology. All Rights Reserved.

Overlapping Genes Produce Proteins with Unusual Sequence Properties and Offer Insight into De Novo Protein Creation{triangledown} ,{dagger}

Corinne Rancurel,1 Mahvash Khosravi,2 A. Keith Dunker,2 Pedro R. Romero,2* and David Karlin3*

Architecture et Fonction des Macromolécules Biologiques, Case 932, Campus de Luminy, 13288 Marseille Cedex 9, France,1 Center for Computational Biology and Bioinformatics, 410 West 10th Street, Suite 5000, Indiana University-Purdue University, Indianapolis, Indiana 46202-5122,2 19, rue Ornano, 69001 Lyon, France3

Received 23 March 2009/ Accepted 23 July 2009

It is widely assumed that new proteins are created by duplication, fusion, or fission of existing coding sequences. Another mechanism of protein birth is provided by overlapping genes. They are created de novo by mutations within a coding sequence that lead to the expression of a novel protein in another reading frame, a process called "overprinting." To investigate this mechanism, we have analyzed the sequences of the protein products of manually curated overlapping genes from 43 genera of unspliced RNA viruses infecting eukaryotes. Overlapping proteins have a sequence composition globally biased toward disorder-promoting amino acids and are predicted to contain significantly more structural disorder than nonoverlapping proteins. By analyzing the phylogenetic distribution of overlapping proteins, we were able to confirm that 17 of these had been created de novo and to study them individually. Most proteins created de novo are orphans (i.e., restricted to one species or genus). Almost all are accessory proteins that play a role in viral pathogenicity or spread, rather than proteins central to viral replication or structure. Most proteins created de novo are predicted to be fully disordered and have a highly unusual sequence composition. This suggests that some viral overlapping reading frames encoding hypothetical proteins with highly biased composition, often discarded as noncoding, might in fact encode proteins. Some proteins created de novo are predicted to be ordered, however, and whenever a three-dimensional structure of such a protein has been solved, it corresponds to a fold previously unobserved, suggesting that the study of these proteins could enhance our knowledge of protein space.


* Corresponding author. Mailing address for Pedro R. Romero: Center for Computational Biology and Bioinformatics, 410 West 10th Street, Suite 5000, Indiana University-Purdue University, Indianapolis, IN 46202-5122. Phone: (317) 278-4101. Fax: (317) 278-9201. E-mail: promero{at}compbio.iupui.edu. Mailing address for David Karlin: 19, rue Ornano, 69001 Lyon, France. Phone: 44 (0) 755 194 5984. Fax: 44 (0) 207 611 8254. E-mail: karlin.david{at}gmail.com

{triangledown} Published ahead of print on 29 July 2009.

{dagger} Supplemental material for this article may be found at http://jvi.asm.org/.


Journal of Virology, October 2009, p. 10719-10736, Vol. 83, No. 20
0022-538X/09/$08.00+0     doi:10.1128/JVI.00595-09
Copyright © 2009, American Society for Microbiology. All Rights Reserved.