Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea

Felipe Veloso, Gonzalo Riadi, Daniela Aliaga, Ryan Lieph, David S. Holmes

Producción científica: Contribución a una revistaArtículo de revisiónrevisión exhaustiva

18 Citas (Scopus)

Resumen

Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame + 1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to mis-annotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.

Idioma originalInglés
Páginas (desde-hasta)91-105
Número de páginas15
PublicaciónOMICS A Journal of Integrative Biology
Volumen9
N.º1
DOI
EstadoPublicada - 2005

Áreas temáticas de ASJC Scopus

  • Biotecnología
  • Bioquímica
  • Medicina molecular
  • Biología molecular
  • Genética

Huella

Profundice en los temas de investigación de 'Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea'. En conjunto forman una huella única.

Citar esto