| Literature DB >> 18384676 |
Michaël Bekaert1, Ivaylo P Ivanov, John F Atkins, Pavel V Baranov.
Abstract
BACKGROUND: Ornithine decarboxylase antizymes are proteins which negatively regulate cellular polyamine levels via their affects on polyamine synthesis and cellular uptake. In virtually all organisms from yeast to mammals, antizymes are encoded by two partially overlapping open reading frames (ORFs). A +1 frameshift between frames is required for the synthesis of antizyme. Ribosomes change translation phase at the end of the first ORF in response to stimulatory signals embedded in mRNA. Since standard sequence analysis pipelines are currently unable to recognise sites of programmed ribosomal frameshifting, proper detection of full length antizyme coding sequences (CDS) requires conscientious manual evaluation by a human expert. The rapid growth of sequence information demands less laborious and more cost efficient solutions for this problem. This manuscript describes a rapid and accurate computer tool for antizyme CDS detection that requires minimal human involvement.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18384676 PMCID: PMC2375905 DOI: 10.1186/1471-2105-9-178
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Scheme of negative regulatory feedback in regulation of antizyme synthesis and conservation of the frameshift site. A. The competition between ribosomal frameshifting and standard translation (termination) is sensitive to polyamine levels. An increase in polyamine concentrations shifts the competition toward frameshifting resulting in elevation of antizyme synthesis and consequent inhibition of polyamine synthesis and uptake. A decrease in polyamine concentrations shifts the competition towards standard translation and produces the opposite effect on the synthesis of antizyme and polyamines. B. WebLogo representing the alignment of 153 OAZ sequences. The last codon in the zero frame and the first codon -in +1 frame are indicated by red and blue bars respectively. It can be seen that the only universal nucleotide of the frameshift site is T (U in mRNA) corresponding to the first position of the stop codon at the end of the first ORF.
Figure 2Scheme of analyses performed by OAF. OAF pipeline. Each step performed by OAF is shown as a box, grey boxes represent external modules utilized by OAF.
Figure 3OAZ clustering. A circular tree of OAZ sequences representing clustering that has been used to design profile-HMMs used by OAF.
OAZ sequences detected in completed genomes
| 1 | 1 | |
| 0 | 0 | |
| 1 | 1 | |
| 1 | 1 | |
| ? | 1 | |
| 0 | 0 | |
| 1 | 1 | |
| ? | 0 | |
| 1 | 1 | |
| 3 | 3 | |
| 3 | 3 | |
| 3 | 3 | |
| 3 | 3 | |
| 0 | 0 | |
| 0 | 0 | |
| 0 | 0 | |
| 0 | 0 | |
| 0 | 0 | |
| 0 | 0 | |
| 0 | 0 |
OAZ genes in completed genomes. Organism names are given in the first column. The second column shows the number of OAZ genes that expected in the corresponding genome prior to the analysis. The third column shows a number of OAZ genes found by OAF.
New OAZ sequences detected in dbEST
| OAZ (A2) | EC778732 EC779052 EC779122 | |
| OAZ1 | CN034176 CN035835 CN036385 CN036992 CN037270 CN037900 CN038182 CN039093 CN039394 CN041555 CN043386 CN045230 CN045468 CN045566 CN045877 | |
| OAZ1 | CN034176 CN035835 CN036385 CN036992 CN037270 CN037900 CN038182 CN039093 CN039394 CN041555 CN043386 CN045230 CN045468 CN045566 CN045877 | |
| OAZ (M) | ES491521 | |
| OAZ (N) | CJ977611 CJ978020 CJ978332 | |
| OAZ (N) | CJ979174 CJ980047 CJ980437 CJ981142 CJ981263 CJ981361 CJ981516 CJ982573 CJ984323 CJ985769 CJ985981 CJ986817 CJ987169 CJ987483 CJ987938 CJ989363 CJ990870 | |
| OAZ (M) | DT716986 | |
| OAZ1L | ES475944 | |
| OAZ1S | ES479153 | |
| OAZ (A2) | EH276450 | |
| OAZ (M) | DW986242 DY402940 | |
| OAZ (N) | ES408646 ES409780 ES410567 ES411071 ES411545 ES412181 ES413997 ES414281 | |
| OAZ (F2) | EH628788 | |
| OAZ (M) | DB912841 DB914374 DB915016 DB916854 DB918091 DB918471 DB918744 DB919717 DB920015 | |
| OAZ1L | DB859357 DB860568 DB862111 DB862372 DB864801 DB870209 DB873104 | |
| OAZ (A1) | EH012818 EH014124 EH015343 EH016344 EH016850 | |
| OAZ2 | EH286700 EH293671 | |
| OAZ (N) | BI747416 | |
| OAZ (A2) | ES581004 | |
| OAZ2 | EL518444 | |
| OAZ1L | EL539625 | |
| OAZ1L | ES373335 ES373822 ES376344 ES380163 | |
| OAZ1S | ES384943 | |
| OAZ (A1) | EH114285 |
OAZ sequences found in dbEST which were not present in the original dataset of OAZ protein sequences. Organism names are given in the first column, the second column shows OAF clustering of OAZ sequences. The third column lists EST accession numbers.
OAZ sequence variants in dbEST
| Species | OAZ type | EST | Complete | Partial | Origin |
| OAZ (M) | CF197657 | 1 | 2 | Allele | |
| OAZ (M) | CK484346 | 1 | 2 | Allele | |
| OAZ1S | CA787369 | 27 | 3+ | Allele | |
| OAZ1S | CD760042 | 57 | 3+ | Allele | |
| OAZ1S | DN717815 | 15 | 3+ | Allele | |
| OAZ1S | DN722004 | 8 | 3+ | Allele | |
| OAZ (M) | BP515897 | 3 | 3+ | Allele | |
| OAZ (M) | DT613005 | 5 | 3+ | Allele | |
| OAZ1 | DR772378 | 2 | 3+ | Allele | |
| OAZ1 | EB520932 | 2 | 3+ | Allele | |
| OAZ2 | BQ713320 | 7 | 3+ | Allele | |
| OAZ2 | CB196396 | 6 | 3+ | Allele | |
| OAZ1L | BX856931 | 1 | 1 | Duplication | |
| OAZ1L | CA376808 | 0 | 2 | Duplication | |
| OAZ1S | CA348102 | 4 | 3+ | Allele | |
| OAZ1S | CA357487 | 1 | 3+ | Allele | |
| OAZ1S | CA345360 | 0 | 3+ | Duplication | |
| OAZ1S | CX352386 | 1 | 2 | Duplication | |
| OAZ1L | DT246851 | 7 | 3+ | Allele | |
| OAZ1L | DT260869 | 4 | 3+ | Allele | |
| OAZ1L | DT262889 | 2 | 3+ | Allele | |
| OAZ1L | DT346263 | 3 | 3+ | Allele | |
| OAZ1L | DT359488 | 2 | 3+ | Allele | |
| OAZ2 | DT117815 | 6 | 3+ | Allele | |
| OAZ2 | DT135205 | 4 | 3+ | Allele | |
| OAZ2 | CK366765 | 6 | 3 | Allele | |
| OAZ2 | CV105321 | 2 | 3 | Allele | |
| OAZ1S | DW536698 | 7 | 3+ | Duplication | |
| OAZ1S | DY726570 | 3 | 3+ | Duplication | |
| OAZ2 | DV947965 | 1 | 2 | Allele | |
| OAZ2 | DV961695 | 1 | 2 | Allele |
Summary of OAZ sequence variants. Names of organisms are shown in the first column. The second column shows OAZ classification according to OAF clustering. The third column lists accession numbers for representative ESTs. The fourth column shows a number of EST sequences containing complete OAZ CDS. The fifth column shows a number of ESTs containing a partial OAZ CDS. Likely reasons for the existence of OAZ variants are given in the last column. Asterisks indicate no complete genome sequence for the corresponding organism, meaning that our assessment of variant origin is presumptive.
Contaminant OAZ sequences
| - | OAZ1 | From cell host (pig) | DV107889 DV107991 | |||
| - | OAZ (A1) | From pest (sharpshooter?) | DY278292 | |||
| - | OAZ (A1) | From pest (aphid?) | BU826038 DN500296 | |||
| - | OAZ (A1) | From pest (aphid?) | DW084698 | |||
| - | OAZ (F1) | From symbiont fungus | CF389943 | |||
| OAZ1L/OAZ1S/OAZ2 | OAZ (A1) | From food (Brian shrimp?) | CK868153 |
OAZ corresponding to sequences from contaminating organisms are shown. Names of organisms where contaminants were found are given in the first column. The second column represents the corresponding taxonomic family. The third column shows an OAZ cluster to which an OAZ is expected to belong based on the taxonomy of the source organism. The third column shows a cluster to which the sequence belongs. The fourth column lists the likely source of contamination. The fifth contains EST accession numbers.