| Literature DB >> 21653513 |
Bas E Dutilh1, Rasa Jurgelenaite, Radek Szklarczyk, Sacha A F T van Hijum, Harry R Harhangi, Markus Schmid, Bart de Wild, Kees-Jan Françoijs, Hendrik G Stunnenberg, Marc Strous, Mike S M Jetten, Huub J M Op den Camp, Martijn A Huynen.
Abstract
MOTIVATION: The intensification of DNA sequencing will increasingly unveil uncharacterized species with potential alternative genetic codes. A total of 0.65% of the DNA sequences currently in Genbank encode their proteins with a variant genetic code, and these exceptions occur in many unrelated taxa.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21653513 PMCID: PMC3129529 DOI: 10.1093/bioinformatics/btr316
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Outline of the FACIL algorithm, see text for details. The Genetic Code Logo visualizes the results, including the reliability of alternative genetic code predictions. The example shows the predicted code for G.pseudospinescens mitochondrial fragments, generated by entering the ‘example’ input data on the FACIL web server. The logo shows the 64 codons from left to right (predicted alternatives in red), each with a stack of AAs. The stack height indicates the percentage of RF3 trees supporting the predicted translation, the letter sizes indicate the scaled AA alignment scores and the red line is the percentage of RF1 or RF2 trees that predict a stop codon.
Fig. 2.F1-score for predicting coding (AA) and stop codons by homology alone and after RF filtering. Values are based on the predictions for all codons from the random fragments of bacterial, archaeal and organellar genomes (see Supplementary Table 2).
Variables used in each RF and their normalized importance (calculated as MeanDecreaseGini/max MeanDecreaseGini; see Supplementary Table 1)
NA, not applicable.