Literature DB >> 16845033

Virtual Ribosome--a comprehensive DNA translation tool with support for integration of sequence feature annotation.

Rasmus Wernersson1.   

Abstract

Virtual Ribosome is a DNA translation tool with two areas of focus. (i) Providing a strong translation tool in its own right, with an integrated ORF finder, full support for the IUPAC degenerate DNA alphabet and all translation tables defined by the NCBI taxonomy group, including the use of alternative start codons. (ii) Integration of sequences feature annotation--in particular, native support for working with files containing intron/exon structure annotation. The software is available for both download and online use at http://www.cbs.dtu.dk/services/VirtualRibosome/.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16845033      PMCID: PMC1538826          DOI: 10.1093/nar/gkl252

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

A large number of software packages for translating DNA sequences already exist, as services on the World Wide Web [e.g. the Expasy Translate Tool ()], as command-line tools [e.g. the GCG package (1)] and as user-friendly graphical applications [e.g. DNA strider (a personal favorite) (2) and ApE ()]. However, many of these fine tools do not support translating sequences containing degenerate nucleotides, have no or limited support for alternative translation tables (including alternative initiation codons) and in general have problems handling special case situations. The software described here aims at addressing these issues and providing a comprehensive solution for translation. The software is build on the experience gained from writing and maintaining the RevTrans server (3). Another part of the rationale for creating Virtual Ribosome is to create an easy and consistent way to map the underlying intron/exon structure of a gene onto its protein product. This makes it easy to build datasets that can be used for analyzing how the underlying exon structure is reflected in the protein [e.g. how exon modules maps onto the 3D structure of the protein, see the FeatureMap3D server (4) elsewhere in this issue].

SOFTWARE FEATURES

Support for the degenerate nucleotide alphabet

The software has full support for the IUPAC alphabet (Table 1) for degenerate nucleotides. For example, the codon TCN correctly translates to S (serine) and not X (unknown) as often seen in other translators.
Table 1

IUPAC alphabet of degenerate nucleotides

LetterDescriptionBases represented
AAdenineA
TThymineT
GGuanineG
CCytosineC
YpYrimidineC T
RpuRineA G
SStrongG C
WWeakA T
KKetoT G
MaMinoA C
BNot AC G T
DNot CA G T
HNot GA C T
VNot T/UA C G
NaNyA C G T

Support for a wide range of translation tables

Full support for all translation tables defined by the NCBI taxonomy group (5) (see the list below). The command-line version of the software also has support for reading an arbitrary translation table defined by the user. [1] Standard Genetic Code [2] Vertebrate (Mitochondrial) [3] Yeast (Mitochondrial) [4] Mold, Protozoan, Coelenterate (Mitochondrial) and Mycoplasma/Spiroplasma [5] Invertebrate Mitochondrial [6] Ciliate, Dasycladacean and Hexamita (Nuclear) [9] Echinoderm and Flatworm (Mitochondrial) [10] Euplotid (Nuclear) [11] Bacterial and Plant Plastid [12] Alternative Yeast (Nuclear) [13] Ascidian Mitochondrial [14] Alternative Flatworm (Mitochondrial) [15] Blepharisma (Nuclear) [16] Chlorophycean (Mitochondrial) [21] Trematode (Mitochondrial) [22] Scenedesmus obliquus (Mitochondrial) [23] Thraustochytrium (Mitochondrial)

Start and Stop codons

Virtual Ribosome also uses the table of alternative translation initiation codons de.ned in the translation tables mentioned above. Figure 1 is the definition for translation table 11 (the Bacterial and Plant plastid code).
Figure 1
In this case, the codons TTG, CTG, ATT, ATC, ATA, ATG and GTG are all allowed as a start codon, and all of them will translate to methionine if used as a start codon. [For a recent report on the use of GTG as a methione coding start-codon, please see (6)]. The use of alternative methionine codons at the first position can be disabled using the ‘all internal’ option (useful for working with sequence fragments). In addition, the software has support for either terminating the translation at the first encountered Stop codon, or reading through the entire sequence annotating stop codons with ‘*’.

Reading frames and ORF finder

The reading frame used for translation can be selected by the user, as a single reading frame (1, 2, 3, −1, −2, −3) or as a set of reading frames (all, positive, negative). Following translation the protein sequences are available for download, and a visualization, in which all possible Start and Stop codons are highlighted, is presented to the user. The example below shows how the result is visualized if a single reading frame has been selected. The ‘strict’ Start codons (always coding for methionine) are annotated with ‘>>>’, the ‘alternative’ Start codons (only coding for methionine at the start position) are annotated with ‘)))’ and Stop codons are annotated with ‘***’. If multiple reading frames are selected the results are stacked as shown in the example below. Notice how the Start codon ‘arrows’ are reversed on the minus strand to indicate the direction of translation. Virtual Ribosome has the option of working as an ORF (open reading frame) finder. When this option is used all specified reading frames are scanned for ORFs and the longest ORF is reported. The rules for defining an ORF can be adjusted to (i) only open an ORF at ‘strict’ Start codons, (ii) open an ORF at any Start codon and (iii) open an ORF at any codon except Stop (useful for working with small DNA fragments). The position of the ORF within the DNA sequences is visualized as shown in the following example.

Intron/exon annotation

Besides working on the standard FASTA format files (sequence only), Virtual Ribosome natively understands the TAB file format for containing both sequence and sequence feature annotation described in (7). Briefly, each line in the TAB format file describes one sequence (DNA or peptide) in four fields, separated by tabs: Name, Sequence, Annotation and Comment. The Annotation field is a string of exactly the same length as the Sequence field. Each position in the annotation string describes the nature of the corresponding position in the sequence string using a single-letter code. TAB files containing intron/exon structure can easily be generated by the FeatureExtract server (7), or by submitting a GenBank file directly to Virtual Ribosome. If a GenBank file is submitted, CDS sections (including information about intron/exon structure) are extracted to the TAB format before translation, by running the FeatureExtract software in the background with default parameters. If a GenBank or TAB file is supplied as input, only the exonic parts of DNA sequences is used for the translation. Furthermore, the underlying exon structure will be rejected in the translated sequence (also in the TAB format). By default, each amino acid will be annotated with a number indicating the exon that encoded this particular amino acid (see example below). Alternatively, the positions and the phase of the introns can be indicated. Phase 0: an intron exists right before the codon encoding the amino acid. Phase 1: an intron exists in between positions 1 and 2 of the codon. Phase 2: an intron exists in between positions 2 and 3 of the codon. The following example illustrates the principle.

Easy to use interface

The interface to the Virtual Ribosome server has been designed to be intuitive and easy to use. Figure 2 shows the basic part of the interface. Notice that it is possible to submit a sequence for translation using the default parameters, without having to scroll through a page of obscure options. The options are grouped into logical sections further down the web page. For each option a short explanation is provided together with a link to a detailed description.
Figure 2

Screenshot of the basic part of the Virtual Ribosome interface.

  7 in total

1.  RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences.

Authors:  Rasmus Wernersson; Anders Gorm Pedersen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Non-AUG translation initiation of mRNA encoding acidic ribosomal P2A protein in Candida albicans.

Authors:  Dariusz Abramczyk; Marek Tchórzewski; Nikodem Grankowski
Journal:  Yeast       Date:  2003-09       Impact factor: 3.239

3.  Database resources of the National Center for Biotechnology Information.

Authors:  D L Wheeler; C Chappey; A E Lash; D D Leipe; T L Madden; G D Schuler; T A Tatusova; B A Rapp
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  GCG: translation of DNA sequence.

Authors:  R Dölz
Journal:  Methods Mol Biol       Date:  1994

5.  DNA Strider. A Macintosh program for handling protein and nucleic acid sequences.

Authors:  S E Douglas
Journal:  Methods Mol Biol       Date:  1994

6.  FeatureExtract--extraction of sequence annotation made easy.

Authors:  Rasmus Wernersson
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

7.  FeatureMap3D--a tool to map protein features and sequence conservation onto homologous structures in the PDB.

Authors:  Rasmus Wernersson; Kristoffer Rapacki; Hans-Henrik Staerfeldt; Peter Wad Sackett; Anne Mølgaard
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

  7 in total
  48 in total

1.  Deep proteomics of the Xenopus laevis egg using an mRNA-derived reference database.

Authors:  Martin Wühr; Robert M Freeman; Marc Presler; Marko E Horb; Leonid Peshkin; Steven Gygi; Marc W Kirschner
Journal:  Curr Biol       Date:  2014-06-19       Impact factor: 10.834

2.  Proteomic analysis of tardigrades: towards a better understanding of molecular mechanisms by anhydrobiotic organisms.

Authors:  Elham Schokraie; Agnes Hotz-Wagenblatt; Uwe Warnken; Brahim Mali; Marcus Frohme; Frank Förster; Thomas Dandekar; Steffen Hengherr; Ralph O Schill; Martina Schnölzer
Journal:  PLoS One       Date:  2010-03-03       Impact factor: 3.240

3.  A subfamily of putative cytokinin receptors is revealed by an analysis of the evolution of the two-component signaling system of plants.

Authors:  Nijuscha Gruhn; Mhyeddeen Halawa; Berend Snel; Michael F Seidl; Alexander Heyl
Journal:  Plant Physiol       Date:  2014-02-11       Impact factor: 8.340

4.  Trait changes induced by species interactions in two phenotypically distinct strains of a marine dinoflagellate.

Authors:  Sylke Wohlrab; Urban Tillmann; Allan Cembella; Uwe John
Journal:  ISME J       Date:  2016-04-19       Impact factor: 10.302

5.  Ribosomal readthrough at a short UGA stop codon context triggers dual localization of metabolic enzymes in Fungi and animals.

Authors:  Alina C Stiebler; Johannes Freitag; Kay O Schink; Thorsten Stehlik; Britta A M Tillmann; Julia Ast; Michael Bölker
Journal:  PLoS Genet       Date:  2014-10-23       Impact factor: 5.917

6.  Does early learning drive ecological divergence during speciation processes in parasitoid wasps?

Authors:  Kerstin König; Elena Krimmer; Sören Brose; Cornelia Gantert; Ines Buschlüter; Christian König; Seraina Klopfstein; Ingo Wendt; Hannes Baur; Lars Krogmann; Johannes L M Steidle
Journal:  Proc Biol Sci       Date:  2015-01-22       Impact factor: 5.349

7.  Antibodies to Intercellular Adhesion Molecule 1-Binding Plasmodium falciparum Erythrocyte Membrane Protein 1-DBLβ Are Biomarkers of Protective Immunity to Malaria in a Cohort of Young Children from Papua New Guinea.

Authors:  Sofonias K Tessema; Digjaya Utama; Olga Chesnokov; Anthony N Hodder; Clara S Lin; G L Abby Harrison; Jakob S Jespersen; Bent Petersen; Livingstone Tavul; Peter Siba; Dominic Kwiatkowski; Thomas Lavstsen; Diana S Hansen; Andrew V Oleinikov; Ivo Mueller; Alyssa E Barry
Journal:  Infect Immun       Date:  2018-07-23       Impact factor: 3.441

8.  Transcriptome analysis of the Cryptocaryon irritans tomont stage identifies potential genes for the detection and control of cryptocaryonosis.

Authors:  Yogeswaran Lokanathan; Adura Mohd-Adnan; Kiew-Lian Wan; Sheila Nathan
Journal:  BMC Genomics       Date:  2010-01-29       Impact factor: 3.969

9.  Large-scale proteome comparative analysis of developing rhizomes of the ancient vascular plant equisetum hyemale.

Authors:  Tiago Santana Balbuena; Ruifeng He; Fernanda Salvato; David R Gang; Jay J Thelen
Journal:  Front Plant Sci       Date:  2012-06-26       Impact factor: 5.753

10.  JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes.

Authors:  Chunguang Liang; Alexander Schmid; María José López-Sánchez; Andres Moya; Roy Gross; Jörg Bernhardt; Thomas Dandekar
Journal:  BMC Bioinformatics       Date:  2009-11-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.