| Literature DB >> 24040122 |
Marlena Siwiak1, Piotr Zielenkiewicz.
Abstract
Although translation is the key step during gene expression, it remains poorly characterized at the level of individual genes. For this reason, we developed Transimulation - a web service measuring translational activity of genes in three model organisms: Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The calculations are based on our previous computational model of translation and experimental data sets. Transimulation quantifies mean translation initiation and elongation time (expressed in SI units), and the number of proteins produced per transcript. It also approximates the number of ribosomes that typically occupy a transcript during translation, and simulates their propagation. The simulation of ribosomes' movement is interactive and allows modifying the coding sequence on the fly. It also enables uploading any coding sequence and simulating its translation in one of three model organisms. In such a case, ribosomes propagate according to mean codon elongation times of the host organism, which may prove useful for heterologous expression. Transimulation was used to examine evolutionary conservation of translational parameters of orthologous genes. Transimulation may be accessed at http://nexus.ibb.waw.pl/Transimulation (requires Java version 1.7 or higher). Its manual and source code, distributed under the GPL-2.0 license, is freely available at the website.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24040122 PMCID: PMC3764131 DOI: 10.1371/journal.pone.0073943
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The summary of translational parameters calculated in the model.
| organism |
|
|
|
|
|
|
|
|
| |
|
| mean | 335 | 3.6 | 47 | 1.3 | 4.0 | 62 | 40 | 119 | 7.5 |
| median | 298 | 1.7 | 28 | 0.8 | 2.3 | 15 | 35 | 119 | 6.8 | |
| sd | 203 | 5.6 | 60 | 1.3 | 5.0 | 206 | 24 | 9 | 4.0 | |
| min | 15 | 0.1 | 0 | 0 | 0 | 2 | 2 | 87 | 2.0 | |
| max | 1487 | 54.0 | 940 | 6.6 | 41.2 | 5091 | 178 | 177 | 42.3 | |
|
| mean | 513 | 7.8 | 116 | 1.1 | 5.6 | 54 | 116 | 224 | 33.2 |
| median | 431 | 2.7 | 58 | 0.8 | 3.1 | 28 | 96 | 229 | 27.4 | |
| sd | 365 | 29.0 | 188 | 0.9 | 7.3 | 186 | 84 | 31 | 26.8 | |
| min | 37 | 0.1 | 1 | 0.0 | 0.0 | 2 | 4 | 98 | 4.3 | |
| max | 4911 | 591.3 | 2543 | 6.6 | 142.1 | 6714 | 1074 | 360 | 677 | |
|
| mean | 676 | 85.9 | 9171 | 2.3 | 11.5 | 7 | 59 | 87 | 6.5 |
| median | 506 | 42.6 | 5616 | 2.1 | 10.1 | 4 | 44 | 87 | 9.2 | |
| sd | 620 | 171.9 | 9739 | 1.4 | 7.22 | 23 | 54 | 4 | 6.3 | |
| min | 38 | 0.9 | 14 | 0.0 | 0.0 | 1 | 3 | 75 | 3.0 | |
| max | 14508 | 4e3 | 83e3 | 7.5 | 131.6 | 1372 | 1232 | 108 | 34.6 |
Column description: () transcript length; () number of gene transcripts; () number of proteins produced from one transcript; () ribosome density in number of ribosomes per 100 codons; () number of ribosomes on a transcript; () initiation time in s; () elongation time in s; () mean elongation time of one transcript codon in ms; and () mean transcript lifetime in min (bacteria, yeast), or in h (humans). For all parameters, except and , the rows 1–15 were calculated for 1738, 4470, and 7494 genes for bacteria, yeast, and humans, respectively. For parameter and , the rows were calculated for 1574, 3425, and 6205 genes, respectively.
Figure 1Translation speed plot generated by Transimulation.
An example plot of translation speed (in aa/sec) in relation to the coding sequence of one of the E.coli genes. To facilitate analysis, the plot was smoothed by calculating translation speed over a 10-codon sliding window. Similar plots for window sizes of 1, 2, 5, 10, 20, 30, and 50 codons are generated for all analyzed genes and sequences uploaded by the user.
Figure 2Calculated protein abundance vs experimental studies.
Correlations between protein abundances calculated in our model (as times ) and those obtained in experimental studies [8], [11]–[13]; n – sample size, – Spearman correlation coefficient and its 95% confidence interval.
Summary of data sets and variables used as an input of the model.
| Input data |
|
|
|
| Cell line | K12 MG1655 | BY4741 | HeLa |
| Temperature | 37°C | 30°C | 37°C |
| Medium | MOPS | YEPD | – |
|
| |||
| Transcriptome size | 1,500 | 36,000 | 700,000* |
| Ribosomes/cell | 20,000 | 200,000 | 9,500,000 |
| Average cell volume | 1e-18 | 42e-18 | 2425e-18 |
|
| |||
| tRNA decoding |
|
|
|
| tRNA abundances |
|
|
|
| tRNAs/cell | 71,000 | 2,800,000 | 60,000,000* |
|
| |||
| Coding sequences | NCBI | SGD | UCSC |
| mRNA abundances |
|
|
|
| mRNA lifetime |
|
|
|
| Ribosome footprints |
|
|
|
Details on data parsing and calculations may be found in the main text. Cell lines and growth conditions (temperature and medium) denote those used in the ribosome profiling experiments. The numbers marked by an asterix were taken from the RNA Tools and Calculators section at the Invitrogen Website (www.invitrogen.com, accessed April 2013). The coding sequences were downloaded from the following databases: NCBI (www.ncbi.nlm.nih.gov.ftp, accessed May 2012), SGD (www.yeastgenome.org, accessed June 2009), and UCSC (http://genome.ucsc.edu, accessed July 2012).