Literature DB >> 25433697

Exploiting hidden information interleaved in the redundancy of the genetic code without prior knowledge.

Hadas Zur1, Tamir Tuller1.   

Abstract

MOTIVATION: Dozens of studies in recent years have demonstrated that codon usage encodes various aspects related to all stages of gene expression regulation. When relevant high-quality large-scale gene expression data are available, it is possible to statistically infer and model these signals, enabling analysing and engineering gene expression. However, when these data are not available, it is impossible to infer and validate such models.
RESULTS: In this current study, we suggest Chimera-an unsupervised computationally efficient approach for exploiting hidden high-dimensional information related to the way gene expression is encoded in the open reading frame (ORF), based solely on the genome of the analysed organism. One version of the approach, named Chimera Average Repetitive Substring (ChimeraARS), estimates the adaptability of an ORF to the intracellular gene expression machinery of a genome (host), by computing its tendency to include long substrings that appear in its coding sequences; the second version, named ChimeraMap, engineers the codons of a protein such that it will include long substrings of codons that appear in the host coding sequences, improving its adaptation to a new host's gene expression machinery. We demonstrate the applicability of the new approach for analysing and engineering heterologous genes and for analysing endogenous genes. Specifically, focusing on Escherichia coli, we show that it can exploit information that cannot be detected by conventional approaches (e.g. the CAI-Codon Adaptation Index), which only consider single codon distributions; for example, we report correlations of up to 0.67 for the ChimeraARS measure with heterologous gene expression, when the CAI yielded no correlation.
AVAILABILITY AND IMPLEMENTATION: For non-commercial purposes, the code of the Chimera approach can be downloaded from http://www.cs.tau.ac.il/∼tamirtul/Chimera/download.htm. CONTACT: tamirtul@post.tau.ac.il SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25433697     DOI: 10.1093/bioinformatics/btu797

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Prokaryotic rRNA-mRNA interactions are involved in all translation steps and shape bacterial transcripts.

Authors:  Shir Bahiri Elitzur; Rachel Cohen-Kupiec; Dana Yacobi; Larissa Fine; Boaz Apt; Alon Diament; Tamir Tuller
Journal:  RNA Biol       Date:  2021-09-29       Impact factor: 4.766

2.  Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.

Authors:  Eli Goz; Oriah Mioduser; Alon Diament; Tamir Tuller
Journal:  DNA Res       Date:  2017-08-01       Impact factor: 4.458

3.  The Landscape of the Emergence of Life.

Authors:  Sohan Jheeta
Journal:  Life (Basel)       Date:  2017-06-16

4.  Unsupervised detection of regulatory gene expression information in different genomic regions enables gene expression ranking.

Authors:  Zohar Zafrir; Tamir Tuller
Journal:  BMC Bioinformatics       Date:  2017-02-01       Impact factor: 3.169

5.  Optimizing the dynamics of protein expression.

Authors:  Jan-Hendrik Trösemeier; Sophia Rudorf; Holger Loessner; Benjamin Hofner; Andreas Reuter; Thomas Schulenborg; Ina Koch; Isabelle Bekeredjian-Ding; Reinhard Lipowsky; Christel Kamp
Journal:  Sci Rep       Date:  2019-05-17       Impact factor: 4.379

6.  Universal evolutionary selection for high dimensional silent patterns of information hidden in the redundancy of viral genetic code.

Authors:  Eli Goz; Zohar Zafrir; Tamir Tuller
Journal:  Bioinformatics       Date:  2018-10-01       Impact factor: 6.937

7.  CSN: unsupervised approach for inferring biological networks based on the genome alone.

Authors:  Maya Galili; Tamir Tuller
Journal:  BMC Bioinformatics       Date:  2020-05-15       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.