Literature DB >> 33970899

Predicting mean ribosome load for 5'UTR of any length using deep learning.

Alexander Karollus1, Žiga Avsec1,2, Julien Gagneur1,3,4.   

Abstract

The 5' untranslated region plays a key role in regulating mRNA translation and consequently protein abundance. Therefore, accurate modeling of 5'UTR regulatory sequences shall provide insights into translational control mechanisms and help interpret genetic variants. Recently, a model was trained on a massively parallel reporter assay to predict mean ribosome load (MRL)-a proxy for translation rate-directly from 5'UTR sequence with a high degree of accuracy. However, this model is restricted to sequence lengths investigated in the reporter assay and therefore cannot be applied to the majority of human sequences without a substantial loss of information. Here, we introduced frame pooling, a novel neural network operation that enabled the development of an MRL prediction model for 5'UTRs of any length. Our model shows state-of-the-art performance on fixed length randomized sequences, while offering better generalization performance on longer sequences and on a variety of translation-related genome-wide datasets. Variant interpretation is demonstrated on a 5'UTR variant of the gene HBB associated with beta-thalassemia. Frame pooling could find applications in other bioinformatics predictive tasks. Moreover, our model, released open source, could help pinpoint pathogenic genetic variants.

Entities:  

Year:  2021        PMID: 33970899     DOI: 10.1371/journal.pcbi.1008982

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  2 in total

1.  Functional characterization of 5' UTR cis-acting sequence elements that modulate translational efficiency in Plasmodium falciparum and humans.

Authors:  Valentina E Garcia; Rebekah Dial; Joseph L DeRisi
Journal:  Malar J       Date:  2022-01-06       Impact factor: 2.979

Review 2.  How Machine Learning and Statistical Models Advance Molecular Diagnostics of Rare Disorders Via Analysis of RNA Sequencing Data.

Authors:  Lea D Schlieben; Holger Prokisch; Vicente A Yépez
Journal:  Front Mol Biosci       Date:  2021-06-01
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.