Literature DB >> 31931434

Discovering protein-binding RNA motifs with a generative model of RNA sequences.

Byungkyu Park1, Kyungsook Han2.   

Abstract

Recent advances in high-throughput experimental technologies have generated a huge amount of data on interactions between proteins and nucleic acids. Motivated by the big experimental data, several computational methods have been developed either to predict binding sites in a sequence or to determine if an interaction exists between protein and nucleic acid sequences. However, most of the methods cannot be used to discover new nucleic acid sequences that bind to a target protein because they are classifiers rather than generators. In this paper we propose a generative model for constructing protein-binding RNA sequences and motifs using a long short-term memory (LSTM) neural network. Testing the model for several target proteins showed that RNA sequences generated by the model have high binding affinity and specificity for their target proteins and that the protein-binding motifs derived from the generated RNA sequences are comparable to the motifs from experimentally validated protein-binding RNA sequences. The results are promising and we believe this approach will help design more efficient in vitro or in vivo experiments by suggesting potential RNA aptamers for a target protein.
Copyright © 2019 The Authors. Published by Elsevier Ltd.. All rights reserved.

Keywords:  Binding motif; Generator; Long short-term memory network; Protein-RNA interaction

Mesh:

Substances:

Year:  2020        PMID: 31931434     DOI: 10.1016/j.compbiolchem.2019.107171

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  1 in total

1.  ENNGene: an Easy Neural Network model building tool for Genomics.

Authors:  Eliška Chalupová; Ondřej Vaculík; Jakub Poláček; Filip Jozefov; Tomáš Majtner; Panagiotis Alexiou
Journal:  BMC Genomics       Date:  2022-03-31       Impact factor: 3.969

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.