Literature DB >> 25748534

A semi-supervised learning approach for RNA secondary structure prediction.

Haruka Yonemoto1, Kiyoshi Asai2, Michiaki Hamada3.   

Abstract

RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Keywords:  Parameter learning; RNA secondary structure; Semi-supervised learning

Mesh:

Substances:

Year:  2015        PMID: 25748534     DOI: 10.1016/j.compbiolchem.2015.02.002

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  3 in total

1.  Research on folding diversity in statistical learning methods for RNA secondary structure prediction.

Authors:  Yu Zhu; ZhaoYang Xie; YiZhou Li; Min Zhu; Yi-Ping Phoebe Chen
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

2.  ATTfold: RNA Secondary Structure Prediction With Pseudoknots Based on Attention Mechanism.

Authors:  Yili Wang; Yuanning Liu; Shuo Wang; Zhen Liu; Yubing Gao; Hao Zhang; Liyan Dong
Journal:  Front Genet       Date:  2020-12-15       Impact factor: 4.599

3.  Empirical evaluation of a virtual laboratory approach to teach lactate dehydrogenase enzyme kinetics.

Authors:  Christine Booth; Rajkumar Cheluvappa; Zack Bellinson; Danni Maguire; Craig Zimitat; Joyce Abraham; Rajaraman Eri
Journal:  Ann Med Surg (Lond)       Date:  2016-04-28
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.