Literature DB >> 17182698

Robust prediction of consensus secondary structures using averaged base pairing probability matrices.

Hisanori Kiryu1, Taishin Kin, Kiyoshi Asai.   

Abstract

MOTIVATION: Recent transcriptomic studies have revealed the existence of a considerable number of non-protein-coding RNA transcripts in higher eukaryotic cells. To investigate the functional roles of these transcripts, it is of great interest to find conserved secondary structures from multiple alignments on a genomic scale. Since multiple alignments are often created using alignment programs that neglect the special conservation patterns of RNA secondary structures for computational efficiency, alignment failures can cause potential risks of overlooking conserved stem structures.
RESULTS: We investigated the dependence of the accuracy of secondary structure prediction on the quality of alignments. We compared three algorithms that maximize the expected accuracy of secondary structures as well as other frequently used algorithms. We found that one of our algorithms, called McCaskill-MEA, was more robust against alignment failures than others. The McCaskill-MEA method first computes the base pairing probability matrices for all the sequences in the alignment and then obtains the base pairing probability matrix of the alignment by averaging over these matrices. The consensus secondary structure is predicted from this matrix such that the expected accuracy of the prediction is maximized. We show that the McCaskill-MEA method performs better than other methods, particularly when the alignment quality is low and when the alignment consists of many sequences. Our model has a parameter that controls the sensitivity and specificity of predictions. We discussed the uses of that parameter for multi-step screening procedures to search for conserved secondary structures and for assigning confidence values to the predicted base pairs. AVAILABILITY: The C++ source code that implements the McCaskill-MEA algorithm and the test dataset used in this paper are available at http://www.ncrna.org/papers/McCaskillMEA/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17182698     DOI: 10.1093/bioinformatics/btl636

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

Review 1.  A classification of bioinformatics algorithms from the viewpoint of maximizing expected accuracy (MEA).

Authors:  Michiaki Hamada; Kiyoshi Asai
Journal:  J Comput Biol       Date:  2012-02-07       Impact factor: 1.479

Review 2.  Folding and finding RNA secondary structure.

Authors:  David H Mathews; Walter N Moss; Douglas H Turner
Journal:  Cold Spring Harb Perspect Biol       Date:  2010-08-04       Impact factor: 10.005

3.  Improved RNA secondary structure prediction by maximizing expected pair accuracy.

Authors:  Zhi John Lu; Jason W Gloor; David H Mathews
Journal:  RNA       Date:  2009-08-24       Impact factor: 4.942

4.  Finding consensus stable local optimal structures for aligned RNA sequences and its application to discovering riboswitch elements.

Authors:  Yuan Li; Cuncong Zhong; Shaojie Zhang
Journal:  Int J Bioinform Res Appl       Date:  2014

5.  Robust and accurate prediction of noncoding RNAs from aligned sequences.

Authors:  Yutaka Saito; Kengo Sato; Yasubumi Sakakibara
Journal:  BMC Bioinformatics       Date:  2010-10-15       Impact factor: 3.169

6.  Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments.

Authors:  Stefan E Seemann; Jan Gorodkin; Rolf Backofen
Journal:  Nucleic Acids Res       Date:  2008-10-04       Impact factor: 16.971

Review 7.  Informatic resources for identifying and annotating structural RNA motifs.

Authors:  Ajish D George; Scott A Tenenbaum
Journal:  Mol Biotechnol       Date:  2008-11-01       Impact factor: 2.695

8.  Prediction of RNA secondary structure including pseudoknots for long sequences.

Authors:  Kengo Sato; Yuki Kato
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

9.  CONS-COCOMAPS: a novel tool to measure and visualize the conservation of inter-residue contacts in multiple docking solutions.

Authors:  Anna Vangone; Romina Oliva; Luigi Cavallo
Journal:  BMC Bioinformatics       Date:  2012-03-28       Impact factor: 3.169

10.  When naked became armored: an eight-gene phylogeny reveals monophyletic origin of theca in dinoflagellates.

Authors:  Russell J S Orr; Shauna A Murray; Anke Stüken; Lesley Rhodes; Kjetill S Jakobsen
Journal:  PLoS One       Date:  2012-11-19       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.