Literature DB >> 19808876

CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score.

Michiaki Hamada1, Kengo Sato, Hisanori Kiryu, Toutai Mituyama, Kiyoshi Asai.   

Abstract

MOTIVATION: The importance of accurate and fast predictions of multiple alignments for RNA sequences has increased due to recent findings about functional non-coding RNAs. Recent studies suggest that maximizing the expected accuracy of predictions will be useful for many problems in bioinformatics.
RESULTS: We designed a novel estimator for multiple alignments of structured RNAs, based on maximizing the expected accuracy of predictions. First, we define the maximum expected accuracy (MEA) estimator for pairwise alignment of RNA sequences. This maximizes the expected sum-of-pairs score (SPS) of a predicted alignment under a probability distribution of alignments given by marginalizing the Sankoff model. Then, by approximating the MEA estimator, we obtain an estimator whose time complexity is O(L(3)+c(2)dL(2)) where L is the length of input sequences and both c and d are constants independent of L. The proposed estimator can handle uncertainty of secondary structures and alignments that are obstacles in Bioinformatics because it considers all the secondary structures and all the pairwise alignments as input sequences. Moreover, we integrate the probabilistic consistency transformation (PCT) on alignments into the proposed estimator. Computational experiments using six benchmark datasets indicate that the proposed method achieved a favorable SPS and was the fastest of many state-of-the-art tools for multiple alignments of structured RNAs. AVAILABILITY: The software called CentroidAlign, which is an implementation of the algorithm in this article, is freely available on our website: http://www.ncrna.org/software/centroidalign/. CONTACT: hamada-michiaki@aist.go.jp SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19808876     DOI: 10.1093/bioinformatics/btp580

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

Review 1.  A classification of bioinformatics algorithms from the viewpoint of maximizing expected accuracy (MEA).

Authors:  Michiaki Hamada; Kiyoshi Asai
Journal:  J Comput Biol       Date:  2012-02-07       Impact factor: 1.479

2.  Direct updating of an RNA base-pairing probability matrix with marginal probability constraints.

Authors:  Michiaki Hamada
Journal:  J Comput Biol       Date:  2012-12       Impact factor: 1.479

3.  Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families.

Authors:  Lars Barquist; Sarah W Burge; Paul P Gardner
Journal:  Curr Protoc Bioinformatics       Date:  2016-06-20

4.  Simultaneous prediction of RNA secondary structure and helix coaxial stacking.

Authors:  Pooya Shareghi; Yingfeng Wang; Russell Malmberg; Liming Cai
Journal:  BMC Genomics       Date:  2012-06-11       Impact factor: 3.969

5.  PicXAA-R: efficient structural alignment of multiple RNA sequences using a greedy approach.

Authors:  Sayed Mohammad Ebrahim Sahraeian; Byung-Jun Yoon
Journal:  BMC Bioinformatics       Date:  2011-02-15       Impact factor: 3.169

6.  Prediction of RNA secondary structure by maximizing pseudo-expected accuracy.

Authors:  Michiaki Hamada; Kengo Sato; Kiyoshi Asai
Journal:  BMC Bioinformatics       Date:  2010-11-30       Impact factor: 3.169

7.  Generalized centroid estimators in bioinformatics.

Authors:  Michiaki Hamada; Hisanori Kiryu; Wataru Iwasaki; Kiyoshi Asai
Journal:  PLoS One       Date:  2011-02-18       Impact factor: 3.240

8.  Improving the accuracy of predicting secondary structure for aligned RNA sequences.

Authors:  Michiaki Hamada; Kengo Sato; Kiyoshi Asai
Journal:  Nucleic Acids Res       Date:  2010-09-15       Impact factor: 16.971

9.  PicXAA-Web: a web-based platform for non-progressive maximum expected accuracy alignment of multiple biological sequences.

Authors:  Sayed Mohammad Ebrahim Sahraeian; Byung-Jun Yoon
Journal:  Nucleic Acids Res       Date:  2011-04-22       Impact factor: 16.971

10.  CentroidAlign-Web: A Fast and Accurate Multiple Aligner for Long Non-Coding RNAs.

Authors:  Haruka Yonemoto; Kiyoshi Asai; Michiaki Hamada
Journal:  Int J Mol Sci       Date:  2013-03-18       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.