Literature DB >> 34991448

LaRA 2: parallel and vectorized program for sequence-structure alignment of RNA sequences.

Jörg Winkler1,2, Gianvito Urgese3, Elisa Ficarra4, Knut Reinert5,6.   

Abstract

BACKGROUND: The function of non-coding RNA sequences is largely determined by their spatial conformation, namely the secondary structure of the molecule, formed by Watson-Crick interactions between nucleotides. Hence, modern RNA alignment algorithms routinely take structural information into account. In order to discover yet unknown RNA families and infer their possible functions, the structural alignment of RNAs is an essential task. This task demands a lot of computational resources, especially for aligning many long sequences, and it therefore requires efficient algorithms that utilize modern hardware when available. A subset of the secondary structures contains overlapping interactions (called pseudoknots), which add additional complexity to the problem and are often ignored in available software.
RESULTS: We present the SeqAn-based software LaRA 2 that is significantly faster than comparable software for accurate pairwise and multiple alignments of structured RNA sequences. In contrast to other programs our approach can handle arbitrary pseudoknots. As an improved re-implementation of the LaRA tool for structural alignments, LaRA 2 uses multi-threading and vectorization for parallel execution and a new heuristic for computing a lower boundary of the solution. Our algorithmic improvements yield a program that is up to 130 times faster than the previous version.
CONCLUSIONS: With LaRA 2 we provide a tool to analyse large sets of RNA secondary structures in relatively short time, based on structural alignment. The produced alignments can be used to derive structural motifs for the search in genomic databases.
© 2021. The Author(s).

Entities:  

Keywords:  Algorithms; Bioinformatics; Integer linear program; Parallel computing; RNA; RNA secondary structure; Structural alignment

Mesh:

Substances:

Year:  2022        PMID: 34991448      PMCID: PMC8734264          DOI: 10.1186/s12859-021-04532-7

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  46 in total

1.  Analysis of elements involved in pseudoknot-dependent expression and regulation of the repA gene of an IncL/M plasmid.

Authors:  V Athanasopoulos; J Praszkier; A J Pittard
Journal:  J Bacteriol       Date:  1999-03       Impact factor: 3.490

2.  Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods.

Authors:  R R Gutell; A Power; G Z Hertz; E J Putz; G D Stormo
Journal:  Nucleic Acids Res       Date:  1992-11-11       Impact factor: 16.971

3.  Alignment of RNA base pairing probability matrices.

Authors:  Ivo L Hofacker; Stephan H F Bernhart; Peter F Stadler
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

4.  Multiple structural alignment and clustering of RNA sequences.

Authors:  Elfar Torarinsson; Jakob H Havgaard; Jan Gorodkin
Journal:  Bioinformatics       Date:  2007-02-25       Impact factor: 6.937

5.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

6.  A benchmark of multiple sequence alignment programs upon structural RNAs.

Authors:  Paul P Gardner; Andreas Wilm; Stefan Washietl
Journal:  Nucleic Acids Res       Date:  2005-04-28       Impact factor: 16.971

7.  The PETfold and PETcofold web servers for intra- and intermolecular structures of multiple RNA sequences.

Authors:  Stefan E Seemann; Peter Menzel; Rolf Backofen; Jan Gorodkin
Journal:  Nucleic Acids Res       Date:  2011-05-23       Impact factor: 16.971

8.  Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments.

Authors:  Jeff Daily
Journal:  BMC Bioinformatics       Date:  2016-02-10       Impact factor: 3.169

9.  SimulFold: simultaneously inferring RNA structures including pseudoknots, alignments, and trees using a Bayesian MCMC framework.

Authors:  Irmtraud M Meyer; István Miklós
Journal:  PLoS Comput Biol       Date:  2007-08       Impact factor: 4.475

10.  RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment.

Authors:  Amir H Bayegan; Peter Clote
Journal:  PLoS One       Date:  2020-01-24       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.