| Literature DB >> 18483080 |
Sébastien Moretti1, Andreas Wilm, Desmond G Higgins, Ioannis Xenarios, Cédric Notredame.
Abstract
The R-Coffee web server produces highly accurate multiple alignments of noncoding RNA (ncRNA) sequences, taking into account predicted secondary structures. R-Coffee uses a novel algorithm recently incorporated in the T-Coffee package. R-Coffee works along the same lines as T-Coffee: it uses pairwise or multiple sequence alignment (MSA) methods to compute a primary library of input alignments. The program then computes an MSA highly consistent with both the alignments contained in the library and the secondary structures associated with the sequences. The secondary structures are predicted using RNAplfold. The server provides two modes. The slow/accurate mode is restricted to small datasets (less than 5 sequences less than 150 nucleotides) and combines R-Coffee with Consan, a very accurate pairwise RNA alignment method. For larger datasets a fast method can be used (RM-Coffee mode), that uses R-Coffee to combine the output of the three packages which combines the outputs from programs found to perform best on RNA (MUSCLE, MAFFT and ProbConsRNA). Our BRAliBase benchmarks indicate that the R-Coffee/Consan combination is one of the best ncRNA alignment methods for short sequences, while the RM-Coffee gives comparable results on longer sequences. The R-Coffee web server is available at http://www.tcoffee.org.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18483080 PMCID: PMC2447777 DOI: 10.1093/nar/gkn278
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.R-Coffee workflow. Secondary structures for each single sequence are predicted by means of RNAplfold. The primary alignment library of pairwise residue scores is computed either using Consan (R-Coffee mode: slow/accurate) or by using a combination of MSA programs (R-Coffee mode: fast/approximate). This library is then extended by the R-Coffee extension, which adds base pairs of aligned residues to the library. The resulting extended library and the so-called R-Score, which again takes the given base pairs into account, are then used to compute a progressive alignment in the same way as default T-Coffee.
Figure 2.Example alignment output. This colored output was generated by aligning tRNA-aln4 from the BRAliBase benchmark using R-Coffee slow/accurate mode. Colors indicate the consistency of aligned residues with the primary library alignments and the predicted structures: blue to green means low consistency; yellow to red means good consistency. The dot bracket notation below the alignment indicates the consensus structure (predicted with RNAalifold) and was added afterwards.