Literature DB >> 16267089

Sequence-based heuristics for faster annotation of non-coding RNA families.

Zasha Weinberg1, Walter L Ruzzo.   

Abstract

MOTIVATION: Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be.
RESULTS: In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. AVAILABILITY: The source code is available under GNU Public License at the supplementary web site.

Mesh:

Substances:

Year:  2005        PMID: 16267089     DOI: 10.1093/bioinformatics/bti743

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  41 in total

1.  Identification of a large noncoding RNA in extremophilic eubacteria.

Authors:  Elena Puerta-Fernandez; Jeffrey E Barrick; Adam Roth; Ronald R Breaker
Journal:  Proc Natl Acad Sci U S A       Date:  2006-12-12       Impact factor: 11.205

2.  Exploring genomic dark matter: a critical assessment of the performance of homology search methods on noncoding RNA.

Authors:  Eva K Freyhult; Jonathan P Bollback; Paul P Gardner
Journal:  Genome Res       Date:  2006-12-06       Impact factor: 9.043

3.  Comparative genomics beyond sequence-based alignments: RNA structures in the ENCODE regions.

Authors:  Elfar Torarinsson; Zizhen Yao; Eric D Wiklund; Jesper B Bramsen; Claus Hansen; Jørgen Kjems; Niels Tommerup; Walter L Ruzzo; Jan Gorodkin
Journal:  Genome Res       Date:  2007-12-20       Impact factor: 9.043

4.  Guanine riboswitch variants from Mesoplasma florum selectively recognize 2'-deoxyguanosine.

Authors:  Jane N Kim; Adam Roth; Ronald R Breaker
Journal:  Proc Natl Acad Sci U S A       Date:  2007-10-02       Impact factor: 11.205

5.  Efficient alignment of RNAs with pseudoknots using sequence alignment constraints.

Authors:  Byung-Jun Yoon
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-04-14

6.  Fast and accurate search for non-coding RNA pseudoknot structures in genomes.

Authors:  Zhibin Huang; Yong Wu; Joseph Robertson; Liang Feng; Russell L Malmberg; Liming Cai
Journal:  Bioinformatics       Date:  2008-08-07       Impact factor: 6.937

7.  RNATOPS-W: a web server for RNA structure searches of genomes.

Authors:  Yingfeng Wang; Zhibin Huang; Yong Wu; Russell L Malmberg; Liming Cai
Journal:  Bioinformatics       Date:  2009-03-05       Impact factor: 6.937

8.  Infernal 1.0: inference of RNA alignments.

Authors:  Eric P Nawrocki; Diana L Kolbe; Sean R Eddy
Journal:  Bioinformatics       Date:  2009-03-23       Impact factor: 6.937

Review 9.  Computational analysis of riboswitch-based regulation.

Authors:  Eric I Sun; Dmitry A Rodionov
Journal:  Biochim Biophys Acta       Date:  2014-02-28

10.  Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm.

Authors:  Supatcha Lertampaiporn; Chinae Thammarongtham; Chakarida Nukoolkit; Boonserm Kaewkamnerdpong; Marasri Ruengjitchatchawalya
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.