Literature DB >> 19642276

Designing secondary structure profiles for fast ncRNA identification.

Yanni Sun1, Jeremy Buhler.   

Abstract

Detecting non-coding RNAs (ncRNAs) in genomic DNA is an important part of annotation. However, the most widely used tool for modeling ncRNA families, the covariance model (CM), incurs a high computational cost when used for search. This cost can be reduced by using a filter to exclude sequence that is unlikely to contain the ncRNA of interest, applying the CM only where it is likely to match strongly. Despite recent advances, designing an efficient filter that can detect nearly all ncRNA instances while excluding most irrelevant sequences remains challenging. This work proposes a systematic procedure to convert a CM for an ncRNA family to a secondary structure profile (SSP), which augments a conservation profile with secondary structure information but can still be efficiently scanned against long sequences. We use dynamic programming to estimate an SSP's sensitivity and FP rate, yielding an efficient, fully automated filter design algorithm. Our experiments demonstrate that designed SSP filters can achieve significant speedup over unfiltered CM search while maintaining high sensitivity for various ncRNA families, including those with and without strong sequence conservation. For highly structured ncRNA families, including secondary structure conservation yields better performance than using primary sequence conservation alone.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19642276

Source DB:  PubMed          Journal:  Comput Syst Bioinformatics Conf        ISSN: 1752-7791


  3 in total

1.  Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum.

Authors:  Xianyu Piao; Nan Hou; Pengfei Cai; Shuai Liu; Chuang Wu; Qijun Chen
Journal:  BMC Genomics       Date:  2014-08-26       Impact factor: 3.969

2.  Rfam: Wikipedia, clans and the "decimal" release.

Authors:  Paul P Gardner; Jennifer Daub; John Tate; Benjamin L Moore; Isabelle H Osuch; Sam Griffiths-Jones; Robert D Finn; Eric P Nawrocki; Diana L Kolbe; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

3.  Fast filtering for RNA homology search.

Authors:  Diana L Kolbe; Sean R Eddy
Journal:  Bioinformatics       Date:  2011-09-28       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.