Literature DB >> 15262817

Exploiting conserved structure for faster annotation of non-coding RNAs without loss of accuracy.

Zasha Weinberg1, Walter L Ruzzo.   

Abstract

MOTIVATION: Non-coding RNAs (ncRNAs)-functional RNA molecules not coding for proteins-are grouped into hundreds of families of homologs. To find new members of an ncRNA gene family in a large genome database, covariance models (CMs) are a useful statistical tool, as they use both sequence and RNA secondary structure information. Unfortunately, CM searches are slow. Previously, we introduced 'rigorous filters', which provably sacrifice none of CMs' accuracy, although often scanning much faster. A rigorous filter, using a profile hidden Markov model (HMM), is built based on the CM, and filters the genome database, eliminating sequences that provably could not be annotated as homologs. The CM is run only on the remainder. Some biologically important ncRNA families could not be scanned efficiently with this technique, largely due to the significance of conserved secondary structure relative to primary sequence in identifying these families. Current heuristic filters are also expected to perform poorly on such families.
RESULTS: By augmenting profile HMMs with limited secondary structure information, we obtain rigorous filters that accelerate CM searches for virtually all known ncRNA families from the Rfam Database and tRNA models in tRNAscan-SE. These filters scan an 8 gigabase database in weeks instead of years, and uncover homologs missed by heuristic techniques to speed CM searches. AVAILABILITY: Software in development; contact the authors.

Mesh:

Substances:

Year:  2004        PMID: 15262817     DOI: 10.1093/bioinformatics/bth925

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structure.

Authors:  Elfar Torarinsson; Milena Sawera; Jakob H Havgaard; Merete Fredholm; Jan Gorodkin
Journal:  Genome Res       Date:  2006-06-02       Impact factor: 9.043

2.  Efficient alignment of RNAs with pseudoknots using sequence alignment constraints.

Authors:  Byung-Jun Yoon
Journal:  EURASIP J Bioinform Syst Biol       Date:  2009-04-14

3.  Fast and accurate search for non-coding RNA pseudoknot structures in genomes.

Authors:  Zhibin Huang; Yong Wu; Joseph Robertson; Liang Feng; Russell L Malmberg; Liming Cai
Journal:  Bioinformatics       Date:  2008-08-07       Impact factor: 6.937

4.  RNATOPS-W: a web server for RNA structure searches of genomes.

Authors:  Yingfeng Wang; Zhibin Huang; Yong Wu; Russell L Malmberg; Liming Cai
Journal:  Bioinformatics       Date:  2009-03-05       Impact factor: 6.937

5.  6S RNA is a widespread regulator of eubacterial RNA polymerase that resembles an open promoter.

Authors:  Jeffrey E Barrick; Narasimhan Sudarsan; Zasha Weinberg; Walter L Ruzzo; Ronald R Breaker
Journal:  RNA       Date:  2005-04-05       Impact factor: 4.942

Review 6.  Computational methods in noncoding RNA research.

Authors:  Ariane Machado-Lima; Hernando A del Portillo; Alan Mitchell Durham
Journal:  J Math Biol       Date:  2007-09-04       Impact factor: 2.259

7.  Efficient known ncRNA search including pseudoknots.

Authors:  Cheng Yuan; Yanni Sun
Journal:  BMC Bioinformatics       Date:  2013-01-21       Impact factor: 3.169

Review 8.  Computational analysis of riboswitch-based regulation.

Authors:  Eric I Sun; Dmitry A Rodionov
Journal:  Biochim Biophys Acta       Date:  2014-02-28

9.  RScan: fast searching structural similarities for structured RNAs in large databases.

Authors:  Chenghai Xue; Guo-Ping Liu
Journal:  BMC Genomics       Date:  2007-07-31       Impact factor: 3.969

10.  Riboswitch detection using profile hidden Markov models.

Authors:  Payal Singh; Pradipta Bandyopadhyay; Sudha Bhattacharya; A Krishnamachari; Supratim Sengupta
Journal:  BMC Bioinformatics       Date:  2009-10-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.