Literature DB >> 21233528

Semantics and ambiguity of stochastic RNA family models.

Robert Giegerich1, Christian Höner zu Siederdissen.   

Abstract

Stochastic models, such as hidden Markov models or stochastic context-free grammars (SCFGs) can fail to return the correct, maximum likelihood solution in the case of semantic ambiguity. This problem arises when the algorithm implementing the model inspects the same solution in different guises. It is a difficult problem in the sense that proving semantic nonambiguity has been shown to be algorithmically undecidable, while compensating for it (by coalescing scores of equivalent solutions) has been shown to be NP-hard. For stochastic context-free grammars modeling RNA secondary structure, it has been shown that the distortion of results can be quite severe. Much less is known about the case when stochastic context-free grammars model the matching of a query sequence to an implicit consensus structure for an RNA family. We find that three different, meaningful semantics can be associated with the matching of a query against the model--a structural, an alignment, and a trace semantics. Rfam models correctly implement the alignment semantics, and are ambiguous with respect to the other two semantics, which are more abstract. We show how provably correct models can be generated for the trace semantics. For approaches, where such a proof is not possible, we present an automated pipeline to check post factum for ambiguity of the generated models. We propose that both the structure and the trace semantics are worth-while concepts for further study, possibly better suited to capture remotely related family members.

Mesh:

Substances:

Year:  2011        PMID: 21233528     DOI: 10.1109/TCBB.2010.12

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  4 in total

1.  Evaluation of a sophisticated SCFG design for RNA secondary structure prediction.

Authors:  Markus E Nebel; Anika Scheid
Journal:  Theory Biosci       Date:  2011-12-02       Impact factor: 1.919

2.  A genome-wide survey of sRNAs in the symbiotic nitrogen-fixing alpha-proteobacterium Sinorhizobium meliloti.

Authors:  Jan-Philip Schlüter; Jan Reinkensmeier; Svenja Daschkey; Elena Evguenieva-Hackenberg; Stefan Janssen; Sebastian Jänicke; Jörg D Becker; Robert Giegerich; Anke Becker
Journal:  BMC Genomics       Date:  2010-04-17       Impact factor: 3.969

3.  Discriminatory power of RNA family models.

Authors:  Christian Höner zu Siederdissen; Ivo L Hofacker
Journal:  Bioinformatics       Date:  2010-09-15       Impact factor: 6.937

4.  Ambivalent covariance models.

Authors:  Stefan Janssen; Robert Giegerich
Journal:  BMC Bioinformatics       Date:  2015-05-28       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.