| Literature DB >> 29554223 |
Florian Eggenhofer1,2, Ivo L Hofacker2,3, Rolf Backofen1,4, Christian Höner Zu Siederdissen2,5,6.
Abstract
Summary: A standard method for the identification of novel RNAs or proteins is homology search via probabilistic models. One approach relies on the definition of families, which can be encoded as covariance models (CMs) or Hidden Markov Models (HMMs). While being powerful tools, their complexity makes it tedious to investigate them in their (default) tabulated form. This specifically applies to the interpretation of comparisons between multiple models as in family clans. The Covariance model visualization tools (CMV) visualize CMs or HMMs to: I) Obtain an easily interpretable representation of HMMs and CMs; II) Put them in context with the structural sequence alignments they have been created from; III) Investigate results of model comparisons and highlight regions of interest. Availability and implementation: Source code (http://www.github.com/eggzilla/cmv), web-service (http://rna.informatik.uni-freiburg.de/CMVS). Supplementary information: Supplementary data are available at Bioinformatics online.Mesh:
Substances:
Year: 2018 PMID: 29554223 PMCID: PMC6061798 DOI: 10.1093/bioinformatics/bty158
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Visualization of HMM (B, C, D) and CM (E, F, G) consensus secondary structure (H, I) and Stockholm Alignment (J) for the Hammerhead RNA_HH9 in comparison with families from the Hammerhead RNA family clan (A). Color labels indicate to which other model an alignment column or node has been linked via CMCompare (Complete figures are shown in Supplementary Material). A: Color Legend for the compared models; B: minimal HMM details show nodes with indices, C: simple HMM details show emission probabilities as well; D: detailed HMM view shows states with emission and transition probabilities; E: minimal CM details show nodes with indices; F: simple CM details add node type information; G: detailed CM view shows nodes with states and emission and transition probabilities; H and I show secondary structure visualization via R2R and forna; J shows a slice of input alignment, each line corresponds to one family member. Numbers on top of the columns represent the column index stored in the corresponding CM node