| Literature DB >> 23640335 |
Florian Eggenhofer1, Ivo L Hofacker, Christian Höner Zu Siederdissen.
Abstract
A standard method for the identification of novel non-coding RNAs is homology search by covariance models. Covariance models are constructed for specific RNA families with common sequence and structure (e.g. transfer RNAs). Currently, there are models for 2208 families available from Rfam. Before being included into a database, a proposed family should be tested for specificity (finding only true homolog sequences), sensitivity (finding remote homologs) and uniqueness. The CMCompare webserver (CMCws) compares Infernal RNA family models to (i) identify models with poor specificity and (ii) explore the relationship between models. The CMCws provides options to compare new models against all existing models in the current Rfam database to avoid the construction of duplicate models for the same non-coding RNA family. In addition, the user can explore the relationship between two or more models, including whole sets of user-created family models. Visualization of family relationships provides help in evaluating candidates for clusters of biologically related families, called clans. The CMCws is freely available, without any login requirements, at http://rna.tbi.univie.ac.at/cmcws, and the underlying software is available under the GPL-3 license.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23640335 PMCID: PMC3692125 DOI: 10.1093/nar/gkt329
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.List of results: contains comparison results corresponding to the current filtering options. The list is sortable by all column names. The magnifying glass links to a detailed view of each comparison. The checkboxes on the right allow to select the models for a comparison with each other. CMCompare computes a score for the Input model (Input score) and for the Rfam model (Rfam score). The lower one is the Link score.
Figure 2.Visualizations: simplifying identification of relevant similarities between models by giving different representations of the pairwise result Link scores. (a) Link score matrix containing the similarity between all provided models and highlighting them by color. Clicking the Link score shows a detailed view of the comparison. (b) Weighted graph representation of linked models. The nodes indicate the models and contain their identifier. In contrast to the matrix representation, the shown edges correspond with the applied filtering options and redirect to a detailed view of the comparison on clicking. The comparisons against Rfam only show edges between the input and Rfam models. The shown input models 1, 2 and 4, 5 are members of the tRNA-clan, whereas ManA is presumably a false link.