| Literature DB >> 25332714 |
Udayakumar Mani1, Vaidhyanathan Mahaganapathy1, Sadhana Ravisankar1, Sai Mukund Ramakrishnan1.
Abstract
The functionality of a gene or a protein depends on codon repeats occurring in it. As a consequence of their vitality in protein function and apparent involvement in causing diseases, an interest in these repeats has developed in recent years. The analysis of genomic and proteomic sequences to identify such repeats requires some algorithmic support from informatics level. Here, we proposed an offline stand-alone toolkit Repeat Searcher and Motif Detector (RSMD), which uncovers and employs few novel approaches in identification of sequence repeats and motifs to understand their functionality in sequence level and their disease causing tendency. The tool offers various features such as identifying motifs, repeats and identification of disease causing repeats. RSMD was designed to provide an easily understandable graphical user interface (GUI), for the tool will be predominantly accessed by biologists and various researchers in all platforms of life science. GUI was developed using the scripting language Perl and its graphical module PerlTK. RSMD covers algorithmic foundations of computational biology by combining theory with practice.Entities:
Keywords: combination algorithm; computational biology; genomic sequence; motif; proteomic sequence; repeats
Year: 2014 PMID: 25332714 PMCID: PMC4197393 DOI: 10.7555/JBR.28.20130065
Source DB: PubMed Journal: J Biomed Res ISSN: 1674-8301
Fig. 1Frames of RSMD.
4 row frames and 2 GUI frames. RSMD, Repeat Searcher and Motif Detector.
The usage of various modules in RSMD.
| No | Module name | Use |
| 1 | Repeat searcher | Identification of repeats in single/multiple sequence (.fasta) |
| 2 | Motif detector | Identification of motifs in a single sequence (.fasta) |
| 3 | Similarity search | Identification of repeats from an ClustalW alignment file (.aln) |
| 4 | Prosite pattern hunter | Identification of motifs from a single sequence using prosite patterns (.fasta) |
Fig. 2Workflow of RSMD
Fig. 4Comparison of the graphical output between single sequence and multiple sequences input.
A: Single sequence (format: repeat(frequency)). B: Multiple sequences (format: repeat (frequency)[sequence_num]).
Fig. 3
Fig. 5Case study depicting the RSMDs capability in identifying disease causing repeats.
Comparison of RSMD with various other existing tools.
| Name | Algorithm used | Flanking region | Repeat finder | Motif identification | Multi sequence testing | User interface |
| Repeat finder | Repbase | NO | YES | YES | NO | Console/WEB |
| Find pattern | Perfect imperfect | NO | YES | YES | NO | GUI |
| TAMO | Combination of above tools | YES | YES | NO | NO | GUI |
| STAR | Minimum description length | NO | NO | NO | NO | Console |
| RSMD | Disambigous combination algorithm | YES | YES | YES | YES | GUI |
Fig. 6Comparison of various tools with RSMD based upon computational speed in seconds.