| Literature DB >> 23667459 |
Massimiliano Orsini1, Simone Carcangiu, Gianmauro Cuccuru, Paolo Uva, Anna Tramontano.
Abstract
BLAST-based similarity searches are commonly used in several applications involving both nucleotide and protein sequences. These applications span from simple tasks such as mapping sequences over a database to more complex procedures as clustering or annotation processes. When the amount of analysed data increases, manual inspection of BLAST results become a tedious procedure. Tools for parsing or filtering BLAST results for different purposes are then required. We describe here PARIGA (http://resources.bioinformatica.crs4.it/pariga/), a server that enables users to perform all-against-all BLAST searches on two sets of sequences selected by the user. Moreover, since it stores the two BLAST output in a python-serialized-objects database, results can be filtered according to several parameters in real-time fashion, without re-running the process and avoiding additional programming efforts. Results can be interrogated by the user using logical operations, for example to retrieve cases where two queries match same targets, or when sequences from the two datasets are reciprocal best hits, or when a query matches a target in multiple regions. The Pariga web server is designed to be a helpful tool for managing the results of sequence similarity searches. The design and implementation of the server renders all operations very fast and easy to use.Entities:
Mesh:
Year: 2013 PMID: 23667459 PMCID: PMC3646873 DOI: 10.1371/journal.pone.0062224
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Real Time Filtering and Logical Operations. (Left) Real Time Filtering.
Differently from other available tools, filtering of the returned results can be done in real time with PARIGA. By clicking the filter button, a form will appear where the user can insert the desired values (or ranges) and only filtered results will be shown. Four icons on the table header will show, from left to right, a graphical summary of the hits distribution on the query sequence, the results table, a Blast summary table and the Blast statistics. (Right) Logical Operations. The main result page will show three buttons that allow the user to perform logical operations between the two groups of results as described in the text.
Figure 2Pariga logical schema.
Central columns represent the original input files, while results are indicated in the columns on the side. Boxes indicate logical operations that can be performed on the results. As an example: COMMON: which sequence(s) of the dataset B is(are) shared in BLAST results of sequence A2 and A3 of the dataset A? CROSS: once sequence A1 is selected from dataset A, in which results of the dataset B does it appear? MULTIPLE: which sequence of dataset A appears more than once (i.e. matches more than one region) in the results of sequence B2 of dataset B?