| Literature DB >> 27129717 |
Pablo Mier1,2, Miguel A Andrade-Navarro3,4.
Abstract
BACKGROUND: Basic local alignment search tool (BLAST) searches are frequently used to look for homologous sequences and to annotate a query protein, but the increasing size of protein databases makes it difficult to review all results from a similarity search.Entities:
Keywords: BLAST search; Clustering; Computational biology; Web tool
Mesh:
Year: 2016 PMID: 27129717 PMCID: PMC4851773 DOI: 10.1186/s13104-016-2062-y
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Fig. 1CABRA pipeline. Steps followed by CABRA. See text for details
Fig. 2Illustrative explanation of the result’s layout after the execution of the CABRA web tool. Top section overview of the search; bottom right section, list of proteins from the BLAST report; and bottom left section, reviewed information per cluster. Only the information for one cluster is shown
Fig. 3Comparison of results from CABRA and UniRef50. The N-terminal region (positions 1–60) of the human ANR17 protein (UniProt:O75179) was used as query in both searches. a Results obtained after the search in CABRA; two ANR17 proteins in one cluster. b The best result from a search in UniRef50 with SwissProt proteins, the UniRef50_O75179 cluster; two ANR17 and one ANKH1 proteins are part of the same cluster. c Multiple sequence alignment of the three proteins from the UniRef50_O75179 cluster plus the query sequence
Fig. 4Overview of the hits obtained from a BLAST search clustered in CABRA using different identity cutoffs. Query: Q969S2, default parameters. BLAST hits clustered using three identity cutoffs: 20, 53 (default) and 90 %. Distinct colors to label the clusters are used up to the twentieth cluster