| Literature DB >> 21715387 |
Lukasz Jaroszewski1, Zhanwen Li, Xiao-hui Cai, Christoph Weber, Adam Godzik.
Abstract
The Fold and Function Assignment System (FFAS) server [Jaroszewski et al. (2005) FFAS03: a server for profile-profile sequence alignments. Nucleic Acids Research, 33, W284-W288] implements the algorithm for protein profile-profile alignment introduced originally in [Rychlewski et al. (2000) Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Science: a Publication of the Protein Society, 9, 232-241]. Here, we present updates, changes and novel functionality added to the server since 2005 and discuss its new applications. The sequence database used to calculate sequence profiles was enriched by adding sets of publicly available metagenomic sequences. The profile of a user's protein can now be compared with ∼20 additional profile databases, including several complete proteomes, human proteins involved in genetic diseases and a database of microbial virulence factors. A newly developed interface uses a system of tabs, allowing the user to navigate multiple results pages, and also includes novel functionality, such as a dotplot graph viewer, modeling tools, an improved 3D alignment viewer and links to the database of structural similarities. The FFAS server was also optimized for speed: running times were reduced by an order of magnitude. The FFAS server, http://ffas.godziklab.org, has no log-in requirement, albeit there is an option to register and store results in individual, password-protected directories. Source code and Linux executables for the FFAS program are available for download from the FFAS server.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21715387 PMCID: PMC3125803 DOI: 10.1093/nar/gkr441
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Databases used by the FFAS server that were added or significantly modified since 2005 [databases of profiles such as PDB, PfamA, SCOP and COG, added before 2005, are regularly updated; for details, see (1)]
| Database | Sources and preparation of the data |
|---|---|
| NR85S (sequences) | The NR database from National Center for Biotechnology Information (NCBI) and the following sets of metagenomic sequences: Global Ocean Sampling (GOS) data from the JCVI and CAMERA consortia ( |
| VFDB (profiles) | VFDB: Virulence Factors Database (VFDB) ( |
| HUMSAVAR (profiles) | Human polymorphisms and disease mutations (HUMSAVAR) ( |
| Complete human proteome (profiles) | The set of sequences of canonical isoforms of human proteins have been downloaded from the Uniprot database page of Complete Proteomes ( |
| Selected microbial proteomes (pathogens and members of human microbiome) and two eukaryotic proteomes (profiles) | The proteomes of |
Figure 1.Examples of novel features of the FFAS server. (A) Dotplot graphs generated with the new FFAS tool. Left panel: the dotplot graph of a leucine-rich repeat region of the human NACHT protein compared to itself. Right panel: the dotplot graph visualizing similarity between C-terminal parts of SusE and SusF proteins from Bacteroides thethaiotaomicron. Arrows indicate the estimated lengths of repeats in NACHT LRRs and the lengths of repeated (homologous) domains in the alignment of SusE with SusF. (B) FFAS results are now linked to a database of structural similarities calculated with FATCAT. These links can be used to evaluate structural consistency of FFAS results. In this example, the fact that two different folds are aligned with the same query (Prophage tail fibre N-terminal domain) is explained by a list of structural neighbors that shows that a Prealbumin-like fold (b.3 code in SCOP) and an Immunoglobulin-like beta-sandwich (b.1 code in SCOP) are structurally similar despite being classified as separate folds. (C) 3D alignment viewer allows quick inspection of the alignment as ‘projected’ on a template structure (labeling of residues in a Jmol viewer is synchronized with alignment labeling).