| Literature DB >> 32313959 |
Dmitry Suplatov1, Yana Sharapova1, Elizaveta Geraseva1, Vytas Švedas1.
Abstract
Zebra2 is a highly automated web-tool to search for subfamily-specific and conserved positions (i.e. the determinants of functional diversity as well as the key catalytic and structural residues) in protein superfamilies. The bioinformatic analysis is facilitated by Mustguseal-a companion web-server to automatically collect and superimpose a large representative set of functionally diverse homologs with high structure similarity but low sequence identity to the selected query protein. The results are automatically prioritized and provided at four information levels to facilitate the knowledge-driven expert selection of the most promising positions on-line: as a sequence similarity network; interfaces to sequence-based and 3D-structure-based analysis of conservation and variability; and accompanied by the detailed annotation of proteins accumulated from the integrated databases with links to the external resources. The integration of Zebra2 and Mustguseal web-tools provides the first of its kind out-of-the-box open-access solution to conduct a systematic analysis of evolutionarily related proteins implementing different functions within a shared 3D-structure of the superfamily, determine common and specific patterns of function-associated local structural elements, assist to select hot-spots for rational design and to prepare focused libraries for directed evolution. The web-servers are free and open to all users at https://biokinet.belozersky.msu.ru/zebra2, no login required.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32313959 PMCID: PMC7319439 DOI: 10.1093/nar/gkaa276
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The on-line interactive analysis toolkit is the hallmark of Zebra2. The results are provided at four information levels: (A) as a sequence similarity network; interfaces to (B) 3D-structure-based and (C) sequence-based analysis of conservation and variability; and (D) accompanied by the detailed annotation of proteins accumulated from the integrated databases with links to the external resources (i.e. PDB, UniProt, BacDive, BRENDA). In (B), the gradient paint indicates the statistical significance of the subfamily-specific (red-to-cyan) and conserved (yellow-to-grey) positions. To operate this example on-line use the ‘Demo mode (PLP)’.