| Literature DB >> 26578555 |
Julien Mariethoz1, Khaled Khatib2, Davide Alocci3, Matthew P Campbell4, Niclas G Karlsson5, Nicolle H Packer4, Elaine H Mullen6, Frederique Lisacek7.
Abstract
The SugarBind Database (SugarBindDB) covers knowledge of glycan binding of human pathogen lectins and adhesins. It is a curated database; each glycan-protein binding pair is associated with at least one published reference. The core data element of SugarBindDB is a set of three inseparable components: the pathogenic agent, a lectin/adhesin and a glycan ligand. Each entity (agent, lectin or ligand) is described by a range of properties that are summarized in an entity-dedicated page. Several search, navigation and visualisation tools are implemented to investigate the functional role of glycans in pathogen binding. The database is cross-linked to protein and glycan-relaled resources such as UniProtKB and UniCarbKB. It is tightly bound to the latter via a substructure search tool that maps each ligand to full structures where it occurs. Thus, a glycan-lectin binding pair of SugarBindDB can lead to the identification of a glycan-mediated protein-protein interaction, that is, a lectin-glycoprotein interaction, via substructure search and the knowledge of site-specific glycosylation stored in UniCarbKB. SugarBindDB is accessible at: http://sugarbind.expasy.org.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26578555 PMCID: PMC4702881 DOI: 10.1093/nar/gkv1247
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Result page prompted by querying SugarBindDB with the pathogen/agent name ‘Escherichia coli R45’. The page is organized in blocks as a function of the query. In this case, since the query is an agent, the header is the name of the agent. Then, all bindings reported in the database for this agent are shown. The glycan molecule is presented linearly for the sake of simplicity but mousing over the formula prompts a two-dimensional cartoon presenting the Essentials of Glycobiology 2nd Edition/CFG nomenclature symbols. The association boxes are displayed on the right side with a colour code corresponding to the entity type, orange for lectins, red for agents, green for affected area/tissue, pink for disease. Clicking on any of these boxes prompts the list of entities sharing the same binding. The screen capture shows the effect of clicking on disease. (B) Result page prompted by querying SugarBindDB with the ligand sequence: Fuc(a1–2)Gal(b1–3)[Fuc(a1–4)]GlcNAc(b1–3)Gal. In this case, the header is the name of the ligand and two answers are returned. It is shown here that two unrelated pathogens a bacterium (H. pylori) and a virus (Norovirus Norwalk) recognize a similar glycan ligand.
List of current, soon available and planned cross-references in SugarBindDB, categorized by the type of information provided by the added link
| Database name | URL | Annotation type | Ref |
|---|---|---|---|
| PubMed | Supporting evidence of binding | - | |
| Taxonomy | Pathogen taxonomy | - | |
| HAMAP | Summary description of bacteria with link to possible genome sequence | ( | |
| GOLD | Summary description of bacteria with link to possible genome sequence | ( | |
| ViralZone | Summary description of viruses with link to corresponding viral proteins | ( | |
| UniProtKB | Lectin functional annotation | ( | |
| Glyco3D/lectin | Lectin three-dimensional structure | ( | |
| CFG arrays | Known binding patterns of lectins (not curated data) | ( | |
| UniCarbKB | Full structures containing ligands | ( | |
| Pfam | Lectin domain classification | ( | |
| UniRef | Lectin amino acid sequence classification | ( | |
| GlycanBuilder | Interface for building and searching glycan structure | ( | |
| BCSDB | Known bacterial glycans possibly recognized by human lectins | ( | |
| PACDB | Pathogen–ligand binding data | Unpub | |
| Glycosciences lab arrays | Known binding patterns (curated data) | ( | |
| Glyco3D | 3D models of ligands | ( | |
| Glycam | 3D models of ligands | ( | |
| GlycoEpitope | Glycan epitope characterisation | ( | |
| GlycomeAtlas | Tissue expression | ( |
Figure 2.This Sankey graph highlights the specificity of the two lectins of Helicobacter pylori strain J99 in binding a variety of glycan determinants as stored in the database. Clicking on these lectins SabA and BabA in the graph changes the path colour from grey to orange. Then the specificity of each lectin is brought out by the absence of common path leading from the lectin to the glycan ligand. Moreover, the glycans that are recognized by each lectin make two distinct structural groups.
Figure 3.This figure shows how protein interacting partners can be found by following SugarBindDB cross-links. VP1, a viral lectin of the Norovirus Norwalk strain binds the B antigen (tri-) glycan determinant. The VP1 protein is linked to UniProtKB Q83884. The substructure search associates the B antigen (tri-) glycan determinant with 34 full glycan structure entries of UniCarbKB. One of these 34 matches is shown as an example. This structure is linked to UniProtKB Q99102 describing the host glycoprotein (MUC-4) on which the glycan is attached. The dashed line suggests a possible interaction between VP1 and MUC-4.