| Literature DB >> 31848260 |
Brian B Haab1, Zachary Klamer2.
Abstract
Proteins that bind carbohydrate structures can serve as tools to quantify or localize specific glycans in biological specimens. Such proteins, including lectins and glycan-binding antibodies, are particularly valuable if accurate information is available about the glycans that a protein binds. Glycan arrays have been transformational for uncovering rich information about the nuances and complexities of glycan-binding specificity. A challenge, however, has been the analysis of the data. Because protein-glycan interactions are so complex, simplistic modes of analyzing the data and describing glycan-binding specificities have proven inadequate in many cases. This review surveys the methods for handling high-content data on protein-glycan interactions. We contrast the approaches that have been demonstrated and provide an overview of the resources that are available. We also give an outlook on the promising experimental technologies for generating new insights into protein-glycan interactions, as well as a perspective on the limitations that currently face the field.Entities:
Keywords: Glycoproteins; bioinformatics; glycan arrays; glycomics; glycosylation; lectins; micro arrays
Mesh:
Substances:
Year: 2019 PMID: 31848260 PMCID: PMC7000120 DOI: 10.1074/mcp.R119.001836
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
Fig. 1.The acquisition of glycan-array data. The typical experiment involves incubating a lectin or glycan-binding antibody on a microarray of diverse glycans, followed by quantifying the amount of binding to each glycan. The protein usually is labeled with a fluorescent tag or another tag that allows fluorescence detection by a secondary agent.
Experimental and data resources for glycan arrays. The criterion for inclusion was any array that advertised itself as a service or resource and that had appropriate web-accessible information and request forms
| Source Type | Source | Content Type | Number of Glycans | Data Available |
|---|---|---|---|---|
| Academic | National Center for Functional Glycomics (NCFG) | Previous Versions of the CFG | 250–600+ | Yes |
| Current Version of the CFG | 600 | Yes | ||
| Mannose-6P Array | 26 | No | ||
| Modified Sialic Acid Array | 80 | No | ||
| NCFG General Glycan Array | 100 | No | ||
| NCFG SBA Array | 106 | No | ||
| Asparagine-Linked Array | 38 | Yes | ||
| Imperial College | Custom Arrays | 796 | Yes | |
| Commercial | Z Biotech | General Array | 100 | No |
| N-Glycan Array | 100 | No | ||
| O-Glycan Array | 94 | No | ||
| Heparan Sulfate | 24 | No | ||
| Neu5Gc/Neu5Ac | 80 | No | ||
| Human Milk Oligosaccharide | 46 | No | ||
| Glycosphingolipid | 58 | No | ||
| Glycosaminoglycan | 34 | No | ||
| Chemily | Blood Group Antigen | 21 | No | |
| General Array | 100 | No | ||
| General Array | 300 | No | ||
| Glycosphingolipid | 58 | No | ||
| Human Milk Oligosaccharide | 46 | No | ||
| N-Glycan Array | 100 | No | ||
| Neu5Gc/Neu5Ac | 80 | No | ||
| RayBiotech | General Array | 100 | No | |
| General Array | 300 | No |
Fig. 2.Defining motifs and families of motifs. A, Motif types. Fixed substructures are continuous units of defined monosaccharides. Intolerant definitions require the unit to be unsubstituted, and tolerant definitions allow substitutions. Explicit definitions define the locations where substitutions are optional, which gives the highest level of precision in the definition. Variable substructures allow for options in the monosaccharides, providing another level of flexibility in the definition. Non-contiguous substructures allow the components of a motif to be physically separated. This feature is useful when a lectin contacts separate branches of a glycan. B, Motif families. The tree shows the relationships between the groups of glycans with the indicated motifs, using a simulated analysis. The first split represents primary motifs (A and B) to which a protein binds. Motif B can be split into sub-motifs that represent fine specificities. C, The simulated data show the ranges of lectin binding to the glycans in each of the motif groups. For example, the glycans in group B contain motif B but not motif A. The B1-B4 sub-motifs define fine-specificities with differing ranges of binding, potentially explaining the broad range of the parent motif B.
Fig. 3.Using the results from glycan-array analyses to interpret experimental data. A lectin (or a glycan-binding antibody) can be applied to glycan arrays and experimental samples in separate experiments. Analysis software is applied to the glycan-array data to determine the lectin specificities, and the amount of lectin binding to the sample with unknown glycans is quantified. The output from the glycan-array analysis is combined with the data from the sample to produce an estimation of the glycans that are present in the sample. This scheme also could be used with integrated data from multiple lectins, and with data acquired after treatments with glycosidases.
Software resources for glycan array analyses
| Software | Method | Summary | Reference |
|---|---|---|---|
| MotifFinder | Motif Binding Association | Uses multiple statistics to test the associations of motifs with lectin binding, using an advanced motif syntax that allows flexible motif definitions. Recent versions include automated motif optimization and modeling families of motifs. | Klamer 2017 ( |
| GlycoPattern | Frequent Subtree Mining | Uses a graph theory approach to mine new glycan substructures that are frequent in the bound glycans and infrequent in the unbound glycans. | Cholleti 2012 ( |
| Multiple Carbohydrate Alignment with Weights (MCAW) | Weighted Structure Alignment | Adapts traditional sequence alignment algorithms to align the strongest glycan binders for a lectin. Offers a database of analyzed CFG datasets. | Hosoda 2017 ( |
| GLycan Array Dashboard (GLAD) | Graphical Visualization | Enables researchers to explore trends in glycan array data through graphic visualization and the manual exploration of simple motifs. | Mehta 2019[1] |