| Literature DB >> 29058413 |
Zachary Klamer1, Ben Staal1, Anthony R Prudden2, Lin Liu2, David F Smith3, Geert-Jan Boons2, Brian Haab1.
Abstract
Knowledge of lectin and glycosidase specificities is fundamental to the study of glycobiology. The primary specificities of such molecules can be uncovered using well-established tools, but the complex details of their specificities are difficult to determine and describe. Here we present a language and algorithm for the analysis and description of glycan motifs with high complexity. The language uses human-readable notation and wildcards, modifiers, and logical operators to define motifs of nearly any complexity. By applying the syntax to the analysis of glycan-array data, we found that the lectin AAL had higher binding where fucose groups are displayed on separate branches. The lectin SNA showed gradations in binding based on the length of the extension displaying sialic acid and on characteristics of the opposing branches. A new algorithm to evaluate changes in lectin binding upon treatment with exoglycosidases identified the primary specificities and potential fine specificities of an α1-2-fucosidase and an α2-3,6,8-neuraminidase. The fucosidase had significantly lower action where sialic acid neighbors the fucose, and the neuraminidase showed statistically lower action where α1-2 fucose neighbors the sialic acid or is on the opposing branch. The complex features identified here would have been inaccessible to analysis using previous methods. The new language and algorithms promise to facilitate the precise determination and description of lectin and glycosidase specificities.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29058413 PMCID: PMC5700451 DOI: 10.1021/acs.analchem.7b04293
Source DB: PubMed Journal: Anal Chem ISSN: 0003-2700 Impact factor: 6.986
Syntax of the Motif Language
| rule | examples | comments |
|---|---|---|
| standard glycan terms are accepted | Galb1–4(Fuca1–3)GlcNAcb | indicates the Lewis X motif |
| Fuca1–3(Galb1–4)GlcNAcb | indicates same motif; either branch may be enclosed | |
| standard abbreviations for monosaccharides | ||
| “a” and “b” for α and β, linkages separated by dash or comma | ||
| case insensitive | ||
| angled brackets specify details about the linkage or monosaccharide | Neu5AcA2-<3or6>GalB1–4<3f6f>GlcNAcB1–2<4f;6f>ManA | indicates Neu5Ac either 3′- or 6′-linked to an unbranched LacNAc |
| <T>Gal | indicates terminal galactose | |
| the “f” modifier specifies a location as unoccupied (free) | ||
| an unspecified location indicates occupied or unoccupied | ||
| the OR operator can be used to designate multiple options of linkage or monosaccharide | ||
| the XOR operator can be used to indicate multiple options of linkage, where all other options of linkage are unoccupied (free) | ||
| commas or semicolons can be included for clarity so that <3f6f>, <3f,6f>, and <3f;6f> are equivalent | ||
| the “T” modifier is shorthand for all nonreducing locations being unoccupied (<2f3f4f6f> for hexose and <8f9f> for neuraminic acids) | ||
| special monosaccharide terms are available | Any??-<3OR6>Galb1–4<3f;6f>GlcNAcb1–2<4f;6f>Mana | indicates a single monosaccharide extension linked either 3′ or 6′ to an unbranched LacNAc |
| Any??-<3OR6>Galb1–4<3f;6f>GlcNAcb1-?Mana | indicates the same but with any linkage to mannose: | |
| Fuca1-<2or3>Any?1-?Skip??-3(Fuca1-<2or3>Any?1-?Skip??-6)Galb | indicates a fucose anywhere on both a 3′ and a 6′ extension of Gal | |
| the “Any” wildcard is for unspecified monosaccharides | ||
| the “?” wildcard is for unspecified linkages | ||
| the “Skip” modifier indicates a gap of any kind | ||
| the “Skip” modifier encompasses lengths of zero, or no gap | ||
| double-angled brackets link complete terms with logical operators | ≪ <2f;3f;4f;6f>Glca ≫ OR ≪ Mana ≫ | indicates either terminal α-glucose or α-mannose |
| ≪ Fuca ≫ 2 | indicates two or more fucose | |
| ≪ Fuca1-<2or3or4>Any ≫ AND NOT ≪ Fuca ≫ 2 AND ≪ Siaa ≫ | indicates a single fucose with at least one sialic acid | |
| ≪Fuca≫ AND NOT ≪Fuca≫ 2 AND NOT k OR ≪ Fuca1–2Galb1–3GalNac ≫ ≫ | indicates a single fucose and excluding two other motifs | |
| ≪Motif1≫ AND NOT ≪Motif1≫ 2 AND NOT ≪Motif 2≫ | uses Motif IDs as shorthand | |
| embedded double-angled brackets are supported | ||
| the AND, OR, and NOT operators are supported |
Figure 1Examples of motifs and their presence or absence in glycans. In each example, the text definition of a motif is given along with the graphical representation of representative glycans. A green circle indicates the presence of the motif, and a red circle indicates a feature that precludes the motif.
Figure 2Using motifs in the analysis of glycan-array data. (A) The initial steps in the analysis of glycan-array data are to quantify the binding of a lectin to each glycan (top) and to determine the presence or absence of each motif in the glycans (bottom). (B) Adding features to a base motif defines subsets of glycans within the larger sets (top). One may compare the lectin binding between sets of glycans (bottom), such as between glycans containing a motif and all other glycans, or between subsets of glycans.
Figure 3Uncovering complex motifs in the HMO array. Each plot shows the fluorescence intensities of glycans containing the indicated motifs. Glycan arrays containing human-milk oligosaccharides were incubated with the lectins AAL (A) and SNA (B). The depicted glycans are representatives from each group (see Figure S-1 for all glycans on the array).
Figure 4Using motifs to uncover details of exoglycosidase specificities. (A) Method of detecting changes in lectin binding upon glycosidase treatment. (B) The removal motif is the part of the glycan removed by the glycosidase, and the recognition motif is the part of the glycan that affects enzyme activity. (C) The program generates in-silico-modified glycans. If a glycan contains a recognition motif that includes the removal motif, the program eliminates the removal motif from the glycan. The program then determines, for each lectin, whether the lectin’s primary motif is present or absent in the original and modified glycans. (D) For each removal motif, the program groups the glycans according the presence or absence of the recognition motif and by the expected change in lectin binding. (E) For the fucosidase, motifs 1, 12, and 7 are examples of moderate, poor, and good recognition motifs, respectively. The graphs show the percent changes in lectin binding in each of the groups defined in panel D. Each graph contains pooled data from AAL and UEA. Fine-specificity analysis revealed statistically higher percent changes where the galactose neighboring the fucose had no sialic acid relative to where it had sialic acid at the 6′ location. (F) For the neuraminidase, motif 56 showed a very good fit to the expected changes. Fine-specificity analysis showed that α1–2-linked fucose could have an inhibitory effect.