Literature DB >> 24755681

Community-based network study of protein-carbohydrate interactions in plant lectins using glycan array data.

Adeel Malik1, Juyong Lee1, Jooyoung Lee1.   

Abstract

Lectins play major roles in biological processes such as immune recognition and regulation, inflammatory responses, cytokine signaling, and cell adhesion. Recently, glycan microarrays have shown to play key roles in understanding glycobiology, allowing us to study the relationship between the specificities of glycan binding proteins and their natural ligands at the omics scale. However, one of the drawbacks in utilizing glycan microarray data is the lack of systematic analysis tools to extract information. In this work, we attempt to group various lectins and their interacting carbohydrates by using community-based analysis of a lectin-carbohydrate network. The network consists of 1119 nodes and 16769 edges and we have identified 3 lectins having large degrees of connectivity playing the roles of hubs. The community based network analysis provides an easy way to obtain a general picture of the lectin-glycan interaction and many statistically significant functional groups.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24755681      PMCID: PMC3995809          DOI: 10.1371/journal.pone.0095480

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Glycans play important roles inside eukaryotic cells by binding to proteins and lipids, and they are also found in the extracellular space between cells [1]. Glycans can be grouped into two classes; linear sugars and polysaccharides. The polysaccharides consist of repeating pyranose monosaccharide rings and branched sugars, which are formed by linking various monosaccharide units [2]. Through non-covalent interactions with lectins, glycans control biochemical reactions by engaging in various biological processes such as development [3], [4], coagulation [5] and response to infection by bacterial and viral agents [6]. The size of the cellular glycome is believed to be in range of 100000–500000 glycans [7]. This large size of glycomic contents could be attributed to the combinatorial aspect that oligosaccharide chains come in either linear or branched form, monosaccharide building blocks are either in α or in β anomeric configurations and monosaccharides can be linked via various carbon atoms in their sugar rings [8]. Using the complexity of the glycome, cells adopt to encode a massive amount of biological information, and it is a great challenge to decode this hidden information to understand the biology of lectins and their interactions with carbohydrates. Protein-carbohydrate interactions are involved in a variety of biological and biochemical processes, and, recently, attempts to understand the molecular basis of such interactions have appeared [9]. Traditional methods to probe glycan–protein recognition events include X-ray crystallography, NMR spectroscopy, the hemagglutination inhibition assay [10], enzyme-linked lectin assay [11], surface plasmon resonance [12] and isothermal titration calorimetry [13]. Although these methods have been successfully applied to elucidate the details of carbohydrate–protein interactions, they are rather labor intensive and require large amounts of carbohydrate samples. These shortcomings make the aforementioned traditional approaches unsuitable as high-throughput analytic methods [14]. On the other hand, recently, many computational methods have been suggested to study protein carbohydrate interactions [15]–[21]. Conventional methods for carbohydrate ligand detection are often cumbersome and we need sensitive and high-throughput technologies that can analyze carbohydrate-protein interactions in order to discover and differentiate oligosaccharide sequences interacting with carbohydrate binding proteins [8]. Carbohydrate micro-array based technology can serve as an appropriate method [22]–[25]. However, at present, one of the biggest limiting factors in utilizing the complete potential of the glycan microarray data is the lack of efficient analysis tools to extract relevant information. For complete utilization of a glycan microarray data, we need a systematic computational method [26]. Large quantities of data are generated from the analysis of the Consortium for Functional Glycomics (CFG) glycan microarray [27]. Also, predicting the glycan-binding specificity or binding motif can be a time consuming step of scrutinizing and evaluating the linear sequences of monosaccharides in glycans [27]. The CFG offers glycan microarray data for various lectins (both plant and animal origin) and glycan binding antibodies. Recently computational methods have been developed for analyzing the glycan-binding specificity from glycan array data such as the motif-segregation method [26] and the outlier motif analysis (OMA) method [28]. In this work, we have developed a method to group various plant lectins and their interacting carbohydrates by the community detection analysis of a lectin-glycan network generated by the glycan microarray data from CFG. The lectin-glycan network consists of 1119 nodes (lectins and glycans) and 16769 edges (interactions). From this network, we have identified 3 lectins having large degrees of connectivity playing the roles of hubs. Additionally, we compared the results of our community detection method with other well known clustering algorithms. We show that our method outperforms existing clustering methods in terms of both modularity score as well as the number of statistically significant (p-value ≤0.05) glycan specific lectin groups. We propose that this study can reveal a global organization of lectin-glycan interactions, and help to identify strongly correlated lectin and glycan clusters.

Methodology

Data Generation

A total of 786 glycan array files for plant lectins were downloaded using a custom made script from Consortium for Functional Glycomics (CFG) as of Dec 2013. CFG provides extensive glycomics resources so that one can explore functions of glycans and glycan-binding proteins that play important roles in human health and disease [http://www.functionalglycomics.org/static/consortium/consortium.shtml]. All of these 786 files were further processed into a single input file, which consists of rows of protein-carbohydrate pairs. Three datasets were generated by filtering the protein-carbohydrate pairs using the cutoff values of relative fluorescence units (RFU) 5000, 10000 and 20000. These three datasets were used for network construction and their community detection. shows the histogram of the RFU values collected from 786 glycan array files. The data corresponding to RFU larger than 5000 constitutes only about 3.5% of the whole data. All the data is available to researchers upon request.
Figure 1

Histogram of the RFU values collected from 786 glycan array files is shown.

It should be noted that the y-axis is shown in the log scale and the data corresponding to RFU larger than 5000 constitutes only about 3.5% of the whole data.

Histogram of the RFU values collected from 786 glycan array files is shown.

It should be noted that the y-axis is shown in the log scale and the data corresponding to RFU larger than 5000 constitutes only about 3.5% of the whole data.

Network Construction

To perform a systematic analysis of protein-carbohydrate interaction, we have constructed a bipartite network, where unweighted edges are assigned between proteins and carbohydrates. Each node represents a lectin or a glycan and its identity is indicated by its array ID or glycan ID at a given condition. A glycan array ID represents a specific protein under a specific condition. Therefore, two different nodes in the network may represent two different concentrations of a protein in the glycan array experiment. The strength of a lectin-glycan interaction is represented by its RFU value and three networks are generated using three cutoff values of RFU of 5000, 10000 and 20000.

Community Detection of a Network

We have identified the community structure of the lectin-glycan network by using the Mod-CSA method, which is a highly effective modularity optimization method [29], [30], [31]. The modularity is a widely used measure to determine the community structures of various networks. From a given community structure it measures the difference between the number of inter-community edges and its expected value from a randomly re-wired counterpart preserving the degrees of nodes. Modularity (Q) is defined as:where M is the total number of edges in the network, is the number of communities, is the number of edges within community i and is the sum of degrees of nodes in community i. The value of Q ranges between −1 and 1 and it becomes close to 1 for a highly modular community structure and 0 for a random community structure [32].

Network Visualization and Comparison with other Clustering Methods

Three lectin glycan array networks constructed in this study were exported to the Cytoscape 2.8.2, a bioinformatics package for biological network visualization and data integration [33]. To compare our clustering method with other widely used network clustering algorithms such as MCL [34], [35], MCODE [36] and greedy algorithm [32], we have used clusterMaker [37] and GLay plugins [38], a multi-algorithm clustering plugins for Cytoscape.

Enrichment of Glycan-specific Proteins

Enriched glycan-specific lectins within each cluster were investigated by annotating each lectin with a predetermined glycan binding specificity. Reported specificities of various lectins were extracted from literature [39], [40] and Uniprot database [41] as summarized in . The full list of all 513 protein nodes used in this study with annotations (wherever possible) are listed in .
Table 1

List of glycan binding specificities of lectins investigated in this study is shown. Specificities are collected from literature and uniprot database.

S. No.Protein NameReported Specificity
1.Pokeweed Agglutinin(GlcNAcb1-4)n
2.Datura Stramonium Lectin(GlcNAcb1-4)n, Galb1-4GlcNAc
3.Soybean Agglutinina- or b-linked terminal GalNAc, GalNAca1-3Gal
4.LBA Lima Bean Agglutinin/LBLa-D-GalN.Ac
5.Griffonia Simplicifolia Lectin I, Isolectin B4/GSI-B4 isolectina-Linked Gal
6.Agglutinina-Linked terminal GalNAc
7.Psophocarpus tetragonolobus Agglutinin/Basic agglutinina-Linked terminal GalNAc
8.Psophocarpus Tetragonolobus Lectin Ia-Linked terminal GalNAc
9.Vicia Villosa Lectin (VVL)a-Linked terminal GalNAc, GalNAca1-3Gal
10.Griffonia simplicifolia II/InsecticidalN-acetylglucosamine-specific lectinAgalactosylated tri/tetra antennary glycans, GlcNAc
11.Phaseolus vulgaris Erythroagglutinin/Erythroagglutinating phytohemagglutininBi-antennary complex-type N-glycan with outer Galand bisecting GlcNAc
12.Wheat Germ Agglutinin (WGA)Chitin oligomers, Sia
13.Laburnum alpinum Agglutinin/Lectin 1/Seed lectin anti-H(O)Di-N-acetylchitobiose specific lectin.
14.Ulex europaeus AgglutininII/UEA-II OR Anti-H(O) lectin 2Di-N-acetylchitobiose specific lectin.
15.Trichosanthes japonica Agglutinin IIFuca1-2Galb1 -> or GalNAcb1 -> groups attheir nonreducing terminals
16.Cholera Toxin BFuca1-2Galb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb ORGalb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb
17.Ulex Europaeus Agglutinin ORAnti-H(O) lectin 1Fuca1-2Galb1-4GlcNAc
18.Lotus Tetragonolobus Lectin/Anti-H(O) lectinFuca1-3(Galb1-4)GlcNAc, Fuca1-2Galb1-4GlcNAc
19.Aspergillus oryzae LectinFuca1-6GlcNAc (core fucose)
20.Lens Culinaris AgglutininFuca1-6GlcNAc, a-D-Glc, a-D-Man
21.Pisum Sativum AgglutininFuca1-6GlcNAc, a-D-Glc, a-D-Man
22.Aleuria Aurantia Lectin AALFuca1-6GlcNAc, Fuca1-3(Galb1-4)GlcNAc
23.Pseudomonas aeruginosa lectin/PA-I galactophilic lectinFucose Anywhere
24.Psophocarpus Tetragonolobus Lectin IIFucose binding lectin
25.Fucose-binding lectin proteinFucose binding lectin
26.Euonymus europaeus AgglutininGala1-3Gal, blood group B antigen
27.Cytisus sscoparius AgglutininGalactose binding lectin
28.Discoidin-2Galactose binding lectin
29.Polyporus Squamosus LectinGalactose binding lectin
30.Discoidin-1 subunit B/CGalactose- and N-acetylgalactosamine-binding
31.SRL- strong binding to di-saccharideGalb1!3GalNAc-a- similar to Agaricus bisporuslectinGalb1->3GalNAc-a-
32.Agaricus bisporus AgglutininGalb1-3GalNAc
33.Amaranthus Caudatus LectinGalb1-3GalNAc
34.Galactose-binding lectin (Agglutinin PNA)Galb1-3GalNAc
35.Jacalin/Agglutinin alpha chainGalb1-3GalNAc, GalNAc
36.Bauhinia Purpurea LectinGalb1-3GalNAc, GalNAc
37.Maclura Pomifera Lectin/Agglutinin alpha chain/MPAGalb1-3GalNAc, GalNAc
38.Erythrina crista-galli LectinGalb1-4GlcNAc
39.Ricinus Communis Agglutinin IGalb1-4GlcNAc
40.Dolichos biflorus Agglutinin/Seed lectin subunit IGalNAca1-3GalNAc, blood group A antigen
41.Wisteria floribunda AgglutininGalNAcb1-4GlcNAc, Galb1-3(-6)GalNAc
42.Marasmium oreades agglutininGalα(1,3)Gal
43.Solanum Tuberosum (Potato) Lectin (STL)GlcNAc oligomers, oligosaccharide containingGlcNAc and LacNAc
44.Lycopersicon Esculentum LectinGlcNAc trimers/tetramers
45.Urtica dioica Agglutinin/Lectin/endochitinase 1GlcNAcb1-4GlcNAc, Mixture of Man5–Man9
46.Coprinopsis cinerea lectin 2GlcNAcβ1,4[Fucα1,3]GlcNAc
47.Vicia faba AgglutininGlucose binding lectin
48.Galanthus nivalis agglutinin orMannose-specific lectinHigh-mannose, Mana1-3Man
49.Hippeastrum hybrid AgglutininHigh-mannose, Mana1-3Man, Mana1-6Man
50.Canavalia A (Con A)High-mannose, Mana1-6(Mana1-3)Man
51.Canavalia ensiformis (Con A)High-mannose, Mana1-6(Mana1-3)Man
52.Narcissus pseudonarcissusAgglutininHigh-mannose, Mana1-6Man
53.Tulip LectinMana1-3(Mana1-6)Man, bi- and tri-antennarycomplex-type N-glycan, GalNAc
54.Sauromatum gutattumManb Anywhere
55.Mannose specific lectinMannose binding lectin
56.ASA, Allium sativum agglutinin(ASAI and ASAII)Mannose binding lectin
57.LectinMannose binding lectin
58.Concanavalin-AMannose binding lectin
59.Colocasia esculenta LectinMannose binding lectin
60.Lectin alpha chainMannose binding lectin
61.Mannose-binding lectinMannose binding lectin
62.Banana lectinMannose binding lectin
63.Cyanovirin-NMannose binding lectin
64.Salt stress-induced proteinMannose binding lectin
65.Lectin-like proteinMannose binding lectin
66.Hessian fly response gene 1 proteinMannose binding lectin
67.Nessun dorma, isoform A;Nessun dorma, isoform BN-acetylglucosamine
68.Nicotiana tabacum agglutininN-acetylglucosamine
69.Psathyrella velutina lectinN-acetylglucosamine and N-acetylneuraminic acid
70.Ricin B-like lectinN,N'-diacetyllactosediamine(GalNAcβ1-4GlcNAc, LacdiNAc)
71.Maackia Amurensis Lectin IISiaa2-3Galb1-
72.Maackia Amurensis Lectin ISiaa2-3Galb1-
73.Maackia amurensis AgglutininSiaa2-3Galb1-3(Siaa2-6)GalNAc
74.Sambucus nigra AgglutininSiaa2-6Gal/GalNAc
75.Trichosanthes japonica Agglutinin ISiaa2-6Gal/GalNAc
76.Limax flavus Agglutinin/Sialic acid-binding lectin 1Sialic acid-binding lectin
77.Platypodium elegans legume lectinSubterminal Mannose
78.Sclerotinia sclerotiorumagglutininterminal N-acetylgalactosamine (GalNAc)
79.Phaseolus vulgaris Leucoagglutinin/Leucoagglutinating phytohemagglutininTri/tetra-antennary complex-type N-glycan
The enrichment of glycan-specificities of lectins in each cluster was assessed by calculating the hypergeometric p-value. The p-value corresponds to the probability that a given lectin cluster sharing the same glycan-specificity can be obtained by chances. The p-value was calculated as follows:where N is the total number of lectins in the network, K is the number of all lectins having a particular glycan-specificity, and k is the number of lectins having the particular glycan-specificity in a cluster with the size of n. Enrichment analysis was also attempted by using DAVID functional annotation cluster tool [http://david.abcc.ncifcrf.gov/home.jsp], which did not yield any statistical significant clusters. We then manually searched each lectin in InterPro database [42] but only 8 unique GO terms such as chitin-binding, carbohydrate-binding, protein binding, endopeptidase inhibitor activity, etc, were retrieved. However, these GO terms are too general to signify any detailed glycan binding specificities of corresponding lectins. Therefore, in this study, the enrichment analysis for each cluster was performed based on the annotations listed in . Only those clusters with at least 10 protein nodes were analyzed for statistical significance.

Identification of Hub Proteins

In general, biological networks possess the scale-free property [43] in which only a few nodes in the network have many connections serving as hubs in the network. Hub proteins were identified by calculating the node degree distribution [44] by using the NetworkAnalyzer plugin of Cytoscape. Top three highest degree protein nodes were assigned as hubs (see ).
Figure 2

The node degree distribution of the lectin-glycan network is shown.

We observe a large gap between 3 hub nodes and the other nodes. The degree distribution was plotted using plotly [https://plot.ly/plot].

The node degree distribution of the lectin-glycan network is shown.

We observe a large gap between 3 hub nodes and the other nodes. The degree distribution was plotted using plotly [https://plot.ly/plot].

Results and Discussion

We constructed three lectin-glycan interaction networks by using the plant lectin-glycan micro array data filtered by three RFU cut-offs. The network where the interactions were filtered by RFU <5000 consists of 1119 nodes (513 proteins and 606 carbohydrates) and 16769 edges. Similarly, the second network filtered by RFU <10000 has 1035 nodes and 12169 edges, and the third one (filtered by RFU <20000) consists of 901 nodes and 8042 edges. Since the first network has the maximum number of nodes and edges, and shows more statistically significant glycan specific groups (discussed later) than the other two networks, the results specified henceforth represent the first network if not specifically indicated. The first network is shown in , where proteins are represented as diamonds and glycans as circles and the interactions between them are represented as edges.
Figure 3

The lectin-glycan network generated using the RFU cut-off of 5000 is shown.

Circles represent glycan nodes and diamonds represent lectin nodes. The nodes are color coded according to their communities. Three hub nodes (shown in green diamonds) are PP2A1, WGA1 and RCA.

The lectin-glycan network generated using the RFU cut-off of 5000 is shown.

Circles represent glycan nodes and diamonds represent lectin nodes. The nodes are color coded according to their communities. Three hub nodes (shown in green diamonds) are PP2A1, WGA1 and RCA. The network representation enables a quick visual inspection of the glycans bound to a lectin of interest. Additionally, in order to identify hub lectins from the lectin-glycan array, the node degree distribution of the network was calculated and is shown in . In an interaction network, proteins that interact with a large number of partners are considered as hubs [45], and are essential components of biological networks [46]. The definition of the hub node is rather subjective, but based on the observation of the biggest gap between the 3rd and 4th largest degree nodes in , we assigned hub proteins as those three with degree larger than 220. The 3 hubs are Phloem Protein2 (PP2A1) from Arabidopsis thaliana, wheat germ agglutinin (WGA) from Triticum vulgaris (wheat), and Ricinus communis agglutinin (RCA) from Ricinus communis (castor bean). By using the Mod-CSA method, the lectin-glycan network is clustered into 4 modules (communities), which are represented by separate colors in . The largest module consists of 168 protein nodes and 215 glycan nodes, and the smallest community contains 98 protein nodes and 133 glycan nodes. To validate the lectin-glycan interaction network and its detected community-structure, we investigated the binding specificities of the first neighbors of two plant lectins, Sambucus nigra agglutinin (SNA) and concanavalin A (ConA) whose glycan binding specificities are well known. The first lectin is a well-characterized plant lectin, elderberry bark agglutinin from Sambucus nigra, which is known to recognize the Neu5Aca2-6Gal linkage [47]. The second one is concanavalin A (ConA), which is known to have specificity for mannose sugars [48], [49], [50]. Proper categorization of the specificities of glycan-binding proteins plays a significant role in understanding protein-glycan interactions and utilizing glycan-binding proteins as analytical reagents.

Binding Specificities of SNA

It is well known that some plants contain more than one lectin with different sugar binding specificities [51]. The bark of the elderberry (Sambucus nigra) has two lectins SNA-I and SNA-II with different glycan binding specificities. Sambucus nigra agglutinin I (SNA-I), is the first lectin identified from the elderberry bark which has been conventionally employed to recognize Neu5Acα2-6Gal [47] or Neu5Acα2-6Galβ1-4GlcNAc sequence [27]. SNA-I is composed of two polypeptides, namely chain A of 33 kDa with enzymatic activity, and chain B of 35 kDa with carbohydrate-binding activity [52]. Molecular modeling studies have indicated that the overall structure of SNA-I is quite similar to that of Ricin [53] and SNA-I belongs to the group of type 2 ribosome-inactivating proteins [52]. SNA-II is the second lectin isolated from the elderberry bark tissue, and it exhibits high affinity for glycoconjugates and Type 14 pneumococcal polysaccharides having multiple terminal D-Gal groups [51]. SNA-II consists of two identical carbohydrate-binding B-chains [51], [52]. In the current lectin glycan array network, nineteen nodes represent both SNA-I and SNA-II lectins. Out of these nineteen SNA nodes, fifteen SNA-I nodes are from community 1 (1000180, 1000181, 1000183, 1000184 and 1000725), and community 3 (1002793, 1004421, 1004422, 1004701, 1004702, 1004703, 1004704, 1004705, 1004706 and 1004780). Similarly, SNA-II is represented by four nodes (1004707, 1004708, 1004709 and 1004710) enriched in community 3. The 10 SNA-I nodes in community 3 show specificity for complex-type biantennary N-glycans ( ). From this table we observe that almost all of the interacting glycans possess the determinant Neu5Acα2-6Gal or Neu5Acα2-6Galβ1-4GlcNAc (shown by bold text in the table). Another interesting point to notice is that the glycans 527 and 479 exhibit low RFU values in . This could be due to the fact that these glycans contain Neu5Acα2-3 sequence, which is known to decrease the binding of SNA [27]. On the other hand, 316 (Neu5Acα2-3Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp12) contains two sequences, one (Neu5Acα2-6Galβ1-4GlcNAc) increasing the binding and the other (Neu5Acα2-3) decreasing the binding.
Table 2

Three types of complex glycans for SNA proteins are listed.

A)
Glycan No.Glycan NameAvg. RFU
268Neu5Acα2-6Galβ1-4(6S)GlcNAcβ-Sp851134
467 Neu5Acα2-6Galβ1-4GlcNAcβ1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2)Manα1-6(GlcNAcβ1-4)(Neu5Acα2-6Galβ1-4GlcNAcβ1-4(Neu5Acα2-6Galβ1-4GlcNAcβ1-2)Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp2148246
465 Neu5Acα2-6Galβ1-4GlcNAcβ1-4Manα1-6(GlcNAcβ1-4)(Neu5Acα2-6Galβ1-4GlcNAcβ1-4(Neu5Acα2-6Galβ1-4GlcNAcβ1-2)Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp2143978
346 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAc-Sp1243812
327 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ-Sp041668
320Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1241588
302 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1241500
483 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4(Fucα1-6)GlcNAcβ-Sp2441106
55 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1240488
348 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6Manβ1-4GlcNAcβ1-4GlcNAc-Sp1239574
606 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3)GalNAca-Sp1439290
482Neu5Acα2-6Galβ1-4 GlcNAcβ1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-3)GalNAca-Sp1439202
57 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Man-a1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp2138592
56 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1337417
609 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1- 4GlcNAcβ1-3Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1236652
457 Neu5Acα2-6Galβ1-4GlcNAcβ1-6(Fucα1-2Galβ1-3GlcNAcβ1-3)Galβ1-4Glc-Sp2136616
325 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-3GlcNAcβ-Sp036221
314 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1235848
503Neu5Acα2-6GalNAcβ1-4(6S)GlcNAcβ-Sp833405
298(6S)Galβ1-4(6S)GlcNAcβ-Sp032632
287Neu5Gcα2-6Galβ1-4GlcNAcβ-Sp031718
354Galβ1-4GlcNAcβ1-2Manα1-6(Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4(Fucα1-6)GlcNAcβ-Sp2230544
557Neu5Gcα2-8Neu5Gcα2-6Galβ1-4GlcNAc-Sp028273
366Fucα1-4(Galβ1-3)GlcNAcβ1-2Manα1-6(Fucα1-4(Galβ1-3)GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4(Fucα1-6)GlcNAcβ-Sp2227993
319 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Aca2-3Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1227611
54 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-N(LT)AVL27447
321 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Aca2-3Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1226481
274Neu5Acα2-6Galβ1-4Glcb-Sp825380
53 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1225345
48[9NAc]Neu5Acα2-6Galβ1-4GlcNAcβ-Sp821953
488 Neu5Acα2-6Galβ1-4GlcNAcβ1-6(Fucα1-2Galβ1-4(Fucα1-3)GlcNAcβ1-3)Galβ1-4Glc-Sp2121783
328 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-3GlcNAcβ-Sp021014
324 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-3Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1219830
58 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp2418639
347Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAc-Sp1216329
464 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(GlcNAcβ1-4)(Neu5Acα2-6Galβ1- 4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp2116237
466 Neu5Acα2-6Galβ1-4GlcNAcβ1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2)Manα1-6(GlcNAcβ1-4)(Neu5Acα2-6Galβ1- 4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp2113858
409Neu5Acα2-6Galβ1-3GlcNAcβ1-3(Galβ1-4GlcNAcβ1-6)Galβ1-4Glc-Sp2112386
270 Neu5Acα2-6Galβ1-4GlcNAcβ-Sp811415
317 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1211124
360KDNa2-3Galβ1-3GalNAca-Sp1411019
485Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4(Fucα1-6)GlcNAcβ-Sp1910968
427Fucα1-2Galβ1-3GlcNAcβ1-2Manα1-6(Fucα1-2Galβ1-3GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4(Fucα1-6)GlcNAcβ-Sp2210833
458 Neu5Acα2-6Galβ1-4GlcNAcβ1-6(Fucα1-2Galβ1-3GlcNAcβ1-3)Galβ-4Glc-Sp2110202
52 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp89467
309 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6(GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp129381
376 Neu5Acα2-6Galβ1-4GlcNAcβ1-3GalNAc-Sp148974
521 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Man-Sp08470
313Neu5Acα2-3Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp128322
316 Neu5Acα2-3Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp128189
353GlcNAcβ1-2Manα1-6(GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4(Fucα1-6)GlcNAcβ-Sp227768
527 Neu5Acα2-3Galβ1-3GlcNAcβ1-2Manα-Sp06941
478 Neu5Acα2-6Galβ1-4GlcNAcβ1-6(Galβ1-3GlcNAcβ1-3)Galβ1-4Glcb-Sp216901
315 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp126606
358KDNa2-6Galβ1-4GlcNAc-Sp06532
333 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ-Sp06339
349 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3Manβ1-4GlcNAcβ1-4GlcNAc-Sp126178
607 Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-2Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp126161
479 Neu5Aca2-3Galβ1-4GlcNAcβ1-2Manα-Sp06154
51 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-N(LT)AVL5582
49Neu5,9Ac2a2-6Galβ1-4GlcNAcβ-Sp85207
B)
Glycan No. Glycan Name Avg. RFU
2AGP-A (AGP ConA flowthrough)52286.06
246 Neu5Acα2-6Galβ1-4GlcNAcβ–Sp849625.09
263 Neu5Gcα2-6Galβ1-4GlcNAcβ–Sp048932.24
250 Neu5Acα2-6Galβ1-4Glcβ–Sp848814.18
6Transferrin47533.26
248 Neu5Acα2-6Galβ1-4GlcNAcb1-3Galb1-4GlcNAcb-Sp047165.41
42[6OSO3]Galβ1-4Glcβ–Sp034505.6
44[6OSO3]Galβ1-4GlcNAcβ–Sp832444.86
247 Neu5Acα2-6Galβ1-4GlcNAcb1-3Galb1-4(Fuca1-3)GlcNAcb1-3Galb1-4(Fuca1-3)GlcNAcb-Sp030612.85
45[6OSO3]Galb1-4[6OSO3]Glcb-Sp827055.41
245 Neu5Acα2-6Galβ1-4GlcNAcβ–Sp026857.74
1Alpha1-acid glycoprotein (AGP)25869.33
43[6OSO3]Galβ1-4Glcβ–Sp822740.63
86GalNAcα1-3Galb–Sp821300.14
20β-GalNAc–Sp820559.23
3AGP-B (AGP ConA bound)17780.79
72Fucα1-2Galβ1-4GlcNAcβ–Sp815937.51
70Fuca1-2Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-3Galb1-4GlcNAcb-Sp014916.26
69Fucα1-2Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAc–Sp013165.86
87GalNAca1-4(Fuca1-2)Galb1-4GlcNAcb-Sp812866.52
242 Neu5Acα2-6GalNAcα–Sp812071.82
90GalNAcb1-3Gala1-4Galb1-4GlcNAcb-Sp011384.24
60Fucα1-2Galβ1-3GalNAcβ1-4(Neu5Acα2-3)Galβ1-4Glcβ-Sp910906.21
120Galβ1-3(Galβ1-4GlcNAcβ1-6)GalNAcα-Sp810546.55
150Galβ1-4GlcNAcβ1-6(Galβ1-3)GalNAcα–Sp89937.06
251 Neu5Acα2-6Galβ–Sp89853.36
73Fucα1-2Galβ1-4Glcβ–Sp09224.52
26[3OSO3][6OSO3]Galb1-4[6OSO3]GlcNAcb-Sp09118.55
59Fuca1-2Galb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb-Sp08069.79
74Fucα1-2Galβ–Sp87769.91
122Galb1-3(Neu5Aca2-6)GalNAca-Sp87693.01
10α-GalNAc–Sp86840.72
40[4OSO3]Galb1-4GlcNAcb-Sp86574.62
39[4OSO3][6OSO3]Galb1-4GlcNAcb-Sp06514.83
241 Neu5Acα2-6(Galβ1-3)GalNAcα–Sp86184.61
87GalNAca1-4(Fuca1-2)Galb1-4GlcNAcb-Sp85469.91
416-H2PO3Manα–Sp85447.01
249 Neu5Acα2-6Galβ1-4Glcβ–Sp05434.93
C)
Glycan No. Glycan Name Avg. RFU
51Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1338866
352Manα1-6(Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1237659
216Manα1-6(Manα1-3)Manα1-6(Manα1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1236933
347Manα1-6(Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAc-Sp1235539
212Manα1-2Manα1-6(Manα1-3)Manα1-6(Manα1-2Manα1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1235267
213Manα1-2Manα1-6(Manα1-2Manα1-3)Manα1-6(Manα1-2Manα1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1218208
217Manα1-6(Manα1-3)Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1215856
485Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4(Fuca1-6)GlcNAcβ-Sp1912002
211Manα1-6(Manα1-2Manα1-3)Manα1-6(Manα1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp1210800
417Fuca1-2Galβ1-4(Fuca1-3)GlcNAcβ1-3GalNAca-Sp1410154
477Galβ1-3GlcNAcβ1-2Manα1-6(GlcNAcβ1-4)(Galβ1-3GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp217265
50Manα1-6(Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp126298
349 Neu5Acα2-6Galβ1-4GlcNAcβ1-2Manα1-3Manβ1-4GlcNAcβ1-4GlcNAc-Sp126173
561Gala1-3Galβ1-4GlcNAcβ1-2Manα1-6(Gala1-3Galβ1-4GlcNAcβ1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAc-Sp245565

A) Complex N-glycans for 10 SNA nodes (1002793, 1004421, 1004422, 1004780, 1004701, 1004702, 1004703, 1004704, 1004705 and 1004706) belonging to community 3 are listed. Majority of glycan nodes contain either show Neu5Acα2-6Gal or Neu5Acα2-6Galβ1-4GlcNAc, B) Four SNA (SNA-II) nodes (1004707, 1004708, 1004709 and 1004710) in the community 3 show preference for mainly mannose glycans. Only two glycans (glycan 347 and 349) possess the determinant Neu5Acα2-6Galβ1-4GlcNAc, C) less complex glycans for protein nodes 1000180, 1000181, 1000183, 1000184 and 1000725 in community 1. Few glycan show the determinant (bold and italicized) which is known to inhibit glycan binding.

A) Complex N-glycans for 10 SNA nodes (1002793, 1004421, 1004422, 1004780, 1004701, 1004702, 1004703, 1004704, 1004705 and 1004706) belonging to community 3 are listed. Majority of glycan nodes contain either show Neu5Acα2-6Gal or Neu5Acα2-6Galβ1-4GlcNAc, B) Four SNA (SNA-II) nodes (1004707, 1004708, 1004709 and 1004710) in the community 3 show preference for mainly mannose glycans. Only two glycans (glycan 347 and 349) possess the determinant Neu5Acα2-6Galβ1-4GlcNAc, C) less complex glycans for protein nodes 1000180, 1000181, 1000183, 1000184 and 1000725 in community 1. Few glycan show the determinant (bold and italicized) which is known to inhibit glycan binding. Compared to SNA-I nodes in community 3, five SNA-I nodes in community 1 (1000180, 1000181, 1000183, 1000184 and 1000725) interact with a smaller number of complex glycans (see ). Top 3 glycans possess either Neu5Acα2-6Gal or Neu5Acα2-6Galβ1-4GlcNAc and show RFU values greater than 40000. Two glycans from the second half of the table (glycans 60 and 59) show lower values of RFU because of the presence of the Neu5Acα2-3Gal sequence, which is known to decrease glycan binding. All these results are consistent with existing studies on the SNA specificity [27]. The 4 SNA-II nodes (1004707, 1004708, 1004709 and 1004710) in community 3 show preference for mainly mannose glycans or terminal GlcNAcb1-4GlcNAcb. Only two glycans (347 and 349) possess the determinant of Neu5Acα2-6Galβ1-4GlcNAc ( ). In general, SNA-II is known to be Gal/GalNAc specific and is precipitated by glycoproteins, which consist of terminal GalNAc oligosaccharide chains [51]. Specifically, it shows higher affinity for D-GalNAc- and terminal N-acetyl-D-galactosaminyl disaccharides as compared to D-Gal. Conversely, the affinity exhibited by SNA-I for D-Gal and D-GalNAc- is identical [51]. However, SNA-I recognizes Neu5Acα2-6Gal [47] or Neu5Acα2-6Galβ1-4GlcNAc glycan sequence [27] with high specificity. Despite the differences in their glycan binding specificities, SNA-I and SNA-II share some similarities. For example, both lectins contain similar amino acid composition, while SNA-II contains more asparagine/aspartic acid, glycine and methionine residues [51]. Additionally, the carbohydrate-binding B-chains of both lectins show caspase-dependent apoptosis in different insect cell lines [52]. Considering their characteristic glycan binding specificities, SNA-I and SNA-II may play different functional roles in plants.

Binding Specificities of ConA

Concanavalin A (ConA) binds to a variety of eukaryotic cells through specific interactions with saccharide-containing cellular receptors, and has been widely used as a molecular probe in studies of cell membrane dynamics and cell division [54]. ConA typically binds to glucosyl and mannosyl residues at the non-reducing termini of oligo- or polysaccharides [48], [49] and it can also bind to non-terminal mannosyl residues [50]. The current network contains sixteen nodes of ConA (1000158 and 1000165 in community 1; 1000356 and 1000699 in community 2; and 1004459, 1004460, 1004461, 1004462, 1004464, 1004465, 1004466, 1004467, 1004468, 1002791, 1004412 and 1004413 in community 3) which mainly interacts with mannose containing glycans. All ConA interacting glycan nodes from community 1, 2 and 3 are shown in , respectively. ConA interacting glycan nodes in community 1 are either mannose sugars or biantennary complex glycans such as transferrin and AGP-B. On the other hand, the ConA nodes in community 2 show preference for terminal glucose glycans.
Table 3

The table shows all types of glycans interacting with ConA protein nodes.

A)
Glycan No.Glycan NameAvg. RFU
144Manα1-2Manα1-6(Manα1-3)Manα1-6(Manα2Manα2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-N31059
139Manα1-3(Manα1-6)Manα–Sp323784
136Mana1-2Mana1-3(Mana1-2Mana1-6)Mana-Sp923161
138Mana1-3(Mana1-2Mana1-2Mana1-6)Mana-Sp917347
137Mana1-2Mana1-3Mana-Sp914700
135Mana1-2Mana1-2Mana1-3Mana-Sp914334
143Mana1-6(Mana1-2Mana1-3)Mana1-6(Manα2Manα1-3)Manb1-4GlcNAcb1-4GlcNAcb-N12786
145Manα1-2Manα1-2Manα1-3(Manα1-2Manα1-3(Manα1-2Manα1-6)Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-N12581
112α-D-Glc–Sp810407
75Galβ1-4GlcNAcβ1-3Galβ1-4Glcβ–Sp88329
151Neu5Gca2-3Galb1-4(Fuca1-3)GlcNAcb-Sp08141
59Galβ1-3GalNAcβ1-4Galβ1-4Glcβ–Sp87380
113mixed glycans: Man5-9-N–Sp16646
114Manα1-6Manα1-3(Manα1-6Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-N–Sp16600
146Manα1-3(Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ–Sp26209
6Transferrin5981
130Manα1-2Manα1-6(Manα1-3)Manα1-6(Manα2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-N–Sp15406
129Manα1-6(Manα1-3)Manα1-6(Manα2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-N–Sp15259
3AGP-B5076
102Fucα1-2Galβ1-4(Fucα1-3)GlcNAcβ–Sp85014
B)
Glycan No. Glycan Name Avg. RFU
193Manα1-2Manα1-6(Manα1-3)Manα1-6(Manα2Manα2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-N52832
194Manα1-2Manα1-2Manα1-3(Manα1-2Manα1-3(Manα1-2Manα1-6)Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Asn52705
199Man5-9mix-Asn52238
198Mana1-6(Mana1-3)Mana1-6(Mana1-3)Manb1-4GlcNAcb1-4 GlcNAcb-Asn51705
196Mana1-3(Mana1-2Mana1-2Mana1-6)Mana-Sp949576
192Mana1-6(Mana1-2Mana1-3)Mana1-6(Manα2Manα1-3)Manb1-4GlcNAcb1-4GlcNAcb-Asn48888
190Mana1-2Mana1-3(Mana1-2Mana1-6)Mana-Sp944830
189Mana1-2Mana1-2Mana1-3Mana-Sp943717
197Manα1-6(Manα1-3)Manα1-6(Manα2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-N40190
195Manα1-3(Manα1-6)Manα–Sp938636
191Mana1-2Mana1-3Mana-Sp935442
177Glcα1-4Glcβ–Sp818139
179Glcα1-6Glcα1-6Glcβ-Sp813465
178Glcα1-4Glca–Sp812700
180Glcb1-4Glcb-Sp86825
186GlcAb1-6Galb-Sp86057
C)
Glycan No. Glycan Name Avg. RFU
609Neu5Aca2-6Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1211536
607Neu5Aca2-6Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1226362
577GlcNAcb1-3Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-6(GlcNAcb1-3Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2415181
576Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-6(Galb1-4GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2437709
575GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-6(GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp245650
561Gala1-3Galb1-4GlcNAcb1-2Mana1-6(Gala1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAc-Sp247563
541GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-6(GlcNAcb1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp2540427
528Gala1-3Galb1-3GlcNAcb1-2Mana-Sp05564
527Neu5Aca2-3Galb1-3GlcNAcb1-2Mana-Sp019479
485Mana1-6(Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp1917039
484Neu5Aca2-3Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2434863
483Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp246282
477Galb1-3GlcNAcb1-2Mana1-6(GlcNAcb1-4)(Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp2135401
476GlcNAcb1-6(GlcNAcb1-2)Mana1-6(GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2410251
474Fuca1-2Galb1-3(Fuca1-4)GlcNAcb1-2Mana1-6(Fuca1-2Galb1-3(Fuca1-4)GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp11-4(Fuca1-6)GlcNAcb-Sp198363
458GalNAca1-3(Fuca1-2)Galb1-3GlcNAcb1-2Mana1-6(GalNAca1-3(Fuca1-2)Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp227281
456Gala1-3(Fuca1-2)Galb1-3GlcNAcb1-2Mana1-6(Gala1-3(Fuca1-2)Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2231036
455GalNAca1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-6(GalNAca1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp229449
428Gala1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-6(Gala1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2219106
427Fuca1-2Galb1-3GlcNAcb1-2Mana1-6(Fuca1-2Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp225130
425Galb1-3GlcNAcb1-2Mana1-3(Galb1-3GlcNAcb1-2(Galb1-3GlcNAcb1-6)Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1921266
424Gala1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-3(Gala1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2232937
422GlcNAcb1-2(GlcNAcb1-6)Mana1-6(GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1910863
421Fuca1-2Galb1-4GlcNAcb1-2Mana1-6(Fuca1-2Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp227369
418GlcNAcb1-2Mana1-3(GlcNAcb1-2(GlcNAcb1-6)Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1915686
405Gala1-4Galb1-4GlcNacb1-2Mana1-6(Gala1-4Galb1-4GlcNacb1-2Mana1-3)Manb1-4GlcNacb1-4GlcNacb-Sp2436858
404Gala1-4Galb1-3GlcNacb1-2Mana1-6(Gala1-4Galb1-3GlcNacb1-2Mana1-3)Manb1-4GlcNacb1-4GlcNacb-Sp198992
399Galb1-4GlcNAcb1-2Mana1-6(GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAc-Sp1237441
398GlcNAcb1-2Mana1-6(Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAc-Sp125577
396Gala1-3Galb1-3(Fuca1-4)GlcNAcb1-2Mana1-6(Gala1-3Galb1-3(Fuca1-4)GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAc-Sp1936771
395Gala1-3Galb1-3GlcNAcb1-2Mana1-6(Gala1-3Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAc-Sp196293
394GlcNAcb1-2Mana1-3(Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAc-Sp1228946
389GlcNacb1-2Mana1-6(GlcNacb1-4(GlcNacb1-2)Mana1-3)Manb1-4GlcNacb1-4GlcNac-Sp216161
375Gala1-3(Fuca1-2)Galb1-3GlcNAcb1-2Mana1-6(Gala1-3(Fuca1-2)Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp2015799
372Gala1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-6(Gala1-3(Fuca1-2)Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp208469
368Gala1-3Galb1-4(Fuca1-3)GlcNAcb1-2Mana1-3(Gala1-3Galb1-4(Fuca1-3)GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp2021683
366Fuca1-4(Galb1-3)GlcNAcb1-2Mana1-6(Fuca1-4(Galb1-3)GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2241588
365Galb1-4GlcNAcb1-2(Galb1-4GlcNAcb1-4)Mana1-3(Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAc-Sp2112174
364Gala1-3Galb1-4GlcNAcb1-2Mana1-6(Gala1-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp2042514
362Fuca1-2Galb1-4GlcNAcb1-2Mana1-6(Fuca1-2Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp205414
361Fuca1-2Galb1-3GlcNAcb1-2Mana1-6(Fuca1-2Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp2021541
360Mana1-3(Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1233065
357Fuca1-2Galb1-4GlcNAcb1-2Mana1-3(Fuca1-2Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp2027555
355Galb1-3GlcNAcb1-2Mana1-6(Galb1-3GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2240353
354Galb1-4GlcNAcb1-2Mana1-6(Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2210006
353GlcNAcb1-2Mana1-6(GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2242532
352Mana1-6(Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1214973
349Galb1-3GlcNAcb1-2Mana1-3(Galb1-3GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2238876
348Galb1-4GlcNAcb1-2Mana1-3(Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2248582
347GlcNAcb1-2Mana1-3(GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4(Fuca1-6)GlcNAcb-Sp2241416
346Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Mana1-3)Manb1-4GlcNAcb1-4GlcNAc-Sp1258408
328Galb1-4(Fuca1-3)GlcNAcb1-2Mana1-6(Galb1-4(Fuca1-3)GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp207604
327Neu5Aca2-3Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1248590
322Fuca1-3(Galb1-4)GlcNAcb1-2Mana1-3(Fuca1-3(Galb1-4)GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp2019979
321GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1242652
320Neu5Aca2-3Galb1-4GlcNAcb1-2Mana1-3(Neu5Aca2-3Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1229652
319Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-3Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1245597
317Mana1-2Mana1-6(Mana1-2Mana1-3)Mana1-6(Mana1-2Mana1-2Mana1-3)Mana-Sp925444
316Mana1-2Mana1-6(Mana1-3)Mana1-6(Mana1-2Mana1-2Mana1-3)Mana-Sp936802
315Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3(GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1237868
314Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3(Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1237828
313Manα1-2Manα1-2Manα1-3(Manα1-2Manα1-6(Manα1-3)Manα1-6)Manα-Sp916574
312Manα1-6(Manα1-3)Manα1-6(Manα1-3)Manβ-Sp106909
309Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1241526
302Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp126413
217Mana1-6(Mana1-3)Mana1-6(Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1238826
216Mana1-6(Mana1-3)Mana1-6(Mana1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1228738
215Mana1-2Mana1-2Mana1-6(Mana1-3)Mana-Sp934121
215Mana1-2Mana1-2Mana1-6(Mana1-3)Mana-Sp934119
214Mana1-6(Mana1-3)Mana-Sp939736
213Manα1-6(Manα1-3)Manα1-6(Manα1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp128657
212Mana1-2Mana1-6(Mana1-3)Mana1-6(Mana1-2Mana1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1255882
211Manα1-3(Manα1-6)Manα-Sp918045
210Mana1-2Mana1-3Mana-Sp935979
209Manα1-2Manα1-6(Manα1-3)Manα1-6(Manα1-2Manα1-2Manα1-3)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp127614
208Mana1-2Mana1-2Mana1-3Mana-Sp937383
207Manα1-2Manα1-3Manα-Sp96620
205Mana1-3(Mana1-2Mana1-2Mana1-6)Mana-Sp924990
58Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp248188
57Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Man-a1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp2138425
56Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp135338
55Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1261880
54Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1340848
53GlcNAcb1-2Mana1-6(GlcNAcb1-2Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1342923
52Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-3(Neu5Aca2-6Galb1-4GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp854802
51Mana1-6(Mana1-3)Manb1-4GlcNAcb1-4GlcNAcb-Sp1342505
50Manα1-3(Manα1-6)Manβ1-4GlcNAcβ1-4GlcNAcβ-Sp138976
49GlcNAcb1-2Mana1-3(GlcNAcb1-2Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp135961
48Mana1-3(Mana1-6)Manb1-4GlcNAcb1-4GlcNAcb-Sp1337798

A) ConA interacting glycan nodes from community 1 are shown. These ConAs interact either with mannose nodes or biantennary complex glycans such as Transferrin and AGP-B, B) ConA interacting glycan nodes from community 2 are shown. They show preference for terminal glucose glycans, C) ConA nodes in community 3 show high preference for “N-glycan, high mannose” sugars.

A) ConA interacting glycan nodes from community 1 are shown. These ConAs interact either with mannose nodes or biantennary complex glycans such as Transferrin and AGP-B, B) ConA interacting glycan nodes from community 2 are shown. They show preference for terminal glucose glycans, C) ConA nodes in community 3 show high preference for “N-glycan, high mannosesugars. In comparison to communities 1 and 2, the ConA nodes in community 3 show high preference for mannose containing sugars especially “N-glycan, high mannose” ( ). These results agree with existing reports on ConA’s binding structure and specificity for mannose containing structures [55]-[57], in addition to the recognition of biantennary glycans, complex N-glycans [58] and terminal glucose [57]. Existing studies on SNA-I [47] and ConA [55]-[57] demonstrate the validity of the lectin-glycan interaction network and its detected community structure. Once a network is constructed, it is fairly easy to identify a lectin that explicitly binds to a certain glycan sequence by just selecting the lectin node of interest and its first neighbors in the network. The lectins in different communities show a dramatic difference in their glycan binding specificities. The current network-based approach should provide quick overall analysis and the use of glycan microarray data on the lectin-glycan interaction without time-consuming calculations.

Community Detection of the Lectin-glycan Interaction

We performed community detection of the lectin-glycan interaction network by using Mod-CSA [28], and compared the results with existing methods such as MCL [34], [35], MCODE [36] and greedy algorithm [32], [38]. The number of identified communities and the modularity values obtained by various community detection algorithms are shown in , and .
Table 4

A summary of various clustering methods tested in this work.

MethodNo. of ClustersModularityDescription
Mod-CSA40.37The conformational space annealing based modularityoptimization method.
Greedy60.30Fast greedy community detection algorithm.
MCODE23−0.04Bader and Hogue algorithm for findingmodules in networks.
MCL33−0.81Markov clustering algorithm from van Dongenthat uses random walks to simulate flow.

Mod-CSA outperforms the other popular clustering methods in terms of the modularity score.

Figure 4

Graphical representation of four communities identified by Mod-CSA is shown.

The figure provides an overall picture of the whole network with four main functional categories based on the p-value analysis.

Figure 5

Communities generated by four methods are shown.

(a) Mod-CSA generated communities are shown. In each community, glycans nodes are represented by circles whereas the protein nodes are shown as diamonds. From the figure it can be seen that all the nodes in a network have been assigned to a community. Community 1 has PP2A1 as hub node where as Community 4 has two hub nodes, WGA1 and RCA. (b) Greedy algorithm generated communities are shown. The nodes are color coded as per the Mod-CSA result. Each of the first three communities (community 1 to 3) contain a hub node where as communities 4-6 have only a few nodes.(c) MCODE generated communities are shown. Many nodes are not clustered, and the three hubs are grouped into one community. (d) MCL generated communities are shown many nodes are not clustered at all. Hub nodes are not clustered with any other nodes.

Graphical representation of four communities identified by Mod-CSA is shown.

The figure provides an overall picture of the whole network with four main functional categories based on the p-value analysis.

Communities generated by four methods are shown.

(a) Mod-CSA generated communities are shown. In each community, glycans nodes are represented by circles whereas the protein nodes are shown as diamonds. From the figure it can be seen that all the nodes in a network have been assigned to a community. Community 1 has PP2A1 as hub node where as Community 4 has two hub nodes, WGA1 and RCA. (b) Greedy algorithm generated communities are shown. The nodes are color coded as per the Mod-CSA result. Each of the first three communities (community 1 to 3) contain a hub node where as communities 4-6 have only a few nodes.(c) MCODE generated communities are shown. Many nodes are not clustered, and the three hubs are grouped into one community. (d) MCL generated communities are shown many nodes are not clustered at all. Hub nodes are not clustered with any other nodes. Mod-CSA outperforms the other popular clustering methods in terms of the modularity score. From , & , it is clear that Mod-CSA [29] outperforms the other clustering methods in terms of the modularity score as well as the number of nodes left unclassified. The only method comparable to our modularity score of 0.37 obtained by Mod-CSA was the fast greedy algorithm [32], [38] with a modularity score of 0.30. The algorithm recognizes clusters by repetitively eliminating edges from the network and then checks again which nodes are still connected [59]. The method detected 6 communities with the largest community containing 223 protein nodes and 298 glycan nodes (community 1) whereas the three smallest communities consist of either 4 nodes (community 4) or 3 nodes (community 5 & 6) only (see ). To compare the biological significance of modules (communities) obtained by Mod-CSA and by the greedy algorithm, we calculated the numbers of statistically meaningful enriched clusters of lectins that bind to the same specific glycan. The glycan binding specificity of each protein node was identified either from the literature or from Uniprot database as described in the methods section, and the significance of each glycan specific clusters was assessed by calculating its p-value (p≤0.05). From , we observe that 44 statistically meaningful enriched clusters of lectins are identified with p-values ≤0.05. Whereas only 33 enriched clusters are identified by the greedy algorithm. This result suggests that many additional functionally related lectin clusters are identified by Mod-CSA, than detected by greedy algorithm.
Table 5

Lists of statistically meaningful enriched clusters (p≤0.05) of lectins binding to the identical glycan are shown.

Mod-CSA (Q = 0.366)Community (Glay) (Q = 0.3)
Cluster No.No. of membersReported SpecificityP-valueClusterNo.No. ofmembersReported SpecificityP-value
1168a-Linked terminal GalNAc0.00551223a-Linked terminal GalNAc0.0028
Chitin oligomers, Sia0.0109Chitin oligomers, Sia0.0006
Fuca1-2Galb1 -> or GalNAcb1 -> groups at their nonreducing terminals 0.0347Fuca1-2Galb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb OR Galb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb0.0352
Fuca1-2Galb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb OR Galb1-3GalNAcb1-4(Neu5Aca2-3)Galb1-4Glcb0.0112Fuca1-2Galb1-4GlcNAc0.0001
Fuca1-2Galb1-4GlcNAc3.49E-06Fuca1-6GlcNAc, Fuca1-3(Galb1-4)GlcNAc0.0028
Fuca1-6GlcNAc (core fucose) 0.0347Fucose binding lectin0.0028
Fuca1-6GlcNAc, Fuca1-3(Galb1-4)GlcNAc0.0004 Galactose binding lectin 0.0117
Fucose binding lectin0.0004Galactose- and N-acetylgalactosamine-binding0.0065
Galactose- and N-acetylgalactosamine-binding0.0147Galb1-3GalNAc0.0052
Mannose binding lectin0.0001Galb1-3GalNAc, GalNAc0.009
terminal N-acetylgalactosamine (GalNAc) 0.0347 Galα(1,3)Gal 0.0352
298Agalactosylated tri/tetra antennary glycans, GlcNAc0.0011High-mannose, Mana1-3Man0.0403
Chitin oligomers, Sia5.27E-10Mannose binding lectin3.47E-06
Galb1->3GalNAc-a-4.40E-08N,N'-diacetyllactosediamine(GalNAcβ1-4GlcNAc, LacdiNAc)0.0369
Galb1-3GalNAc0.0007 Siaa2-3Galb1- 0.0151
Galb1-3GalNAc, GalNAc0.00212190(GlcNAcb1-4)n, Galb1-4GlcNAc0.0257
Mannose binding lectin0.0207Agalactosylated tri/tetra antennary glycans, GlcNAc0.0257
N-acetylglucosamine and N-acetylneuraminic acid 0.0068Chitin oligomers, Sia3.16E-07
3147Fuca1-6GlcNAc, a-D-Glc, a-D-Man3.36E-05Fuca1-2Galb1-4GlcNAc0.0119
Galb1-3GalNAc0.0114Fuca1-6GlcNAc, a-D-Glc, a-D-Man0.0005
High-mannose, Mana1-3Man0.0223Galactose binding lectin0.0177
High-mannose, Mana1-3Man, Mana1-6Man0.0162Galb1->3GalNAc-a-4.17E-05
High-mannose, Mana1-6(Mana1-3)Man8.12E-07High-mannose, Mana1-6(Mana1-3)Man0.0485
High-mannose, Mana1-6Man0.0026Mannose binding lectin0.0196
Mana1-3(Mana1-6)Man, bi- and tri-antennary complex-type N-glycan, GalNAc 0.0232Siaa2-6Gal/GalNAc0.0469
Manb Anywhere 0.0232Tri/tetra-antennary complex-type N-glycan0.0184
Mannose binding lectin5.86E-06393High-mannose, Mana1-3Man0.0092
N-acetylglucosamine0.0162High-mannose, Mana1-3Man, Mana1-6Man0.0105
Siaa2-6Gal/GalNAc0.0044High-mannose, Mana1-6Man5.32E-06
Subterminal Mannose0.0232Mannose binding lectin7.02E-14
4100(GlcNAcb1-4)n, Galb1-4GlcNAc0.0013N-acetylglucosamine0.0105
a- or b-linked terminal GalNAc, GalNAca1-3Gal 0.0003Siaa2-6Gal/GalNAc0.0049
Agalactosylated tri/tetra antennary glycans, GlcNAc0.0013Subterminal Mannose0.0058
Bi-antennary complex-type N-glycan with outer Gal and bisecting GlcNAc 0.005644NANA
Galb1-4GlcNAc 0.000453NANA
GalNAca1-3GalNAc, blood group A antigen 0.005663NANA
GalNAcb1-4GlcNAc, Galb1-3(-6)GalNAc 0.0056
GlcNAc oligomers, oligosaccharide containing GlcNAc and LacNAc 0.0474
GlcNAc trimers/tetramers 0.0056
Mannose binding lectin0.0003
N,N'-diacetyllactosediamine(GalNAcβ1-4GlcNAc, LacdiNAc)0
Siaa2-3Galb1-3(Siaa2-6)GalNAc 0.0036
Siaa2-3Galb1-4GlcNAc 0.0377
Tri/tetra-antennary complex-type N-glycan0.0234

Communities generated by Mod-CSA and greedy algorithm are used. The statistical significance of each reported glycan binding lectin was calculated by hypergeometric distribution using p≤0.05. For each glycan listed in , interacting lectin nodes were identified to calculate the significance of the community structure determined in this study. The number of statistically significant glycan-specific groups according to Mod-CSA partitioning is 44 (p-value <0.05) while greedy algorithm provides only 33 groups. 15 glycan-specific groups generated by Mod-CSA but not by greedy algorithm are shown in bold, whereas 3 groups generated by greedy algorithm but not by Mod-CSA are shown in italic bold.

Communities generated by Mod-CSA and greedy algorithm are used. The statistical significance of each reported glycan binding lectin was calculated by hypergeometric distribution using p≤0.05. For each glycan listed in , interacting lectin nodes were identified to calculate the significance of the community structure determined in this study. The number of statistically significant glycan-specific groups according to Mod-CSA partitioning is 44 (p-value <0.05) while greedy algorithm provides only 33 groups. 15 glycan-specific groups generated by Mod-CSA but not by greedy algorithm are shown in bold, whereas 3 groups generated by greedy algorithm but not by Mod-CSA are shown in italic bold. For example, the greedy algorithm failed to identify 15 glycan specific lectin clusters (shown in bold in ) that were identified by Mod-CSA. On the contrary, 3 glycan specific clusters (shown in italic bold in ) were not detected by Mod-CSA, which are found by the greedy algorithm result. Specifically, the greedy algorithm failed to identify all fucose specific lectins, while Mod-CSA [29] successfully detected almost all fucose specific lectins and grouped them in community 1. Similarly, the greedy algorithm identified only five mannose related specificities in community 3, which is the major mannose binding community detected by greedy algorithm. However, Mod-CSA recognized eight mannose related specificities in community 1. We compared our method with other popular clustering algorithms such as MCODE [36] and MCL [34], [35]. MCODE method divided the network into a total of 23 clusters with the modularity score of −0.036. The largest cluster consists of 56 nodes whereas the smallest cluster contains only 4 nodes. However, only 3 clusters contain more than 10 protein nodes and they were further analyzed for enrichment of glycan specific lectin groups. The statistical analysis of these 3 clusters resulted in only 4 statistically meaningful lectin groups. From , we observe that a large number of single nodes (791) are not clustered into any groups. This is because MCODE identifies clusters of tightly connected nodes and does not intend to assign every node in the network to a cluster [59]. The main reason for this could be the fact that the MCODE algorithm is sensitive to noise in the network, particularly to false positive interactions [60]. Consequently, only a small number of strongly connected clusters are identified by MCODE and the rest of the nodes remain unclustered, which makes it hard to extract information from the network. Among all four methods tested, the MCL algorithm performed worst in terms of its modularity value of −0.815. MCL detected 33 clusters with the largest cluster consisting of 340 nodes while the smallest cluster has 2 nodes ( ). Similar to MCODE, the MCL method detected only 3 clusters containing more than 10 protein nodes and many nodes (689) in the network were not assigned to any group, again making it difficult to interpret these unassigned nodes. Therefore, these unassigned nodes were left out for further analysis. The MCL method resulted in only 12 statistically significant glycan specific groups. If the performances of MCL and MCODE are hindered by false positive interactions, MCL and MCODE may perform better with networks generated using only reliable data. To find out if the Mod-CSA method outperforms the other methods regardless of the amount of potentially false information, we performed the enriched cluster analysis on two additional networks generated using more stringent RFU criteria, RFU ≥10000 and RFU ≥20000 (see ). The results remain same regardless of the RFU cutoff values used to generate the network. For example, the numbers of statistically significant glycan specific groups identified by Mod-CSA are 41 and 35 using RFU cutoff values of 10000 and 20000, respectively. However, the greedy algorithm provides 23 and 20 statistically significant glycan specific groups. Similarly, with the MCL method, 20 and 14 statistically significant glycan specific groups were identified (see . Surprisingly, MCODE detected no statistically significant glycan specific lectin groups from more stringent networks. Finally? we compared the clusters obtained by Mod-CSA with random clusters. We divided the nodes into four random clusters, which have the same number of nodes with those detected by Mod-CSA. This process was iterated 20 times and the average number of statistically enriched glycan-specific groups detected by random clustering was compared with that by Mod-CSA. The maximum and minimum number of significantly enriched lectin groups was 11 and 1, respectively. On average, these 20 random permutations of clusters resulted in about 7 glycan-specific lectin groups having p-value ≤0.05 (see ). A comparison of the number of significantly enriched lectin groups detected by the different clustering methods is shown in . All these results demonstrate that Mod-CSA extracts more information than the other widely used clustering methods, and it can serve as a powerful tool for investigating the lectin-glycan interaction.
Figure 6

The number of statistically significant glycan-specific groups are shown for three networks generated with RFU cutoff values of 5000 (blue), 10000 (red), 20000 (green).

The random clusterings are generated using the four community results of Mod-CSA, and the average and the standard deviation is calculated from 20 runs.

The number of statistically significant glycan-specific groups are shown for three networks generated with RFU cutoff values of 5000 (blue), 10000 (red), 20000 (green).

The random clusterings are generated using the four community results of Mod-CSA, and the average and the standard deviation is calculated from 20 runs.

The Optimal Community Structure of the Lectin-glycan Interaction Network

It has been shown that Mod-CSA can provide globally optimal modularity partitioning of a network containing up to 2000 nodes [31]. Since our lectin-glycan network has 1119 nodes, we believe that the Mod-CSA result corresponds to the optimal grouping of the network in terms of its modularity. The optimal modularity grouping of lectins and glycans results in 4 communities with the modularity score of 0.37. We attempted to explore the relationship between all nodes within the same community on the basis of structure and function of each lectin and the type of glycan binding specificity. Each lectin node was assigned with its known glycan binding specificity, and the statistical significance of their grouping was assessed by calculating its p-value (p≤0.05) (see and ). A brief description of each community is given below:

Community 1 (Fucose specific)

This is the largest community of the lectin-glycan network detected by Mod-CSA analysis and contains 168 protein nodes and 215 glycan nodes, respectively. This community is dominated by protein nodes with fucose specific lectins, such as ulex europaeus agglutinin I (UEA-I), aleuria aurantia lectin (AAL), ralstonia solanacearum lectin (RSL), etc. The fucose binding sites of RSL are very similar to those of previously reported five fucose-binding sites of AAL [61]. Fucose-containing xyloglucans are known to promote signaling consequences on plant tissues [62]. The other types of overrepresented lectins in this community have specificity for Galactose- and N-acetylgalactosamine binding with cell adhesion as their main function. The most common protein domains correspond to these galactose specific lectins are H_lectin (PFAM ID: PF09458) domain, which is involved in self/non-self recognition of cells through binding with carbohydrates [63], and Galactose-binding domain-like domain known as Discoidin domain (PFAM ID: PF00754), which is found in many blood coagulation factors. The galactose specific lectins in this community include agglutinin from Helix pomatia, Discoidin I and Discoidin II from Dictyostelium discoideum (Slime mold). Additionally, the unannotated lectins in this cluster such as 6RG, Tap1, Mubin1 show specificity for galactose or fucose sugars (see Table S5), which strongly indicates that these proteins are related to cell adhesion. This community contains the top hub PP2A1 (1001943) with the largest node degree of 257. The other three PP2A1 nodes (1002090, 1002091 and 1002092) belong to community 2. The list of unique glycans that interact with these PP2A1 nodes are summarized in Table S6. From this table it can been seen that PP2A1 nodes show specificity for a diverse range of glycans such as GlcNAc, high-mannose N-glycans and sialic acid containing glycans. Recently, Beneteau et al., (2010) [64] in their glycan array experiments have shown that PP2A1 binds to different types of carbohydrates. This indicates the possibility that the phloem PP2 lectin plays roles in numerous functions, recognizing either endogenous glycoproteins or glycosylated receptors of pathogens. This diversity in glycan binding by PP2A1 could be attributed to the presence of several carbohydrate-binding sites in PP2A1 [64].

Community 2 (Galb1-3GalNAc specific)

This is the smallest community with 98 protein nodes and 133 glycan nodes. Community 2 is rich in N-acetylglucosamine and N-acetylgalactosamine binding lectins such as Wheat Germ Agglutinin (WGA), Griffonia simplicifolia II (GS-II), and Sclerotium rolfsii lectin (SRL). WGA belongs to a highly conserved family of chitin-binding lectins from cereals (Gramineae), such as rye, barley, rice and wheat [65]. Chitin, a polymer of β-1,4-N-acetylglucosamine is present in the cell wall of many fungi, in the exoskeleton and digestive tract of some insects, and in some nematodes [66]. Similarly, GS-II, also an N-acetylglucosamine-specific legume lectin, has insecticidal activity against cowpea weevil [67]. In contrast to WGA and GS-II, SRL displays strong binding to O-linked galactose-beta-1,3-N-acetylgalactosamine, disaccharide (Thomsen Friedenreich antigen) similar to Agaricus bisporus lectin [68]. Similarly, the other N-acetylgalactosamine specific lectins in this group are involved in the binding of T-antigen structure Gal-beta1,3-GalNAc e.g. Agglutinin alpha chain (Jacalin alpha chain) from Artocarpus integer (Jack fruit) and Agglutinin alpha chain (MPA) from Maclura pomifera (Osage orange). Unannotated protein nodes are represented by lectins such as Protein PHLOEM PROTEIN 2-LIKE A1 (PP2A1) from Arabidopsis thaliana and Codium fragile lectin (CFT) from Codium fragile [(Dead man's fingers) (Green alga)]. PP2A1 is known to interact with diverse types of carbohydrates and may be involved in numerous recognition functions [64]. On the other hand, CFT shows preference for the a-anomer of GalNAc and recognizes GalNAca1 sequences as well as high affinity for the Forssman pentasaccharide and for Galb1->3GalNAc-a- [69], which is one of the overrepresented (p-value <0.05) glycan specific group in this community. Lists of unique glycans for PP2A1 and CFT nodes are summarized in .

Community 3 (Mannose specific)

Protein nodes in this group are dominantly mannose binding lectins and nine out of twelve statistically significant glycan groups are mannose specific. Many members of these mannose specific lectins have B_lectin (PFAM ID: PF01453) structural domain. The members of this family are mannose specific and belong to Bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae).For example, Galanthus nivalis agglutinin (GNA), a mannose-specific lectin from snowdrop bulbs, is a tetrameric member of the family of Amaryllidaceae lectins that exhibit antiviral activity towards HIV [70]. Other mannose binding lectins in this group have Lectin_legB (PFAM ID: PF00139) structural domain and require metal ions like Ca and Mn ions for carbohydrate binding and cell-agglutinating activities. Examples include ConA and Garden pea lectin. The group also includes various high mannose binding lectins such as Hippeastrum hybrid lectin (HHL), Narcissus psuedo-narcissus agglutinin (NPA), Salt stress-induced protein, Allium sativum agglutinin (ASA), etc. Another mannose binding lectin in this group which has an antiviral activity is Cyanovirin-N (CV-N). The antiviral activity of CV-N is mediated through specific interactions with the viral surface envelope glycoproteins gp120 and gp41, as well as to high-mannose oligosaccharides found on the HIV envelope [71]. Other lectins that were grouped in this community for which we could not find the reported glycan specificity include Arum maculatun agglutinin (AMA), Caragana arborescens agglutinin (CAA), Colchicum autumnale lectin (CA), and Arisaema helleborifolium schott lectin (AHL). All these lectins also show high specificity for mannose sugars (). Overall the community consists of 147 protein nodes and 124 glycan nodes.

Community 4 (GalNAc specific)

From it can be observed that this community is enriched in GalNAc specific lectins such as Datura stramonium agglutinin (DSA), Soybean agglutinin (SBA), Vicia villosa agglutinin (VVA), Bauhinia purpurea lectin (BPL), etc. These galactose specific lectins may play a significant role in cell-agglutinating activities e.g. VVA (Lectin B4) from Vicia villosa (Hairy vetch). Another galactose-specific lectin in this group is a legume lectin known as Erythrina cristagalli lectin (ECL) [72]. Although its function in the legume is unknown, it has been shown that ECL possesses hemagglutinating activity and it is believed to be mitogenic for human T lymphocytes [73]. A large number of plant and fungal proteins (e.g. solanaceous lectins of tomato and potato, plant endochitinases, the wound-induced proteins: hevein, win1 and win2, and the Kluyveromyces lactis killer toxin alpha subunit) that bind N-acetylglucosamine contain chitin-binding domain (PFAM ID: PF00187). These proteins might function as a defence against chitin containing pathogens, e.g. Chitin-binding lectin 1 of Solanum tuberosum (Potato). This community also includes lectins such as Macrolepiota procera agglutinin (MPA) and Laccaria bicolor lectin both of which show high specificity for complex GalNAc glycans (). This community consists of 100 protein and 134 glycan nodes. Additionally, this community includes 2 out of three hub nodes identified in the lectin-glycan array network. One of the hubs represent protein node (1004763) for wheat germ agglutinin (WGA) from Triticum vulgaris (wheat), whereas the second node (1004668) represents Ricinus communis agglutinin (RCA) from Ricinus communis (castor bean). WGA is a stable homodimer protein and exhibits specificity for N-acetylneuraminic acid and N-acetylglucosamine (GlcNAc) sugars. The glycans for WGA hub node are summarized in and it can be observed that almost all these glycans have GlcNAc group, while few others contain N-acetylneuraminic acid. Each monomeric unit of WGA consists of four domains (A–D) which can be further classified into “primary” (B and C domains) and “secondary” (A and D domains) binding sites showing dissimilar affinities for GlcNAc containing moieties [74]. These structural characteristics and the closeness of binding sites make WGA a worthy candidate to explore multivalent protein-carbohydrate interactions and to assess the impact of structural modifications of glycoclusters [75]. These multivalent interactions are favorable as compared to monomeric ones and are frequently employed by nature to control an array of diverse biological processes [76]. RCA as well as ECL recognize carbohydrate chains with non-reducing terminal β-d-galactose (Galβ) and show preference to Galβ1-4GlcNAc instead of Galβ1-3GlcNAc sequence [77], [78]. The diverse types of glycans including Galβ1-4GlcNAc that interact with RCA hub node are listed in . The table also shows many Neu5Aca2-6Galb1 sugars having large RFU values.?RCA is a glycoprotein from seeds of castor plants and one of the most important applied lectins that have been widely used as a tool to study cell surfaces and to purify glycans [79]. RCA promotes binding and agglutination of polysaccharides and glycoproteins in addition to liposomes and micelles containing glycolipids with galactosyl residues [80], [81]. Furthermore, the specificities of interactions of RCA with neutral and sialylated oligosaccharides have been well established and is consistent with our results as summarized in [82]. The current community-based network study of the lectin-glycan microarray data provides not only a quick and systematic analysis of lectin specificities, but also global organization and grouping of biologically related lectins along with their binding partners (glycans). Such information will be vital to identify lectins that bind to particular glycan structures or to catalogue lectins according to the similarity in specificities. Another important significance of the community-based network analysis is the identification of a novel lectin and the initial guess about its specificity. For this, a sequence database should be constructed for each community identified and a target lectin under investigation should be fed into the databases to get an idea about the structural/functional role of the query lectin and the type of glycans it might bind to. This approach will be more practical when the communities have a large number of different lectins and might help in determining the glycan binding nature of a given lectin. There are many network-based protein function prediction methods along with approaches utilizing structural or sequence information of proteins. Recently, when dealing with a protein-protein-interaction network, it has been shown that more accurate protein function prediction results were obtained by modularity based community detection of the network. The current study provides the first attempt to study lectin-carbohydrate interactions via community detection of a network.

Conclusion

We have constructed a bipartite lectin-glycan interaction network from the collection of glycan microarray data. The network itself provides a quick and global view of the lectin-glycan interaction from which hub proteins are identified. We find that the hub proteins match well with the characteristics of known biological relevance. Using Mod-CSA, a recently developed efficient community detection method, 4 modules are identified. The clustering results are shown to be biologically more meaningful than those obtained by other widely used methods. Most significantly, 44 statistically significant glycan specific groups are identified including fucose and mannose binding ones, some of which could not be detected by alternative methods. Even with more strict RFU cut-offs, clusters generated by Mod-CSA provide consistently better results as compared to other methods. We provide overall analysis of 4 communities identified in the lectin-glycan microarray network. We also show how multiple lectins from the same plant, such as Sambugus nigra (SNA-I and SNA-II) are grouped into different communities based on their glycan binding specificities. The network study provides a framework to get a broad picture of data containing many interacting components. These capabilities of a community-based network analysis allow researchers to explore, analyze and compare a variety of proteins and glycans within the context of modules/communities identified in the network. We expect that this will trigger interest in the prediction of protein-carbohydrate interactions using biological networks and will have wider applications as additional glycan binding proteins are identified. The method can also be applied to study other types of lectins as well as other interaction networks. List of all protein nodes, their clusters and reported specificity in the lectin-glycan network. (XLS) Click here for additional data file. The list of meaningful glycan-specific groups and their P-values detected by Mod-CSA and greedy algorithm (GLAY) at RFU ≥10000 and RFU ≥20000. (XLS) Click here for additional data file. The list of meaningful glycan-specific groups and their P-values detected by MCL and MCODE at RFU ≥5000, RFU ≥10000 and RFU ≥20000. (XLS) Click here for additional data file. List of randomly identified statistically significant glycan-specific groups. (DOCX) Click here for additional data file. List of unique galactose and fucose sugars that interact with unannotated 6RG, Tap1, and Mubin at RFU ≥5000 in the lectin-glycan network. (XLS) Click here for additional data file. List of diverse glycans that interact with the hub PP2A1 at RFU ≥5000 in the lectin-glycan network. (XLS) Click here for additional data file. Lists of unique glycans for PP2A1 and CFT at RFU ≥5000 in the lectin-glycan network. (XLS) Click here for additional data file. List of unique glycans for unannotated lectins Arum maculatun agglutinin (AMA), Caragana arborescens agglutinin (CAA), Colchicum autumnale lectin (CA), and Arisaema helleborifolium schott lectin (AHL) that show high specificity for mannose sugars at RFU ≥5000 in the lectin-glycan network. (XLS) Click here for additional data file. List of complex glycans that show high specificity for lectins such as Macrolepiota procera agglutinin (MPA) and Laccaria bicolor lectin. (XLS) Click here for additional data file. List of diverse glycans that interact with the hub WGA at RFU ≥5000 in the lectin-glycan network. (XLS) Click here for additional data file. List of diverse glycans that interact with the hub RCA at RFU ≥5000 in the lectin-glycan network. (XLS) Click here for additional data file.
  77 in total

1.  Analysis and prediction of carbohydrate binding sites.

Authors:  C Taroni; S Jones; J M Thornton
Journal:  Protein Eng       Date:  2000-02

Review 2.  Cyanovirin-N: a sugar-binding antiviral protein with a new twist.

Authors:  I Botos; A Wlodawer
Journal:  Cell Mol Life Sci       Date:  2003-02       Impact factor: 9.261

3.  X-ray sequence ambiguities of Sclerotium rolfsii lectin resolved by mass spectrometry.

Authors:  G J Sathisha; Y K Subrahmanya Prakash; V B Chachadi; N N Nagaraja; S R Inamdar; D D Leonidas; H S Savithri; B M Swamy
Journal:  Amino Acids       Date:  2007-12-28       Impact factor: 3.520

Review 4.  Use of glycan microarrays to explore specificity of glycan-binding proteins.

Authors:  David F Smith; Xuezheng Song; Richard D Cummings
Journal:  Methods Enzymol       Date:  2010       Impact factor: 1.600

5.  Evaluation of clustering algorithms for protein-protein interaction networks.

Authors:  Sylvain Brohée; Jacques van Helden
Journal:  BMC Bioinformatics       Date:  2006-11-06       Impact factor: 3.169

6.  A motif-based analysis of glycan array data to determine the specificities of glycan-binding proteins.

Authors:  Andrew Porter; Tingting Yue; Lee Heeringa; Steven Day; Edward Suh; Brian B Haab
Journal:  Glycobiology       Date:  2009-11-29       Impact factor: 4.313

7.  The gene for stinging nettle lectin (Urtica dioica agglutinin) encodes both a lectin and a chitinase.

Authors:  D R Lerner; N V Raikhel
Journal:  J Biol Chem       Date:  1992-06-05       Impact factor: 5.157

8.  Prediction of protein-glucose binding sites using support vector machines.

Authors:  Houssam Nassif; Hassan Al-Ali; Sawsan Khuri; Walid Keirouz
Journal:  Proteins       Date:  2009-10

9.  clusterMaker: a multi-algorithm clustering plugin for Cytoscape.

Authors:  John H Morris; Leonard Apeltsin; Aaron M Newman; Jan Baumbach; Tobias Wittkop; Gang Su; Gary D Bader; Thomas E Ferrin
Journal:  BMC Bioinformatics       Date:  2011-11-09       Impact factor: 3.307

10.  Identification of mannose interacting residues using local composition.

Authors:  Sandhya Agarwal; Nitish Kumar Mishra; Harinder Singh; Gajendra P S Raghava
Journal:  PLoS One       Date:  2011-09-13       Impact factor: 3.240

View more
  6 in total

1.  GlyNet: a multi-task neural network for predicting protein-glycan interactions.

Authors:  Eric J Carpenter; Shaurya Seth; Noel Yue; Russell Greiner; Ratmir Derda
Journal:  Chem Sci       Date:  2022-05-16       Impact factor: 9.969

2.  CHARMM-GUI Glycan Modeler for modeling and simulation of carbohydrates and glycoconjugates.

Authors:  Sang-Jun Park; Jumin Lee; Yifei Qi; Nathan R Kern; Hui Sun Lee; Sunhwan Jo; InSuk Joung; Keehyung Joo; Jooyoung Lee; Wonpil Im
Journal:  Glycobiology       Date:  2019-04-01       Impact factor: 4.313

3.  Network Analysis for the Identification of Differentially Expressed Hub Genes Using Myogenin Knock-down Muscle Satellite Cells.

Authors:  Adeel Malik; Eun Ju Lee; Arif Tasleem Jan; Sarafraz Ahmad; Kyung-Hyun Cho; Jihoe Kim; Inho Choi
Journal:  PLoS One       Date:  2015-07-22       Impact factor: 3.240

4.  Effect of algae and plant lectins on planktonic growth and biofilm formation in clinically relevant bacteria and yeasts.

Authors:  Mayron Alves Vasconcelos; Francisco Vassiliepe Sousa Arruda; Victor Alves Carneiro; Helton Colares Silva; Kyria Santiago Nascimento; Alexandre Holanda Sampaio; Benildo Cavada; Edson Holanda Teixeira; Mariana Henriques; Maria Olivia Pereira
Journal:  Biomed Res Int       Date:  2014-05-28       Impact factor: 3.411

5.  Role of Transportome in the Gills of Chinese Mitten Crabs in Response to Salinity Change: A Meta-Analysis of RNA-Seq Datasets.

Authors:  Adeel Malik; Chang-Bae Kim
Journal:  Biology (Basel)       Date:  2021-01-08

6.  Inverse Resolution Limit of Partition Density and Detecting Overlapping Communities by Link-Surprise.

Authors:  Juyong Lee; Zhong-Yuan Zhang; Jooyoung Lee; Bernard R Brooks; Yong-Yeol Ahn
Journal:  Sci Rep       Date:  2017-09-29       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.