| Literature DB >> 31247951 |
Karsten Suhre1, Irena Trbojević-Akmačić2, Ivo Ugrina2, Dennis O Mook-Kanamori3, Tim Spector4, Johannes Graumann5, Gordan Lauc2, Mario Falchi4.
Abstract
Most human proteins are glycosylated. Attachment of complex oligosaccharides to the polypeptide part of these proteins is an integral part of their structure and function and plays a central role in many complex disorders. One approach towards deciphering this human glycan code is to study natural variation in experimentally well characterized samples and cohorts. High-throughput capable large-scale methods that allow for the comprehensive determination of blood circulating proteins and their glycans have been recently developed, but so far, no study has investigated the link between both traits. Here we map for the first time the blood plasma proteome to its matching N-glycome by correlating the levels of 1116 blood circulating proteins with 113 N-glycan traits, determined in 344 samples from individuals of Arab, South-Asian, and Filipino descent, and then replicate our findings in 46 subjects of European ancestry. We report protein-specific N-glycosylation patterns, including a correlation of core fucosylated structures with immunoglobulin G (IgG) levels, and of trisialylated, trigalactosylated, and triantennary structures with heparin cofactor 2 (SERPIND2). Our study reveals a detailed picture of protein N-glycosylation and suggests new avenues for the investigation of its role and function in the associated complex disorders.Entities:
Keywords: HILIC-UPLC; N-glycosylation; SOMAscan; aptamers; glycomics; population study; proteomics
Year: 2019 PMID: 31247951 PMCID: PMC6681129 DOI: 10.3390/metabo9070122
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Replicated protein–glycan associations. (a) Correlation (r2) between 62 glycan (rows) and 43 protein traits (columns); positive associations are in blue, negative in red; darker values indicate stronger associations; (b) limited to proteins and glycans that have at least one association with r2 > 0.4. A fully annotated and filterable matrix for all associations is available in Excel format as Tables S4 and S5.
Selected protein–glycan associations. Pearson correlations between inverse-normal scaled traits and corresponding p-values are listed.
| Protein | Glycan Trait | R QMDiab | R TwinsUK | ||
|---|---|---|---|---|---|
| IgG | PGP69: M9 in total neutral plasma glycans (GPn) | −0.69 | −0.36 | 8.0 × 10−50 | 1.5 × 10−2 |
| IgG | PGP93: core fucosylated structures | 0.68 | 0.31 | 3.7 × 10−47 | 3.7 × 10−2 |
| IgG | PGP65: A2G2 in total neutral plasma glycans (GPn) | −0.51 | −0.35 | 3.8 × 10−24 | 1.6 × 10−2 |
| IgG | PGP75: fucosylation of digalactosylated structures in total neutral plasma glycans | 0.44 | 0.35 | 1.3 × 10−17 | 1.9 × 10−2 |
| IgG | PGP14: A2G2S(6)1 + A2G2S(3)1 | −0.44 | −0.30 | 1.4 × 10−17 | 4.3 × 10−2 |
| IgM | PGP16: FA2BG2S(3)1 + FA2BG2S(6)1 | 0.59 | 0.61 | 3.6 × 10−34 | 6.0 × 10−6 |
| IgM | PGP44: monosialylation of core-fucosylated digalactosylated structures with bisecting GlcNAc | 0.51 | 0.48 | 2.5 × 10−24 | 8.0 × 10−4 |
| IgM | PGP11: FA2BG2 | 0.48 | 0.32 | 1.3 × 10−21 | 2.9 × 10−2 |
| IgM | PGP42: monosialylation of core-fucosylated digalactosylated structures without bisecting GlcNAc | 0.46 | 0.41 | 4.6 × 10−19 | 4.1 × 10−3 |
| IgM | PGP48: ratio of fucosylated monosialylated and disialylated structures (with bisecting GlcNAc) | 0.45 | 0.38 | 1.7 × 10−18 | 8.3 × 10−3 |
| IgM | PGP54: ratio of fucosylated monosialylated structures with and without bisecting GlcNAc | 0.40 | 0.56 | 1.1 × 10−14 | 4.4 × 10−5 |
| IgM | PGP55: the incidence of bisecting GlcNAc in all fucosylated monosialylated structures | 0.40 | 0.56 | 1.1 × 10−14 | 4.4 × 10−5 |
| SERPIND1 | PGP97: trisialylated structures | 0.53 | 0.29 | 8.0 × 10−27 | 4.9 × 10−2 |
| SERPIND1 | PGP102: trigalactosylated structures | 0.53 | 0.35 | 8.7 × 10−26 | 1.7 × 10−2 |
| SERPIND1 | PGP105: triantennary structures | 0.52 | 0.34 | 7.0 × 10−25 | 2.0 × 10−2 |
| FUT5 | PGP24: A3G3S(3,3,3)3 | 0.53 | 0.39 | 3.6 × 10−26 | 7.9 × 10−3 |
| FUT5 | PGP30: A4G4S(3,3,3)3 | 0.51 | 0.50 | 3.8 × 10−24 | 3.8 × 10−4 |
| FUT5 | PGP32: A4F1G3S(3,3,3)3 + A4F1G3S(3,3,6)3 + A4F1G3S(3,6,6)3 | 0.51 | 0.43 | 4.4 × 10−24 | 3.1 × 10−3 |
| FUT5 | PGP106: tetraantennary structures | 0.48 | 0.46 | 1.9 × 10−21 | 1.4 × 10−3 |
| FUT5 | PGP110: ratio of trisialylated and tetrasialylated tetragalactosylated structures | 0.47 | 0.48 | 9.4 × 10−21 | 8.3 × 10−4 |
| FUT5 | PGP103: tetragalactosylated structures | 0.47 | 0.45 | 2.4 × 10−20 | 1.5 × 10−3 |
| FUT5 | PGP36: A4F1G4S(3,3,3,6)4 | 0.45 | 0.30 | 6.3 × 10−19 | 4.0 × 10−2 |
| FUT5 | PGP92: antennary fucosylated structures | 0.40 | 0.37 | 6.6 × 10−15 | 1.1 × 10−2 |
| CD5L | PGP16: FA2BG2S(3)1 + FA2BG2S(6)1 | 0.51 | 0.40 | 6.3 × 10−24 | 5.7 × 10−3 |
| CD5L | PGP108: glycan structures with bisecting GlcNAc | 0.40 | 0.41 | 6.7 × 10−15 | 4.9 × 10−3 |
| CRP | PGP97: trisialylated structures | 0.45 | 0.38 | 3.3 × 10−18 | 8.8 × 10−3 |
| CRP | PGP29: A3F1G3S(3,3,3)3 + A3F1G3S(3,3,6)3 | 0.44 | 0.39 | 7.1 × 10−18 | 7.1 × 10−3 |
| F9 (IX) | PGP29: A3F1G3S(3,3,3)3 + A3F1G3S(3,3,6)3 | 0.44 | 0.38 | 6.9 × 10−18 | 9.7 × 10−3 |
| F9 (IXab) | PGP29: A3F1G3S(3,3,3)3 + A3F1G3S(3,3,6)3 | 0.44 | 0.38 | 7.1 × 10−18 | 9.3 × 10−3 |
| F9 (IX) | PGP92: antennary fucosylated structures | 0.41 | 0.31 | 4.5 × 10−15 | 3.5 × 10−2 |
| FCGR3B | PGP3: A2(6)BG1 | 0.44 | 0.30 | 1.3 × 10−17 | 4.2 × 10−2 |
Figure 2Scatterplots of selected protein–glycan associations. QMDiab (black circles), TwinsUK (red dots); (a) IgG with percentage of M9 in total neutral plasma glycans, (b) IgM with FA2BG2 glycans, (c) FUT5 with A3G3S(3,3,3)3 glycans, and (d) FCGR3B with A2(6)BG1 glycans.
Figure 3Consistency of the association between glycan traits and C-reactive protein (CRP) levels in TwinsUK and QMDiab. Correlation coefficients for TwinsUK and QMDiab are shown.
Figure 4Representative chromatogram of a total plasma N-glycome. Fluorescently labelled plasma N-glycans were separated by HILIC-UPLC into 36 peaks (GP1–GP36). The glycan content in each peak was assigned as determined previously [38]. The amount of glycan species in each peak was expressed as % of total integrated area.