Ziyuan Gao1,2, Zhen Wu3, Ying Han4, Xumin Zhang3, Piliang Hao4, Mingming Xu1, Shan Huang1, Shuwei Li5, Jun Xia6, Junhong Jiang2,7, Shuang Yang1. 1. Center for Clinical Mass Spectrometry, School of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215123, China. 2. Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Soochow University, Pinghai Road No. 899, Suzhou 215000, China. 3. State Key Laboratory of Genetic Engineering, Department of Biochemistry, School of Life Sciences, Fudan University, Shanghai 200438, China. 4. School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China. 5. Nanjing Apollomics Biotech, Inc., Nanjing, Jiangsu 210033, China. 6. Department of Clinical Laboratory Center, Zhejiang Provincial People's Hospital, People's Hospital of Hangzhou Medical College, Hangzhou, Zhejiang 310014, China. 7. Department of Pulmonary and Critical Care Medicine, Dushu Lake Hospital, Affiliated to Soochow University, Chongwen Road No. 9, Suzhou 215000, China.
Abstract
Aberrant glycosylation is a hallmark of cancer found during tumorigenesis and tumor progression. Lung cancer (LC) induced by oncogene mutations has been detected in the patient's saliva, and saliva glycosylation has been altered. Saliva contains highly glycosylated glycoproteins, the characteristics of which may be related to various diseases. Therefore, elucidating cancer-specific glycosylation in the saliva of healthy, non-cancer, and cancer patients can reveal whether tumor glycosylation has unique characteristics for early diagnosis. In this work, we used a solid-phase chemoenzymatic method to study the glycosylation of saliva glycoproteins in clinical specimens. The results showed that the α1,6-core fucosylation of glycoproteins was increased in cancer patients, whereas α1,2 or α1,3 fucosylation was significantly increased. We further analyzed the expression of fucosyltransferases responsible for α1,2, α1,3, and α1,6 fucosylation. The fucosylation of the saliva of cancer patients is drastically different from that of non-cancer or health controls. These results indicate that the glycoform of saliva fucosylation distinguishes LC from other diseases, and this feature has the potential to diagnose lung adenocarcinoma.
Aberrant glycosylation is a hallmark of cancer found during tumorigenesis and tumor progression. Lung cancer (LC) induced by oncogene mutations has been detected in the patient's saliva, and saliva glycosylation has been altered. Saliva contains highly glycosylated glycoproteins, the characteristics of which may be related to various diseases. Therefore, elucidating cancer-specific glycosylation in the saliva of healthy, non-cancer, and cancer patients can reveal whether tumor glycosylation has unique characteristics for early diagnosis. In this work, we used a solid-phase chemoenzymatic method to study the glycosylation of saliva glycoproteins in clinical specimens. The results showed that the α1,6-core fucosylation of glycoproteins was increased in cancer patients, whereas α1,2 or α1,3 fucosylation was significantly increased. We further analyzed the expression of fucosyltransferases responsible for α1,2, α1,3, and α1,6 fucosylation. The fucosylation of the saliva of cancer patients is drastically different from that of non-cancer or health controls. These results indicate that the glycoform of saliva fucosylation distinguishes LC from other diseases, and this feature has the potential to diagnose lung adenocarcinoma.
As one of the common
post-translationally modifications, glycosylation
is associated with many diseases, and its abnormal changes can affect
the pathophysiology of cells or organisms.[1,2] Changes
in glycosylation play a vital role in diseases such as increased fucosylation
in prostate cancers,[3,4] dysregulated glycoforms in influenza
virus,[5,6] varied glycosites of spike glycoprotein
in COVID-19,[7,8] upregulated sialylation in cardiovascular
disease,[9] and elevated O-GlcNAcylation in neurodegenerative disease.[10,11] In particular, protein glycosylation changes during tumorigenesis
and cancer progression.[12,13] Therefore, disease-specific
glycosylation is often used as a diagnostic and/or prognostic biomarker.
For instance, the core fucosylation of α-fetoprotein (AFP) is
a clinical molecule for liver cancer diagnosis; using AFP core fucosylation
instead of total AFP can improve sensitivity and specificity.[14] Since most tumor markers approved by Food and
Drug Administration are glycoproteins, such as cancer antigen 125
(CA 125), AFP, immunoglobulins, neuron-specific enolase, and prostate-specific
antigen (PSA), potential cancer biomarkers are likely to be glycoproteins
in human biofluids.[15−17] Glycoenzymes [glycosyltransferases (GTFs) and glycosidases]
may be intrinsically regulated in the tumor microenvironments.[18,19] Dysregulated glycoenzymes and their protein expression can alter
protein glycosylation, leading to changes in the function of the protein
cascade in the cell. Thus, analysis of tumor-specific glycosylation
and upstream glycoenzymes is important to identify potential biomarkers
for diagnosis and prognosis.For non-invasive detection of body
fluids, liquid biopsy has become
a very popular focus in recent years, such as blood, circulating tumor
cells, and circulating tumor DNA (ct DNA). Studies have shown that
the early diagnosis of different cancers can be achieved by detecting
ctDNA methylation in longitudinal studies in patient plasma.[20] Tumor markers can also be proteins or other
substances that are present or produced in cancer or other cells of
the body in response to the tumor microenvironment. Glycosylation
is also used as a detection in various cancer liquid biopsies as tumor-associated
glycans or glycoproteins may be secreted into the circulation and
present in different body fluids as potential biomarkers. Therefore,
human plasma, urine, and saliva can all be used to discover disease-specific
glycosylation markers. Plasma markers such as PSA, CA-125, AFP, or
amyloid-beta precursor protein have been clinically used for the early
detection of prostate cancer, ovarian cancer, liver cancers, and Alzheimer’s
disease, respectively.[21,22] Recent studies have found that
the expression of serum proteins CEA (carcinoembryonic antigen), RBP (retinol-binding protein), and α1 antitrypsin
in the diagnosis of lung cancer (LC) has a sensitivity of 89.3% and
a specificity of 84.7%.[23] The results are
based on the analysis of the serum proteins of several patients diagnosed
with non-small-cell LC (NSCLC). However, more clinical studies are
needed to confirm whether these results are applicable to different
subtypes of NSCLC.In addition to serum or plasma, which is
widely used for biomarker
discovery, saliva has become one of the essential biofluids in diagnosis
due to non-invasive sample preparation. It can avoid the pain, anxiety,
or risk of infection, and it is easy to store and collect multiple
subsequent specimens. Saliva has been used to diagnose oral diseases
and monitor disease progression, such as periodontal pathogen[24] or patients suspected COVID-19.[25,26] Proteomic analysis of human saliva found that 48 out of the 500
proteins were differentially expressed between healthy controls (HCs)
and gastric cancer patients. Among them, STAT2 (signal transducer
and activator of transcription 2) was upregulated, and the tumor suppressor
of DMBT1 (deleted in malignant brain tumors 1 protein) was downregulated.[27] STAT family members such as STAT2 play an important
role in the regulation of cell proliferation, differentiation, apoptosis,
and angiogenesis.[28] For example, upregulation
of TLR2 driven by STAT3 can promote gastric tumorigenesis, and inhibition
of STAT3 signaling can prevent gastric cancer proliferation and metastasis.[29,30] A meta-analysis of 29 articles from more than 10,000 subjects showed
that the diagnostic accuracy of saliva biomarkers for LC remote from
the mouth is up to 88%.[31] Therefore, saliva
is a promising non-invasive biofluid for discovering novel biomarkers
for LC.In addition to urea, ammonia, and electrolytes, saliva
also contains
many proteins. The most abundant saliva proteins are mucins, amylases,
defensins, cystatins, histatins, proline-rich proteins, statherin,
lactoperoxidase, lysozyme, lactoferrin, and immunoglobulins. These
proteins can come from the salivary gland, stomach, and lung.[32,33] Mass spectrometry (MS) analysis of exosomes and macrovesicles in
the saliva of LC patients revealed that approximately 4% of the identified
proteins belonged to distal lung cells. Among them, BPIFA1 (BPI fold-containing
family A member 1), CRNN (cornulin), MUC5B (mucin-5B), and IQGAP (Ras
GTPase-activating-like protein) are dysregulated in LC, and most of
which are also glycosylated.[34] The changes
in glycosylation may be attributed to the differential expression
of glycoenzymes and their substrates in the tumor environment. GTFs,
such as glucosyltransferase B (GtfB),[35] α1,3-fucosyltransferase (FUT5),[36] α1,3-mannosyltransferase (ALG3), N-acetylgalactosaminide
α2,6-sialyltransferase 1 (ST6GALNAC1), and α-N-acetyl-neuraminide α2,8-sialyltranserase 2 or 5 (ST8SIA2 or
ST8SIA5) (the Human Protein Atlas), are highly abundant in saliva.
Glycosylation of saliva-containing microbe, phagocyte, mucin, or agglutinin
is regulated by these GTFs.[37] Saliva glycoproteins,
MUC5B, MUC7 (mucin-7),[38] salivary agglutinin
(SAG),[39] β-2-micoglobulin,[40] and proline-rich glycoprotein,[38] can change when tumor initializes and progresses further
through dysregulated glycoenzymes. Consequently, the identification
of tumor-specific glycosylation and its dependent regulators is crucial
for the discovery of biomarkers of interest.We hypothesized
that tumor-associated glycosylation exists in saliva
that can be used to differentiate lung adenocarcinoma patients from
healthy individuals. To decipher protein glycosylation, structural
analysis of glycans, glycosites, site occupancy, and occupied glycans
of glycosites is required. Glycan analysis can be performed by glycosidases
or alkaline β-elimination,[41,42] while N-glycosites are determined by tandem MS (MS/MS) against
the intact N-glycopeptides enriched by hydrophilic
interaction liquid chromatography.[43−45] Complex O-glycosylation
has been successfully studied by O-protease (OpeRATOR or StcE), which
cleaves the N-terminus of O-glycosylated serine or
threonine; O-glycopeptides are usually analyzed by
electron-transfer and higher-energy collision dissociation (EThcD)
fragmentation.[46−48] Conversely, the linkages of labile sialic acids are
differentially derivatized by ethyl esterification and reductive amination
using amine-containing compounds.[49,50] The derivatization
of sialic acid on the solid phase not only stabilizes the α-2,3
and α-2,6 linkages sequentially but also facilitates the removal
of reagents after the reaction.[50] By combining
these analytical platforms and advanced MS technology, we can extensively
deconvolute disease-specific glycopatterns by comparing protein glycosylation
between HCs and non-cancer and cancer patients.In this study,
we used a solid-phase chemoenzymatic method to compare
saliva glycosylation in HCs and non-cancer and cancer patients. To
determine the linkage of fucosylation, glycoproteins are conjugated
to a solid support, and their fucoses are sequentially digested by
specific α-fucosidases. Unstable sialic acids are modified by
two-step chemical derivatization, and the linkages between α2,6
and α2,3 are distinguished by carrying a distinct mass tag after
derivatization. Fucosylated glycoproteins are studied by bottom-up
proteomics and matrix-assisted laser desorption/ionization (MALDI)-MS.
Fucosyltransferases are quantitatively analyzed by qPCR. The biosynthesis
of fucosylated high-mannose or complex N-glycans
and their potential application for diagnosis of LC are also discussed.
Methods
Participants
and Study Design
The workflow of clinical
samples is shown in Figure . In this study, saliva samples were collected from 51 individuals,
including 20 patients with LC, 21 patients with other diseases, and
10 healthy volunteers. Saliva samples were divided into HC, other
non-cancer disease (OD), and lung adenocarcinomas (LC) (see Supporting Information Table S1). All patients
in the LC group were histopathologically confirmed as lung adenocarcinoma,
has no history of inflammatory disease or other malignant tumors,
and had not received chemotherapy or radiotherapy. In this study,
there was statistically no significant difference between LC and OD/HC
in terms of gender, smoking history, and other factors. All patient
samples were collected according to protocols approved by the Institutional
Review Board (IRB) of the First Affiliated Hospital of Soochow University,
and written informed consent was provided to patients in advance.
Figure 1
MS workflow
for analysis of saliva proteins, glycoproteins, and
glycans. Three groups have been used for comparison, including HC,
OD, and LC. First, proteins are extracted from saliva and used for
glycosylation analysis, bottom-up (or shotgun) proteomics, and fucosylation
linkage determination. Shotgun proteomics can identify GTFs responsible
for specific glycosylation.
MS workflow
for analysis of saliva proteins, glycoproteins, and
glycans. Three groups have been used for comparison, including HC,
OD, and LC. First, proteins are extracted from saliva and used for
glycosylation analysis, bottom-up (or shotgun) proteomics, and fucosylation
linkage determination. Shotgun proteomics can identify GTFs responsible
for specific glycosylation.
Standard Procedure for Saliva Collection
All saliva
samples were collected in the morning (9–11 am). Patients and
healthy individuals were asked not to eat, drink, smoke, or use any
oral cleaning products for at least 1 h before collecting saliva.
This minimizes the effect of smoking, food, alcohol consumption, or
beverages on the final results of the experiment. Subjects rinsed
their mouth 2–3 times with drinking water 5 min before collection
to ensure oral hygiene. In the absence of stimulation, naturally secreted
whole saliva was collected in a 50 mL centrifuge tube. Saliva collection
(∼5 mL) must be completed within 10 min. During the collection
process, the saliva collected in the centrifuge tube must be kept
on ice. Saliva samples were centrifuged at 12,000 rpm for 25 min at
4 °C. After discarding the pellet and adding 100× protease
inhibitor to the supernatant, the saliva samples are stored at −80
°C.
SDS-PAGE and Glycosidase Treatment of Saliva Proteins
The concentration of saliva proteins was measured by bicinchoninic
acid (BCA) assay and nanodrop. Three sets of saliva samples were diluted
to a concentration of ∼1 mg/μL. 20 μg of protein
was taken from each group and reacted with PNGase F, fucosidase, and
a mixture of the two enzymes at 37 °C for 4 h. The 5× protein
loading buffer was added to the saliva with and without these enzyme
digestion, and samples were incubated at 100 °C for 5 min. Electrophoresis
was performed on SDS-PAGE using 10% SDS-PAGE gel kit (Beyotime). Running
buffer consists of 0.025 M Tris, 0.192 M glycine, and 0.1% SDS. 20
μL mixture of the sample (20 μg) and loading buffer was
added to the gel well. After electrophoresis, the gel was stained
in the staining solution (containing 0.25% Coomassie Bright Blue R250,
45% methanol, and 10% acetic acid) for 3 h and then eluted in the
eluting buffer (methanol/glacial acetic acid/water = 2:2:9, v/v) until
protein bands were clear. The gel bands were then imaged using the
ChemiDoc MIP imaging system (Bio-Rad).
Saliva Protein Extraction
The 500 μL of solution
consists of trichloroacetic acid (20% w/v), acetone (90% v/v), and
dithiothreitol (DTT; 20 mM) and was mixed with 500 μL saliva.
The mixture was vortexed and precipitated overnight at −20
°C. The sample was then centrifugated at 15,000 rpm for 30 min
at 4 °C. The supernatant was discarded, and the pellet was collected,
then washed with 200 μL of cold acetone (90%) and 20 mM DTT,
and finally washed with cold acetone (80%) and 10 mM DTT. To suspend
the pellet in the solution, the sample was sonicated for at least
5 min prior to acetone-DTT wash. The pellet was placed at −20
°C for 20 min, then centrifugated at 15,000 rpm for 5 min at
4 °C. Finally, the pellet was collected and dried in a Speed-Vac
(5 min) and stored at −80 °C before further analysis.
Enzymatic Release of N-Glycans
PNGase
F is used to release glycans from glycoproteins after derivatization
of sialic acids on a solid phase.[42,51] Briefly, protein
(500 μg) was heated at 90–100 °C for 10 min and
mixed with 200 μL of AminoLink plus resin, which was pre-conditioned
with 500 μL of 1× binding buffer (2×). 1× binding
buffer contains 10 mM sodium citrate and 5 mM sodium carbonate. The
protein was conjugated to the resin in 1× binding buffer [4 h
at room temperature (RT)], followed by adding 50 mM NaCNBH3. After washing the resin with 1× PBS (500 μL, 3×),
the sample was further incubated in 1× PBS for 4 h in the presence
of 50 mM NaCNBH3. The unreacted aldehydes remaining on
the resin were blocked with 1 M Tris·HCl (pH 7.4). The 2,6-linked
sialic acids were then derivatized with 0.25 M EDC (200 μL)
and 0.25 M HBot (200 μL) in ethanol at 37 °C/1 h. After
removing reagents and washing the resin with DI water, the 2,3-linked
sialic acids were further modified with 1 M p-toluidine
(pT) (500 μL).[50] After multiple washing
steps as previously described,[42] the resin
was treated with glycosidases to analyze fucose linkages or glycan
compositions by MS.
Determination of the Fucosylation Linkage
The fucosylation
linkage of glycoproteins conjugated to the resin can be further determined
by fucosidase and MS (Figure ). The linkage is resolved by α1-2 fucosidase, α1-2,3,4
fucosidase, or α1-2,4,6 fucosidase. The conjugated glycoprotein
was aliquoted into three equal amounts and treated with three fucosidases.
An aliquot was incubated in 50 unit of α1-2,3,4 fucosidase in
20 mM Tris·HCl (pH 6.8), 37 °C/30 min. N-Glycans were released by 0.2 μL of PNGase F in 200 μL
of 20 mM NH4HCO3, 37 °C/overnight (Figure step ①).
The second aliquot was treated with 10 units of α1-2 fucosidase
in 50 mM sodium acetate and 5 mM CaCl2 (pH 5.5), 37 °C/1
h and then by PNGase F to release N-glycans (Figure step ②).
The third aliquot was treated with 10 units of α1-2,4,6 fucosidase
under the same condition, and its N-glycans were
released by PNGase F (Figure step ③). The linkage of α1,2, α1,3, and
α1,6 is thus determined.
Figure 2
Schematic diagram of determination of
fucosylation linkage using
specific fucosidase and MS analysis. ① Removal of all fucose
linkages except for core α1,6 fucosylation by α1-2,3,4
fucosidase. This scheme led to the determination of core α1,6
linkage of fucosylated glycan; ② removal of α1,2 linkage
of fucosylated glycan by α1-2 fucosidase. The remaining linkages
of fucosylation can be α1,3 or α1,6. The α1,3 is
then determined by comparing fucosylated glycans with scheme 1; and
③ removal of all linkages except for α1,3Fuc-GlcNAc through
α1-2,4,6 fucosidase. This scheme confirms whether there is α1,4
linkage.
Schematic diagram of determination of
fucosylation linkage using
specific fucosidase and MS analysis. ① Removal of all fucose
linkages except for core α1,6 fucosylation by α1-2,3,4
fucosidase. This scheme led to the determination of core α1,6
linkage of fucosylated glycan; ② removal of α1,2 linkage
of fucosylated glycan by α1-2 fucosidase. The remaining linkages
of fucosylation can be α1,3 or α1,6. The α1,3 is
then determined by comparing fucosylated glycans with scheme 1; and
③ removal of all linkages except for α1,3Fuc-GlcNAc through
α1-2,4,6 fucosidase. This scheme confirms whether there is α1,4
linkage.
Comparison of Fucosylation
of Saliva Glycoproteins
Proteins are extracted from saliva
according to the saliva protein
extraction protocol. The proteins (1 mg) were used to determine the
fucosylation linkage using a solid-phase chemoenzymatic method (Figure ). The aliquot proteins
(500 μg) were also digested with trypsin for the quantitative
analysis of GTFs. The structure of glycans in HC, OD, and LC was compared
for features that are specific to cancer.
Mass Spectrometry Analysis
of Glycans and Glycopeptides
Following PNGase F incubation,
glycans were eluted by centrifugation
and further washed with 100 μL of HPLC water (twice). The total
volume is approximately 400 μL, of which 2–4 μL
is used for glycan analysis by Bruker AutoFlex MALDI-TOF/TOF-MS. Each
sample is tested in 3–4 technical duplicates, with an average
of 10,000 shots per measurement. Global proteins are analyzed by shotgun
proteomics. Briefly, protein (500 μg) was dissolved in 8 M urea
and treated with 12 mM Tris (2-carboxyethyl) phosphine hydrochloride
(37 °C/1 h) and 16 mM iodoacetamide (RT/1 h in the dark). 10
μg of trypsin (Promega, Madison, WI, USA) was added to the protein
after dilution (<1.5 M urea). Protein digestion was conducted overnight
at 37 °C, and the peptides were further purified by C18 SPE (solid-phase
extraction). N-Glycosite analysis was performed by
solid-phase extraction of glycopeptide enrichment (SPEG)[52] as follows: the purified peptides were oxidized
by 10 mM sodium periodate to couple glycopeptides to hydrazide beads.
Glycan-containing glycopeptides are released by PNGase F. The deglycopeptides
were analyzed by Thermo Scientific Orbitrap Fusion LC–MS, using
the same parameters described in our previous work.[47]
qPCR Quantification of Fucosyltransferases
in Lung Tissues
The fucosyltransferases of interest were
quantitatively analyzed
by q-PCR using an ABI 7500 Real-Time PCR instrument. The TRIzol method
was used to extract total RNA from LC tissues and matched adjacent
non-tumor tissues. The RNA concentration was measured using Nanodrop.
The extracted RNA was reversed into cDNA using the RevertAid First
Strand cDNA Synthesis kit (Thermo). The primer sequences for qPCR
are shown in Supporting Information Table S2. We use human GAPDH as the reference gene. The reaction system is
10 μL 2× ChamQ Universal SYBR qPCR Master Mix, 0.4 μL
of 10 μM upstream and downstream primers, 1 μL of cDNA
template, and 20 μL of water for the final system. The reaction
procedure of the qPCR system is as follows: pre-deformation at 95
°C for 30 s; 40 cycles of amplification (95 °C for 10 s
and 60 °C for 30 s); and melting curve (60 °C for 60 s and
95 °C for 15 s). After the reaction, relative gene expression
was calculated quantitatively by 2–ΔΔCt.
Results
Protein Glycosylation Differs between Cancerous
and Noncancerous
Saliva
To show whether the glycosylation in the saliva of
LC patients has changed, we performed SDS-PAGE on the saliva proteins
of HC, other diseases (non-cancer, OD), and LC (adenocarcinomas, LC)
with and without glycosidase treatment. PNGase F (NEB BioLabs) is
an N-glycosidase that can cleave all N-glycans from glycoproteins but cannot cleave the innermost GlcNAc N-glycans with α1-3 fucose residue (e.g., plant or
insect glycoproteins).[53] SDS-PAGE showed
that HC has more protein bands between 66 and 95 kDa, OD has fewer
intensity bands between 66 and 70 kDa, and LC has high intensity proteins
between 52 and 66 kDa (Figure ). The protein pattern between 52 and 30 kDa is also different.
After PNGase F digestion, the protein bands of all samples shifted
to lower molecular weight (MW), indicating the presence of N-glycosylation in HC, OD, and LC.
Figure 3
Different glycosylation
present in saliva glycoproteins in HC,
non-cancer, and LC. Proteins of HC, OD, and LC were treated with PNGase
F, α1-2,3,4 fucosidase. PNGase F removes N-glycans
from glycoproteins, thereby reducing the MW of N-glycoproteins.
α1-2,3,4 Fucosidase hydrolyzes fucose with linkages of α1-2,
α1-3, or α1-4. A decrease in the MW of glycoproteins suggests
one or more of these fucosylated linkages but not α1-6. The
MW of PNGase F is about 36 kDa, while α1-2,3,4 Fucosidase consists
of two fucosidases modified with His-tags, with the MW of 87 and 64
kDa.
Different glycosylation
present in saliva glycoproteins in HC,
non-cancer, and LC. Proteins of HC, OD, and LC were treated with PNGase
F, α1-2,3,4 fucosidase. PNGase F removes N-glycans
from glycoproteins, thereby reducing the MW of N-glycoproteins.
α1-2,3,4 Fucosidase hydrolyzes fucose with linkages of α1-2,
α1-3, or α1-4. A decrease in the MW of glycoproteins suggests
one or more of these fucosylated linkages but not α1-6. The
MW of PNGase F is about 36 kDa, while α1-2,3,4 Fucosidase consists
of two fucosidases modified with His-tags, with the MW of 87 and 64
kDa.Fucosidase treatment can detect
the presence of fucosylation on N-glycans or O-glycans,
while fucosidase after PNGase F
digestion can reveal whether there are fucoses on N-glycans. There was only a slight change when only fucosidase was
used, but after treatment with both glycosidases, more pronounced
protein bands appeared in OD and LC (Figure ). These results indicate that fucosylation
mainly occurs on N-glycosylation of LC, and O-glycosylation containing fucosylation is negligible. The
specific linkage of fucosylation can be further determined by fucosidase.
Different Linkages of Fucosylated N-Glycans
are Elevated in Lung Cancer
To determine the linkage of fucosylation,
we used three fucosidases to process the glycoproteins on the solid
support (resin) before PNGase F digestion. Because α1-2 fucosidase
removes α1-2 fucose, the remaining linkage can be α1-3,
α1-4, or α1-6. Similarly, α1-3 fucosidase can determine
linkage α1-3, and the remaining linkage can be α1-2, α1-4,
or α1-6. After α1-2,3,4 fucosidase digestion, any remaining
fucose can be α1-6 core fucose. We used this strategy to elucidate
the linkage of fucosylated N-glycans of saliva glycoproteins.The glycan abundance after fucosidase treatment includes one that
already exists in the sample and the other after corresponding fucose
is removed. To explain our strategy, we used glycans with the same
core structure H5N4 for quantitative analysis. As shown in Figure , the fucosylation
linkage was determined by examining H5N4F4, H5N4F3, H5N4F2, and H5N4F1.
The glycan profile of saliva glycoproteins indicates that these glycans
are present in LC, so removing any fucose will alter the relative
abundance of the related glycans. For example, α1-2 fucosidase
(F2) digests H5N4F4 to H5N4F2 (corresponding to two α1-2 linkages),
or H5N4F3 to H5N4F2 and H5N4F1 (Figure a). Similarly, α1-2,3,4 fucosidase (F234) alters
H5N4F4 to H5N4F3, H5N4F2, and H5N4F1 (Figure b), resulting in the overall abundance of
the final glycan profile (Figure c). The reduction of fucosylated glycans after F2 or
F234 treatment indicates the presence of one or more of these fucosylation
linkages in the sample. The core-fucosylated glycans were also identified
because none of these fucosidases can digest α1-6 core fucosylation.
Figure 4
Description
of the general workflow for the determination of linkage
of fucosylated glycans containing multiple fucoses using different
fucosidases. Saliva glycoproteins are extracted with lysis buffer
and conjugated to the AminoLink Plus Resin. (a) Four fucosylated N-glycans (H5N4F4) contain α1-2 fucose on Gal after
α1-2 fucosidase (F2) digestion, and the remaining fucose is
either core α1-6 or antenna GlcNAc α1-3 linkage. F2 digestion
can increase the abundance of H5N4F2, whereas H5N4F4 is reduced due
to the loss of α1-2 fucose to form H5N4F2. Similarly, H5N4F3
loses two α1-2 fucose on Gal, and its abundance decreases accordingly.
α1-2 fucosidase digestion eventually forms H5N4F1, whereas core
α1-6 fucose still exists after F2 digestion. (b) α1-2,3,4
fucosidase (F234) determines the core-fucosylated glycan after removing
α1-2, α1-3, and α1-4. H5N4F4 is trimmed to N5N4F3,
N5H4F2, and H5N4F1. The abundance of these fucosylated glycans changes
and is characterized by MALDI-MS. The remaining fucose of the glycan
is a core fucose linkage. (c) Change in each glycan, H5N4F4, H5N4F3,
H5N4F2, and H5N4F1, is the sum of these glycans and fucosidase digestion
products in saliva. The reduction of H5N4F4 by F2 or F234 shows the
presence of α1-2, α1-3, and α1-4. The ratio of LC
vs OD. HC was a value measured from saliva glycoproteins without any
fucosidase treatment. The arrow ↑ denotes the increase in abundance
after fucosidase treatment compared with the untreated sample, and
the arrow ↓ indicates decrease in abundance.
Description
of the general workflow for the determination of linkage
of fucosylated glycans containing multiple fucoses using different
fucosidases. Saliva glycoproteins are extracted with lysis buffer
and conjugated to the AminoLink Plus Resin. (a) Four fucosylated N-glycans (H5N4F4) contain α1-2 fucose on Gal after
α1-2 fucosidase (F2) digestion, and the remaining fucose is
either core α1-6 or antenna GlcNAc α1-3 linkage. F2 digestion
can increase the abundance of H5N4F2, whereas H5N4F4 is reduced due
to the loss of α1-2 fucose to form H5N4F2. Similarly, H5N4F3
loses two α1-2 fucose on Gal, and its abundance decreases accordingly.
α1-2 fucosidase digestion eventually forms H5N4F1, whereas core
α1-6 fucose still exists after F2 digestion. (b) α1-2,3,4
fucosidase (F234) determines the core-fucosylated glycan after removing
α1-2, α1-3, and α1-4. H5N4F4 is trimmed to N5N4F3,
N5H4F2, and H5N4F1. The abundance of these fucosylated glycans changes
and is characterized by MALDI-MS. The remaining fucose of the glycan
is a core fucose linkage. (c) Change in each glycan, H5N4F4, H5N4F3,
H5N4F2, and H5N4F1, is the sum of these glycans and fucosidase digestion
products in saliva. The reduction of H5N4F4 by F2 or F234 shows the
presence of α1-2, α1-3, and α1-4. The ratio of LC
vs OD. HC was a value measured from saliva glycoproteins without any
fucosidase treatment. The arrow ↑ denotes the increase in abundance
after fucosidase treatment compared with the untreated sample, and
the arrow ↓ indicates decrease in abundance.
Elevated Fucosylation is Unique to Lung Cancer
To confirm
whether the glycan profile can be used to distinguish between LC and
OD/HC, we analyzed the overall profile of glycans of the saliva glycoproteins
from patients (Supporting Information Table S1). The sample is processed on a solid support, and its sialic acids
are stabilized by the ethyl esterification of α2,6-sialic acid
and p-toluidine carbodiimide coupling of α2,3-sialic
acid.[50] The N-glycans
are characterized by MALDI-TOF/TOF-MS and analyzed by GlycoWorkBench.[54]Figure shows the glycan profiles of saliva glycoproteins in HC,
LC, and OD. Several conclusions can be drawn: (1) compared with LC,
HC has a lower glycan abundance because the same amount of protein
is conjugated to the resin. The highest peak observed in HC is H3N5F1
(1668 Da), and most glycans in LC are significantly higher than those
in HC; (2) most glycans are core-fucosylated, that is, α1,6
fucosylation to the innermost GlcNAc. These glycans have significantly
higher intensity, such as H3N3F1, H3N4F1, and H8N7F1; and (3) compared
with HC or OD, the bisecting glycans with multiple fucosylation are
obviously abundant in LC. These glycans include H4N3F3, H4N5F2, H4N5F3,
H5N5F3, and H5N5F4; (4) there are multiple fucoses in core-GlcNAc
(α1,6), antenna-GlcNAc (α1,3), and antenna-Gal (α1-2);
and (5) the glycan profile of OD is also different from that of HC.
For example, the highest peak in HC is H3N5F1, and in OD, it is H4N5F3.
Generally, there are several fucosylation of glycans in OD compared
to HC. These results indicate that the characteristics of fucosylated
glycans can be used as markers to detect whether a patient has LC
or other diseases. The increase in fucosylated N-glycans
is also shown in Supporting Information Figure S1. In general, the intensity of fucosylated N-glycans in LC is significantly higher than that in HC; interestingly,
all of these glycans are core-fucosylated.
Figure 5
Different MALDI-MS glycan
profiles of saliva glycoproteins in cancer
compared with the HC and other disease. (a) Glycan from HC. The glycan
abundance of protein in HC is significantly lower than that of LC.
(b) Glycan profile of saliva glycoproteins of LC. The abundance of
glycans with multiple fucosylation is significantly higher in cancer
patients. These glycans also have core fucosylation, fucose on Gal
or antenna GlcNAc. (c) Glycan profile of OD. The profile of OD is
very different from that of LC. The characteristics of glycans can
differentiate whether the patient is LC or OD.
Different MALDI-MS glycan
profiles of saliva glycoproteins in cancer
compared with the HC and other disease. (a) Glycan from HC. The glycan
abundance of protein in HC is significantly lower than that of LC.
(b) Glycan profile of saliva glycoproteins of LC. The abundance of
glycans with multiple fucosylation is significantly higher in cancer
patients. These glycans also have core fucosylation, fucose on Gal
or antenna GlcNAc. (c) Glycan profile of OD. The profile of OD is
very different from that of LC. The characteristics of glycans can
differentiate whether the patient is LC or OD.
Changes in Different Types of N-Glycans
According to the branching of the N-glycan side
chain, we divided fucosylated N-glycans into three
different subtypes, namely, core-fucosylated high-mannose, fucosylated
hybrid glycans, and fucosylated complex glycans. Supporting Information Figure S1 shows quantification of three
subtypes of N-glycans. We found that changes in fucosylation
occurred in high-mannose, hybrid, and complex glycans. Table shows the quantitative analysis
of major fucosylated N-glycans in saliva samples
from HC, OD, and LC, including glycan type, core structure of its
fucosylated glycan, mass (MW), fucosidase digestion (F234 and F2),
glycan structure, and abundance of fucosylated glycan of saliva without
fucosidase treatment. The core-fucosylation of Man3 (H3N2F1) decreased
slightly in LC, but the changes in H4N2F1 and H5N2F1 were negligible.
The core-fucosylated high-mannose (H3N2F1) was found in human saliva,[55] and their possible biosynthetic pathways may
involve FUT8 and α-mannosidase I.[56] Compared with those in HC, the OD of fucosylated high-mannoses is
significantly reduced. Therefore, understanding the biosynthetic pathway
of core-fucosylated high-mannose may be helpful for the diagnosis
of non-cancer diseases using saliva.
Table 1
Regulation of Fucosylated Glycans
of Saliva Glycoproteins in Lung Cancer Compared with Other Diseases
and Healthy Controlsa
The linkage of
fucosylation is determined
by fucosidases. The abundance between LC, OD, and HC is measured by
MALDI-TOF/TOF-MS without any fucosidase treatment. The measurement
is conducted in triplicate. H = Hex, N = HexNAc, F = Fucose, F234
= a1-2,3,4 Fucosidase, and F2 = a1-2 Fucosidase. The arrow ↑
and ↓ stand for increase or decrease in glycan after fucosidase
treatment (10–2). The intensity of each glycan is
listed in Supporting Information Table S3 (* indicates statistical significance, *P <
0.05; **P < 0.01; ***P < 0.001;
and ****P < 0.0001).
The linkage of
fucosylation is determined
by fucosidases. The abundance between LC, OD, and HC is measured by
MALDI-TOF/TOF-MS without any fucosidase treatment. The measurement
is conducted in triplicate. H = Hex, N = HexNAc, F = Fucose, F234
= a1-2,3,4 Fucosidase, and F2 = a1-2 Fucosidase. The arrow ↑
and ↓ stand for increase or decrease in glycan after fucosidase
treatment (10–2). The intensity of each glycan is
listed in Supporting Information Table S3 (* indicates statistical significance, *P <
0.05; **P < 0.01; ***P < 0.001;
and ****P < 0.0001).Hybrid glycans were detected in human saliva, and
they were greatly
reduced in LC (Table ). The four fucosylated hybrid glycans H5N3F1, H5N3F2, H6N3F1, and
H6N3F2 have much lower intensity in LC saliva. These glycans feature
a core fucose, two of which have α1-2 fucose on Gal or α1-3
on GlcNAc. A similar trend was observed in gastric cancer serum, where
the hybrid glycan H6N4F1 was downregulated in cancer serum.[57] How these hybrid glycans are regulated in tumorigenesis
and cancer progression remain to be discovered, but they lack N-acetylglucosaminyltransferase I (GnT1), an enzyme responsible
for the synthesis of hybrid and complex glycans, which can lead to
delayed embryonic development.[58]The most striking changes were observed in complex glycans with
at least one fucose. We list 25 complex glycans that are significantly
upregulated in LC (Table ). Except for H3N5F1, H4N5F1, and H5N5F1, most of the glycans
in LC are higher than those in OD or HC. The dominant increase in
complex glycans is those with two or more fucoses, such as H4N3F2,
H4N3F3, H4N4F2, H5N4F3, H5N4F4, H5N5F3, and H5N5F4. Because these
complex glycans have core fucose, this suggests that the core fucosylation
enzyme FUT8 is actively regulated in cancer. Studies have shown that
the expression of FUT8 in tumor lesion is upregulated in NSCLC and
is associated with tumor metastasis or malignancy.[59] According to the Human Protein Atlas, FUT8 protein is highly
abundant in lung and digestive tract tissues, and its mRNA is highly
expressed in salivary gland, tongue, and lung; however, proteomic
analysis of saliva proteins did not detect FUT8 in HC, OD, or LC.
These results suggest that the core-fucosylated proteins should come
from the lungs or other organs. Additionally, the formation of α1,2-linked
fucose on Gal or the formation of α1,3-linked fucose on GlcNAc
was observed in saliva glycoproteins of LC patients. The increase
in fucosylation specific for LC should be attributed to the increase
in the expression of the corresponding fucosyltransferases (FUTs),
which have been further characterized by qPCR.
Upregulated Fucosyltransferases
Lead to Aberrant Fucosylation
in Lung Cancer
To identify FUTs, we used clinical specimens
form lung tumor tissue and matched adjacent non-tumor tissues to quantify
the mRNA level of each FUT. To determine whether the saliva contains
FUTs for synthesizing linkages of fucose, we used shotgun proteomics
to analyze FUT expression in HC and LC. LC–MS/MS data showed
that FUT6 and FUT11 were present in the saliva proteins of LC patients,
but no other FUTs were identified from saliva; FUT4 and FUT6 were
identified by SPEG and found in OD (Supporting Information Tables S4 and S5). In contrast, the abundance of
mRNA extracted from saliva is extremely low. As a result, we did not
observe any FUT mRNA expression using saliva samples.Proteomic
analysis of human saliva shows that there is an inherent correlation
between the protein components of lung tissue and saliva. Literature
studies have shown that when people suffer from LC, protein signature
appears in human saliva.[60] The presence
of specific glycosylation can be traced back to lung tissue. To this
end, we use qPCR to quantitatively characterize FUTs in lung tissues.
As shown in Figure a, eight FUTs were found in adjacent non-tumor tissues (non-cancer)
and tumor tissues (LC). Among these FUT genes, FUT4, FUT6, FUT7, and
FUT9 are highly expressed in LC, while FUT1 and FUT3 have limited
increase. Interestingly, the change in FUT8 mRNA levels between LC
and HC is negligible, although the core-fucosylation in LC is significantly
higher than that in HC.
Figure 6
Biosynthetic pathway of formation of different
fucosylation linkages
by fucosyltransferase (FUT) enzymes through qPCR quantification. (a)
qPCR quantification of FUT genes showed a substantial increase in
FUT1 (2.17-fold), FUT3 (1.68), FUT4 (6.56), FUT6 (8.49), FUT7 (7.93),
and FUT9 (11.09). The mRNA is extracted from lung tissues of adjacent
non-tumors (control) and tumors; (b) FUT8 enzyme transfers GDP-β-l-fucose to the innermost GlcNAc, forming core-fucosylation
(1.16-fold). The α1,2 linked fucosylation is catalyzed by FUT2,
and the α1,3 linked fucosylation is catalyzed by the combination
of FUT4, FUT6, and FUT9 enzymes.
Biosynthetic pathway of formation of different
fucosylation linkages
by fucosyltransferase (FUT) enzymes through qPCR quantification. (a)
qPCR quantification of FUT genes showed a substantial increase in
FUT1 (2.17-fold), FUT3 (1.68), FUT4 (6.56), FUT6 (8.49), FUT7 (7.93),
and FUT9 (11.09). The mRNA is extracted from lung tissues of adjacent
non-tumors (control) and tumors; (b) FUT8 enzyme transfers GDP-β-l-fucose to the innermost GlcNAc, forming core-fucosylation
(1.16-fold). The α1,2 linked fucosylation is catalyzed by FUT2,
and the α1,3 linked fucosylation is catalyzed by the combination
of FUT4, FUT6, and FUT9 enzymes.The different linkages of fucosylation are regulated by specific
FUT enzymes. Figure b schematically shows the biosynthetic pathway for fucosylation via
various FUTs. Theoretically, FUT1 or FUT2 catalyzes GDP-β-l-fucose to Gal, forming α1,2-linked fucosylation.[61,62] mRNA expression indicates that FUT1 is an enzyme that synthesizes
α1,2-linked fucosylation in lung tissues. FUT8 is responsible
for the synthesis of α1,6-linked fucosylation and exists in
LC and HC. FUT8 is associated with unfavorable clinical outcomes and
may be a prognostic marker of LC.[63] FUT4,
FUT6, and FUT9 are the three main isoforms that catalyze the α1,3-linked
fucosylation in LC. Several N-glycans (Figure ) have α1,2, α1,3,
and α1,6-linked fucosylation, indicating that these FUTs are
highly expressed in LC.
Discussion
Our study shows that
aberrant fucosylation is manifested in saliva
glycoproteins of LC. The characteristics of fucosylated glycans are
quite different from those of HCs or other diseases. Most glycans
have increased core and antenna fucosylation in LC. Although many
studies have reported the upregulation of α2,6-linked sialic
acids in LC serum,[64,65] the increase in sialylation of
LC saliva glycoproteins is negligible. The saliva glycoproteins, such
as Mucin-5B, IgA, lactotransferrin, zinic-a2-glycoprotein, and so
forth,[37] do possess sialic acid residues,
but in our research, we found that dominant change is fucosylation.
Our data show that the characteristics of fucosylation can distinguish
whether a patient has LC or other diseases (Figure ).Because the tumor microenvironment
alters the expression of glycoenzymes,
abnormal fucosylation has been reported in various cancers. Importantly,
fucosylation plays a vital role in cancer biology by regulating tumor
signal transduction and cell–cell adhesion pathways and performs
tumor immune surveillance through necrosis factor-related apoptosis-inducing
ligand signaling.[66] Fucosylation analysis
of prostate cancer cell lines showed that FUT1 is highly elevated
compared to normal prostate cells and is regulated in LNCaP, so glycans
carrying α1,3-linked fucose are elevated in prostate cancer.[67] Changes in the expression of fucosyltransferases
(FUTs), FUT1, FUT3, FUT6, and FUT8, are associated with poor diagnosis
and tumor metastasis in NSCLC.[68] Therefore,
it is significant to identify FUTs in saliva and how this altered
expression affects fucosylation.The fucosylation is formed
by transferring a GDP-β-l-fucose to the substrate catalyzed
by a specific fucosyltransferase.
As shown in Figure b, three different fucose linkages are catalyzed by the respective
enzymes. It is known that FUT8 can synthesize α1,6 Fuc-GlcNAc,
which is the core-fucosylated N-glycans. However,
more than one FUT enzyme can catalyze the transfer of GDP-β-l-fucose to Gal or antenna GlcNAc. For instance, FUT1 is responsible
for the synthesis of α1,2 Fuc-Gal, and any one of FUT4, FUT6,
FUT7, and FUT9 can synthesize α1,3 Fuc-GlcNAc. Studies have
shown that knocking down the FUT1 gene can attenuate tumor cell proliferation
in HER2-overexpressed NCI-N87 cells.[69] Similarly,
the upregulation of FUT1 in LC may lead to an increase in α1,2
fucosylation in LC saliva. In summary, our study shows that (1) FUT8
in LC leads to an increase in the level of core fucosylation, (2)
FUT1 upregulation is the main driving factor for the significant increase
in α1,2 linkage fucosylation, and (3) FUT4, FUT6, FUT7, and
FUT9 are highly upregulated to elevate expression of α1,3 linked
fucosylation.Due to the unique characteristics of fucosylation
in LC, the different
glycan profiles between LC and HC/other disease can be used for the
diagnosis of LC. Since each fucosylated glycoform can be recognized
by a different lectin, a microarray or lectin-based enzyme-linked
immunosorbent assay can be used to quantify and determine the fucosylated
linkage. Our future work includes the use of lectins, such as lens
culinaris agglutinin (LCA) (α1,6), ulex europaeus agglutinin
I (UEAI) (α1,2), or aleuria aurantia lectin (AAL) (α1,2,
α1,3, α1,4, and α1,6), to study linkage-specific
glycoproteins. Additionally, collecting saliva from early patients
may help determine the characteristics of fucosylation for early diagnosis.
Conclusions
Our study shows that aberrant fucosylation of saliva glycoproteins
defines LC malignancy. Since the proteins in human biofluids are highly
glycosylated, attempts are made to identify disease-specific markers
through changes in protein glycosylation in biofluids. Abnormal glycosylation
is usually produced by dysregulated glycoenzymes, which are responsible
for adding or removing monosaccharides to or from glycans. The tumor
microenvironment can cause glycoenzyme dysregulation that is very
different from the normal pathophysiological state. Lung cancer tends
to have higher FUT expression, leading to the upregulation of fucosylation.
Glycoproteomics and glycomic analysis of saliva indicate that aberrant
fucosylation is unique to LC, while other diseases (such as lung inflammatory)
or HCs show a distinct fucosylation than LC. Our results confirmed
that the increase in FUT1 expression enhanced α1,2-linked fucosylation,
while FUT4,6,7,9 catalyzed the upregulation of α1,3-linked fucosylation.
In contrast, FUT8 mRNA expression is also present in LC and adjacent
non-tumor tissues, which indicates that FUT8 mRNA alone is not sufficient
as a marker of LC, rather than using fucosylation patterns for tumor
diagnosis.
Authors: Hazel Tye; Catherine L Kennedy; Meri Najdovska; Louise McLeod; William McCormack; Norman Hughes; Anouk Dev; William Sievert; Chia Huey Ooi; Tomo-o Ishikawa; Hiroko Oshima; Prithi S Bhathal; Andrew E Parker; Masanobu Oshima; Patrick Tan; Brendan J Jenkins Journal: Cancer Cell Date: 2012-10-16 Impact factor: 31.743
Authors: Shuang Yang; Yan Wang; Matthew Mann; Qiong Wang; E Tian; Liping Zhang; John F Cipollo; Kelly G Ten Hagen; Lawrence A Tabak Journal: Glycoconj J Date: 2020-10-17 Impact factor: 2.916