| Literature DB >> 29232830 |
Shibu Krishnan1, Harry J Whitwell2, Joy Cuenco3, Aleksandra Gentry-Maharaj4, Usha Menon5, Stephen P Pereira6, Marco Gaspari7, John F Timms8.
Abstract
Biomarkers for the early detection of pancreatic cancer are urgently needed. The aim of this pilot study was to evaluate changes in serum N-glycoproteins and their glycosylation status prior to clinical presentation of pancreatic cancer that may be potential biomarkers. Prediagnosis serum samples pooled according to five time-to-diagnosis groups and a non-cancer control pool were digested with trypsin, labelled with mass tags, and subjected to titanium dioxide capture, deglycosylation, and 2D-LC-MS/MS profiling. Unbound peptides were profiled in parallel. Across the sample groups, 703 proteins were quantified and 426 putative sites of N-glycosylation were identified with evidence of several novel sites. Altered proteins with biomarker potential were predominantly abundant inflammatory response, coagulation, and immune-related proteins. Whilst glycopeptide profiles largely paralleled those of their parent proteins, there was evidence of altered N-glycosylation site occupancy or sialic acid content prior to diagnosis for some proteins, most notably of immunoglobulin gamma chains. α-1-Antitrypsin was tested as a biomarker, but found not to complement carbohydrate antigen 19-9 (CA19-9) in early detection of cancer. In conclusion, we provide preliminary evidence of altered glycosylation of several serum proteins prior to pancreatic cancer diagnosis, warranting further investigation of these proteins as early biomarkers. These changes may be largely driven by inflammatory processes that occur in response to tumour formation and progression.Entities:
Keywords: N-glycosylation; UKCTOCS; biomarkers; pancreatic cancer; serum; titanium dioxide
Mesh:
Substances:
Year: 2017 PMID: 29232830 PMCID: PMC5751272 DOI: 10.3390/ijms18122670
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Top scoring glycoproteins. The 20 highest-scoring glycoproteins from the analysis are shown with number of unique peptides, number of peptide spectrum matches, score, and ratios of protein expression in cases versus controls for each time group.
| Accession | Protein Name | Unique Peptides | † PSMs | Score | Ratio (0–0.5 years vs. † Ctrl) | Ratio (0.5–1 years vs. Ctrl) | Ratio (1–2 years vs. Ctrl) | Ratio (2–3 years vs. Ctrl) | Ratio (3+ years vs. Ctrl) |
|---|---|---|---|---|---|---|---|---|---|
| P01009 | α-1-Antitrypsin | 25 | 1183 | 27.0 | 3.294 | 2.207 | 2.000 | 2.230 | 1.930 |
| P00738 | Haptoglobin | 10 | 951 | 25.8 | 2.720 | 2.077 | 1.551 | 2.076 | 1.636 |
| P00739 | Haptoglobin-related protein | 4 | 354 | 22.8 | 2.832 | 2.053 | 1.592 | 1.959 | 1.660 |
| P02763 | α-1-Acid glycoprotein 1 | 7 | 239 | 19.0 | 1.909 | 1.554 | 1.294 | 1.662 | 1.319 |
| P01857 | Igγ-1 chain C region | 7 | 738 | 18.8 | 1.714 | 3.338 | 1.613 | 1.202 | 1.535 |
| P01860 | Igγ-3 chain C region | 37 | 385 | 18.6 | 1.726 | 3.315 | 1.653 | 1.264 | 1.538 |
| P01871 | Igμ chain C region | 18 | 139 | 18.4 | 1.558 | 1.637 | 1.306 | 1.304 | 1.273 |
| P02787 | Serotransferrin | 53 | 1610 | 17.8 | 1.723 | 2.017 | 1.466 | 1.200 | 1.416 |
| P19652 | α-1-acid glycoprotein 2 | 20 | 196 | 17.8 | 1.819 | 1.552 | 1.326 | 1.608 | 1.376 |
| P01023 | α-2-macroglobulin | 5 | 142 | 17.8 | 1.707 | 1.605 | 1.337 | 1.232 | 1.340 |
| P08603 | Complement factor H | 5 | 101 | 17.0 | 1.503 | 1.373 | 1.234 | 1.330 | 1.378 |
| P01880 | Igδ chain C region | 16 | 49 | 17.0 | 1.892 | 2.748 | 1.942 | 1.557 | 1.405 |
| P01861 | Igγ-4 chain C region | 32 | 471 | 17.0 | 1.558 | 2.878 | 1.595 | 1.246 | 1.478 |
| P00748 | Coagulation factor XII | 15 | 657 | 17.0 | 1.489 | 1.359 | 1.241 | 1.351 | 1.125 |
| P05543 | Thyroxine-binding globulin | 13 | 83 | 16.8 | 0.828 | 0.765 | 0.769 | 0.837 | 0.884 |
| P10643 | Complement component 7 | 16 | 146 | 16.8 | 1.432 | 1.341 | 1.508 | 1.350 | 1.260 |
| P43652 | Afamin | 31 | 676 | 16.4 | 0.811 | 0.790 | 0.822 | 0.910 | 0.998 |
| P02765 | α-2-HS-glycoprotein | 13 | 18 | 16.2 | 0.970 | 0.975 | 0.977 | 1.025 | 1.118 |
| P43251 | Biotinidase | 12 | 64 | 16.2 | 0.957 | 0.931 | 0.986 | 1.013 | 1.039 |
| P04114 | Apolipoprotein B100 | 20 | 73 | 16.2 | 1.152 | 1.333 | 1.510 | 1.498 | 1.146 |
† PSMs, peptide spectrum matches; Ctrl, non-cancer control.
Top-scoring glycopeptides. The 20 highest-scoring glycopeptides are shown with number of peptide spectrum matches, score, and ratios of expression in cases versus controls for each time group.
| Sequence | Accession | Protein Name | † PSMs | Score | Ratio (0–0.5 years vs. Ctrl) | Ratio (0.5–1 years vs. Ctrl) | Ratio (1–2 years vs. Ctrl) | Ratio (2–3 years vs. Ctrl) | Ratio (3+ years vs. Ctrl) |
|---|---|---|---|---|---|---|---|---|---|
| yLGnATAIFFLPDEGk | P01009 | α-1-Antitrypsin | 27 | 31 | 4.06 | 2.69 | 2.39 | 2.63 | 2.53 |
| nLFLnHSEnATAk | P00738 | Haptoglobin | 42 | 28.8 | 3.28 | 2.38 | 1.67 | 2.36 | 1.79 |
| nLFLnHSEnATAk | P00738 | Haptoglobin | 43 | 24 | 2.66 | 1.90 | 1.58 | 1.91 | 1.55 |
| vVLHPnYSQVDIGLIk | P00738 | Haptoglobin | 16 | 20.2 | 2.60 | 1.84 | 1.31 | 1.92 | 1.44 |
| eEqFnSTFR | P01859 | Igγ-2 chain C region | 16 | 19.6 | 0.46 | 0.50 | 0.53 | 0.60 | 0.58 |
| lGAcnDTLQQLmEVFk | P01008 | Antithrombin-III | 7 | 19 | 0.81 | 0.70 | 0.71 | 0.73 | 0.75 |
| eTFFnLSk | Q9UK55 | Protein Z-dependent protease inhibitor | 3 | 19 | 1.91 | 2.05 | 2.53 | 2.19 | 1.98 |
| enGTISR | P02763 | α-1-Acid glycoprotein 1 | 57 | 19 | 2.25 | 1.74 | 1.48 | 1.99 | 1.45 |
| qDqcIYnTTYLNVQR | P02763 | α-1-Acid glycoprotein 1 | 149 | 18.6 | 1.98 | 1.61 | 1.36 | 1.72 | 1.33 |
| lGHcPDPVLVnGEFSSSGPVnVSDk | P20851 | C4b-binding protein β chain | 8 | 18.4 | 1.66 | 1.76 | 1.37 | 1.55 | 1.44 |
| qNQcFYnSSYLNVQR | P19652 | α-1-Acid glycoprotein 2 | 24 | 18 | 1.93 | 1.57 | 1.20 | 1.53 | 1.26 |
| aAIPSALDTnSSk | Q08380 | Galectin-3-binding protein | 24 | 17.6 | 0.90 | 0.94 | 0.99 | 1.00 | 1.09 |
| enGTVSR | P19652 | α-1-Acid glycoprotein 2 | 22 | 17.2 | 1.93 | 1.64 | 1.33 | 1.67 | 1.45 |
| eDAAVncTDISVQk | Q86VB7 | Scavenger receptor cysteine-rich type 1 | 3 | 17 | 0.61 | 0.63 | 0.64 | 0.65 | 0.72 |
| vcQDcPLLAPLnDTR | P02765 | α-2-HS-glycoprotein | 49 | 17 | 0.93 | 1.09 | 0.91 | 1.12 | 1.24 |
| dIEnFnSTQk | P43652 | Afamin | 56 | 17 | 0.80 | 0.74 | 0.83 | 0.87 | 0.94 |
| nYTLTGR | P10643 | Complement component C7 | 3 | 17 | 2.15 | 1.94 | 2.07 | 1.75 | 1.55 |
| aDGTVNQIEGEATPVnLTEPAk | P05090 | Apolipoprotein D | 66 | 16.8 | 1.24 | 1.15 | 1.12 | 1.00 | 1.18 |
| enLTAPGSDSAVFFEQGTTR | P00450 | Ceruloplasmin | 131 | 16.8 | 0.88 | 0.85 | 0.85 | 0.89 | 0.98 |
| nVTAEQAR | P00748 | Coagulation factor XII | 10 | 16.8 | 1.67 | 1.64 | 1.31 | 1.49 | 1.16 |
† PSMs, peptide spectrum matches; Ctrl, non-cancer control.
Figure 1Global comparison of glycoprotein and matched glycopeptide profiles across the time groups. Case versus control (Ctrl) ratios for glycoproteins were indexed in order of increasing ratio for each prediagnosis time group and plotted with ratios of matching glycopeptides. Protein ratios are repeated for proteins with multiple matched glycopeptides.
Figure 2(a) Examples of altered glycoproteins whose constituent glycopeptides match the pattern of expression of their parent protein; (b) Examples of glycoproteins whose constituent glycopeptides deviate from the pattern of expression of their parent protein. Graphs show the median natural log (ln) ratio of case versus control for protein (open circles) at each time group prior to diagnosis of pancreatic cancer. Protein ratios were derived from TMT reporter ion intensity ratios for all peptides matching that protein. Closed symbols show ln ratio for case versus control across time groups for specific glycopeptides matching those proteins. Some examples show the same peptide sequence with different deamidation (N and Q) modifications.
Figure 3Expression of immunoglobulin G heavy chains and constituent glycopeptides. Graphs show the median natural log (ln) ratio of case versus control for protein (open circles) at each time group prior to diagnosis of pancreatic cancer. Protein ratios were derived from TMT reporter ion intensity ratios for all peptides matching that protein from analysis of the TiO2 non-bound fractions (Table S6) to mitigate the significant effect that glycopeptide count had on the total protein ratio; IGHG2 glycopeptide counts represented more than half of total peptide counts. Closed symbols show the ln ratio across time groups for specific matched glycopeptides using reporter ion intensity ratios from the combined analysis.
Figure 4Testing α-1-antitrypsin (A1AT) as an early biomarker. Box and whisker plots showing A1AT ELISA data for individual case samples in different time groups prior to PDAC diagnosis and matched controls. Whiskers represent the 5th and 95th percentiles. The p values are shown for significant (p < 0.05) differences.
Performance of logistic regression models comparing carbohydrate antigen 19-9 (CA19-9) alone and CA19-9 and A1AT in combination. The number of case samples is shown for each group. Data from all control samples (n = 120) was used. The area under the receiver operating characteristics curve (AUC) and sensitivities are shown for both models for each time group at a fixed specificity of 0.95.
| Time Group | No. Case Samples | Marker Model | AUC | Sensitivity at 0.95 Specificity |
|---|---|---|---|---|
| 0–1 years | 34 | CA9-9 | 0.799 | 0.647 |
| CA19-9, A1AT | 0.88 | 0.647 | ||
| 1–2 years | 17 | CA9-9 | 0.579 | 0.176 |
| CA19-9, A1AT | 0.664 | 0.235 | ||
| 2+ years | 41 | CA9-9 | 0.618 | 0.073 |
| CA19-9, A1AT | 0.644 | 0.146 | ||
| 0–2 years | 51 | CA9-9 | 0.726 | 0.49 |
| CA19-9, A1AT | 0.76 | 0.471 | ||
| 0–3 years | 64 | CA9-9 | 0.7 | 0.422 |
| CA19-9, A1AT | 0.738 | 0.438 | ||
| 0–4+ years (All) | 92 | CA9-9 | 0.678 | 0.304 |
| CA19-9, A1AT | 0.709 | 0.315 |
Study set characteristics.
| Variable | Cases | Controls | |
|---|---|---|---|
| No. individuals | 154 | 304 | - |
| No. samples | 261 | 304 | - |
| Tumour site | |||
| Tail | 8 | na | - |
| Body | 10 | na | - |
| Head | 65 | na | - |
| Unspecified | 71 | na | - |
| Mean time to spin (h) (range) | 21.8 (0.5–47) | 22.0 (6.9–47) | 0.62 |
| Mean age at sample draw (years) (range) | 64.64 (51.6–74.4) | 62.67 (50.6–77.5) | 0.049 |
| Mean BMI (kg/m2) (range) | 27.1 (18.6–42.7) | 26.7 (17.9–44.4) | 0.446 |
| Mean time from sample collection to diagnosis (months) (range) | 25.5 (0–79) | na | - |
BMI, Body Mass Index; na, not applicable.