| Literature DB >> 28098776 |
Yarrow J McConnell1,2, Farshad Farshidfar3, Aalim M Weljie4,5, Karen A Kopciuk6,7, Elijah Dixon8,9, Chad G Ball10, Francis R Sutherland11, Hans J Vogel12, Oliver F Bathe13,14.
Abstract
Previous work demonstrated that serum metabolomics can distinguish pancreatic cancer from benign disease. However, in the clinic, non-pancreatic periampullary cancers are difficult to distinguish from pancreatic cancer. Therefore, to test the clinical utility of this technology, we determined whether any pancreatic and periampullary adenocarcinoma could be distinguished from benign masses and biliary strictures. Sera from 157 patients with malignant and benign pancreatic and periampullary lesions were analyzed using proton nuclear magnetic resonance (¹H-NMR) spectroscopy and gas chromatography-mass spectrometry (GC-MS). Multivariate projection modeling using SIMCA-P+ software in training datasets (n = 80) was used to generate the best models to differentiate disease states. Models were validated in test datasets (n = 77). The final ¹H-NMR spectroscopy and GC-MS metabolomic profiles consisted of 14 and 18 compounds, with AUROC values of 0.74 (SE 0.06) and 0.62 (SE 0.08), respectively. The combination of ¹H-NMR spectroscopy and GC-MS metabolites did not substantially improve this performance (AUROC 0.66, SE 0.08). In patients with adenocarcinoma, glutamate levels were consistently higher, while glutamine and alanine levels were consistently lower. Pancreatic and periampullary adenocarcinomas can be distinguished from benign lesions. To further enhance the discriminatory power of metabolomics in this setting, it will be important to identify the metabolomic changes that characterize each of the subclasses of this heterogeneous group of cancers.Entities:
Keywords: biomarkers; metabolomics; pancreatic cancer; periampullary adenocarcinoma
Year: 2017 PMID: 28098776 PMCID: PMC5372206 DOI: 10.3390/metabo7010003
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Clinical and technical variables for each allocation of training and test sets.
| Allocation A | Allocation B | Allocation C | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Training | Test | Training | Test | Training | Test | |||||
| Age | <60 yrs | 24 | 24 | 0.87 | 27 | 21 | 0.38 | 22 | 26 | 0.40 |
| ≥60 yrs | 56 | 53 | 53 | 56 | 58 | 51 | ||||
| Gender | Male | 45 | 37 | 0.31 | 46 | 36 | 0.18 | 40 | 42 | 0.57 |
| Female | 35 | 40 | 34 | 41 | 40 | 35 | ||||
| Lesion Location | Head/Uncinate | 52 | 58 | 0.31 | 54 | 56 | 0.59 | 55 | 55 | 0.46 |
| Body/Tail | 20 | 15 | 19 | 16 | 20 | 15 | ||||
| Lesion Type | Mass | 58 | 53 | 0.37 | 59 | 52 | 0.57 | 54 | 57 | 0.12 |
| Stricture | 8 | 11 | 7 | 12 | 13 | 6 | ||||
| Cyst | 9 | 11 | 11 | 9 | 12 | 8 | ||||
| Stage (for Malignant Lesions Only) | I | 11 | 8 | 0.39 | 5 | 14 | 0.24 | 12 | 7 | 0.25 |
| II | 27 | 28 | 30 | 25 | 27 | 28 | ||||
| III | 16 | 13 | 18 | 11 | 14 | 15 | ||||
| IV | 7 | 12 | 8 | 11 | 8 | 11 | ||||
| Surgically Resected | Yes | 48 | 44 | 0.72 | 43 | 49 | 0.21 | 49 | 43 | 0.49 |
| No | 32 | 33 | 37 | 28 | 31 | 34 | ||||
| Jaundice | Yes | 13 | 18 | 0.26 | 14 | 17 | 0.47 | 15 | 16 | 0.75 |
| No | 67 | 59 | 66 | 60 | 65 | 61 | ||||
| Diabetes Mellitus | Yes | 20 | 13 | 0.21 | 17 | 16 | 0.94 | 18 | 15 | 0.64 |
| No | 60 | 64 | 63 | 61 | 62 | 62 | ||||
| Bowel Cleansing | Yes | 43 | 43 | 1.0 | 42 | 44 | 0.30 | 47 | 39 | 0.46 |
| No | 25 | 25 | 29 | 21 | 24 | 26 | ||||
| Sampling Location | Laboratory | 12 | 17 | 0.25 | 17 | 12 | 0.36 | 11 | 18 | 0.12 |
| OR | 68 | 60 | 63 | 65 | 69 | 59 | ||||
* p values are for Mann-U-Witney testing between subgroups. Italicized variables were used as stratification factors in the randomized allocation process.
Figure 1Principal components analysis (PCA) results. Scatter plots showing scores (t) in first two components of PCA models for one training dataset ((A) 1H-NMR; (B) GC-MS; (C) Combined). Results from other training sets were similar. Plots coded for patient diagnosis: malignant: ▲ vs. benign: ♢.
Results of orthogonal partial least squares discriminant analysis (OPLS-DA).
| Dataset | Mean of Training Sets ( | Mean of Test Sets ( | ||||
|---|---|---|---|---|---|---|
| X | R2 | Q2 | AUROC | SE | ||
| 1H-NMR | 14 | 0.308 | 0.184 | 1.80 × 10−3 | 0.74 | 0.06 |
| GC-MS | 18 | 0.312 | 0.188 | 8.40 × 10−4 | 0.62 | 0.08 |
| Combined * | 20 | 0.478 | 0.324 | 6.14 × 10−6 | 0.66 | 0.08 |
* The combined dataset includes metabolite features from both 1H-NMR and GC-MS data, with averaged values for metabolites detected by both platforms. X: Mean number of unique metabolites/features in the focused metabolite lists across three randomized allocations of training/test set assignment; R2: goodness of fit; Q2: predictive ability of model (7-fold internal cross validation); p: p-value for CVANOVA testing; AUROC: area under the receiver operating curve; SE: standard error.
Figure 2Orthogonal partial least squares discriminant analysis (OPLS-DA) results. Scatter plots showing scores (t) in first (t[1]) and orthogonal (to[1]) components of final OPLS-DA models for one training dataset (A) 1H-NMR; (B) GC-MS; (C) Combined). Results from other training sets were similar. Plots coded for patient diagnosis: malignant: ▲ vs. benign: ♢.
Summary list of metabolite features included in final focused models.
| Metabolite | Datasets | Mean Coeff | Mean SE (Coeff) | Mean VIP | Mean SE (VIP) | |||
|---|---|---|---|---|---|---|---|---|
| Galactose | G, C | 0.121 | 0.069 | 1.123 | 0.683 | - | 0.001 | |
| Unmatched RI:1007.82 QI: 67, 82, 83 | G, C | 0.120 | 0.074 | 1.337 | 0.708 | - | 0.11 | |
| Isopropanol | N, C | 0.114 | 0.042 | 1.001 | 0.382 | 0.01 | - | |
| Phenylalanine | N, G, C | 0.109 | 0.057 | 1.052 | 0.621 | 0.004 | 0.15 | |
| Glutamate | N, G, C | 0.105 | 0.064 | 1.127 | 0.616 | 0.01 | 0.01 | |
| Mannose | N, C | 0.102 | 0.069 | 1.220 | 0.410 | 0.01 | - | |
| Trimethylamine-N-oxide | N | 0.092 | 0.061 | 0.867 | 0.503 | 0.08 | - | |
| Arabitol | G, C | 0.090 | 0.047 | 0.967 | 0.409 | - | 0.16 | |
| Threitol | G, C | 0.088 | 0.080 | 0.889 | 0.816 | - | 0.14 | |
| Succinate | N, C | 0.086 | 0.115 | 0.743 | 0.777 | - | - | |
| Urea | N, G, C | 0.074 | 0.058 | 0.965 | 0.604 | 0.08 | 0.19 | |
| Myo-Inositol | N, G, C | 0.070 | 0.061 | 0.991 | 0.582 | 0.04 | 0.16 | |
| Trehalose-alpha | G, C | 0.059 | 0.053 | 0.624 | 0.572 | - | 0.21 | |
| Match RI:2018.25 QI: 191, 217, 305, 318, 507 | G, C | −0.029 | 0.055 | 0.568 | 0.680 | - | 0.79 | |
| Tridecanol | G | −0.060 | 0.051 | 0.738 | 0.613 | - | 0.28 | |
| Azelaic acid | G | −0.061 | 0.038 | 0.814 | 0.526 | - | 0.04 | |
| Unmatched RI:2475.33 QI: 73, 375, 376 | G, C | −0.066 | 0.048 | 0.791 | 0.475 | - | 0.01 | |
| Pyroglutamate | N | −0.068 | 0.036 | 0.696 | 0.306 | 0.18 | - | |
| Isoleucine | G | −0.069 | 0.091 | 0.778 | 1.069 | - | 0.05 | |
| Tyrosine | N, G | −0.074 | 0.058 | 0.862 | 0.669 | 0.21 | 0.08 | |
| Arginine | N, C | −0.080 | 0.055 | 0.721 | 0.500 | 0.38 | - | |
| Unmatched RI:1913.88 QI: 156, 174, 317 | G, C | −0.090 | 0.067 | 1.092 | 0.863 | - | 0.01 | |
| Proline | N, G, C | −0.096 | 0.063 | 1.009 | 0.547 | 0.03 | 0.10 | |
| Alanine | N, C | −0.098 | 0.041 | 0.853 | 0.311 | 0.01 | - | |
| Ornithine | N, G, C | −0.104 | 0.068 | 0.997 | 0.687 | 0.06 | 0.07 | |
| Creatine | N, C | −0.107 | 0.041 | 0.952 | 0.267 | 0.06 | - | |
| Glutamine | N, G, C | −0.115 | 0.072 | 1.107 | 0.686 | 0.0002 | 0.0001 | |
| Lysine | N, C | −0.117 | 0.037 | 1.289 | 0.345 | 0.01 | - | |
| Threonine | N, G, C | −0.137 | 0.065 | 1.360 | 0.538 | 0.04 | 0.001 | |
| Unmatched RI:1971.99 QI: 185, 247, 275 | G, C | –0.138 | 0.069 | 1.346 | 0.640 | - | 0.03 |
N: 1H-nuclear magnetic resonance spectroscopy, G: gas chromatography mass spectrometry, C: combined dataset, Coeff: regression coefficient for given X variable (metabolite) in the modeled Y variable (malignant versus benign), positive values associated with malignancy and negative values associated with benign disease; SE: standard error; RI: retention index, QI: quantification ions; VIP: variable importance to projection expresses overall contribution to the model. Metabolite features in italics were found in the focused lists for all three datasets.
Topological metabolic pathway analysis.
| Metabolic Pathway | Total Compounds in Pathway | Hits in Current Dataset | Impact Factor | |
|---|---|---|---|---|
| Arginine and proline metabolism | 77 | 7 | 8.49 × 10−5 | 0.456 |
| Alanine, aspartate, and glutamate metabolism | 24 | 4 | 2.60 × 10−4 | 0.441 |
| Galactose metabolism | 41 | 3 | 8.63 × 10−5 | 0.224 |
| Lysine degradation | 47 | 1 | 4.09 × 10−3 | 0.147 |
| D-Glutamine and D-glutamate metabolism | 11 | 2 | 1.37 × 10−3 | 0.139 |
| Inositol phosphate metabolism | 39 | 1 | 3.00 × 10−2 | 0.137 |
| Phenylalanine metabolism | 45 | 3 | 6.60 × 10−3 | 0.119 |
| Aminoacyl-tRNA biosynthesis | 75 | 10 | 8.90 × 10−7 | 0.113 |
| Lysine biosynthesis | 32 | 1 | 4.09 × 10−3 | 0.100 |
| Glycine, serine and threonine metabolism | 48 | 2 | 7.07 × 10−4 | 0.097 |
| Tyrosine metabolism | 76 | 2 | 2.77 × 10−2 | 0.047 |
| Taurine and hypotaurine metabolism | 20 | 1 | 8.27 × 10−3 | 0.032 |
| Fructose and mannose metabolism | 48 | 1 | 1.56 × 10−3 | 0.029 |
| Butanoate metabolism | 40 | 2 | 6.28 × 10−3 | 0.018 |
| Valine, leucine, and isoleucine biosynthesis | 27 | 2 | 9.74 × 10−4 | 0.013 |
| Glutathione metabolism | 38 | 3 | 3.35 × 10−3 | 0.013 |
| Phenylalanine, tyrosine, and tryptophan biosynthesis | 27 | 2 | 1.05 × 10−2 | 0.008 |
| Purine metabolism | 92 | 2 | 5.70 × 10−4 | 0.008 |
Produced using MetaboAnalyst software. For each pathway, the total number of known metabolites, along with the number of those found in the current dataset (“hits”) are reported. The p value is reported for the statistical comparison of metabolite feature levels between malignant and benign samples. The impact factor expresses the degree of centrality of the identified changes to the pathway functioning overall.