| Literature DB >> 20300169 |
Masahiro Sugimoto, David T Wong, Akiyoshi Hirayama, Tomoyoshi Soga, Masaru Tomita.
Abstract
Saliva is a readily accessible and informative biofluid, making it ideal for the early detection of a wide range of diseases including cardiovascular, renal, and autoimmune diseases, viral and bacterial infections and, importantly, cancers. Saliva-based diagnostics, particularly those based on metabolomics technology, are emerging and offer a promising clinical strategy, characterizing the association between salivary analytes and a particular disease. Here, we conducted a comprehensive metabolite analysis of saliva samples obtained from 215 individuals (69 oral, 18 pancreatic and 30 breast cancer patients, 11 periodontal disease patients and 87 healthy controls) using capillary electrophoresis time-of-flight mass spectrometry (CE-TOF-MS). We identified 57 principal metabolites that can be used to accurately predict the probability of being affected by each individual disease. Although small but significant correlations were found between the known patient characteristics and the quantified metabolites, the profiles manifested relatively higher concentrations of most of the metabolites detected in all three cancers in comparison with those in people with periodontal disease and control subjects. This suggests that cancer-specific signatures are embedded in saliva metabolites. Multiple logistic regression models yielded high area under the receiver-operating characteristic curves (AUCs) to discriminate healthy controls from each disease. The AUCs were 0.865 for oral cancer, 0.973 for breast cancer, 0.993 for pancreatic cancer, and 0.969 for periodontal diseases. The accuracy of the models was also high, with cross-validation AUCs of 0.810, 0.881, 0.994, and 0.954, respectively. Quantitative information for these 57 metabolites and their combinations enable us to predict disease susceptibility. These metabolites are promising biomarkers for medical screening. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-009-0178-y) contains supplementary material, which is available to authorized users.Entities:
Year: 2009 PMID: 20300169 PMCID: PMC2818837 DOI: 10.1007/s11306-009-0178-y
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Subject characteristics
| Groups | Control | Oral cancer | Breast cancer | Pancreatic cancer | Periodontal diseases |
|---|---|---|---|---|---|
|
| |||||
| Min–Max (median) | 20–75 (43) | 34–87 (59.5) | 29–77 (57) | 11–87 (67) | 23–76 (60) |
| Missing | 2 | 5 | 10 | 2 | 2 |
|
| |||||
| Male | 42 | 41 | N/A | ||
| Female | 27 | 23 | |||
| Missing | 18 | 5 | 30 | 18 | 11 |
|
| |||||
| Total | 87 | 69 | 30 | 18 | 11 |
| Caucasian | 37 | 41 | N/A | ||
| Asia | 15 | 5 | |||
| African-American | 12 | 4 | |||
| Hispanic | 5 | 5 | |||
| Missing | 18 | 14 | 30 | 18 | 11 |
N/A not available
Fig. 1A summary of the different metabolome profiles of cations obtained from CE-TOF-MS analyses of salivary metabolites from control (n = 87) and oral cancer samples (n = 69). The X and Y axes represent the migration time and the m/z value, respectively. The color density reflects the difference in intensity between the averaged control and oral cancer samples. Black circles indicate peaks that are significantly different between healthy control and oral cancer samples (P < 0.05; Steel–Dwass test). The small linked figures include overlaid electropherograms of control (blue) and oral cancer samples (red)
Salivary metabolite marker candidates (P < 0.05 Steel–Dwass test) and the ratio of the relative area of oral, breast and pancreatic cancers, and periodontal diseases to controls (n = 215)
| Metabolite |
| Identification (*1) | HMDB (*2) |
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Theoretical | Measured | Error (ppm) | Control vs oral cancer | Control vs breast cancer | Control vs pancreatic cancer | Control vs periodontal desease | Oral cancer vs breast cancer | |||
| C2H6N2 | 59.0604 | 59.0616 | 21 | 0.260 | 4.17 × 10−6*** | 2.67 × 10−4*** | 3.46 × 10−4*** | 1.51 × 10−8*** | ||
| C32H48O13 (*4) | 214.4438 | 214.4440 | 1.0 | 0.834 | 0.00385** | 0.271 | 0.907 | 4.75 × 10−4*** | ||
| C3H7NO2 | 90.0550 | 90.0553 | 3.7 | 0.955 | 0.167 | 1.40 × 10−6*** | 0.0242* | 0.154 | ||
| C4H12N5 | 131.1165 | 131.1174 | 6.8 | 0.686 | 0.00290** | 3.37 × 10−6*** | 0.0251* | 0.323 | ||
| C4H9NO2 | 104.0706 | 104.0706 | −0.35 | 0.620 | 0.913 | 5.78 × 10−6*** | 0.779 | 1.00 | ||
| Cadaverine | 103.1230 | 103.1231 | 1.4 | Standard | HMDB02322 | 0.0422* | 0.00889** | 0.00104** | 0.00100** | 0.993 |
| C5H11NO2 (10.05 min.) | 118.0863 | 118.0864 | 0.86 | 0.910 | 0.408 | 0.192 | 0.0308* | 0.416 | ||
| C30H62N19O2S3 (*3) | 409.2321 | 409.2312 | −2.2 | 0.367 | 2.39 × 10−5*** | 4.64 × 10−4*** | 0.0141* | 0.0354* | ||
| C30H55N27O3S (*3) | 437.7424 | 437.7442 | 4.2 | 0.428 | 0.0305* | 0.0114* | 0.672 | 0.532 | ||
| C18H32N6O6 (*3) | 215.1264 | 215.1269 | 2.2 | 0.891 | 0.386 | 0.0387* | 0.997 | 0.143 | ||
| Alpha-Aminobutyric acid | 104.0706 | 104.0705 | −1.0 | Standard | HMDB00650 | 0.00256** | 0.0269* | 4.37 × 10−6*** | 0.00796** | 1.00 |
| Alanine | 90.0550 | 90.0551 | 1.1 | Standard | HMDB00161 | 2.45 × 10−4*** | 0.0277* | 4.57 × 10−6*** | 0.00647** | 0.956 |
| Putrescine | 89.1073 | 89.1075 | 1.5 | Standard | HMDB01414 | 0.890 | 0.00108** | 1.91 × 10−4*** | 0.0227* | 0.138 |
| Methylimidazoleacetic acid (*5) | 141.0659 | 141.0660 | 0.02 | Predicted | HMDB02820 | 0.183 | 0.390 | 5.32 × 10−4*** | 0.213 | 1.00 |
| Trimethylamine (*5) | 60.0808 | 60.0814 | 11 | Predicted | HMDB00906 | 0.609 | 0.921 | 4.97 × 10−4*** | 1.99 × 10−4*** | 1.00 |
| C5H14N5 | 145.1322 | 145.1332 | 6.9 | 0.0212* | 0.0700 | 2.28 × 10−4*** | 0.0825 | 0.998 | ||
| Piperidine | 86.0964 | 86.0966 | 2.4 | Standard | 0.00119** | 0.934 | 0.0347* | 0.137 | 0.165 | |
| Taurine | 126.0219 | 126.0221 | 1.0 | Standard | HMDB00251 | 3.57 × 10−5*** | 5.94 × 10−6*** | 2.36 × 10−4*** | 6.70 × 10−5*** | 6.82 × 10−10*** |
| Piperideine (*5) | 84.0808 | 84.0807 | −0.41 | Predicted | 2.83 × 10−4*** | 0.0128* | 0.569 | 3.19 × 10−4*** | 2.79 × 10−6*** | |
| Pipecolic acid | 130.0863 | 130.0861 | −0.93 | Standard | HMDB00070 | 1.87 × 10−4*** | 0.264 | 0.943 | 0.00948** | 3.47 × 10−4*** |
| C4H9N | 72.0808 | 72.0813 | 7.8 | 2.02 × 10−7*** | 0.729 | 3.37 × 10−6*** | 0.949 | 5.62 × 10−4*** | ||
| C8H9N | 120.0808 | 120.0801 | −5.9 | 2.64 × 10−5*** | 4.60 × 10−4*** | 1.00 | 0.0385* | 1.02 × 10−8*** | ||
| Pyrroline hydroxycarboxylic acid (*5) | 130.0499 | 130.0498 | −0.49 | Predicted | HMDB01369 | 1.28 × 10−5*** | 0.141 | 0.228 | 0.176 | 1.36 × 10−5*** |
| Betaine | 118.0863 | 118.0864 | 0.86 | Standard | HMDB00043 | 0.0162* | 0.719 | 0.987 | 0.576 | 0.0183* |
| C4H7N | 70.0651 | 70.0655 | 5.8 | 0.101 | 0.637 | 0.997 | 0.502 | 0.0349* | ||
| C6H6N2O2 | 139.0502 | 139.0500 | −1.8 | 0.00270** | 0.265 | 1.65 × 10−10−4*** | 0.127 | 0.948 | ||
| Leucine + Isoleucine | 132.1019 | 132.1019 | −0.30 | Standard | HMDB00687 + HMDB00172 | 1.56 × 10−5*** | 0.00136** | 7.47 × 10−9*** | 0.00150** | 1.00 |
| Phenylalanine | 166.0863 | 166.0863 | 0.46 | Standard | HMDB00159 | 0.00333** | 0.0149* | 2.95 × 10−7*** | 0.0198* | 1.00 |
| Tyrosine | 182.0812 | 182.0812 | 0.16 | Standard | HMDB00158 | 0.0253* | 0.00908** | 1.00 × 10−5*** | 0.0279* | 0.926 |
| Histidine | 156.0768 | 156.0766 | −1.2 | Standard | HMDB00177 | 6.87 × 10−4*** | 0.350 | 0.00141** | 0.246 | 0.805 |
| Proline | 116.0705 | 116.0705 | −0.19 | Standard | HMDB00162 | 0.968 | 0.00165** | 0.00115** | 0.0598 | 0.0299* |
| Lysine | 147.1128 | 147.1127 | −1.0 | Standard | HMDB00182 | 0.0779 | 9.56 × 10−4*** | 5.22 × 10−6*** | 0.459 | 0.426 |
| Glycine | 76.0393 | 76.0397 | 4.8 | Standard | HMDB00123 | 1.00 | 0.0140* | 0.00216** | 0.193 | 0.0753 |
| Ornithine | 133.0972 | 133.0971 | −0.22 | Standard | HMDB00214 | 0.293 | 0.0114* | 0.0220* | 0.0801 | 0.652 |
| C17H26N4O5 | 367.1976 | 367.1977 | 0.21 | 0.981 | 0.0990 | 0.0309* | 0.0561 | 0.430 | ||
| Pro-Gly-Pro or Pro-Pro-Gly (*6) | 270.1449 | 270.1452 | 1.1 | Predicted | 0.143 | 0.635 | 0.00165** | 0.425 | 0.990 | |
| C7H12N2O3 | 173.0921 | 173.0919 | −1.1 | 0.892 | 0.854 | 0.00289** | 0.902 | 0.999 | ||
| Burimamide (*5) | 213.1168 | 213.1233 | 30 | Predicted | 0.999 | 0.266 | 0.0439* | 0.950 | 0.224 | |
| Ethanolamine | 62.0600 | 62.0601 | 1.1 | Predicted | HMDB00149 | 0.684 | 0.0483* | 2.43 × 10−7*** | 0.00187** | 0.597 |
| Gamma-Aminobutyric acid | 104.0706 | 104.0706 | −0.35 | Standard | HMDB00112 | 0.833 | 0.139 | 2.10 × 10−7*** | 0.0133 | 0.897 |
| Aspartic acid | 134.0448 | 134.0447 | −0.34 | Standard | HMDB00191 | 0.287 | 0.00564** | 1.56 × 10−7*** | 0.403 | 0.416 |
| Valine | 118.0863 | 118.0862 | −0.72 | Standard | HMDB00883 | 7.31 × 10−5*** | 0.0204* | 1.94 × 10−8*** | 0.0138 | 0.990 |
| Tryptophan | 205.0972 | 205.0972 | −0.091 | Standard | HMDB00929 | 6.13 × 10−5*** | 0.402 | 2.99 × 10−8*** | 0.0113* | 0.229 |
| Beta-Alanine | 90.0550 | 90.0551 | 1.1 | Standard | HMDB00056 | 0.0407* | 0.722 | 3.56 × 10−6*** | 0.268 | 0.842 |
| Citrulline | 176.1030 | 176.1028 | −0.80 | Standard | HMDB00904 | 0.148 | 0.0509 | 0.00420** | 0.130 | 0.960 |
| Glutamic acid | 148.0604 | 148.0603 | −1.1 | Standard | HMDB00148 | 4.95 × 10−4*** | 0.00528** | 1.49 × 10−8*** | 0.0757 | 1.00 |
| Threonine | 120.0655 | 120.0654 | −0.64 | Standard | HMDB00167 | 1.18 × 10−4*** | 0.00790** | 5.72 × 10−8*** | 1.80 × 10−4*** | 1.00 |
| Serine | 106.0499 | 106.0499 | 0.33 | Standard | HMDB00187 | 0.0197* | 0.0119* | 3.42 × 10−7*** | 0.00699** | 0.846 |
| Glutamine | 147.0764 | 147.0764 | −0.43 | Standard | HMDB00641 | 0.0327* | 0.368 | 1.07 × 10−6*** | 0.111 | 0.975 |
| Hypoxanthine | 137.0458 | 137.0457 | −0.96 | Standard | HMDB00157 | 0.207 | 0.616 | 4.93 × 10−6*** | 0.00279** | 0.107 |
| Choline (*5) | 104.1070 | 104.1070 | 0.22 | Predicted | HMDB00097 | 2.30 × 10−5*** | 1.00 | 1.91 × 10−4*** | 0.0580 | 0.0115* |
| Carnitine | 162.1125 | 162.1124 | −0.38 | Standard | HMDB00062 | 7.60 × 10−4*** | 0.956 | 0.00341** | 0.996 | 0.247 |
| C5H11NO2 (13.42 min.) | 118.0863 | 118.0863 | 0.33 | 0.0606 | 0.958 | 3.73 × 10−4*** | 0.915 | 0.754 | ||
| Glycerophosphocholine | 258.1107 | 258.1121 | 5.5 | Standard | HMDB00086 | 0.287 | 4.53 × 10−5*** | 0.00263** | 0.0322* | 7.05 × 10−6*** |
| C7H8O3S | 173.0267 | 173.0285 | 10 | 0.962 | 1.88 × 10−5*** | 0.00553** | 0.0154* | 2.71 × 10−4*** | ||
| C4H5N2O11P | 288.9704 | 288.9691 | −4.3 | 0.0421* | 0.0815 | 0.0776 | 0.256 | 5.17 × 10−6*** | ||
* P < 0.05; ** P < 0.01; *** P < 0.001 (Steel–Dwass test)
(*1) The peaks marked “standard” were identified based on the matched m/z values and normalized migration times of the corresponding standard compounds. The peaks marked “predicted” were obtained from Kyoto Encyclopedia of Gene and Genomics (KEGG) database (http://www.genome.jp/kegg/) and HMDB (HMDB; http://www.hmdb.ca/) by matching the composition formula and migration times that were calculated from the isotope distribution and molecular structure, respectively
(*2) HMDB: accession numbers for the Human Metabolome Database
(*3) The peaks were detected as bivalent ions
(*4) The peaks were detected as trivalent ions
(*5) The normalized measured and predicted migration times were 8.16 min and 7.78 min (0.38 min) for ethanolamine, 11.61 min and 10.42 min (1.19 min) for burimamide, 8.75 min and 7.78 min (0.97 min) for choline, 8.16 min and 7.78 min (0.38 min) for ethanolamine, 10.012 min and 10.10 min (0.98 min) for methylimidazoleacetic acid, 8.78 min and 7.84 min (0.94 min) for piperidine, 12.90 min and 13.09 min (0.19 min) for pyrroline hydroxycarboxylic acid, and 7.61 min and 8.26 min (0.65 min) for trimethylamine. The parenthetic values are differences between the predicted and the measured time
(*6) Because we did not find any candidate compounds for the 270.1454 m/z and 12.23 min peaks in the databases, we considered Pro-Gly-Pro, Pro-Pro-Gly or Gly-Pro-Pro as possible candidates ([M + H]+ = 270.3044 m/z in all cases). The predicted migration time of Pro-Gly-Pro and Pro-Pro-Gly was 11.33 min and that of Gly-Pro-Pro was 10.98 min; therefore, Gly-Pro-Pro was excluded as a candidate
Fig. 2Representative dot plots for the relative area of detected metabolites in samples from all groups. The colored dots denote healthy controls (blue), oral (red), breast (pink), pancreatic cancer (green), and periodontal disease (purple). The Y- and X-axes denote the relative peak area (no units) and the group name, respectively. The horizontal, center long bars and the short top/bottom bars indicate the means and standard deviations, respectively. The stars indicates * P < 0.05, ** P < 0.01, and *** P < 0.001 (Steel–Dwass test). Only metabolites showing a significant difference between oral cancer and controls at P < 0.001 and matched with standard library are displayed. The dot plots of the other metabolites are shown in Supplementary Fig. S1
Fig. 3Score plots of principal components (PC) analyses. The subjects in all groups are shown in 3-dimensional (a) and 2-dimensional (b) plots without outliers. The cumulative proportions of the first, second and third PCs (PC1, PC2, and PC3) were 44.8, 57.6 and 67.0%. The same analyses presented for all datasets are shown in Supplementary Fig. S2
Fig. 4ROC curve analysis of the ability of salivary metabolites to discriminate between samples from patients with a oral (n = 69), b breast (n = 30) or c pancreatic cancer (n = 18), and d samples from patients with periodontal diseases (n = 11) and the controls (n = 87). The solid (red) and dotted (blue) ROC curves were obtained using the complete data as a training set and with a tenfold cross-validation, respectively. Using a cut-off probability of 50%, the calculated area under the ROC curves were 0.865 (0.810) for oral, 0.973 (0.881) for breast and 0.993 (0.944) for pancreatic cancer, and 0.969 (0.954) for periodontal diseases. The non-parenthetic values were obtained with the full-training data and parenthetic values by tenfold cross-validation
Fig. 5Heat map of 57 peaks showing significantly different levels (P < 0.05; Steel–Dwass test) between control samples (n = 87) and samples from patients with at least one disease (n = 128). Each row shows data for a specific metabolite and each column shows an individual. The colors correspond to the relative metabolite areas that were converted to Z-scores