| Literature DB >> 22792336 |
Shin Nishiumi1, Takashi Kobayashi, Atsuki Ikeda, Tomoo Yoshie, Megumi Kibi, Yoshihiro Izumi, Tatsuya Okuno, Nobuhide Hayashi, Seiji Kawano, Tadaomi Takenawa, Takeshi Azuma, Masaru Yoshida.
Abstract
BACKGROUND: To improve the quality of life of colorectal cancer patients, it is important to establish new screening methods for early diagnosis of colorectal cancer. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22792336 PMCID: PMC3394708 DOI: 10.1371/journal.pone.0040459
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Subject information for the training and validation sets.
| Training set | Validation set | ||||||
| Colorectal cancer patients | Healthyvolunteers | Significance | Colorectal cancer patients | Healthyvolunteers | Significance | ||
|
| 60 | 60 | 59 | 63 | |||
|
| 39 | 39 | 30 | 32 | |||
|
| 21 | 21 | 29 | 31 | |||
|
|
| ||||||
|
| 67.7 | 64.5 | N.S. | 64.8 | 62.8 | N.S. | |
|
| 70 | 68 | 66 | 63 | |||
|
| 36–88 | 39–88 | 31–84 | 47–73 | |||
|
|
| 21.9 | 22.1 | N.S. | 22.5 | 22.2 | N.S. |
|
|
| 12 | − | 15 | − | ||
|
| 12 | − | 11 | − | |||
|
| 12 | − | 3 | − | |||
|
| 12 | − | 11 | − | |||
|
| 12 | − | 19 | − | |||
|
|
| 9 | − | 8 | − | ||
|
| 9 | − | 4 | − | |||
|
| 2 | − | 3 | − | |||
|
| 14 | − | 18 | − | |||
|
| 5 | − | 9 | − | |||
|
| 21 | − | 17 | − | |||
The differences in the mean age and BMI value (%) between the colorectal cancer patients and healthy volunteers were evaluated using the Mann-Whitney U test. N.S., not significant.
Figure 1Comparison of serum metabolite levels between colorectal cancer patients and healthy volunteers.
The mean fold changes in the levels of 107 metabolites observed in a comparison between colorectal cancer patients (N = 60) and healthy volunteers (N = 60) (training set) are shown in Figure 1. In GC/MS analysis, multiple peaks are sometimes detected for a particular metabolite due to TMS-derivatization, isomeric form, etc. In such cases, the peak that most reflected the level of the metabolite was adopted for the subsequent evaluation. In addition, each metabolite had the term ‘_1’, ‘_2’, or ‘(-TMS)’ added to the end of its name, as described in a previous report [7]. This figure does not include background metabolites or minor peak-derived metabolites.
The sensitivity and specificity of the metabolites whose levels differed significantly between the colorectal cancer patients and healthy volunteers.
| Training set | Validation set | |||||||
| Compounds | Sensitivity | Specificity | Accuracy | AUC (95% CI) | Cut-off | Sensitivity | Specificity | Accuracy |
| Pyruvate | 75.0 | 61.7 | 68.3 | 0.6888 (0.5866–0.7755) | 0.31200 | 72.9 | 69.8 | 71.3 |
| 2-hydroxy-butyrate | 56.7 | 83.3 | 70.0 | 0.7418 (0.6448–0.8199) | 0.00965 | 55.9 | 69.8 | 63.1 |
| Phosphate | 60.0 | 68.3 | 64.2 | 0.6303 (0.5256–0.7240) | 0.45960 | 49.2 | 61.9 | 55.7 |
| Isoleucine | 76.7 | 53.3 | 65.0 | 0.6719 (0.5695–0.7603) | 0.02162 | 69.5 | 52.4 | 60.7 |
| Nonanoic acid(C9) | 86.7 | 46.7 | 66.7 | 0.6568 (0.5515–0.7489) | 0.00483 | 25.4 | 61.9 | 44.3 |
| β-alanine | 75.0 | 46.7 | 60.8 | 0.6153 (0.5106–0.7102) | 0.00192 | 74.6 | 38.1 | 55.7 |
| meso-erythritol | 73.3 | 56.7 | 65.0 | 0.6571 (0.5537–0.7477) | 0.01524 | 66.1 | 57.1 | 61.5 |
| Aspartic acid | 65.0 | 80.0 | 72.5 | 0.7001 (0.5957–0.7870) | 0.03750 | 69.5 | 71.4 | 70.5 |
| Pyroglutamic acid | 61.7 | 76.7 | 69.2 | 0.7333 (0.6348–0.8127) | 0.53590 | 59.3 | 79.4 | 69.7 |
| Creatinine | 71.7 | 63.3 | 67.5 | 0.6517 (0.5460–0.7442) | 0.00094 | 44.1 | 54.0 | 49.2 |
| Glutamic acid | 76.7 | 66.7 | 71.7 | 0.7547 (0.6556–0.8326) | 0.10640 | 78.0 | 65.1 | 71.3 |
| p-hydroxybenzoic acid | 90.0 | 65.0 | 77.5 | 0.7660 (0.6630–0.8446) | 0.00084 | 81.4 | 57.1 | 68.9 |
| Arabinose | 76.7 | 70.0 | 73.3 | 0.7860 (0.6922–0.8573) | 0.00206 | 69.5 | 68.3 | 68.9 |
| Ribulose | 51.7 | 73.3 | 62.5 | 0.6093 (0.5041–0.7055) | 0.00253 | 49.2 | 36.5 | 42.6 |
| Asparagine | 66.7 | 55.0 | 60.8 | 0.6054 (0.5001–0.7016) | 0.01345 | 61.0 | 50.8 | 55.7 |
| Xylitol | 56.7 | 78.3 | 67.5 | 0.6749 (0.5716–0.7642) | 0.00252 | 61.0 | 69.8 | 65.6 |
| O-phosphoethanolamine | 76.7 | 51.7 | 64.2 | 0.6276 (0.5224–0.7217) | 0.00209 | 33.9 | 65.1 | 50.0 |
| Ornithine | 38.3 | 86.7 | 62.5 | 0.6450 (0.5413–0.7367) | 0.04155 | 42.4 | 85.7 | 64.8 |
| Citrulline | 41.7 | 85.0 | 63.3 | 0.6058 (0.4967–0.7054) | 0.00459 | 49.2 | 76.2 | 63.1 |
| Glucuronate_1 | 71.7 | 58.3 | 65.0 | 0.6575 (0.5532–0.7485) | 0.00470 | 47.5 | 84.1 | 66.4 |
| Glucosamine_2 | 70.0 | 70.0 | 70.0 | 0.6931 (0.5868–0.7817) | 0.00308 | 80.0 | 69.8 | 74.6 |
| Palmitoleate | 40.0 | 85.0 | 62.5 | 0.6071 (0.5022–0.7032) | 0.00420 | 35.6 | 81.0 | 59.0 |
| Inositol | 55.0 | 70.0 | 62.5 | 0.6260 (0.5209–0.7196) | 0.14210 | 44.1 | 73.0 | 59.0 |
| Kynurenine | 70.0 | 80.0 | 75.0 | 0.8018 (0.7126–0.8686) | 0.00109 | 67.8 | 73.0 | 70.5 |
| Cystamine | 55.0 | 90.0 | 72.5 | 0.7497 0.6522–0.8267) | 0.03885 | 49.2 | 79.4 | 64.8 |
| Cystine | 46.7 | 90.0 | 68.3 | 0.7192 (0.6196–0.8011) | 0.04477 | 55.9 | 93.7 | 75.4 |
| Lactitol | 48.3 | 78.3 | 63.3 | 0.6139 (0.5088–0.7093) | 0.00036 | 40.7 | 77.8 | 59.8 |
The sensitivity, specificity, accuracy, AUC (95% CI), and cut-off values were calculated from the training set data by ROC analysis, as shown in Figure S1. When the validation set was used, sensitivity, specificity, and accuracy were evaluated using the cut-off value obtained from the training set.
Biomarker candidates subjected to multivariate analysis using the stepwise variable selection method.
| Stage 0–4 | Stage 0–2 | Stage 3–4 | |||||
| Compounds | Fold induction | p value | Fold induction | p value | Fold induction | p value | Chemical class |
|
| |||||||
| Cystamine | 1.43 | <0.0001 | 1.57 | <0.0001 | 1.21 | 0.076 | Aliphatic and aryl amine |
| Cystine | 1.55 | <0.0001 | 1.81 | <0.0001 | 1.16 | 0.23 | Amino acid |
| Aspartic acid | 1.59 | 0.0002 | 1.52 | 0.0032 | 1.69 | 0.0011 | Amino acid |
| Arabinose | 1.65 | <0.0001 | 1.61 | <0.0001 | 1.71 | <0.0001 | Carbohydrate |
| p-hydroxybenzoic acid | 1.77 | <0.0001 | 1.89 | <0.0001 | 1.58 | 0.0004 | Aromatic acid |
| Glutamic acid | 1.82 | <0.0001 | 1.54 | <0.0001 | 2.24 | <0.0001 | Amino acid |
| 2-hydroxy-butyrate | 1.83 | <0.0001 | 1.59 | <0.0001 | 2.18 | 0.00050 | Hydroxy acid |
| Kynurenine | 1.83 | <0.0001 | 1.74 | <0.0001 | 1.96 | <0.0001 | Amino acid |
| meso-erythritol | 2.83 | 0.0030 | 3.01 | 0.0065 | 2.57 | 0.042 | Alcohol and polyol |
| Lactitol | 7.44 | 0.0316 | 1.16 | 0.53 | 16.9 | 0.0013 | Alcohol and polyol |
The 7 metabolites subjected to multivariate analysis using the stepwise variable selection method are listed in Table 3. Among these metabolites, arabinose, meso-erythritol, and lactitol are excluded, because they are frequently consumed in the diet. The concentration of each metabolite in the colorectal cancer patients with stage 0–4, stage 0–2, or stage 3–4 disease at the training set was compared with that detected in the healthy volunteers, and the fold induction was calculated. P values were evaluated using the Mann-Whitney U test, and p values of less than 0.05 were considered to indicate a significant difference.
Biomarkers for detecting colorectal cancer selected by the multiple logistic regression model.
| Biomarkers | Coefficient | Standard error | p value | Lower 95% CI | Upper 95% CI |
| (Intercept) | −8.32 | 1.539 | <0.0001 | −11.71 | −5.621 |
| 2-hydroxy-butyrate | 286.59 | 71.90 | <0.0001 | 155.0 | 440.1 |
| Aspartic acid | 33.87 | 14.29 | 0.0178 | 7.390 | 63.85 |
| Kynurenine | 1634.96 | 569.3 | 0.0041 | 559.1 | 2.830E+03 |
| Cystamine | 78.78 | 26.82 | 0.0033 | 31.53 | 137.3 |
The 4 metabolites selected by multiple logistic regression analysis using the stepwise variable selection method. The results of the analysis are shown in Table 4. The 95% confidence interval (95% CI) for the AUC (0.9097) obtained from ROC analysis ranged from 0.8438 to 0.9495.
The sensitivity and specificity of tumor biomarkers and the prediction model.
| Training set | |||||||||
| CEA | CA19-9 | Prediction model | |||||||
| Stage 0–4 | Stage 0–2 | Stage 3–4 | Stage 0–4 | Stage 0–2 | Stage 3–4 | Stage 0–4 | Stage 0–2 | Stage 3–4 | |
|
| 35.0% | 30.6% | 37.5% | 16.7% | 5.6% | 29.2% | 85.0% | 83.3% | 87.5% |
|
| 96.7% | − | − | 100% | − | − | 85.0% | − | − |
|
| 65.8% | − | − | 58.3% | − | − | 85.0% | − | − |
|
| |||||||||
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
| |
|
| 33.9% | 6.9% | 60.0% | 13.6% | 0% | 26.7% | 83.1% | 82.8% | 83.3% |
|
| 96.8% | − | − | 100% | − | − | 81.0% | − | − |
|
| 66.4% | − | − | 58.2% | − | − | 82.0% | − | − |
The sensitivity, specificity, and accuracy of CEA, CA19-9, and the prediction model were calculated using the cut-off value obtained from the ROC analysis. The cut-off values of CEA, CA19-9, and the prediction model were 5 ng/ml, 37 U/ml, and 0.4945, respectively.
Figure 2ROC curve of the established prediction model.
The solid curve is the ROC curve for the prediction model established on the basis of the training set. The AUC and cut-off values were 0.9097 {95% confidence interval (95% CI): 0.8438–0.9495} and 0.4945, respectively. The sensitivity, specificity, and accuracy of this model are summarized in Table 5.