| Literature DB >> 30340609 |
Kelsi Perttula1, Courtney Schiffman1, William M B Edmands1, Lauren Petrick1,2, Hasmik Grigoryan1, Xiaoming Cai1, Marc J Gunter3, Alessio Naccarati4, Silvia Polidoro4, Sandrine Dudoit1,5,6, Paolo Vineis4,7, Stephen M Rappaport8.
Abstract
BACKGROUND: Epidemiologists are beginning to employ metabolomics and lipidomics with archived blood from incident cases and controls to discover causes of cancer. Although several such studies have focused on colorectal cancer (CRC), they all followed targeted or semi-targeted designs that limited their ability to find discriminating molecules and pathways related to the causes of CRC.Entities:
Keywords: Biomarkers; Colorectal cancer; EPIC; Lipidomics; Metabolomics; Untargeted
Mesh:
Substances:
Year: 2018 PMID: 30340609 PMCID: PMC6194742 DOI: 10.1186/s12885-018-4894-4
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Studies that investigated associations of colorectal cancer with small molecules in plasma or serum from prospective cohorts
| Cohort | Cases/Controls | Follow-upa (y) | Analytical method | Design | Exposure variable | Likely associations | Ref. |
|---|---|---|---|---|---|---|---|
| WHI-OS | 835/835 | 5.2 | LC-MS | Targeted | Choline and its metabolites | TMAO (+); betaine/choline ratio (−) | [ |
| EPIC [ | 1367/2323 | 3.7 | LC-MS | Targeted | Methionine and choline metabolites | Methionine, choline, and betaine (−) | [ |
| EPIC [ | 95/95 | 14.7 | LC-MS | Targeted | 8 Ultra-long-chain hydroxylated fatty acids | All associations (−) diminished with time to diagnosis (reverse causality) | [ |
| EPIC [ | 1238/1238 | 3.8 | Colorimetry and turbidimetry | Targeted | Triglycerides, cholesterol, and lipoproteins | HDL (−) | [ |
| PLCO | 254/254 | 7.8 | LC-MS and GC-MS | Semi-targeted | 278 Annotated metabolites detected in > 80% of specimens | Glycochenodeoxycholate (+) in women but not men | [ |
WHI-OS Women’s Health Initiative Observational Study, EPIC European Prospective Investigation into Cancer, GC-MS gas chromatography-mass spectrometry, HDL high-density lipoprotein cholesterol, LC-MS liquid chromatography-mass spectrometry, PLCO Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial, TMAO trimethylamine-N-oxide, WHI-OS Women’s Health Initiative-Observational Study, (+) positively associated with CRC, (−) negatively associated with CRC
aMean period of follow-up
Descriptive statistics of human subjects matched by age, study enrollment, gender, and selected covariates
| Total | CRC cases | Controls | ||
|---|---|---|---|---|
| Gender | Male | 51 | 51 | |
| Female | 15 | 15 | ||
| Age at enrollment (y) | median | 56 | 56 | |
| min | 35 | 35 | ||
| max | 65 | 65 | ||
| Years to diagnosis | median | 7.52 | – | |
| min | 0.10 | – | ||
| max | 14.40 | – | ||
| BMI | median | 26.9 | 25.3 | 0.0246 |
| min | 19.7 | 18.7 | ||
| max | 36.7 | 33.6 | ||
| Waist circumference (cm) | median | 97 | 90 | 0.0016 |
| min | 68 | 66 | ||
| max | 115 | 119 | ||
| Diabetes | yes | 2 | 2 | |
| no | 64 | 64 | ||
| Smoking Status | current | 15 | 16 | 0.8423 |
| former | 27 | 23 | ||
| never | 21 | 22 | ||
| NA | 3 | 5 | ||
| Alcohol consumption (ml/day) | median | 23.0 | 22.6 | 0.4038 |
| min | 0.0 | 0.1 | ||
| max | 79.8 | 113 | ||
| Physical Activity (min/day) | high | 13 | 13 | 0.6102 |
| medium | 15 | 20 | ||
| low | 25 | 18 | ||
| none | 10 | 10 | ||
| NA | 3 | 5 | ||
| Total meat consumption (g/day) | median | 75.6 | 67.6 | 0.4640 |
| min | 5.9 | 8.8 | ||
| max | 248.3 | 201.3 |
NA not available
*Nominal p-values calculated from two-sided Wilcoxon rank sum tests (dichotomous variables) or chi-square tests (categorical or continuous variables)
Fig. 1Volcano plot of analyzed features; the nine selected features are highlighted in red with the ID labels from Table 3. An arbitrary p-value = 0.05 threshold line is present for reference. p-values and fold-changes are calculated based on the regression model in Eq. (2)
Untargeted features selected as predictors of case-control status
| Feature ID | Observed | Ret. time (sec) | Fold change | Feature type c | |
|---|---|---|---|---|---|
| 235 | 391.2832 | 596.4 | 0.702 | 0.000261 | R |
| 4250 | 453.3592 | 605.5 | 0.753 | 0.001082 | R |
| 4294 | 467.3744 | 605.6 | 0.741 | 0.000569 | R |
| 5080 | 519.1965 | 595.9 | 1.308 | 0.000432 | C |
| 3207 | 531.1558 | 563.9 | 0.661 | 0.000468 | C |
| 6054 | 551.1781 | 563.9 | 0.404 | 0.001750 | C |
| 839 | 577.2698 | 620.6 | 1.370 | 0.000359 | C |
| 5749 | 882.6393 | 718.2 | 0.880 | 0.000168 | I |
| 14,963 | 907.4806 | 617.6 | 1.373 | 0.000133 | R |
m/z mass-to-charge ratio, p-value from the regression model (Eq. 2), C potentially causal feature, R potentially reactive feature, I indeterminate
aObserved m/z values correspond to singly-charged negative ions
bFeature selected by bootstrap LASSO and by being in the top 1% of features ranked by both the p-values from the case-control regression (Eq. 2) and the random forest variable importance measure
cBased on regression of case-control difference on time to diagnosis (ttd, Fig. 2)
Fig. 2Scatterplots of case-control log fold-change vs. time to diagnosis (ttd) for the 9 selected features. The blue line is the linear regression fit and the gray band represents a 95% confidence intervals, calculated with the ‘lm’ method of the R function ‘geom_smooth’ in the package ‘ggplot2’
Results of tandem MS/MS analyses of features associated with case-control status
| Feature ID | Observed | Ret. time (s) | Prominent MS2 fragments with possible fragment IDs | Putative ID | Species | Molecular formula | Δ ppm |
|---|---|---|---|---|---|---|---|
| 235 | 391.2832 | 596.4 | 347.2961(loss of CO2), 197.0725 ([C10H13O4]-) | possible fatty acid | [M-H]- | C24H40O4 | 4.22 |
| 4250 | 453.3592 | 605.5 | 59.0131, 409.3687 (loss of CO2), 391.3568 (loss of CO2 and H2O), 279.2336 ([C18H31O2]-), 435.3462 (loss of H2O) | possible ULCFA | [M-H]- | C27H50O5 | −2.76 |
| 4294 | 467.3744 | 605.6 | 449.3639 (loss of H2O), 263.2368 ([C18H31O]-), 423.3842 (loss of CO2), 162.8392, 405.3724 (loss of CO2 and H2O) | ULCFA 468 | [M-H]- | C28H52O5 | −1.56 |
| 5080 | 519.1965 | 595.9 | No MS2 spectra | unknown | |||
| 3207 | 531.1558 | 563.9 | 481.3110, 256.2357 | unknown | |||
| 6054 | 551.1781 | 612.3 | 478.2903, 515.1326 (loss of 2 H2O;), 253.2165 ([C16H29O2]-) | possible ceramide | |||
| 839 | 577.2698 | 596.8 | No MS2 spectra | unknown | |||
| 5749 | 882.6393 | 718.2 | 124.0075, 822.6453 (loss of acetate) | unknown | [M + HAc-H]- | ||
| 14,963 | 907.4806 | 617.6 | No MS2 spectra | unknown |