| Literature DB >> 35629952 |
Patricia A Da Cunha1, Diana Nitusca2,3, Luisa Matos Do Canto1, Rency S Varghese1, Habtom W Ressom1, Shawna Willey1,4, Catalin Marian2,3, Bassem R Haddad1.
Abstract
Breast cancer (BC) is one of the leading causes of cancer mortality in women worldwide, and therefore, novel biomarkers for early disease detection are critically needed. We performed herein an untargeted plasma metabolomic profiling of 55 BC patients and 55 healthy controls (HC) using ultra-high performance liquid chromatography coupled with quadrupole time-of-flight mass spectrometry (UHPLC/Q-TOF-MS). Pre-processed data revealed 2494 ions in total. Data matrices' paired t-tests revealed 792 ions (both positive and negative) which presented statistically significant changes (FDR < 0.05) in intensity levels between cases versus controls. Metabolites identified with putative names via MetaboQuest using MS/MS and mass-based approaches included amino acid esters (i.e., N-stearoyl tryptophan, L-arginine ethyl ester), dipeptides (ile-ser, met-his), nitrogenous bases (i.e., uracil derivatives), lipid metabolism-derived molecules (caproleic acid), and exogenous compounds from plants, drugs, or dietary supplements. LASSO regression selected 16 metabolites after several variables (TNM Stage, Grade, smoking status, menopausal status, and race) were adjusted. A predictive conditional logistic regression model on the 16 LASSO selected ions provided a high diagnostic performance with an area-under-the-curve (AUC) value of 0.9729 (95% CI 0.96-0.98) on all 55 samples. This study proves that BC possesses a specific metabolic signature that could be exploited as a novel metabolomics-based approach for BC detection and characterization. Future studies of large-scale cohorts are needed to validate these findings.Entities:
Keywords: UHPLC/Q-TOF-MS; biomarkers; breast cancer; diagnostic; metabolomics
Year: 2022 PMID: 35629952 PMCID: PMC9147455 DOI: 10.3390/metabo12050447
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Characteristics of the BC patients (N = 55) included in our study.
| Characteristics | N = 55 | % * |
|---|---|---|
|
| 53.35 (±12.26) | |
|
| ||
| Pre-menopause | 18 | 32.73% |
| Peri-menopause | 2 | 3.64% |
| Post-menopause | 35 | 63.64% |
|
| ||
| Asian | 5 | 9.09% |
| African American | 15 | 27.27% |
| Caucasian | 33 | 60.00% |
| Hispanic | 2 | 3.64% |
|
| ||
| Current smoker | 7 | 12.73% |
| Former smoker | 5 | 9.09% |
| Never smoked | 43 | 78.18% |
|
| ||
| Ductal carcinoma in situ (DCIS) | 10 | 18.18% |
| Invasive ductal carcinoma (IDC) | 35 | 63.64% |
| Invasive lobular carcinoma (ILC) | 5 | 9.09% |
| Mixed | 2 | 3.63% |
|
| ||
| 0 | 9 | 16.36% |
| I | 21 | 38.18% |
| II | 15 | 27.27% |
| III | 4 | 7.27% |
|
| ||
| Low | 7 | 12.72% |
| Intermediate | 21 | 38.18% |
| High | 25 | 45.45% |
|
| ||
| Yes | 14 | 25.45% |
| No | 40 | 72.72% |
|
| ||
| Positive | 45 | 81.82% |
| Negative | 9 | 16.36% |
|
| ||
| Positive | 33 | 60.00% |
| Negative | 21 | 38.18% |
|
| ||
| Positive | 11 | 20.00% |
| Negative | 39 | 70.90% |
|
| ||
| Bilateral mastectomy (BM) | 18 | 32.73% |
| Preventive mastectomy (PM) | 14 | 25.45% |
| Mastectomy | 22 | 40.00% |
| Endoscopy-assisted breast surgery (EABS) | 1 | 1.82% |
|
| ||
| Yes | 25 | 45.45% |
| No | 30 | 54.55% |
* Might not add to 100% due to missing data.
The total number of ions detected and ions selected by statistical analysis.
| Ion Mode | Number of Ions Detected | Number of Ions with Adjusted |
|---|---|---|
| Positive (POS) | 1930 | 603 |
| Negative (NEG) | 564 | 189 |
Figure 1PCA 2D scores plot showing the first two principal components depicting the variance (shown in parenthesis) among the 55 samples.
Figure 2Important features selected by volcano plot with false discovery rate (FDR) < 0.05 and |FC| > 2 from univariate analysis. Red-colored dots denote up-regulated, blue-colored dots denote down-regulated, and grey-colored dots denote metabolites with non-significant change.
Metabolites selected with putative identities.
| Metabolite ID | Formula | Exact Mass | Ion Mode | Retention Time (RT) | BC vs. HC | Fold Change (FC) | Adjusted | |
|---|---|---|---|---|---|---|---|---|
|
| C10H18O2 | 171.139 | 170.131 (M + H) | + | 314.469 | ↓ | −8.62 | 1.00 × 10−24 |
|
| C8H18N4O2 | 235.176 | 202.143 (M + CH3OH + H) | + | 150.534 | ↑ | 98.43 | 1.21 × 10−22 |
|
| C29H46N2O3 | 236.184 | 470.354 (M + 2H) | + | 135.323 | ↑ | 50.15 | 5.23 × 10−29 |
|
| C9H18N2O4 | 236.184 | 426.386 (M + 2Na) | + | 135.323 | ↑ | 50.15 | 5.23 × 10−29 |
|
| C15H20ClN3O2 | 310.129 | 309.124 (M + H) | + | 55.740 | ↓ | −17.82 | 6.85 × 10−24 |
|
| C16H18N4O6S | 395.103 | 394.095 (M + H) | + | 361.000 | ↑ | 2.06 | 1.64 × 10−11 |
|
| C15H10N2O7 | 329.046 | 330.049 (M − H) | − | 323.114 | ↑ | 5.84 | 2.72 × 10−13 |
* These metabolites have been identified using matched MS/MS identifications (Supplementary Figures S1–S3). The other identifications were done via mass-based putative identifications from public databases.
Multivariate analysis (LASSO & SVM-RFE) on training, test, and all samples.
| Metabolite | Train Samples (40 vs. 40) | Test Samples (15 vs. 15) | All Samples (55 vs. 55) | ||||
|---|---|---|---|---|---|---|---|
| Ion Mode | AUC | 95% CI AUC | AUC | 95% CI AUC | AUC | 95% CI AUC | |
| 171.139 | + | 0.971 | 0.922–1 | 0.916 | 0.787–1 | 0.969 | 0.931–1 |
| 203.107 | + | 0.925 | 0.864–0.985 | 0.809 | 0.620–0.997 | 0.904 | 0.843–0.964 |
| 221.118 | + | 0.970 | 0.939–1 | 0.844 | 0.666–1 | 0.940 | 0.886–0.992 |
| 223.064 | + | 0.884 | 0.801–0.967 | 0.920 | 0.788–1 | 0.911 | 0.850–0.971 |
| 235.176 | + | 0.968 | 0.925–1 | 1.000 | 1 | 0.976 | 0.944–1 |
| 236.184 | + | 0.974 | 0.941–1 | 1.000 | 1 | 0.980 | 0.954–1 |
| 256.942 | + | 0.746 | 0.638–0.853 | 0.987 | 0.957–1 | 0.681 | 0.581–0.780 |
| 302.122 | + | 0.940 | 0.881–0.998 | 0.893 | 0.774–1 | 0.929 | 0.877–0.981 |
| 306.977 | + | 0.913 | 0.842–0.983 | 0.711 | 0.501–0.920 | 0.875 | 0.803–0.947 |
| 310.129 | + | 0.951 | 0.883–1 | 0.858 | 0.720–0.995 | 0.937 | 0.882–0.990 |
| 395.103 | + | 0.855 | 0.764–0.945 | 0.769 | 0.572–0.965 | 0.842 | 0.761–0.922 |
| 451.165 | + | 0.898 | 0.831–0.963 | 0.764 | 0.580–0.948 | 0.876 | 0.810–0.940 |
| 223.028 | − | 0.897 | 0.817–0.975 | 0.907 | 0.772–1 | 0.912 | 0.851–0.972 |
| 317.948 | − | 0.949 | 0.893–1 | 0.667 | 0.451–0.881 | 0.889 | 0.819–0.959 |
| 329.046 | − | 0.988 | 0.971–1 | 0.836 | 0.657–1 | 0.953 | 0.907–0.998 |
Figure 3Receiver operating characteristics (ROC) curve with the area under the curve (AUC) showing the prediction performance of the 16 selected ions for all 55 samples.
Figure 4Receiver operating characteristics (ROC) curve with the area under the curve (AUC) showing the prediction performance of the 16 selected ions for the 15 test samples.
Figure 5Dot plots of four selected metabolites: L-Arginine (ester), m/z = 235.176 (A); N-stearoyl tryptophan, m/z = 236.184 (B); Caproleic acid, m/z = 171.139 (C); 5-[(4-Nitrobenzoyl)amino]-isophthalic acid, m/z = 329.046 (D).
Figure 6Summarized steps of the Materials and Methods sections.