| Literature DB >> 26573008 |
David A Gaul1,2, Roman Mezencev2, Tran Q Long3, Christina M Jones1, Benedict B Benigno4, Alexander Gray3, Facundo M Fernández1,5, John F McDonald2,4,5.
Abstract
High performance mass spectrometry was employed to interrogate the serum metabolome of early-stage ovarian cancer (OC) patients and age-matched control women. The resulting spectral features were used to establish a linear support vector machine (SVM) model of sixteen diagnostic metabolites that are able to identify early-stage OC with 100% accuracy in our patient cohort. The results provide evidence for the importance of lipid and fatty acid metabolism in OC and serve as the foundation of a clinically significant diagnostic test.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26573008 PMCID: PMC4647115 DOI: 10.1038/srep16351
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Recursive feature elimination (RFE) selects 16 metabolic features that distinguish early stage serous epithelial ovarian cancer (EOC) serum samples with high accuracy.
(a) Evolution of accuracy using support vector machine (SVM)-RFE feature selection for metabolic classifiers. The initial 255 metabolic features identified by ultra-performance liquid chromatography-mass spectrometry (UPLC-MS) provided only moderate predictive accuracy (accuracy 62%; specificity 57%; sensitivity 67%) in distinguishing EOC from control samples. SVM-RFE selected a minimum of 16 metabolic features that provided 100% accuracy (100% sensitivity; 100% specificity) in distinguishing between EOC and control samples. (b) Visualization of the optimal separation between EOC and control samples by the SVM model. The X-axis is the optimal weight vector of the SVM model; the Y-axis is the age of donors (EOC patients or normal control women at the time of sample collection). The vertical line is the projection of the separating hyperplane generated by the SVM model. The discriminant linear SVM model was evaluated by leave-one-out cross-validation (LOOCV).
Chemical identification of 16 features that distinguish early-staged ovarian cancer sera from the sera of normal healthy controls with high accuracy (m/z = mass-to-charge ratio, min = minutes, ppm = part per million, CAS = chemical abstract service, USPTO = United States Patent and Trademark Office, HMDB = the human metabolome database, SVM = support vector machine, NA = not available).
| Feature | Average m/ | Average Retention Time (min) | Ion Type | Ion Theoretical m/ | Mass Error (ppm) | Neutral Elemental Formula | Tentative Metabolite Identification [Database: #] | SVM Model Weight |
|---|---|---|---|---|---|---|---|---|
| 279 | 552.2327 | 0.70 | [M-H]2− | 552.2335 | −1.4 | C42H80N3O30 | [NA] | 1.1521 |
| 552.2342 | −2.7 | C43H76N7O26 | ||||||
| 552.2398 | −12.9 | C42H76N9O25 | ||||||
| 571 | 329.1733 | 4.81 | [M-CHO-H]− | 329.1753 | −6.1 | C21H28O5 | cortisone [CAS: 53-06-5] | 1.0810 |
| 286 | 597.3029 | 10.89 | [M-H]− | 597.3040 | −1.8 | C27H51O12P | lysophatidylinositol(18:1) [CAS: 1246298-13-4] | 0.4762 |
| 683 | 261.0727 | 1.01 | [M-H]− | 261.0723 | 1.5 | C9H13N2O7 | aspartyl-glutamic acid [CAS: 6157-06-8] | 0.3967 |
| 226 | 539.4301 | 14.86 | [M-H]− | 539.4312 | −2.0 | C32H60O6 | 16-(6-butoxy-3-hydroxy-4,5-dimethylcyclohex-1-en-1-yl)-6,10-dihydroxy-2,6,10,14-tetramethyl hexadecanoic acid [USPTO: document #20100086960] | 0.1748 |
| 45 | 536.5042 | 18.56 | [M-H]− | 536.5043 | −0.2 | C34H67NO3 | ceramide(d18:1/16:0) [CAS: 24696-26-2] | 0.1076 |
| 64 | 365.3413 | 16.97 | [M-H]− | 365.3425 | 1.2 | C24H46O2 | [NA] | 0.0847 |
| 28 | 524.2778 | 12.79 | [M-H]− | 524.2777 | 0.2 | C27H44NO7P | lysophosphatidylethanolamine(22:6) [PUBCHEM: 52925132] | 0.0445 |
| 105 | 195.1016 | 1.01 | [M+Na-2H]− | 195.0997 | 9.7 | C9H18O3 | 2-hydroxyl nonanoic acid [CAS: 617-31-2] | 0.0195 |
| 14 | 307.2633 | 14.80 | [M+Na-2H]− | 307.2613 | 6.5 | C18H36O2 | iso-1,2-octadecanediol [PUBCHEM: 42607317] | −0.0200 |
| 79 | 245.1378 | 0.99 | [M-H]− | 245.1389 | −4.5 | C12H22O5 | 3-hydroxyl dodecanedioic acid [CAS: 34574-69-1] | −0.0205 |
| 80 | 883.5358 | 15.40 | [M-H]− | 883.5337 | 2.4 | C47H81O13P | phosphatidylinositol(20:4/18:1) [HMDB: 09901] | −0.0308 |
| 123 | 467.3727 | 14.61 | [M-H]− | 467.3737 | −2.1 | C28H52O5 | 7,9,13-trihydroxyoctacosa-16,22-dienoic acid [USPTO: document #20120136057] | −0.0803 |
| 231 | 429.2997 | 11.49 | [M-H]− | 429.3010 | 1.3 | C27H42O4 | [NA] | −0.3420 |
| 261 | 451.2275 | 1.58 | [M-H]2− | 451.2242 | 7.3 | C40H68N6O17 | [NA] | −0.4834 |
| 451.2249 | 5.8 | C41H64N10O13 | ||||||
| 451.2305 | −6.6 | C40H64N12O12 | ||||||
| 620 | 129.0909 | 1.14 | [M-H]− | 129.0916 | −2.1 | C7H14O2 | [NA] | −0.9938 |
aAll possible metabolite isomers are not listed. Where indicated, the species in the Table are those for which MS/MS data was available in the literature.
Figure 2Fold-change of average peak areas of each discriminant feature.
Positive values indicate higher levels of metabolite observed on average for EOC patients compared to control patients, while negative values indicate inverse relationship (*p < 0.05; **p < 0.10, Mann Whitney U test).