| Literature DB >> 35804820 |
Alexandra Sala1,2, James M Cameron2, Cerys A Jenkins3, Hugh Barr4, Loren Christie1,2, Justin J A Conn2, Thomas R Jeffry Evans5, Dean A Harris6, David S Palmer1,2, Christopher Rinaldi7, Ashton G Theakstone7, Matthew J Baker2.
Abstract
Pancreatic cancer claims over 460,000 victims per year. The carbohydrate antigen (CA) 19-9 test is the blood test used for pancreatic cancer's detection; however, its levels can be raised in symptomatic patients with other non-malignant diseases, or with other tumors in the surrounding area. Attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy has demonstrated exceptional potential in cancer diagnostics, and its clinical implementation could represent a significant step towards early detection. This proof-of-concept study, investigating the use of ATR-FTIR spectroscopy on dried blood serum, focused on the discrimination of both cancer versus healthy control samples, and cancer versus symptomatic non-malignant control samples, as a novel liquid biopsy approach for pancreatic cancer diagnosis. Machine learning algorithms were applied, achieving results of up to 92% sensitivity and 88% specificity when discriminating between cancers (n = 100) and healthy controls (n = 100). An area under the curve (AUC) of 0.95 was obtained through receiver operating characteristic (ROC) analysis. Balanced sensitivity and specificity over 75%, with an AUC of 0.83, were achieved with cancers (n = 35) versus symptomatic controls (n = 35). Herein, we present these results as demonstration that our liquid biopsy approach could become a simple, minimally invasive, and reliable diagnostic test for pancreatic cancer detection.Entities:
Keywords: ATR-FTIR; PDAC; adenocarcinoma; infrared spectroscopy; pancreatic cancer; serum
Year: 2022 PMID: 35804820 PMCID: PMC9264892 DOI: 10.3390/cancers14133048
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Samples and patient details of the cohorts analyzed. (F, females; M, males).
| Cohort | Samples | Average Age | Sex |
|---|---|---|---|
| A | 100 cancers | 63 | F = 42; M = 58 |
| 100 healthy controls | 65 | F = 43; M = 57 | |
| B | 35 cancers † | 63 | F = 46; M = 54 |
| 35 symptomatic controls | 63 | F = 46; M = 54 | |
| C | 100 cancers | 63 | F = 42; M = 58 |
| 35 symptomatic controls | 63 | F = 46; M = 54 |
† The 35 cancer samples in cohort B were selected from the 100 cancer samples in cohort A by age- and sex-matching the 35 symptomatic controls included in cohort B.
Figure 1Receiver operating characteristic (ROC) curve analysis calculated from a partial least squares discriminant analysis (PLS-DA) model performed on Cohort A. The colored box indicates the region with sensitivity and specificity greater than 60%. The orange dots indicate the points within that region that have maximum sensitivity (A), maximum specificity (B), and balanced sensitivity and specificity (C) (Sens, sensitivity; Spec, specificity; AUC, area under the ROC curve).
Statistical performances of random forest (RF), partial least squares discriminant analysis (PLS-DA), support-vector machine (SVM), and receiver operating characteristic (ROC) analysis performed on Cohort A (AUC, area under the ROC curve; SD, standard deviation).
| Cohort A | RF | PLS-DA | SVM | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Sensitivity ± SD (%) | 84.4 | ± | 7.2 | 91.7 | ± | 5.4 | 89.3 | ± | 6.0 |
| Specificity ± SD (%) | 86.3 | ± | 7.1 | 87.7 | ± | 4.8 | 87.8 | ± | 6.2 |
| Accuracy ± SD (%) | 85.4 | ± | 4.5 | 89.7 | ± | 2.9 | 88.5 | ± | 4.3 |
| ROC (AUC) | 0.905 | 0.954 | 0.946 | ||||||
Figure 2Gini importance plot of random forest (RF) analysis performed on Cohort A; C (black) represents the pancreatic cancer spectrum and NC (red) represents the healthy control spectrum.
Figure 3Permutation tests’ output plot; null (red) and observed (blue) distribution classification rated for Cohort A with a partial least squares discriminant analysis (PLS-DA) classification model after 1000 bootstraps, using pre-processed spectra.
Figure 4Receiver operating characteristic (ROC) curve analysis calculated from a partial least squares discriminant analysis (PLS-DA) model performed on Cohort B. Point A shows the point of maximum sensitivity and point E shows the point of maximum specificity when the model is tuned to give a specificity or sensitivity of 45%, respectively, as highlighted by the light blue square. Point B shows the point of maximum sensitivity and point D shows the point of maximum specificity when the model is tuned to give a specificity or sensitivity of 60%, respectively, as highlighted by the darker light blue square. Point C represents the balance point between sensitivity and specificity, whilst remaining in the target regions (Sens, sensitivity; Spec, specificity; AUC, area under the ROC curve).
Statistical performances of random forest (RF), partial least squares discriminant analysis (PLS-DA), support-vector machine (SVM), and receiver operating characteristic (ROC) analysis performed on Cohort B (AUC, area under the ROC curve; SD, standard deviation).
| Cohort B | RF | PLS-DA | SVM | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Sensitivity ± SD (%) | 72.0 | ± | 13.1 | 69.8 | ± | 14.3 | 71.6 | ± | 15.9 | |
| Specificity ± SD (%) | 72.7 | ± | 16.0 | 85.1 | ± | 12.9 | 83.3 | ± | 14.9 | |
| Accuracy ± SD (%) | 72.3 | ± | 8.9 | 76.5 | ± | 7.8 | 77.5 | ± | 10.1 | |
| ROC (AUC) | 0.809 | 0.829 | 0.793 | |||||||
Figure 5Gini importance plot of RF analysis performed on Cohort B; C (black) represents the pancreatic cancer spectrum and SC (red) represents the symptomatic control spectrum.
Figure 6Permutation tests’ output plot; null (red) and observed (blue) distribution classification rated for Cohort B with a partial least squares discriminant analysis (PLS-DA) classification model after 1000 bootstraps, using pre-processed spectra.