| Literature DB >> 35406529 |
Ulf Gyllensten1,2, Julia Hedlund-Lindberg1, Johanna Svensson3, Johanna Manninen3, Torbjörn Öst3, Jon Ramsell3, Matilda Åslin3, Emma Ivansson1, Marta Lomnytska4, Maria Lycke5, Tomas Axelsson3, Ulrika Liljedahl3, Jessica Nordlund3, Per-Henrik Edqvist1, Tobias Sjöblom1, Mathias Uhlén6, Karin Stålberg4, Karin Sundfeldt5, Mikael Åberg3, Stefan Enroth1,7.
Abstract
BACKGROUND: Ovarian cancer is the eighth most common cancer among women and has a 5-year survival of only 30-50%. The survival is close to 90% for patients in stage I but only 20% for patients in stage IV. The presently available biomarkers have insufficient sensitivity and specificity for early detection and there is an urgent need to identify novel biomarkers.Entities:
Keywords: early detection; ovarian cancer; protein biomarkers
Year: 2022 PMID: 35406529 PMCID: PMC8997113 DOI: 10.3390/cancers14071757
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Cohort characteristics.
| Cohort | All | Benign | Ovarian Cancer | ||||
|---|---|---|---|---|---|---|---|
| I | II | III | IV | ||||
| No. of samples | Discovery | 111 | 37 | 10 | 9 | 35 | 20 |
| Replication | 37 | 14 | 4 | 0 | 10 | 9 | |
| Age at diag. a | Discovery | 60.1 (13.2) | 56.2 (15.1) | 62.0 (12.2) | 66.4 (6.2) | 60.6 (12.5) | 62.4 (12.4) |
| Replication | 57.4 (14.4) | 49.9 (14.9) | 70.4 (15.6) | 60.2 (6.8) | 60.3 (14.9) | ||
| Age diff | 0.21 | 0.11 | 0.36 | 0.52 | 0.83 | ||
| CA125 (U/mL) c | Discovery | 263 | 41.5 | 67 | 240 | 594 | 1358 |
| Replication | 189 | 28.5 | 189 | 640 | 340 | ||
| CA125 diff | 0.64 | 0.46 | 0.48 | 0.76 | 0.31 | ||
a Reported as mean (standard deviation) age in the group. b Two-sided Wilcoxon ranked test of difference between the two cohorts. c Reported as median (median absolute deviation) CA125 in the group.
Figure 1Univariate results in the discovery cohort. (A) Mean differences in NPX (controls–cases) are shown on the x-axis and p-values (−log10, two-sided Wilcoxon ranked test) on the y-axis. Proteins plotted in red were significantly different in the discovery data (q < 0.05, Bonferroni adjusted, and a foldchange of at least 1 NPX). Light grey dashed lines represent cut-offs for p-value and foldchange. Proteins marked with a ‘+’ were also found to be significant (q < 0.05, Bonferroni adjusted, foldchange of at least 1 NPX) in the replication data. The five proteins with the lowest p-values in the discovery data are labelled. (B) Beeswarm plots of individual protein measurements for WFDC2 in the discovery cohort. The top and bottom of the overlayed boxplots represents the 25th and 75th percentile and the band inside the box the median value. Outliers were omitted from the boxplots. The samples are divided by diagnoses: B—benign (coloured grey), I, II, III and IV—ovarian cancer FIGO stage (coloured yellow to red). (C) As (B), but for KRT19. (D) As (B), but for FOLR1.
Figure 2Results from multivariate modelling using the univariate significant biomarkers. (A) ROC curve from validation (solid line) and training (dotted line) for separating benign controls from ovarian cancer stages I–IV. The combined performance of clinically measured MUCIN-16 (CA125) in the same cohorts is shown in dashed blue and the performance of the previously developed 11-biomarker panel [15] is shown as a long-dashed black line. In each ROC curve, the point estimate of sensitivity and specificity at a cut-off (‘best-point’ for multivariate models, 35 U/mL for clinical MUCIN-16) is indicated by a red dot. (B,C) As (A), but for separating benign controls from stages I–II (B) and stages III–IV (C). (D) As (A), but for separating ovarian cancer stages I–II from stages III–IV. (E) Graphical illustration of the proteins (rows) included in each of the models in A–D (columns). A green box indicates inclusion.
Results of multivariate modelling based on proteins with nominal univariate significance.
| AUC a | Sens b | Spec b | ||
|---|---|---|---|---|
| B vs. I–IV | Discovery | 0.98 (0.95–1.00) | 0.91 (0.83–0.98) | 0.96 (0.87–1.00) |
| Replication | 1.00 (0.99–1.00) | 0.75 (0.55–0.95) | 1.00 (1.00–1.00) | |
| 0.17 | 0.11 | 1.00 | ||
| B vs. I–II | Discovery | 0.96 (0.90–1.00) | 1.00 (1.00–1.00) | 0.92 (0.79–1.00) |
| Replication | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) | |
| 0.22 | 1.00 | 0.53 | ||
| B vs. III–IV | Discovery | 0.96 (0.92–1.00) | 0.90 (0.81–0.98) | 0.96 (0.89–1.00) |
| Replication | 0.96 (0.90–1.00) | 0.72 (0.50–0.89) | 0.92 (0.77–1.00) | |
| 0.92 | 0.11 | 1.00 | ||
| I–II vs. III–IV | Discovery | 0.78 (0.61–0.95) | 0.98 (0.93–1.00) | 0.60 (0.33–0.80) |
| Replication | 0.81 (0.51–1.00) | 0.81 (0.62–1.00) | 0.75 (0.25–1.00) | |
| 0.85 | 0.054 | 1.00 |
a Numbers for discovery and replication are given as a point estimate and 95% confidence intervals. b The point estimate and 95% confidence interval is given at a cut-off defined in the discovery cohort at the point on the ROC (receiver operating characteristics) curve closest to perfect classification. c p-values for difference in AUCs were calculated using the DeLong’s method. For differences of sensitivity and specificity, a Fisher’s exact test was used. B—Benign. Roman numerals specify ovarian cancer stage.
Figure 3Comparison of two disjunct multivariate models based on all proteins. (A) ROC curve for the first model (‘m1′, including ALPP, GFOD2, IFNG, IL6, KIR3DL1, KRT19, MEP1B, PAEP, SIGLEC5, and WFDC2) in the validation (solid line) and training (dashed line) cohorts for separating benign controls from ovarian cancer stages I–IV. The combined performance of clinically measured MUCIN-16 (CA125) in the same cohorts is shown as a dashed blue line and the performance of the previously developed eleven-biomarker panel [15] is shown as a long-dashed black line. In each ROC curve, the point estimate of sensitivity and specificity at a cut-off (‘best-point’ for multivariate models, 35 U/mL for clinical MUCIN-16) is indicated by a red dot. (B) As (A), but for the second model (‘m2′, including AGR2, BRK1, CES3, DPY30, FOLR1, KLK1, KLK10, MUCIN-16, SCGB3A2, and VTCN1). (C) Comparison of risk scores for each individual in the training cohort (circles) and validation cohort (crosses) for the two models. Benign samples are illustrated in blue and malignant samples are illustrated in red. The dashed light-grey lines correspond to the cut-off for malignancy in the two models: 0.599 for the first model (y-axis) and 0.606 for the second (x-axis). (D) Univariate results for the proteins in the first model. The x-axis indicates mean differences between cases and controls in the discovery cohort and the y-axis represents the statistical significance of this difference (in −log10). The two horizontal lines indicate nominal significance levels and those adjusted for multiple hypothesis testing and is also indicated by the colours of the bars. (E) As (D), but for the second model.
Results from the multivariate modelling using all proteins.
| AUC a | Sens b | Spec b | ||
|---|---|---|---|---|
| Model 1 | Discovery | 0.98 (0.96–1.00) | 0.93 (0.82–1.00) | 0.95 (0.85–1.00) |
| Replication | 1.00 (1.00–1.00) | 0.78 (0.56–0.94) | 1.00 (1.00–1.00) | |
| 0.16 | 0.19 | 1.00 | ||
| Model 2 | Discovery | 0.94 (0.89–0.99) | 0.87 (0.76–0.96) | 0.96 (0.87–1.00) |
| Replication | 0.95 (0.89–1.00) | 0.53 (0.32–0.74) | 1.00 (1.00–1.00) | |
| 0.78 | 0.0073 | 1.00 |
a Numbers for discovery and replication are given as point estimates and 95% confidence intervals. b The point estimate and 95% confidence interval is given at a cut-off defined in the discovery cohort at the point on the ROC (receiver operating characteristics) curve closest to perfect classification. c p-values for difference in AUCs was calculated using the DeLong’s method. For the differences of sensitivity and specificity, a Fisher’s exact test was used.