| Literature DB >> 31240259 |
Stefan Enroth1, Malin Berggrund1, Maria Lycke2, John Broberg3, Martin Lundberg3, Erika Assarsson3, Matts Olovsson4, Karin Stålberg4, Karin Sundfeldt2, Ulf Gyllensten1.
Abstract
Ovarian cancer is usually detected at a late stage and the overall 5-year survival is only 30-40%. Additional means for early detection and improved diagnosis are acutely needed. To search for novel biomarkers, we compared circulating plasma levels of 593 proteins in three cohorts of patients with ovarian cancer and benign tumors, using the proximity extension assay (PEA). A combinatorial strategy was developed for identification of different multivariate biomarker signatures. A final model consisting of 11 biomarkers plus age was developed into a multiplex PEA test reporting in absolute concentrations. The final model was evaluated in a fourth independent cohort and has an AUC = 0.94, PPV = 0.92, sensitivity = 0.85 and specificity = 0.93 for detection of ovarian cancer stages I-IV. The novel plasma protein signature could be used to improve the diagnosis of women with adnexal ovarian mass or in screening to identify women that should be referred to specialized examination.Entities:
Keywords: Diagnostic markers; Machine learning; Ovarian cancer
Mesh:
Substances:
Year: 2019 PMID: 31240259 PMCID: PMC6586828 DOI: 10.1038/s42003-019-0464-9
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Cohort statistics
| Cohort | Origina | Types | No of Women | Age, mean (SD) | CA-125b |
|---|---|---|---|---|---|
| Discovery | Gbg | Benign tumors | 90 | 60.0 (16.8) | 16.8 (9.9) |
| Stage I–II | 42 | 60.7 (12.4) | 67.6 (72.0) | ||
| Stage III–IV | 37 | 63.8 (14.1) | 327.4 (284.5) | ||
| 1st Replication | Gbg | Benign tumors | 71 | 60.2 (14.5) | NA |
| Stage I–II | 44 | 62.4 (13.7) | NA | ||
| Stage III–IV | 56 | 61.6 (11.3) | NA | ||
| 2nd Replication | UCAN | Stage I–II | 13 | 55.9 (15.0) | NA |
| Stage III–IV | 64 | 59.4 (12.0) | NA | ||
| 3rd Replication | Gbg | Benign tumors | 106 | 57.9 (16.1) | 31.5 (29.7) |
| Borderline | 28 | 49.4 (19.6) | 58.0 (50.4) | ||
| Stage I–II | 25 | 65.2 (10.0) | 96.5 (116.4) | ||
| Stage III–IV | 65 | 61.4 (12.2) | 739.0 (812.5) |
aUCAN: collection at Uppsala Biobank, Uppsala University, Sweden. Gbg: Gynaecology tumor biobank at Sahlgrenska University Hospital, Göteborg, Sweden
bMeasured at clinic [U/L], median (median absolute deviation). NA indicates ‘not available’
Fig. 1Model Generation. (a) Repeated model generation over random splits of the data. Proteins present in a sufficient fraction of the models are included into the core. b Generation of mutually exclusive cores. Proteins present in the first core (top node) are sequentially withheld from the second round of core discovery, as indicated by the sets to the left of the nodes. Each core of size N generates N new search-branches. c The final models are built by adding proteins to each core. The added proteins are chosen with respect to the proteins excluded in the core-discovery. Proteins are added in a stepwise forward selection choosing the protein that explains the highest proportion of remaining variance in the decision. See Methods for details
Fig. 2Top 50 model characteristics. a Variance explained in the decision (Benign tumor or ovarian cancer stage III–IV by the cores (as indicated in blue) and by the additional proteins (gray) in the test set of the Discovery Data. Sensitivity and 1-Specifity of the cores (hollow markers) and the full models (filled markers) are shown (right axis) in red. (b) Protein inclusion into cores. Top 50 cores are indicated with C1, …, C50 and proteins are labeled with their short name. A connector represents inclusion of that protein in a core. c Same as (b) but for additional proteins (not including core-proteins). Top 50 additional protein-sets are indicated by A1, …, A50
Performance ranges of all models
| Stagea | MUC16b | No. | Size | Cohort | AUC | PPV | NPV | BPsensc | BPspecc | FSEsed | FSEspd | FSPsed | FSPspd |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| I–II | Yes | 36 | 8–17 | Discovery | 0.80–0.94 | 0.71–0.89 | 0.89–0.97 | 0.77–0.95 | 0.84–0.93 | 0.99–1.00 | 0.04–0.14 | 0.58–0.90 | 0.95–0.96 |
| 1st Repl. | 0.58–0.71 | 0.55–0.69 | 0.75–0.81 | 0.63–0.74 | 0.68–0.80 | NA | NA | 0.16–0.45 | 0.94–0.95 | ||||
| 2st Repl. | 0.49–0.83 | 0.30–0.59 | 0.92–0.98 | 0.74–0.92 | 0.68–0.83 | 1.00–1.00 | 0.06–0.06 | 0.12–0.51 | 0.94–0.95 | ||||
| No | 448 | 9–20 | Discovery | 0.54–0.91 | 0.44–0.84 | 0.76–0.94 | 0.60–0.89 | 0.61–0.91 | 1.00–1.00 | 0.04–0.07 | 0.13–0.77 | 0.95–0.96 | |
| 1st Repl. | 0.46–0.82 | 0.50–0.77 | 0.69–0.89 | 0.53–0.83 | 0.64–0.84 | 0.99–1.00 | 0.06–0.09 | 0.16–0.59 | 0.94–0.96 | ||||
| 2st Repl. | 0.41–0.93 | 0.27–0.78 | 0.89–1.00 | 0.71–0.98 | 0.63–0.92 | 1.00–1.00 | 0.05–0.06 | 0.08–0.81 | 0.94–0.95 | ||||
| III–IV | Yes | 36 | 8–17 | Discovery | 0.95–0.96 | 0.94–1.00 | 0.98–1.00 | 0.95–1.00 | 0.97–1.00 | 1.00–1.00 | 0.04–0.10 | 0.93–1.00 | 0.95–0.96 |
| 1st Repl. | 0.85–0.92 | 0.82–0.93 | 0.88–0.93 | 0.84–0.91 | 0.86–0.95 | 0.97–0.98 | 0.11–0.31 | 0.68–0.86 | 0.95–0.96 | ||||
| 2st Repl. | 0.75–0.91 | 0.76–0.92 | 0.77–0.92 | 0.74–0.90 | 0.79–0.93 | 0.95–0.96 | 0.15–0.50 | 0.50–0.82 | 0.94–0.96 | ||||
| No | 448 | 9–20 | Discovery | 0.94–0.96 | 0.89–1.00 | 0.97–1.00 | 0.93–1.00 | 0.95–1.00 | 0.99–1.00 | 0.04–0.12 | 0.90–1.00 | 0.95–0.96 | |
| 1st Repl. | 0.78–0.90 | 0.78–0.95 | 0.82–0.92 | 0.76–0.91 | 0.80–0.96 | 0.96–0.99 | 0.07–0.34 | 0.54–0.87 | 0.94–0.96 | ||||
| 2st Repl. | 0.77–0.94 | 0.77–0.96 | 0.77–0.97 | 0.74–0.97 | 0.78–0.97 | 0.95–0.97 | 0.19–0.69 | 0.42–0.92 | 0.94–0.96 | ||||
| I–IV | Yes | 36 | 8–17 | Discovery | 0.88–0.94 | 0.88–0.95 | 0.86–0.96 | 0.85–0.95 | 0.89–0.96 | 0.95–0.96 | 0.32–0.74 | 0.76–0.93 | 0.95–0.96 |
| 1st Repl. | 0.75–0.83 | 0.83–0.89 | 0.69–0.75 | 0.73–0.80 | 0.77–0.87 | 0.95–0.96 | 0.09–0.24 | 0.47–0.65 | 0.95–0.96 | ||||
| 2st Repl. | 0.70–0.87 | 0.75–0.89 | 0.70–0.87 | 0.71–0.87 | 0.73–0.89 | 0.95–0.95 | 0.14–0.59 | 0.39–0.73 | 0.95–0.96 | ||||
| No | 448 | 9–20 | Discovery | 0.74–0.92 | 0.76–0.93 | 0.76-0.90 | 0.70–0.88 | 0.79–0.93 | 0.95–0.96 | 0.04–0.55 | 0.49–0.84 | 0.95–0.96 | |
| 1st Repl. | 0.67–0.84 | 0.78–0.92 | 0.60–0.80 | 0.62–0.83 | 0.73–0.90 | 0.95–0.96 | 0.04–0.35 | 0.35–0.72 | 0.95–0.96 | ||||
| 2st Repl. | 0.75–0.93 | 0.77–0.95 | 0.73–0.96 | 0.74–0.96 | 0.75–0.95 | 0.95–0.96 | 0.16–0.83 | 0.41–0.91 | 0.94–0.96 |
All ranges indicate lowest and highest values for all models on that row
‘NA’ means that not such point exists
aPerformances are for benign tumors vs this stage of ovarian cancers
bIndicates whether or not Mucin-16 was included in the model
cPerformances when cut-off is chosen at the best point (BP, closest point on ROC-curve to perfect classification)
dPerformances at a point on the ROC-curves with at least 0.93 sensitivity (FSEse and FSEsp) or specificity (FSPse and FSPsp)
Fig. 3Top-ranking model performance in 1st replication cohort. a Distribution of protein abundance levels in NPX for the three proteins in the core in patients with Benign tumors (indicated with a ‘B’) and ovarian cancer stage III–IV (indicated with ‘OC’). Horizontal black lines indicate mean of the protein abundance levels. b PCA plot of the first two components using the proteins in the core. Figures show Benign tumors in black and ovarian cancer stages III–IV in red. c As (a) but for the six first additional proteins in the model. d As (b) but for the complete model with 14 proteins. e–g Receiver operating characteristic (ROC) curves of the performance of the complete model in the 1st replication cohort. From top to bottom, the ROC-curves represent Benign tumors vs. Ovarian cancer stages I–II, III–IV, and I–IV, respectively
Performance of the top-ranking and the proof-of-concept model
| Stagea | Cohort | AUC | PPV | NPV | BPseb | BPspb | FSEsec | FSEspc | FSPsec | FSPspc |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| I–II | Discovery | 0.82 (0.07) | 0.68 (0.14) | 0.92 (0.04) | 0.85 (0.09) | 0.82 (0.08) | 1.00 (0.01) | 0.06 (0.06) | 0.60 (0.16) | 0.96 (0.01) |
| 1st Repl. | 0.51 (0.1) | 0.62 (0.13) | 0.79 (0.09) | 0.71 (0.13) | 0.71 (0.12) | 1.00 (0.01) | 0.20 (0.07) | 0.29 (0.15) | 0.94 (0.01) | |
| 2nd Repl. | 0.27 (0.15) | 0.25 (0.16) | 0.87 (0.09) | 0.65 (0.23) | 0.51 (0.22) | 1.00 (0) | 0.15 (0.09) | 0.06 (0.12) | 0.96 (0.03) | |
| I–IV | Discovery | 0.86 (0.04) | 0.88 (0.08) | 0.87 (0.06) | 0.86 (0.06) | 0.89 (0.07) | 0.95 (0.01) | 0.31 (0.26) | 0.75 (0.11) | 0.96 (0.01) |
| 1st Repl. | 0.65 (0.08) | 0.83 (0.06) | 0.73 (0.10) | 0.79 (0.09) | 0.78 (0.08) | 0.96 (0.01) | 0.26 (0.12) | 0.52 (0.14) | 0.96 (0.02) | |
| 2nd Repl. | 0.57 (0.09) | 0.78 (0.08) | 0.70 (0.09) | 0.69 (0.09) | 0.78 (0.10) | 0.95 (0.01) | 0.27 (0.12) | 0.45 (0.16) | 0.95 (0.02) | |
| III–IV | Discovery | 0.91 (0.06) | 0.95 (0.11) | 0.95 (0.11) | 0.96 (0.06) | 0.98 (0.05) | 1.00 (0) | 0.06 (0.03) | 0.94 (0.08) | 0.96 (0.01) |
| 1st Repl. | 0.70 (0.08) | 0.81 (0.09) | 0.81 (0.09) | 0.86 (0.07) | 0.85 (0.08) | 0.98 (0.03) | 0.24 (0.14) | 0.68 (0.16) | 0.95 (0.01) | |
| 2nd Repl. | 0.60 (0.08) | 0.79 (0.10) | 0.79 (0.10) | 0.75 (0.09) | 0.81 (0.07) | 0.96 (0.03) | 0.31 (0.16) | 0.49 (0.14) | 0.95 (0.02) | |
|
| ||||||||||
| I–II | Discovery | 0.83 (0.06) | 0.74 (0.15) | 0.91 (0.05) | 0.81 (0.09) | 0.86 (0.09) | 1.00 (0.01) | 0.06 (0.08) | 0.60 (0.18) | 0.96 (0.01) |
| 1st Repl. | 0.61 (0.09) | 0.60 (0.13) | 0.75 (0.10) | 0.64 (0.13) | 0.70 (0.12) | 0.99 (0.03) | 0.04 (0.02) | 0.26 (0.15) | 0.95 (0.02) | |
| 2nd Repl. | 0.65 (0.18) | 0.42 (0.22) | 0.95 (0.05) | 0.80 (0.20) | 0.74 (0.17) | 1.00 (0) | 0.06 (0.01) | 0.30 (0.27) | 0.95 (0.01) | |
| I–IV | Discovery | 0.88 (0.04) | 0.91 (0.06) | 0.86 (0.06) | 0.85 (0.06) | 0.91 (0.06) | 0.95 (0.01) | 0.38 (0.18) | 0.78 (0.09) | 0.96 (0.01) |
| 1st Repl. | 0.79 (0.06) | 0.85 (0.07) | 0.71 (0.09) | 0.74 (0.08) | 0.83 (0.09) | 0.96 (0.01) | 0.09 (0.14) | 0.58 (0.13) | 0.95 (0.02) | |
| 2nd Repl. | 0.85 (0.05) | 0.88 (0.06) | 0.84 (0.08) | 0.86 (0.07) | 0.87 (0.06) | 0.95 (0.01) | 0.35 (0.29) | 0.73 (0.12) | 0.96 (0.02) | |
| III–IV | Discovery | 0.95 (0.01) | 1.00 (0.02) | 1.00 (0.02) | 0.99 (0.03) | 1.00 (0.01) | 1.00 (0) | 0.04 (0) | 0.99 (0.03) | 0.96 (0.01) |
| 1st Repl. | 0.89 (0.04) | 0.93 (0.07) | 0.93 (0.07) | 0.89 (0.06) | 0.95 (0.05) | 0.97 (0.03) | 0.27 (0.31) | 0.86 (0.10) | 0.95 (0.01) | |
| 2nd Repl. | 0.87 (0.05) | 0.89 (0.09) | 0.89 (0.09) | 0.88 (0.06) | 0.90 (0.08) | 0.95 (0.02) | 0.28 (0.31) | 0.80 (0.13) | 0.94 (0.01) | |
|
| ||||||||||
| I–II | Discovery | 0.83 (0.06) | 0.72 (0.13) | 0.91 (0.05) | 0.83 (0.08) | 0.84 (0.08) | 1.00 (0.01) | 0.05 (0.06) | 0.60 (0.19) | 0.96 (0.01) |
| 1st Repl. | 0.69 (0.10) | 0.63 (0.11) | 0.82 (0.11) | 0.77 (0.13) | 0.69 (0.11) | 0.99 (0.02) | 0.05 (0.02) | 0.37 (0.15) | 0.95 (0.02) | |
| 2nd Repl. | 0.70 (0.20) | 0.58 (0.27) | 0.95 (0.05) | 0.80 (0.18) | 0.82 (0.2) | 1.00 (0) | 0.06 (0) | 0.54 (0.31) | 0.94 (0.01) | |
| I–IV | Discovery | 0.88 (0.04) | 0.88 (0.06) | 0.89 (0.06) | 0.87 (0.07) | 0.90 (0.06) | 0.95 (0.01) | 0.40 (0.22) | 0.79 (0.09) | 0.96 (0.01) |
| 1st Repl. | 0.82 (0.05) | 0.87 (0.08) | 0.75 (0.08) | 0.79 (0.07) | 0.85 (0.09) | 0.96 (0.01) | 0.20 (0.18) | 0.66 (0.12) | 0.95 (0.01) | |
| 2nd Repl. | 0.83 (0.04) | 0.87 (0.07) | 0.84 (0.07) | 0.83 (0.08) | 0.87 (0.07) | 0.95 (0.01) | 0.36 (0.23) | 0.68 (0.11) | 0.95 (0.01) | |
| III–IV | Discovery | 0.95 (0.02) | 0.99 (0.03) | 0.99 (0.03) | 0.98 (0.04) | 1.00 (0.01) | 1.00 (0) | 0.04 (0) | 0.98 (0.04) | 0.96 (0) |
| 1st Repl. | 0.90 (0.04) | 0.94 (0.06) | 0.94 (0.06) | 0.91 (0.07) | 0.95 (0.05) | 0.97 (0.03) | 0.27 (0.31) | 0.88 (0.10) | 0.95 (0.02) | |
| 2nd Repl. | 0.84 (0.06) | 0.88 (0.07) | 0.88 (0.07) | 0.85 (0.08) | 0.89 (0.07) | 0.95 (0.02) | 0.32 (0.30) | 0.73 (0.14) | 0.95 (0.02) | |
aPerformances are for benign tumors vs this stage of ovarian cancers
bPerformances when cut-off is chosen at the best point (BP, closest point on ROC-curve to perfect classification)
cPerformances at a point on the ROC-curves with at least 0.93 sensitivity (FSEse and FSEsp) or specificity (FSPse and FSPsp)
Fig. 4Final models’ performance in the 3rd replication cohort. a ROC-curves for the test/training (gray), validation (black) and final model (red) for each of the 4 models. The AUC is taken from the performance in the validation partition. All models were trained on benign vs malign (stages I–IV) samples. b ROC-curves for the 4 final models when evaluated on subsets of stages. c Distribution of outcomes in ranges of prediction scores (left) for the ‘11-plex + Age’ model and distribution of prediction scores for each outcome (right). In the right panel, the three cut-offs for ‘best-point (BP)’, ‘focus on sensitivity (FSE, sensitivity ≥0.98)’ and ‘focus on specificity (FSP, specificity ≥0.98)’ are illustrated by horizontal dashed lines. The solid black lines indicate the mean prediction score in each outcome group. d As (c) but for the ‘2-plex + Age’ model
Performance of the custom assay in the independent replication cohort, combined analysis
| Full modela | MUCIN-16 and WFDC2 and Age | |||||||
|---|---|---|---|---|---|---|---|---|
| Benign vs Malign | Benign vs Stage I–II | Benign vs Stage III–IV | Stage I–II vs Stage III–IV | Benign vs Malign | Benign vs Stage I–II | Benign vs Stage III–IV | Stage I–II vs Stage III–IV | |
| AUC | 0.94 (0.91-0.98) | 0.88 (0.81–0.96) | 0.98 (0.96-1) | 0.74 (0.63–0.86) | 0.90 (0.85–0.95) | 0.79 (0.67–0.91) | 0.95 (0.92–0.99) | 0.77 (0.65–0.88) |
| PPVb | 0.92 | 0.71 | 0.90 | 0.78 | 0.97 | 0.88 | 0.97 | 0.80 |
| NPVb | 0.88 | 0.93 | 0.95 | 0.62 | 0.83 | 0.90 | 0.92 | 0.55 |
| BPcutc | 0.3937 | 0.3937 | 0.3937 | 0.3937 | 0.5117 | 0.5117 | 0.5117 | 0.5117 |
| BPse | 0.85 (0.76–0.91) | 0.68 (0.48–0.84) | 0.92 (0.85–0.98) | 0.92 (0.86–0.98) | 0.76 (0.67–0.85) | 0.56 (0.36–0.76) | 0.86 (0.77–0.94) | 0.86 (0.77–0.94) |
| BPsp | 0.93 (0.88–0.98) | 0.93 (0.88–0.98) | 0.93 (0.89–0.97) | 0.32 (0.16–0.52) | 0.98 (0.95–1.00) | 0.98 (0.95–1.00) | 0.98 (0.95–1.00) | 0.44 (0.24–0.64) |
| FSEcutc | 0.1976 | 0.1976 | 0.1976 | 0.1976 | 0.2047 | 0.2047 | 0.2047 | 0.2047 |
| FSEse | 0.99 (0.96–1.00) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) | 0.99 (0.97–1.00) | 0.96 (0.88–1.00) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) |
| FSEsp | 0.31 (0.23–0.40) | 0.31 (0.23–0.41) | 0.31 (0.23–0.40) | 0 (0.00–0.00) | 0.19 (0.11–0.26) | 0.19 (0.12–0.26) | 0.19 (0.11–0.27) | 0.04 (0.00–0.12) |
| FSPcutc | 0.4908 | 0.4908 | 0.4908 | 0.4908 | 0.5117 | 0.5117 | 0.5117 | 0.5117 |
| FSPse | 0.77 (0.69–0.86) | 0.56 (0.36–0.76) | 0.88 (0.80–0.95) | 0.88 (0.78–0.95) | 0.76 (0.68–0.85) | 0.56 (0.36–0.76) | 0.86 (0.77–0.94) | 0.86 (0.77–0.94) |
| FSPsp | 0.98 (0.95–1.00) | 0.98 (0.95–1.00) | 0.9 (0.95–1.00) | 0.44 (0.24–0.64) | 0.98 (0.95–1.00) | 0.98 (0.95–1.00) | 0.98 (0.95–1.00) | 0.44 (0.24–0.64) |
aProof-of-concept model plus age
bPerformances when cut-off is chosen at the best point (BP, closest point on ROC-curve to perfect classification)
cCut-off thresholds calculated in the Benign vs. Malign models and applied to difference subgroups. The BPcut is taken at the point on the ROC-curve closest to perfect performance. The FSEcut is taken from point with highest specificity when requiring at least 0.98 sensitivity. The FSPcut is taken from point with highest sensitivity when requiring at least 0.98 specificity. All cells: numbers in parentheses represent 95% confidence intervals