| Literature DB >> 30657384 |
Fatemeh Seyednasrollah1, Devin C Koestler1, Tao Wang1, Stephen R Piccolo1, Roberto Vega1, Russell Greiner1, Christiane Fuchs1, Eyal Gofer1, Luke Kumar1, Russell D Wolfinger1, Kimberly Kanigel Winner1, Chris Bare1, Elias Chaibub Neto1, Thomas Yu1, Liji Shen1, Kald Abdallah1, Thea Norman1, Gustavo Stolovitzky1, Howard R Soule1, Christopher J Sweeney1, Charles J Ryan1, Howard I Scher1, Oliver Sartor1, Laura L Elo1, Fang Liz Zhou1, Justin Guinney1, James C Costello1.
Abstract
PURPOSE: Docetaxel has a demonstrated survival benefit for patients with metastatic castration-resistant prostate cancer (mCRPC); however, 10% to 20% of patients discontinue docetaxel prematurely because of toxicity-induced adverse events, and the management of risk factors for toxicity remains a challenge. PATIENTS AND METHODS: The comparator arms of four phase III clinical trials in first-line mCRPC were collected, annotated, and compiled, with a total of 2,070 patients. Early discontinuation was defined as treatment stoppage within 3 months as a result of adverse treatment effects; 10% of patients discontinued treatment. We designed an open-data, crowd-sourced DREAM Challenge for developing models with which to predict early discontinuation of docetaxel treatment. Clinical features for all four trials and outcomes for three of the four trials were made publicly available, with the outcomes of the fourth trial held back for unbiased model evaluation. Challenge participants from around the world trained models and submitted their predictions. Area under the precision-recall curve was the primary metric used for performance assessment.Entities:
Mesh:
Substances:
Year: 2017 PMID: 30657384 PMCID: PMC6874023 DOI: 10.1200/CCI.17.00018
Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN: 2473-4276
Fig 1.Study design. Data were acquired from Project Data Sphere Cancer Research Platform and centrally curated by the organizing team to create a standardized data set across the four studies. Three of the studies (ASCENT2, VENICE, and MAINSAIL) were selected as training sets, and a fourth data set (ENTHUSE 33) was withheld as a validation set. Teams submitted risk scores for evaluation in the validation set, which were scored and ranked by using the area under the precision-recall curve (AUPRC). The vertical dashed line in the rightmost panel represents the bootstrap estimate of the fraction of discontinuation events in the ENTHUSE33 dataset. (NOTE. Precision = positive predictive value, recall = sensitivity.) ALB, albumin; ALP, alkaline phosphatase; BMI, body mass index; HB, hemoglobin; mCRPC, metastatic castration-resistant prostate cancer; MI, myocardial infarction; TURP, transurethral resection of the prostate.
Summary of Selected Baseline Clinical Characteristics Across Trials
Fig 2.Rate and frequency of treatment discontinuation across trials. (A) Trial-specific cumulative incidence functions for treatment discontinuation as a result of adverse events or possible adverse events (solid lines) and death (dotted lines). (B) Fraction of patients with metastatic castration-resistant prostate cancer who discontinued treatment ≤ 3 months after initiation because of adverse events or possible adverse events.
Fig 3.Most frequent clinical features used across all prediction models. Abbreviated terms are provided in the Data Supplement.
Fig 4.Meta-analysis of risk scores computed by the seven top-performing teams. (A) Hierarchical clustering heatmap of patients in the ENTHUSE 33 validation data set (n = 470) on the basis of their normalized ranked risk score, computed across the seven top-performing teams. (B) Kaplan-Meier curves, stratified by event type—that is, death or treatment discontinuation—across the three identified patient subgroups. (C) Distribution of baseline laboratory variables found to be significantly different between the three patient subgroups. (D) Distribution of baseline prior medical and medication variables found to be significantly different between the three patient subgroups. ACE, angiotensin-converting enzyme; ALB, albumin; ALP, alkaline phosphatase; Ca, calcium; CREACL, creatinine clearance; HB, hemoglobin; LDH, lactate dehydrogenase; MHGEN, medical history: general disorders and administration site conditions; Na, sodium; PSA, prostate-specific antigen; TPRO, total protein.
Fig 5.Performance of the postchallenge ensemble-based prediction model. (A) Area under the precision-recall curve (AUPRC) computed within the ENTHUSE 33 data set for the ensemble-based prediction model, along with the models developed by the seven top-performing teams. Diamonds represent the observed AUPRCs, and horizontal boxplots reflect the empirical distribution of a model’s AUPRC on the basis of 5,000 bootstrap samples generated from each models’ predictions. The vertical dotted line represents the mean AUPRC computed from 5,000 bootstrap samples generated from a random prediction model. (B) Lift ratio (LR) curve for the ensemble-based prediction model, with gray lines representing the LR curves generated for 100 random prediction models. (C) Distribution of the area under the LR curve at 20% on the basis of random prediction models (gold), all challenge submissions teams (blue), the top-performing teams (gray points), and the postchallenge ensemble-based classifier (red).