| Literature DB >> 35626474 |
Takeshi Emura1,2, Hirofumi Michimae3, Shigeyuki Matsui2,4.
Abstract
Clinical risk prediction formulas for cancer patients can be improved by dynamically updating the formulas by intermediate events, such as tumor progression. The increased accessibility of individual patient data (IPD) from multiple studies has motivated the development of dynamic prediction formulas accounting for between-study heterogeneity. A joint frailty-copula model for overall survival and time to tumor progression has the potential to develop a dynamic prediction formula of death from heterogenous studies. However, the process of developing, validating, and publishing the prediction formula is complex, which has not been sufficiently described in the literature. In this article, we provide a tutorial in order to build a web-based application for dynamic risk prediction for cancer patients on the basis of the R packages joint.Cox and Shiny. We demonstrate the proposed methods using a dataset of breast cancer patients from multiple clinical studies. Following this tutorial, we demonstrate how one can publish web applications available online, which can be manipulated by any user through a smartphone or personal computer. After learning this tutorial, developers acquire the ability to build an online web application using their own datasets.Entities:
Keywords: Shiny; clustered data; copula; frailty model; meta-analysis; risk prediction; survival analysis
Year: 2022 PMID: 35626474 PMCID: PMC9140593 DOI: 10.3390/e24050589
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1A risk prediction scheme in the framework of dynamic prediction. The measure of prediction is the conditional probability of death between t and t + w given the observed status of a patient at time t. The expressions for F are defined in Section 2.3.
Figure 2The web application for a clinical prediction tool made by applying the proposed methods to breast cancer data. The interactive version is available at https://takeshi.shinyapps.io/Breast-2022-0218/ (accessed on 1 April 2022).
The breast cancer dataset of Haibe-Kains et al. [4].
| Maximum Follow-Up Days | Dataset a |
| The Number of Observed Events (Event Rates) | ||
|---|---|---|---|---|---|
| Metastasis | Death | Censoring | |||
| 5165 | CAL | 109 | 24 (22%) | 75 (69%) | 34 (31%) |
| 6694 | NKI | 295 | 101 (34%) | 79 (27%) | 216 (73%) |
| 9108 | TRANSBIG | 196 | 62 (32%) | 56 (29%) | 140 (71%) |
| 8267 | UCSF | 120 | 19 (16%) | 39 (32%) | 81 (68%) |
| 9108 | Total | 720 | 206 (29%) | 249(35%) | 471 (65%) |
Notes: The R code for obtaining the data is available in the Supplementary Materials. The data are a subset from the file “jnci-JNCI-11-0924-s02.csv” available in the Supplementary Data of Haibe-Kains et al. [4]; the file is available on the journal webpage. a Datasets are signified as acronyms: CAL = dataset of breast cancer patients from the University of California, San Francisco, and the California Pacific Medical Center (United States); NKI = National Kanker Institute (the Netherlands); TRANSBIG = dataset collected by the TransBIG consortium (Europe); UCSF = University of California, San Francisco (United States). The extracted data are the subset having complete values of “t.dmfs: time for distant metastasis-free survival (days)”, “e.dmfs: event for distant metastasis-free survival”, “t.os: time for overall survival (days)”, and “e.os: event for overall survival”, as well as covariates (ER, Size, Node, Age, MammaPrint, and GGI). The median follow-up time was calculated from the Kaplan–Meier estimator for the time to censoring for each study. The event rates were calculated separately for each study.
Figure 3The predicted probability of death when the prediction time is set at t = 1000 days. The 95% CIs are indicated by the dotted lines (……), and their widths are shown by the vertical lines (and the number below the lines) at three time points.
Figure 4The calibration plots comparing the observed and predicted survival rates, consisting of with equally spaced prediction horizons, (days). If the plots are placed on the diagonal line, the ideal performance of the prediction formula is achieved. Left panel: the joint frailty-copula model; right panel: the dynamic KM estimator.
Figure 5(Left panel): prediction errors (Brier score) based on the breast cancer data at the prediction time at 1000 days. (Right panel): the c-index for discrimination ability with the 95% CI based on the same setting.
Figure 6The web application for a clinical prediction tool made by applying the proposed methods to ovarian cancer data. The interactive version is available at https://takeshi.shinyapps.io/Ovarian-2022-0218/ (accessed on 1 April 2022).