| Literature DB >> 35641659 |
Carlos Baeza-Delgado1, Leonor Cerdá Alberich1, José Miguel Carot-Sierra2, Diana Veiga-Canuto3, Blanca Martínez de Las Heras4, Ben Raza1,4, Luis Martí-Bonmatí5.
Abstract
BACKGROUND: Estimating the required sample size is crucial when developing and validating clinical prediction models. However, there is no consensus about how to determine the sample size in such a setting. Here, the goal was to compare available methods to define a practical solution to sample size estimation for clinical predictive models, as applied to Horizon 2020 PRIMAGE as a case study.Entities:
Keywords: Clinical predictive models; PRIMAGE; Paediatric oncology; Radiology; Sample size calculation
Mesh:
Year: 2022 PMID: 35641659 PMCID: PMC9156610 DOI: 10.1186/s41747-022-00276-y
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
Fig. 1Factors involved in sample size estimation and model development. Example of the clinical outcomes, events, predictors and predictions applied to neuroblastoma
Clinical endpoints and model type
| sTumor type | Description | Type of outcome |
|---|---|---|
| Neuroblastoma | 5-year mortality risk | Binary |
| Neuroblastoma | Relapse/progression risk | Binary |
| Neuroblastoma | Time-to-death | Time-to-event |
| HR-Neuroblastoma | HR 5-year mortality risk | Binary |
| HR-Neuroblastoma | HR 5-year relapse/progression risk | Binary |
| HR-Neuroblastoma | HR-time to relapse/progression | Time-to-event |
| DIPG | 1-year mortality risk | Binary |
| DIPG | 2-year mortality risk | Binary |
| DIPG | 1-year progression risk | Binary |
| DIPG | Time-to-death | Time-to-event |
| DIPG | Time-to-progression | Time-to-event |
List of the clinical endpoints for neuroblastoma, HR neuroblastoma and DIPG tumors for which the sample size was determined, and the type of outcome for each of these. DIPG diffuse intrinsic pontine glioma, HR high-risk
Required data for sample size calculations
| Model | Prevalence | Median | Rate | Time point | Follow-up | |
|---|---|---|---|---|---|---|
| NB 5-year mortality risk | 0.307 | – | – | – | – | – |
| NB relapse/progression risk | 0.258 | – | – | – | – | – |
| NB time to death | 0.307 | 24.2 | 60 | 0.0063 | 24 | 24 |
| HR-NB 5-year mortality risk | 0.500 | – | – | – | – | – |
| HR-NB 5-year relapse/progression risk | 0.408 | - | – | – | – | – |
| HR-NB time to relapse/progression | 0.561 | 19.08 | 72.12 | 0.0132 | 24 | 24 |
| DIPG 1-year mortality risk | 0.450 | – | – | – | – | – |
| DIPG 2-year mortality risk | 0.169 | – | – | – | – | – |
| DIPG 1-year progression risk | 0.235 | – | – | – | – | – |
| DIPG time to death | 0.169 | 11.4 | 24 | 0.0077 | 24 | 24 |
| DIPG time to progression | 0.235 | 7.7 | 12 | 0.0214 | 12 | 12 |
Data related to each clinical endpoint for which the sample size has been determined. The number of parameters and the value of R is the same for all 11 scenarios (30 and 0.3, respectively) and thus, only the prevalence is required for the binary models
Results of sample size calculations
| Model | Riley’s sample size | 10 EPP | 5 EPP | Riley’s EPP |
|---|---|---|---|---|
| NB 5-year mortality risk | 1111 | 978 | 490 | 11.37 |
| NB relapse/progression risk | 1168 | 1166 | 584 | 10.03 |
| NB time to death | 1397 | 978 | 490 | 7.00 |
| HR-NB 5-year mortality risk | 1043 | 600 | 300 | 17.38 |
| HR-NB 5-year relapse/progression risk | 1060 | 736 | 369 | 14.42 |
| HR-NB time to relapse/progression | 1060 | 536 | 268 | 11.23 |
| DIPG 1-year mortality risk | 1043 | 668 | 334 | 15.65 |
| DIPG 2-year mortality risk | 1345 | 1776 | 889 | 7.58 |
| DIPG 1-year progression risk | 1208 | 1278 | 639 | 9.46 |
| DIPG time to death | 1273 | 1776 | 889 | 7.87 |
| DIPG time to progression | 1130 | 1278 | 639 | 9.67 |
Sample size estimated by Riley’s methodology, the 10 EPP “rule of thumb” and 5 EPP. The number of events per predictor (EPP) variable derived from the sample size obtained with Riley’s methodology was also calculated. EPP event per predictor parameter
Fig. 2Impact of the number of predictor variables on the sample size. Variability in the sample size relative to the number of predictor variables included in model development for neuroblastoma (A), HR neuroblastoma (B) and DIPG (C). DIPG diffuse intrinsic pontine glioma, HR high-risk, NB neuroblastoma
Variability of sample size with R2Nagelkerke
| Model | ||||||
|---|---|---|---|---|---|---|
| Sample size | EPP | Sample size | EPP | Sample size | EPP | |
| NB 5-year mortality risk | 2383 | 24.39 | 666 | 6.82 | 550 | 5.63 |
| NB relapse/progression risk | 2495 | 21.42 | 704 | 6.04 | 590 | 5.06 |
| NB time to death | 2951 | 14.79 | 858 | 4.30 | 749 | 3.75 |
| HR-NB 5-year mortality risk | 2247 | 37.45 | 620 | 10.33 | 501 | 8.35 |
| HR-NB 5-year relapse/progression risk | 2280 | 31.01 | 631 | 8.58 | 513 | 6.98 |
| HR-NB time to relapse/progression | 2280 | 24.15 | 631 | 6.68 | 513 | 5.43 |
| DIPG 1-year mortality risk | 2247 | 33.70 | 620 | 9.30 | 501 | 7.52 |
| DIPG 2-year mortality risk | 2848 | 16.04 | 824 | 4.64 | 713 | 4.02 |
| DIPG 1-year progression risk | 2575 | 20.17 | 731 | 5.73 | 618 | 4.84 |
| DIPG time to death | 2705 | 16.72 | 775 | 4.79 | 663 | 4.10 |
| DIPG time to progression | 2419 | 20.69 | 679 | 5.81 | 563 | 4.82 |
Sample size calculations with different values of R2Nagelkerke (0.15, 0.5, and 0.8). The number of events per predictor variable was obtained with the pmsampsize package
Fig. 3How the follow-up/timepoint ratio affects the sample size. Analysis of the variation in sample size for the time-to-event models relative to the ratio between the expected average follow-up of the dataset and the time points of interest for the predictions. DIPG diffuse intrinsic pontine glioma, HR high-risk, NB neuroblastoma
Sample sizes for external validation
| Model | 100 events | 200 events |
|---|---|---|
| NB 5-year mortality risk | 326 | 652 |
| NB relapse/progression risk | 288 | 776 |
| NB time to death | 326 | 652 |
| HR-NB 5-year mortality risk | 200 | 400 |
| HR-NB 5-year relapse/progression risk | 246 | 491 |
| HR-NB time to relapse/progression | 179 | 357 |
| DIPG 1-year mortality risk | 223 | 445 |
| DIPG 2-year mortality risk | 592 | 1184 |
| DIPG 1-year progression risk | 426 | 852 |
| DIPG time to death | 592 | 1184 |
| DIPG time to progression | 426 | 852 |
The sample size for the external validation of the models has been calculated considering a minimum effective sample size of 100 events and a desirable situation of 200 events