Milena Falcaro1, Ula Nur, Bernard Rachet, James R Carpenter. 1. From the aUniversity College London, London, United Kingdom; bLondon School of Hygiene and Tropical Medicine, London, United Kingdom; and cMRC Clinical Trials Unit, London, United Kingdom.
Abstract
BACKGROUND: Net survival is the survival probability we would observe if the disease under study were the only cause of death. When estimated from routinely collected population-based cancer registry data, this indicator is a key metric for cancer control. Unfortunately, such data typically contain a non-negligible proportion of missing values on important prognostic factors (eg, tumor stage). METHODS: We carried out an empirical study to compare the performance of complete records analysis and several multiple imputation strategies when net survival is estimated via a flexible parametric proportional hazards model that includes stage, a partially observed categorical covariate. Starting from fully observed cancer registry data, we induced missingness on stage under three scenarios. For each of these scenarios, we simulated 100 incomplete datasets and evaluated the performance of the different strategies. RESULTS: Ordinal logistic models are not suitable for the imputation of tumor stage. Complete records analysis may lead to grossly misleading estimates of net survival, even when the missing data mechanism is conditionally independent of survival time given the covariates and the bias on the excess hazard ratios estimates is negligible. CONCLUSIONS: As key covariates are unlikely missing completely at random, studies estimating net survival should not use complete records. When the missingness can be inferred from available data, appropriate multiple imputation should be performed. In the context of flexible parametric proportional hazards models with a partially observed stage covariate, a multinomial logistic imputation model for stage should be used and should include the Nelson-Aalen cumulative hazard estimate and the event indicator.
BACKGROUND: Net survival is the survival probability we would observe if the disease under study were the only cause of death. When estimated from routinely collected population-based cancer registry data, this indicator is a key metric for cancer control. Unfortunately, such data typically contain a non-negligible proportion of missing values on important prognostic factors (eg, tumor stage). METHODS: We carried out an empirical study to compare the performance of complete records analysis and several multiple imputation strategies when net survival is estimated via a flexible parametric proportional hazards model that includes stage, a partially observed categorical covariate. Starting from fully observed cancer registry data, we induced missingness on stage under three scenarios. For each of these scenarios, we simulated 100 incomplete datasets and evaluated the performance of the different strategies. RESULTS: Ordinal logistic models are not suitable for the imputation of tumor stage. Complete records analysis may lead to grossly misleading estimates of net survival, even when the missing data mechanism is conditionally independent of survival time given the covariates and the bias on the excess hazard ratios estimates is negligible. CONCLUSIONS: As key covariates are unlikely missing completely at random, studies estimating net survival should not use complete records. When the missingness can be inferred from available data, appropriate multiple imputation should be performed. In the context of flexible parametric proportional hazards models with a partially observed stage covariate, a multinomial logistic imputation model for stage should be used and should include the Nelson-Aalen cumulative hazard estimate and the event indicator.
Authors: Edouard F Bonneville; Matthieu Resche-Rigon; Johannes Schetelig; Hein Putter; Liesbeth C de Wreede Journal: Stat Methods Med Res Date: 2022-06-05 Impact factor: 2.494
Authors: Chiara Di Girolamo; Sarah Walters; Sara Benitez Majano; Bernard Rachet; Michel P Coleman; Edmund Njeru Njagi; Melanie Morris Journal: BMC Cancer Date: 2018-05-02 Impact factor: 4.430
Authors: H Fowler; A Belot; E N Njagi; M A Luque-Fernandez; C Maringe; M Quaresma; M Kajiwara; B Rachet Journal: Br J Cancer Date: 2017-08-31 Impact factor: 7.640
Authors: Deanna P Jannat-Khah; Michelle Unterbrink; Margaret McNairy; Samuel Pierre; Dan W Fitzgerald; Jean Pape; Arthur Evans Journal: BMC Public Health Date: 2018-11-19 Impact factor: 3.295