Literature DB >> 31953948

Marginal measures and causal effects using the relative survival framework.

Elisavet Syriopoulou¹, Mark J Rutherford¹, Paul C Lambert^1,2.

Abstract

BACKGROUND: In population-based cancer survival studies, the event of interest is usually death due to cancer. However, other competing events may be present. Relative survival is a commonly used measure in cancer studies that circumvents problems caused by the inaccuracy of the cause of death information. A summary of the prognosis of the cancer population and potential differences between subgroups can be obtained using marginal estimates of relative survival.
METHODS: We utilize regression standardization to obtain marginal estimates of interest in a relative survival framework. Such measures include the standardized relative survival, standardized all-cause survival and standardized crude probabilities of death. Contrasts of these can be formed to explore differences between exposure groups and under certain assumptions are interpreted as causal effects. The difference in standardized all-cause survival can also provide an estimate for the impact of eliminating cancer-related differences between exposure groups. The potential avoidable deaths after such hypothetical scenarios can also be estimated. To illustrate the methods we use the example of survival differences across socio-economic groups for colon cancer.
RESULTS: Using relative survival, a range of marginal measures and contrasts were estimated. For these measures we either focused on cancer-related differences only or chose to incorporate both cancer and other cause differences. The impact of eliminating differences between groups was also estimated. Another useful way for quantifying that impact is the avoidable deaths under hypothetical scenarios.
CONCLUSIONS: Marginal estimates within the relative survival framework provide useful summary measures and can be applied to better understand differences across exposure groups.

Entities: Chemical Disease Species

Keywords: Relative survival; avoidable deaths; causal effects; marginal measures

Mesh：

Year: 2020 PMID： 31953948 PMCID： PMC7266533 DOI： 10.1093/ije/dyz268

Source DB: PubMed Journal: Int J Epidemiol ISSN： 0300-5771 Impact factor: 7.196

Marginal measures provide population estimates in a relative survival framework with a simple interpretation. Such measures are marginal relative survival, marginal all-cause survival and marginal crude probabilities of death. Under certain assumptions, the difference in marginal estimates between subgroups can be interpreted as the average causal effect. Using the relative survival framework enables us to focus on cancer-related differences instead of all-cause differences that are more difficult to explore. The avoidable deaths under hypothetical scenarios can also be estimated.

Introduction

In population-based cancer survival studies the event of interest is usually death due to a specific cancer. However, death from other causes may prevent the event of interest occurring, i.e. there are so-called competing risks. To quantify cancer survival while accounting for differential other cause probability of death, net survival can be estimated. Net survival is the survival in a hypothetical world where the only possible cause of death is death due to cancer. It can be estimated using either the cause-specific or relative survival (excess mortality) approach., The cause-specific approach estimates cancer survival by censoring patients who died from other causes at the time of death. Information on cause of death is often inaccurate, particularly for the elderly. Relative survival is an alternative method to estimate net survival, but does not require information on cause of death through incorporating the expected mortality rates using population life tables. Information on appropriate expected mortality rates is essential for the interpretation of relative survival as net survival. Within the relative survival framework, two probabilities of interest are the net probability of death, which is 1 minus relative survival, and the crude probability of death. Each approach yields a different estimand and the choice is based on the research question of interest. Net probabilities of death focus on a hypothetical world where the cancer of interest is the only possible cause of death. They provide a useful measure when comparing populations, such as countries or socio-economic groups, with differential other cause mortality rates as they focus on differences that are only due to the cancer of interest. Crude probabilities of death are more appropriate when making clinical decisions for a specific patient as they quantify survival in the presence of other possible causes of death. In the competing risks literature, the terminology is generally different where crude probabilities are often referred to as cause-specific cumulative incidence functions. To improve understanding of the mechanisms that drive associations, causal inference methods can be applied., The mathematical framework used for formulating statistical models and assumptions for causal inference is that of potential outcomes: the outcomes that would be observed if the patient received a specific level of the exposure. Causal effects are defined as contrasts of marginal effects of the potential outcomes and enable quantification of differences in prognosis of subgroups. In order to make causal statements certain assumptions need to hold and these are explored later in this paper in the context of the relative survival setting—with one main issue being the level of stratification in the lifetables that is adequate to achieve conditional exchangeability for other cause mortality. Marginal effects can be estimated using approaches such as inverse probability weights, but here our focus is on using regression standardization to obtain standardized survival and related functions., Their simple interpretation as a single measure for each time-point circumvents problems of communication of results from complex statistical models. Contrasts of standardized measures can also be utilized to investigate the impact of eliminating the observed differences between groups. In this paper, we define various marginal measures using the relative survival framework and we define causal effects to compare population subgroups. To illustrate the measures we use the example of survival differences across socio-economic groups for colon cancer. Moreover, we estimate the potential avoidable deaths under a hypothetical scenario of eliminating survival differences between groups. The remainder of the paper is organized as follows. First, we introduce the data to illustrate the methods and describe relative survival and excess mortality. Then, we define marginal measures of interest as well as contrasts between subgroups. Following, we describe contrasts within subsets of the population, including the avoidable deaths. Finally, we provide a summary of the methods.

Introducing the illustrative example

Data, made available by Public Health England, includes patients diagnosed with colon cancer in 2008 in England, with follow-up to the end of 2013 and information on gender, age at diagnosis and deprivation status. There were five deprivation groups, derived from national quintiles of the income domain of the area of patients’ residence at diagnosis., For simplicity we only included the least and most deprived groups, resulting in 7346 patients in total, 55% of whom are in the least deprived group. More details on the study population are available in Table 1.

Table 1.

Number of colon cancer patients (with proportions) for gender and age-groups by deprivation group

	Deprivation group
	Least deprived	Most deprived
Gender
Male	2136 (52.37%)	1772 (52.24%)
Female	1943 (47.63%)	1495 (45.76%)
Age group, years
18–44	102 (2.50%)	116 (3.55%)
45–54	210 (5.15%)	209 (6.40%)
55–64	769 (18.85%)	521 (15.95%)
65–74	1149 (28.17%)	972 (29.75%)
75–84	1332 (32.66%)	1045 (31.99%)
85+	517 (12.67%)	404 (12.37%)

Number of colon cancer patients (with proportions) for gender and age-groups by deprivation group

Excess mortality and relative survival

The underlying all-cause mortality rate of an individual , with covariate pattern , is given as the summation of the expected mortality rates if they did not have the cancer, and their excess mortality due to the cancer, : with denoting the set of all covariates. and denote the covariates for expected and excess mortalities respectively. The expected mortality rate is obtained from available life tables on a comparable population in the general population matched by characteristics such as age, sex, calendar year and deprivation status. The survival analogue of excess mortality is relative survival. The relative survival of an individual, , is defined as their all-cause survival,, divided by their expected survival, . The all-cause survival is thus given as Relative survival accounts for different background mortality rates in patients with different characteristics, without having to rely on the cause of death information and under assumptions is interpreted as net survival., Net survival has the interpretation of the probability of survival in a hypothetical world where the only possible cause of death is the cancer of interest. In order for this interpretation to be valid certain assumptions need to hold. These are (i) appropriate expected mortality rate that represents mortality due to other causes for the cancer population and (ii) the potential times to death from cancer and other causes are conditionally independent. When important variables that affect both cancer and other causes of deaths are not included in the available life tables for the expected mortality rates, then comparability between populations is lost and relative survival cannot be interpreted as net survival. There are several models for relative survival and the following sections in theory can be applied to any of these. We choose flexible parametric survival models, which use splines to model the effect of time and are preferable in our setting as they incorporate time-dependent effects (non-proportional hazards) easily., For the illustrative example, we fitted a flexible parametric model with 5 degrees of freedom for the baseline excess hazard. The model included deprivation status, gender and age. Age was included as a continuous non-linear variable using restricted cubic splines with 3 degrees of freedom. We also included time-dependent effects for age and deprivation. We then derived various predictions based on the fitted model. More details on Stata code are available in Supplementary Appendix A, available as Supplementary data at IJE online.

Marginal measures

Marginal measures provide population summaries with a simple interpretation. In the following subsections, we define the marginal relative survival function and describe how to obtain the marginal all-cause survival and marginal crude probabilities within the relative survival framework. Contrasts between subgroups are described in the section headed ‘Forming contrasts between population groups’.

Marginal relative survival

Let denote the conditional relative survival given covariates . The marginal relative survival is: with the expectation over the marginal distribution of . After fitting a survival model, can be estimated by obtaining predictions of relative survival for each individual in the study population and taking an average of these predictions, where is the number of patients in the population. If interest is on the mortality scale, the standardized net probability of death can be obtained instead by . Standardizing to an external population might also be applied and is particularly common when comparing relative survival across different countries. For instance, the externally age-standardized relative survival is calculated as where is the ratio of the proportion within an age group in the reference population to the corresponding group in the study population. Weights higher than 1 are applied to groups that are underrepresented in the study population compared with the standard population.

Marginal all-cause survival

To quantify survival in the presence of both cancer and other causes, the marginal all-cause survival function can be obtained. The estimand of interest in now defined as: and is estimated by the standardized all-cause survival The standardized all-cause probability of death can also be estimated as .

Marginal crude probabilities of death

Let the crude probability of dying from the cancer of interest by time in the presence of a competing risk of death due to other causes be and the probability of dying of causes other than the cancer of interest in the presence of cancer be The marginal crude probability of dying from cancer is defined as and is estimated by the standardized crude-probability of death due to cancer Similarly, the marginal crude probability of dying of causes other than the cancer of interest is estimated by

Example

The marginal 5-year expected probability of death for a population without colon cancer is 20%. For the colon cancer population, the 5-year standardized net and all-cause probability of death was 46 and 55% respectively (Fig. 1). The net probability of death is lower as it is estimated in a hypothetical world where it is not possible to die from other causes. The all-cause probability of death can also be partitioned to that due to cancer and that due to other causes. The 5-year crude probability of death due to cancer in the presence of other risks was 44% and the crude probability of death due to other causes in the presence of cancer was 11% (Fig. 1).

Figure 1.

(A) Standardized all-cause and net probability of death, with 95% confidence intervals, as well as expected probability of death, and (B) stacked plot for the standardized crude probabilities for cancer and other causes, for the whole study population.

Forming contrasts between population groups

Let’s assume that we want to estimate the effect of exposure, , on the time-to-event outcome, while allowing for confounding . For simplicity, X will be a binary variable with for the exposed patients and for the unexposed. Let be the counterfactual survival function that we would have observed, had everybody in the population been exposed to level . To form contrasts between the exposure groups the difference can be estimated and it has the advantage of being collapsible., The difference between levels and is defined as . The first term is the counterfactual survival function if everyone in our population had and the second term is the counterfactual survival function if everyone had . Contrasts between the counterfactual outcomes can be estimated using the observed outcomes, under some assumptions., These are (i) conditional exchangeability meaning that the outcome and the exposure are independent given covariates, (ii) consistency i.e. an individual’s potential outcome under a specific exposure corresponds to the actual outcome of this person under this exposure level and (iii) positivity so that the probability of being in every level of the exposure group is positive for all levels of Z. Notice that the assumptions are now extended to both outcomes (death due to cancer and death due to other causes). The conditional exchangeability assumption for the other cause mortality can only be achieved by adjusting the available life tables of the general population for sufficient variables (see Discussion for further details).

Relative survival differences

The difference in marginal relative survival functions, comparing and , is defined as and gives the difference in the hypothetical situation where the cancer of interest is the only possible cause of death. It is estimated by Here everyone is first forced to be exposed () and then unexposed (). A key point is that the average over confounders, , is the same when estimating both marginal effects.

All-cause survival differences

The difference in marginal all-cause survival can also be defined by incorporating the expected survival in equation (4): and is estimated by All-cause survival differences move away from the hypothetical world of relative survival and take into account both cancer-related and other-causes-related survival. Equation (5) has also the interpretation of the potential impact of removing all-cause differences between exposed and unexposed. Figure 2 shows the standardized net and all-cause probabilities of death by deprivation. These are standardized over the combined age and sex distribution. The 5-year standardized net probability of death of the least and most deprived group was 43 and 50% respectively. The 5-year standardized all-cause probability of death was 51 and 60% for the least and most deprived. However, such a comparison does not distinguish whether the difference is due to cancer mortality, other cause mortality or both.

Figure 2.

(A) Standardized net probability of death, and (B) standardized all-cause probability of death, for the least and most deprived patients with 95% confidence intervals.

Forming contrasts within subsets of the population

It might also be of interest to estimate the measures and contrasts described earlier, within subsets of the whole population. For instance, the all-cause survival difference in the whole population, defined in equation (5), could also be defined among the exposed: with and denoting the covariates for the exposed, for the expected and relative survival respectively. It can be estimated by standardizing only to patients of the exposed group, , Forming contrasts within subsets of the population is useful when estimating the potential impact of removing differences for groups with worse survival, under hypothetical scenarios.

Eliminating cancer-related differences

In practice, it might be difficult to remove all-cause survival differences as they are the result of complex mechanisms that involve both cancer-related and other cause mortality. A hypothetical scenario under which we eliminate cancer-related differences only may be easier to define. Contrasts of all-cause survival in which we only eliminate cancer-related survival differences between groups can be obtained using relative survival. For example, instead of equation (6), we could vary only relative survival between the two terms: which is estimated as In equation (7), we assume that under the hypothetical scenario, the other cause mortality rate remains unchanged.

Avoidable deaths

The impact of the hypothetical scenario described in the section ‘Eliminating cancer-related differences’ can also be estimated using avoidable deaths. Firstly, we need the predicted number of deaths for the exposed, which is given by multiplying the number of exposed patients diagnosed in a typical calendar year, , with the probability of death: Secondly, we need the expected number of deaths under the hypothetical scenario that is derived by replacing the relative survival of the exposed with that of the unexposed: The avoidable deaths are then estimated by: Each of the terms is estimated by The number of exposed patients in a typical year, may be different from the patients we standardize over, . For instance, can be calculated by the number of exposed patients diagnosed in the most recent year or by the total number of exposed patients divided by the number of years. Equation (8) yields the all-cause avoidable deaths among the exposed and can be partitioned to cancer or other causes deaths. This can be estimated by multiplying the marginal crude probabilities of death by the number of patients, . We estimated the impact on the standardized all-cause probability of death of the most deprived group under a hypothetical scenario of removing differences in relative survival between deprivation groups. To do so, we applied the relative survival of the least deprived, i.e. the most advantaged group, to the most deprived, but kept their expected survival unchanged. In such a scenario the 5-year standardized all-cause probability of death of the most deprived would decrease from 60 to 55% (Fig. 3).

Figure 3.

Standardized all-cause probabilities of death for the most deprived patients before and after the hypothetical scenario of removing differences in relative survival, with 95% confidence intervals.

Standardized all-cause probabilities of death for the most deprived patients before and after the hypothetical scenario of removing differences in relative survival, with 95% confidence intervals. We also estimated the avoidable deaths for the most deprived patients under the same scenario. Five years after diagnosis 168 deaths could be avoided out of 3267 patients from the most deprived group diagnosed in 2008 (Fig. 4). In this example, and of equation (9) coincide (3267 patients) but this will not always be the case. Fig. 5 breaks down the all-cause avoidable deaths to cancer and other causes deaths. Even though the cancer avoidable deaths increase and finally stay constant with time, the all-cause avoidable deaths will decrease after the initial increase, as some patients that would die from the cancer will now die of other causes. That is why we observe an increase in other cause deaths.

Figure 4.

All-cause avoidable deaths under the hypothetical scenario of removing differences in relative survival between deprivation groups, with 95% confidence intervals.

Figure 5.

All-cause avoidable deaths partitioned to avoidable deaths due to cancer and increase in deaths due to other causes.

All-cause avoidable deaths under the hypothetical scenario of removing differences in relative survival between deprivation groups, with 95% confidence intervals. All-cause avoidable deaths partitioned to avoidable deaths due to cancer and increase in deaths due to other causes.

Discussion

We outlined marginal measures in a relative survival framework that can summarize the prognosis of a population. We also defined contrasts of these measures between subgroups of the population that under assumptions can be interpreted as causal effects. Most of these methods are used in practice, however in this paper we are formalizing them into a causal framework and providing software for their estimation. We also defined marginal crude probabilities as an additional useful measure. Marginal estimates were estimated using regression standardization. An advantage of these measures is that even after fitting a complex statistical model with interactions and time-dependent effects, a single number can be used to summarize the exposure effect at a given time point. In order to relate the counterfactual and the observed outcomes, some assumptions need to hold:, conditional exchangeability, consistency and positivity. These have similar interpretation to that of an all-cause setting but here they are extended for both competing outcomes. Another assumption is that of well-defined interventions that would allow us to compute the causal effect in an ideal randomized experiment. As we evaluated the impact of an intervention that aims to eliminate deprivation differences one could argue that this is an ambiguous causal question. However, as others have argued, understanding the magnitude of disparities across deprivation groups in a formalized causal framework gives a firm basis to further unpick the reasons for the differences, even if the ideal randomized experiment would be difficult to precisely define., Our approaches can be extended to a mediation analysis setting to quantify measurable aspects that drive these inequalities, which will form part of future work. In addition to standard causal inference assumptions, assumptions relevant to the relative survival need to hold. Information on expected mortality rates should be appropriate for the cancer population. Previous studies have assessed potential bias from including cancer patients in the general population. Bias was found to be negligible for individual cancers. Another assumption requires that the competing risks are conditionally independent. This means that there are no other factors to affect both competing events than the factors we have adjusted for. An example would be a strong effect of comorbidity that is likely to affect both cancer and other cause mortality rates. As the ability to adjust for confounders for other cause mortality depends on the available life tables, methods have been developed to do so when that information is not available. We appreciate that the estimates can only be interpreted as causal if this assumption is valid but, in principle, life tables can be constructed for any number of risk factors if there is available data to do so. The interpretation of net survival in a hypothetical world where the only possible cause of death is the cancer of interest has received some criticism. Some proponents have argued that one should always ‘stick to this world’ for quantities of interest. Historically, relative survival has been used to account for differential mortality of competing events in population-based cancer survival., A hypothetical scenario of eliminating all-cause survival differences between exposures may not be straightforward in practice as many factors, which relate both to cancer and other causes, account for the differences. Relative survival allows to focus on cancer-related differences that may be easier to identify. Furthermore, we develop approaches for equalizing the excess mortality across population groups and then convert to ‘real’ world probabilities in the avoidable deaths measures we propose. Avoidable deaths can be obtained to estimate the impact of eliminating survival differences between subgroups., The avoidable deaths depend on both the survival differences and the number of patients diagnosed with cancer and has the interpretation of postponable deaths as eventually all deaths will be observed. However, it helps to quantify the impact of removing survival inequalities for public health stakeholders. We have demonstrated how to obtain marginal measures for the whole population or specific subsets as well as causal effects between exposure groups using the relative survival framework. Future work will focus on methods that will allow further investigation of observed differences. The relative survival framework could also be applied alongside mediation analysis to investigate the impact of potential mediators on the relationship between exposure and outcome. Click here for additional data file.

35 in total

1. Non-parametric comparison of relative versus cause-specific survival in Surveillance, Epidemiology and End Results (SEER) programme breast cancer patients.

Authors: J W Gamel; R L Vogel
Journal: Stat Methods Med Res Date: 2001-10 Impact factor: 3.021

2. A definition of causal effect for epidemiological research.

Authors: M A Hernán
Journal: J Epidemiol Community Health Date: 2004-04 Impact factor: 3.710

3. Estimating the crude probability of death due to cancer and other causes using relative survival models.

Authors: P C Lambert; P W Dickman; C P Nelson; P Royston
Journal: Stat Med Date: 2010-03-30 Impact factor: 2.373

4. Using pseudo-observations for estimation in relative survival.

Authors: Klemen Pavlič; Maja Pohar Perme
Journal: Biostatistics Date: 2019-07-01 Impact factor: 5.899

5. Estimating expected survival probabilities for relative survival analysis--exploring the impact of including cancer patient mortality in the calculations.

Authors: Mats Talbäck; Paul W Dickman
Journal: Eur J Cancer Date: 2011-09-15 Impact factor: 9.162

6. Cumulative cause-specific mortality for cancer patients in the presence of other causes: a crude analogue of relative survival.

Authors: K A Cronin; E J Feuer
Journal: Stat Med Date: 2000-07-15 Impact factor: 2.373

7. On crude and age-adjusted relative survival rates.

Authors: Hermann Brenner; Timo Hakulinen
Journal: J Clin Epidemiol Date: 2003-12 Impact factor: 6.437

8. The hazards of hazard ratios.

Authors: Miguel A Hernán
Journal: Epidemiology Date: 2010-01 Impact factor: 4.822

9. Standard cancer patient population for age standardising survival ratios.

Authors: Isabella Corazziari; Mike Quinn; Riccardo Capocaccia
Journal: Eur J Cancer Date: 2004-10 Impact factor: 9.162

10. The impact of life tables adjusted for smoking on the socio-economic difference in net survival for laryngeal and lung cancer.

Authors: L Ellis; M P Coleman; B Rachet
Journal: Br J Cancer Date: 2014-05-22 Impact factor: 7.640

7 in total

1. Understanding disparities in cancer prognosis: An extension of mediation analysis to the relative survival framework.

Authors: Elisavet Syriopoulou; Mark J Rutherford; Paul C Lambert
Journal: Biom J Date: 2020-12-14 Impact factor: 2.207

2. Direct modelling of age standardized marginal relative survival through incorporation of time-dependent weights.

Authors: Paul C Lambert; Elisavet Syriopoulou; Mark R Rutherford
Journal: BMC Med Res Methodol Date: 2021-04-24 Impact factor: 4.615

3. Assessing lead time bias due to mammography screening on estimates of loss in life expectancy.

Authors: Elisavet Syriopoulou; Alessandro Gasparini; Keith Humphreys; Therese M-L Andersson
Journal: Breast Cancer Res Date: 2022-02-23 Impact factor: 6.466

4. Bayesian variable selection and survival modeling: assessing the Most important comorbidities that impact lung and colorectal cancer survival in Spain.

Authors: Francisco Javier Rubio; Danilo Alvares; Daniel Redondo-Sanchez; Rafael Marcos-Gragera; María-José Sánchez; Miguel Angel Luque-Fernandez
Journal: BMC Med Res Methodol Date: 2022-04-03 Impact factor: 4.615

5. Assessing the impact of including variation in general population mortality on standard errors of relative survival and loss in life expectancy.

Authors: Yuliya Leontyeva; Hannah Bower; Oskar Gauffin; Paul C Lambert; Therese M-L Andersson
Journal: BMC Med Res Methodol Date: 2022-05-02 Impact factor: 4.615

6. Estimating causal effects in the presence of competing events using regression standardisation with the Stata command standsurv.

Authors: Elisavet Syriopoulou; Sarwar I Mozumder; Mark J Rutherford; Paul C Lambert
Journal: BMC Med Res Methodol Date: 2022-08-13 Impact factor: 4.612

7. Quantifying the number of deaths among Aboriginal and Torres Strait Islander cancer patients that could be avoided by removing survival inequalities, Australia 2005-2016.

Authors: Paramita Dasgupta; Gail Garvey; Peter D Baade
Journal: PLoS One Date: 2022-08-26 Impact factor: 3.752

7 in total