Literature DB >> 34796588

Multivariate risk preferences in the quality-adjusted life year model.

Arthur E Attema¹, Jona J Frasch¹, Olivier L'Haridon².

Abstract

The interest in multivariate and higher-order risk preferences has increased. A growing body of literature has demonstrated the relevance and impact of these preferences, but for health the evidence is lacking. We measure multivariate and higher-order risk preferences for quality of life (QoL) and longevity, the two attributes of the Quality-Adjusted Life Year (QALY) model. We observe preferences for a positive correlation between these attributes and for pooling together a fixed loss in one of the attributes and a mean-zero risk in the other, and for pooling together mean-zero risks in QoL and longevity. The findings indicate that higher-order risk preferences are stronger for health than for money. Furthermore, we test if preferences for a risky treatment for a disease affecting only QoL, depend on life expectancy. We find no such a relation, but there is a positive relation between riskiness of a comorbidity affecting life expectancy and risk aversion for a QoL treatment. We therefore observe no definitive deviation from the QALY model, although the model is more robust when expected longevity is high. Our findings suggest that the current practice of cost-effectiveness analysis should be generalized to account for risk aversion in QoL and longevity, and higher-order preferences.

Entities: Chemical

Keywords: QALYs; comorbidities; correlation attitude; prudence; risk apportionment; risk aversion; temperance; treatment intensity

Mesh：

Year: 2021 PMID： 34796588 PMCID： PMC9299505 DOI： 10.1002/hec.4456

Source DB: PubMed Journal: Health Econ ISSN： 1057-9230 Impact factor: 2.395

INTRODUCTION

Health and health care are surrounded by a lot of risk, implying that risk aversion plays a central role in health economics. Recently, several studies have convincingly shown that also some concepts beyond risk aversion, such as prudence (i.e., downside risk aversion or a preference for separating a mean‐zero risk from a fixed loss, equivalent under expected utility to a positive sign of the third derivative of the utility function), are much more important than previously thought (Eeckhoudt & Schlesinger, 2006; Trautmann & van de Kuilen, 2018). These concepts relate to higher moments of a distribution than just variance, such as skewness and kurtosis, and are therefore coined higher‐order risk attitudes. Hence, the necessity to look beyond second‐order risk attitudes has become clear, also in the health care field. This knowledge is important for several reasons. First, it allows to test if the quality‐adjusted life year (QALY) model represents individual health preferences, and hence if QALYs are a proper metric to value health improvements. Related to this, the current conduct of cost‐effectiveness analysis (CEA) is usually to assume the QALY model without allowing for risk aversion for quality of life (QoL) or longevity, or third‐ and fourth‐order risk attitudes. The same holds for the value of a statistical life (VSL) literature, where risk neutrality is typically assumed and the marginal value of a change in survival at a point in time is independent of the baseline survival level (Rosen, 1988). If individuals are instead risk averse in QoL, the cost‐effectiveness threshold and the willingness to pay for marginal gains in QoL would vary with baseline health status (Lakdawalla & Phelps, 2020). Likewise, risk aversion for longevity increases the willingness to pay to avoid early death (Bommier & Villeneuve, 2012), while it can explain the sizable private healthcare expenditures at the end of life (Córdoba & Ripoll, 2017). Second, higher‐order risk attitudes are relevant to many everyday health care decisions, such as risky treatment choices to combat a disease in the face of comorbidities. It is well known that many people suffer from two or more diseases at the same time (MacMahon, 2018), which may influence their preferences for treating their primary disease. Courbage and Rey (2006) pointed out that the level of prudence is a main determinant of the optimal level of prevention for health risks, and Pauker (2014) advocated higher‐order risk attitudes as a research topic that should receive priority on the research agenda in the domain of medical decision making. Moreover, Bleichrodt, Crainich, and Eeckhoudt (2003) have shown the importance of higher‐order risk attitudes in treatment decisions in the presence of comorbidities influencing life expectancy. They demonstrated that economic evaluations and medical decision analyses that ignore comorbidities will lead to recommendations that are biased in the direction of too much treatment if aversion to health status risks increases with life expectancy. They also derived several predictions regarding treatment decisions under particular assumptions, but so far these predictions had not yet been tested empirically. In addition, Eeckhoudt et al. (2007) showed how investment in tertiary preventive care (i.e., the treatment of an established or chronic disease in order to minimize the negative health consequences of the disease) depends on cross‐prudence of health and income, that is it depends on whether an individual prefers to disaggregate a zero‐mean income risk and a fixed health reduction, or equivalently has a positive third cross‐derivative of income with respect to health. Krieger and Mayrhofer (2012) have explored higher‐order risk attitudes in a health context empirically and observed both risk aversion and prudence. However, they only studied univariate risk attitudes and no multivariate risk attitudes, whereas in many settings a decision maker actually faces more than one attribute (Keeney & Raiffa, 1993). Eeckhoudt et al. (2007) and Ebert and van de Kuilen (2015) have stressed the importance of multi‐attribute decision making, given the high prevalence of decisions where more than one attribute is involved. In the health domain, for instance, the widely used QALY model, which is the recommended metric to be used in health economic evaluations (Sanders et al., 2016), involves the attributes longevity and QoL. In case of two attributes, correlation aversion means that an individual prefers a 50% chance of a loss in one attribute and a 50% chance of a loss in the other attribute over a 50–50 gamble offering a loss in neither attribute or a loss in both (Eeckhoudt et al., 2007). An example of correlation aversion in health is when a patient prefers a lottery where he will get either a lower QoL (50% chance) or a shorter life expectancy (50% chance) over a lottery where he has a 50% chance to get both a health deterioration and a lower life expectancy at the same time, and 50% chance to get no health losses at all. Bleichrodt, Crainich, and Eeckhoudt (2003) showed that various consequences of the QALY model can be tested by obtaining knowledge about higher‐order (cross‐) derivatives of the utility function for longevity and QoL. One of their predictions was that people are risk averse for both longevity and QoL, which are both established theoretical predictions (Lakdawalla & Phelps, 2020; Miyamoto & Eraker, 1988) that have been empirically confirmed in several studies (Attema et al., 2012, 2013, 2016; Bleichrodt & Pinto, 2005; Rouyard et al., 2018; Schosser et al., 2016; Wakker & Deneffe, 1996). Another prediction they made is correlation seeking for the combination of these two attributes. That is, people would prefer to combine a bad [good] health state with a short [long] life duration over mixing these two. The risk apportionment technique allows us to test these predictions. Bleichrodt, Crainich, and Eeckhoudt (2003) also showed that, according to the QALY model, risk aversion for QoL should not depend on having a comorbidity that only affects longevity. In addition, they predicted that decreases in the riskiness of longevity caused by this comorbidity will generally lead to more treatment‐prone behavior (i.e., people get less risk averse for QoL). Finally, Bleichrodt, Crainich, and Eeckhoudt (2003) derived how risk aversion, and hence treatment intensity, depend on higher‐order multivariate risk preferences (i.e., risk aversion, correlation aversion, cross‐prudence, and cross‐temperance – a preference for disaggregating a zero‐mean longevity risk and a zero‐mean QoL risk). Attema et al. (2019) recently applied the risk apportionment technique to the health field, when they measured multivariate risk preferences, up to the fourth order, for longevity and wealth. They reported substantial risk aversion and correlation aversion for gains, but the opposite was found for losses. Furthermore, they observed less substantial amounts of prudence and temperance, but still significantly more than 50%. However, that study only investigated the duration component of the QALY model and hence could not test all the propositions from Bleichrodt, Crainich, and Eeckhoudt (2003). In fact, to the best of our knowledge, no assessments of (cross‐)prudence and (cross‐)temperance are available yet for QoL. In this paper we are the first to empirically study several higher‐order properties of the QALY model. This design enables us to test the theoretical predictions put forward by Bleichrodt, Crainich, and Eeckhoudt (2003). In a nutshell, we combine an implementation of the risk apportionment technique with a treatment intensity task in a lab experiment, in which we measure risk aversion for QoL for different life durations. First, we obtain evidence on individuals' correlation attitude between longevity and QoL. Second, we elicit their third‐ and fourth‐order multivariate risk attitudes, that is, cross‐prudence and cross‐temperance. Finally, we measure preferred treatment intensity for treating a disease affecting only QoL for patients also suffering from a comorbidity which affects longevity. Here, a higher treatment intensity increases the spread in the potential QoL outcomes. The latter measure enables us to test several theoretical predictions based on the QALY model as suggested by Bleichrodt, Crainich, and Eeckhoudt (2003). Our results show that subjects have marked risk preferences for longevity and QoL. First, we find a lot of risk aversion for both attributes, confirming most theoretical models. Second, we confirm Bleichrodt, Crainich, and Eeckhoudt's (2003) prediction of correlation seeking, with an overwhelming majority of subjects showing this preference. Furthermore, in contrast to most studies using monetary outcomes, we also find highly significant evidence for cross‐imprudence and cross‐intemperance. However, we observe no systematic correlation between treatment intensity and duration. Finally, we observe a marginally significant relation between treatment intensity and riskiness of life duration, in agreement with the intuition of Bleichrodt, Crainich, and Eeckhoudt (2003).

METHOD

We assume preferences satisfy a weak‐order, that is they are complete and transitive. Individuals care about QoL (q) and longevity (t). According to the QALY model, preferences for chronic health states are evaluated by: If expected utility holds, a subject is risk averse for QoL if and risk averse for longevity if . Prudence for QoL holds if , prudence for longevity implies and temperance holds if for QoL and for longevity. Concerning multivariate risk preferences, a subject is correlation averse if , cross‐prudent for longevity if , cross‐prudent for QoL if , and cross‐temperate if . Opposite signs define correlation seeking, cross‐imprudence and cross‐intemperance, respectively. Throughout this paper, we only consider health states better than dead, that is, we assume utility is increasing in life duration: . The general QALY model of Eq. (1) does not give any prediction about univariate ( , . , , ) or multivariate () risk preferences For instance, in addition to (i.e., correlation seeking), we have (i.e., cross‐imprudence for longevity) in case of risk aversion for QoL, and (i.e., cross‐imprudence for QoL) in case of risk aversion for longevity and finally, if the decision maker is risk averse in both longevity and QoL. The linear QALY model, , which is often applied in economic evaluations, provides more specific predictions. The linear QALY model implies that = 0, that is, people are risk neutral with regard to longevity. From this it follows that , , , and are also 0 for the linear QALY model, while if , and in case of risk aversion for QoL. Eeckhoudt and Schlesinger (2006) were the first to operationalize (higher‐order) risk preferences in terms of choices between two binary lotteries with equally likely outcomes that distribute harms and benefits differently, as illustrated below. An example of an item revealing risk aversion for QoL is the following (Table 1):

TABLE 1

Question to test for risk aversion for QoL

What is your most preferred alternative?
Option A	Option B
50%: Live with 40% of full health for 40 years	50%: Live with 30% of full health for 40 years
50%: Live with 50% of full health for 40 years	50%: Live with 60% of full health for 40 years

Note: Bold text shows the answer revealing risk aversion for QoL.

Abbreviation: QoL, quality of life.

Question to test for risk aversion for QoL Note: Bold text shows the answer revealing risk aversion for QoL. Abbreviation: QoL, quality of life. Here, the risk averse individual would choose Option A, because it offers the same expected QoL as Option B (i.e., 45%), but with a lower spread. In fact, Option B is a mean‐preserving spread of Option A. the general idea of the risk apportionment method is to have these kinds of choices between two‐outcome gambles, with one resulting from the other from a mean‐preserving spread. Similarly, risk aversion for longevity could be determined by gambles such as the following (Table 2):

TABLE 2

Question to test for risk aversion for longevity

What is your most preferred alternative?
Option A	Option B
50%: Live with 60% of full health for 40 years	50%: Live with 60% of full health for 30 years
50%: Live with 60% of full health for 40 years	50%: Live with 60% of full health for 50 years

Note: Bold text shows the answer revealing risk aversion for longevity.

Question to test for risk aversion for longevity Note: Bold text shows the answer revealing risk aversion for longevity. In this example, Option A is riskless and Option B involves a mean‐preserving spread of the same longevity. The risk apportionment method also allows for eliciting higher‐order risk attitudes by adding different sources of uncertainty. For example, prudence for longevity can be elicited by the following choice (Table 3):

TABLE 3

Question to test for prudence for longevity

What is your most preferred alternative?
Option A	Option B
50%: Live with 60% of full health for 40 years	50%: Live with 60% of full health for 30 years OR 50 years
50%: Live with 60% of full health for 10 OR 30 years	50%: Live with 60% of full health for 20 years

Note: Bold text shows the answer revealing prudence for longevity.

Question to test for prudence for longevity Note: Bold text shows the answer revealing prudence for longevity. In this case, QoL is always 60% and longevity is either 40 or 20 years. The choice involves distributing a zero‐mean longevity risk of ±10 years to the bad longevity outcome (20 years, Option A) or the good longevity outcome (40 years, Option B). The former choice reflects imprudence and the latter choice reflects prudence. Similarly, temperance can be elicited by including two independent longevity or QoL risks and determining if the respondent prefers to aggregate (intemperance) or disaggregate (temperance) these risks. An example is shown in the Appendix. Eeckhoudt et al. (2007) have demonstrated that the risk apportionment method can also be extended to elicit (higher‐order) cross‐risk attitudes when risk in both attributes is involved. For example, consider the following gamble (Table 4):

TABLE 4

Question to test for correlation aversion

What is your most preferred alternative?
Option A	Option B
50%: Live with 60% of full health for 40 years	50%: Live with 60% of full health for 20 years
50%: Live with 30% of full health for 20 years	50%: Live with 30% of full health for 40 years

Note: Bold text shows the answer revealing correlation aversion.

Question to test for correlation aversion Note: Bold text shows the answer revealing correlation aversion. This gamble involves risk in both QoL (30% or 60%) and longevity (20 or 40 years). The essential choice is if one prefers to combine the good outcome for QoL with the good outcome for longevity, while at the same time combining the bad outcomes for both (Option A), or if one prefers to spread the risks and combine the good outcome for the one attribute with the bad outcome for the other attribute (Option B). The former is deemed correlation seeking and the latter correlation aversion. Tests of cross‐prudence and cross‐temperance can be conducted in a similar fashion. The below question could for instance be used for cross‐prudence for longevity (Table 5).

TABLE 5

Question to test for cross‐prudence for longevity

What is your most preferred alternative?
Option A	Option B
50%: Live with 60% of full health for 30 years	50%: Live with 40% OR 80% of full health for 30 years
50%: Live with 40% OR 80% of full health for 40 years	50%: Live with 60% of full health for 40 years

Note: Bold text shows the answer revealing cross‐prudence for longevity.

Question to test for cross‐prudence for longevity Note: Bold text shows the answer revealing cross‐prudence for longevity. Looking closely, we can see that one lives either 30 or 40 more years in both gambles. Furthermore, QoL may be 60% or it may be another gamble, resulting in either 40% or 80%. In effect, a zero‐mean risk on QoL ( ∼ ±20%) has to be apportioned to either the good outcome of the gamble (i.e., t = 40 years, Option A) or the bad outcome of the gamble (i.e., t = 30 years, Option B). Someone who prefers to combine the zero‐mean risk with the good longevity outcome is said to be cross‐prudent for longevity, whilst someone who prefers combining the zero‐mean risk with the bad longevity outcome is called cross‐imprudent for longevity. Tests for cross‐prudence for QoL and (cross‐)temperance can be done similarly, as shown in the Appendix. Embedded in our study is the assumption that, generally, individuals prefer both higher levels of longevity and higher levels of QoL. While this method relies on the assumption that individuals aim to maximize their utility, it does not require assumptions about the functional form of the utility function (Attema et al., 2019). The risk apportionment technique can also be applied to elicit the other traits mentioned above. In order to test the other predictions of Bleichrodt, Crainich, and Eeckhoudt (2003), as described in the introduction, we elicit the sign of several (higher‐order) risk traits. Table 6 gives an overview of all traits we elicited and the associated implications for the utility function in case of EU.

TABLE 6

Overview of elicited traits and their implied EU condition

Trait if prospect 1 is chosen	Prospect 1	Prospect 2	EU condition prospect 1 is chosen
Risk aversion for QoL (q>q2>q1)	(0.5,q−q1;q−q2)	(0.5,q−q1−q2;q)	Uqq≤0
Risk aversion for longevity (t >t2>t1)	(0.5,t−t1;t−t2)	(0.5, t−t1−t2;t)	Utt≤0
Correlation aversion (q>q1,t>t1)	(0.5,t−t1,q−q1;q,t)	(0.5,t,q−q1;q,t−t1)	Uqt≤0
Cross‐prudence for QoL (q>q1,E(t˜)=0)	(0.5,t,q−q1;q,t+t˜)	(0.5,t+t˜,q−q1;t,q)	Uqtt≥0
Cross‐prudence for longevity (t>t1,E(q˜)=0)	(0.5,t−t1,q;q+q˜,t)	(0.5,t−t1,q+q˜;q,t)	Uqqt≥0
Cross‐temperance (E(t˜)=0,E(q˜)=0)	(0.5,t+t˜,q;q+q˜,t)	(0.5,t+t˜,q+q˜;q,t)	Uqqtt≤0

Abbreviation: QoL, quality of life.

Overview of elicited traits and their implied EU condition Abbreviation: QoL, quality of life. In Table 6, Prospect 1 of the first row denotes a prospect where the subject has 50% probability to live with a QoL of for T years, and 50% to live in QoL of for T years. The other prospect of this first row is riskier, since it involves a lower minimum () and a higher maximum (q). The other prospects can be interpreted similarly. For cross‐prudence and cross‐temperance, and , denote zero‐mean risks on longevity and QoL, respectively. In the model of Bleichrodt, Crainich, and Eeckhoudt (2003), patients can choose the intensity n of a treatment combatting a disease. This only affects their QoL q and is risky, since it can either be effective, improving the patient's health by b*n, or it can be detrimental due to side effects, in which case the patient's health will deteriorate by c*n. Hence, the amount of upside and downside potential depends on the treatment intensity chosen by the patient; the higher the intensity, the more extreme the outcomes will be. In this study we test the predictions of the (linear) QALY model, as shown by Bleichrodt, Crainich, and Eeckhoudt (2003), by asking subjects to choose the amount n in this decision context, for different life durations t. For instance, in one of the questions the subject had to choose n such that they would live 20 more years with q , with n measured in percentages, and b = 0.4, c = −0.1; for example, n = 50% would correspond to . Repeating this for several durations t, we could test the correlation with the risk traits from Table 6.

EXPERIMENT

Subjects

Participants were recruited randomly through a faculty internal recruitment system available to all undergraduate business students at the Rotterdam School of Management. As an incentive for taking part, participants were awarded with course credits. On arrival at the laboratory, a maximum of four students completed the procedure in the same room. A total of 124 students took part in the study. For two subjects, a program failure occurred during data collection. One student re‐contacted us, asking to be excluded from the study because he had not answered faithfully. Therefore, a total of three cases were excluded from the study. The final sample size was N = 121 (51.2% female). The average age of participants was 20.1 years (SD = 1.44). n = 19 participants reported a physical health condition (16.0%), and n = 7 a mental health condition (5.8%), and the average self‐reported QoL on the visual analog scale ranging from 0 (death) to 100 (best possible health) was 83.48 (SD = 9.57). The average BMI was 21.52 (SD = 2.26), and n = 13 participants were considered underweight (10.7%), while n = 9 were considered overweight (7.4%).

Procedure

Subjects were first asked to provide their informed consent and signed a form of solemn commitment. Signing such a solemn commitment has been shown to increase diligent responding (Jacquemet et al., 2018, 2019). Subsequently, subjects received instructions to complete a part eliciting their risk attitudes and treatment proneness and completed 5 practice questions (1 for risk aversion with respect to QoL, 1 for correlation attitude, 1 for cross‐prudence, 1 for cross‐temperance, and 1 for treatment intensity). The order of the tasks was randomized. Within each trait, questions were not interspersed to avoid subjects having to switch between tasks continuously. Within each part, the questions were randomized. At the end of this part, four questions were repeated in order to test consistency (one for question on correlation attitude, one one cross‐prudence for longevity, one on risk aversion for longevity and one for treatment intensity). The experiment was programmed in Matlab. A researcher was in the room with the participants during all sessions.

Stimuli

For all tasks, we took a QoL level of q = 60% of full health to be the base QoL. For longevity, this base was t = 40 life years. As a result, risk aversion for QoL was elicited by fixing longevity at 40 years while varying the variance of QoL. Likewise, risk aversion for longevity was assessed by fixing QoL at 60% while varying the variance of longevity between the options. A similar procedure was used for the other traits. Table 7 shows the stimuli for all traits.

TABLE 7

Stimuli for the risk apportionment tasks

Task ^a	Trait	Prospect A	Prospect B
1	Risk aversion for QoL	[(60%−10%,40y); (60%−40%,40y)]	[(60%,40y); (60%−50%,40y)]
2		[(60%−10%,40y); (60%−20%,40y)]	[(60%,40y); (60%−30%,40y)]
3		[(60%−20%,40y); (60%−20%,40y)]	[(60%,40y); (60%−40%,40y)]
4	Risk aversion for longevity	[(60%,40y−10y); (60%,40y−20y)]	[(60%,40y−30y); (60%,40y)]
5		[(60%,40y−10y); (60%,40y−10y)]	[(60%,40y−20y); (60%,40y)]
6*		[(60%,40y−5y); (60%,40y−10y)]	[(60%,40y−15y); (60%,40y)]
7	Correlation attitude	[(60%−40%,40y); (60%,40y−10y)]	[(60%−40%,40y−10y); (60%,40y)]
8		[(60%−20%,40y); (60%,40y−20y)]	[(60%−20%,40y−20y); (60%,40y)]
9*		[(60%−20%,40y); (60%,40y−10y)]	[(60%−20%,40y−10y); (60%,40y)]
10	Cross‐prudence for longevity	[(60%,40y−20y); (60%±20%,40y)]	[(60%±20%,40y−20y); (60%,40y)]
11		[(60%,40y−10y); (60%±40%,40y)]	[(60%±40%,40y−10y); (60%,40y)]
12		[(60%,40y−10y); (60%±20%,40y)]	[(60%±20%,40y−10y); (60%,40y)]
13	Cross‐prudence for QoL	[(60%−20%,40y); (60%,40y±20y)]	[(60%−20%,40y±20y);(60%,40y)]
14*		[(60%−20%,40y); (60%,40y±10y)]	[(60%−20%,40y±10y);(60%,40y)]
15		[(60%−40%,40y); (60%,40y±10y)]	[(60%−40%,40y±10y);(60%,40y)]
16	Cross‐temperance	[(60%±20%,40y); (60%,40y±20y)]	[(60%±20%,40y±20y); (60%,40y)]
17		[(60%±40%,40y); (60%,40y±10y)]	[(60%±40%,40y±10y); (60%,40y)]
18		[(60%±20%,40y); (60%,40y±10y)]	[(60%±20%,40y±10y); (60%,40y)]

Abbreviation: QoL, quality of life.

An asterisk indicates that the choice task was repeated once as a consistency check.

Stimuli for the risk apportionment tasks Abbreviation: QoL, quality of life. An asterisk indicates that the choice task was repeated once as a consistency check.

Treatment intensity

Treatment proneness was operationalized as the preferred treatment intensity. Here, participants were presented with a singular 50–50 lottery, in which each outcome represented a QoL index q for a given duration of life t. At baseline (intensity of 0%, i.e., no treatment taken), the two lotteries were identical. The life duration was always exogenous; that is, the subject could not influence the life duration. The life duration was equal for both lottery outcomes, and it was either certain or associated with uncertainty. The former case represents the situation in which the comorbidity caused a known reduction in life duration, whilst in the second case the comorbidity caused a riskier life duration. The subject could, however, influence the expected QoL by choosing a preferred treatment intensity n, represented as a percentage ranging from 0 to 100, which subjects could choose from in steps of 2%. The treatment is associated with either benefits b (associated with one lottery outcome) or costs c (associated with the other lottery outcome). The size of the benefits and costs depends on the treatment intensity n. The higher the treatment intensity, the higher the potential benefits as well as the potential costs. We picked a ratio of b/c = 4, and used three questions with a fixed duration (20, 30 and 40 years), and one question with a random duration of either t = 10 or t = 30 years, equally likely. An overview of the stimuli is provided in Table 8. Screenshots of this task are shown in the Web Appendix.

TABLE 8

Stimuli for the treatment intensity task

	Task 1	Task 2	Task 3	Task 4 ^a	Task 5
Prospect in case of intensity 0%	(60%,20y±10y)	(60%,20y)	(60%,30y)	(60%,40y)	(60%,40y±10y)
Prospect in case of intensity 100%	(50%or100%,20y±10y)	(50%or100%,20y)	(50%or100%,30y)	(50%or100%,40y)	(50%or100%,40y±10y)

Repeated at the end.

Stimuli for the treatment intensity task Repeated at the end.

Analysis

Data analysis was performed in R. We used the number of choices (out of 3) that are compatible with a given risk trait as our measurement of the strength of multi‐ and univariate risk preferences. In our analysis, a subject is classified according to a risk trait if the majority of her choices is consistent with that particular trait. Thus, for example, an individual is classified as being risk averse (seeking) if most of her choices are compatible with risk aversion (seeking). For each of these traits, we investigated whether people show a given risk preference or behave at random based on a chi‐square test. At the aggregate level, we report the average percentage of choices over tasks compatible with each trait. We use Fisher exact tests to compare the classifications obtained for each trait. To assess the relation between the higher‐order risk preferences and treatment intensity, we used repeated‐measure ANOVAs and Friedman tests. We also used Wilcoxon and Student t‐tests for complementary analysis. Bleichrodt, Crainich, and Eeckhoudt (2003) show under which conditions of the higher‐order derivations, treatment intensity varies with duration. They show that an increase (decrease) in treatment intensity with duration is predicted by a decrease (increase) in risk aversion to health status with duration. In addition, when treatment intensity increases (decreases) with duration the sign of the following ratio is positive (negative): Bleichrodt, Crainich, and Eeckhoudt (2003) show that r corresponds to the responsiveness of (normalized) correlation aversion to changes in health status. Because the denominator of the fraction in Equation (2) is always positive, its sign depends on the sign of the numerator, which gives an unambiguous sign only if particular combinations of higher‐order risk traits are satisfied. For example, if a participant is cross‐prudent for longevity, risk averse for QoL and correlation seeking we know that the fraction is positive, whilst it is negative for a participant who is risk averse for QoL, cross‐imprudent for longevity and correlation averse. Instead, in case of cross‐prudence for longevity, risk aversion and correlation aversion, we cannot make a prediction for the sign of the fraction without knowing the degrees of the higher‐order derivatives (the degrees of correlation aversion, cross‐prudence and risk aversion for QoL). We test if our data generate an unambiguous sign by computing Equation (2) using the signs of the median traits, as well as computing the sign of Equation (2) for each participant separately.

RESULTS

Consistency checks

To assess whether participants were consistent in their answers, four items were included twice in the experiment, measuring risk aversion for duration, correlation aversion, cross‐prudence for QoL, and treatment intensity. For binary choices, subjects made the same choice in 75.38 percent of the repeated choices. This rate is consistent with the usually observed consistency rates in experiments (Attema et al., 2019; Stott, 2006). We also found some variability in consistency between the different tasks. For the treatment intensity choices, subjects made the same choice in 41.32 percent of the repeated choices. This percentage can be considered relatively low. Allowing for an error margin of 5 percentage points, the consistency rate increases to 53.72 percent. For an error margin of 10 percentage points, it raises to 67.77 percent.

Risk preferences

Table 9 shows the results on risk preferences. The first two columns show the aggregate results: the mean proportion of the three choices compatible with each trait and the associated standard deviation. The last two columns show the individual results. The third column corresponds to the classification of individuals, based on their risk preferences, and the fourth shows the p‐value of a one‐sided binomial test for comparison between the percentage of individuals and 50 percent. Table A4 in the Appendix gives the percentages of choices compatible with risk apportionment for each binary choice task.

TABLE 9

Risk preferences: aggregate results and individual classification

	Aggregate results		Individual classification
	Mean	Standard deviation	Proportion	p‐value
Risk aversion, quality of life	66.39	9.10	67.77	<0.01
Risk aversion, longevity	74.38	5.03	79.34	<0.01
Correlation aversion	10.19	2.66	4.13	<0.01
Cross‐prudence for quality of life	36.64	3.73	32.23	<0.01
Cross‐prudence for longevity	27.27	7.06	23.14	<0.01
Cross‐temperance	39.94	5.63	33.06	<0.01

Risk preferences: aggregate results and individual classification At the aggregate level, we performed a series of chi‐squared tests to check whether the observed distribution of preferences deviated from the distribution that would be observed if subjects choose randomly. All tests show that choices were not made at random. We found risk aversion to be the predominant pattern for both longevity and QoL, with a large majority of the choices compatible with risk aversion in both cases. Figure 1 illustrates this point and shows the distribution of the number of risk averse choices for QoL and for longevity. Figure 1 also shows the expected number of risk averse choices if participants chose at random.

FIGURE 1

Distribution of the number of risk averse choices for quality of life and for longevity

Distribution of the number of risk averse choices for quality of life and for longevity Overall, 58.68 percent of individuals were classified as both risk averse for longevity and for QoL. The association between risk attitudes for longevity and QoL was highly significant (Fisher test, p‐value 0.007). We found a clear choice pattern indicative of a preference for correlation seeking for longevity and QoL with more than 90 percent of the choices compatible with correlation seeking. Figure 2 shows the distribution of correlation averse choices for QoL and for longevity together with the distribution of risk averse individuals for QoL (panel (a)) and for longevity (panel (b)). Under expected utility, this pattern of preference suggests that the cross‐derivative of the utility function is positive for most individuals. Using classifications at the individual level, we found no evidence for an association between correlation attitudes and risk attitudes for neither QoL nor longevity. Due to the large majority of individuals being classified as correlation seeking (95.87 percent), this result is hardly surprising.

FIGURE 2

Correlation aversion and risk aversion for quality of life and longevity, distribution of the number of correlation averse choices

Correlation aversion and risk aversion for quality of life and longevity, distribution of the number of correlation averse choices Table 9 also shows the classification of individuals depending on their cross‐prudent choices. A majority of individuals were classified as cross‐imprudent for QoL (67.77 percent of individuals) and for longevity (76.86 percent of individuals). At the individual level, 9.09 percent of individuals were classified as cross‐prudent in both attributes and 53.72 percent as cross‐imprudent for both attributes. We found no significant association between those risk preferences, and risk and correlation aversion. Under expected utility, the pattern of preferences revealed in Table 9 suggests that the cross‐derivatives of the utility function and were negative for most individuals. Last, we found evidence for cross‐intemperance with a majority of subjects choosing compatible with this trait. We found an association between cross‐temperance and cross‐prudence for QoL (Fisher test, p‐value 0.014) but not for longevity (Fisher test, p‐value 0.82). The combination of correlation seeking, cross‐imprudence and cross‐intemperance corresponded to the modal multivariate risk preference when both cross‐prudence for QoL and longevity were considered (for 48.76 percent and 52.07 percent of subjects, respectively).

Choice of treatment intensity with certain longevity

Table 10 shows the descriptive statistics on the choice of treatment intensity. On average, subjects chose a treatment intensity of 60%. The values for the third quartile show that a significant number of individuals chose the maximum treatment intensity in any treatment.

TABLE 10

Descriptive statistics on the choice of treatment intensity

	Certain longevity			Risky longevity
	T = 20	T = 30	T = 40	T = 20 + 10/−10	T = 40 + 10/−10
Median	74.00	61.00	60.00	60.00	58.00
Q1	36.00	39.00	32.00	30.00	38.00
Q3	99.50	99.50	99.50	99.50	99.50
Mean	64.41	63.12	62.14	58.32	59.74
SD	35.42	32.07	34.46	36.13	33.64

Descriptive statistics on the choice of treatment intensity The median values reported in the first three columns of Table 10 suggest that treatment intensity decreases with longevity, while the means suggest a flat pattern instead. In order to test the association between treatment intensity and longevity, we ran a repeated‐measure ANOVA with longevity as the within‐subject factor. In accordance with the mean values from Table 10, the results show that treatment intensity does not differ between the three tasks (p‐value 0.72). A Friedman test shows however a marginally significant difference between the median values (p‐value 0.08). Pairwise comparisons based on Wilcoxon or Student t‐test support the results from the ANOVA. Table 11 shows a classification of individuals based on the relation between longevity and treatment intensity. We used two rules to classify subjects. The strict rule classifies individuals as having a constant (increasing, decreasing) profile if they reported the same exact (increasing, decreasing) treatment intensity for the three longevities . We also used a more lenient rule allowing for a deviation of 5 percentage points in first‐order differences. Subjects who were classified as neither constant nor increasing or decreasing were classified as exhibiting a non‐monotone profile. Individual analysis from Table 11 shows that between 1/4 and 1/3 of the subjects chose constant treatment intensities for different longevities, a majority of them choosing extreme (0 and 100 percent) treatment intensities. For around 1/4 of the subjects, treatment intensity decreases with longevity and for around 1/6 of the subjects, treatment intensity increases with longevity.

TABLE 11

Classification of individuals depending on the relationship between treatment intensity and longevity

	Strict rule	With 5 pp. error
Constant	32	40
Constant with extreme choices	27	27
Decreasing	32	33
Increasing	21	23
Non‐monotone	36	25

Classification of individuals depending on the relationship between treatment intensity and longevity We now use the classification of individuals based on their choice apportionment in binary choice to evaluate the prediction of the expected utility model. More specifically, we use the individual classification of risk aversion for QoL, correlation aversion and cross‐prudence for longevity to infer the sign of r defined in Equation (2). Remember that r measures the responsiveness of (normalized) correlation aversion to change in health status and is the key behavioral parameter governing the response of treatment intensity to duration. For 60.33 percent of the individuals, the information gathered from binary choices did not allow to have a clear prediction on the sign of . 25.62 percent were classified as revealing a negative . The remaining 14.05 percent were classified as revealing a positive . Because the test of expected utility is based on the sign of , the risk apportionment technique does not allow to make firm predictions for a majority of subjects in our experiment. Figure 3 shows the distribution of treatment intensities at different longevities, based on the revealed sign of . A visual inspection of Figure 3 shows that treatment intensities tend to decrease for participants revealing a positive and a non‐monotone pattern for those revealing a negative . An ANOVA with repeated‐measures for subjects with a revealed negative cannot reject constancy of treatment intensity (p‐value 0.45). The same applies for the mixed case but also for positive . In accordance with Figure 3, for the two latter classifications, a Friedman test nevertheless shows marginally significant differences (p‐value of the Friedman test 0.08 and 0.05).

FIGURE 3

Relations between treatment intensity and the sign of responsiveness of normalized correlation attitude to changes in health status r

Choice of treatment intensity with risky longevity

The values reported in Table 10 suggest that treatment intensity decreases when a risk on longevity is introduced. In order to test for the impact of risky longevity on treatment intensity, we ran a repeated‐measure ANOVA, with two within‐subject factors (certain vs. risky longevity and expected longevity equal to either or ). The results from the ANOVA show that treatment intensity does not vary with longevity (p‐value 0.89) and that riskiness of longevity has only a marginal impact on treatment intensity (p‐value 0.06). Pairwise comparisons based on Wilcoxon or Student t‐tests support the results from the ANOVA: the differences between treatment intensities at certain and risky longevity are not significantly different at expected longevity equal to years, but are marginally different at expected longevity equal to years (Wilcoxon two‐sided test, p‐value 0.07, Student two‐sided t‐test, p‐value 0.07). One‐sided interpretations of pairwise comparisons therefore show evidence for decreasing treatment intensity with riskiness of longevity, at least when expected longevity is low. Last, the base value of treatment intensity (t = 20 or t = 40) does not impact treatment intensity when duration is risky (Wilcoxon two‐sided test, p‐value 0.49, Student two‐sided t‐test, p‐value 0.61). Bleichrodt, Crainich, and Eeckhoudt (2003) show that risk aversion alone is not sufficient to predict the reaction of the introduction of a risky longevity in the choice of treatment intensity. In particular, they show that it is far from obvious that the riskiness of longevity leads, through risk aversion, to a decrease in treatment intensity. We tested this hypothesis by comparing the differences between risky and certain longevity (for expected longevity equal to either t = 20 or t = 40) for risk averse and risk seeking subjects. Results are shown in Figure 4, which makes it clear that risk aversion, per se, is not clearly associated with a systematic drop in treatment intensity when longevity is risky. Figure 4 also shows that risk seeking does not translate in a systematic increase in treatment intensity when risky longevity is introduced. A repeated‐measure ANOVA, with one within‐subject factor (expected longevity equal to either t = 20 or t = 40) and one between‐subject factor (risk attitudes), shows that the former has no significant effect on the difference between treatment intensity for risky or certain longevity (p = 0.156). Together, these results confirm Bleichrodt, Crainich, and Eeckhoudt (2003) that there is not a one‐to‐one link between risk aversion for longevity and choice of treatment intensity when longevity becomes risky.

FIGURE 4

Relations between variation in treatment intensity and risk attitudes

DISCUSSION

The study set out with two objectives. First, we aimed to describe people's multivariate and higher‐order risk attitudes for longevity and QoL. Second, we conducted a test of the QALY model by assessing how people's higher‐order risk attitudes were related to their preference for treatment intensity. Our findings for the risk apportionment task confirm the intuitive predictions of Bleichrodt, Crainich, and Eeckhoudt (2003) that people are risk averse and correlation seeking for duration and QoL. Concerning risk aversion, this is a reassuring finding, in accordance with previous evidence (Attema et al., 2013, 2016; Delprat et al., 2016). The finding of correlation seeking on the other hand is particularly interesting given the widespread evidence of correlation aversion for other outcomes (Ebert & van de Kuilen, 2015), although it has been found for the QALY model before (McNeil et al., 1981; Pliskin et al., 1980; Sutherland et al., 1982). Under expected utility, correlation seeking reveals that increasing longevity reinforces the marginal utility of variations (positive or negative) in QoL insofar as individuals benefit from it or experience it longer. This study's results also indicate a clear majority of cross‐imprudent choices, albeit less deviant from neutrality than for the second‐order traits. Lastly, the evidence is, as usual, least pronounced for intemperance, but still we found a significant deviation from 50%. The model of Bleichrodt, Crainich, and Eeckhoudt (2003) neither provides any predictions for the signs of these higher‐order preferences, nor does it give intuitive predictions. Hence, our study provides the first evidence of these higher‐order, multivariate, risk preferences. These findings can have large implications for several health‐related behaviors and open up a new research area. The findings on risk apportionment are also somehow supportive of the QALY model. As shown in Section 2, the QALY model implies correlation seeking and, if risk aversion for QoL and longevity holds, cross‐imprudence and cross‐intemperance. These traits were all found for a majority of our sample. At the same, these results violate the linear QALY model, suggesting that the assumption of risk neutrality with respect to longevity, and the resulting implications for higher‐order multivariate risk preferences are too simplistic. We found no correlation between longevity and treatment intensity (i.e., health status risks). This result is in contrast with the prediction of Bleichrodt, Crainich, and Eeckhoudt (2003), who argued that it would be plausible for aversion to health status risk to increase with life expectancy. The absence of such a relation suggests that (this part of) the QALY model is valid, because it implies comorbidities that only affect life duration indeed have no impact on treatment decisions that only affect QoL. However, we admit that there are several caveats to this conclusion. First, the treatment intensity task may not have been the best way to elicit treatment preferences, which could explain the high amount of noise and the multimodal preferences observed in this task. This may be due to the questions being difficult to answer. Given that our sample encompassed highly educated people, this raises the concern that the task might generate an even higher error margin among a sample representative of the general public. Therefore, we call for future research to explore this issue, and to look for alternative tasks that are easier to comprehend while still capturing these risk preferences. Second, since the theoretical analysis by Bleichrodt, Crainich, and Eeckhoudt (2003) assumes expected utility, our test of the QALY model based on their framework is only valid to the extent that expected utility holds. Otherwise, it may be the case that the observed findings are due to a falsification of expected utility, and that the QALY model would not be valid in a non‐EU framework. It is left to future research to test these properties of the QALY model without the restriction to EU. Still, our results regarding the sign of Eq (2) do not contradict the lack of a correlation between treatment intensity and duration and duration risk, lending some credibility to the test. We report a differential impact of the introduction of a background risk on longevity for different amounts of longevity. For high expected longevity, the background risk did not impact the choice of treatment intensity while it significantly decreased it for lower expected longevity. According to this result, the QALY model, which imposes a neutral impact of the background risk on treatment decisions, appears to be more robust for long durations than for short durations. This result confirms the empirical results from Bleichrodt, Pinto, and Abellán‐Perpinán (2003) and Attema and Brouwer (2008), who showed in a very different experimental setting that standard elicitations of the QALY model (standard gamble or time trade‐off) are more likely to be biased for short durations than for long durations. The findings reported in this paper have profound implications for the conduct of CEA and VSL, two fundamental measurements supporting Health Technology Assessment (HTA). First, most CEA's neglect risk aversion in QoL and assume risk neutrality here. Recently, Lakdawalla and Phelps (2020) showed that assuming risk neutrality put CEA at risk of misrepresenting true individual preferences and misguiding HTA. They demonstrate that risk aversion in QoL implies that the value of improving QoL rises with illness severity or with disability. On the opposite, assuming neutrality for QoL overvalues treatments for minor illnesses. For example, Lakdawalla and Phelps (2020) by calibration show that risk neutrality overvalues treatment for mild illnesses by a factor of 2 to 3. Another implication of introducing risk aversion for QoL is to break the equivalence between health gains in life years and QoL. This has important normative consequences for HTA, since the equivalence between health gains in life years and QoL is a fundamental condition for the cost per QALY rule to allocate medical technology efficiently. Lakdawalla and Phelps (2020) show that incorporating risk aversion and higher‐order risk preferences, such as prudence or temperance, into a generalized risk‐adjusted QALY index allows restoring this equivalence. In light of these analyses, our results, together with previous studies (Attema et al., 2016; Rouyard et al., 2018; Schosser et al., 2016), indicate the urgency of moving beyond risk neutrality over QoL in CEA's. Lakdawalla and Phelps (2020) call for similar theoretical research that incorporates risk aversion over longevity into the microeconomic foundations of cost‐effectiveness. This holds for the VSL literature, where risk neutrality for longevity is typically assumed based on the assumption of additive separability of preferences (Bommier, 2006). Introducing risk aversion for longevity has major implications for modeling VSL (Bommier & Villeneuve, 2012). First, at the individual level, risk aversion increases the willingness to pay to avoid early death. Ignoring risk aversion toward longevity may underestimate the decline of VSL with age computed with standard methods à la Moore and Viscusi (1988). Second, at the social level, assuming risk neutrality for longevity generates a pro‐old bias in the welfare evaluations of mortality risk reductions. Indeed, Bommier and Villeneuve (2012) show that introducing a non‐additive representation of preferences (with a recursive model based on mortality risk aversion) increases the discount rate above the rate of time preference traditionally used in consumption‐smoothing evaluations. In a different setting, based on Epstein‐Zin preferences , Córdoba and Ripoll (2017) show that relaxing expected utility and the additive separability of preferences directly links the marginal valuation of survival to the coefficient of mortality risk aversion, over and above the intertemporal rate of substitution. In sum, if CEA's and VSL's keep on sticking to the assumption of risk neutrality over longevity, incorrect inferences will be made. For example, health state utilities obtained by the time tradeoff technique will be underestimated (Attema & Brouwer, 2010). A related critical issue is the importance of the assumption of risk neutrality for the normative status of cost per QALY decision making rules. The QALY model can be justified on the basis of a life‐cycle model in which the individual maximizes a lifetime utility, expressed as the expected present value of the sum of future utilities derived from consumption and QoL (Bleichrodt & Quiggin, 1999). By connecting the cost per QALY measures and individual preferences when lifetime utility depends on both health status and consumption, the life‐cycle model is key for HTA to have a foundation in welfare economics. , The use of risk apportionment techniques to identify higher‐order risk attitudes, and therefore infer the properties of the utility function has its own limitations. Under expected utility, risk apportionment techniques allow to obtain clear measurements of the signs of successive derivatives of the utility function from behavioral traits. The method is easy to handle for experimenters and the elements of choices are rather easy to understand for participants to an experiment. However, risk apportionment techniques perform poorly if one needs to obtain precise knowledge on the shape of the utility function. Such knowledge is required if one wants a precise elicitation of the effect of a risky comorbidity on the optimal treatment decision. For such comparative statics results, elicitation of risk aversion and other higher‐order risk preferences by risk apportionment techniques are too coarse to elicit all the determinants of marginal benefits and marginal costs of treatment. A precise empirical assessment of those comparative statics would require an elicitation of more complex objects such as prudence premia for longevity and health status. Another limitation is that we used a student sample for our lab study. Although this sample is not representative of the general public, it was useful for a first test application of risk apportionment techniques to the QALY model. Nevertheless, a clear drawback of our young sample is that they are unlikely to have much experience with illness. Hence, our conclusions, even if firm (especially for correlation seeking), should be interpreted with caution and future research should test if our first results can be generalized to the general public's preferences and, perhaps, patient preferences. The QALY model has been largely challenged as a descriptive model for health decisions, mainly because of violations of expected utility (Bleichrodt & Pinto, 2005). One of the reasons why the QALY model would fail to represent risk preferences is therefore largely due to biases and heuristics in elicitation methods, such as the certainty effect (Bleichrodt et al., 2007) or loss aversion (Bleichrodt, Pinto, & Abellán‐Perpinán (2003)). In this paper we used a different methodology, based on risk apportionments, to assess the descriptive ability of the QALY model. One advantage of this methodology rests on its use of paired gambles, for which expected utility is less likely to be violated. Our results show that, at least within expected utility, the QALY model could not be easily rejected, and that it is important to allow for risk aversion and higher‐order risk attitudes for longevity and QoL in cost‐effectiveness and cost‐benefit analysis.

CONFLICT OF INTEREST

The authors have nothing to disclose.

ETHICS STATEMENT

Ethical approval for this study was obtained from the ethical review board of Erasmus University Rotterdam. Supporting Information S1 Click here for additional data file.

19 in total

1. A consistency test of the time trade-off.

Authors: Han Bleichrodt; Jose Luis Pinto; Jose Maria Abellan-Perpiñan
Journal: J Health Econ Date: 2003-11 Impact factor: 3.883

2. A direct method for measuring discounting and QALYs more easily and reliably.

Authors: Arthur E Attema; Han Bleichrodt; Peter P Wakker
Journal: Med Decis Making Date: 2012-06-15 Impact factor: 2.583

3. The value of correcting values: influence and importance of correcting TTO scores for time preference.

Authors: Arthur E Attema; Werner B F Brouwer
Journal: Value Health Date: 2010-12 Impact factor: 5.725

4. Moments when utilities are functional.

Authors: Stephen G Pauker
Journal: Med Decis Making Date: 2014-01 Impact factor: 2.583

5. Prospect theory in the health domain: a quantitative assessment.

Authors: Arthur E Attema; Werner B F Brouwer; Olivier I'Haridon
Journal: J Health Econ Date: 2013-09-03 Impact factor: 3.883

6. Measuring multivariate risk preferences in the health domain.

Authors: Arthur E Attema; Olivier l'Haridon; Gijs van de Kuilen
Journal: J Health Econ Date: 2018-12-27 Impact factor: 3.883

7. The treatment decision under uncertainty: The effects of health, wealth and the probability of death.

Authors: Stefan Felder
Journal: J Health Econ Date: 2019-11-16 Impact factor: 3.883

8. Attitudes toward quality of survival. The concept of "maximal endurable time".

Authors: H J Sutherland; H Llewellyn-Thomas; N F Boyd; J E Till
Journal: Med Decis Making Date: 1982 Impact factor: 2.583

9. Recommendations for Conduct, Methodological Practices, and Reporting of Cost-effectiveness Analyses: Second Panel on Cost-Effectiveness in Health and Medicine.

Authors: Gillian D Sanders; Peter J Neumann; Anirban Basu; Dan W Brock; David Feeny; Murray Krahn; Karen M Kuntz; David O Meltzer; Douglas K Owens; Lisa A Prosser; Joshua A Salomon; Mark J Sculpher; Thomas A Trikalinos; Louise B Russell; Joanna E Siegel; Theodore G Ganiats
Journal: JAMA Date: 2016-09-13 Impact factor: 56.272

10. Multivariate risk preferences in the quality-adjusted life year model.

Authors: Arthur E Attema; Jona J Frasch; Olivier L'Haridon
Journal: Health Econ Date: 2021-11-18 Impact factor: 2.395

1 in total

1. Multivariate risk preferences in the quality-adjusted life year model.

Authors: Arthur E Attema; Jona J Frasch; Olivier L'Haridon
Journal: Health Econ Date: 2021-11-18 Impact factor: 2.395

1 in total