Literature DB >> 28769547

Win Ratio -An Intuitive and Easy-To-Interpret Composite Outcome in Medical Studies.

Hongyue Wang1, Jing Peng1, Juila Z Zheng2, Bokai Wang1, Xiang Lu1, Chongshu Chen1, Xin M Tu3, Changyong Feng1,4.   

Abstract

In medical studies with multiple outcomes, researchers always need to make choices as to whether to use a composite outcome (after combining multiple outcomes) as their primary outcome. In this paper we review a new measurement of the treatment effect - win ratio, which can be easily used in studies with prioritized multiple outcomes. We also propose some research topics to be done in this area.

Entities:  

Keywords:  proportional hazards model; survival functions; time-to-event data

Year:  2017        PMID: 28769547      PMCID: PMC5518256          DOI: 10.11919/j.issn.1002-0829.217011

Source DB:  PubMed          Journal:  Shanghai Arch Psychiatry        ISSN: 1002-0829


1. Introduction

Multiple outcomes are quite common in clinical and observational studies as it is difficult to use a single outcome to characterize treatment (exposure) effects. Another reason is that the complexity of a disease may be not adequately characterized by a single outcome. For example, in the study of depression in psychiatric studies, the outcomes may include (i) depression severity such as the Hamilton Depression Rating Scale[; (ii) instrumental role functioning such as the Social Adjustment Rating Scale[; (iii) social functioning such as the Short Form 36-Item Health Survey[ and (iv) depression remission status based on clinical assessment. In heart research[, outcomes of primary interest may include the time to the first heart failure (from study baseline), to death due to heart disease, and to death due to other reasons. Most research studies attempt to collect as much information as possible in order to study the disease of interest from multiple aspects to find the optimal treatment. Thus, multiple outcomes are usually considered to study treatment effects. However, multiple outcomes impose challenges in data analysis as well as interpretations of study results: (i) the multiple outcomes in the same study may have different scales. For example, in psychiatric studies, depression status is categorical, while depression severity and social functioning may be continuous (or treated as such if the scale has a wide range). It is difficult to use a multivariate distribution for this kind of mixed outcomes; and (ii) directions of treatment effects may be different for different outcomes. For example, treatment for depression may reduce depression severity and improve daily living, but may also reduce sexual desires for depressed patients. A composite outcome is a single outcome based on combinations of multiple study outcomes and is widely used to summarize information from the multiple outcomes. The Intelligence Quotient (IQ) test is an example of a composite outcome, as the test result is a weighted score from four subscales: verbal compression, perceptual reasoning, working memory, and processing speed.[ Composite outcome is also widely used in clinical and observational studies. Overweight or obese people are generally at higher risks for many diseases such as diabetes, hypertension, and heart disease. One widely used measure for obesity is the Body Mass Index (BMI)[ which is defined as The validity of this index has been widely discussed in literature.[ In heart research, patients may experience both heart failures and death, which are two different types of events.[ A treatment may decrease the risks of both types of events or may decrease the risk of one, and increase the risk of the other. How to measure treatment effect is a challenging problem for both physicians and statisticians. One popular practice in heart research is to define a composite outcome, for example,[ the time to heart failure or death, whichever occurs first. Some well-established methods, such as the Kaplan-Meir estimator[ and the Cox proportional hazards model[, can be readily used to analyze such composite outcomes. The advantages and the disadvantages of composite outcomes have been discussed extensively in the medical literature.[ Heddle and Cook[ summarize the advantages and disadvantages of using composite outcomes in clinical studies. They also give some guidelines on how to select composite outcomes. The decision to use composite outcome may be debated forever. In this report note, we review a relatively new method – win ratio- to analyze some types of multiple outcomes in clinical studies. In Section 2 we give a brief introduction to win ratio and statistical properties. Section 3 focuses on the interpretation of this parameter, the population win ratio. Section 4 reports simulation results of estimating win ratio if both the marginal survival functions of the fatal event and the conditional survival functions of the nonfatal event satisfies being in the Lehmman family,[ followed by conclusion and discussion in Section 5.

2. A brief introduction to win ratio method

Finkelstein and Schoenfeld[ first develop a nonparametric test, which will be referred to as the FS rank test below, to combine time-to-event and longitudinal outcomes in clinical studies. In this method, each pair chosen from the treatment and control group are compared based on these two types of outcomes. First, subjects are compared based on the time-to-event outcome. If the individual in the treatment group has a longer (shorter) time to event, the treatment group is said to win (lose) in this pair. If for some reason (for example, censoring), their times to event cannot be compared, the longitudinal outcomes will be compared and the win or loss will be determined similarly. The FS rank test is the difference between the numbers of wins and losses in the treatment group. The FR rank test is a generalization of the Wilcoxon rank sum test. The idea of FS rank test was later expanded by Pocock and colleagues[ in a clinical trial with two types of events: fatal event (cardiovascular death) and non-fatal event (heart failure hospitalization). Fatal event is assumed to have a higher priority than the non-fatal event when evaluating treatment effects. For each pair of patients, we first determine which patient lives longer than the other. If this is unknown, we then determine who has a better non-fatal event. This is the same idea as the FS rank test. The ratio of the wins over losses in the treatment group is called the win ratio. The treatment is beneficial compared to the control if the win ratio is greater than 1. Pocock and colleagues[ offered two approaches to calculating the sample win ratio: (1) Match pair approach: This is a three-step method to calculating the win ratio statistic. Use a risk score or risk stratification to select matched pairs of patients from treatment and control group. For each matched pair, one first compares the priority event (e.g., fatal event). If the comparison cannot be made (e.g., which patient dies first), then check the second priority event (e.g., time to hospitalization). There are 5 possible results based on these two comparisons: patient in the treatment group dies first; patient in the control group dies first; if not (a) or (b), patient in the treatment group is hospitalized first; if not (a) or (b), patient in the control group is hospitalized first; none of the above. The comparison results are summarized by Na, Nb, Nc, Nd, and Ne, corresponding to the numbers of matched pairs in (a), (b), (c), (d), and (e). The numbers of known wins and losses in the treatment group are Nw=Nb + Nd and Nl=Na + Nc. The sample win ratio is Pocock and colleagues[ constructed 95% confidence interval (CI) and p-value for the win ratio based on the matched pairs approach. Let pw =Nw /(Nw +N1) and p the 95% CI of the win ratio is [pL/(1-pL), pU/(1-pU)]. For a significance test, (pw-0.5)/ [pw(1-pw)/(Nw+Nl)]1/2 has an asymptotic standardized normal distribution under the null hypothesis of no difference between the treatment and control group. (2) Unmatched pair approach: Analogous to the matched pairs approach, the unmatched pairs approach classifies the comparison results into five categories. The difference is that we compare every subject in the treatment group with every subject in the control group. Let Nn and Ns be the number of patients in the treatment and control group, we need to make Nn×Ns comparisons. In the unmatched pair approach, calculating the 95% CI and p-value for Rware quite complex. Finkelstein & Schoenfeld[ described the general idea of significance test. Two-sample U- statistic theory provides expressions for the asymptotic variance of the win-ratio statistic for unmatched pairs. Luo and colleagues[ derive an alternative standard error estimate using counting process methods.

3. The mathematical meaning of win ratio

Recently, Oakes[ extended the win ratio statistic to define the probability win ratio, the ratio of the win and loss probabilities, for general survival models when follow-up of all patients is over a specified time interval [0, c]. This extension not only avoids the complex subjecting matching process (or comparisons of all subjects between the treatment and control group), but also gives rise to nice statistical properties. Let (T, X) denote the pair of time to death (T) and to first non-fatal (X) event. For j=0, 1 with 0 (1) denoting the control (treatment) group, let S where Sj(t)=Pr[Tj>t] (Gj(x|c)=Pr[Xj>x|Tj>c]) is the marginal survival function of Tj (conditional survival function of Xjgiven Tj>c). The win and loss probabilities are Thus the probability win ratio is the ratio of W(c) over L(c), Note that unlike the original sample win ratio in (1.1), the probability win ratio is an unknown constant or a parameter, rather than a statistic as in (1.1). The probability win ratio of greater than 1 indicates evidence of treatment effect in favor of the treatment group. For example, PRw = 1.5 means that there is a 50% increase in number of deaths (if observed) or non-fatal events (if death is censored) for a patient in a treatment group than in a control group. It is clear that the win ratio defined above depends on the specified observation window [0, c]. This means that for the same study, if we stop the study at different follow-up times, we will be estimating different win ratio parameters, PRW (C). In the extreme case, if c is sufficiently large, deaths censored by the observation window will be negligibly small and the second priority event may become irrelevant. In general, PRW (C) is a function of c. However, as shown in Oakes,[ the win ratio will be independent of the observation time c if the marginal survival function and conditional survival function are both in the Lehmann family sharing the same parameters. Moreover, this shared parameter has the interpretation of both the hazard ratio and loss ratio (the reciprocal of the win ratio). This is quite a nice statistical property as the hazard ratio is widely used to measure treatment effects under the popular Cox proportional hazards model.

4. A small simulation study

We now discuss how to estimate the probability win ratio under the assumption that both the marginal and conditional survival functions are in the Lehmann family with the same parameters. We also show how to use the Wei-Lin-Weissfeld[ method to obtain a more efficient estimator by combining the estimators obtained from fatal and non-fatal events separately. In the control group, we assume the joint survival function of (T, X) is For the treatment group we assume the joint survival function of (T, X) is The correlation of T and X is described by α; T and X become highly if α is close to 0 and independent when α=1. Under the above specifications, the win ratio parameter in the absence of censoring is PRW = 1/θα. We also assume that both the fatal and non-fatal events are subject to independent censoring with the censoring time C exponentially distributed with a rate q. We first used the Cox proportional hazards model to estimate the win ratio based on (1) the non-fatal event, where the non-fatal event time may be censored by either the censoring time C or the fatal event time; (2) fatal event, where the time may be censored by the censoring time C. The Wei-Lin-Weissfeld[ method was used to combine these two estimators. Table 1 reports the results after 1,000 Monte Carlo simulations based on sample size n=2,000(λ1=0.1, λ2=0.6, θ=0.5, and q=0.5). With such a large sample size, both estimates from fatal events and non-fatal events should be close to the true win ratio parameter. The estimates obtained from the Wei-Lin-Weissfeld[48] combination method is always more efficient than the other two methods, especially when the correlation between the fatal and non-fatal event is weak (large α).
Table 1.

Estimate of win ratio

αWin ratioFatal eventNon-fatal eventCombined
EstimateSDEstimateSDEstimateSD
0.11.0721.0730.0481.0730.0471.0730.045
0.21.1491.1490.0541.1490.0511.1490.049
0.31.2311.2320.0651.2330.0571.2320.055
0.41.3201.3210.0761.3190.0631.3190.059
0.51.4141.4170.0841.4160.0661.4160.062
0.61.5161.5200.1041.5190.0721.5180.069
0.71.6251.6270.1171.6210.0781.6220.071
0.81.7411.74101.341.7430.0831.7410.075
0.91.8661.8740.1611.8680.0921.8670.083

5. Conclusion and Future work

In medical studies with multiple outcomes, it is often difficult to construct appropriate composite outcomes to evaluate treatment effects that reflect the multi-faceted nature of interventions. In this paper we review the concept of win ratio and associated methods to facilitate such a difficult process. The win ratio is an intuitive and easy-to-interpret composite outcome and is readily implemented within the context of survival analysis, especially with the probability win ratio. Since win ratio is new, more work is needed to study its properties and applications. We propose two future research areas in win ratio analysis: In most clinical or observational studies, we want to estimate the treatment effects after adjusting the confounding effects of other covariates. How to incorporate the estimation of win ratio in semiparametric regression analysis is a new area to be explored. Power analysis based on win ratio. In the clinical study design, the sample size calculation is usually based on the proposed treatment effects. How to calculate the sample size given the proposed win ratio is another research topic.
  36 in total

1.  Composite outcomes in randomized trials: greater precision but with greater uncertainty?

Authors:  Nick Freemantle; Melanie Calvert; John Wood; Joanne Eastaugh; Carl Griffin
Journal:  JAMA       Date:  2003-05-21       Impact factor: 56.272

2.  Pitfalls of using composite primary end points in the presence of competing risks.

Authors:  Loren K Mell; Jong-Hyeon Jeong
Journal:  J Clin Oncol       Date:  2010-08-16       Impact factor: 44.544

3.  Composite outcomes in clinical trials: what are they and when should they be used?

Authors:  Nancy M Heddle; Richard J Cook
Journal:  Transfusion       Date:  2010-11-23       Impact factor: 3.157

Review 4.  Validity of composite end points in clinical trials.

Authors:  Victor M Montori; Gaietà Permanyer-Miralda; Ignacio Ferreira-González; Jason W Busse; Valeria Pacheco-Huergo; Dianne Bryant; Jordi Alonso; Elie A Akl; Antònia Domingo-Salvany; Edward Mills; Ping Wu; Holger J Schünemann; Roman Jaeschke; Gordon H Guyatt
Journal:  BMJ       Date:  2005-03-12

5.  Key issues in end point selection for heart failure trials: composite end points.

Authors:  James D Neaton; Gerry Gray; Bram D Zuckerman; Marvin A Konstam
Journal:  J Card Fail       Date:  2005-10       Impact factor: 5.712

6.  The win ratio: a new approach to the analysis of composite endpoints in clinical trials based on clinical priorities.

Authors:  Stuart J Pocock; Cono A Ariti; Timothy J Collier; Duolao Wang
Journal:  Eur Heart J       Date:  2011-09-06       Impact factor: 29.983

7.  Composite outcomes in cardiovascular research: a survey of randomized trials.

Authors:  Eric Lim; Adam Brown; Adel Helmy; Shafi Mussa; Douglas G Altman
Journal:  Ann Intern Med       Date:  2008-11-04       Impact factor: 25.391

8.  Statistical Approaches to Modeling Multiple Outcomes In Psychiatric Studies.

Authors:  Armando Teixeira-Pinto; Juned Siddique; Robert Gibbons; Sharon-Lise Normand
Journal:  Psychiatr Ann       Date:  2009-07-01

Review 9.  Analysing recurrent hospitalizations in heart failure: a review of statistical methodology, with application to CHARM-Preserved.

Authors:  Jennifer K Rogers; Stuart J Pocock; John J V McMurray; Christopher B Granger; Eric L Michelson; Jan Östergren; Marc A Pfeffer; Scott D Solomon; Karl Swedberg; Salim Yusuf
Journal:  Eur J Heart Fail       Date:  2013-12-18       Impact factor: 15.534

10.  Prognostic impact of moderate or severe mitral regurgitation (MR) irrespective of concomitant comorbidities: a retrospective matched cohort study.

Authors:  Roshan Prakash; Matthew Horsfall; Andrew Markwick; Marsus Pumar; Leong Lee; Ajay Sinhal; Majo X Joseph; Derek P Chew
Journal:  BMJ Open       Date:  2014-07-17       Impact factor: 2.692

View more
  3 in total

1.  Statistical models for composite endpoints of death and non-fatal events: a review.

Authors:  Lu Mao; KyungMann Kim
Journal:  Stat Biopharm Res       Date:  2021-07-06       Impact factor: 1.586

2.  On recurrent-event win ratio.

Authors:  Lu Mao; KyungMann Kim; Yi Li
Journal:  Stat Methods Med Res       Date:  2022-03-29       Impact factor: 2.494

3.  A class of proportional win-fractions regression models for composite outcomes.

Authors:  Lu Mao; Tuo Wang
Journal:  Biometrics       Date:  2020-10-10       Impact factor: 1.701

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.