Ann A Lazar1, Marco Bonetti2, Bernard F Cole3, Wai-Ki Yip4, Richard D Gelber4. 1. Division of Oral Epidemiology, Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, CA, USA Division of Biostatistics, Department of Epidemiology & Biostatistics, University of California, San Francisco, San Francisco, CA, USA ann.lazar@ucsf.edu. 2. Carlo F. Dondena Centre for Research on Social Dynamics and Public Policies, Bocconi University, Milan, Italy. 3. Department of Mathematics and Statistics, University of Vermont, Burlington, VT, USA. 4. Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
Abstract
BACKGROUND: Investigators conducting randomized clinical trials often explore treatment effect heterogeneity to assess whether treatment efficacy varies according to patient characteristics. Identifying heterogeneity is central to making informed personalized healthcare decisions. Treatment effect heterogeneity can be investigated using subpopulation treatment effect pattern plot (STEPP), a non-parametric graphical approach that constructs overlapping patient subpopulations with varying values of a characteristic. Procedures for statistical testing using subpopulation treatment effect pattern plot when the endpoint of interest is survival remain an area of active investigation. METHODS: A STEPP analysis was used to explore patterns of absolute and relative treatment effects for varying levels of a breast cancer biomarker, Ki-67, in the phase III Breast International Group 1-98 randomized clinical trial, comparing letrozole to tamoxifen as adjuvant therapy for postmenopausal women with hormone receptor-positive breast cancer. Absolute treatment effects were measured by differences in 4-year cumulative incidence of breast cancer recurrence, while relative effects were measured by the subdistribution hazard ratio in the presence of competing risks using O-E (observed-minus-expected) methodology, an intuitive non-parametric method. While estimation of hazard ratio values based on O-E methodology has been shown, a similar development for the subdistribution hazard ratio has not. Furthermore, we observed that the subpopulation treatment effect pattern plot analysis may not produce results, even with 100 patients within each subpopulation. After further investigation through simulation studies, we observed inflation of the type I error rate of the traditional test statistic and sometimes singular variance-covariance matrix estimates that may lead to results not being produced. This is due to the lack of sufficient number of events within the subpopulations, which we refer to as instability of the subpopulation treatment effect pattern plot analysis. We introduce methodology designed to improve stability of the subpopulation treatment effect pattern plot analysis and generalize O-E methodology to the competing risks setting. Simulation studies were designed to assess the type I error rate of the tests for a variety of treatment effect measures, including subdistribution hazard ratio based on O-E estimation. This subpopulation treatment effect pattern plot methodology and standard regression modeling were used to evaluate heterogeneity of Ki-67 in the Breast International Group 1-98 randomized clinical trial. RESULTS: We introduce methodology that generalizes O-E methodology to the competing risks setting and that improves stability of the STEPP analysis by pre-specifying the number of events across subpopulations while controlling the type I error rate. The subpopulation treatment effect pattern plot analysis of the Breast International Group 1-98 randomized clinical trial showed that patients with high Ki-67 percentages may benefit most from letrozole, while heterogeneity was not detected using standard regression modeling. CONCLUSION: The STEPP methodology can be used to study complex patterns of treatment effect heterogeneity, as illustrated in the Breast International Group 1-98 randomized clinical trial. For the subpopulation treatment effect pattern plot analysis, we recommend a minimum of 20 events within each subpopulation.
RCT Entities:
BACKGROUND: Investigators conducting randomized clinical trials often explore treatment effect heterogeneity to assess whether treatment efficacy varies according to patient characteristics. Identifying heterogeneity is central to making informed personalized healthcare decisions. Treatment effect heterogeneity can be investigated using subpopulation treatment effect pattern plot (STEPP), a non-parametric graphical approach that constructs overlapping patient subpopulations with varying values of a characteristic. Procedures for statistical testing using subpopulation treatment effect pattern plot when the endpoint of interest is survival remain an area of active investigation. METHODS: A STEPP analysis was used to explore patterns of absolute and relative treatment effects for varying levels of a breast cancer biomarker, Ki-67, in the phase III Breast International Group 1-98 randomized clinical trial, comparing letrozole to tamoxifen as adjuvant therapy for postmenopausal women with hormone receptor-positive breast cancer. Absolute treatment effects were measured by differences in 4-year cumulative incidence of breast cancer recurrence, while relative effects were measured by the subdistribution hazard ratio in the presence of competing risks using O-E (observed-minus-expected) methodology, an intuitive non-parametric method. While estimation of hazard ratio values based on O-E methodology has been shown, a similar development for the subdistribution hazard ratio has not. Furthermore, we observed that the subpopulation treatment effect pattern plot analysis may not produce results, even with 100 patients within each subpopulation. After further investigation through simulation studies, we observed inflation of the type I error rate of the traditional test statistic and sometimes singular variance-covariance matrix estimates that may lead to results not being produced. This is due to the lack of sufficient number of events within the subpopulations, which we refer to as instability of the subpopulation treatment effect pattern plot analysis. We introduce methodology designed to improve stability of the subpopulation treatment effect pattern plot analysis and generalize O-E methodology to the competing risks setting. Simulation studies were designed to assess the type I error rate of the tests for a variety of treatment effect measures, including subdistribution hazard ratio based on O-E estimation. This subpopulation treatment effect pattern plot methodology and standard regression modeling were used to evaluate heterogeneity of Ki-67 in the Breast International Group 1-98 randomized clinical trial. RESULTS: We introduce methodology that generalizes O-E methodology to the competing risks setting and that improves stability of the STEPP analysis by pre-specifying the number of events across subpopulations while controlling the type I error rate. The subpopulation treatment effect pattern plot analysis of the Breast International Group 1-98 randomized clinical trial showed that patients with high Ki-67 percentages may benefit most from letrozole, while heterogeneity was not detected using standard regression modeling. CONCLUSION: The STEPP methodology can be used to study complex patterns of treatment effect heterogeneity, as illustrated in the Breast International Group 1-98 randomized clinical trial. For the subpopulation treatment effect pattern plot analysis, we recommend a minimum of 20 events within each subpopulation.
Authors: P C Clahsen; C J van de Velde; C Duval; C Pallud; A M Mandard; A Delobelle-Deroide; L van den Broek; M J van de Vijver Journal: Eur J Surg Oncol Date: 1999-08 Impact factor: 4.424
Authors: Alan S Coates; Aparna Keshaviah; Beat Thürlimann; Henning Mouridsen; Louis Mauriac; John F Forbes; Robert Paridaens; Monica Castiglione-Gertsch; Richard D Gelber; Marco Colleoni; István Láng; Lucia Del Mastro; Ian Smith; Jacquie Chirgwin; Jean-Marie Nogaret; Tadeusz Pienkowski; Andrew Wardley; Erik H Jakobsen; Karen N Price; Aron Goldhirsch Journal: J Clin Oncol Date: 2007-01-02 Impact factor: 44.544
Authors: Beat Thürlimann; Aparna Keshaviah; Alan S Coates; Henning Mouridsen; Louis Mauriac; John F Forbes; Robert Paridaens; Monica Castiglione-Gertsch; Richard D Gelber; Manuela Rabaglio; Ian Smith; Andrew Wardley; Andrew Wardly; Karen N Price; Aron Goldhirsch Journal: N Engl J Med Date: 2005-12-29 Impact factor: 91.245
Authors: R Peto; M C Pike; P Armitage; N E Breslow; D R Cox; S V Howard; N Mantel; K McPherson; J Peto; P G Smith Journal: Br J Cancer Date: 1977-01 Impact factor: 7.640
Authors: Wai-Ki Yip; Marco Bonetti; Bernard F Cole; William Barcella; Xin Victoria Wang; Ann Lazar; Richard D Gelber Journal: Clin Trials Date: 2016-04-19 Impact factor: 2.486
Authors: N A Danhof; M van Wely; C A M Koks; J Gianotten; J P de Bruin; B J Cohlen; D P van der Ham; N F Klijn; M H A van Hooff; F J M Broekmans; K Fleischer; C A H Janssen; J M Rijn van Weert; J van Disseldorp; M Twisk; M Traas; M F G Verberg; M J Pelinck; J Visser; D A M Perquin; D E S Boks; H R Verhoeve; C F van Heteren; B W J Mol; S Repping; F van der Veen; M H Mochtar Journal: BMJ Open Date: 2017-05-25 Impact factor: 2.692
Authors: Jozef Bartunek; Andre Terzic; Beth A Davison; Gerasimos S Filippatos; Slavica Radovanovic; Branko Beleslin; Bela Merkely; Piotr Musialek; Wojciech Wojakowski; Peter Andreka; Ivan G Horvath; Amos Katz; Dariouch Dolatabadi; Badih El Nakadi; Aleksandra Arandjelovic; Istvan Edes; Petar M Seferovic; Slobodan Obradovic; Marc Vanderheyden; Nikola Jagic; Ivo Petrov; Shaul Atar; Majdi Halabi; Valeri L Gelev; Michael K Shochat; Jaroslaw D Kasprzak; Ricardo Sanz-Ruiz; Guy R Heyndrickx; Noémi Nyolczas; Victor Legrand; Antoine Guédès; Alex Heyse; Tiziano Moccetti; Francisco Fernandez-Aviles; Pilar Jimenez-Quevedo; Antoni Bayes-Genis; Jose Maria Hernandez-Garcia; Flavio Ribichini; Marcin Gruchala; Scott A Waldman; John R Teerlink; Bernard J Gersh; Thomas J Povsic; Timothy D Henry; Marco Metra; Roger J Hajjar; Michal Tendera; Atta Behfar; Bertrand Alexandre; Aymeric Seron; Wendy Gattis Stough; Warren Sherman; Gad Cotter; William Wijns Journal: Eur Heart J Date: 2017-03-01 Impact factor: 29.983