Pratik Sinha1, Kevin L Delucchi2, Daniel F McAuley3, Cecilia M O'Kane4, Michael A Matthay5, Carolyn S Calfee5. 1. Department of Medicine, Division of Pulmonary, Critical Care, Allergy and Sleep Medicine, University of California, San Francisco, San Francisco, CA, USA; Department of Anesthesia, University of California, San Francisco, San Francisco, CA, USA. Electronic address: pratik.sinha@ucsf.edu. 2. Department of Psychiatry, University of California, San Francisco, San Francisco, CA, USA. 3. The Wellcome-Wolfson Institute for Experimental Medicine, Queen's University, Belfast, UK; Regional Intensive Care Unit, The Royal Hospitals, Belfast, UK. 4. The Wellcome-Wolfson Institute for Experimental Medicine, Queen's University, Belfast, UK. 5. Department of Medicine, Division of Pulmonary, Critical Care, Allergy and Sleep Medicine, University of California, San Francisco, San Francisco, CA, USA; Department of Anesthesia, University of California, San Francisco, San Francisco, CA, USA.
Abstract
BACKGROUND: Using latent class analysis (LCA) in five randomised controlled trial (RCT) cohorts, two distinct phenotypes of acute respiratory distress syndrome (ARDS) have been identified: hypoinflammatory and hyperinflammatory. The phenotypes are associated with differential outcomes and treatment response. The objective of this study was to develop parsimonious models for phenotype identification that could be accurate and feasible to use in the clinical setting. METHODS: In this retrospective study, three RCT cohorts from the National Lung, Heart, and Blood Institute ARDS Network (ARMA, ALVEOLI, and FACTT) were used as the derivation dataset (n=2022), from which the machine learning and logistic regression classifer models were derived, and a fourth (SAILS; n=715) from the same network was used as the validation test set. LCA-derived phenotypes in all of these cohorts served as the reference standard. Machine-learning algorithms (random forest, bootstrapped aggregating, and least absolute shrinkage and selection operator) were used to select a maximum of six important classifier variables, which were then used to develop nested logistic regression models. Only cases with complete biomarker data in the derivation dataset were used for variable selection. The best logistic regression models based on parsimony and predictive accuracy were then evaluated in the validation test set. Finally, the models' prognostic validity was tested in two external ARDS clinical trial datasets (START and HARP-2) by assessing mortality at days 28, 60, and 90 and ventilator-free days to day 28. FINDINGS: The six most important classifier variables were interleukin (IL)-8, IL-6, protein C, soluble tumour necrosis factor receptor 1, bicarbonate, and vasopressor use. From the nested models, three-variable (IL-8, bicarbonate, and protein C) and four-variable (3-variable plus vasopressor use) models were adjudicated to be the best performing. In the validation test set, both models showed good accuracy (AUC 0·94 [95% CI 0·92-0·95] for the three-variable model and 0·95 [95% CI 0·93-0·96] for the four-variable model) against LCA classifications. As with LCA-derived phenotypes, the hyperinflammatory phenotype as identified by the classifier model was associated with higher mortality at day 90 (87 [39%] of 223 patients vs 112 [23%] of 492 patients; p<0·0001) and fewer ventilator-free days (median 14 days [IQR 0-22] vs 22 days [0-25]; p<0·0001). In the external validation datasets, three-variable models developed in the derivation dataset identified two phenotypes with distinct clinical features and outcomes consistent with previous findings, including differential survival with simvastatin versus placebo in HARP-2 (p=0·023 for survival at 28 days). INTERPRETATION: ARDS phenotypes can be accurately identified with parsimonious classifier models using three or four variables. Pending the development of real-time testing for key biomarkers and prospective validation, these models could facilitate identification of ARDS phenotypes to enable their application in clinical trials and practice. FUNDING: National Institutes of Health.
BACKGROUND: Using latent class analysis (LCA) in five randomised controlled trial (RCT) cohorts, two distinct phenotypes of acute respiratory distress syndrome (ARDS) have been identified: hypoinflammatory and hyperinflammatory. The phenotypes are associated with differential outcomes and treatment response. The objective of this study was to develop parsimonious models for phenotype identification that could be accurate and feasible to use in the clinical setting. METHODS: In this retrospective study, three RCT cohorts from the National Lung, Heart, and Blood Institute ARDS Network (ARMA, ALVEOLI, and FACTT) were used as the derivation dataset (n=2022), from which the machine learning and logistic regression classifer models were derived, and a fourth (SAILS; n=715) from the same network was used as the validation test set. LCA-derived phenotypes in all of these cohorts served as the reference standard. Machine-learning algorithms (random forest, bootstrapped aggregating, and least absolute shrinkage and selection operator) were used to select a maximum of six important classifier variables, which were then used to develop nested logistic regression models. Only cases with complete biomarker data in the derivation dataset were used for variable selection. The best logistic regression models based on parsimony and predictive accuracy were then evaluated in the validation test set. Finally, the models' prognostic validity was tested in two external ARDS clinical trial datasets (START and HARP-2) by assessing mortality at days 28, 60, and 90 and ventilator-free days to day 28. FINDINGS: The six most important classifier variables were interleukin (IL)-8, IL-6, protein C, soluble tumour necrosis factor receptor 1, bicarbonate, and vasopressor use. From the nested models, three-variable (IL-8, bicarbonate, and protein C) and four-variable (3-variable plus vasopressor use) models were adjudicated to be the best performing. In the validation test set, both models showed good accuracy (AUC 0·94 [95% CI 0·92-0·95] for the three-variable model and 0·95 [95% CI 0·93-0·96] for the four-variable model) against LCA classifications. As with LCA-derived phenotypes, the hyperinflammatory phenotype as identified by the classifier model was associated with higher mortality at day 90 (87 [39%] of 223 patients vs 112 [23%] of 492 patients; p<0·0001) and fewer ventilator-free days (median 14 days [IQR 0-22] vs 22 days [0-25]; p<0·0001). In the external validation datasets, three-variable models developed in the derivation dataset identified two phenotypes with distinct clinical features and outcomes consistent with previous findings, including differential survival with simvastatin versus placebo in HARP-2 (p=0·023 for survival at 28 days). INTERPRETATION: ARDS phenotypes can be accurately identified with parsimonious classifier models using three or four variables. Pending the development of real-time testing for key biomarkers and prospective validation, these models could facilitate identification of ARDS phenotypes to enable their application in clinical trials and practice. FUNDING: National Institutes of Health.
Authors: Jesús Villar; Alfonso Ambrós; Fernando Mosteiro; Domingo Martínez; Lorena Fernández; Carlos Ferrando; Demetrio Carriedo; Juan A Soler; Dácil Parrilla; Mónica Hernández; David Andaluz-Ojeda; José M Añón; Anxela Vidal; Elena González-Higueras; Carmen Martín-Rodríguez; Ana M Díaz-Lamas; Jesús Blanco; Javier Belda; Francisco J Díaz-Domínguez; Jesús Rico-Feijoó; Carmen Martín-Delgado; Miguel A Romera; Jesús M González-Martín; Rosa L Fernández; Robert M Kacmarek Journal: Crit Care Med Date: 2019-03 Impact factor: 7.598
Authors: Michael A Matthay; Roy G Brower; Shannon Carson; Ivor S Douglas; Mark Eisner; Duncan Hite; Steven Holets; Richard H Kallet; Kathleen D Liu; Neil MacIntyre; Marc Moss; David Schoenfeld; Jay Steingrub; B Taylor Thompson Journal: Am J Respir Crit Care Med Date: 2011-09-01 Impact factor: 21.405
Authors: Carolyn S Calfee; Kevin L Delucchi; Pratik Sinha; Michael A Matthay; Jonathan Hackett; Manu Shankar-Hari; Cliona McDowell; John G Laffey; Cecilia M O'Kane; Daniel F McAuley Journal: Lancet Respir Med Date: 2018-08-02 Impact factor: 30.700
Authors: Michael A Matthay; Carolyn S Calfee; Hanjing Zhuo; B Taylor Thompson; Jennifer G Wilson; Joseph E Levitt; Angela J Rogers; Jeffrey E Gotts; Jeanine P Wiener-Kronish; Ednan K Bajwa; Michael P Donahoe; Bryan J McVerry; Luis A Ortiz; Matthew Exline; John W Christman; Jason Abbott; Kevin L Delucchi; Lizette Caballero; Melanie McMillan; David H McKenna; Kathleen D Liu Journal: Lancet Respir Med Date: 2018-11-16 Impact factor: 30.700
Authors: Roy G Brower; Paul N Lanken; Neil MacIntyre; Michael A Matthay; Alan Morris; Marek Ancukiewicz; David Schoenfeld; B Taylor Thompson Journal: N Engl J Med Date: 2004-07-22 Impact factor: 91.245
Authors: Jonathon D Truwit; Gordon R Bernard; Jay Steingrub; Michael A Matthay; Kathleen D Liu; Timothy E Albertson; Roy G Brower; Carl Shanholtz; Peter Rock; Ivor S Douglas; Bennett P deBoisblanc; Catherine L Hough; R Duncan Hite; B Taylor Thompson Journal: N Engl J Med Date: 2014-05-18 Impact factor: 91.245
Authors: Pratik Sinha; Carolyn S Calfee; Jeremy R Beitler; Neil Soni; Kelly Ho; Michael A Matthay; Richard H Kallet Journal: Am J Respir Crit Care Med Date: 2019-02-01 Impact factor: 30.528
Authors: Pratik Sinha; Kevin L Delucchi; B Taylor Thompson; Daniel F McAuley; Michael A Matthay; Carolyn S Calfee Journal: Intensive Care Med Date: 2018-10-05 Impact factor: 17.440
Authors: Kevin King; Luanda P Grazette; Dina N Paltoo; John T McDevitt; Samuel K Sia; Paddy M Barrett; Fred S Apple; Paul A Gurbel; Ralph Weissleder; Hilary Leeds; Erin J Iturriaga; Anupama Rao; Bishow Adhikari; Patrice Desvigne-Nickens; Zorina S Galis; Peter Libby Journal: JACC Basic Transl Sci Date: 2016 Jan-Feb
Authors: Nadir Yehya; Hossein Fazelinia; Deanne M Taylor; Gladys G Lawrence; Lynn A Spruce; Jill M Thompson; Susan S Margulies; Steven H Seeholzer; G Scott Worthen Journal: Am J Physiol Lung Cell Mol Physiol Date: 2022-01-05 Impact factor: 5.464
Authors: Mary K Dahmer; Guangyu Yang; Min Zhang; Michael W Quasney; Anil Sapru; Heidi M Weeks; Pratik Sinha; Martha A Q Curley; Kevin L Delucchi; Carolyn S Calfee; Heidi Flori Journal: Lancet Respir Med Date: 2021-12-06 Impact factor: 30.700
Authors: Lieuwe D J Bos; Antonio Artigas; Jean-Michel Constantin; Laura A Hagens; Nanon Heijnen; John G Laffey; Nuala Meyer; Laurent Papazian; Lara Pisani; Marcus J Schultz; Manu Shankar-Hari; Marry R Smit; Charlotte Summers; Lorraine B Ware; Raffaele Scala; Carolyn S Calfee Journal: Eur Respir Rev Date: 2021-02-02
Authors: Pratik Sinha; Kevin L Delucchi; Yue Chen; Hanjing Zhuo; Jason Abbott; Chunxue Wang; Nancy Wickersham; J Brennan McNeil; Alejandra Jauregui; Serena Ke; Kathryn Vessel; Antonio Gomez; Carolyn M Hendrickson; Kirsten N Kangelaris; Aartik Sarma; Aleksandra Leligdowicz; Kathleen D Liu; Michael A Matthay; Lorraine B Ware; Carolyn S Calfee Journal: Thorax Date: 2021-07-12 Impact factor: 9.139
Authors: Hui Chen; Zhu Zhu; Nan Su; Jun Wang; Jun Gu; Shu Lu; Li Zhang; Xuesong Chen; Lei Xu; Xiangrong Shao; Jiangtao Yin; Jinghui Yang; Baodi Sun; Yongsheng Li Journal: Front Med (Lausanne) Date: 2021-06-04
Authors: Mark E Nunnally; Ricard Ferrer; Greg S Martin; Ignacio Martin-Loeches; Flavia R Machado; Daniel De Backer; Craig M Coopersmith; Clifford S Deutschman Journal: Intensive Care Med Exp Date: 2021-07-02