Wei Wu1, Eugene Bleecker2, Wendy Moore2, William W Busse3, Mario Castro4, Kian Fan Chung5, William J Calhoun6, Serpil Erzurum7, Benjamin Gaston8, Elliot Israel9, Douglas Curran-Everett10, Sally E Wenzel11. 1. Lane Center for Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pa. Electronic address: weiwu2@cs.cmu.edu. 2. Center for Human Genomics, School of Medicine, Wake Forest University, Winston-Salem, NC. 3. Division of Allergy and Immunology, University of Wisconsin, Madison, Wis. 4. Division of Pulmonary & Critical Care Medicine, Washington University School of Medicine, St Louis, Mo. 5. National Heart & Lung Institute, Imperial College, London, United Kingdom. 6. Department of Internal Medicine, University of Texas Medical Branch, Galveston, Tex. 7. Department of Pulmonary, Allergy and Critical Care Medicine, Cleveland Clinic, Cleveland, Ohio. 8. Division of Pediatric Pulmonology, and Allergy/Immunology, Department of Pediatrics, School of Medicine, Case Western Reserve University, Cleveland, Ohio. 9. Pulmonary Division, Brigham and Women's Hospital, Boston, Mass. 10. National Jewish Medical and Research Center, University of Colorado Health Sciences Center, Denver, Colo. 11. National Jewish Medical and Research Center, University of Colorado Health Sciences Center, Denver, Colo; Asthma Institute, Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pa. Electronic address: wenzelse@upmc.edu.
Abstract
BACKGROUND: Previous studies have identified asthma phenotypes based on small numbers of clinical, physiologic, or inflammatory characteristics. However, no studies have used a wide range of variables using machine learning approaches. OBJECTIVES: We sought to identify subphenotypes of asthma by using blood, bronchoscopic, exhaled nitric oxide, and clinical data from the Severe Asthma Research Program with unsupervised clustering and then characterize them by using supervised learning approaches. METHODS: Unsupervised clustering approaches were applied to 112 clinical, physiologic, and inflammatory variables from 378 subjects. Variable selection and supervised learning techniques were used to select relevant and nonredundant variables and address their predictive values, as well as the predictive value of the full variable set. RESULTS: Ten variable clusters and 6 subject clusters were identified, which differed and overlapped with previous clusters. Patients with traditionally defined severe asthma were distributed through subject clusters 3 to 6. Cluster 4 identified patients with early-onset allergic asthma with low lung function and eosinophilic inflammation. Patients with later-onset, mostly severe asthma with nasal polyps and eosinophilia characterized cluster 5. Cluster 6 asthmatic patients manifested persistent inflammation in blood and bronchoalveolar lavage fluid and exacerbations despite high systemic corticosteroid use and side effects. Age of asthma onset, quality of life, symptoms, medications, and health care use were some of the 51 nonredundant variables distinguishing subject clusters. These 51 variables classified test cases with 88% accuracy compared with 93% accuracy with all 112 variables. CONCLUSION: The unsupervised machine learning approaches used here provide unique insights into disease, confirming other approaches while revealing novel additional phenotypes.
BACKGROUND: Previous studies have identified asthma phenotypes based on small numbers of clinical, physiologic, or inflammatory characteristics. However, no studies have used a wide range of variables using machine learning approaches. OBJECTIVES: We sought to identify subphenotypes of asthma by using blood, bronchoscopic, exhaled nitric oxide, and clinical data from the Severe Asthma Research Program with unsupervised clustering and then characterize them by using supervised learning approaches. METHODS: Unsupervised clustering approaches were applied to 112 clinical, physiologic, and inflammatory variables from 378 subjects. Variable selection and supervised learning techniques were used to select relevant and nonredundant variables and address their predictive values, as well as the predictive value of the full variable set. RESULTS: Ten variable clusters and 6 subject clusters were identified, which differed and overlapped with previous clusters. Patients with traditionally defined severe asthma were distributed through subject clusters 3 to 6. Cluster 4 identified patients with early-onset allergic asthma with low lung function and eosinophilic inflammation. Patients with later-onset, mostly severe asthma with nasal polyps and eosinophilia characterized cluster 5. Cluster 6 asthmatic patients manifested persistent inflammation in blood and bronchoalveolar lavage fluid and exacerbations despite high systemic corticosteroid use and side effects. Age of asthma onset, quality of life, symptoms, medications, and health care use were some of the 51 nonredundant variables distinguishing subject clusters. These 51 variables classified test cases with 88% accuracy compared with 93% accuracy with all 112 variables. CONCLUSION: The unsupervised machine learning approaches used here provide unique insights into disease, confirming other approaches while revealing novel additional phenotypes.
Authors: Wendy C Moore; Deborah A Meyers; Sally E Wenzel; W Gerald Teague; Huashi Li; Xingnan Li; Ralph D'Agostino; Mario Castro; Douglas Curran-Everett; Anne M Fitzpatrick; Benjamin Gaston; Nizar N Jarjour; Ronald Sorkness; William J Calhoun; Kian Fan Chung; Suzy A A Comhair; Raed A Dweik; Elliot Israel; Stephen P Peters; William W Busse; Serpil C Erzurum; Eugene R Bleecker Journal: Am J Respir Crit Care Med Date: 2009-11-05 Impact factor: 21.405
Authors: Pranab Haldar; Ian D Pavord; Ruth H Green; Dominic E Shaw; Michael A Berry; Michael Thomas; Christopher E Brightling; Andrew J Wardlaw Journal: Am J Respir Crit Care Med Date: 2008-05-14 Impact factor: 21.405
Authors: Miriam F Moffatt; Ivo G Gut; Florence Demenais; David P Strachan; Emmanuelle Bouzigon; Simon Heath; Erika von Mutius; Martin Farrall; Mark Lathrop; William O C M Cookson Journal: N Engl J Med Date: 2010-09-23 Impact factor: 91.245
Authors: Christopher John Staniorski; Caroline P E Price; Ava R Weibman; Kevin C Welch; David B Conley; Stephanie Shintani-Smith; Whitney W Stevens; Anju T Peters; Leslie Grammer; Alcina K Lidder; Robert P Schleimer; Robert C Kern; Bruce K Tan Journal: Int Forum Allergy Rhinol Date: 2018-01-05 Impact factor: 3.858
Authors: Prescott G Woodruff; Maarten van den Berge; Richard C Boucher; Christopher Brightling; Esteban G Burchard; Stephanie A Christenson; MeiLan K Han; Michael J Holtzman; Monica Kraft; David A Lynch; Fernando D Martinez; Helen K Reddel; Don D Sin; George R Washko; Sally E Wenzel; Antonello Punturieri; Michelle M Freemer; Robert A Wise Journal: Am J Respir Crit Care Med Date: 2017-08-01 Impact factor: 21.405
Authors: Manuel A R Ferreira; Riddhima Mathur; Judith M Vonk; Agnieszka Szwajda; Ben Brumpton; Raquel Granell; Bronwyn K Brew; Vilhelmina Ullemar; Yi Lu; Yunxuan Jiang; Patrik K E Magnusson; Robert Karlsson; David A Hinds; Lavinia Paternoster; Gerard H Koppelman; Catarina Almqvist Journal: Am J Hum Genet Date: 2019-03-28 Impact factor: 11.025
Authors: Min Xie; Anthony T Mustovich; Yi Jiang; John B Trudeau; Anuradha Ray; Prabir Ray; Haizhen Hu; Fernando Holguin; Bruce Freeman; Sally E Wenzel Journal: J Allergy Clin Immunol Date: 2014-10-11 Impact factor: 10.793
Authors: Sara L Hall; Theresa Baker; Stephane Lajoie; Phoebe K Richgels; Yanfen Yang; Jaclyn W McAlees; Adelaide van Lier; Marsha Wills-Karp; Umasundari Sivaprasad; Thomas H Acciani; Timothy D LeCras; Jocelyn Biagini Myers; Melinda Butsch Kovacic; Ian P Lewkowich Journal: J Allergy Clin Immunol Date: 2016-06-11 Impact factor: 10.793
Authors: Tao Huang; Meredith Hazen; Yonglei Shang; Meijuan Zhou; Xiumin Wu; Donghong Yan; Zhonghua Lin; Margaret Solon; Elizabeth Luis; Hai Ngu; Yongchang Shi; Arna Katewa; David F Choy; Nandhini Ramamoorthi; Erick R Castellanos; Mercedesz Balazs; Min Xu; Wyne P Lee; Marissa L Matsumoto; Jian Payandeh; Joseph R Arron; Jo-Anne Hongo; Jianyong Wang; Isidro Hötzel; Cary D Austin; Karin Reif Journal: JCI Insight Date: 2016-05-19