RATIONALE: Unsupervised statistical learning techniques, such as exploratory factor analysis (EFA) and hierarchical clustering (HC), have been used to identify asthma phenotypes, with partly consistent results. Some of the inconsistency is caused by the variable selection and demographic and clinical differences among study populations. OBJECTIVES: To investigate the effects of the choice of statistical method and different preparations of data on the clustering results; and to relate these to disease severity. METHODS: Several variants of EFA and HC were applied and compared using various sets of variables and different encodings and transformations within a dataset of 383 children with asthma. Variables included lung function, inflammatory and allergy markers, family history, environmental exposures, and medications. Clusters and original variables were related to asthma severity (logistic regression and Bayesian network analysis). MEASUREMENTS AND MAIN RESULTS: EFA identified five components (eigenvalues ≥ 1) explaining 35% of the overall variance. Variations of the HC (as linkage-distance functions) did not affect the cluster inference; however, using different variable encodings and transformations did. The derived clusters predicted asthma severity less than the original variables. Prognostic factors of severity were medication usage, current symptoms, lung function, paternal asthma, body mass index, and age of asthma onset. Bayesian networks indicated conditional dependence among variables. CONCLUSIONS: The use of different unsupervised statistical learning methods and different variable sets and encodings can lead to multiple and inconsistent subgroupings of asthma, not necessarily correlated with severity. The search for asthma phenotypes needs more careful selection of markers, consistent across different study populations, and more cautious interpretation of results from unsupervised learning.
RATIONALE: Unsupervised statistical learning techniques, such as exploratory factor analysis (EFA) and hierarchical clustering (HC), have been used to identify asthma phenotypes, with partly consistent results. Some of the inconsistency is caused by the variable selection and demographic and clinical differences among study populations. OBJECTIVES: To investigate the effects of the choice of statistical method and different preparations of data on the clustering results; and to relate these to disease severity. METHODS: Several variants of EFA and HC were applied and compared using various sets of variables and different encodings and transformations within a dataset of 383 children with asthma. Variables included lung function, inflammatory and allergy markers, family history, environmental exposures, and medications. Clusters and original variables were related to asthma severity (logistic regression and Bayesian network analysis). MEASUREMENTS AND MAIN RESULTS: EFA identified five components (eigenvalues ≥ 1) explaining 35% of the overall variance. Variations of the HC (as linkage-distance functions) did not affect the cluster inference; however, using different variable encodings and transformations did. The derived clusters predicted asthma severity less than the original variables. Prognostic factors of severity were medication usage, current symptoms, lung function, paternal asthma, body mass index, and age of asthma onset. Bayesian networks indicated conditional dependence among variables. CONCLUSIONS: The use of different unsupervised statistical learning methods and different variable sets and encodings can lead to multiple and inconsistent subgroupings of asthma, not necessarily correlated with severity. The search for asthma phenotypes needs more careful selection of markers, consistent across different study populations, and more cautious interpretation of results from unsupervised learning.
Authors: R O Crapo; R Casaburi; A L Coates; P L Enright; J L Hankinson; C G Irvin; N R MacIntyre; R T McKay; J S Wanger; S D Anderson; D W Cockcroft; J E Fish; P J Sterk Journal: Am J Respir Crit Care Med Date: 2000-01 Impact factor: 21.405
Authors: Anne M Fitzpatrick; W Gerald Teague; Deborah A Meyers; Stephen P Peters; Xingnan Li; Huashi Li; Sally E Wenzel; Shean Aujla; Mario Castro; Leonard B Bacharier; Benjamin M Gaston; Eugene R Bleecker; Wendy C Moore Journal: J Allergy Clin Immunol Date: 2010-12-31 Impact factor: 10.793
Authors: V Siroux; X Basagaña; A Boudier; I Pin; J Garcia-Aymerich; A Vesin; R Slama; D Jarvis; J M Anto; F Kauffmann; J Sunyer Journal: Eur Respir J Date: 2011-01-13 Impact factor: 16.671
Authors: Wendy C Moore; Deborah A Meyers; Sally E Wenzel; W Gerald Teague; Huashi Li; Xingnan Li; Ralph D'Agostino; Mario Castro; Douglas Curran-Everett; Anne M Fitzpatrick; Benjamin Gaston; Nizar N Jarjour; Ronald Sorkness; William J Calhoun; Kian Fan Chung; Suzy A A Comhair; Raed A Dweik; Elliot Israel; Stephen P Peters; William W Busse; Serpil C Erzurum; Eugene R Bleecker Journal: Am J Respir Crit Care Med Date: 2009-11-05 Impact factor: 21.405
Authors: Pranab Haldar; Ian D Pavord; Ruth H Green; Dominic E Shaw; Michael A Berry; Michael Thomas; Christopher E Brightling; Andrew J Wardlaw Journal: Am J Respir Crit Care Med Date: 2008-05-14 Impact factor: 21.405
Authors: Cindy Thamrin; Urs Frey; David A Kaminsky; Helen K Reddel; Andrew J E Seely; Béla Suki; Peter J Sterk Journal: Am J Respir Crit Care Med Date: 2016-11-01 Impact factor: 21.405
Authors: Edward M Zoratti; Rebecca Z Krouse; Denise C Babineau; Jacqueline A Pongracic; George T O'Connor; Robert A Wood; Gurjit K Khurana Hershey; Carolyn M Kercsmar; Rebecca S Gruchalla; Meyer Kattan; Stephen J Teach; Steven M Sigelman; Peter J Gergen; Alkis Togias; Cynthia M Visness; William W Busse; Andrew H Liu Journal: J Allergy Clin Immunol Date: 2016-10 Impact factor: 10.793
Authors: Danielle C M Belgrave; Raquel Granell; Angela Simpson; John Guiver; Christopher Bishop; Iain Buchan; A John Henderson; Adnan Custovic Journal: PLoS Med Date: 2014-10-21 Impact factor: 11.069
Authors: T Zaihra; C J Walsh; S Ahmed; C Fugère; Q A Hamid; R Olivenstein; J G Martin; A Benedetti Journal: BMC Pulm Med Date: 2016-05-10 Impact factor: 3.317
Authors: Anne M Fitzpatrick; Leonard B Bacharier; Daniel J Jackson; Stanley J Szefler; Avraham Beigelman; Michael Cabana; Ronina Covar; Theresa Guilbert; Fernando Holguin; Robert F Lemanske; Fernando D Martinez; Wayne Morgan; Wanda Phipatanakul; Jacqueline A Pongracic; Hengameh H Raissy; Robert S Zeiger; David T Mauger Journal: J Allergy Clin Immunol Pract Date: 2020-03-07