Literature DB >> 35080316

Machine learning: A modern approach to pediatric asthma.

Giovanna Cilluffo1, Salvatore Fasola1, Giuliana Ferrante2, Amelia Licari3,4, Giuseppe Roberto Marseglia5, Andrea Albarelli5, Gian Luigi Marseglia3,4, Stefania La Grutta1.   

Abstract

Among modern methods of statistical and computational analysis, the application of machine learning (ML) to healthcare data has been gaining recognition in helping us understand the heterogeneity of asthma and predicting its progression. In pediatric research, ML approaches may provide rapid advances in uncovering asthma phenotypes with potential translational impact in clinical practice. Also, several accurate models to predict asthma and its progression have been developed using ML. Here, we provide a brief overview of ML approaches recently proposed to characterize pediatric asthma.
© 2022 The Authors. Pediatric Allergy and Immunology published by European Academy of Allergy and Clinical Immunology and John Wiley & Sons Ltd.

Entities:  

Keywords:  asthma; children; machine learning; phenotypes

Mesh:

Year:  2022        PMID: 35080316      PMCID: PMC9303472          DOI: 10.1111/pai.13624

Source DB:  PubMed          Journal:  Pediatr Allergy Immunol        ISSN: 0905-6157            Impact factor:   5.464


ML is a modern comprehensive approach to characterize pediatric asthma phenotypes effectively and may be a promising tool to predict asthma and its progression.

INTRODUCTION

Modern methods of statistical and computational analysis offer a powerful tool to mine knowledge from various sources of healthcare data. In this context, machine learning (ML) has increasingly provided a new perspective for characterizing the heterogeneity of asthma among children and predicting its outcomes (Figure 1). Here, we provide a brief overview of ML approaches recently proposed to characterize pediatric asthma.
FIGURE 1

Word cloud analysis using the title of articles published in the last five years. The list of publications was obtained using the following search strategy (PUBMED): machine learning AND asthma AND children. The pre‐processing procedures applied were as follows: (1) removing words in the search strategy, non‐English words, or common words that do not provide information; (2) changing words into lower case, and (3) removing punctuation and white spaces. The size of the words is proportional to the observed frequency

Word cloud analysis using the title of articles published in the last five years. The list of publications was obtained using the following search strategy (PUBMED): machine learning AND asthma AND children. The pre‐processing procedures applied were as follows: (1) removing words in the search strategy, non‐English words, or common words that do not provide information; (2) changing words into lower case, and (3) removing punctuation and white spaces. The size of the words is proportional to the observed frequency

MACHINE LEARNING

ML can be defined as the study of computer algorithms that improve automatically through experience. In this sense, ML is an umbrella term encompassing all computational methods designed for learning from experience (available data) to improve performance and make accurate predictions. Unsupervised ML is concerned with the identification of data patterns in the absence of any pre‐defined outcome. Supervised ML involves learning a rule for predicting an outcome based on input‐output examples. Regarding unsupervised ML, data‐driven approaches using clustering methods could help characterize heterogeneous features of diseases among distinct patients. By unveiling the underlying structure of the data, cluster analysis can gather a set of samples into different clusters. For example, the k‐means algorithm is one of the most popular iterative descent clustering methods, aiming to minimize the sum of variance within clusters and maximize separation between clusters, thereby identifying distinct groups within the population. Conversely, agglomerative hierarchical clustering algorithms follow a tree structure where the elementary nodes represent the samples to be clustered, and the root node represents a supercluster containing all the samples. Regarding supervised ML, different classifiers have been implemented using regression or classification methods. The most used approaches remain linear regression for quantitative outcomes and logistic regression for categorical outcomes. However, in the era of big data, the mining potential of ML has increased substantially, and more advanced models are expanding; some of them are briefly introduced below. Regression trees are obtained by splitting the variables, constituting the tree's root node, into subsets that constitute the child nodes. Random forests are an ensemble learning method based on a family of decision trees. Support vector machines build a model that predicts new observations using a non‐probabilistic binary linear classifier. Deep learning has developed as a modern sub‐field of ML based on deep neural networks (NN), which can reproduce the mechanisms of the human brain in processing complex and high‐dimensional data such as images, video, or text. Deep learning is also linked to natural language processing, which is concerned with the analysis of large amounts of human language data to perform speech recognition and text classification.

PHENOTYPING OF PEDIATRIC ASTHMA

There is increasing recognition that pediatric asthma is a heterogeneous disease with multiple subtypes, which may have overlapping observable characteristics (phenotypes), but different underlying pathophysiological causes. In this context, ML has been recently used to uncover phenotypes of pediatric asthma and derive clusters based on a series of characteristics derived from high‐dimensional clinical data sets (Table 1). The most used techniques for phenotyping asthma in children include clustering methods (k‐means and hierarchical) and latent class analysis.
TABLE 1

ML approaches used in phenotyping asthma in children

ML approachStudy design and participantsDistinctive features of asthma clustersClusters identifiedRef.
Hierarchical clusteringCross‐sectional, 613 asthmatic childrenAge of onset, allergic sensitization, severity, and exacerbations in the previous year

Early‐onset mild atopic asthma

Early‐onset mild non‐atopic asthma

Late‐onset asthma

Difficult asthma

Exacerbation‐prone asthma

Deliu et al. 1
k‐means clusteringCross‐sectional, 351 asthmatic children from the Taiwanese Consortium of Childhood Asthma StudyLung function, symptom frequency, healthcare utilization, percentages of eosinophils and neutrophils in peripheral blood, and serum IgE

Asthma with elevated RBC and wheeze episodes

Neutrophil‐predominant asthma

Allergic asthma with preserved pulmonary function

Eosinophil‐predominant asthma with poor pulmonary function

Asthma with low wheeze episodes

Su et al. 2
LCACross‐sectional, 2593 children with mild to moderate persistent asthmaDemographic features, asthma control, sensitization, type 2 inflammatory markers, and lung function

Multiple sensitization with partially reversible airflow limitation

Multiple sensitization with reversible airflow limitation

Lesser sensitization with reversible airflow limitation

Multiple sensitization with normal lung function

Lesser sensitization with normal lung function

Fitzpatrick et al. 3

Abbreviations: LCA, latent class analysis; ML, machine learning; RBC, red blood cells.

ML approaches used in phenotyping asthma in children Early‐onset mild atopic asthma Early‐onset mild non‐atopic asthma Late‐onset asthma Difficult asthma Exacerbation‐prone asthma Asthma with elevated RBC and wheeze episodes Neutrophil‐predominant asthma Allergic asthma with preserved pulmonary function Eosinophil‐predominant asthma with poor pulmonary function Asthma with low wheeze episodes Multiple sensitization with partially reversible airflow limitation Multiple sensitization with reversible airflow limitation Lesser sensitization with reversible airflow limitation Multiple sensitization with normal lung function Lesser sensitization with normal lung function Abbreviations: LCA, latent class analysis; ML, machine learning; RBC, red blood cells. Different features of asthma have been identified to distinguish the clusters in several studies. Integrating the data derived from 613 children with asthma with clinical expert domain knowledge, Deliu et al. identified four distinctive features (age of onset, allergic sensitization, severity, and exacerbations in the previous year) informative of five different phenotypes of pediatric asthma. In the Taiwanese Consortium of Childhood Asthma Study, Su et al. combined clinical and functional features with gene expression profiles of 351 asthmatic children. They obtained five distinct phenotypes of childhood asthma, differing in lung function, symptom frequency, healthcare utilization, percentages of eosinophils and neutrophils in peripheral blood, and serum IgE. Also, five latent classes differing for demographic features, asthma control, sensitization, type 2 inflammatory markers, and lung function were identified from Fitzpatrick et al. among a large heterogeneous cohort of more than 2500 children with mild to moderate persistent asthma. Notably, multiple allergic sensitizations and partially reversible airflow limitation emerged as clinically valuable features of asthma in identifying children at the greatest risk for future exacerbation. Similarly, clusters with prominent type 2 inflammation and features of greater asthma severity have also been noted in previous cluster analyses of children with asthma. , Although valuable information has been gained, the results of these data‐driven approaches require further validation to expand our knowledge of asthma phenotypes in children.

PREDICTION OF ASTHMA AND ITS PROGRESSION

Advanced predictive analytic techniques such as ML have been very recently used to achieve the prediction of asthma and its progression. Recently, a segmented logit regression model was used to compute the probability of having asthma in children In particular, 7.9% and 14.7% resulted in the estimated bronchodilator response cutoff values at which the probability of predicting asthma was found to change significantly. ML models have also been trained on electronic health record data to distinguish between persistent and transient asthma cases in childhood. All the five models tested (naïve Bayes, logistic regression, k‐nearest neighbors, random forest, and gradient boosted trees) in the study were found to be able to predict childhood asthma persistence. ML has the potential to overcome the poor sensitivity and specificity of current prediction models for asthma exacerbations, including several associated factors such as epidemiologic, environmental, and physiologic factors. By means of an automatic ML model, Luo et al developed a predictive model for severe asthma exacerbation. The model achieved an area under the curve of 0.86 (95% CI, 0.846–0.871) and showed a negative predictive value of 97.8%, demonstrating an ability to identify children who are not at risk for exacerbation. In line with these findings, Sills et al. proved that advanced automatic ML techniques are superior to conventional ML approaches, such as random forest and logistic regression, in terms of the predictive power of the need for hospitalization of pediatric asthma patients in emergency departments (ED), suggesting that ML models could be successfully implemented to improve the ED workflow as well as to spare resources. Notwithstanding the aforementioned studies, further research is warranted to test and validate the model's generalizability on external datasets; moreover, ML models would benefit from achieving better predictive accuracy.

CONCLUSION

ML has recently become an increasingly used analytic tool to phenotyping for many disease processes. In asthma research, ML approaches have provided rapid advances in addressing pediatric asthma heterogeneity and uncover different disease phenotypes with potential translational impact in clinical practice. The ability of ML to include several data from multiple sources yields the advantage of improving predictive accuracy over conventional statistical methods. However, the number of variables required for ML models could also be a limitation since it may be challenging to choose variables that contribute most to the model. In addition, some variables may not always be available, which can limit replication of the models in external populations and their potential for application in clinical practice. Therefore, we can conclude that, in the field of pediatric asthma, ML is promising but still a work in progress.

CONFLICT OF INTEREST

Authors declared they have no conflict of interests.

AUTHOR CONTRIBUTIONS

Giovanna Cilluffo: Methodology (equal); Writing‐review & editing (equal). Salvatore Fasola: Methodology (equal); Writing‐review & editing (equal). Giuliana Ferrante: Conceptualization (equal); Writing‐review & editing (equal). Amelia Licari: Conceptualization (equal); Writing‐original draft (equal). Giuseppe Roberto Marseglia: Methodology (equal); Writing‐review & editing (equal). Andrea Albarelli: Methodology (equal); Writing‐review & editing (equal). Gian Luigi Marseglia: Supervision (lead); Writing‐review & editing (equal). Stefania La Grutta: Conceptualization (equal); Supervision (lead); Writing‐review & editing (equal).
  11 in total

1.  Heterogeneity of severe asthma in childhood: confirmation by cluster analysis of children in the National Institutes of Health/National Heart, Lung, and Blood Institute Severe Asthma Research Program.

Authors:  Anne M Fitzpatrick; W Gerald Teague; Deborah A Meyers; Stephen P Peters; Xingnan Li; Huashi Li; Sally E Wenzel; Shean Aujla; Mario Castro; Leonard B Bacharier; Benjamin M Gaston; Eugene R Bleecker; Wendy C Moore
Journal:  J Allergy Clin Immunol       Date:  2010-12-31       Impact factor: 10.793

2.  Two novel, severe asthma phenotypes identified during childhood using a clustering approach.

Authors:  Jocelyne Just; Rahele Gouvis-Echraghi; Sarah Rouve; Stephanie Wanin; David Moreau; Isabella Annesi-Maesano
Journal:  Eur Respir J       Date:  2012-01-20       Impact factor: 16.671

3.  Predicting hospitalization of pediatric asthma patients in emergency departments using machine learning.

Authors:  Marion R Sills; Mustafa Ozkaynak; Hoon Jang
Journal:  Int J Med Inform       Date:  2021-04-20       Impact factor: 4.046

4.  A model-based approach for assessing bronchodilator responsiveness in children: The conventional cutoff revisited.

Authors:  Gianluca Sottile; Giuliana Ferrante; Giovanna Cilluffo; Salvatore Fasola; Velia Malizia; Vito M R Muggeo; Giovanni Viegi; Stefania La Grutta
Journal:  J Allergy Clin Immunol       Date:  2020-08-07       Impact factor: 10.793

5.  Childhood asthma clusters reveal neutrophil-predominant phenotype with distinct gene expression.

Authors:  M-W Su; W-C Lin; C-H Tsai; B-L Chiang; Y-H Yang; Y-T Lin; L-C Wang; J-H Lee; C-C Chou; Y-F Wu; Y-L Yeh; Y L Lee
Journal:  Allergy       Date:  2018-07-31       Impact factor: 13.146

6.  Predicting Severe Asthma Exacerbations in Children: Blueprint for Today and Tomorrow.

Authors:  Nidhya Navanandan; Jonathan Hatoun; Juan C Celedón; Andrew H Liu
Journal:  J Allergy Clin Immunol Pract       Date:  2021-04-05

7.  Features of asthma which provide meaningful insights for understanding the disease heterogeneity.

Authors:  M Deliu; T S Yavuz; M Sperrin; D Belgrave; U M Sahiner; C Sackesen; O Kalayci; A Custovic
Journal:  Clin Exp Allergy       Date:  2017-09-15       Impact factor: 5.018

Review 8.  Machine learning: A modern approach to pediatric asthma.

Authors:  Giovanna Cilluffo; Salvatore Fasola; Giuliana Ferrante; Amelia Licari; Giuseppe Roberto Marseglia; Andrea Albarelli; Gian Luigi Marseglia; Stefania La Grutta
Journal:  Pediatr Allergy Immunol       Date:  2022-01       Impact factor: 5.464

9.  Heterogeneity of Mild to Moderate Persistent Asthma in Children: Confirmation by Latent Class Analysis and Association with 1-Year Outcomes.

Authors:  Anne M Fitzpatrick; Leonard B Bacharier; Daniel J Jackson; Stanley J Szefler; Avraham Beigelman; Michael Cabana; Ronina Covar; Theresa Guilbert; Fernando Holguin; Robert F Lemanske; Fernando D Martinez; Wayne Morgan; Wanda Phipatanakul; Jacqueline A Pongracic; Hengameh H Raissy; Robert S Zeiger; David T Mauger
Journal:  J Allergy Clin Immunol Pract       Date:  2020-03-07

10.  Developing a Model to Predict Hospital Encounters for Asthma in Asthmatic Patients: Secondary Analysis.

Authors:  Gang Luo; Shan He; Bryan L Stone; Flory L Nkoy; Michael D Johnson
Journal:  JMIR Med Inform       Date:  2020-01-21
View more
  1 in total

Review 1.  Machine learning: A modern approach to pediatric asthma.

Authors:  Giovanna Cilluffo; Salvatore Fasola; Giuliana Ferrante; Amelia Licari; Giuseppe Roberto Marseglia; Andrea Albarelli; Gian Luigi Marseglia; Stefania La Grutta
Journal:  Pediatr Allergy Immunol       Date:  2022-01       Impact factor: 5.464

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.