| Literature DB >> 27512723 |
Matea Deliu1, Matthew Sperrin1, Danielle Belgrave2, Adnan Custovic2.
Abstract
Asthma is a heterogeneous disease comprising a number of subtypes which may be caused by different pathophysiologic mechanisms (sometimes referred to as endotypes) but may share similar observed characteristics (phenotypes). The use of unsupervised clustering in adult and paediatric populations has identified subtypes of asthma based on observable characteristics such as symptoms, lung function, atopy, eosinophilia, obesity, and age of onset. Here we describe different clustering methods and demonstrate their contributions to our understanding of the spectrum of asthma syndrome. Precise identification of asthma subtypes and their pathophysiological mechanisms may lead to stratification of patients, thus enabling more precise therapeutic and prevention approaches.Entities:
Keywords: Adult asthma; Asthma; Clustering; Endotypes; Paediatric asthma; Phenotypes
Year: 2016 PMID: 27512723 PMCID: PMC4959136 DOI: 10.1007/s41030-016-0017-z
Source DB: PubMed Journal: Pulm Ther ISSN: 2364-1754
Fig. 1Overview of the difference between agglomerative and divisive hierarchical clustering
Most commonly used linkage criteria
| Linkage criteria | |
|---|---|
| Centroid | Measures distance between the central points of each cluster |
| Ward’s method | Measures the distance between clusters as the ANOVA sum of squares—i.e. combining information over all cluster members |
| Complete | Measures the distance between the members of clusters farthest apart |
Fig. 2A silhouette plot used for non-hierarchical clustering (k-means) (from [20], with permission). A silhouette plot shows how close observations from neighbouring clusters are to each other using a measure of −1 to +1. A value of +1 indicates that observations are far away, 0 indicates that the observations are very close to the boundary of deciding exactly which cluster they belong to, and −1 indicates that the observations may be assigned to the wrong cluster
Studies using principal components analysis/factor analysis in asthma subtyping
| Cohort/data setting | Year | Age group | Sample size | No. variables | Method, % variance | Resulting components (PCA)/factors (FA) | Author group references |
|---|---|---|---|---|---|---|---|
| Manchester Asthma and Allergy Study | 2008 | 3 | 946 | 21 | PCA | [ | |
| 5 | 904 | 32 | Age 3: 47.5% | Age 3: 4 | |||
| Age 5: 49.8% | Age 5: 5 | ||||||
| 59 rural communities in Ecuador | 2011 | 7–15 | Mean 73 | 29 | PCA | 2 | [ |
| Component 1: 54.4% | |||||||
| Component 2: 50.1% | |||||||
| Component 3: 50.7% | |||||||
| Three clinical trials | 2012 | 15–79 | 1114 | 21 | PCA | 6 | [ |
| 76% cumulative | |||||||
| Generation R study | 2012 | ≤4 | 2173 | 21 | PCA | 2 | [ |
| Component 1: 16.3% | |||||||
| 8.2% | |||||||
| Education department Sao Francisco do Conde, Brazil | 2013 | 6–12 | 1307 | 22 | PCA | 2 | [ |
| 45.7% cumulative | |||||||
| COREA | 2013 | Avg age 70.2 | 434 | 11 | PCA | Elderly: 4 | [ |
| 53.5% cumulative | |||||||
| Non-elderly: 4 | |||||||
| Avg age 44.2 | 1633 | ||||||
| Manchester Asthma and Allergy Study | 2014 | Children | 1051 | 97 | PCA | 3 | [ |
| 15.3% cumulative | |||||||
| Riyadh Cohort Study | 2014 | 7–17 | 195 | 6 | PCA | 2 | [ |
| 57.3% cumulative | |||||||
| COPSAC | 2015 | Neonates | 411 | 5 | PCA | 1 | [ |
| 41% cumulative | |||||||
| CAMP, CARE, PACT, ACRN, IMPACT, SOCS | 2015 | Children | 327 | 6 | PCA | 6 | [ |
| 100% cumulative | |||||||
| University of Alabama at Birmingham Pulmonary Medicine Clinic | 1992 | Adults | 199 | 10 | FA | 3 | [ |
| Institute of Immunoallergology, Florence IT | 1999 | 16–75 | 99 | 8 | FA | 3 | [ |
| 74.8% cumulative | |||||||
| European Community Respiratory Health Study | 2000 | 20–44 | 16,635 | 18 | FA | 4 | [ |
| 58% cumulative | |||||||
| Tucson Children’s Respiratory Study | 2001 | 6–11 | 877 | 25 | FA | 2 | [ |
| 22.6% cumulative | |||||||
| Stable chronic asthmatics | 2001 | Adults | 69 | – | FA | 3 | [ |
| 78% cumulative | |||||||
| Salmeterol Quality of Life Study Group | 2004 | >12 | 763 | 21 | FA | 4 | [ |
| 80.8% cumulative | |||||||
| Health Maintenance Organisation, Kaiser-Permanente, US | 2005 | 18–56 | 2854 | 53 | FA | 5 | [ |
| 59% cumulative | |||||||
| Paediatric outpatients, Chinese University of Hong Kong | 2005 | 7–18 | 92 | 12 | FA | 5 | [ |
| 64.6% cumulative | |||||||
| Childhood Asthma Management Program, clinical trial, Boston, USA | 2008 | 5–12 | 990 | 17 | FA | 5 | [ |
| 51.2% cumulative |
Avg average, COREA (Korea) Cohort for Reality and Evolution of Adult Asthma in Korea, COPSAC (Denmark) Copenhagen Prospective Study on Asthma in Childhood, CAMP (US) Childhood Asthma Management Program, IMPACT (US) Improving Asthma Control Trial, PACT (US) Pediatric Asthma Controller Trial, CARE (US) Childhood Asthma Research and Education Network, SOCS (US) Salmeterol or Corticosteroids Study, ACRN (US) Asthma Clinical Research Network, MAAS Manchester Asthma and Allergy Study
Studies using model-free approaches for subtyping asthma
| Cohort/data setting | Year | Age group (years) | Sample size, | Data reduction technique | Method of clustering | Number of clusters | Author group reference |
|---|---|---|---|---|---|---|---|
| Glenfield Hospital Difficult Asthma Clinic | 2008 | Avg age: 49.2 | 184 | PCA | Two-step: | 3 | [ |
| Clustering | 4 | ||||||
| Avg age: 43.4 | 187 | ||||||
|
| 3 | ||||||
| Avg age: 52.4 | 68 | ||||||
| Random selection of patients in Wellington, NZ | 2009 | 25–75 | 175 | Two-step: | Agnes: 5 | [ | |
| Diana: 4 | |||||||
| SARP | 2010 | 12–80 | 726 | Ward’s hierarchical clustering | 5 | [ | |
| Post hoc | |||||||
| Discriminant analysis for tree analysis | |||||||
| Asthma Severity Modifying Polymorphisms Project, USA | 2010 | 6–20 | 154 | PCA | Two-step: | 3 | [ |
| Hierarchical clustering | |||||||
|
| |||||||
| SARP | 2011 | 6–17 | 161 | Ward’s hierarchical clustering | 4 | [ | |
| Centroid linkage | |||||||
| Post hoc | |||||||
| Fisher discriminant analysis-predictors of cluster assignment | |||||||
| John Hunter Hospital Ambulatory Care Clinic, Newcastle, Australia | 2011 | 19–75 | 72 | Hierarchical clustering | 3 | [ | |
| Complete linkage | |||||||
| TAP | 2012 | 6–12 | 315 | PCA | Two-step: | 3 | [ |
|
| |||||||
| Ward’s hierarchical clustering | |||||||
| Korean Genome Research Centre for Allergy and Respiratory Diseases cohort | 2012 | Adults | 86 | Two-step: | 4 | [ | |
| Hierarchical cluster analysis | |||||||
|
| |||||||
| NYUBAR, New York City, Bellevue Hospital Center Asthma Clinic | 2012 | 18–75 | 471 | Ward’s hierarchical clustering | 5 | [ | |
| TAP | 2012 | 0–3 | 551 | Ward’s hierarchical clustering | 3 | [ | |
| Post hoc | |||||||
| Classification and regression trees | |||||||
| Random forest for predictors of cluster assignment | |||||||
| TAP | 2012 | <36 mos | 79 | Ward’s hierarchical clustering | 3 | [ | |
| TALC and BASALT trials, USA | 2012 | Avg age: 37.6 | 250 | Ward’s hierarchical clustering | 4 | [ | |
| Post hoc | |||||||
| Discriminant analysis for predicting cluster membership | |||||||
| TAP | 2013 | 5 | 150 | Ward’s hierarchical clustering | 4 | [ | |
| COREA | 2013 | >18 | 724 | Two-step: | 4 | [ | |
| 1843 | Ward’s hierarchical clustering | ||||||
|
| |||||||
| University of Tsukuba Hospital, Hokkaido University Hospital | 2013 | 16–84 | 800 | Ward’s hierarchical clustering | 6 | [ | |
| Post hoc | |||||||
| Random forest for predictors of cluster assignment | |||||||
| Quebec City Case–Control Asthma Cohort | 2013 | Avg age: 35.7 | 377 | Factor analysis | Two-step: | 4 | [ |
|
| |||||||
| Niigata University Hospital, Japan | 2013 | Avg age: 59.8 | 86 | Step-wise multiple regression | Ward’s hierarchical clustering | 3 | [ |
| Decision tree analysis for cluster assignment | |||||||
| Paediatric Asthma Clinic, Hacettepe University, Ankara, Turkey | 2013 | 6–18 | 383 | Factor analysis | Hierarchical clustering | 4 | [ |
| Gower, Jaccard distances | |||||||
| Logistic models | |||||||
| Dutch multi-centre study | 2013 | Adults | 200 | Factor analysis | Wards hierarchical clustering | 3 | [ |
|
| |||||||
| The epidemiology and natural history of asthma: outcomes and treatment regimens, San Diego, USA | 2014 | 6–11 | 518 | Ward’s hierarchical clustering | Children: 5 | [ | |
| SARP | 2014 | Adults | 378 | InfoGain [ |
| 6 | [ |
| Ward’s hierarchical clustering | |||||||
| Outpatient clinics, Portugal | 2015 | Avg age: 45.6 | 57 | Ward’s hierarchical clustering | 5 clusters | [ |
PCA principal components analysis, FA factor analysis, Avg average, SARP Severe Asthma Research Program (USA), GLAD (UK) GPIAG [General Practitioners in Asthma Group] and Leicester Asthma and Dysfunctional breathing study, TAP Trousseau Asthma Program (Paris, FR), COREA (Korea) Cohort for Reality and Evolution of Adult Asthma in Korea