Alvar Agusti1,2,3,4, Rosa Faner3,4. 1. Respiratory InstituteHospital ClinicBarcelona, Spain. 2. University of BarcelonaBarcelona, Spain. 3. Institut d'Investigacions Biomediques August Pi i Sunyer (IDIBAPS)Barcelona, Spainand. 4. Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES)Madrid, Spain.
Our understanding of chronic obstructive pulmonary disease (COPD) is changing rapidly (1). Traditionally considered a self-inflicted disease caused by tobacco smoking and characterized by an accelerated rate of lung function decline with age (2), we now know that it is not always self-inflicted, as a substantial proportion of patients with COPD have never smoked (3), and both abnormal lung development and aging determine different lung-function trajectories that can lead to COPD in adulthood (Figure 1) (4–6). Accordingly, the concept of disease progression, traditionally linked to the decline in lung function over time, needs revision (1). For instance, not all patients exhibit rapid lung function decline (7, 8), different disease components can progress independent of each other (e.g., lung function and exacerbation rate), and multimorbid conditions can contribute to disease progression independent of lung function (1). In this issue of the Journal, Young and colleagues (pp. 294–302) applied modern machine learning (ML) methods to model disease progression in COPD (9). ML is a subset of artificial intelligence in which a computer system performs specific analysis using algorithms and statistical models without preestablished explicit instructions, relying only on observed patterns and inferences.
Figure 1.
Potential lung function trajectories through life. Reprinted by permission from Reference 1.
Potential lung function trajectories through life. Reprinted by permission from Reference 1.Young and colleagues (9) used an ML tool named Subtype and Stage Inference (SuStain), capable of “reconstructing long-term temporal progression of disease from cross-sectional data” (10). Using four computed tomography (CT) imaging features (two tissue measurements [emphysema and functional small airways disease] and two airway measurements [airway wall area and thickness]) determined in patients and control individuals participating in the COPDGene study (11), the authors asked SuStain to determine the subtypes of COPD (defined as “a group of subjects who share a particular trajectory of CT measurement evolution”), stages (defined as “the position on a subtype trajectory of an individual subject at a specific time, representing the degree of abnormality in imaging measurements”), and disease progression (defined as “change in stage with time, which occurs when an CT measurements becomes more abnormal relative to a control population”) (9). Findings were validated cross-sectionally in the ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-Points) cohort (12) and longitudinally in the same COPDGene cohort, using 5 years follow-up data (9). Finally, Young and colleagues also investigated whether SuStain observations in patients with COPD could be identified in a subgroup of smokers with normal spirometry.Main results showed (9) that, cross-sectionally, SuStain identified two subtypes of patients in COPDGene. The most prevalent one (70%) was characterized by emphysema and peripheral airway disease being detectable “earlier” than central airway abnormalities (so authors named it Tissue→Airway), whereas the reverse occurred in the other subtype (Airway→Tissue). The clinical characteristics of these two subtypes were broadly similar, and in both, SuStain stages were related to airflow limitation severity (9). These observations were mostly reproduced in ECLIPSE (9), and these two subtypes remained consistent in 87% of COPDGene patients after 5 years of follow-up. Individual patients tended to progress in stage within each subtype, particularly GOLD 1–2 patients. Baseline SuStain stage correlated (weakly) with lung function decline during follow-up in both subtypes (9), and SuStain identified a subpopulation of control smokers (29%) with similar imaging abnormalities (subtypes) as those determined in patients with COPD (Tissue→Airway, 63%; Airway→Tissue, 37%), despite normal spirometric values. As in patients with COPD, these subtypes remained constant at 5-year follow-up, and SuStain stage was associated with lung function both at baseline and during follow-up (9).This article is important, novel, intriguing, and a bit difficult to follow because of the large amount of complex data it includes. As with any good article, though, it raises many questions.First, that COPD progression (a concept tightly related to a time axis) can be modeled from cross-sectional data (where the time axis is absent) is difficult to grasp. Admittedly, however, authors validate their cross-sectional results in COPDGene in an independent cohort (ECLIPSE) and, more important, explored changes over time using COPDGene follow-up data (9).Second, that X occurs earlier than Y (time axis again) means that Y may occur later. Whether or not this happened here is unclear, but judging from the fact that 87% of patients remained classified in the same subtype after 5-year follow-up, it seems unlikely. If so, does SuStain really model disease progression or simply identify two different and time-stable phenotypes? Actually, these two subtypes look remarkably similar to those described by Burrows and colleagues many years ago (type A [Tissue] and type B [Airway]) (13). Is ML rediscovering the wheel?Third, the results of the study by Young and colleagues (9) have to be contrasted and reconciled with those of two other recent studies that also used ML methods (admittedly different from SuStain) in the same COPDGene cohort but provided different results: one identified four (not two) trajectories (14) and the other questioned the concept of disease subtypes in favor of a continuum of COPD manifestations or “disease axes” (15). To what extent are different ML methods complementary, concordant, or help us to better understand COPD progression? In fact, real-life (not modeled) data from the Tasmanian cohort (which includes serial spirometric data from patients aged 7 to 53 yr) recently reported the existence of 6 (not 4, not 2) different lung function trajectories (16). It is likely that the much longer period of follow-up in the Tasmanian cohort (46 vs. 5 yr) brings a more precise description of reality in which, we speculate, an infinite number of disease trajectories may exist (Figure 1).Fourth, if the clinical characteristics of the two subtypes identified by Young and colleagues were similar, and both remained basically stable over time (9), what is their clinical relevance? In this context, it should be noted that other important components of disease progression, such as changes in exacerbation frequency or mortality, were not included in their analysis.Fifth, what can we learn about the pathogenesis (endotypes) of these two subtypes in particular, and COPD in general? Why, according to SuStain, does the disease start in the parenchyma in the majority of patients (70%), but in the large airways (which are irrelevant in terms of airflow limitation) in a relative minority (30%)? This seems at variance with results obtained using micro-CT ex vivo in lung tissue samples of patients with COPD that indicate that peripheral airways are reduced in number before emphysema develops (17), although we acknowledge that SuStain cannot disentangle (and hence includes in the same subtype) parenchymal and peripheral airway changes (9).Sixth, the observation that individual stage progression was more rapid in GOLD 1–2 than GOLD 3–4 patients (9) is in keeping with previous observations and may be consistent with patients belonging to different lung function trajectories (Figure 1) (1).Finally, observations in smokers with normal spirometry suggest that SuStain can detect early COPD (9). This is an important observation that needs replication in other studies.
Authors: John E McDonough; Ren Yuan; Masaru Suzuki; Nazgol Seyednejad; W Mark Elliott; Pablo G Sanchez; Alexander C Wright; Warren B Gefter; Leslie Litzky; Harvey O Coxson; Peter D Paré; Don D Sin; Richard A Pierce; Jason C Woods; Annette M McWilliams; John R Mayo; Stephen C Lam; Joel D Cooper; James C Hogg Journal: N Engl J Med Date: 2011-10-27 Impact factor: 91.245
Authors: Gregory L Kinney; Stephanie A Santorico; Kendra A Young; Michael H Cho; Peter J Castaldi; Raul San José Estépar; James C Ross; Jennifer G Dy; Barry J Make; Elizabeth A Regan; David A Lynch; Douglas C Everett; Sharon M Lutz; Edwin K Silverman; George R Washko; James D Crapo; John E Hokanson Journal: Am J Epidemiol Date: 2018-10-01 Impact factor: 4.897
Authors: Dinh S Bui; Haydn E Walters; John A Burgess; Jennifer L Perret; Minh Q Bui; Gayan Bowatte; Adrian J Lowe; Melissa A Russell; Bruce R Thompson; Garun S Hamilton; Alan L James; Graham G Giles; Paul S Thomas; Debbie Jarvis; Cecilie Svanes; Judith Garcia-Aymerich; Bircan Erbas; Peter A Frith; Katrina J Allen; Michael J Abramson; Caroline J Lodge; Shyamali C Dharmage Journal: Ann Am Thorac Soc Date: 2018-09
Authors: Alexandra L Young; Felix J S Bragman; Bojidar Rangelov; MeiLan K Han; Craig J Galbán; David A Lynch; David J Hawkes; Daniel C Alexander; John R Hurst Journal: Am J Respir Crit Care Med Date: 2020-02-01 Impact factor: 21.405
Authors: Jørgen Vestbo; Lisa D Edwards; Paul D Scanlon; Julie C Yates; Alvar Agusti; Per Bakke; Peter M A Calverley; Bartolome Celli; Harvey O Coxson; Courtney Crim; David A Lomas; William MacNee; Bruce E Miller; Edwin K Silverman; Ruth Tal-Singer; Emiel Wouters; Stephen I Rennard Journal: N Engl J Med Date: 2011-09-26 Impact factor: 91.245
Authors: James C Ross; Peter J Castaldi; Michael H Cho; Craig P Hersh; Farbod N Rahaghi; Gonzalo V Sánchez-Ferrero; Margaret M Parker; Augusto A Litonjua; David Sparrow; Jennifer G Dy; Edwin K Silverman; George R Washko; Raúl San José Estépar Journal: Am J Respir Crit Care Med Date: 2018-10-15 Impact factor: 30.528
Authors: Andy I Ritchie; Jonathon R Baker; Trisha M Parekh; James P Allinson; Surya P Bhatt; Louise E Donnelly; Gavin C Donaldson Journal: Am J Respir Crit Care Med Date: 2021-07-01 Impact factor: 21.405