Literature DB >> 18268941

A new staging strategy for chronic obstructive pulmonary disease.

Martin R Miller1, Ole F Pedersen, Asger Dirksen.   

Abstract

BACKGROUND: The best method for expressing lung function impairment is undecided. We tested in a population of patients with chronic obstructive pulmonary disease (COPD) whether forced expiratory volume in 1 second (FEV1) or FEV1 divided by height squared (FEV1/ht2) was better than FEV1 percent predicted (FEV1PP) for predicting survival.
METHOD: FEV1, FEV1PP, and FEV1/ht2 recorded post bronchodilator were compared as predictors of survival in 1095 COPD patients followed for 15 years. A staging system for severity of COPD was defined from FEV1/ht2 and compared with the Global Initiative for Obstructive Lung Disease (GOLD) staging system. RESULT: FEV1/ht2 was a better univariate predictor of survival in COPD than FEV1 and both were better than FEV1PP. The best multivariate model for predicting survival included FEV1/ht2, age and sex. Comparing the GOLD stages with the FEV1/ht2 groups found that survival was more coherent within each FEV1/ht2 group than it was within each GOLD stage. FEV1/ht2 had 60% more people in its most severe group than the severest GOLD stage with these extra subjects having equivalently poor survival and had 155% more in the least severe group with equivalent survival. GOLD staging misclassified 51% of subjects with regard to survival.
CONCLUSION: We conclude that GOLD criteria using FEV1PP do not optimally stage COPD with regard to survival. An alternative strategy using FEV1/ht2 improves the staging of this disease. Studies which stratify COPD patients to determine the effect of interventions such as drug trials, rehabilitation, or management guidelines should consider alternatives to the GOLD classification.

Entities:  

Mesh:

Year:  2007        PMID: 18268941      PMCID: PMC2699963     

Source DB:  PubMed          Journal:  Int J Chron Obstruct Pulmon Dis        ISSN: 1176-9106


Introduction

Lung function measurements are made to help determine if a subject has an abnormality of lung function and also to monitor a disease or its response to treatment. The degree of any abnormality may be needed to determine choices about therapy or level of disability benefit but may also be needed to estimate prognosis. To determine if a lung function result is abnormal the American Thoracic Society (ATS) and European Respiratory Society (ERS) recommend (Quanjer et al 1993; ATS 1994) that the individual’s lung function data should be compared with a predicted value using the method of standardized residuals (SR). SR are calculated from: where RSD is the residual standard deviation from the regression equation used for the prediction, so that an FEV1SR of −1.645 is at the lower 90% confidence limit of normality. When it comes to expressing the degree of any such abnormality, that is the severity of impairment, there is no agreed method that gives equivalent results for all ages and both sexes. The method of percent of predicted (PP) is widely used for expressing disease severity (ATS 1987) but its validity is not based on statistical evidence (Sobol and Weinheimer 1966; Miller and Pincock 1988). SR are potentially flawed for expressing severity in that a young subject’s predicted value is much higher than that for an older subject, so a young subject can have a forced expiratory volume in 1 second (FEV1) 5 SR below predicted which for an older subject would never be found as the FEV1 would have to be below zero. The PP method has been used in many epidemiological studies (Lange et al 1990; Knuiman et al 1999; Schunemann et al 2000; Thomason and Strachan 2000) showing that FEV1 expressed as percent predicted (FEV1PP) is an important predictor of survival. However, in chronic obstructive pulmonary disease (COPD) it was raw FEV1 that was originally shown to relate to survival (Burrows and Earle 1969). The Framingham study found that FEV1 divided by height squared was an important predictor of survival (Ashley et al 1975) in the general population, a finding recently reinforced from the Reykjavik study (Chinn et al 2007). More recently the PP method has been used with arbitrary threshold values in guidelines with regard to specifying the severity of COPD (ATS 1995; Siafakas et al 1995) and asthma (British guideline 2005) and the Global Initiative for Chronic Obstructive Lung disease (GOLD) has empirically defined 5 stages of severity (NIH et al 2005) based on FEV1PP. Concern has been raised that this system is not evidence-based (Kerstjens 2004). Prediction equations such as those recommended by the ERS (Quanjer et al 1993) are used to derive a predicted value for an individual which tries to take into account the effect of age, height, and sex on lung function, but this is imprecise with wide confidence intervals and the data from which the equations are derived may not perfectly match the patient under consideration. Therefore any method of relating lung function to a predicted value may introduce some age, height or sex related error. We have therefore tested whether lung function impairment expressed as absolute FEV1 or standardized by height might be better than FEV1PP and FEV1SR in predicting survival in COPD.

Methods

We have reanalyzed the data from 1095 subjects with COPD who had been recruited from 1983 to 1988 in a respiratory clinic in Copenhagen into a longitudinal study to look at predictors of their all cause mortality (Hansen et al 1999, 2001) over a 15 year follow up. They were all assessed by a consultant respiratory physician and diagnosed as having COPD based from symptoms of cough, sputum, lack of variability in airflow limitation, lack of allergy and atopy, their smoking history and their FEV1 as a percent of forced vital capacity (FVC) had to be less than 89% of their predicted value, in accordance with ERS recommendations at that time (Siafakas et al 1995). There was no within day variation in lung function and no significant acute or corticosteroid reversibility to their airflow obstruction. We have analyzed the spirometric data on entry into the study which were obtained after both an inhaled bronchodilator and a 2 week trial of oral corticosteroids, ie, maximally bronchodilated as per GOLD recommendation (NIH et al 2005). Each subject’s data had their FEV1 expressed as percent predicted (PP) using ERS prediction equations (Quanjer et al 1993), as FEV1 divided by height squared (FEV1/ht2) and as standardized residuals (SR) (Miller et al 1985; Quanjer et al 1993). Subjects were staged by GOLD criteria (NIH et al 2005) and also arbitrarily by FEV1/ht2 into 4 stages as defined in Table 1. Because GOLD stage 1 only identified 6 subjects in this cohort the staging by FEV1/ht2 was into 4 and not 5 arbitrary groups in order to allow fairer comparison between the two classification systems.
Table 1

Classification criteria for GOLD and FEV1/ht2 staging from spirometry alone. FEV1% is the FEV1 expressed as a percentage of FVC. FEV1PP is the FEV1 expressed as a percentage of the subject’s predicted value

GOLD staging
FEV1/ht2 staging
StageCriteriaStageCriteria
0FEV1% ≥70%
1FEV1% < 70% and
FEV1PP ≥ 80%10.5 < FEV1/ht2
2FEV1% < 70% and
FEV1 PP < 80%20.4 < FEV1/ht2 ≤ 0.5
3FEV1% < 70% and
FEV1PP < 50%30.3 < FEV1/ht2 ≤ 0.4
4FEV1% < 70% and
FEV1PP < 30%4FEV1/ht2 ≤ 0.3
Statistical analysis was undertaken using SPSS for Windows version 11.5 (Microsoft, Redmond, WA). Cox proportional hazards regression analysis was used for predicting all cause mortality using quintiles of putative predictor variables. This method is a special form of multiple regression that allows for the inclusion of the censored data for those cases for which the end point of death has not yet occurred. The best prediction models were judged on the basis of highest global chi-square (χ2) value and greatest reduction in −2 times the log likelihood (−2LL) for different models (Cox 1972). The proportional hazards model assumes that the ratio of the estimated hazard functions for any two observations with different values for the independent (predictor) variables is constant over time. This assumption was checked for each model from plots of cumulative hazard against survival time stratified by the relevant FEV1 index. For the multivariate Cox models, hazard ratios were calculated by contrasting risk against that for the first stage of each index. Subjects were categorized by their GOLD and FEV1/ht2 stages and Kaplan-Meier survival analysis was used to estimate mean survival for every subgroup of subjects defined by the 4 × 4 matrix of GOLD and FEV1/ht2 stages. Mean survival was used rather than median survival since some subgroups were too small for reliable estimates of the median.

Results

Age, height, and spirometric values are shown in Table 2 for the 1095 subjects of whom 644 (59%) were female. The median survival from entry into the longitudinal study was 3208 days, with 723 (66%) dying during the period of follow up (median survival 2231 days versus 4083 days for survivors), of whom 404 were female which was slightly lower than the number (425) expected by chance (χ2 = 7.57, df = 1, p = 0.006). Univariate Cox proportional hazards models for survival were derived with the following variables as predictors: height, sex, body mass index (BMI), FEV1, FEV1/ht2, FEV1PP and FEV1SR. Table 3 shows the strength of univariate association with survival for each index as a continuous variable with FEV1/ht2 being the best univariate predictor.
Table 2

Descriptive data for age, height and spirometric values for the 1095 patients in the study (644 female) showing mean, standard deviation (SD) and median values

Mean (SD)Median
Age61.8 (9.9)63.0
Ht in meters1.66 (0.09)1.65
FVC in L2.25 (0.76)2.14
FVCSR−1.87 (1.10)−1.86
FEV1 in L1.21 (0.45)1.14
FEVSR−3.21 (1.02)−3.24
FEV1PP46.7 (14.5)45.5
FEV1/ht20.43 (0.15)0.41
FEV1%54.4 (12.2)55.0
FEV1%SR−3.31 (1.76)−3.23
PEF in L/s3.60 (1.47)3.33
PEFSR−3.11 (1.32)−3.28
Table 3

Univariate analysis: Indices that were found by Cox regression to be univariate predictors of survival with χ2 value for strength of prediction and its significance value

Indexχ2valuep value
FEV1/ht2161<0.0001
Age159<0.0001
FEV1127<0.0001
FEV1PP111<0.0001
FEV1%97<0.0001
FEV1SR18<0.0001
Sex12<0.001
BMI11<0.001
Weight80.004
Height0
To allow hazard ratios for various predictors to be calculated Cox regression models were constructed with sex and quintiles of age and the various lung function indices. The best multivariate model was with FEV1/ht2, age and sex (χ2 = 289, −2LL = 9025) and the hazard ratios for the predictors are shown in Table 4. The other FEV1 indices gave less good models with the next best being with raw FEV1, followed by FEV1PP and then FEV1SR. The model with FEV1/ht2 had a lower HR for male sex than that with raw FEV1 suggesting the standardization by height had taken into account some sex difference. The models with PP and SR found the HR for male sex was no longer significant. BMI quintiles improved each of these models slightly but only the lightest BMI quintile had a significantly increased hazard ratio (HR = 1.4, 95% CL 1.1 to 1.8) compared with the other BMI quintiles.
Table 4

Multivariate analysis: The table shows the covariate hazard ratios (HR) for death (with 95% confidence limits) for Cox proportional hazards models using sex and the quintiles of age and various methods for expressing FEV1 impairment. The HR values are contrasts against the hazard estimated for the first stage of each group and for male contrasted with female. Chi-square values are shown for each model

FEV1/ht2Chi square = 289Raw FEV1Chi square = 276FEV1PPChi square = 269FEV1SRChi square = 218
HR95% CLHR95% CLHR95% CLHR95% CL
Ageyoungest quintile1.01.01.01.0
quintile 21.8(1.3 to 2.4)1.8(1.3 to 2.4)2.0(1.5 to 2.6)2.1(1.6 to 2.9)
quintile 32.2(1.6 to 2.9)2.2(1.7 to 3.0)2.6(2.0 to 3.5)3.1(2.4 to 4.2)
quintile 42.5(1.8 to 3.3)2.5(1.9 to 3.3)3.2(2.4 to 4.3)4.1(3.1 to 5.4)
oldest quintile3.1(2.4 to 4.1)3.1(2.3 to 4.1)4.3(3.3 to 5.7)6.0(4.5 to 7.9)
FEV1 indexbest quintile1.01.01.01.0
quintile 21.6(1.2 to 2.1)1.6(1.2 to 2.1)1.5(1.1 to 1.9)1.2(0.9 to 1.5)
quintile 32.0(1.5 to 2.7)1.9(1.4 to 2.5)1.8(1.4 to 2.4)1.6(1.3 to 2.0)
quintile 42.4(1.8 to 3.2)2.4(1.8 to 3.2)2.4(1.8 to 3.0)2.0(1.5 to 2.5)
worst quintile4.0(3.0 to 5.2)3.9(3.0 to 5.1)3.4(2.6 to 4.3)2.6(2.0 to 3.3)
Male1.5(1.3 to 1.7)1.8(1.5 to 2.0)1.1(0.9 to 1.3)1.1(0.9 to 1.3)
Figure 1 shows mean survival (with 95% CL bars) for each FEV1/ht2 group compared with the GOLD stage. Figure 2 shows the numbers of subjects in each of the staging groups and the full data used in these Figures is presented in Table 5. The mean survival for the FEV1/ht2 groups decreases progressively and fairly evenly whereas the survival for GOLD 0 and 2 are very similar. There were 308 subjects in FEV1/ht2 stage 1 which was 2.5 times the number of subjects in the equivalent GOLD stage 0 (121). More than half of the subjects in each of GOLD stages 2 and 3 were in discordant FEV1/ht stages and the worst FEV1/ht2 stage included 1.6 times as many subjects as did GOLD stage 4 (199 versus 126). Overall 51% of subjects were in discordant staging groups. The poor staging by GOLD was confirmed from a Cox regression model using just the GOLD stage as predictor for survival, where the χ2 value was only 81, which was significantly inferior to a model using just the FEV1/ht2 stage which yielded χ2 value of 166.
Figure 1

The mean survival for each FEV1/ht2 group and each GOLD stage with 95% confidence limit bars and the number of subjects in each column.

Figure 2

Plot of the number of subjects in each of the groups for the FEV1/ht2 staging system on the left and the GOLD staging system on the right with the mean survival of each group stated inside the relevant column.

Table 5

Cross tabulation of concordance in the 1095 subjects between GOLD classification (GOLD stages 0 to 4) and FEV1/ht2 classification for COPD severity. The table shows the number of subjects dying (nd) under follow up as a fraction of the total number N within that group (nd/N), the estimated Kaplan-Meier mean survival for that group as years in bold and the 95% confidence limits for the estimate in parenthesis. Aggregate results for each columns and row as a whole are also shown

Staging by FEV1/ht2
Stage 1(>0.5)Stage 2 (>0.4 and0.5)Stage 3 (>0.3 and0.4)Stage 4 (0.3)
GOLD 029/7511.2(10.2–12.2)13/299.8(8.4–11.3)12/159.0(7.1–10.8)2/25.7(0.0–13.3)56/12110.6(9.8–11.4)
GOLD 288/19510.8(10.2–11.4)66/1019.3(8.5–10.2)15/186.2(4.7–7.6)169/31410.1(9.6–10.6)
GOLD 317/3710.7(9.6–11.9)101/1508.7(8.0–9.4)194/2537.9(7.4–8.4)76/856.1(5.2–6.9)388/5258.0(7.7–8.4)
GOLD 41/111/136.6(4.3–8.9)98/1125.8(5.1–6.5)110/1266.0(5.2–6.7)

135/30811.0(10.5–11.4)180/2809.1(8.6–9.7)232/2997.8(7.4–8.3)176/1995.9(5.4–6.5)

Discussion

We present the first evidence that FEV1PP is not as good as either raw FEV1 or FEV1/ht2 at predicting survival in a cohort of COPD patients. This finding confirms concerns previously expressed about the use of PP as a method for expressing impairment in FEV1 (Sobol and Weinheimer 1966; Miller and Pincock 1988). The assumption that a fixed level of FEV1PP means the same level of impairment for different subjects is clearly flawed since, for example, young adults with cystic fibrosis can survive with an FEV1 just as low in absolute terms as old subjects, with their PP values going as low as 10% (Sood et al 2001) which is much lower than that seen in the elderly. Several studies have shown that survival in COPD relates to FEV1 (Burrows and Earle 1969; Gorecka et al 1997; Thomason and Strachan 2000) with values lower than 0.75 litre having a 3 year survival of only 50% (Burrows and Earle 1969). When using the GOLD criteria to stratify our subjects there were only 11.5% subjects in stage 4 whereas the method based on FEV1/ht2 increased the proportion in the severest stage to 18.2% and yet these had a comparably poor survival. This suggests that estimating severity of COPD from FEV1PP criteria does not identify all those subjects who are most severely affected. Concern has been expressed that the GOLD criteria, which were arbitrarily chosen to define severity of COPD, are not based on any evidence of their ability to predict survival or any other aspect of COPD management (Kerstjens 2004). Our data provide strong evidence that the GOLD criteria are not adequate for correctly placing individual patients into disease stage categories that relate to survival. We do not have data on individual symptoms to determine how the two classification systems of COPD relate to symptoms and performance but this could be verified in other data sets. Furthermore we found that within the GOLD stages there were subgroups with differing survival that were better stratified by FEV1/ht 2 staging. The definition of stage 4 by GOLD criteria severity (NIH et al 2005) can include subjects with less severe spirometry results but who are in type 2 respiratory failure. Therefore it is possible that some of our subjects allocated to GOLD 3 might in fact be in GOLD stage 4 if their arterial gas results were known. However, it seems unlikely that the 85 subjects in the worst FEV1/ht2 stage but who we classified as GOLD 3 had been in type 2 respiratory failure on entry into the study because their mean survival was 6.1 years. It seems clear that FEV1/ht2 can stratify subjects with COPD into survival groups better than GOLD criteria without recourse to other data such as arterial gases. Recent work has emphasized that multivariate analysis including data on BMI, the degree of airflow obstruction as FEV1PP, dyspnoea, and exercise capacity measured by the six-minute-walk test (BODE index) was better at predicting survival in COPD than univariate analysis (Celli et al 2004). However, our data suggest that the use of FEV1/ht2 instead of FEV1PP as the index of airflow obstruction in this form of multivariate analysis would further improve the survival prediction. Additional problems with the PP and SR methods for expressing degree of abnormality occur because they require an estimate of a subject’s predicted value. The regression equations recommended by the ERS for FEV1 only account for about 58% of the variation of FEV1 in normal subjects, thus the predicted value is not a precise estimate and will include errors related to sex, age, height, and technical issues (Quanjer et al 1993). Furthermore if the subject whose predicted value is being derived is from a population different from that used to derive the prediction equation or the subject’s age is outside the limits of the population used in the equation then further errors will be incurred. Our data show FEV1/ht2 is better than both raw FEV1 and FEV1PP for expressing degree of lung function impairment. If this dataset is analyzed with the two sexes considered independently then FEV1/ht2 gives results indistinguishable from that for raw FEV1 in terms of ability to predict all cause mortality. However, in clinical practice it is advantageous to have a method for assessing risk that is applicable for both sexes in an equivalent way. FEV1/ht2 is one way of taking some of the sex differences in lung function into account without introducing the potential errors inherent in trying to predict what a given subject’s FEV1 should be (Quanjer et al 1993). A recent study of all cause mortality in a general population confirmed that FEV1/ht2 was better related to survival than FEV1PP (Chinn et al 2007) and we now confirm this finding in a cohort of COPD patients. We conclude that FEV1/ht2 is better related to survival in COPD than FEV1PP and so may be the best method for expressing degree of FEV1 impairment. Our findings indicate that applying the GOLD criteria to COPD management does not optimally classify those most severely affected. So studies which need to stratify COPD patients in order to determine the potential benefits of interventions such as drug trials, rehabilitation programmes or even discharge policies should not just consider the GOLD classification but also consider using FEV1/ht2 or other strategies to assess disease severity correctly. Our evidence suggests using GOLD criteria alone will have suboptimal power to show any benefits from such interventions. We believe future scientific endeavor in COPD must not be limited by the arbitrary GOLD staging which here failed to identify over one third of the worst prognosis group. Alternative classifications should now be prospectively compared so that the best results for managing and researching into COPD can be achieved.
  22 in total

1.  Which spirometric indices best predict subsequent death from chronic obstructive pulmonary disease?

Authors:  M J Thomason; D P Strachan
Journal:  Thorax       Date:  2000-09       Impact factor: 9.139

2.  British guideline on the management of asthma.

Authors: 
Journal:  Thorax       Date:  2003-02       Impact factor: 9.139

3.  The GOLD classification has not advanced understanding of COPD.

Authors:  Huib A M Kerstjens
Journal:  Am J Respir Crit Care Med       Date:  2004-08-01       Impact factor: 21.405

4.  Course and prognosis of chronic obstructive lung disease. A prospective study of 200 patients.

Authors:  B Burrows; R H Earle
Journal:  N Engl J Med       Date:  1969-02-20       Impact factor: 91.245

5.  Pulmonary function: relation to aging, cigarette habit, and mortality.

Authors:  F Ashley; W B Kannel; P D Sorlie; R Masson
Journal:  Ann Intern Med       Date:  1975-06       Impact factor: 25.391

6.  Optimum expression of adult lung function based on all-cause mortality: results from the Reykjavik study.

Authors:  S Chinn; T Gislason; T Aspelund; V Gudnason
Journal:  Respir Med       Date:  2006-08-04       Impact factor: 3.415

7.  Standardization of Spirometry, 1994 Update. American Thoracic Society.

Authors: 
Journal:  Am J Respir Crit Care Med       Date:  1995-09       Impact factor: 21.405

8.  Patterns of spirogram abnormality in individual smokers.

Authors:  M R Miller; A C Pincock; D M Grove
Journal:  Am Rev Respir Dis       Date:  1985-11

9.  Outcomes of intensive care unit care in adults with cystic fibrosis.

Authors:  N Sood; L J Paradowski; J R Yankaskas
Journal:  Am J Respir Crit Care Med       Date:  2001-02       Impact factor: 21.405

10.  Optimal assessment and management of chronic obstructive pulmonary disease (COPD). The European Respiratory Society Task Force.

Authors:  N M Siafakas; P Vermeire; N B Pride; P Paoletti; J Gibson; P Howard; J C Yernault; M Decramer; T Higenbottam; D S Postma
Journal:  Eur Respir J       Date:  1995-08       Impact factor: 16.671

View more
  8 in total

1.  ARTP statement on pulmonary function testing 2020.

Authors:  Karl Peter Sylvester; Nigel Clayton; Ian Cliff; Michael Hepple; Adrian Kendrick; Jane Kirkby; Martin Miller; Alan Moore; Gerrard Francis Rafferty; Liam O'Reilly; Joanna Shakespeare; Laurie Smith; Trefor Watts; Martyn Bucknall; Keith Butterfield
Journal:  BMJ Open Respir Res       Date:  2020-07

2.  Differences between absolute and predicted values of forced expiratory volumes to classify ventilatory impairment in chronic obstructive pulmonary disease.

Authors:  William Checkley; Marilyn G Foreman; Surya P Bhatt; Mark T Dransfield; MeiLan Han; Nicola A Hanania; Nadia N Hansel; Elizabeth A Regan; Robert A Wise
Journal:  Respir Med       Date:  2015-11-11       Impact factor: 3.415

3.  Palliative care or end-of-life care in advanced chronic obstructive pulmonary disease: a prospective community survey.

Authors:  Patrick White; Suzanne White; Polly Edmonds; Marjolein Gysels; John Moxham; Paul Seed; Cathy Shipman
Journal:  Br J Gen Pract       Date:  2011-06       Impact factor: 5.386

4.  Staging the severity of chronic obstructive pulmonary disease in older persons based on spirometric Z-scores.

Authors:  Carlos A Vaz Fragoso; John Concato; Gail McAvay; H Klar Yaggi; Peter H Van Ness; Thomas M Gill
Journal:  J Am Geriatr Soc       Date:  2011-09-13       Impact factor: 5.562

5.  The impact of different spirometric definitions on the prevalence of airway obstruction and their association with respiratory symptoms.

Authors:  Howraman Meteran; Martin R Miller; Simon Francis Thomsen; Kaare Christensen; Torben Sigsgaard; Vibeke Backer
Journal:  ERJ Open Res       Date:  2017-12-08

6.  How to Stage Airflow Limitation in Stable Chronic Obstructive Pulmonary Disease Male Patients?

Authors:  Ichraf Anane; Fatma Guezguez; Hend Knaz; Helmi Ben Saad
Journal:  Am J Mens Health       Date:  2020 May-Jun

7.  Influences of Two FEV1 Reference Equations (GLI-2012 and GIRH-2017) on Airflow Limitation Classification Among COPD Patients.

Authors:  Dafei Wei; Qi Wang; Shasha Liu; Xiaowu Tan; Lin Chen; Rongfang Tu; Qing Liu; Yuanhang Jia; Sha Liu
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2022-09-02

8.  Lung function indices for predicting mortality in COPD.

Authors:  Afroditi K Boutou; Dinesh Shrikrishna; Rebecca J Tanner; Cayley Smith; Julia L Kelly; Simon P Ward; Michael I Polkey; Nicholas S Hopkinson
Journal:  Eur Respir J       Date:  2013-01-24       Impact factor: 16.671

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.