Lung function tests are used in clinical practice to detect “abnormality” against a background of “normal” function. “Abnormality” may be determined either by (i) detection of some discrete abnormality (eg, the shape of the flow volume loop) or (ii) by the use of reference values to show a relative change in values (eg, forced expiratory volume in one second [FEV1] = 30% of predicted) or (iii) relative differences in several values (eg, obstructive spirometry but relatively raised lung volumes). Lung function tests are also used to longitudinally monitor therapy or disease progression.Using reference values to produce “percent of predicted” measures can lead to serious misinterpretation of results and subsequent inappropriate diagnosis and treatment. It has been recognized in Europe and North America for over 20 years (Quanjer et al 1983, 1993; Pellegrino et al 2005) that using “percent of predicted” is scientifically unsound and should therefore be replaced by or used alongside “standardized residuals” which are a more scientifically robust way to express results. Knowing that a patient’s FEV1 value is below 1.64 standardized residuals gives more confidence when interpreting a respiratory disorder. Taken together with the “percent of predicted” values the two figures can help to indicate the “severity” of abnormality without falling into the trap of the reference equations producing “false positives”, particularly in elderly patients (Miller 2007).Traditionally the AARC/ATS guidelines have recommended that utilization of reference values from a variety of populations should be verified in local healthy nonsmoking subjects to determine their “local best fit” reference values. In Europe, the adoption of a generic and averaged “euro-lung” set of reference values (Quanjer et al 1983. 1993) has been recommended with some ethnic correction factors for non-Caucasian subjects. Clearly neither of these methods is ideal or without significant errors. Whilst these ethnic correction factors were probably financially expedient in the 1980s, it is clearly unacceptable to continue this practice today, and more reference value studies on most of the major races populating Europe needs undertaking.The greatest problem for introducing new reference values is deciding that they are “better” than previously used values. “Better” usually means that the normal distribution of the values is “tightened” by the improved selection of control subjects criteria, the use of improved quality spirometry (performance or equipment) or a greater number of subjects being included. The dilemma is deciding which set of reference values is “the gold standard” and whether older values should be replaced.In this issue Sood and colleagues (2007) compare new reference values (NHANES III) (Hankinson et al 1999) from a population of non-Hispanic Caucasians living in the USA with the traditional values used by AARC/ATS for over 30 years (Kory et al 1961; Knudson et al 1976, 1983; Morris et al 1971, 1973; Morris 1976; Crapo et al 1982). Data from two populations of restrictive and obstructive patients are used to highlight the lack of agreement between sets of reference values in detecting “abnormality”, but good agreement when it comes to “severity” of disease. The authors cite the new ATS/ERS recommendation (Pellegrino et al 2005) as the driver to adopt NHANES III values, and highlight the consequences which may become costly in terms of confusion, misdiagnosis, and the practicalities of respiratory specialists and equipment manufacturers adopting the new reference values.The study begs the question whether the European (ECSC) reference values (Quanjer et al 1983, 1993) should themselves be updated for European populations. There needs to be an examination of whether the costs of developing what may amount to perhaps a subtle “intellectual” change will actually make any significant clinical difference when there are far more fundamental questions to answer with limited financial resources. The effect on the burden of lung disease must be economically beneficial to make such a fundamental change worthwhile.Whilst reference values are an important issue, they need to be considered with respect to other potential errors in lung function testing. For example, in primary care spirometry the largest errors are the poor quality of the tests performed which are often a direct result of poor training and support (Kaminsky et al 2005; Raghunath et al 2006; Tinkelman et al 2006; Miravitiles et al 2007). Growing evidence indicates that equivalence in spirometry only occurs between “lab and office” when an adequate training and mentoring programme is in place in primary care (Eaton et al 1999; Upton et al 2000; Johansen 2007).Arguably the best way to monitor lung function for both the individual and the population as a whole is to adopt the “lung health” approach to respiratory healthcare. All adults at 25 years of age (ie, when lung growth is complete) (Sherill et al 1989), should have their spirometry measured accurately by trained and experienced healthcare professionals with quality-assured spirometers. This benchmark (similar to blood pressure monitoring) can help to establish “abnormality” from “normality” in an individual against reference values. It may then be possible to distinguish which individuals have a relative reduction in lung function. These susceptible individuals can then be monitored regularly (probably each decade) to see how much capacity they have left before respiratory disease causes major disability or morbidity. The concept of “loss of functional units on the road to death” has been described eloquently by Miller and colleagues (2005a, 2005b).The data from such a lung health study like NHANES III can also be collected nationally to provide better reference values and monitor the lung health of the nation and evaluate the intervention of therapies and behavioral changes (new guidelines, smoking cessation, etc.). The savings from detecting early disease and decreasing the burden of lung disease in the elderly should offset any financial pressures from the implementation of a lung health study. However, this has to be seen as a long term strategy in health economics and not a short term “quick fix”.Sood and colleagues (2007) have boldly addressed the issue of differences in reference values and their implications for healthcare. This article will undoubtedly generate more discussion, further consideration, and careful implementation before the NHANES III data (or any other new reference values) are widely adopted.
Authors: M N Upton; C Ferrell; C Bidwell; A McConnachie; J Goodfellow; G Davey Smith; G C Watt Journal: Public Health Date: 2000-09 Impact factor: 2.427
Authors: Sheena E Martenies; Joshua W Schaeffer; Grant Erlandson; Mary Bradford; Jill A Poole; Ander Wilson; Zachary Weller; Stephen J Reynolds; Sheryl Magzamen Journal: J Occup Environ Med Date: 2020-06 Impact factor: 2.306