Literature DB >> 31367634

Evaluation of inter-observer variation for computed tomography identification of childhood interstitial lung disease.

Joseph Jacob1,2, Catherine M Owens3, Alan S Brody4, Thomas Semple5, Tom A Watson3, Alistair Calder3, Pilar Garcia-Peña6, Paolo Toma7, Anand Devaraj5, Henry Walton8, Antonio Moreno-Galdó9, Paul Aurora10, Alexandra Rice11, Timothy J Vece12, Steve Cunningham13, Andre Altmann2, Athol U Wells14, Andrew G Nicholson11, Andrew Bush15.   

Abstract

Making chILD diagnoses on CT is poorly reproducible, even amongst sub-specialists. CT might best improve diagnostic confidence in a multidisciplinary team setting when augmented with clinical, functional and haematological results. http://bit.ly/327jRCw.

Entities:  

Year:  2019        PMID: 31367634      PMCID: PMC6661316          DOI: 10.1183/23120541.00100-2019

Source DB:  PubMed          Journal:  ERJ Open Res        ISSN: 2312-0541


To the Editor: Interstitial lung diseases (ILDs) that present in childhood (chILD) are seen far less frequently than ILDs presenting in adults which themselves constitute rare disorders [1]. Histopathological [2, 3] and imaging [4] characterisation of chILD disease subtypes therefore lags behind adult ILDs. The field has also been constrained by comparisons with disease morphology in adults, despite the developmental differences in terms of growth and healing in the paediatric lung, which may alter disease patterns and distributions. The American Thoracic Society [5] and European [1] chILD management guidelines both specify a pivotal role for computed tomography (CT) imaging in the work-up of chILD patients to: 1) determine whether a chILD is present or not; and 2) where possible, to make a specific diagnosis of the underlying cause. For the second aim to be achieved, diagnostic reviews need to be reproducible between experts. Our study uniquely examined agreement between observers of varying experience in the CT evaluation of chILD to inform whether the current status of CT imaging and knowledge can be diagnostic of specific chILDs. We hypothesised that observer agreement for chILD groups and diagnoses would be limited. The study was not designed to relate CT agreement to final diagnosis. As a secondary analysis, we examined how CT interpretation differed between observers in children under and over 2 years of age. 84 patients (<2 years, n=35) from 13 countries, referred to the Royal Brompton Hospital paediatric ILD unit from 1999 to 2014 were re-evaluated by 10 observers: three chILD sub-specialist radiologists with >25 years' experience (C.M. Owens, P. Garcia-Peña, A.S. Brody); four chILD pulmonologists with >20 years' experience (S. Cunningham, T.J. Vece, P. Aurora, A. Moreno-Galdó); and three radiologists with 5–10 years' chILD experience (A. Calder, P. Toma, T.A. Watson). Consecutive patients were chosen and included in the study if clinically indicated DICOM format CTs were retrievable and of acceptable quality (high-resolution, ≤2 mm slice thickness) as determined by an independent radiologist (J. Jacob). Observers were given details of patient sex and age. Each scorer provided up to three choices when assigning a chILD diagnostic group or individual chILD diagnosis from a preselected list. For each choice, a measure of group/diagnostic likelihood was also assigned with confidence scores ranging from 0% to 100%: 100%=pathognomonic, 70–90%=high confidence, 40–60%=moderate confidence, 10–30%=low confidence, 0%=no confidence. The choices were based on previous histopathological work [2]. The Fleiss Kappa evaluated observer agreement for first-choice chILD groups/individual diagnoses. For chILD groups/individual diagnoses where agreement was at least fair, observer confidence was examined using linear weighted kappas [6]. Results were reanalysed according to patient age (<2 years versus >2 years, the conventional dichotomy in chILD classifications [7]) given the variable prevalence of chILD disease groups/individual diagnoses across these age ranges. Agreement was also examined in observer subgroups (senior versus junior radiologists versus senior pulmonologists). Statistical analyses were performed with R: A language and environment for statistical computing [8]. Approval for the study was obtained from the Institutional Research Committee of the Royal Brompton Hospital (Project 1157). 840 first-choice chILD group/diagnostic assignations were made by the 10 observers (table 1). chILD groups/diagnoses whose frequency was <10% of the total number of first-choice diagnoses in patients under and over 2 years of age, were grouped as “others”, in accord with previous studies [9]. Four chILD groups were analysed: airways disease (n=247), interstitial pneumonia (n=326), developmental/undefined aetiology disorders (n=147) and others (n=120). Five chILD diagnoses could be analysed in patients <2 years: infection (n=48), chronic pneumonitis of infancy (CPI; n=47), pulmonary alveolar proteinosis (PAP, n=53), developmental/undefined aetiology disorders (n=83) and others (n=119). Four chILD diagnoses could be analysed in patients >2 years: obliterative bronchiolitis (n=48), infection (n=86), fibrotic nonspecific interstitial pneumonia (fNSIP; n=47) and others (n=309).
TABLE 1

Assignations of first-choice individual childhood interstitial lung disease (chILD) group or diagnosis in children under 2 years of age (n=35) and over 2 years of age (n=49) by 10 observers

chILD categorychILD group/diagnosisNumbers of first-choice assignations
<2 years>2 years
chILD groupsNormal318
Airways disease#77170
Interstitial pneumonia126200
Developmental/undefined aetiology disorders9948
Unclassifiable413
Bronchopulmonary dysplasia2910
Other1231
chILD DiagnosesExudative bronchiolitis42
Constrictive obliterative bronchiolitis1048
Infection4886
Chronic pneumonitis of infancy4712
Pulmonary alveolar proteinosis5324
Desquamative interstitial pneumonia813
Nonspecific interstitial pneumonia (cellular)1634
Diffuse alveolar damage38
Fibrotic nonspecific interstitial pneumonia147
Organising pneumonia030
Diffuse alveolar haemorrhage111
Alveolar microlithiasis03
Sarcoidosis06
Langerhans cell histiocytosis112
Lung growth abnormality/chronic disease of prematurity8312
Pulmonary interstitial glycogenosis110
Neuroendocrine cell hyperplasia of infancy3015
Lymphoid interstitial pneumonia123
Diffuse lymphoid hyperplasia12
Lymphomatoid granulomatosis14
Follicular bronchiolitis06
Bronchus-associated lymphoid tissue01
Primary pulmonary arterial hypertension14
PVOD/PCH35
Thromboembolic disease13
Lymphangiomatosis11
Lymphangiectasia (primary and secondary)77
Secondary vasculopathies (e.g. due to cardiac disease)62
Primary and secondary pulmonary vasculitis114
Neoplasms09
Alternate diagnosis1124

PVOD: pulmonary veno-occlusive disease; PCH: pulmonary capillary haemangiomatosis. #: airways disease included infection and obliterative and exudative bronchiolitis; ¶: developmental/undefined aetiology disorders included growth disorders such as alveolar hypoplasia, neuroendocrine cell hyperplasia of infancy and pulmonary interstitial glycogenosis.

Assignations of first-choice individual childhood interstitial lung disease (chILD) group or diagnosis in children under 2 years of age (n=35) and over 2 years of age (n=49) by 10 observers PVOD: pulmonary veno-occlusive disease; PCH: pulmonary capillary haemangiomatosis. #: airways disease included infection and obliterative and exudative bronchiolitis; ¶: developmental/undefined aetiology disorders included growth disorders such as alveolar hypoplasia, neuroendocrine cell hyperplasia of infancy and pulmonary interstitial glycogenosis. When subanalysed according to patient age, observer agreement for first-choice chILD groups was moderate for airways disease and interstitial pneumonia in patients >2 years, but only fair in patients <2 years (table 2). There was generally less observer variation between senior radiologists than pulmonologists. In patients <2 years senior radiologists demonstrated better agreement than other observers. High confidence chILD group assignations (>70%) were more common in children >2 years and increased with observer seniority (data not shown). Encouragingly, agreement for diagnostic groups was best for infection/airways disease where conservative management without biopsy may be preferable.
TABLE 2

Fleiss kappa values for observer agreement for first choice group assignation in the 84 study cases. Sub-analyses are shown for patients under 2 years of age (n=35) and over 2 years of age (n=49)

Patient groupsObserver groupsAll casesAirways disease#Interstitial pneumoniaDevelopmental/undefined
All patientsAll observers0.30 (0.28–0.32)0.37 (0.34–0.41)0.36 (0.33–0.40)0.18 (0.15–0.21)
Senior radiologists0.35 (0.27–0.42)0.46 (0.34–0.58)0.44 (0.31–0.56)0.16 (0.04–0.29)
Senior pulmonologists0.28 (0.23–0.33)0.34 (0.25–0.42)0.36 (0.27–0.45)0.21 (0.12–0.29)
Junior radiologists0.31 (0.23–0.38)0.45 (0.33–0.57)0.39 (0.27–0.51)0.07 (−0.06–0.19)
Patients <2 years of ageAll observers0.22 (0.19–0.25)0.28 (0.23–0.33)0.30 (0.25–0.35)0.14 (0.09–0.19)
Senior radiologists0.36 (0.24–0.48)0.61 (0.42–0.80)0.46 (0.26–0.65)0.15 (−0.05–0.34)
Senior pulmonologists0.18 (0.10–0.26)0.24 (0.11–0.38)0.26 (0.12–0.39)0.16 (0.02–0.29)
Junior radiologists0.13 (0.01–0.24)0.35 (0.16–0.54)0.19 (−0.00–0.38)−0.05 (−0.24–0.14)
Patients >2 years of ageAll observers0.33 (0.31–0.36)0.40 (0.36–0.45)0.40 (0.36–0.44)0.11 (0.07–0.15)
Senior radiologists0.31 (0.21–0.41)0.36 (0.20–0.53)0.42 (0.26–0.58)0.12 (−0.04–0.29)
Senior pulmonologists0.32 (0.25–0.40)0.36 (0.24–0.47)0.43 (0.32–0.55)0.09 (−0.02–0.21)
Junior radiologists0.42 (0.32–0.53)0.52 (0.36–0.68)0.48 (0.32–0.64)0.09 (−0.07–0.25)

#: airways disease included infection and obliterative and exudative bronchiolitis; ¶: developmental/undefined aetiology disorders included growth disorders such as alveolar hypoplasia, neuroendocrine cell hyperplasia of infancy and pulmonary interstitial glycogenosis.

Fleiss kappa values for observer agreement for first choice group assignation in the 84 study cases. Sub-analyses are shown for patients under 2 years of age (n=35) and over 2 years of age (n=49) #: airways disease included infection and obliterative and exudative bronchiolitis; ¶: developmental/undefined aetiology disorders included growth disorders such as alveolar hypoplasia, neuroendocrine cell hyperplasia of infancy and pulmonary interstitial glycogenosis. Observer agreement in assigning first-choice individual chILD diagnoses was moderate for obliterative bronchiolitis (in children >2 years), and fair for other diagnoses except CPI where agreement was poor. Further analyses were only performed in individual diagnoses where agreement was at least fair. Observer agreement for diagnostic likelihoods were moderate for obliterative bronchiolitis (children >2 years of age), but were at best only fair for other diagnoses, regardless of patient age (data not shown). Observer agreement for diagnostic likelihood of infection in children <2 years was poor across all observer subgroups. We report poor agreement between even experienced thoracic radiologists for the identification of most chILD groups and individual diagnoses across ages. The lower observer agreement for individual chILD diagnoses compared with chILD diagnostic groups may reflect differing methods by which radiologists classify CT appearances. Some observers might classify according to histological pattern and distinguish features of NSIP from PAP in a child >2 years, while others might classify according to suspected aetiology and recognise both patterns as suggesting a surfactant protein disorder. Our findings stress the need for a new common uniform descriptor language for CT appearances, because the present system is not reproducible and therefore confusing. The current classification system in chILD is based on histopathological diagnoses [2, 7], and as occurred with adult fibrosing lung diseases, it is hoped that histopathological descriptors will inform those CT features that are of importance in delineating childhood disease. Two early studies examining diagnostic accuracy using radiological–histopathological correlations in a small subset of chILD patients demonstrated that CT features could identify histopathological diagnoses with acceptable accuracy [4, 10]. However, prior to the evaluation of CT diagnostic accuracy it is essential to understand whether expert readers can coalesce on a single diagnosis with sufficient frequency that a single CT diagnosis can be compared to a single histopathological diagnosis. Study limitations included the time frame over which cases were acquired, where CT imaging protocols and quality varied widely. However, imaging heterogeneity is an unavoidable constraint when examining real-world paediatric data and we actively sought to draw real-world conclusions on chILD observer agreement. Limiting clinical information to patient age and sex might have increased observer disagreement. However rather than examining diagnostic accuracy, our a priori study aim was to understand the limitations of CT interrogation of chILD and identify those conditions where pattern recognition alone is insufficient to make a confident diagnosis. Though the study case mix from a tertiary centre is likely to have been weighted towards more challenging presentations of disease, chILD cases are only seen with sufficient frequency to allow analysis in such centres. Our study is the largest examination of CT imaging in chILD cases, scored by the largest number of observers, to date, who importantly were from different specialties. We demonstrate that making chILD diagnoses on CT is difficult, even amongst sub-specialists. Considering the current state of knowledge in the field, we found that agreement for chILD diagnostic groups was generally better than for individual diagnoses and was highest amongst senior radiologists. Further detailed work is needed to understand if, with studies and training, improvements in CT diagnostic agreement are possible. Additionally, the ability of the multidisciplinary team to improve diagnostic confidence by augmenting radiology with clinical, functional and haematological results, as per guideline recommendations [1, 5] requires further study.
  9 in total

1.  Diagnostic accuracy of thin-section CT and chest radiography of pediatric interstitial lung disease.

Authors:  S J Copley; M Coren; A G Nicholson; M B Rubens; A Bush; D M Hansell
Journal:  AJR Am J Roentgenol       Date:  2000-02       Impact factor: 3.959

2.  Diffuse lung disease in infancy and childhood: expanding the chILD classification.

Authors:  Alexandra Rice; My-Anh Tran-Dang; Andrew Bush; Andrew G Nicholson
Journal:  Histopathology       Date:  2013-10-09       Impact factor: 5.087

3.  Diffuse Lung Disease in Biopsied Children 2 to 18 Years of Age. Application of the chILD Classification Scheme.

Authors:  Leland L Fan; Megan K Dishop; Csaba Galambos; Frederic B Askin; Frances V White; Claire Langston; Deborah R Liptzin; Miranda E Kroehl; Gail H Deutsch; Lisa R Young; Geoffrey Kurland; James Hagood; Sharon Dell; Bruce C Trapnell; Robin R Deterding
Journal:  Ann Am Thorac Soc       Date:  2015-10

Review 4.  European protocols for the diagnosis and initial treatment of interstitial lung disease in children.

Authors:  Andrew Bush; Steve Cunningham; Jacques de Blic; Angelo Barbato; Annick Clement; Ralph Epaud; Meike Hengst; Nural Kiper; Andrew G Nicholson; Martin Wetzke; Deborah Snijders; Nicolaus Schwerk; Matthias Griese
Journal:  Thorax       Date:  2015-07-01       Impact factor: 9.139

Review 5.  An official American Thoracic Society clinical practice guideline: classification, evaluation, and management of childhood interstitial lung disease in infancy.

Authors:  Geoffrey Kurland; Robin R Deterding; James S Hagood; Lisa R Young; Alan S Brody; Robert G Castile; Sharon Dell; Leland L Fan; Aaron Hamvas; Bettina C Hilman; Claire Langston; Lawrence M Nogee; Gregory J Redding
Journal:  Am J Respir Crit Care Med       Date:  2013-08-01       Impact factor: 21.405

6.  Pediatric diffuse lung disease: diagnosis and classification using high-resolution CT.

Authors:  D A Lynch; T Hay; J D Newell; V D Divgi; L L Fan
Journal:  AJR Am J Roentgenol       Date:  1999-09       Impact factor: 3.959

7.  HRCT diagnosis of diffuse parenchymal lung disease: inter-observer variation.

Authors:  Z A Aziz; A U Wells; D M Hansell; G A Bain; S J Copley; S R Desai; S M Ellis; F V Gleeson; S Grubnic; A G Nicholson; S P G Padley; K S Pointon; J H Reynolds; R J H Robertson; M B Rubens
Journal:  Thorax       Date:  2004-06       Impact factor: 9.139

8.  Diffuse lung disease in young children: application of a novel classification scheme.

Authors:  Gail H Deutsch; Lisa R Young; Robin R Deterding; Leland L Fan; Sharon D Dell; Judy A Bean; Alan S Brody; Lawrence M Nogee; Bruce C Trapnell; Claire Langston; Eric A Albright; Frederic B Askin; Peter Baker; Pauline M Chou; Carlyne M Cool; Susan C Coventry; Ernest Cutz; Mary M Davis; Megan K Dishop; Csaba Galambos; Kathleen Patterson; William D Travis; Susan E Wert; Frances V White
Journal:  Am J Respir Crit Care Med       Date:  2007-09-20       Impact factor: 21.405

9.  Interobserver agreement for the ATS/ERS/JRS/ALAT criteria for a UIP pattern on CT.

Authors:  Simon L F Walsh; Lucio Calandriello; Nicola Sverzellati; Athol U Wells; David M Hansell
Journal:  Thorax       Date:  2015-11-19       Impact factor: 9.139

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.