Literature DB >> 32631626

Validation of the British Society of Thoracic Imaging guidelines for COVID-19 chest radiograph reporting.

S S Hare1, A N Tavare1, V Dattani1, B Musaddaq1, I Beal1, J Cleverley1, C Cash1, E Lemoniati1, J Barnett2.   

Abstract

AIM: To validate the British Society of Thoracic Imaging issued guidelines for the categorisation of chest radiographs for coronavirus disease 2019 (COVID-19) reporting regarding reproducibility amongst radiologists and diagnostic performance.
MATERIALS AND METHODS: Chest radiographs from 50 patients with COVID-19, and 50 control patients with symptoms consistent with COVID-19 from prior to the emergence of the novel coronavirus were assessed by seven consultant radiologists with regards to the British Society of Thoracic Imaging guidelines.
RESULTS: The findings show excellent specificity (100%) and moderate sensitivity (44%) for guideline-defined Classic/Probable COVID-19, and substantial interobserver agreement (Fleiss' k=0.61). Fair agreement was observed for the "Indeterminate for COVID-19" (k=0.23), and "Non-COVID-19" (k=0.37) categories; furthermore, the sensitivity (0.26 and 0.14 respectively) and specificity (0.76, 0.80) of these categories for COVID-19 were not significantly different (McNemar's test p=0.18 and p=0.67).
CONCLUSION: An amalgamation of the categories of "Indeterminate for COVID-19" and "Non-COVID-19" into a single "not classic of COVID-19" classification would improve interobserver agreement, encompass patients with a similar probability of COVID-19, and remove the possibility of labelling patients with COVID-19 as "Non-COVID-19", which is the presenting radiographic appearance in a significant minority (14%) of patients.
Copyright © 2020. Published by Elsevier Ltd.

Entities:  

Mesh:

Year:  2020        PMID: 32631626      PMCID: PMC7298474          DOI: 10.1016/j.crad.2020.06.005

Source DB:  PubMed          Journal:  Clin Radiol        ISSN: 0009-9260            Impact factor:   2.350


Introduction

In December 2019, Wuhan City (Hubei Province, China) reported a cluster of patients displaying a febrile respiratory tract illness of unknown origin. Bronchoalveolar lavage of the patients isolated the pathogen as a novel strain of coronavirus (SARS-coronavirus-2 [SARS-CoV-2]). The pulmonary infection caused by SARS-CoV-2 was named coronavirus disease 2019 (COVID-19) by the World Health Organization (WHO). In early 2020, the unprecedented surge in UK COVID-19 cases saw the chest radiograph (CXR) emerge as the frontline diagnostic imaging test, in conjunction with clinical history and key blood biomarkers: C-reactive protein (CRP) and lymphopenia. The British Society of Thoracic Imaging (BSTI) developed a simple, internationally recognised CXR reporting template to help facilitate consistency of reporting with embedded CXR reporting codes and allow retrospective radiology information system keyword searches for audit purposes. Frontline doctors have found this standardised reporting method a useful adjunct to clinical assessment, particularly when CXRs are “hot-reported”. The BSTI reporting template has been incorporated into an NHS England (NHSE) endorsed radiology decision tool for suspected COVID-19. , Moreover, CXR has also emerged as a pivotal triage tool in proposed infection-control management strategies of inpatients as a result of reverse transcription-polymerase chain reaction (RT-PCR) result delays of up to 24–48 h and suspected initial false-negative COVID-19 nose/throat swab RT-PCR results in patients with a high clinical suspicion of COVID-19 infection. , Despite the prominence of CXR in the management of patients suspected to have SARS-CoV-2 pulmonary infection in Great Britain, to date very few studies have examined the CXR findings of COVID-19. , Saliently, no study to date has examined the diagnostic utility of CXR against non-COVID-19 controls. Furthermore, the four diagnostic categories of chest radiograph introduced in the BSTI guidelines have not been validated regarding their utility or reproducibility. The aim of the present study was to validate the BSTI COVID-19 CXR classification criteria with regards firstly to their reproducibility amongst consultant radiologists involved in the front-line care of patients with COVID-19, and secondly, to their diagnostic utility against symptomatic control patients without COVID-19.

Materials and methods

Patient selection

Consecutive adult patients with nose/throat swab RT-PCR-confirmed SARS-CoV-2 infection were identified from the microbiology database at Barnet General Hospital. Fifty consecutive patients were selected following exclusion of patients <18 years (n=0); patients with multiple organisms identified on PCR (n=0); and patients without admission CXR available on the picture archiving and communication system (PACS; n=4). As a retrospective evaluation of routinely collected clinical data, ethical approval was not required. Given the limited application of RT-PCR testing at this time in England, and the reported non-trivial false-negative rate of RT-PCR testing for SARS-CoV-2, it is difficult to identify patients who are definitively negative for SARS-CoV-2; therefore, a control cohort of patients was selected from November 2019, prior to the emergence of SARS-CoV-2. Fifty consecutive adult patients with symptoms consistent with COVID-19 (new cough and fever) and available admission chest radiograph were selected from the PACS records.

Imaging evaluation

Images were anonymised regarding both patient identifiable data and date of image acquisition and stored in a random order on the Trust's PACS. Seven consultant radiologists (median length of time on the specialist register 10 years, range 1–22 years) were recruited to participate in the study. They received training consisting of a review of the educational material available on the BSTI website regarding COVID-19 classification of CXRs. Participants were informed of the presenting complaint (new cough and fever, query COVID-19), and asked to categorise each CXR with regards to the BSTI guidelines (Fig 1 ). Patients were categorised as Classic/probable COVID-19, Indeterminate for COVID-19, Normal, and Non-COVID-19 according to Box 1 .
Figure 1

Examples of the COVID BSTI categories for plain films, in each case all radiologists agreed on the categorisation. (a) Anteroposterior (AP) erect radiograph demonstrating “Classic COVID-19”. (b) AP erect chest radiograph “Indeterminate for COVID-19”. (c) AP erect radiograph classified as “COVID normal”. (d) AP erect radiograph classified as “Non-COVID”.

Examples of the COVID BSTI categories for plain films, in each case all radiologists agreed on the categorisation. (a) Anteroposterior (AP) erect radiograph demonstrating “Classic COVID-19”. (b) AP erect chest radiograph “Indeterminate for COVID-19”. (c) AP erect radiograph classified as “COVID normal”. (d) AP erect radiograph classified as “Non-COVID”. The British Society of Thoracic Imaging chest radiography reporting criteria. Alt-text: Box 1 In order to reach a final categorisation for each radiograph, the scores of two fellowship-trained thoracic radiologists (SSH, JB) were used, with consensus reached by post-hoc agreement.

Data collection and analysis

Data were extracted electronically from clinical records and analysed in R (https://www.R-project.org). Tests of comparative statistics were selected using Levene's test for homoscedasticity of variables and quintile–quintile plots for normality. Student's t-test was utilised for homoscedastic normally distributed data. Kruskal–Wallis test was used for non-normal homoscedastic data and Welch's t-test was used in the case of heteroscedasticity. Agreement was measured using unweighted Cohen's kappa for the final categorisation, with unweighted Fleiss' kappa used for assessment all participants. Only test statistics independent of disease prevalence (sensitivity and specificity) are reported. Confidence intervals for test statistics were calculated using the methods of Simel. McNemar's test was used to compare test statistics.

Results

Patient demographics are displayed in Table 1 . Patients with SARS-CoV-2 infection were significantly older than those without, but there was no significant difference in gender. Although neither lymphocyte count nor lymphopenia (defined as lymphocyte count <1×109 l−1) were significantly different in COVID-19 patients than controls, patients with COVID-19 had significantly greater CRP levels at presentation.
Table 1

Demographic and radiological data.

COVIDNon-COVIDp-Value
Demographic data
n5050
Male, n (%)33 (66.0%)24 (48.0%)0.07
Age, years (±SD)68.6 (±17.3)55.4 (±21.3)0.001
Lymphocyte count, ×109 l−1 (±sd)1.06 (±0.68)1.22 (±0.68)0.23
Lymphopenia, n (%)27 (54%)23 (46%)0.54
CRP, (IQR)77 (114)38 (100)0.01

Proportions compared using fisher test; age and lymphocyte count expressed as mean and standard deviation and compared using Student's t-test; CRP expressed as median and interquartile range and compared with Kruskal–Wallis test. Lymphopenia defined as lymphocyte count <1.0 ×109L−1.

CRP, C-reactive protein; SD, standard deviation; IQR, interquartile range.

Demographic and radiological data. Proportions compared using fisher test; age and lymphocyte count expressed as mean and standard deviation and compared using Student's t-test; CRP expressed as median and interquartile range and compared with Kruskal–Wallis test. Lymphopenia defined as lymphocyte count <1.0 ×109L−1. CRP, C-reactive protein; SD, standard deviation; IQR, interquartile range. Amongst all radiologists, overall agreement of CXR categorisation was moderate (fleiss K = 0.50). Agreement for individual diagnostic categories was substantial for ‘Classic/Probable COVID-19’, (k=0.61) and ‘Normal’ (k = 0.68). Fair agreement was observed for the ‘Indeterminate for COVID-19’ (k=0.23), and ‘Non-COVID-19’ (k=0.37) categories. Post-hoc combination of the ‘Indeterminate for COVID-19’ and ‘Non-COVID-19’ codes into a single category was associated with improved inter-observer agreement (k=0.58). For the purposes of final classification of CXRs, scores from two fellowship-trained thoracic radiologists (SSH, JB) were used, with disagreements arbitrated by consensus. Agreement amongst these radiologists was almost perfect for “Classic/Probable COVID-19” (k=0.83), substantial for “Normal” (k=0.70), moderate for “Non-COVID-19” (k=0.50) and slight for “Indeterminate” (k=0.25). The final classifications of patients are given in Table 2 . The “Classic/Probable COVID-19” category was associated with 100% specificity for COVID-19, and detected 44% of patients with RT-PCR-confirmed SARS-CoV-2 infection. Normal CXRs were significantly more frequent in controls (p<0.001 after adjustment for multiple testing), but still occurred in 16% of patients with RT-PCR-confirmed SARS-CoV-2 infection. The frequency of “Indeterminate for COVID-19” and “Non-COVID-19” chest radiographs was not significantly different between COVID-19 patients and controls, indeed the sensitivity and specificity of these categories for COVID-19 were not significantly different (McNemar's test p=0.18 and p=0.67, respectively). Finally, 7/50 patients (14%) with RT-PCR-confirmed SARS-CoV-2 infection had an admission CXR classified as “Non-COVID-19”, examples of such patients are demonstrated in Fig 2 .
Table 2

Proportions of patients compared using fisher test with Benjamini and Hochberg method of adjustment for multiple testing.

COVIDNon-COVIDp-ValueSensitivitySpecificity
Final CXR categorisation
COVID classic220<0.0010.44 (0.30, 0.59)1.00 (0.93, 1.00)
COVID indeterminate131210.26 (0.15, 0.40)0.76 (0.62, 0.87)
COVID normal828<0.0010.16 (0.07, 0.29)0.44 (0.30, 0.59)
Non-COVID71010.14 (0.06, 0.27)0.80 (0.66, 0.90)

Sensitivity and specificity expressed with 95% confidence intervals.

CXR, chest radiography.

Figure 2

Examples of patients with SARS-CoV-2 infection, but admission chest radiographs classified as “Non-COVID”. (a) AP erect radiograph demonstrating lobar pneumonia. (b) AP erect radiograph showing congestive cardiac failure. (c) AP erect chest radiograph with unilateral pleural effusion. (d) AP semi-erect chest radiograph with left lower lung airways disease/thickening.

Proportions of patients compared using fisher test with Benjamini and Hochberg method of adjustment for multiple testing. Sensitivity and specificity expressed with 95% confidence intervals. CXR, chest radiography. Examples of patients with SARS-CoV-2 infection, but admission chest radiographs classified as “Non-COVID”. (a) AP erect radiograph demonstrating lobar pneumonia. (b) AP erect radiograph showing congestive cardiac failure. (c) AP erect chest radiograph with unilateral pleural effusion. (d) AP semi-erect chest radiograph with left lower lung airways disease/thickening.

Discussion

The results of the present study demonstrate that the BSTI “Classic/Probable COVID-19” categorisation is very specific and moderately sensitive for patients with RT-PCR-confirmed SARS-CoV-2 pulmonary infection on admission CXR, as opposed to symptom-matched controls. Furthermore, this classification is substantially agreed upon by consultant radiologists. A significant minority of patients in this study with SARS-CoV-2 infection presented with normal CXRs, findings that reinforce the BSTI “Normal” categorisation, which states that COVID-19 cannot be excluded and that RT-PCR may be required. These results, however, highlight that some refinement of the BSTI COVID-19 classification criteria may be needed, specifically the categories of “Indeterminate for COVID-19” and “Non-COVID-19”. Only fair interobserver agreement was observed for these categories, which in the case of “Indeterminate for COVID-19” fell to slight agreement when only the categorisation of fellowship-trained thoracic radiologists were used. In addition, these categories have similar diagnostic performance regarding SARS-CoV-2 infection. The potential need for an iteration of these two categories is highlighted by an inherent overlap of CXR appearances between “Indeterminate for COVID-19” and “Non-COVID-19” categories. Examples of this overlap exist in patients with limited or unilateral consolidation, which could be SARS-CoV-2 or bacterial in aetiology; and in patients with multiple radiological abnormalities, for example, fluid overload and alveolar opacity. In equivocal cases, it is expected that radiologists may also reasonably be informed by the pre-test probability of SARS-CoV-2 infection in assigning cases to the “Indeterminate” or “Non-COVID” categories. Thus, the categorisation of the same radiograph may differ depending on whether the patient presented in the first peak of the COVID-19 pandemic in London, when up to 80% of emergency department admissions were COVID related, as opposed to during a relative trough. Inclusion of non-diagnostic examinations in the “Indeterminate for COVID-19” category also adds variation to this group of patents, compounding interobserver variation with regards to diagnosis and that regarding acceptable film quality. A non-diagnostic examination should be reported as such, and no statement about COVID-19 classification is necessary or possible in this situation. There is also variability in the recommendation implied by this diagnostic category; patients with a non-diagnostic film are more likely to benefit from a repeated attempt at imaging, whereas those with a diagnostic-quality examination revealing a non-specific abnormality may not. As the prevalence of COVID-19 increases and as health-seeking behaviours of the population respond to the presence of a pandemic, atypical radiographic presentations of SARS-CoV-2 infection will become more frequent relative to non-coronavirus disease. Indeed, 14% of patients in this study with SARS-CoV-2 infection had a CXR categorised as “Non-COVID”. The amalgamation of the “Indeterminate for COVID-19” and “Non-COVID-19” categories into a single “not classic of COVID-19” category would have several advantages. Firstly, this category would increase consultant radiologist agreement. Secondly, the category would encompass a group of patients with similar probability of SARS-CoV-2 infection. Thirdly, it would remove the possibility of potentially mislabelling patients with SARS-CoV-2 infection as “Non-COVID-19”. The present study has a number of limitations. Firstly, performance of a given test varies with disease prevalence. In the present study, patients and controls were matched at a 1:1 ratio. Throughout the height of the pandemic, the authors' anecdotal experience is of patients with COVID-19 outnumbering those without. In order to minimise the effect of varying prevalence of SARS-CoV-2 on the results, only sensitivity and specificity have been presented, which are statistics independent of disease prevalence. Using PCR as the reference standard diagnosis in a study examining radiological diagnosis is necessary to avoid incorporation bias, but introduces the biases of PCR testing strategy, namely towards those patients with more severe disease requiring admission. This may have the effect of overstating the sensitivity of the “Classic/Probable COVID” category, but should not affect the specificity. It is currently uncertain whether patients with false-negative SARS-CoV-2 RT-PCR have a distinct radiological phenotype to those who are RT-PCR positive; this could also introduce bias into the data. In conclusion, the results of the present demonstrate variable performance of the BSTI COVID-19 classification criteria. The guideline defined “Classic/Probable” appearance of COVID-19 has excellent specificity and moderate sensitivity for SARS-CoV-2 pulmonary infection, and furthermore, is associated with substantial interobserver agreement. The categories of “Indeterminate for COVID-19” and “Non-COVID-19”, however, suffer from greater interobserver variability, and furthermore, have similar sensitivity and specificity for COVID-19. The authors suggest an amalgamation of these categories into a “not classic of COVID-19” category, which would increase interobserver agreement; encompass patients with similar probability of COVID-19; and remove the potential for mislabelling patients with SARS-CoV-2 infection as “Non-COVID”.

Conflicts of interest

Dr Hare is on the committee of the British Society of Thoracic Imaging.
Box 1

The British Society of Thoracic Imaging chest radiography reporting criteria.

NormalCOVID-19 not excluded, please correlate with PCR
Classic/probable COVID-19Lower lobe and peripheral predominant multiple opacities that are bilateral (>> unilateral)
Indeterminate for COVID-19Does not fit Classic or Non-COVID-19 descriptors” or “poor quality film
Non-COVID-19Pneumothorax/lobar pneumonia/pleural effusion(s)/pulmonary oedema/other
  8 in total

1.  COVID-19 in a tertiary paediatric centre in Portugal: a single-centre retrospective study.

Authors:  Tiago Milheiro Silva; Ana Margarida Garcia; Catarina Gouveia; Flora Candeias; Maria João Brito
Journal:  BMJ Paediatr Open       Date:  2022-06

2.  Time course of lung ultrasound findings in patients with COVID-19 pneumonia and cardiac dysfunction.

Authors:  Joao Leote; Tiago Judas; Ana Luísa Broa; Miguel Lopes; Francisca Abecasis; Inês Pintassilgo; Afonso Gonçalves; Filipe Gonzalez
Journal:  Ultrasound J       Date:  2022-07-07

3.  Symptomatic, biochemical and radiographic recovery in patients with COVID-19.

Authors:  Patrick Mallia; Onn Min Kon; Jamilah Meghji; Brandon Wong; Kartik Kumar; Victoria Pilkington; Shaan Chhabra; Ben Russell; Jian Chen; Karthikan Srikanthan; Mirae Park; Harriet Owles; Felicity Liew; Joana Alcada; Laura Martin; Meg Coleman; Sarah Elkin; Clare Ross; Shweta Agrawal; Thomas Gardiner; Aaron Bell; Alice White; Dominic Hampson; Gauri Vithlani; Kavina Manalan; Solange Bramer; Alejandra Martin Segura; Anushree Kucheria; Prashanthi Ratnakumar; Alexander Sheeka; Lavanya Anandan; Susan Copley; Georgina Russell; Chloe I Bloom
Journal:  BMJ Open Respir Res       Date:  2021-04

4.  COVID-19 diagnosis on CT images with Bayes optimization-based deep neural networks and machine learning algorithms.

Authors:  Murat Canayaz; Sanem Şehribanoğlu; Recep Özdağ; Murat Demir
Journal:  Neural Comput Appl       Date:  2022-02-28       Impact factor: 5.102

5.  Automated deep learning-based segmentation of COVID-19 lesions from chest computed tomography images.

Authors:  Mohammad Salehi; Mahdieh Afkhami Ardekani; Alireza Bashari Taramsari; Hamed Ghaffari; Mohammad Haghparast
Journal:  Pol J Radiol       Date:  2022-08-26

Review 6.  Review on Diagnosis of COVID-19 from Chest CT Images Using Artificial Intelligence.

Authors:  Ilker Ozsahin; Boran Sekeroglu; Musa Sani Musa; Mubarak Taiwo Mustapha; Dilber Uzun Ozsahin
Journal:  Comput Math Methods Med       Date:  2020-09-26       Impact factor: 2.238

7.  Chest X-ray in the emergency department during COVID-19 pandemic descending phase in Italy: correlation with patients' outcome.

Authors:  Chiara Moroni; Diletta Cozzi; Marco Albanesi; Edoardo Cavigli; Alessandra Bindi; Silvia Luvarà; Simone Busoni; Lorenzo Nicola Mazzoni; Stefano Grifoni; Peiman Nazerian; Vittorio Miele
Journal:  Radiol Med       Date:  2021-01-04       Impact factor: 3.469

Review 8.  Diagnosis of COVID-19 in symptomatic patients: An updated review.

Authors:  M N Zahan; H Habibi; A Pencil; J Abdul-Ghafar; S A Ahmadi; N S Juyena; M T Rahman; M S Parvej
Journal:  Vacunas       Date:  2021-07-13
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.