Literature DB >> 33409142

Very deep super-resolution for efficient cone-beam computed tomographic image restoration.

Jae Joon Hwang¹, Yun-Hoa Jung¹, Bong-Hae Cho¹, Min-Suk Heo².

Abstract

PURPOSE: As cone-beam computed tomography (CBCT) has become the most widely used 3-dimensional (3D) imaging modality in the dental field, storage space and costs for large-capacity data have become an important issue. Therefore, if 3D data can be stored at a clinically acceptable compression rate, the burden in terms of storage space and cost can be reduced and data can be managed more efficiently. In this study, a deep learning network for super-resolution was tested to restore compressed virtual CBCT images.
MATERIALS AND METHODS: Virtual CBCT image data were created with a publicly available online dataset (CQ500) of multidetector computed tomography images using CBCT reconstruction software (TIGRE). A very deep super-resolution (VDSR) network was trained to restore high-resolution virtual CBCT images from the low-resolution virtual CBCT images.
RESULTS: The images reconstructed by VDSR showed better image quality than bicubic interpolation in restored images at various scale ratios. The highest scale ratio with clinically acceptable reconstruction accuracy using VDSR was 2.1.
CONCLUSION: VDSR showed promising restoration accuracy in this study. In the future, it will be necessary to experiment with new deep learning algorithms and large-scale data for clinical application of this technology.

Entities: Chemical

Keywords: Cone-Beam Computed Tomography; Data Compression; Radiographic Image Enhancement

Year: 2020 PMID： 33409142 PMCID： PMC7758262 DOI： 10.5624/isd.2020.50.4.331

Source DB: PubMed Journal: Imaging Sci Dent ISSN： 2233-7822

Introduction

Cone-beam computed tomography (CBCT) is used in many dental hospitals and clinics for the purpose of diagnosis using 3-dimensional (3D) images. However, the storage and management of 3D data is enormously expensive in terms of time and cost because of the large file size of each scan, which may range from 500 to 1,200 MB depending on the size of the field of view. In addition, the loss of detailed information by the lossy compression of CBCT images is irreversible. Therefore, additional storage devices and technical support are required to implement effective backup strategies, incurring additional expenses and consuming additional time. Furthermore, data backup and management are not securely carried out in smaller dental clinics to the same extent as in dental hospitals. Therefore, if CBCT data can be compressed and later restored to a clinically acceptable image quality, dental and medical institutions can reduce their amount of data storage, depending on the compression rate. Expanding this technique to the entire hospital community could save astronomical amounts of time and money. However, there has not been much progress in research on efficient compression and reconstruction of CBCT images. Furthermore, no commercial algorithm is available that can reconstruct a compressed 3D CBCT image that is close to the original image quality in a highly reliable manner. Super-resolution is the process of restoring high-resolution images from low-resolution images. Super-resolution is challenging because high-frequency image content typically cannot be recovered from low-resolution images. Without high-frequency information, the quality of the high-resolution image is limited. Until now, super-resolution has been attempted using traditional image processing methods, but it did not demonstrate high accuracy. Deep learning, when applied to this field, has begun to exceed the accuracy of traditional image processing methods.1 Recently, with the use of deep network architecture and residual blocks, accuracy has rapidly increased.2 Very deep super-resolution (VDSR) is a convolutional neural network (CNN) designed to reconstruct the original resolution of a compressed image.3 The VDSR network learns the mapping between the low-resolution and the high-resolution images. This mapping is possible because low-resolution and high-resolution images have similar image content and differ primarily in their high-frequency details. For VDSR network training, a residual image is created by subtracting a downsampled low-resolution image from the original image. Next, residual learning occurs, wherein the CNN learns to estimate a residual image containing information about the high-frequency details of the residual image. No previous reports have been published on the possibility of applying the VDSR technique to the dental field, although it has shown high performance for images in general. Using the VDSR algorithm, which has a deep CNN architecture, this study aimed to explore the possibility of using a deep learning algorithm in clinical practice to restore compressed virtual CBCT images. Through this study of an efficient and accurate image restoration algorithm, it may be possible to contribute to the efficiency and cost reduction of image storage in the dental field.

Materials and Methods

Figure 1 shows the overview of the method used in this study. The simulation was carried using MATLAB 2020a (MathWorks, Natick, MA, USA) on an i7-8700K processor.

Fig. 1

Schematic view of the method used in this study. After reconstructing virtual cone-beam computed tomography (CBCT) data, the very deep super-resolution (VDSR) network learns to predict the residual image from the low-resolution image.

In order to experiment with verifiable images, virtual CBCT image data were created using TIGRE software4 (the University of Bath, Bat, UK and CERN, Geneva, Switzerland) with multidetector computed tomography (MDCT) images from the CQ500 dataset.5 The CQ500 comprises 491 head scans with 193,317 slices that are publicly available. The TIGRE software created virtual CBCT images by initially adding noise to the projection image generated from the MDCT image, which it then back-projected using a Feldkamp-type algorithm (Fig. 1).6 The intensity value of the created image was normalized between 0 and 1 using the equation below: where m and M refer to the minimum and maximum pixel intensity, respectively. In order to improve the accuracy of deep learning, the size of the dataset was increased using geometric and scale augmentation. Geometric augmentation was performed by first randomly rotating images by 90°, 180°, and 270°, and next, flipping them horizontally. By using multiple scale factors, the dataset was also augmented by scale augmentation. As the scale factor increases, restoring the original image becomes more difficult because the lower-resolution image loses more detailed image information. Scale augmentation improves the image quality at a larger compression rate because the CNN network can take advantage of the detailed information from images with a smaller compression rate. In this study, scale augmentations with factors of 2, 3, and 4 were used. The residual image is formed from the difference between the high-resolution and downsampled low-resolution images. The VDSR network learns to predict the residual image from the low-resolution image. After the VDSR network learns to estimate the residual image, high-resolution images can be reconstructed by adding the estimated residual image to the downsampled low-resolution image. The dataset of 12,000 virtual CBCT images was divided into training and test sets at a ratio of 8 : 2. An NVIDIA Titan RTX GPU with cuDNN version 5.1 acceleration was used for network training. The models were trained for 100 epochs with a stochastic gradient descent with a momentum optimizer (initial learning rate: 0.1, momentum: 0.9, learning rate drop period: 10).7 In this experiment, the peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), and naturalness image quality evaluator (NIQE)8 of at scale ratios of 2, 3, and 4 were calculated. Furthermore, the loss of image information incurred via the use of VDSR was analyzed according to various compression ratios. Larger PSNR and SSIM values indicate closer image quality relative to the reference image, whereas smaller NIQE scores indicate a perceptually better image. The definitions of PSRN, SSIM, and NIQE, which are widely used in the literature for objective comparisons of image quality, are below: PSNR=10 log SSIM(x,y)=[l(x,y)] NIQE: the distance of the Gaussian model between the NSS-based features of a test image and the training dataset Where, x and y are image coordinates, MSE is mean squared error, l is luminance, c is contrast, s is structure, and NSS refers to natural scene statistics. In order to obtain a scale ratio similar to the clinically acceptable PNSR (36.36) of low-dose CBCT,9 0.1-unit intervals between compression ratios of 2 and 3 were evaluated.

Results

An example of an input virtual CBCT image is shown in Figure 2. Figure 3 shows the high-resolution reference image and restoration results using VDSL and bicubic interpolation with a scale factor of 3.

Fig. 2

Examples of a multidetector computed tomographic (MDCT) image (A) and a virtual cone-beam computed tomographic (CBCT) image (B). Virtual CBCT image made from MDCT (A) using CBCT reconstruction software (TIGRE).

Fig. 3

High-resolution image of reference, bicubic interpolation, and very deep super-resolution (VDSR) at a scale factor of 3. A. High-resolution reference image. B. Restored high-resolution image using VDSR. C. Restored high-resolution image using bicubic interpolation. The VDSR image shows finer details and sharper edges than the image upsampled using bicubic interpolation.

At most compression ratios, the PNSR and SSIM of the images restored using VDSR were higher than the PNSR and SSIM of the images restored using bicubic interpolation. Conversely, the NIQE of the images restored using VDSR tended to be lower than the NIQE of the images restored using bicubic interpolation. After upscaling low-resolution images with a scale factor of 2, the PNSR and SSIM of the images restored using VDSR exceeded 40 and 25, respectively. However, at a scale factor of 3, the PNSR and SSIM of the images restored using VDSR dropped to 32 and 22, respectively. This tendency was also found for the PNSR and SSIM of the images restored using bicubic interpolation. The PNSR difference of the images restored using VDSR between scale factors of 3 and 4 was insignificant compared to the PNSR difference of the images restored using VDSR between scale factors of 2 and 3. The PNSR and SSIM of the images restored using VDSR showed a tendency to decrease with an increasing scale factor, but did not decrease proportionally within the 2–3 interval. NIQE showed a tendency to increase with an increasing scale factor in VDSR, but there was no clear tendency for bicubic interpolation. In this experiment, the scale factor that showed the closest PNSR of the images restored using VDSR to the clinically acceptable PNSR (36.36) was 2.1 (Table 1, Fig. 4).

Table 1

Image quality comparison using peak signal-to-noise ratio (PNSR), structural similarity index (SSIM), and naturalness image quality evaluator (NIQE) at various magnifications

Fig. 4

Peak signal-to-noise ratio (PNSR, A), structural similarity index (SSIM, B), and naturalness image quality evaluator (NIQE, C) values of very deep super-resolution (VDSR) and bicubic interpolation at various scale factors. At most compression ratios, the VDSR of PNSR and SSIM was higher than that of bicubic interpolation. Conversely, the VDSR of NIQE tended to be lower than that of bicubic interpolation.

The image quality difference between the images restored using bicubic interpolation or VDSR at the scale factors of 2.1, 3, and 4 is compared in Figure 5. The images restored using VDSR showed finer details and sharper edges than the images upsampled using bicubic interpolation.

Fig. 5

High resolution image of bicubic interpolation and very deep super-resolution (VDSR) at scale factors of 2.1, 3, and 4. A. High-resolution reference image. B. 2.1× compression. C. 3× compression. D. 4× compression.

Discussion

Image compression is widely used for digital photography in everyday life, but it has not been used effectively in dental and medical imaging. This is because lossy compression of the original image diminishes image quality, and there is a concern that this degradation may cause errors in interpretation.10 However, as 3D images become more popular in the dental field, the cost of storing and managing CBCT images, each of which exceeds 500 MB in size, is increasing. As 3D CBCT has gained popularity in the dental field, the storage and management of 3D images have become an issue. At some general hospitals, old data that exceed the capacity of the data server may be stored on magnetic tape. However, it takes time to retrieve the images stored on tape, which delays the comparison and interpretation time when trying to make a diagnosis. Therefore, if Digital Imaging and Communications in Medicine images can be compressed, stored, and restored with sufficient accuracy that there is no difference in interpretation results, the efforts and costs of storing and managing 3D images can be greatly reduced. In this study, the CNN architecture, which has shown excellent performance in the dental field11, was applied to achieve super-resolution of virtual CBCT images. It was demonstrated that by using VDSR, which showed high performance, the CBCT data could be reduced by up to half the original amount without clinically significant image degradation. However, a limitation of this study is that it used virtual CBCT data generated from MDCT data rather than actual CBCT data. It is necessary to verify the appropriate compression rate in large-scale CBCT data in the future. In addition, the criterion for clinically acceptable PNSR was borrowed from results obtained by analyzing low-dose CBCT. Therefore, additional studies are required to attain an acceptable PNSR during CBCT image restoration after compression. It will also be necessary to study indicators that reflect the reader's subjective satisfaction, such as changes in the mean opinion score,2 which are relevant during the compression and restoration of CBCT data. A higher compression rate allows a smaller storage capacity, but more time is needed to compress and restore the images. When compressing and storing 3D dental and medical images, both the accuracy of restoration and the time and computing power required for compression and restoration must be considered. In recent years, deep learning networks that can restore images in real time with high accuracy have been developed;12 therefore, if performance verification is performed on large-scale datasets and radiologists agree that they do not perceive a difference during the interpretation process, VDSR technology can be applied in the clinical setting.

4 in total

1. Image Super-Resolution Using Deep Convolutional Networks.

Authors: Chao Dong; Chen Change Loy; Kaiming He; Xiaoou Tang
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2016-02 Impact factor: 6.226

2. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study.

Authors: Sasank Chilamkurthy; Rohit Ghosh; Swetha Tanamala; Mustafa Biviji; Norbert G Campeau; Vasantha Kumar Venugopal; Vidur Mahajan; Pooja Rao; Prashant Warier
Journal: Lancet Date: 2018-10-11 Impact factor: 79.321

Review 3. An overview of deep learning in the field of dentistry.

Authors: Jae-Joon Hwang; Yun-Hoa Jung; Bong-Hae Cho; Min-Suk Heo
Journal: Imaging Sci Dent Date: 2019-03-25

4. Low-dose cone-beam CT (LD-CBCT) reconstruction for image-guided radiation therapy (IGRT) by three-dimensional dual-dictionary learning.

Authors: Ying Song; Weikang Zhang; Hong Zhang; Qiang Wang; Qing Xiao; Zhibing Li; Xing Wei; Jialu Lai; Xuetao Wang; Wan Li; Quan Zhong; Pan Gong; Renming Zhong; Jun Zhao
Journal: Radiat Oncol Date: 2020-08-12 Impact factor: 3.481

4 in total