| Literature DB >> 28086901 |
Yi Du1,2, Gongyi Yu1,3, Xincheng Xiang1, Xiangang Wang4.
Abstract
BACKGROUND: For cone-beam computed tomography (CBCT), which has been playing an important role in clinical applications, iterative reconstruction algorithms are able to provide advantageous image qualities over the classical FDK. However, the computational speed of iterative reconstruction is a notable issue for CBCT, of which the forward projection calculation is one of the most time-consuming components. METHOD ANDEntities:
Keywords: Cone-beam CT; Forward projection; GPU
Mesh:
Year: 2017 PMID: 28086901 PMCID: PMC5234133 DOI: 10.1186/s12938-016-0293-8
Source DB: PubMed Journal: Biomed Eng Online ISSN: 1475-925X Impact factor: 2.819
Gather and scatter operations involved in forward and back projection computes
| Approach | Forward projection | Back projection |
|---|---|---|
| Voxel-driven | Scatter | Gather |
| Line-driven | Gather | Scatter |
Fig. 1Schematic of CBCT imaging (a) and the voxel-driven forward projection algorithm (b), where the image voxel value is scattered into the four adjacent detector units
Fig. 2Schematic of the voxel-driven forward projection algorithm for adjacent voxels at β
Fig. 3Conventional threads are allocated in horizontal planes (a). In the proposed method, the threads are allocated in vertical (coronal) planes (b), and the thread-plane direction is interchanged at certain angles from coronal to sagittal (c)
Outline of the GPU acceleration method
Fig. 4Calculation time curves of different methods: threads are allocated in axial planes and racing threads are solved with atomic operations (black); threads are allocated in vertical planes and racing threads are solved with atomic operations without thread-plane direction interchange (green); threads are allocated in vertical planes and racing threads are solved with atomic operations with thread-plane direction interchange at given angles (blue); threads are allocated in vertical planes and racing threads are solved with atomic operations with thread-plane direction interchange at critical angles (red)
Computation efficiency comparison of different methods
| Projection frames/image matrix dimension | Method | Total time (s) | Average time (s) | Acceleration ratio |
|---|---|---|---|---|
| 360/512c | CPUa | 360 × 45 | 45 | 1 |
| GPUb | 214.97 | 0.597 | 75.36 | |
| GPUc | 168.49 | 0.468 | 96.15 | |
| GPUd | 153.499 | 0.426 | 105.54 |
aCPU implementation on a single thread, b GPU acceleration with threads allocated in axial planes, c GPU acceleration without thread-plane interchange, d proposed GPU acceleration method