| Literature DB >> 33380769 |
Malte Brunn1, Naveen Himthani2, George Biros2, Miriam Mehl1, Andreas Mang3.
Abstract
3D image registration is one of the most fundamental and computationally expensive operations in medical image analysis. Here, we present a mixed-precision, Gauss-Newton-Krylov solver for diffeomorphic registration of two images. Our work extends the publicly available CLAIRE library to GPU architectures. Despite the importance of image registration, only a few implementations of large deformation diffeomorphic registration packages support GPUs. Our contributions are new algorithms to significantly reduce the run time of the two main computational kernels in CLAIRE: calculation of derivatives and scattered-data interpolation. We deploy (i) highly-optimized, mixed-precision GPU-kernels for the evaluation of scattered-data interpolation, (ii) replace Fast-Fourier-Transform (FFT)-based first-order derivatives with optimized 8th-order finite differences, and (iii) compare with state-of-the-art CPU and GPU implementations. As a highlight, we demonstrate that we can register 2563 clinical images in less than 6 seconds on a single NVIDIA Tesla V100. This amounts to over 20× speed-up over the current version of CLAIRE and over 30× speed-up over existing GPU implementations.Entities:
Keywords: Diffeomorphic Image Registration; GPU computing; Gauss–Newton–Krylov Method; Mixed-Precision Solver; Parallel Optimization
Year: 2020 PMID: 33380769 PMCID: PMC7769216 DOI: 10.1016/j.jpdc.2020.11.006
Source DB: PubMed Journal: J Parallel Distrib Comput ISSN: 0743-7315 Impact factor: 3.734