| Literature DB >> 26352412 |
Yun-Gang Luo1, Ping Liu2, Lin Shi3, Yishan Luo4, Lei Yi5, Ang Li4, Jing Qin2, Pheng-Ann Heng6, Defeng Wang7.
Abstract
Neuroimage registration is crucial for brain morphometric analysis and treatment efficacy evaluation. However, existing advanced registration algorithms such as FLIRT and ANTs are not efficient enough for clinical use. In this paper, a GPU implementation of FLIRT with the correlation ratio (CR) as the similarity metric and a GPU accelerated correlation coefficient (CC) calculation for the symmetric diffeomorphic registration of ANTs have been developed. The comparison with their corresponding original tools shows that our accelerated algorithms can greatly outperform the original algorithm in terms of computational efficiency. This paper demonstrates the great potential of applying these registration tools in clinical applications.Entities:
Mesh:
Year: 2015 PMID: 26352412 PMCID: PMC4564209 DOI: 10.1371/journal.pone.0136718
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flowchart of the GPU implementation for a stage of FLIRT.
Fig 2Sketch map of a block execution for computation of Bin[i].
The hardware specification of theworkstastions with GTX680 and GTX 660, respectively.
| Device specification (GPU) | GTX680 | GTX660 |
|---|---|---|
| Number of multiprocessors | 8 | 2 |
| Number of cores per multi-processor | 192 | 192 |
| Total amount of global memory | 4096 MBytes | 512 MBytes |
| Total number of registers available per block | 65536 bytes | 65536 bytes |
| Total amount of shared memory per block | 49152 bytes | 49152 bytes |
| Total amount of constant memory | 65536 bytes | 65536 bytes |
| Maximum number of threads per block | 1024 | 1024 |
| Warp size | 32 | 32 |
| CPU specification | Intel(R) Xeon(R) CPU W3505 @2.53GHz (RAM 4.00GB) | Intel(R) Core(TM) i5-3470S CPU @2.90GHz (RAM 8.00GB) |
Fig 3Registered results obtained by the accelerated FLIRT with CR as similarity metric and original FLIRT with CR as similarity metric.
Mean runtimes for registration of images from the 5 datasets of the original FLIRT and the accelerated FLIRT with Workstation 1.
(Time: seconds).
| Dataset time (s) | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| Dimensions (Width*Height*Depth) | 131×179×137 | 162×215×157 | 176×192×192 | 256×124×256 | 256×256×128 |
| Size of voxels | 3212513 | 5468310 | 6488064 | 8126464 | 8088608 |
| Original FLIRT | 46.9 | 63.74 | 75.66 | 95.51 | 118.29 |
| Accelerated FLIRT | 13.72 | 23.17 | 25.05 | 25.54 | 24.56 |
| Total speedup | 3.4 | 2.8 | 3 | 3.7 | 4.8 |
Fig 4Registered results obtained by the GPU accelerated CC calculation for the symmetric diffeomorphic registration of ANTs and the original ANTs.
Fig 5Registered results of region labeled images provided by the IBSR by the GPU accelerated CC calculation for the symmetric diffeomorphic registration of ANTs and the original ANTs.
Fig 6Mean absolute differences between the registered results obtained by original ANTs and the reference image, and those between the reference and the results obtained by the accelerated ANTs with CC calculation on GPU.
Mean runtimes and speedups for registration of images from the 5 datasets of the original ANTs and the accelerated ANTs with Workstation 1 (Time: seconds).
| Dataset time (s) | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| Dimensions (Width*Height*Depth) | 131×179×137 | 162×215×157 | 176×192×192 | 256×124×256 | 256×256×128 |
| Size | 3212513 | 5468310 | 6488064 | 8126464 | 8088608 |
| Time for CC in Original ANTs | 1050.79 | 1699.62 | 2002.19 | 3170.54 | 3461.48 |
| Total time for Original ANTs | 1972.19 | 3062.61 | 3526.72 | 5023.48 | 6197.47 |
| Percent of time for CC | 53% | 55% | 57% | 63% | 56% |
| Time for CC in Accelerated ANTs | 11.46 | 17.45 | 19.79 | 28.87 | 29.27 |
| Total time for Accelerated ANTs | 934.81 | 1398.15 | 1566.96 | 3713.58 | 3452 |
| Speedup of CC | 91.69 | 97.4 | 101.17 | 109.82 | 118.26 |
| Total Speedup | 2.11 | 2.19 | 2.25 | 1.35 | 1.81 |
Mean runtimes and speedups for registration of images from the 5 datasets of the original ANTs and the accelerated ANTs with Workstation 2. (Time: seconds).
| Dataset time (s) | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| Dimensions (Width*Height*Depth) | 131×179×137 | 162×215×157 | 176×192×192 | 256×124×256 | 256×256×128 |
| Size (voxels) | 3212513 | 5468310 | 6488064 | 8126464 | 8088608 |
| Time for CC in Original ANTs | 591.31 | 966.86 | 1126.05 | 1742.72 | 1982.59 |
| Total time for Original ANTs | 996.81 | 1616.6 | 1838.79 | 2627.15 | 3039.34 |
| Percent of time for CC | 59% | 60% | 61% | 66% | 62% |
| Time for CC in Accelerated ANTs | 39.37 | 60.83 | 72.76 | 109.88 | 107.26 |
| Total time for Accelerated ANTs | 443.37 | 676.57 | 781.64 | 1011.11 | 1108.66 |
| Speedup of CC | 15.02 | 15.9 | 15.48 | 15.86 | 18.48 |
| Total Speedup | 2.25 | 2.38 | 2.35 | 2.6 | 2.74 |