| Literature DB >> 28781999 |
Yu Chen1,2, Zihao Wang1,2, Jingrong Zhang1,2, Lun Li1,3, Xiaohua Wan1, Fei Sun2,4,5, Fa Zhang1.
Abstract
Electron tomography (ET) plays an important role in studying in situ cell ultrastructure in three-dimensional space. Due to limited tilt angles, ET reconstruction always suffers from the "missing wedge" problem. With a validation procedure, iterative compressed-sensing optimized NUFFT reconstruction (ICON) demonstrates its power in the restoration of validated missing information for low SNR biological ET dataset. However, the huge computational demand has become a major problem for the application of ICON. In this work, we analyzed the framework of ICON and classified the operations of major steps of ICON reconstruction into three types. Accordingly, we designed parallel strategies and implemented them on graphics processing units (GPU) to generate a parallel program ICON-GPU. With high accuracy, ICON-GPU has a great acceleration compared to its CPU version, up to 83.7×, greatly relieving ICON's dependence on computing resource.Entities:
Keywords: Acceleration; Electron tomography; GPU; ICON; Missing wedge restoration
Year: 2017 PMID: 28781999 PMCID: PMC5516007 DOI: 10.1007/s41048-017-0041-z
Source DB: PubMed Journal: Biophys Rep ISSN: 2364-3439
Fig. 1The RMSREs of ICON-GPU
Fig. 2Evaluate ICON-GPU by investigating the reconstructed tomograms. A–C The XY-slices of the tomograms reconstructed by WBP, ICON-CPU, and ICON-GPU, respectively; D–F The XZ-slices of the tomograms reconstructed by WBP, ICON-CPU, and ICON-GPU, respectively
Fig. 3Evaluate ICON-GPU by the pseudo-missing-validation procedure. A The omit-projection (“Ground truth”); B–D The re-projections of the omit-tomograms reconstructed by WBP, ICON-CPU, and ICON-GPU, respectively; E The pseudo-missing-validation FRCs of WBP, ICON-CPU, and ICON-GPU
Fig. 4The comparison of time-consuming of ICON-CPU and ICON-GPU
The speedups of ICON-GPU compared to ICON-CPU
| Image size | Speedup |
|---|---|
| 5122 | 8.7× |
| 10242 | 41.5× |
| 20482 | 61.2× |
| 40962 | 83.7× |
Fig. 5Pseudo codes for calling a kernel function and the operations inside a kernel function for element-wise operations
Fig. 6The speedups of parallel NUFFTs compared to NFFT3.0