| Literature DB >> 28075343 |
Zhixin Li1, Dandan Su2, Haijiang Zhu3, Wei Li4, Fan Zhang5, Ruirui Li6.
Abstract
Synthetic Aperture Radar (SAR) raw data simulation is a fundamental problem in radar system design and imaging algorithm research. The growth of surveying swath and resolution results in a significant increase in data volume and simulation period, which can be considered to be a comprehensive data intensive and computing intensive issue. Although several high performance computing (HPC) methods have demonstrated their potential for accelerating simulation, the input/output (I/O) bottleneck of huge raw data has not been eased. In this paper, we propose a cloud computing based SAR raw data simulation algorithm, which employs the MapReduce model to accelerate the raw data computing and the Hadoop distributed file system (HDFS) for fast I/O access. The MapReduce model is designed for the irregular parallel accumulation of raw data simulation, which greatly reduces the parallel efficiency of graphics processing unit (GPU) based simulation methods. In addition, three kinds of optimization strategies are put forward from the aspects of programming model, HDFS configuration and scheduling. The experimental results show that the cloud computing based algorithm achieves 4_ speedup over the baseline serial approach in an 8-node cloud environment, and each optimization strategy can improve about 20%. This work proves that the proposed cloud algorithm is capable of solving the computing intensive and data intensive issues in SAR raw data simulation, and is easily extended to large scale computing to achieve higher acceleration.Entities:
Keywords: big data; cloud computing; distributed simulation; raw data generation; synthetic aperture radar (SAR)
Year: 2017 PMID: 28075343 PMCID: PMC5298686 DOI: 10.3390/s17010113
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The Fast Fourier Transform (FFT) based time-domain SAR raw data simulation diagram.
Figure 2MapReduce model structure.
Figure 3Hadoop Distributed File System (HDFS) structure and its reading process.
Figure 4The cloud computing based SAR raw data simulation diagram.
Experiment configuration.
| Name | Configuration |
|---|---|
| operation system | Linux RedHat 5.3 |
| Hadoop | Version 2.5.2 |
| Java | Version 1.7 |
| NameNode | Hex-core 3.2 GHz Intel Xeon processor, 16 GB memory |
| DataNode | Bi-core 3.2 GHz Intel I5 processor, 4 GB memory |
| Network | 100 Mbps network |
Experimental algorithms.
| Name | Methods | Map Job | Combine Job | Reduce Job |
|---|---|---|---|---|
| Alg1 | − | − | − | |
| Alg2 | ||||
| Alg3 | − |
Figure 5The input target scattering file.
Figure 6The imaging results of simulated SAR raw data.
Run time and speedup comparison with different algorithms.
| Nodes | Alg1 | Alg2 | Alg3 | Alg2 | Alg3 |
|---|---|---|---|---|---|
| Run Time (s) | Run Time (s) | Run Time (s) | Speedup | Speedup | |
| 1 | 420 | 338 | 372 | 1.24 | 1.12 |
| 2 | − | 183 | 198 | 2.30 | 2.12 |
| 4 | − | 133 | 168 | 3.16 | 2.50 |
| 6 | − | 112 | 131 | 3.75 | 3.21 |
| 8 | − | 109 | 125 | 3.86 | 3.36 |
Run time comparison with distributed computing method [15] under big input file conditions.
| Nodes | Alg2 Run Time (s) | DC Run Time (s) |
|---|---|---|
| 1 | 1320 | 5482 |
| 4 | 411 | 1432 |
| 8 | 289 | 784 |
Figure 7The speedup comparison between Alg2 and Alg3.
Parallel efficiency comparison between Alg2 and Alg3.
| Nodes | Alg2 | Alg3 |
|---|---|---|
| Parallel Efficiency | Parallel Efficiency | |
| 1 | 100% | 100% |
| 2 | 92% | 93% |
| 4 | 64% | 55% |
| 6 | 50% | 47% |
| 8 | 39% | 37% |
The impact of split number on algorithm performance.
| Splits (Map) Number | Alg2 Run Time (s) |
|---|---|
| 6 | 253 |
| 9 | 193 |
| 18 | 151 |
| 36 | 122 |
| 72 | 176 |
| 144 | 251 |
Figure 8The relationship between split number and simulation time.
Experiment configuration.
| Split Number | Speculative Switch | Alg2 Run Time (s) |
|---|---|---|
| 36 | On | 172 |
| 36 | Off | 217 |
| 72 | On | 240 |
| 72 | Off | 254 |
Figure 9The FFT based time-domain SAR raw data simulation diagram.