| Literature DB >> 29495637 |
Bingyi Li1, Hao Shi2,3, Liang Chen4, Wenyue Yu5, Chen Yang6, Yizhuang Xie7, Mingming Bian8, Qingjun Zhang9, Long Pang10.
Abstract
With the development of satellite load technology and very large-scale integrated (VLSI) circuit technology, on-board real-time synthetic aperture radar (SAR) imaging systems have facilitated rapid response to disasters. A key goal of the on-board SAR imaging system design is to achieve high real-time processing performance under severe size, weight, and power consumption constraints. This paper presents a multi-node prototype system for real-time SAR imaging processing. We decompose the commonly used chirp scaling (CS) SAR imaging algorithm into two parts according to the computing features. The linearization and logic-memory optimum allocation methods are adopted to realize the nonlinear part in a reconfigurable structure, and the two-part bandwidth balance method is used to realize the linear part. Thus, float-point SAR imaging processing can be integrated into a single Field Programmable Gate Array (FPGA) chip instead of relying on distributed technologies. A single-processing node requires 10.6 s and consumes 17 W to focus on 25-km swath width, 5-m resolution stripmap SAR raw data with a granularity of 16,384 × 16,384. The design methodology of the multi-FPGA parallel accelerating system under the real-time principle is introduced. As a proof of concept, a prototype with four processing nodes and one master node is implemented using a Xilinx xc6vlx315t FPGA. The weight and volume of one single machine are 10 kg and 32 cm × 24 cm × 20 cm, respectively, and the power consumption is under 100 W. The real-time performance of the proposed design is demonstrated on Chinese Gaofen-3 stripmap continuous imaging.Entities:
Keywords: multi-nodes parallel accelerating technique; real-time processing; single FPGA node imaging processing; synthetic aperture radar (SAR)
Year: 2018 PMID: 29495637 PMCID: PMC5876905 DOI: 10.3390/s18030725
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Flowchart of the chirp scaling (CS) algorithm.
Figure 2Hierarchical decomposition mapping flow.
Figure 3Field Programmable Gate Array (FPGA)-based architecture.
Figure 4Programmable element (PE) and PE array architecture.
Figure 5The relative errors between hardware and software computation.
Figure 6Point target imaging result with no window.
Point target imaging quality assessment.
| Window | Azimuthal Direction | Range Direction | |||||
|---|---|---|---|---|---|---|---|
| PSLR (dB) | ISLR (dB) | RES (m) | PSLR (dB) | ISLR (dB) | RES (m) | ||
| −11.21 | −10.00 | 2.93 | −10.93 | −9.03 | 2.2 | ||
| −12.52 | −11.41 | 2.1 | −12.61 | −10.05 | 1.9 | ||
| −27.54 | −21.85 | 4.10 | −27.23 | −22.34 | 2.45 | ||
| −29.74 | −25.93 | 2.7 | −28.58 | −24.21 | 2.3 | ||
| −38.73 | −37.21 | 4.61 | −36.85 | −34.73 | 3.10 | ||
| −40.71 | −39.10 | 3.16 | −38.32 | −36.08 | 2.69 | ||
Time consumption of three PFE procedures.
| Stage | Cycles (Working Frequency of 100 MHz) |
|---|---|
| 1st PFE | 512 |
| 2nd PFE | 512 |
| 3rd PFE | 404 |
Figure 7Cross-mapping method.
Figure 8Block diagram of the multi-node parallel system architecture.
Figure 9The time-sequence of multi-node processing parallel processing.
Basic parameters of the single-node imaging implementation.
| Parameter | Value |
|---|---|
| FPGA main frequency | 100 MHz |
| 16,384 | |
| 16,384 | |
| 32 | |
| 32 | |
| 8 | |
| 400 MHz | |
| 200 MHz | |
| 100 MHz | |
| 64 bit |
Figure 10Two-dimensional data read mode.
Summary of the parameters related to parallel processing.
| Parameter | Range Direction Operation | Azimuthal Direction Operation | ||
|---|---|---|---|---|
| Proposed | [ | Proposed | [ | |
| 0.74 | 0.9375 | 0.74 | 0.74 | |
| 1 | 1 | 1 | 0.5 | |
| 2 | 1 | 4 | 4 | |
| 6.4 GB/s | 6.4 GB/s | 6.4 GB/s | 6.4 GB/s | |
| 4.8 GB/s | 6 GB/s | 4.8 GB/s | 2.37 GB/s | |
| 1.6 GB/s | 1.6 GB/s | 1.6 GB/s | 1.6 GB/s | |
| 0.8 GB/s | 0.8 GB/s | 0.8 GB/s | 0.8 GB/s | |
| 4 | 2 | 6 | 2 | |
FPGA resource occupation (Xilinx xc6vlx315t).
| Parameter | Value |
|---|---|
| Number of slice registers | 134,259 (34%) |
| Number of LUTs | 122,467 (62%) |
| Number of block RAMs/FIFOs | 499 (67%) |
| Number of DSP48s | 387 (28%) |
Figure 11Actual scene imaging result of the system hardware.
Actual scene imaging quality assessment.
| Parameter | Value |
|---|---|
| MSE | 23.3 |
| PSNR (dB) | 30 |
| SSIM | 0.99 |
| γ (dB) | 4.98 |
Comparison with previous works.
| Works | Year | Schemes | Data Granularity | Working Frequency | Power Consumption | Processing Time |
|---|---|---|---|---|---|---|
| Proposed | 2017 | FPGA | 16,384 × 16,384 | 100 MHZ | 17 W | 10.6 s |
| [ | 2017 | FPGA + ASIC | 16,384 × 16,384 | 100 MHZ | 21 W | 12.1 s |
| [ | 2016 | FPGA + Microprocess | 6472 × 3328 | / | 68 W | 8 s |
| [ | 2016 | CPU + GPU | 32,768 × 32,768 | / | >330 W | 2.8 s |
| [ | 2015 | Multi-DSP | 4096 × 4096 | 100 MHZ | / | 2.178 s |
| [ | 2012 | CPU + ASIC | 1024 × 1024 | 100 MHz | 10 W | / |
| [ | 2008 | Multi-DSP | 4096 × 4096 | 100 MHZ | 35 W | 13 s |
| [ | 1998 | ASIC | 1020 × 200 | 10 MHz | 2 W | / |
The real-time parameters of the multi-node system.
| Parameter | Value (s) |
|---|---|
| 7.9 | |
| 2.7 | |
| 10.6 | |
| 2.7 | |
| 4 |
Figure 12(a) Photographs of the board and (b) the prototype single machine.
The structuring indicators of the prototype single machine system.
| Indicator | Value |
|---|---|
| Weight | 10 kg |
| Volume | 32 cm × 24 cm × 20 cm |
| Power | <100 W |
Figure 13The continuous imaging results of 12 images.