| Literature DB >> 30366454 |
Yi Zhang1, Zebin Wu2,3, Jin Sun4, Yan Zhang5, Yaoqin Zhu6, Jun Liu7, Qitao Zang8, Antonio Plaza9.
Abstract
Anomaly detection aims to separate anomalous pixels from the background, and has become an important application of remotely sensed hyperspectral image processing. Anomaly detection methods based on low-rank and sparse representation (LRASR) can accurately detect anomalous pixels. However, with the significant volume increase of hyperspectral image repositories, such techniques consume a significant amount of time (mainly due to the massive amount of matrix computations involved). In this paper, we propose a novel distributed parallel algorithm (DPA) by redesigning key operators of LRASR in terms of MapReduce model to accelerate LRASR on cloud computing architectures. Independent computation operators are explored and executed in parallel on Spark. Specifically, we reconstitute the hyperspectral images in an appropriate format for efficient DPA processing, design the optimized storage strategy, and develop a pre-merge mechanism to reduce data transmission. Besides, a repartitioning policy is also proposed to improve DPA's efficiency. Our experimental results demonstrate that the newly developed DPA achieves very high speedups when accelerating LRASR, in addition to maintaining similar accuracies. Moreover, our proposed DPA is shown to be scalable with the number of computing nodes and capable of processing big hyperspectral images involving massive amounts of data.Entities:
Keywords: anomaly detection; apache spark; clouds; distributed and parallel computing; hyperspectral images
Year: 2018 PMID: 30366454 PMCID: PMC6263513 DOI: 10.3390/s18113627
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Data model of LRASR.
Figure 2Flowchart of LRASR.
Figure 3Flowchart of DPA.
Figure 4Data transmission before using the pre-merge mechanism.
Figure 5Data transmission after using the pre-merge mechanism.
Details about the four experiments.
| Experiments | HSIs | Platforms |
|---|---|---|
| Experiment 1 | HSI1 | Spark1 |
| Experiment 2 | HSI2 | Spark1 |
| Experiment 3 | HSI1 | Spark2 |
| Experiment 4 | HSI3-HSI6 | Spark3 |
Figure 6The first Hyperspectral Image (HSI1). (a) The false color image of the entire image; (b) The false color image of the chosen area for detection; (c) The ground-truth map of the chosen area [30].
Figure 7The second Hyperspectral Image (HSI2). (a) The false color image of the entire image; (b) The false color image of the chosen area for detection; (c) The ground-truth map of the chosen area [30].
AUCs and Consumed Times(s) obtained by LRASR and DPA with different numbers of nodes in Experiments 1–3.
| Number of Nodes | Times (EX1) | AUC (EX1) | Times (EX2) | AUC (EX2) | Times (EX3) | AUC (EX3) |
|---|---|---|---|---|---|---|
| LRASR | 3987 | 0.9181 | 4957 | 0.9595 | 3657 | 0.9184 |
| DPA with 2 Nodes | 1788 | 0.9202 | 2613 | 0.9596 | 1699 | 0.9218 |
| DPA with 4 Nodes | 955 | 0.9184 | 1223 | 0.9601 | 842 | 0.9141 |
| DPA with 8 Nodes | 451 | 0.9193 | 586 | 0.9609 | 418 | 0.9188 |
| DPA with 16 Nodes | 233 | 0.9195 | 290 | 0.9616 | 232 | 0.9116 |
| DPA with 32 Nodes | 116 | 0.9203 | 149 | 0.9607 | 111 | 0.9184 |
Figure 8Speedups of DPA with different numbers of nodes in Experiments 1–3.
Figure 9Memory consumption (MB) of LRASR and DPA with different number of nodes in Experiment 1.
Figure 10Speedups of DPA processing big HSIs with different data sizes.
AUCs and Consumed Times(s) obtained by HDPA with different numbers of nodes.
| Metrics | 2 Nodes | 4 Nodes | 8 Nodes | 16 Nodes | 32 Nodes |
|---|---|---|---|---|---|
| AUC | 0.9216 | 0.9166 | 0.9187 | 0.9141 | 0.9155 |
| Times | 4320 | 2524 | 2272 | 1706 | 1706 |