| Literature DB >> 29932436 |
Yi Zhou1, Rui Zhang2,3, Shixin Wang4, Futao Wang5.
Abstract
With the advent of high spatial resolution remote sensing imagery, numerous image features can be utilized. Applying a reasonable feature selection approach is critical to effectively reduce feature redundancy and improve the efficiency and accuracy of classification. This paper proposes a novel feature selection approach, in which ReliefF, genetic algorithm, and support vector machine (RFGASVM) are integrated to extract buildings. We adopt the ReliefF algorithm to preliminary filter high-dimensional features in the feature database. After eliminating the sorted features, the feature subset and the C and γ parameters of support vector machine (SVM) are encoded into the chromosome of the genetic algorithm. A fitness function is constructed considering the sample identification accuracy, the number of selected features, and the feature cost. The proposed method was applied to high-resolution images obtained from different sensors, GF-2, BJ-2, and unmanned aerial vehicles (UAV). The confusion matrix, precision, recall and F1-score were applied to assess the accuracy. The results showed that the proposed method achieved feature reduction, and the overall accuracy (OA) was more than 85%, with Kappa coefficient values of 0.80, 0.83 and 0.85, respectively. The precision of each image was more than 85%. The time efficiency of the proposed method was two-fold greater than SVM with all the features. The RFGASVM method has the advantages of large feature reduction and high extraction performance and can be applied in feature selection.Entities:
Keywords: SVM; accuracy evaluation; feature selection; genetic algorithm; object-based
Year: 2018 PMID: 29932436 PMCID: PMC6068868 DOI: 10.3390/s18072013
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Experimental data ((a) GF-2 satellite data, (b) BJ-2 satellite data, and (c) unmanned aerial vehicle (UAV) data).
Figure 2Results of segmented images ((a) GF-2 satellite data, (b) BJ-2 satellite data, and (c) UAV data).
Description of features extracted from high resolution remote-sensing images.
| Feature Name | Feature Description |
|---|---|
|
| Mean L (R, G, B, NIR); brightness; SD L (R, G, B, NIR); ratio L (R, G, B, NIR); max.diff; MBI index (Huang Xin et al.); BAI:(B − NIR)/(B + NIR); |
|
| Area; length; width; length/width; boundary length; pixel number; shape index; density; main direction; asymmetry; compactness; rectangular fit; elliptic fit; differential of morphological profiles (DMP) |
|
| GLCM entropy; GLCM angular second moment; GLCM correlation; GLCM homogeneity; GLCM contrast; GLCM mean; GLCM SD; GLCM dissimilarity; GLDV angular second moment; GLDV entropy; GLDV contrast; GLDV mean |
|
| SI:(R + G + B + NIR)/4; |
|
| Object numbers; object layers; image resolution; mean of image layers |
|
| Digital elevation model(DEM); slope; aspect; building vectors |
(Remark: Mean L: mean of the bands; SD L: Standard Deviation of the bands; ratio L: ratio of the bands; MBI: Morphological Building Index; BAI: Building Area Index; NDBI: Normalized Difference Build-up Index; NDVI: Normalized Difference Vegetation Index; DVI: Difference Vegetation Index; RVI: Ratio Vegetation Index; SAVI: Soil-Adjusted Vegetation Index; OSAVI: Optimized Soil Adjusted Vegetation Index; SBI: Soil brightness index; NDWI: Normalized Difference Water Index; GLCM: Gray Level Co-occurrence Matrix; GLDV: Grey Level Difference Vector; SI: Shadow Index; Chen: Custom features).
Description of features extracted from unmanned aerial vehicle image.
| Feature Name | Feature Description |
|---|---|
|
| Mean L (R, G, B, NIR); brightness; SD L (R, G, B, NIR); ratio L (R, G, B, NIR); max.diff; Green Index: GR = G/(R + G + B); Red-Green Vegetation Index: NGRDI = (G − R)/(G + R); GLI = (2G − R − B)/(2G + R + B) |
|
| Area; length; width; length/width; boundary length; pixel number; shape index; density; main direction; asymmetry; compactness; rectangular fit; elliptic fit; differential of morphological profiles (DMP); digital surface model(nDSM); height standard deviation |
|
| GLCM entropy; GLCM angular second moment; GLCM correlation; GLCM homogeneity; GLCM contrast; GLCM mean; GLCM SD; GLCM dissimilarity; GLDV angular second moment; GLDV entropy; GLDV contrast; GLDV mean |
|
| Chen4: (R + B)/(G − 2); Chen5: |R + G − 2B| |
|
| Object numbers; object layers; image resolution; mean of image layers |
|
| Digital elevation model(DEM); slope; aspect; building vectors |
(Remark: GR: Green Index; GLR: Green Leaf Index; NGRDI: Red-Green vegetation index).
Figure 3The optimal hyperplane.
Figure 4The chromosome design. SVM: support vector machine.
Figure 5Flow chart of the extraction of house information based on the RFGASVM optimization algorithm.
Figure 6Schematic of training and test samples.
Sample statistics selected from different high-resolution images.
| Data | Sample Category | Building | Road | Vegetation | Shadow | Water | Bare Land |
|---|---|---|---|---|---|---|---|
| GF-2 image | Training samples | 95 | 75 | 85 | 68 | 70 | 92 |
| BJ-2 image | Training samples | 95 | 80 | 87 | 79 | -- | 91 |
| UAV image | Training samples | 105 | 110 | 95 | 90 | -- | 90 |
Figure 7Extraction results of urban and rural areas.
Statistical analysis of the accuracy of proposed method for processing high-resolution imagery.
| High-Resolution Imagery | GF-2 Satellite Image | BJ-2 Satellite Image | UAV Image |
|---|---|---|---|
| Overall accuracy (OA) | 88.52 | 89.75 | 91.3 |
| Kappa coefficient | 0.8 | 0.83 | 0.85 |
| Producer’s Accuracy (PA) | 91 | 93.12 | 96.21 |
| User’s Accuracy (UA) | 89.65 | 89 | 90.38 |
| Number of features | 8 | 6 | 10 |
| Optimization time | 7.85 | 13.79 | 18 |
Figure 8Probability distribution density of different object features based on extraction from high-resolution images ((a) BJ-2 imagery, (b) UAV imagery, and (c) GF-2 imagery).
Comparison among RFGASVM and related methods.
| Experimental Data | Evaluation Index | RFGASVM | SVM (All Features) | RFSVM |
|---|---|---|---|---|
|
| Overall accuracy (OA) | 88.52 | 86.46 | 83.02 |
|
| Overall accuracy (OA) | 89.75 | 81.06 | 80 |
|
| Overall accuracy (OA) | 91.30 | 86 | 90.25 |
Results of the test samples of satellite and UAV imagery between proposed RFGASVM and related methods in terms of precision, recall and F1-score.
| Experimental Data | Method | Precision | Recall | F1-Score |
|---|---|---|---|---|
|
| 85.50 | 86.81 | 86.15 | |
|
| 89.51 | 88.12 | 88.81 | |
|
| 92.25 | 90.05 | 91.14 |
Figure 9Comparison of the efficiency of proposed RFGASVM and related methods using different iteration times.