| Literature DB >> 24204582 |
Yong Wang1, Dong Jiang, Dafang Zhuang, Yaohuan Huang, Wei Wang, Xinfang Yu.
Abstract
The classification of land cover based on satellite data is important for many areas of scientific research. Unfortunately, some traditional land cover classification methods (e.g. known as supervised classification) are very labor-intensive and subjective because of the required human involvement. Jiang et al. proposed a simple but robust method for land cover classification using a prior classification map and a current multispectral remote sensing image. This new method has proven to be a suitable classification method; however, its drawback is that it is a semi-automatic method because the key parameters cannot be selected automatically. In this study, we propose an approach in which the two key parameters are chosen automatically. The proposed method consists primarily of the following three interdependent parts: the selection procedure for the pure-pixel training-sample dataset, the method to determine the key parameters, and the optimal combination model. In this study, the proposed approach employs both overall accuracy and their Kappa Coefficients (KC), and Time-Consumings (TC, unit: second) in order to select the two key parameters automatically instead of using a test-decision, which avoids subjective bias. A case study of Weichang District of Hebei Province, China, using Landsat-5/TM data of 2010 with 30 m spatial resolution and prior classification map of 2005 recognised as relatively precise data, was conducted to test the performance of this method. The experimental results show that the methodology determining the key parameters uses the portfolio optimisation model and increases the degree of automation of Jiang et al.'s classification method, which may have a wide scope of scientific application.Entities:
Mesh:
Year: 2013 PMID: 24204582 PMCID: PMC3810380 DOI: 10.1371/journal.pone.0075852
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1General flowchart of proposed approach (KC: Kappa Coefficient; TC: Time-Consuming).
Figure 2Flowchart of the iterated procedure used to determine the key parameters ( and ) (KC: Kappa Coefficient; TC: Time-Consuming).
Figure 3Selection process of portfolio optimisation model (KC: Kappa Coefficient; TC: Time-Consuming).
Figure 4Location of the study area: Weichang County, Hebei Province, China.
The results of KC (Kappa Coefficient).
| Pbuffer\Pa | 10% | 20% | 30% | 40% | 50% | 60% | 70% | 80% | 90% | 100% |
| 10% | 0.011 | 0.013 | 0.015 | 0.017 | 0.018 | 0.021 | 0.019 | 0.018 | 0.016 | 0.015 |
| 20% | 0.099 | 0.149 | 0.207 | 0.312 | 0.353 | 0.381 | 0.331 | 0.283 | 0.197 | 0.091 |
| 30% | 0.132 | 0.282 | 0.323 | 0.401 | 0.437 | 0.476 | 0.441 | 0.317 | 0.204 | 0.169 |
| 40% | 0.201 | 0.278 | 0.391 | 0.452 | 0.528 | 0.539 | 0.516 | 0.476 | 0.361 | 0.292 |
| 50% | 0.276 | 0.324 | 0.441 | 0.524 | 0.581 | 0.692 | 0.628 | 0.528 | 0.456 | 0.308 |
| 60% | 0.332 | 0.398 | 0.492 | 0.568 | 0.621 | 0.762 | 0.760 | 0.612 | 0.489 | 0.362 |
| 70% | 0.328 | 0.364 | 0.486 | 0.557 | 0.579 | 0.623 | 0.619 | 0.573 | 0.477 | 0.213 |
| 80% | 0.294 | 0.347 | 0.424 | 0.502 | 0.544 | 0.592 | 0.585 | 0.486 | 0.392 | 0.271 |
| 90% | 0.192 | 0.297 | 0.316 | 0.392 | 0.473 | 0.492 | 0.468 | 0.395 | 0.226 | 0.107 |
| 100% | 0.019 | 0.222 | 0.248 | 0.258 | 0.287 | 0.268 | 0.257 | 0.238 | 0.201 | 0.014 |
TC (Time-Consuming, unit: second) of the proposed methodology with different parameters.
| Pbuffer\Pa | 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 100 |
| 10 | 25 | 26 | 31 | 39 | 42 | 51 | 80 | 103 | 111 | 149 |
| 20 | 32 | 37 | 45 | 57 | 68 | 87 | 122 | 170 | 205 | 242 |
| 30 | 47 | 57 | 71 | 89 | 93 | 119 | 181 | 251 | 285 | 346 |
| 40 | 57 | 69 | 86 | 109 | 119 | 149 | 221 | 309 | 336 | 415 |
| 50 | 65 | 83 | 102 | 111 | 138 | 168 | 273 | 337 | 403 | 476 |
| 60 | 78 | 99 | 136 | 161 | 185 | 236 | 370 | 432 | 479 | 573 |
| 70 | 88 | 115 | 148 | 168 | 199 | 247 | 375 | 445 | 492 | 582 |
| 80 | 100 | 133 | 159 | 178 | 217 | 258 | 389 | 468 | 525 | 601 |
| 90 | 157 | 183 | 212 | 239 | 273 | 318 | 467 | 547 | 621 | 698 |
| 100 | 134 | 182 | 235 | 272 | 298 | 336 | 515 | 569 | 657 | 721 |
Figure 5Relationship among , and TC (Time-Consuming, unit: second).
Figure 6Comparison of land cover classification in Weichang.
Error matrix of the combination model (80%, 80%).
| Cropland2 | Forest2 | Grassland2 | Water2 | Residential and construction land2 | Bareland2 | Sum | Omission error | |
| Cropland1 | 16.08 | 3.54 | 4.63 | 0.82 | 0.19 | 1.15 | 26.40 | 39.1 |
| Forest1 | 4.84 | 22.85 | 8.41 | 0.95 | 0.02 | 0.70 | 37.77 | 39.5 |
| Grassland1 | 3.30 | 2.97 | 16.35 | 1.00 | 0.11 | 1.12 | 24.85 | 34.2 |
| Water1 | 0.52 | 1.19 | 0.15 | 3.01 | 0.03 | 0.18 | 5.09 | 40.7 |
| Residential and construction land1 | 0.06 | 0.02 | 0.06 | 0.10 | 0.43 | 0.08 | 0.75 | 43.3 |
| Bareland1 | 0.29 | 0.46 | 0.41 | 0.49 | 0.03 | 3.46 | 5.14 | 32.7 |
| Sum | 25.09 | 31.04 | 30.01 | 6.38 | 0.80 | 6.69 | 100.00 | |
| Commission error | 35.9 | 26.4 | 45.5 | 52.7 | 46.8 | 48.2 |
Note: 1) Land cover types with number 1 (i.e. Cropland1, Forest1, Grassland1, Water1, Residential and construction land1, and Bareland1 ) stand for land cover results of the visual interpretation; Land cover types with number 2 stand for land cover results of Automatic classification. 2) For better quantitative assessment, absolute values (pixel number) were converted to percentage values in each error matrix. 3) For automatic classification result, overall accuracy = 62.2%, KC = 0.486, sample size = 8,658,588.
Error matrix of the combination model (20%, 20%).
| Cropland2 | Forest2 | Grassland2 | Water2 | Residential and construction land2 | Bareland2 | Sum | Omission error | |
| Cropland1 | 7.85 | 6.88 | 5.06 | 5.06 | 0.12 | 0.66 | 25.65 | 69.4 |
| Forest1 | 5.73 | 19.98 | 11.14 | 3.33 | 0.21 | 1.26 | 41.66 | 52.0 |
| Grassland1 | 4.64 | 7.15 | 7.58 | 4.25 | 0.49 | 1.09 | 25.20 | 69.9 |
| Water1 | 0.24 | 1.48 | 0.14 | 1.81 | 0.11 | 0.05 | 3.82 | 52.6 |
| Residential and construction land1 | 0.08 | 0.09 | 0.06 | 0.04 | 0.19 | 0.02 | 0.48 | 61.1 |
| Bareland1 | 0.43 | 0.59 | 0.56 | 1.18 | 0.12 | 0.32 | 3.20 | 89.9 |
| Sum | 18.96 | 36.18 | 24.54 | 15.67 | 1.24 | 3.41 | 100.00 | |
| Commission error | 58.6 | 44.8 | 69.1 | 88.4 | 85.0 | 90.5 |
Note: 1) Land cover types with number 1 (i.e. Cropland1, Forest1, Grassland1, Water1, Residential and construction land1, and Bareland1 ) stand for land cover results of the visual interpretation; Land cover types with number 2 stand for land cover results of Automatic classification. 2) For better quantitative assessment, absolute values (pixel number) were converted to percentage values in each error matrix. 3) For automatic classification result, overall accuracy = 37.7%, KC = 0.149, sample size = 8,658,588.
Error matrix of the combination model (60%, 60%).
| Cropland2 | Forest2 | Grassland2 | Water2 | Residential and construction land2 | Bareland2 | Sum | Omission error | |
| Cropland1 | 10.06 | 1.04 | 0.91 | 0.01 | 0.02 | 0.19 | 12.23 | 17.7 |
| Forest1 | 1.03 | 32.45 | 4.83 | 0.02 | 0.07 | 0.40 | 38.80 | 16.4 |
| Grassland1 | 0.91 | 4.84 | 29.62 | 0.08 | 0.11 | 0.43 | 35.99 | 17.7 |
| Water1 | 0.01 | 0.02 | 0.07 | 1.13 | 0.01 | 0.02 | 1.25 | 9.7 |
| Residential and construction land1 | 0.02 | 0.09 | 0.11 | 0.01 | 1.28 | 0.12 | 1.64 | 22.1 |
| Bareland1 | 0.19 | 0.42 | 0.50 | 0.02 | 0.11 | 8.85 | 10.09 | 12.3 |
| Sum | 12.23 | 38.86 | 36.04 | 1.26 | 1.61 | 10.01 | 100.00 | |
| Commission error | 17.7 | 16.5 | 17.8 | 10.4 | 20.6 | 11.6 |
Note: 1) Land cover types with number 1 (i.e. Cropland1, Forest1, Grassland1, Water1, Residential and construction land1, and Bareland1 ) stand for land cover results of the visual interpretation; Land cover types with number 2 stand for land cover results of Automatic classification. 2) For better quantitative assessment, absolute values (pixel number) were converted to percentage values in each error matrix. 3) For automatic classification result, overall accuracy = 83.4%, KC = 0.760, sample size = 8,658,588.
Error matrix of common classification approach (Maximum Likelihood Approach).
| Cropland2 | Forest2 | Grassland2 | Water2 | Residential and construction land2 | Bareland2 | Sum | Omission error | |
| Cropland1 | 13.34 | 4.92 | 3.22 | 1.04 | 1.32 | 0.12 | 23.96 | 44.3 |
| Forest1 | 1.28 | 20.69 | 9.14 | 0.59 | 3.74 | 0.25 | 35.69 | 42.0 |
| Grassland1 | 2.86 | 7.01 | 13.18 | 0.81 | 0.27 | 0.10 | 24.24 | 45.6 |
| Water1 | 0.41 | 0.84 | 0.05 | 3.93 | 0.08 | 0.08 | 5.38 | 27.0 |
| Residential and construction land1 | 0.64 | 0.98 | 0.04 | 0.20 | 5.67 | 0.08 | 7.61 | 25.5 |
| Bareland1 | 0.56 | 0.50 | 0.09 | 0.35 | 0.06 | 1.56 | 3.11 | 50.0 |
| Sum | 19.08 | 34.94 | 25.73 | 6.92 | 11.14 | 2.18 | 100.00 | |
| Commission error | 30.1 | 40.8 | 48.8 | 43.2 | 49.1 | 28.7 |
Note: 1) Land cover types with number 1 (i.e. Cropland1, Forest1, Grassland1, Water1, Residential and construction land1, and Bareland1 ) stand for land cover results of the visual interpretation; Land cover types with number 2 stand for land cover results of Maximum Likelihood approach classification. 2) For better quantitative assessment, absolute values (pixel number) were converted to percentage values in each error matrix. 3) For Maximum Likelihood approach classification, overall accuracy = 58.4%, KC = 0.4482, sample size = 8,658,588.
Comparison of the Z values in each model/approach.
| Model/Approach | Kappa | Asymptotic standard error (ASE) | 95% confidence lower limit | 95% confidence upper limit |
| combination model (80%, 80%) | 0.4856 | 0.0002 | 0.4852 | 0.4860 |
| combination model (20%, 20%) | 0.1490 | 0.0002 | 0.1485 | 0.1494 |
| combination model (60%, 60%) | 0.7607 | 0.0002 | 0.7603 | 0.7611 |
| Maximum Likelihood Approach | 0.4482 | 0.0002 | 0.4478 | 0.4487 |
Note. Sample size was 8,658,588.
The 95% confidence intervals (2.5% each side) all were less than 0.0001.
P values in bold were statistically significant (p<0.0001).
Figure 7Sketch map of automatic dataset of pure-pixel training samples.