| Literature DB >> 29225715 |
Abstract
Remote sensing images from Earth-orbiting satellites are a potentially rich data source for monitoring and cataloguing atmospheric health hazards that cover large geographic regions. A method is proposed for classifying such images into hazard and nonhazard regions using the autologistic regression model, which may be viewed as a spatial extension of logistic regression. The method includes a novel and simple approach to parameter estimation that makes it well suited to handling the large and high-dimensional datasets arising from satellite-borne instruments. The methodology is demonstrated on both simulated images and a real application to the identification of forest fire smoke.Entities:
Keywords: Autologistic regression; Forest fire smoke; Hyperspectral images; Image segmentation; Machine learning
Year: 2016 PMID: 29225715 PMCID: PMC5711969 DOI: 10.1007/s12561-016-9185-5
Source DB: PubMed Journal: Stat Biosci ISSN: 1867-1764
Fig. 1An illustration of a hyperspectral image
Fig. 2The region of interest, showing a a clear-sky composite image, and b a typical image during a significant smoke event
Fig. 3Analysis flowchart for classifier construction
Fig. 4Examples of the simulated images. Top row the images. Bottom row the true classes
Results of MPL and plug-in estimation for different image sizes
| Pixels | Method |
|
|
|
| Error rate (%) | Time (min) |
|---|---|---|---|---|---|---|---|
|
| plug-in |
|
| 1.91 | 0.90 | 20.1 | 0.25 |
| PL |
|
| 2.06 | 0.99 | 20.4 | 0.49 | |
|
| plug-in |
|
| 1.71 | 1.00 | 17.7 | 0.66 |
| PL |
|
| 1.70 | 1.19 | 17.7 | 1.5 | |
|
| plug-in |
|
| 1.63 | 1.60 | 20.1 | 2.8 |
| PL |
|
| 1.68 | 1.36 | 20.1 | 7.5 | |
|
| plug-in |
|
| 1.76 | 1.95 | 20.6 | 6.9 |
| PL |
|
| 1.79 | 1.51 | 20.4 | 20 | |
|
| plug-in |
|
| 1.58 | 1.95 | 18.8 | 12 |
| PL |
|
| 1.49 | 1.59 | 18.6 | 35 |
Symbols denote coefficients for the red, green, and blue predictors, respectively. These coefficient values correspond to the plus/minus response coding. The two estimation methods consistently give similar parameter estimates and error rates
Reported times for the plug-in method are times for a single value in a parallel computing implementation; see the comments in Sect. 6
Fig. 5Example of prediction results for an test image. Left the original image. Right the predicted probabilities using plug-in estimation. True class boundaries have been superimposed to aid visualization
Fig. 6Effect of plugging in various values on prediction error, for different autologistic model variants. Only the standard model with coding shows improvement; the other models show a strong degradation of performance. Pairwise parameter values for all variants are converted to the -coding scale
Information about the best models found
| Predictor set | Selected variables (MODIS band numbers) | plug-in |
|---|---|---|
| Main effects | 1 6 7 8 14 16 17 18 21 23 25 26 30 31 32 36 | 1.85 |
| Main effects & interactions | 7 30 2:3 5:26 6:11 7:36 8:20 8:22 8:25 8:31 13:15 13:23 16:31 18:23 22:36 32:36 | 1.75 |
Test-set prediction error rates for the smoke data
| Error rate (%) | |||
|---|---|---|---|
| Model | Nonsmoke pixels | Smoke pixels | Overall |
| Main effects, logistic | 21.1 | 25.9 | 21.6 |
| Interactions, logistic | 20.0 | 23.3 | 20.3 |
| Main effects, autologistic | 17.6 | 23.9 | 18.2 |
| Interactions, autologistic | 16.2 | 21.3 | 16.7 |
Adding interaction terms and using the autologistic model both yield improvements over the base logistic regression model
Fig. 7A scene that is relatively easy to segment, showing the colour image and the fitted probability maps for the logistic and autologistic models. Red outlines indicate the boundaries of the human-identified smoke regions
Fig. 8A scene that is harder to segment due to the presence of clouds and areas with thin smoke. The autologistic model imparts spatial smoothness but cannot correct ambiguities inherent in the logistic model