| Literature DB >> 29743081 |
Robert F Chew1, Safaa Amer2, Kasey Jones3, Jennifer Unangst2, James Cajka4, Justine Allpress4, Mark Bruhn4.
Abstract
BACKGROUND: Conducting surveys in low- and middle-income countries is often challenging because many areas lack a complete sampling frame, have outdated census information, or have limited data available for designing and selecting a representative sample. Geosampling is a probability-based, gridded population sampling method that addresses some of these issues by using geographic information system (GIS) tools to create logistically manageable area units for sampling. GIS grid cells are overlaid to partition a country's existing administrative boundaries into area units that vary in size from 50 m × 50 m to 150 m × 150 m. To avoid sending interviewers to unoccupied areas, researchers manually classify grid cells as "residential" or "nonresidential" through visual inspection of aerial images. "Nonresidential" units are then excluded from sampling and data collection. This process of manually classifying sampling units has drawbacks since it is labor intensive, prone to human error, and creates the need for simplifying assumptions during calculation of design-based sampling weights. In this paper, we discuss the development of a deep learning classification model to predict whether aerial images are residential or nonresidential, thus reducing manual labor and eliminating the need for simplifying assumptions.Entities:
Keywords: Clustering; Complex sample design; Deep learning; GIS; Machine learning; Probability based; Remote sensing; Scene classification
Mesh:
Year: 2018 PMID: 29743081 PMCID: PMC5944062 DOI: 10.1186/s12942-018-0132-1
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Fig. 1Overview of an example multistage geosampling design outside of Kampala, Uganda
Count of images by grid area sizes in the Nigeria and Guatemala data sets
| Type | Size | Nigeria | Guatemala |
|---|---|---|---|
| PGC | 1 km × 1 km | 71 | 6 |
| SGC | 50 m × 50 m | 410 | 1200 |
| SGC | 100 m × 100 m | 3900 | 300 |
| SGC | 150 m × 150 m | 1044 | 0 |
Fig. 2a Nigeria PGC Image (1 km × 1 km). b Guatemala PGC Image (1 km × 1 km)
Fig. 3a Nigeria SGC Image (50 m × 50 m). b Guatemala SGC Image (100 m × 100 m)
Fig. 4a Example SGC residential scene (100 m × 100 m). b Example SGC nonresidential scene (100 m × 100 m)
Training and test data set allocation for Nigeria and Guatemala
| Nigeria | Guatemala | |
|---|---|---|
| Training set | 4550 | 1275 |
| Residential | 1676 | 417 |
| Nonresidential | 2874 | 858 |
| Test set | 800 | 225 |
| Residential | 295 | 73 |
| Nonresidential | 505 | 152 |
| Total | 5350 | 1500 |
Fig. 5Workflow diagram of modeling approach
Fig. 6Network diagram of baseline CNN
GIS derived OSM + ESA variables
| Variable name | Type | Number | Description |
|---|---|---|---|
| ContainBuildings | Binary | 1 | Whether an SGC contains an OSM building polygon |
| SemiFitBuild | Binary | 1 | Whether an SGC contains a semi-filtered OSM building polygon |
| AnyRoad | Binary | 1 | Whether the SGC intersects any OSM road |
| ResRoad | Binary | 1 | Whether the SGC intersects any OSM road labelled residential |
| ResPlusUnRoad | Binary | 1 | Whether the SGC intersects any OSM road labelled residential or unlabeled |
| Glob2015_MajLC | Categorical | 38 | ESA land-cover categories, ranging from “cropland” to “permanent snow and ice” |
Model evaluation metrics for the Nigeria and Guatemala test sets
| Model | Type | Acc. | Prec. | Recall | F1 |
|---|---|---|---|---|---|
| Nigeria | |||||
| Baseline CNN | Deep | 88.9% | 89.2% | 88.9% | 89.0% |
| VGG16 with ImageNet weights | Deep | 93.4% | 93.4% | 93.4% | 93.3% |
| InceptionV3 with ImageNet weights | Deep | 93.6% | 93.6% | 93.6% | 93.6% |
| VGG16 and InceptionV3 ensemble | Deep | 94.5% | 94.5% | 94.5% | 94.5% |
| Decision Tree | Shallow | 80.3% | 80.9% | 80.3% | 78.9% |
| Gradient Boosting | Shallow | 80.3% | 80.9% | 80.3% | 79.0% |
| AdaBoost | Shallow | 80.6% | 81.8% | 80.6% | 79.2% |
| Random forest | Shallow | 80.1% | 80.7% | 80.1% | 78.8% |
| Logistic regression | Shallow | 80.6% | 81.8% | 80.6% | 79.2% |
| Support vector machine | Shallow | 79.9% | 81.5% | 79.9% | 78.1% |
| K-nearest neighbors | Shallow | 75.6% | 81.3% | 75.6% | 71.3% |
| Human benchmark | Human | 91.0%* | – | – | – |
| Guatemala | |||||
| Baseline CNN | Deep | 93.3% | 93.3% | 93.3% | 93.3% |
| VGG16 with ImageNet weights | Deep | 96.4% | 96.7% | 96.4% | 96.5% |
| Inception V3 with ImageNet weights | Deep | 95.6% | 95.9% | 95.6% | 95.6% |
| VGG16 and InceptionV3 ensemble | Deep | 96.4% | 96.7% | 96.4% | 96.5% |
| Decision tree | Shallow | 93.8% | 94.1% | 93.8% | 93.8% |
| Gradient boosting | Shallow | 93.8% | 94.1% | 93.8% | 93.8% |
| AdaBoost | Shallow | 92.9% | 93.1% | 92.9% | 93.0% |
| Random forest | Shallow | 93.8% | 94.1% | 93.8% | 93.8% |
| Logistic regression | Shallow | 93.8% | 94.1% | 93.8% | 93.8% |
| Support vector machine | Shallow | 93.8% | 94.6% | 93.8% | 93.9% |
| K-nearest neighbors | Shallow | 92.4% | 93.7% | 92.4% | 92.6% |
| Human benchmark | Human | 97.1%* | – | – | – |
*Raw agreement between two independent coders
Test set accuracy by SGC grid size
| SGC size | Nigeria | Guatemala | ||
|---|---|---|---|---|
| Count | Accuracy | Count | Accuracy | |
| 50 × 50 m | 74 | 98.65% | 179 | 97.21% |
| 100 × 100 m | 571 | 93.52% | 46 | 93.48% |
| 150 × 150 m | 155 | 95.48% | 0 | – |
| Test for equality of proportions | ||||
Test set accuracy by image source
| Image source | Nigeria | Guatemala | ||
|---|---|---|---|---|
| Count | Accuracy | Count | Accuracy | |
| 643 | 94.25% | 179 | 97.21% | |
| Bing | 157 | 94.90% | 46 | 93.48% |
| Test for equality of proportions | ||||
Fig. 7Learning curves for Nigeria and Guatemala
Residential scene classification accuracy across studies using deep CNNs transfer learning models
| References | Scene class | Dataset | # Classes | Accuracy (%) | Relative scene accuracy ranking |
|---|---|---|---|---|---|
| Hu et al. [ | Sparse Residential | UC Merced | 21 | 85 | 19 of 21* |
| Med. Residential | UC Merced | 21 | 85 | 19 of 21* | |
| Dense Residential | UC Merced | 21 | 90 | 17 of 21** | |
| Han et al. [ | Sparse Residential | UC Merced | 21 | 95 | 12 of 21*** |
| Med. Residential | UC Merced | 21 | 90 | 19 of 21**** | |
| Dense Residential | UC Merced | 21 | 85 | 21 of 21 | |
| Residential | SIRI-WHU | 12 | 93 | 10 of 12***** | |
| Residential | WHU-RS | 19 | 88 | 19 of 19 | |
| Chew et al. (in this study) | Residential | Nigeria | 2 | 94.5 | NA |
| Residential | Guatemala | 2 | 96.4 | NA |
*Medium and Sparse residential tied for 19th/20th place
**Tied with “intersection” for 17th/18th place
***Tied with seven other classes for 12th–18th place
****Tied with “storage tank” for 19th/20th place
*****Tied with “idle land” for 10th/11th place