| Literature DB >> 31296224 |
Roger Hillson1, Austin Coates2, Joel D Alejandre3, Kathryn H Jacobsen4, Rashid Ansumana5,6, Alfred S Bockarie5,6, Umaru Bangura6, Joseph M Lamin6, David A Stenger7.
Abstract
BACKGROUND: This is the third paper in a 3-paper series evaluating alternative models for rapidly estimating neighborhood populations using limited survey data, augmented with aerial imagery.Entities:
Keywords: Landsat; MCMC; Population Estimation; Remote Sensing; Sierra Leone; Spatial Epidemiology; urban footprint
Mesh:
Year: 2019 PMID: 31296224 PMCID: PMC6625010 DOI: 10.1186/s12942-019-0180-1
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Bo municipal survey data
| (1) Section | (2) Persons | (3) Area (km | (4) Persons/area | (5) Residential structures | (6) Total structures |
|---|---|---|---|---|---|
| Moibawo Farm | 135 | 0.5 | 270.04 | 17 | 43 |
| Roma | 139 | 0.04 | 3510.72 | 4 | 52 |
| Bo Central | 273 | 0.066 | 4137.9 | 33 | 103 |
| Toubu | 454 | 0.016 | 28089.66 | 34 | 46 |
| Salina | 580 | 0.467 | 1242.8 | 59 | 231 |
| Dodo | 597 | 0.049 | 12126.22 | 26 | 88 |
| Reservation | 637 | 2.329 | 273.5 | 66 | 252 |
| Kpetewoma | 640 | 0.197 | 3250.74 | 46 | 105 |
| Lewabu | 879 | 0.479 | 1836.16 | 105 | 117 |
| Tengbewabu | 1068 | 0.68 | 1571.17 | 136 | 233 |
| New York | 1088 | 1.513 | 719.3 | 116 | 605 |
| Komende | 1103 | 0.196 | 5622.16 | 56 | 258 |
| Kindia Town | 1160 | 0.146 | 7972.46 | 102 | 278 |
| New Site | 1248 | 0.686 | 1818.17 | 136 | 194 |
| Yemoh Town | 1858 | 0.404 | 4602.47 | 152 | 284 |
| Njai Town | 2298 | 0.216 | 10641.33 | 127 | 269 |
| Kissi Town | 2490 | 0.196 | 12709.81 | 154 | 287 |
| Nduvuibu | 2552 | 0.493 | 5177.21 | 205 | 343 |
| New London | 2873 | 0.597 | 4813.01 | 208 | 495 |
| Kulanda Town | 3882 | 0.294 | 13216.15 | 197 | 314 |
| TOTAL | 25,954 | – | – | 1979 | 4597 |
Residential and household survey data for 20 municipal sections of Bo, ordered by population, showing the persons per municipal section, section area, and the total number of residential and non-residential structures [2, 3]
The min, mid and max values
Fig. 1Bo sections ordered by population density. The 20 surveyed sections of Bo City ordered by population density . The inset in the upper left includes a map of Africa, with Sierra Leone highlighted in red and Bo City in blue (Source: OCHA/ReliefWeb). The larger map shows Bo City, the survey location
Landsat 5 thematic mapper bands used in this study
| (1) Bands | (2) Wavelength ( | (3) Resolution (m) | (4) Applications |
|---|---|---|---|
| Band 1—blue | 0.45–0.52 | 30 | Because blue light is absorbed by chlorophyll, this band can be used to discriminate between vegetation, which will appear to be dark, and soil and/or roads and buildings, which are more reflective. Light in this band penetrates water deeply, which is useful for monitoring aquatic systems and for bathymetric mapping |
| Band 2—green | 0.52–0.60 | 30 | Green light is reflected strongly by chlorophyll, so this band is useful for appraising the state of vegetation. There will be high contrast between clear and turbid water, because light in this band penetrates water well, but with less scattering than in band 1 |
| Band 3—red | 0.63–0.69 | 30 | This is the chlorophyll absorption band, and live vegetation will appear to be dark. This band is useful for differentiating between live plants with chlorophyll, which absorb red light, and dead foliage, which does not |
| Band 4—near infrared (NIR) | 0.76–0.90 | 30 | Useful for delineating shorelines, because water is strongly absorbent in this band, while soil and vegetation reflect brightly. It is a good band for differentiating soil from crops, and for quantifying biomasses |
| Band 5—shortwave infrared (SWIR) 1 | 1.55–1.75 | 30 | This band is useful for measuring the moisture content of soil and vegetation. SWIR measurements can also help differentiate between snow on the ground, ice, and clouds in the air |
| Band 7—shortwave infrared (SWIR) 2 | 2.08–2.35 | 30 | Similar in utility to Band 5, but also useful for differentiating between different types of rock formations |
Landsat 5 thematic mapper band names, wavelengths, resolutions, and nominal domains. In this study, only measurements collected in Bands 1 through 5 and Band 7 were used. Band 6 measures thermal emissions, and was not used.
Fig. 2False-color near infrared (NIR) image of Bo city. Composite images are created by assigning the visible colors red (R), green (G), and blue (B) to TM greyscale bands [49]. The image shown is near infrared (NIR): [4,3,2]. Vegetation appears bright red in NIR images because near infrared band 4 is assigned to the color red, and chlorophyll is a good reflector of infrared
Moibawo normalized pixel amplitude distribution
Moibawo is approximately 0.5 km. In each of the 6 bands utilized, about 1000 pixels were scanned. This figure shows the magnitude and grid locations of the normalized pixel values (scaled from 0.0 to 100.0 for readability) measured in Band 3. Compare the shape of the non-zero elements in the above table to the outline of Moibawo in Fig. 1
Moibawo normalized pixel amplitude distribution
New London section is approximately 0.5 km. In each of the 6 bands utilized, about 1050 pixels were scanned. This figure shows the magnitude and grid locations of the normalized pixel values (scaled from 0.0 to 100.0 for readability) measured in Band 3. The matrix of pixel values is rotated counterclockwise. Compare the shape of the non-zero elements in the above table to the outline of New London in Fig. 1
Landsat 5 thematic mapper candidate covariates
| Variables | Covariate subset list | Variable definition | Number of variables | Non-spectral mean values | Spectral (pixel-level) transform |
|---|---|---|---|---|---|
|
| |||||
| B | Mean value of Landsat 5 thematic mapper | 6 | * | ||
| Bs | SD of | 6 | * | ||
| Bv | Variance of | 6 | * | ||
| Bc | Coefficent of Variation [CV]: | 6 | * | ||
| S | Square of | 6 | * | ||
| P | Non-spectral cross products of the means | 15 | * | ||
| R.re | Non-spectral ratios of means | 15 | * | ||
| D | Non-spectral ratio of the difference-to-the-sum of the mean value | 15 | * | ||
|
| |||||
| NB | Mean value of the min-max normalized | 6 | * | ||
| NBs | SD of | 6 | * | ||
| NBv | Variance of | 6 | * | ||
| NBc | Coefficent of Variation [CV]: | 6 | * | ||
| R | 15 | * | |||
| Rs | SD of | 15 | * | ||
| Rv | variance of | 15 | * | ||
| Rc | Coefficient of Variance [CV] of | 15 | * | ||
| DS | 15 | * | |||
| DSs | 15 | * | |||
| DSv | 15 | * | |||
| DSc | 15 | * | |||
| CH | Cylindrical transform of composite | 20 | * | ||
| CHs | 20 | * | |||
| CHv | 20 | * | |||
| CHc | 20 | * | |||
| RH | Rectangular transform transform of composite | 20 | * | ||
| RHs | 20 | * | |||
| RHv | 20 | * | |||
| RHc | 20 | * | |||
| Total variables: | 379 | 75 | 304 | ||
A summary of the 379 Landsat 5 thematic mapper variables calculated for this study. Only measurements collected in Bands 1 through 5 and Band 7 are used
Bo municipal survey population density data and abbreviated Landsat band data, tabulated by section
| (1) Section | (2) Persons/km2 | (3) b1 | (4) b1c | (5) – | (6)[col 380] s5 | (7) [col. 381] s7 |
|---|---|---|---|---|---|---|
| Moibawo farm | 135 | 0.04988 | 0.11564 | – | 0.04902 | 0.01545 |
| Roma | 139 | 0.0661 | 0.06879 | – | 0.0497 | 0.02889 |
| Bo Central | 273 | 0.07276 | 0.09094 | – | 0.04855 | 0.03639 |
| Toubu | 454 | 0.0657 | 0.11885 | – | 0.04008 | 0.02383 |
| Salina | 580 | 0.05845 | 0.14618 | – | 0.05418 | 0.02302 |
| Dodo | 597 | 0.06779 | 0.06786 | – | 0.04359 | 0.03019 |
| Reservation | 637 | 0.05072 | 0.13245 | – | 0.0442 | 0.01411 |
| Kpetewoma | 640 | 0.05775 | 0.10814 | – | 0.04936 | 0.0209 |
| Lewabu | 879 | 0.05604 | 0.14217 | – | 0.04967 | 0.01897 |
| Tengbewabu | 1068 | 0.05798 | 0.13686 | – | 0.04854 | 0.01896 |
| New York | 1088 | 0.05675 | 0.1109 | – | 0.05054 | 0.01932 |
| Komende | 1103 | 0.06226 | 0.10095 | – | 0.04478 | 0.02323 |
| Kindia Town | 1160 | 0.05992 | 0.07469 | – | 0.04318 | 0.0223 |
| New Site | 1248 | 0.05861 | 0.14711 | – | 0.05713 | 0.02422 |
| Yemoh Town | 1858 | 0.05627 | 0.1299 | – | 0.03849 | 0.01577 |
| Njai Town | 2298 | 0.0587 | 0.10871 | – | 0.04057 | 0.01859 |
| Kissi Town | 2490 | 0.06258 | 0.08847 | – | 0.04161 | 0.02462 |
| Nduvuibu | 2552 | 0.05408 | 0.10372 | – | 0.03665 | 0.01464 |
| New London | 2873 | 0.05782 | 0.12842 | – | 0.04221 | 0.01777 |
| Kulanda Town | 3882 | 0.05779 | 0.11445 | – | 0.04022 | 0.01746 |
Table entries are ordered by population. The section (municipality) name is in column 1, and the population density d = persons/km is in column 2. See [2, 3]. Appended to the first 2 columns are representative Landsat band filter measurements 1–2 and 378–379 (see text). The measured number of persons/km in the ith section (i.e. ) is the dependent variable to be estimated
This table summarizes the number of candidate covariates retained at each stage of model development
| (1) Step | (2) Number of covariates |
|---|---|
| Initial data set | 379 |
| Use spectral transforms only | 304 |
| Remove redundant highly-correlated variables | 159 |
| MCMC best regression model | 6 |
Fig. 3Back-transformed estimators of the and of the population density . (a) No transform (b) back-transform of (c) back-transform of
The best regression model found by the MCMC sampler
| (1) Covariates | (2) Estimated coefficient | (3) Std. Error | (4) t value | (5) Pr( | (6) Max. Prob. | (7) VIF | (8) PIP |
|---|---|---|---|---|---|---|---|
| (intercept) | − 174.02 | 20.16 | − 8.63 | 9.63E−7 | < .0001 | – | – |
| nb7v | − 1656.91 | 85.58 | − 19.36 | 5.72E−11 | < .0001 | 6.29 | 0.9837 |
| r_sp37 | 532.23 | 31.27 | 17.02 | 2.88E−10 | < .0001 | 1.34 | 0.9790 |
| nb1v | 1686.14 | 71.28 | 23.66 | 4.52E−12 | < .0001 | 5.36 | 0.9500 |
| r_sp15s | − 2744.29 | 159.21 | − 17.24 | 2.46E−10 | < .0001 | 1.08 | 0.4711 |
| ch245c | 44775.32 | 2636.73 | 16.98 | 2.96E−10 | < .0001 | 2.72 | 0.9835 |
| r_sp14c | − 246.75 | 23.79 | − 10.37 | 1.18E−07 | < .0001 | 1.55 | 0.5381 |
This table summarizes the best regression equation returned by the MCMC sampler for the estimation of . The values of the variance inflation factor (VIF) are less than 7.0, which demonstrates the low collinearity between the covariates. Four of Posterior Inclusion Probabilities (PIPs) are close to 1.0, quantifying their importance as predictive variables of , as discussed in the text
Measured and estimated values of population and population density
| (1) Name | 2) p = Population | (3) Measured area in km | (4) d = Persons per km | (5) Estimated | (6) Estimated | (7) |
|---|---|---|---|---|---|---|
| Moibawo Farm | 135 | 0.5 | 270 | 206.13 | 103 | − 24 |
| Roma | 139 | 0.04 | 3475 | 3818.48 | 153 | 10 |
| Bo Central | 273 | 0.066 | 4136.36 | 3753.27 | 248 | − 9 |
| Toubu | 454 | 0.016 | 28375 | 27852.13 | 446 | − 2 |
| Salina | 580 | 0.467 | 1241.97 | 1164.25 | 544 | − 6 |
| Dodo | 597 | 0.049 | 12183.67 | 12234.85 | 600 | 0 |
| Reservation | 637 | 2.329 | 273.51 | 282.25 | 657 | 3 |
| Kpetewoma | 640 | 0.197 | 3248.73 | 2918.91 | 575 | − 10 |
| Lewabu | 879 | 0.479 | 1835.07 | 2100.07 | 1006 | 14 |
| Tengbewabu | 1068 | 0.68 | 1570.59 | 1770.75 | 1204 | 13 |
| New York | 1088 | 1.513 | 719.1 | 585.41 | 886 | − 19 |
| Komende | 1103 | 0.196 | 5627.55 | 5780.18 | 1133 | 3 |
| Kindia Town | 1160 | 0.146 | 7945.21 | 8806.6 | 1286 | 11 |
| New Site | 1248 | 0.686 | 1819.24 | 2061.33 | 1414 | 13 |
| Yemoh Town | 1858 | 0.404 | 4599.01 | 5379.96 | 2174 | 17 |
| Njai Town | 2298 | 0.216 | 10638.89 | 10524.21 | 2273 | − 1 |
| Kissi Town | 2490 | 0.196 | 12704.08 | 12479.6 | 2446 | − 2 |
| Nduvuibu | 2552 | 0.493 | 5176.47 | 4803.58 | 2368 | − 7 |
| New London | 2873 | 0.597 | 4812.4 | 4487.98 | 2679 | − 7 |
| Kulanda Town | 3882 | 0.294 | 13204.08 | 12452.93 | 3661 | − 6 |
This table lists the (1) section name (2) measured section population (3) measured section area in (4) population density (5) the regression-estimated population density (6) the estimated population/section . (7) the % Relative Error (RE) for the density estimation
Fig. 4Back-transformed regressive estimations of and . (a) The back-transformed estimates of the square root of the population density , shown with back-transformed 95% confidence intervals. (b) The estimated section populations
Fig. 5The Relative Errors REs for the back-transformed estimate and cross-validation trials. (a) The absolute value of the RE (% Relative Error) for the back-transformed estimate (b) The absolute value of the RE for the 20 “Leave-one-out cross-validation” (LOOCV) trials
This table summarizes the results from the k − 1 ’LOOCV’ cross-validation analysis
| Name | (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | (10) |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| Roma | 139 | 0.04 | 3510.72 | 59.25 | 70.39 | 4954.9 | 11.14 | 18.8 | 1444.18 | 41.14 |
| Bo Central | 273 | 0.066 | 4137.9 | 64.33 | 60.44 | 3653.53 | 3.88 | 6.03 | 484.37 | − 11.71 |
| Toubu | 454 | 0.016 | 28089.66 | 167.6 | 160.05 | 25617.12 | 7.55 | 4.5 | 2472.54 | − 8.80 |
| Salina | 580 | 0.467 | 1242.8 | 35.25 | 33.94 | 1151.93 | 1.31 | 3.73 | 90.87 | − 7.31 |
| Dodo | 597 | 0.049 | 12126.22 | 110.12 | 111.22 | 12370.87 | 1.11 | 1 | 244.65 | 2.02 |
| Reservation | 637 | 2.329 | 273.5 | 16.54 | 16.91 | 285.8 | 0.37 | 2.22 | 12.3 | 4.50 |
| Kpetewoma | 640 | 0.197 | 3250.74 | 57.02 | 52.63 | 2769.82 | 4.39 | 7.69 | 480.92 | − 14.79 |
| Lewabu | 879 | 0.479 | 1836.16 | 42.85 | 46.54 | 2166.06 | 3.69 | 8.61 | 329.9 | 17.97 |
| Tengbewabu | 1068 | 0.68 | 1571.17 | 39.64 | 42.41 | 1798.53 | 2.77 | 6.99 | 227.36 | 14.47 |
| New York | 1088 | 1.513 | 719.3 | 26.82 | 23.01 | 529.41 | 3.81 | 14.21 | 189.89 | − 26.40 |
| Komende | 1103 | 0.196 | 5622.16 | 74.98 | 76.52 | 5856.04 | 1.54 | 2.06 | 233.88 | 4.16 |
| Kindia Town | 1160 | 0.146 | 7972.46 | 89.29 | 94.36 | 8903.89 | 5.07 | 5.68 | 931.43 | 11.68 |
| New Site | 1248 | 0.686 | 1818.17 | 42.64 | 47.27 | 2234.43 | 4.63 | 10.86 | 416.26 | 22.89 |
| Yemoh Town | 1858 | 0.404 | 4602.47 | 67.84 | 75.01 | 5626.04 | 7.17 | 10.56 | 1023.57 | 22.24 |
| Median (abs) | 1078.00 | 0.35 | 4370.19 | 66.09 | 67.11 | 4504.31 | 3.85 | 5.57 | 424.25 | 11.15 |
| Mean (abs) | 1297.70 | 0.48 | 6180.05 | 69.14 | 69.00 | 6098.19 | 3.89 | 7.41 | 568.21 | 14.58 |
For each section, col (1) is the section population, Col (2) is the area in , and column (3) is the population density . Col (4) is the transform , col (5) is the estimate , and (6) is the back-transformed estimate of . Cols (7) and (8) are the error (Er) and RE) of the estimated variable ; and cols (9) and (10) are the corresponding error functions for the back-transformed estimate of d
Measured and estimated values of population and population density using EBL estimator
| y(1) Name | (2) p = Population | (3) Measured area in km | (4) d = Persons per km | (5) Estimated | (6) Estimated | (7) |
|---|---|---|---|---|---|---|
| Moibawo Farm | 135 | 0.5 | 270.04 | 203.5 | 101.75 | − 24.64 |
| Roma | 139 | 0.04 | 3510.72 | 3353.39 | 134.1356 | − 4.48 |
| Bo Central | 273 | 0.066 | 4137.9 | 4224.7 | 278.8302 | 2.10 |
| Toubu | 454 | 0.016 | 28089.66 | 27109.01 | 433.7442 | − 3.49 |
| Salina | 580 | 0.467 | 1242.8 | 1537.59 | 718.0545 | 23.72 |
| Dodo | 597 | 0.049 | 12126.22 | 12417.3 | 608.4477 | 2.40 |
| Reservation | 637 | 2.329 | 273.5 | 188 | 437.852 | − 31.26 |
| Kpetewoma | 640 | 0.197 | 3250.74 | 3374.54 | 664.7844 | 3.81 |
| Lewabu | 879 | 0.479 | 1836.16 | 2057.56 | 985.5712 | 12.06 |
| Tengbewabu | 1068 | 0.68 | 1571.17 | 1557.68 | 1059.222 | − 0.86 |
| New York | 1088 | 1.513 | 719.3 | 604.88 | 915.1834 | − 15.91 |
| Komende | 1103 | 0.196 | 5622.16 | 5924.81 | 1161.263 | 5.38 |
| Kindia Town | 1160 | 0.146 | 7972.46 | 8357.53 | 1220.199 | 4.83 |
| New Site | 1248 | 0.686 | 1818.17 | 1815.95 | 1245.742 | − 0.12 |
| Yemoh Town | 1858 | 0.404 | 4602.47 | 4455.86 | 1800.167 | − 3.19 |
| Njai Town | 2298 | 0.216 | 10641.33 | 10061.91 | 2173.373 | − 5.44 |
| Kissi Town | 2490 | 0.196 | 12709.81 | 12463.32 | 2442.811 | − 1.94 |
| Nduvuibu | 2552 | 0.493 | 5177.21 | 5661.29 | 2791.016 | 9.35 |
| New London | 2873 | 0.597 | 4813.01 | 4308.02 | 2571.888 | − 10.49 |
| Kulanda Town | 3882 | 0.294 | 13216.15 | 13149.33 | 3865.903 | − 0.51 |
| Mean | 1297.7 | 0.48 | 6180.05 | 6141.31 | 2947.829 | − 0.63 |
| Median | 1078 | 0.35 | 4370.18 | 4266.36 | 1493.226 | − 2.38 |
This table lists the (1) section name (2) measured section population (3) measured section area in (4) population density (5) the regression-estimated population density (6) the estimated population/section . (7) the Relative Error (RE) for the density estimation
The best regression model found by the MCMC sampler for the EBL estimator
| (1) Covariates | (2) Estimated coefficient | (3) Std. error | (4) t value | (5) Pr( | (6) Max. Prob. | (7) VIF | (8) PIP |
|---|---|---|---|---|---|---|---|
| intercept | 129.60 | 6.17 | 21.00 | 0.00 | < .001 | – | – |
| nb3s | 1076.00 | 28.09 | 33.32 | 0.00 | < .001 | 8.97 | 0.31 |
| nb7v | − 1882.00 | 56.70 | − 33.20 | 0.00 | < .001 | 7.39 | 0.48 |
| nb7c | 61.42 | 4.95 | 12.39 | 0.00 | < .001 | 1.25 | 0.15 |
| ds15s | − 2112.00 | 109.10 | − 19.36 | 0.00 | < .001 | 2.17 | 0.22 |
| ds35c | 239.90 | 40.13 | 5.98 | 0.00 | < .001 | 2.58 | 0.10 |
| ch127 | 0.57 | 0.02 | − 26.76 | 0.00 | < .001 | 2.19 | 0.23 |
| ch357 | − 0.57 | 0.02 | − 31.22 | 0.00 | < .001 | 1.50 | 0.24 |
This table summarizes the best regression equation returned by the MCMC sampler for the estimation of . The values of the variance inflation factor (VIF) are less than 7.0, which demonstrates the low collinearity between the covariates. Four of Posterior Inclusion Probabilities (PIPs) are close to 1.0, quantifying their importance as predictive variables of , as discussed in the text.
Fig. 6The Relative Errors REs for the back-transformed estimate and cross-validation trials using a Local Empirical Bayesian estimator. The same results shown in Fig. 5a, b, except a Local Empirical Bayesian (EBL) estimator was used