| Literature DB >> 36241886 |
Mingxing Chen1,2, Yue Xian1,2, Yaohuan Huang1, Xiaoping Zhang2, Maogui Hu3, Shasha Guo1,2,4, Liangkan Chen1,2, Longwu Liang1,2.
Abstract
Accurate location-based big data has a high resolution and a direct interaction with human activities, allowing for fine-scale population spatial data to be realized. We take the average of Tencent user location big data as a measure of ambient population. The county-level statistical population data in 2018 was used as the assigned input data. The log linear spatially weighted regression model was used to establish the relationship between location data and statistical data to allocate the latter to a 0.01° grid, and the ambient population data of mainland China was obtained. Extracting street-level (lower than county-level) statistics for accuracy testing, we found that POP2018 has the best fit with the actual permanent population (R2 = 0.91), and the error is the smallest (MSEPOP2018 = 22.48 <MSEWorldPop = 37.24 <MSELandScan = 100.91). This research supplemented in the refined spatial distribution data of people between census years, as well as presenting the application technique of big data in ambient population estimation and zoning mapping.Entities:
Mesh:
Year: 2022 PMID: 36241886 PMCID: PMC9568591 DOI: 10.1038/s41597-022-01740-5
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1The research and production framework of population spatial distribution map.
Fig. 2County-level permanent population (a) and Tencent positioning data (b).
Fig. 3County-level statistical population and Tencent location number (a) and their logarithmic (b) kernel density plots.
Fig. 4Geographically weighted regression fit residuals in counties (a) and local R² (b).
Categories of data used to fit the model and evaluate the accuracy of the new population density map.
| Dataset | Format | Source | Reference link |
|---|---|---|---|
| National Population Sample Survey data (2018) | Excel | National Bureau of Statistics | |
| Tencent user location big data | GEOTIFF | Tencent Cloud | The real-time data application can be accessed at |
| Boundary maps | Polygon features | National Catalogue Service for Geographic Information | |
| LandScan (2018) | Raster | Oak Ridge National Laboratory | |
| WorldPop China Mainland (2018) | Raster | WorldPop, School of Geography and Environmental Science, University of Southampton | 10.5258/SOTON/WP00674[ |
Fig. 50.01° resolution spatial population data for 2018 across mainland China (POP2018 dataset).
Fig. 6Estimated population spatial distribution in cities with population of more than 5 million.
Fig. 7Scatter plot of POP2018 (a), LandScan (b) and WorldPop (c) and Dongguan township statistical population.
Fig. 8Population distribution of POP2018 (a), WorldPop (b) and LandScan (c) in the three cities of Huangshi, Xi’an and Shanghai.
| Measurement(s) | population |
| Technology Type(s) | location-based big data |
| Factor Type(s) | spatial region |
| Sample Characteristic - Environment | spatial region |
| Sample Characteristic - Location | China |