| Literature DB >> 32528065 |
Michael F Meyer1, Stephanie G Labou2,3, Alli N Cramer4, Matthew R Brousil2, Bradley T Luff4.
Abstract
An increasing population in conjunction with a changing climate necessitates a detailed understanding of water abundance at multiple spatial and temporal scales. Remote sensing has provided massive data volumes to track fluctuations in water quantity, yet contextualizing water abundance with other local, regional, and global trends remains challenging by often requiring large computational resources to combine multiple data sources into analytically-friendly formats. To bridge this gap and facilitate future freshwater research opportunities, we harmonized existing global datasets to create the Global Lake area, Climate, and Population (GLCP) dataset. The GLCP is a compilation of lake surface area for 1.42 + million lakes and reservoirs of at least 10 ha in size from 1995 to 2015 with co-located basin-level temperature, precipitation, and population data. The GLCP was created with FAIR (findable, accessible, interoperable, reusable) data principles in mind and retains unique identifiers from parent datasets to expedite interoperability. The GLCP offers critical data for basic and applied investigations of lake surface area and water quantity at local, regional, and global scales.Entities:
Year: 2020 PMID: 32528065 PMCID: PMC7289843 DOI: 10.1038/s41597-020-0517-4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Number of HydroLAKES polygons matched with HydroBASINS polygons by Pfafstetter level.
| HydroBASIN Pfafstetter level | Number of lakes | Percent of valid lakes | Median basin size (km2) |
|---|---|---|---|
| 2 | 2595 | 0.18% | 2198593 |
| 3 | 6650 | 0.46% | 548263 |
| 4 | 8912 | 0.62% | 225882 |
| 5 | 15948 | 1.12% | 58705 |
| 6 | 23262 | 1.63% | 16967 |
| 7 | 41001 | 2.88% | 5042 |
| 8 | 55107 | 3.87% | 1632 |
| 9 | 55061 | 3.87% | 518 |
| 10 | 9747 | 0.69% | 353 |
| 11 | 196 | 0.01% | 333 |
| 12 | 1204020 | 84.64% | 156 |
Fig. 1Conceptual diagram of data harmonization process. JRC Water area was calculated in Google Earth Engine (GEE), and then aggregated locally within the R environment. Similarly, basin population was calculated on GEE, and then aggregated locally within the R environment. Climate data from MERRA-2, however, was first aggregated from hourly to monthly and annual values, then resampled for 1/10th of their original resolutions with a subsequent and raster clipping was performed within the R environment. We then imported the geotiff outputs to GEE to calculate basin climate variables. GEE output was then downloaded locally and aggregated within the R environment.
Ratios of JRC-derived total water area to water area as reported in HydroLAKES for five year intervals (N = 1,422,499 lakes).
| Buffer (m) | Year | Min. | 1st Qu. | Median | Mean | 3rd Qu. | Max. |
|---|---|---|---|---|---|---|---|
| 0.000 | 0.414 | 0.874 | 0.690 | 0.968 | 2.106 | ||
| 0.000 | 0.795 | 0.906 | 0.863 | 0.991 | 2.205 | ||
| 0.000 | 0.813 | 0.919 | 0.870 | 0.997 | 2.181 | ||
| 0.000 | 0.803 | 0.915 | 0.864 | 0.997 | 2.301 | ||
| 0.000 | 0.818 | 0.921 | 0.871 | 0.997 | 2.515 | ||
| 0.000 | 0.424 | 0.879 | 0.703 | 0.975 | 3.185 | ||
| 0.000 | 0.805 | 0.917 | 0.891 | 1.011 | 3.481 | ||
| 0.000 | 0.824 | 0.930 | 0.898 | 1.017 | 3.367 | ||
| 0.000 | 0.814 | 0.925 | 0.892 | 1.017 | 3.769 | ||
| 0.000 | 0.829 | 0.931 | 0.900 | 1.017 | 4.095 | ||
| 0.000 | 0.432 | 0.883 | 0.714 | 0.981 | 4.208 | ||
| 0.000 | 0.812 | 0.924 | 0.913 | 1.025 | 4.769 | ||
| 0.000 | 0.831 | 0.937 | 0.921 | 1.032 | 4.579 | ||
| 0.000 | 0.821 | 0.932 | 0.915 | 1.031 | 5.287 | ||
| 0.000 | 0.835 | 0.938 | 0.924 | 1.032 | 5.667 | ||
| 0.000 | 0.440 | 0.889 | 0.726 | 0.989 | 5.349 | ||
| 0.000 | 0.818 | 0.931 | 0.935 | 1.041 | 6.240 | ||
| 0.000 | 0.839 | 0.945 | 0.944 | 1.048 | 5.793 | ||
| 0.000 | 0.829 | 0.940 | 0.938 | 1.047 | 6.891 | ||
| 0.000 | 0.843 | 0.946 | 0.948 | 1.049 | 7.213 |
Values greater than one indicate observed values greater than expected, and values smaller than one indicate smaller than expected values.
Buffer comparisons for five year intervals with percent of lakes (N = 1,422,499) <1% increase, 1–5% increase, 5–10% increase, and >10% increase in total water surface area.
| Buffer:Buffer | Size Change | 1995 | 2000 | 2005 | 2010 | 2015 |
|---|---|---|---|---|---|---|
| 55.00% | 59.62% | 58.32% | 58.98% | 58.93% | ||
| 15.64% | 20.16% | 20.81% | 20.18% | 20.22% | ||
| 4.49% | 8.62% | 8.78% | 8.58% | 8.49% | ||
| 3.65% | 9.86% | 10.14% | 10.23% | 10.44% | ||
| 0.01% | 0.01% | 0.01% | 0.01% | 0.01% | ||
| 21.21% | 1.74% | 1.93% | 2.02% | 1.91% | ||
| 58.01% | 64.68% | 63.62% | 64.04% | 63.86% | ||
| 14.64% | 19.65% | 20.10% | 19.53% | 19.49% | ||
| 3.72% | 7.66% | 7.85% | 7.83% | 7.83% | ||
| 2.44% | 6.29% | 6.52% | 6.61% | 6.95% | ||
| 0.03% | 0.03% | 0.03% | 0.03% | 0.03% | ||
| 21.16% | 1.70% | 1.88% | 1.96% | 1.84% | ||
| 53.23% | 61.18% | 59.89% | 60.30% | 59.90% | ||
| 18.11% | 23.72% | 24.37% | 23.86% | 23.92% | ||
| 4.91% | 8.53% | 8.78% | 8.72% | 8.91% | ||
| 2.61% | 4.88% | 5.10% | 5.18% | 5.46% | ||
| 0.02% | 0.02% | 0.02% | 0.02% | 0.02% | ||
| 21.12% | 1.65% | 1.83% | 1.91% | 1.79% |
Fig. 2Comparison of 90 m buffer total water areas from JRC to reported lake areas in HydroLAKES. The grey diagonal line is the 1:1 line, where a value on the 1:1 grey line implies no difference between data sources. Color of hexbin indicates the number of lakes contained within a bin. The orange line is a linear regression for all points. The orange regression line tends to deviate more markedly for comparisons among the mid- to late-1990s. This is likely due to sparse LANDSAT coverage during the mid-to-late 1990s. Starting in 1999, though, the orange and grey lines are nearly identical, which would be expected as LANDSAT coverage in 1999 and 2000 incorporated more complete global coverage, particularly in Siberia and Greenland. Scales have been log-transformed in order to show a more even spread of data points, as most lakes are less than 10 km2.
Summary table (minimum, 1st quartile, median, mean, 3rd quartile, and maximum) of interannual percent difference for 250 spatially stratified lakes used for manual QA/QC.
| Variable | Min. | 1st Qu. | Median | Mean | 3rd Qu. | Max. |
|---|---|---|---|---|---|---|
| total_precip_mm | −98.64 | −13.38 | −0.62 | −0.41 | 11.85 | 95.77 |
| mean_monthly_precip_mm | −98.64 | −12.86 | −0.24 | 0.02 | 12.16 | 95.77 |
| mean_annual_temp_k | −1.74 | −0.34 | 0.02 | 0.02 | 0.36 | 1.78 |
| pop_sum | −200.00 | −42.27 | 11.88 | −42.08 | 2.17 | 198.98 |
| seasonal_km2 | −200.00 | −31.89 | 0.88 | 3.34 | 37.63 | 200.00 |
| permanent_km2 | −200.00 | −2.58 | 0.00 | 1.58 | 2.65 | 200.00 |
| total_km2 | −200.00 | −2.54 | 0.00 | 2.14 | 2.83 | 200.00 |
The range of percent differences was bounded between −200 and +200 percent, implying a year-to-year change in value from zero to non-zero.
| Measurement(s) | area of open water • temperature of air • volume of hydrological precipitation • population |
| Technology Type(s) | digital curation |
| Factor Type(s) | year • geographic location |
| Sample Characteristic - Environment | lake |