| Literature DB >> 32917890 |
Hylke E Beck1, Seth Westra2, Jackson Tan3,4, Florian Pappenberger5, George J Huffman6, Tim R McVicar7,8, Gaby J Gründemann9, Noemi Vergopolan10, Hayley J Fowler11, Elizabeth Lewis11, Koen Verbist12, Eric F Wood10.
Abstract
We introduce the Precipitation Probability DISTribution (PPDIST) dataset, a collection of global high-resolution (0.1°) observation-based climatologies (1979-2018) of the occurrence and peak intensity of precipitation (P) at daily and 3-hourly time-scales. The climatologies were produced using neural networks trained with daily P observations from 93,138 gauges and hourly P observations (resampled to 3-hourly) from 11,881 gauges worldwide. Mean validation coefficient of determination (R2) values ranged from 0.76 to 0.80 for the daily P occurrence indices, and from 0.44 to 0.84 for the daily peak P intensity indices. The neural networks performed significantly better than current state-of-the-art reanalysis (ERA5) and satellite (IMERG) products for all P indices. Using a 0.1 mm 3 h-1 threshold, P was estimated to occur 12.2%, 7.4%, and 14.3% of the time, on average, over the global, land, and ocean domains, respectively. The highest P intensities were found over parts of Central America, India, and Southeast Asia, along the western equatorial coast of Africa, and in the intertropical convergence zone. The PPDIST dataset is available via www.gloh2o.org/ppdist .Entities:
Year: 2020 PMID: 32917890 PMCID: PMC7486373 DOI: 10.1038/s41597-020-00631-x
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1The >0.5 mm d−1 daily P occurrence according to (a) the gauge observations, (b) the PPDIST dataset, (c) the ERA5 reanalysis, and (d) the IMERG satellite product. Supplement Fig. S2 presents an equivalent figure for the >0.1 mm 3 h−1 3-hourly P occurrence. The other PPDIST P occurrence indices can be viewed by accessing the dataset.
Fig. 2The 15-year return-period daily P intensity according to (a) the gauge observations, (b) the PPDIST dataset, (c) the ERA5 reanalysis, and (d) the IMERG satellite product. IMERG has data gaps at high latitudes (>60°N/S) precluding the calculation of the 15-year return-period daily P intensity. Supplement Fig. S3 presents an equivalent figure for the 15-year return-period 3-hourly P intensity. The other PPDIST P intensity indices can be viewed by accessing the dataset.
Predictors used in the neural networks to estimate the P occurrence and peak P intensity indices.
| Predictor | Description (units | Resolution | Data source | Used for daily | Used for 3-hourly |
|---|---|---|---|---|---|
| CAPE | Convective available potential energy (J kg−1) | 0.28° | ERA5-HRES reanalysis (mean over 1979–2018)[ | ✓ | ✓ |
| CldCov | Cloud cover frequency (%) | 1 km | Wilson | ✓ | ✓ |
| Elev | Square-root transformed surface elevation smoothed using 10-km filter (—) | 90 m | MERIT[ | ✓ | ✓ |
| MAP1a | Square-root-transformed mean annual precipitation (—) from WorldClim V2 | 1 km (land); 0.28° (ocean) | WorldClim V2[ | ✓ | ✓ |
| MAP2a | Difference in square-root-transformed mean annual precipitation (—) between WorldClim V2 and CHPclim V2 | 0.05° (land); 0.28° (ocean) | CHPclim V1[ | ✓ | ✓ |
| MAT | Mean annual air temperature (°C) | 1 km (land); 0.28° (ocean) | See MAP1 | ✓ | ✓ |
| Snow Frac | Long-term fraction of total | 1 km (land); 0.28° (ocean) | See MAP1 | ✓ | ✓ |
| Lat | Absolute geographical latitude (°) | 0.1° | — | ✓ | ✓ |
| ERA5b | ERA5-HRES (1979–2018) reanalysis | 0.28° | Hersbach | ✓ | ✓ |
| IMERGb | IMERG Late run (IMERGHHL) V06 (2000–2018) satellite-based | 0.1° | Huffman | ✓ | ✓ |
| Daily PPDIST estimates of the | 0.1° | The PPDIST dataset derived in this study ( | ✗ | ✓ |
aTwo mean annual P climatologies were used to account the uncertainty in mean annual P estimates.
bThe ERA5 and IMERG predictors represent estimates of the P index subject of the estimation.
cDaily PPDIST estimates of the P index subject of the estimation were used as predictor for the 3-hourly estimates.
Fig. 3Performance of the PPDIST dataset, the ERA5 reanalysis, and the IMERG satellite product in estimating the daily P occurrence and peak P intensity indices. For the PPDIST dataset, we calculated the mean of the ten validation scores (one for each cross-validation iteration). The training scores were not shown as they were nearly identical to the validation scores. All scores were calculated using square-root-transformed observed and estimated values. Supplement Fig. S4 presents an equivalent figure for the 3-hourly P indices.
Fig. 4PPDIST uncertainty estimates for (a) the >0.5 mm d−1 daily P occurrence and (b) the 15-year return-period daily P intensity. The uncertainty represents the spread of the ten cross-validation iterations. Supplement Fig. S5 presents an equivalent figure for the >0.1 mm 3 h−1 P occurrence and the 15-year return-period 3-hourly P intensity.
| Measurement(s) | hydrological precipitation process • precipitation occurrence • precipitation intensity |
| Technology Type(s) | Neural networks models • Gauge or Meter Device • satellite imaging of a planet • reanalyses |
| Factor Type(s) | daily and 3-hourly time-scales • year of data collection |
| Sample Characteristic - Environment | climate system |
| Sample Characteristic - Location | Earth (planet) |