| Literature DB >> 36060709 |
Aniruddha Basak1, Kevin M Schmidt2, Ole Jakob Mengshoel3.
Abstract
Soil moisture is critical to agricultural business, ecosystem health, and certain hydrologically driven natural disasters. Monitoring data, though, is prone to instrumental noise, wide ranging extrema, and nonstationary response to rainfall where ground conditions change. Furthermore, existing soil moisture models generally forecast poorly for time periods greater than a few hours. To improve such forecasts, we introduce two data-driven models, the Naive Accumulative Representation (NAR) and the Additive Exponential Accumulative Representation (AEAR). Both of these models are rooted in deterministic, physically based hydrology, and we study their capabilities in forecasting soil moisture over time periods longer than a few hours. Learned model parameters represent the physically based unsaturated hydrological redistribution processes of gravity and suction. We validate our models using soil moisture and rainfall time series data collected from a steep gradient, post-wildfire site in southern California. Data analysis is complicated by rapid landscape change observed in steep, burned hillslopes in response to even small to moderate rain events. The proposed NAR and AEAR models are, in forecasting experiments, shown to be competitive with several established and state-of-the-art baselines. The AEAR model fits the data well for three distinct soil textures at variable depths below the ground surface (5, 15, and 30 cm). Similar robust results are demonstrated in controlled, laboratory-based experiments. Our AEAR model includes readily interpretable hydrologic parameters and provides more accurate forecasts than existing models for time horizons of 10-24 h. Such extended periods of warning for natural disasters, such as floods and landslides, provide actionable knowledge to reduce loss of life and property.Entities:
Keywords: Data analysis; Interpretable machine learning; Model optimization and fitting; Monitoring; Post-fire landslides; Soil moisture forecasting
Year: 2022 PMID: 36060709 PMCID: PMC9427440 DOI: 10.1007/s41060-022-00347-8
Source DB: PubMed Journal: Int J Data Sci Anal
Fig. 1Left: Destructive post-Thomas Fire debris flow impacts of January 9, 2018 in Montecito, California. Right: U.S. Geological Survey researcher responds to deadly January 9, 2018, debris flows in Montecito, California. Distributed rainfall runoff in the absence of vegetation allows readily erodible material to transport downslope gaining mass and velocity (momentum) through the channel network resulting in damaging and deadly debris flows. Photographs by USGS
Fig. 2Left: Photograph of a burned hillslope with rainfall, overland flow, and soil moisture monitoring instrumentation after the 2007 Canyon fire, but before rainfall, on the Pepperdine University Malibu, California campus. Right: Photograph from December 19, 2007, of a small post-fire debris flow and flood on the Pepperdine University campus following minor rainfall measured by USGS monitoring array to left. Photographs by USGS
Fig. 3Rainfall intensity (blue line) and soil moisture measurements at three depths below ground surface (black, green, and red lines at 5, 15, and 30 cm, respectively) from the Canyon Fire field monitoring site depicted in Fig. 2
Fig. 4Inputs, outputs, and steps of our soil moisture data analysis process; see Sect. 3 for further explanations
Summary of the six soil moisture datasets used in our analyses; see Sect. 3.1 for additional information
| Location | Depth (cm) | Dataset Id |
|---|---|---|
| Field: Canyon Fire | 5 | Canyon-5 |
| 15 | Canyon-15 | |
| 30 | Canyon-30 | |
| Controlled experiment: Bucket | 10 | Bucket-10 |
| 20 | Bucket-20 | |
| 28 | Bucket-28 |
Fig. 5Single-sided amplitude spectrum of 5 cm soil moisture data from the Canyon Fire study area
Fig. 6Two different forecasts at , namely = , illustrating the difference between the regular and irregular approaches. For clarity, we are showing regular observed measurements (model inputs, blue circles) for both cases, even though they are only present for forecasting purposes in the regular case. Left: Forecast for (model output, green rectangle) using regular measurement = 0.213. Right: Forecast for (model output, red rectangle) using an irregular “pseudo-measurement” = 0.185
Fig. 7Input hyperparameters of STL’s R implementation as used in experiments, see Sect. 5.2
Hyperparameter settings for stochastic optimization and statistical parameter tuning algorithms
| Parameter | Algorithm | |||
|---|---|---|---|---|
| RGA | GenSA | DE | IRACE | |
| Population size ( | 50 | 1 | 50 | 1 |
| Num. generations ( | 100 | 5000 | 100 | 5000 |
| Crossover prob. ( | 0.8 | N/A | 0.5 | N/A |
| Mutation prob. ( | 0.1 | N/A | 0.8 | N/A |
Fig. 8Results from 30 cm soil moisture data for AEAR and NAR models trained by four different optimization algorithms. Comparisons are of forecast error (a, b) and runtime of training (c). The mean and variance of all performance metrics are computed over 30 independent runs
Comparison of novel soil moisture models (NAR and AEAR) with classic machine learning models, state-of-the-art time series models, and soil moisture models with respect to Mean Absolute Percent Error (MAPE) and Maximum Error in forecasting soil moisture at 30 cm depth and with a 24 h time horizon
| Forecasting model | MAPE | Maximum error ( |
|---|---|---|
| Polynomial Regression | 23.15% ( | 0.2341 ( |
| Random Forest | 19.70% ( | 0.1901 ( |
| SVM | 18.36% ( | 0.1660 ( |
| ARMAX(1,10) | 17.83% ( | 0.0701 ( |
| ARMAX(5,10) | 18.71% ( | 0.0782 ( |
| ARMAX(10,10) | 18.13% ( | 0.0744 ( |
| LSTM(2) | 9.8% ( | 0.2046 ( |
| LSTM(4) | 6.3% ( | 0.1440 ( |
| LSTM(8) | 13.95% ( | 0.1511 ( |
| LSTM(16) | 17.37% ( | 0.1453 ( |
| SEM | 19.25% ( | 0.1384 ( |
| NAR | 7.83% ( | 0.0802 ( |
| AEAR | 7.16% ( | 0.0597 ( |
Mean and standard deviation of 30 independent runs are reported in each cell
Fig. 14Soil moisture forecasts at 30 cm depth using an LSTM(2) model in the regular setting. Points on the left of blue dashed vertical line are in the training set and on the right in the forecasting test set
Fig. 9Soil moisture forecasts for 5 and 15 cm depths using the AEAR model, based on a one-point measurement of soil moisture at . The points to the left of each of the blue dashed vertical lines are in the training set , and the points to the right are in the test set
Fig. 10Moisture forecasts for 30 cm depth using AEAR model. Points to left of blue dashed vertical line are in the training set and on the right in forecasting test set
Fig. 12Forecast errors as a function of prediction horizon (), varying on the x-axis, for regular measurements. Top: Standard error is on the y-axis. Bottom: Maximum error is on the y-axis
Fig. 11Comparison of different model forecasts (SEM, NAR, AEAR) at 30 cm depth. The standard error in these forecasts is , respectively. The vertical blue dashed line separates the training set (to the left) and forecasting set (to the right)
Fig. 13Forecast error for a 30 cm probe as function of prediction horizon (), varying on the x-axis, for regular measurements. Top: MAPE on the y-axis. Bottom: Maximum error on the y-axis
Fig. 15Results for controlled experimental bucket data. Forecast errors for different models are on the y-axis, as a function of varying (in regular measurements) on the x-axis. Top: Standard error on the y-axis. Bottom: Maximum error on the y-axis
Fig. 16Partition of Canyon Fire soil moisture data from 30 cm deep probe measurements and forecasts of trained AEAR models for each period. To compare the model parameters for different periods, we partitioned the almost four months of the winter 2007–2008 season (December–March) into Period 1 (“December”), Period 2 (“January”), Period 3 (“February”), and Period 4 (“March”)
Fig. 17Variation in wetting and drying properties of soil in AEAR model parameters. We depict the variations over a winter season’s data, shown in Fig. 16, in both time—different wetting rates over “months,” as shown in (a) and space—drying rates at three different depths, as shown in (b). To observe the spatial variation, the model parameters for various soil depths below ground surface are shown in (b)