Literature DB >> 30003049

Wavelet regression: An approach for undertaking multi-time scale analyses of hydro-climate relationships.

Jianhua Xu1,2.   

Abstract

Previous studies showed that hydro-climate processes are stochastic and complex systems, and it is difficult to discover the hidden patterns in the and non-stationary data and thoroughly understand the hydro-climate relationships. For the purpose to show multi-time scale responses of a hydrological variable to climate change, we developed an integrated approach by combining wavelet analysis and regression method, which is called wavelet regression (WR). The customization and the advantage of this approach over the existing methods are presented below: •The patterns in the data series of a hydrological variable and its related climatic factors are revealed by the wavelet analysis at different time scales.•The hydro-climate relationship of each pattern is revealed by the regression method based on the results of wavelet analysis.•The advantage of this approach over the existing methods is that the approach provides a routing to discover the hidden patterns in the stochastic and non-stationary data and quantitatively describe the hydro-climate relationships at different time scales.

Entities:  

Keywords:  Hydro-climate relationship; Multi-time scale analyses; Northwest China; Wavelet regression

Year:  2018        PMID: 30003049      PMCID: PMC6041364          DOI: 10.1016/j.mex.2018.05.005

Source DB:  PubMed          Journal:  MethodsX        ISSN: 2215-0161


Wavelet analysis

Wavelet transformation has been shown to be a powerful technique for characterizing the frequency, intensity, scale, and duration of variations in hydro-climatic process [[1], [2], [3]]. Wavelet analysis can also reveal localized time and frequency information without requiring the signal time series to be stationary, as required by the Fourier transform and other spectral methods [4]. The first task of WR is to approximate the variation patterns of a hydrological variable and its related climatic factors by using wavelet decomposition and reconstruction at different time scales. The principle of wavelet decomposition and reconstruction is as follows [5,6]. Considering a given signal X(t), such as streamflow, temperature and precipitation, etc., which can be built up as a sequence of projections onto Father and Mother wavelets indexed by both k {k = 1, 2, ……} and s {s = 2j, j = 1, 2, ……}. The coefficients in the expansion are given by the projectionswhere J is the maximum scale sustainable by the number of data points, is father wavelet, and is mother wavelet. Generally, father wavelet is used for the lowest-frequency smooth components, which requires wavelet with the widest support; mother wavelet is used for the highest-frequency detailed components. In other words, father wavelet is used for the major trend components, and mother wavelet is used for all deviations from the trend [6,7]. Once a mother wavelet is selected, the wavelet transform can be used to decompose a signal according to scale, allowing separation of the fine-scale behavior (detail) from the large-scale behavior (approximation) of the signal [6,7]. The relationship between scale and signal behavior is designated as follows: a low scale corresponds to compressed wavelet as well as rapidly changing details, namely high frequency, whereas a high scale corresponds to stretched wavelet and slowly changing coarse features, namely low frequency. Signal decomposition is typically conducted in an iterative fashion using a series of scales such as a = 2, 4, 8, ……, 2L, with successive approximations being split in turn so that one signal is broken down into many lower resolution components [6]. The representation of the signal X (t) now can be given by:where and . In general, we have the relationship aswhere {S, S-1, …, S1}is a sequence of multi-resolution approximations of the function X(t) at ever-increasing levels of refinement. The corresponding multi-resolution decomposition of X (t) is given by {, , ……,……, } [6]. Selecting a proper wavelet function is a prerequisite for wavelet analysis. The actual criteria for wavelet selection include self-similarity, compactness, and smoothness [8,9]. Choosing the Symmlet family [6], we experimented with alternative choices of scaling functions, and found that the results from ‘Sym8’ are robust. Therefore, ‘sym8’ is used for approximating the variation patterns of the hydrological variable and its related climatic factors at different time scales.

Regression analysis based on the results of wavelet analysis

The second task of WR is to fit the regression equation describing the hydro-climate relationship between a hydrological variable with its related climatic factors for each pattern at the chosen time scale based on the results of wavelet analysis. Because the hydrological variable (e.g. streamflow or groundwater table, etc.) is affected by climatic factors (e.g. temperature, precipitation, etc.), we generally suppose the hydrological variable as dependent variable, Y, and climatic factors as independent variables, X, X, …, X. The linear regression equation is as follows:where, is a constant, and and are partial regression coefficient, which can be fitted by the Least Squares [10]. The significance of the regression Eq. (4) should be tested by the F-test with a significance level [11]. If the linear regression equation cannot well describe the hydro-climate relationship between a hydrological variable with its related climatic factors, we must fit a nonlinear regression equation instead. Using the above approach, we can fit a most suitable regression equation for each pattern at the chosen time scale to describe the hydro-climate relationship between a hydrological variable with its related climatic factors based on the results of wavelet analysis. We called the above regression equation based on the results of wavelet analysis as wavelet regression equation (WRE).

The test of wavelet regression equation

We suggest to use the coefficient of determination and Akaike information criterion (AIC) to test the fitting effect of the above WRE. In order to identify the uncertainty of the wavelet regression equation for a given time scale, the coefficient of determination, also known as the goodness of fit, was calculated as follows [11]:where is the coefficient of determination; and are the simulate value by the WRE and observed data of the hydrological variable; is the mean of (); is the residual sum of squares; is the total sum of squares. The coefficient of determination is a measure of how well the simulate results represent the actual data. A bigger indicates a higher certainty and lower uncertainty of the WRE. To compare the relative goodness between different WREs, we also used the measure of Akaike information criterion (AIC) [12]. The formula of AIC is as follows:where k is the number of parameters estimated in the model; n is the number of samples; RSS is the same as in formula (5). A smaller AIC indicates a better goodness of the WRE [12]. For small sample sizes (i.e., n/k ≤ 40), the second-order Akaike Information Criterion (AICc) should be used insteadwhere n is the sample size. As the sample size increases, the last term of the approaches zero, and the tends to yield the same conclusions as the AIC [13].

An application case

As known, the observation data from hydrological and climatic stations always present the stochastic and non-stationary characteristic. How can we reveal the patterns hidden in the stochastic and non-stationary data? Several application cases [[14], [15], [16], [17]] in Northwest China have proved that the WR is effective approach, which present a good performance. Fig.1 shows the observation data of annual average temperature (AAT), annual precipitation (AP) and annual runoff (AR) in the Yarkand River basin of Northwest China. It is evident that all the data series of AAT, AP and AR are fluctuating, and difficult to identify the patterns hidden in the raw data. In order to uncover the patterns hidden in the raw data, we now analyze hydro-climate relationships at different time scales using the wavelet regression.
Fig. 1

Hydrological and climatic observation data of Yarkland river basin, Northwest China.

Hydrological and climatic observation data of Yarkland river basin, Northwest China. The five scales of time are designated as s1 to s5, the Fig. 2(a) presents five variation patterns of AR. The s1 curve retains a large amount of residual from the raw data, and drastic fluctuations exist in the study period. These characteristics indicate that, although the runoff varied greatly throughout the study period, there was a hidden increasing trend. The s2 curve still retains a considerable amount of residual, as indicated by the presence of 4 peaks and 4 valleys. However, the s2 curve is much smoother than the s1 curve, which allows the hidden increasing trend to be more apparent. The s3 curve retains much less residual, as indicated by the presence of 2 peaks and 2 valleys. Compared to s2, the increase in runoff over time is more apparent in s3. Finally, the s5 curve presents an ascending tendency, whereas the increasing trend is obvious in the s4 curve. Fig. 2(b) and (c) present five variation patterns of AAT and AP, which show the similar variation patterns to AR at the five scales of time.
Fig. 2

Variation patterns at different time scales of (a) annual runoff, (b) annual average temperature, and (c) annual precipitation.

Variation patterns at different time scales of (a) annual runoff, (b) annual average temperature, and (c) annual precipitation. Based on the results of wavelet analysis, five linear regression equations to describe the hydro-climate relationships between AR with AAT and AP were fitted at the five scales of time (Table 1), which show multi-time scale responses of annual runoff to regional climatic change in the Yarkand River basin of Northwest China.
Table 1

Wavelet regression equations to describe the relationships between annual runoff and annual average temperature & annual precipitation at different time scales.

Time scaleRegression equationR2FSignificance level αAIC
s0AR = 8.8150AAT + 0.0793AP − 9.62030.366612.73400.01400.528
s1AR = 10.7510AAT + 0.0742AP  − 26.56510.576529.94780.001209.924
s2AR = 11.9308AAT + 0.0706AP  − 36.94120.675445.77530.001143.263
s3AR = 11.4623AAT + 0.1397AP − 39.22540.799187.50660.00112.960
s4AR = −10.6715AAT + 0.6335AP + 113.19790.9509425.69540.001−96.714
s5AR = 25.6166AAT − 0.2411AP − 130.87050.99991598.9850.001−209.172

Notes: AR – annual runoff, AAT – annual average temperature, and AP – annual precipitation; s0, s1, s2, s3, s4 and s5 represent 1-year, 2-year, 4-year, 8-year, 16-year and 32-year scale, respectively.

Wavelet regression equations to describe the relationships between annual runoff and annual average temperature & annual precipitation at different time scales. Notes: AR – annual runoff, AAT – annual average temperature, and AP – annual precipitation; s0, s1, s2, s3, s4 and s5 represent 1-year, 2-year, 4-year, 8-year, 16-year and 32-year scale, respectively. Table 1 tells us that the significant level of wavelet regression equations (WREs) at the five scales of time from s1 (2-year scale) to s5 (32-year scale) achieved as high as α = 0.001, and the WRE at 1-year scale are also achieved the significant level of α = 0.01. By comparing their AIC values, we also compare the fitting effects of WREs at different time scales. The order of fitting effect of WREs at different time scales is as follows: the fitting effect of the WRE at s5 (32-year scale) is the best, that at s4 (16-year scale) is second, that at s3 (8-year scale) is third, that at s4 (4-year scale) is fourth, that at s1 (4-year scale) is the penult, and that at s0 is the worst.

Summary

Combining wavelet analysis and regression method, we developed an integrated approach, the wavelet regression (WR), which can be used to show the multi-time scale responses of a hydrological variable to climate change. The principle of the approach is that the wavelet analysis is used to reveal the variation patterns of a hydrological variable and its related climatic factors at different time scales, and then the regression method is used to show the hydro-climate relationship between the hydrological variable with its related climatic factors for each pattern at the chosen time scale. To illustrate the application of the approach, the hydro-climate relationships between annual runoff (AR) with annual average temperature (AAT) and annual precipitation (AP) in the Yarkand River basin of Northwest China have been analyzed at different time scales. Though the effects of wavelet regression equations (WREs) at time scales are different (the larger time scale, the better effect), overall the wavelet regression equations (WREs) can significantly describe the relations between AR with AAT and AP based at chosen time scales.

Additional information

Previous studies showed that hydro-climate processes are complex systems [[18], [19], [20], [21], [22]], and the observed data of climate and hydrology are non-stationary and stochastic [[23], [24], [25], [26]]. How can we find out the patterns hidden in the stochastic and non-stationary data? Multi-time scale analysis is an approach to attempt [27,28]. To show the multi-time scale responses of a hydrological variable (e.g. runoff, evaporation, or groundwater level, etc.) to climate change, we developed an integrated approach by combining wavelet analysis and regression method, which is called wavelet regression (WR). The main idea of the approach is that the wavelet analysis is used to reveal the variation patterns of a hydrological variable and its related climatic factors at different time scales, and then the regression method is used to show the hydro-climate relationship between the hydrological variable and its related climatic factors for each pattern at the chosen time scale. The procedure of this approach is shown as Fig. 3, and the effectiveness of the approach has been verified in some case studies [[14], [15], [16], [17],[29], [30], [31], [32]].
Fig. 3

The procedure of wavelet regression for multi-time scale analyses of hydro-climate relationship.

The procedure of wavelet regression for multi-time scale analyses of hydro-climate relationship.

Conflicts of interest

The author declares that they have no conflicts of interest.
  1 in total

1.  The vegetation coverage dynamic coupling with climatic factors in Northeast China Transect.

Authors:  Qin Nie; Jianhua Xu; Minhe Ji; Lei Cao; Yang Yang; Yulian Hong
Journal:  Environ Manage       Date:  2012-06-22       Impact factor: 3.266

  1 in total
  1 in total

1.  Understanding shallow soil moisture variation in the data-scarce area and its relationship with climate change by GLDAS data.

Authors:  Jingping Zuo; Jianhua Xu; Weihong Li; Dongyang Yang
Journal:  PLoS One       Date:  2019-05-22       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.