| Literature DB >> 30018395 |
Abstract
Several cases of failure in the prediction of Indian Summer Monsoon Rainfall (ISMR) are the major concern for long-lead prediction. We propose that this is due to the temporal evolution of association/linkage (inherent concept of temporal networks) with various factors and climatic indices across the globe, such as El Niño-Southern Oscillation (ENSO), Equatorial Indian Ocean Oscillation (EQUINOO), Atlantic Multidecadal Oscillation (AMO), North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO) etc. Static models establish time-invariant (permanent) connections between such indices (predictors) and predictand (ISMR), whereas we hypothesize that such systems are temporally varying in nature. Considering hydroclimatic teleconnection with two major climate indices, ENSO and EQUINOO, we showed that the temporal persistence of the association is as low as three years. As an application of this concept, a statistical time-varying model is developed and the prediction performance is compared against its static counterpart (time-invariant model). The proposed approach is able to capture the ISMR anomalies and successfully predicts the severe drought years too. Specifically, 64% more accurate performance (in terms of RMSE) is achievable by the recommended time-varying approach as compared to existing time-invariant concepts.Entities:
Year: 2018 PMID: 30018395 PMCID: PMC6050344 DOI: 10.1038/s41598-018-28972-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Methodological concept and flowchart. (a) Model development involves two major steps – development of the conditional independence structure and development of the C-Vine copula model. The conditional independence structure is utilized for identification of the potential predictors. In this process, initially a fully saturated model is considered where the red nodes signify the predictors and the green nodes signify the target variable. After eliminating the insignificant edges (at 95% significance level), the blue nodes designate the parents of the target variable and the purple node designates conditionally independent variables. Next, the C-Vine copula model is utilized to evaluate the conditional dependence of the target variable given the potential predictors (directly influencing variables). (b) The time-varying concept utilizes a series of moving model development periods that utilizes the concept shown in a. The time-varying concept is incorporated in the model by sliding the model development period by n years (optimum prediction horizon) and updating the input variables and the model parameters (Table 1).
Details of the model development period, testing period and the probabilistic models developed from the dependence structure for the respective time period using optimal time horizon of 3 years.
| Sl. No | Model Development Period | Model Testing Period | Probabilistic Model with Inputs |
|---|---|---|---|
| 1 | 1950–1979 | 1980–1982 |
|
| 2 | 1953–1982 | 1983–1985 |
|
| 3 | 1956–1985 | 1986–1988 |
|
| 4 | 1959–1988 | 1989–1991 |
|
| 5 | 1962–1991 | 1992–1994 |
|
| 6 | 1965–1994 | 1995–1997 |
|
| 7 | 1968–1997 | 1998–2000 |
|
| 8 | 1971–2000 | 2001–2003 |
|
| 9 | 1974–2003 | 2004–2006 |
|
| 10 | 1977–2006 | 2007–2009 |
|
Figure 2Selection of optimum time horizon for the time-varying model. (a) ISMR prediction during the testing period (1980–2009) is carried using the time-varying C-Vine model considering the prediction time horizons of 1 to 10 years for the selection of the optimal time horizon. A grouped bar-plot showing different performance statistics for the predicted results using each time horizon clearly depict 3 years as the optimal prediction time horizon, (b) a heat map showing the year-wise deviation of the predicted values from observed ISMR for each time horizon and entire model testing period, and (c) the map is associated to the total error for the entire testing period, which shows prediction time horizon of 3 years is optimum. As the prediction time horizon increases beyond 3 years, the value of error increases indicating the decreasing temporal persistence of the identified model.
Figure 3The significant edges between the climatic indices and ISMR for time horizon of 3 years. (a) The presence or absence of association between the different lags of ENSO and ISMR is investigated through their respective edge strengths. The significant edges and the corresponding edge strength for a particular lag of ENSO with ISMR is shown by the bar plot. It is observed that the August (9th lag) ENSO of the previous year appears in 1950–1979, however with very low strength of association. The remaining five lags (8th, 10th, 11th, 12th and 13th lag) appear in 1959–1988 and show an increasing strength of association till 1962–1991 and then gradually reduce. Again the edges appear around 1971–2000 and the edge strength for all the lags (8th to 13th lag) shows an increasing association till 2009, and (b) the significant edges and the corresponding edge strength for a particular lag of EQUINOO with ISMR is shown by the bar plot. It can be observed that only July and June (10th and 11th lags) EQUINOO of the previous year, show significant association with ISMR. The July EQUINOO of previous year appears in the time period 1959–1988, right around the years lags of ENSO showed an appearance, and is consistently showing an increase in the edge strength till 2009. The June EQUINOO of previous year also shows significant edge strength right from 1950, then gradually reduces, again reappears around 1971 and again disappears by 2009.
Figure 4Comparison of the performance between time-varying and time-invariant approaches. (a) Observed and predicted rainfall anomaly (deviation from long-term mean) obtained using the time-varying C-Vine model, time-varying SVR model and time-invariant C-Vine model. For clarity time-invariant SVR model is not included which performs much poorer (Table 2), (b) the percentage absolute error resulted from aforementioned three prediction models. The time-varying C-Vine model is found to be the best performing model, and (c) comparison of the observed and predicted (50th percentile) ISMR using the time-varying C-Vine model. The uncertainty band of the predicted ISMR is shown as an envelope where the lower and the upper limits corresponds to the 5th and 95th percentile respectively.
Model performances during the testing period (1980–2009).
| Performance Statistics | Model Used | |||
|---|---|---|---|---|
| Time-varying C-Vine | Time-invariant C-Vine | Time-varying SVR | Time-invariant SVR | |
| R | 0.85 | 0.28 | 0.69 | 0.12 |
| RMSE | 30.16 | 84.74 | 58.28 | 103.88 |
| NSE | 0.72 | 0.02 | 0.42 | 0.01 |
| Dr | 0.75 | 0.53 | 0.65 | 0.36 |
| R2 | 0.72 | 0.07 | 0.48 | 0.01 |
In general, the time-varying concept provides better results and time-varying C-Vine model provides best results.