| Literature DB >> 35017537 |
George D C Cavalcanti1, Domingos S de O Santos Júnior1, Eraylson G Silva1, Paulo S G de Mattos Neto2.
Abstract
The sea surface temperature (SST) is an environmental indicator closely related to climate, weather, and atmospheric events worldwide. Its forecasting is essential for supporting the decision of governments and environmental organizations. Literature has shown that single machine learning (ML) models are generally more accurate than traditional statistical models for SST time series modeling. However, the parameters tuning of these ML models is a challenging task, mainly when complex phenomena, such as SST forecasting, are addressed. Issues related to misspecification, overfitting, or underfitting of the ML models can lead to underperforming forecasts. This work proposes using hybrid systems (HS) that combine (ML) models using residual forecasting as an alternative to enhance the performance of SST forecasting. In this context, two types of combinations are evaluated using two ML models: support vector regression (SVR) and long short-term memory (LSTM). The experimental evaluation was performed on three datasets from different regions of the Atlantic Ocean using three well-known measures: mean square error (MSE), mean absolute percentage error (MAPE), and mean absolute error (MAE). The best HS based on SVR improved the MSE value for each analyzed series by [Formula: see text], [Formula: see text], and [Formula: see text] compared to its respective single model. The HS employing the LSTM improved [Formula: see text], [Formula: see text], and [Formula: see text] concerning the single LSTM model. Compared to literature approaches, at least one version of HS attained higher accuracy than statistical and ML models in all study cases. In particular, the nonlinear combination of the ML models obtained the best performance among the proposed HS versions.Entities:
Year: 2022 PMID: 35017537 PMCID: PMC8752630 DOI: 10.1038/s41598-021-04238-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1NoLiC training and testing phases.
Figure 2Buoys locations of the PIRATA project in the Atlantic Ocean. Font: https://www.pmel.noaa.gov/tao/drupal/disdel/.
Figure 3Sea surface temperature time series.
Properties of the SST time series used in this work.
| Adopted acronym | S1 | S2 | S3 |
|---|---|---|---|
| Localization | |||
| Time unit | Daily | Daily | Daily |
| Start date | Oct 08, 00 | Sep 09, 99 | May 10, 07 |
| Start date (test set) | Oct 01, 06 | Sep 07, 10 | Jun 30, 16 |
| End date | Oct 07, 07 | Sep 08, 11 | Jul 02, 17 |
| Total size | 2545 | 4376 | 3699 |
| Training sample size | 1815 | 3646 | 2969 |
| Validation sample size | 365 | 365 | 365 |
| Testing sample size | 365 | 365 | 365 |
Values of the parameters of the SVR and LSTM models.
| Model | Parameters | Values |
|---|---|---|
| SVR | Gamma | [0.001, 1] |
| Cost | [0.1, 1, 100] | |
| Tolerance | [0.001, 0.01, 0.1] | |
| Kernel | Radial basis function[ | |
| LSTM | Units in hidden layer | [2, 5, 10] |
| Algorithm | Adam[ |
Selected parameters for SVR and LSTM in the combination approaches using a grid search in the validation set for 1 day ahead SST forecasting.
| Time series | Model | Parameters | Combination approaches | ||||
|---|---|---|---|---|---|---|---|
| Perturbative[ | NoLiC[ | ||||||
| S1 | SVR | Gamma | 1 | 1 | 0.001 | 1 | 1 |
| Cost | 1 | 0.1 | 1 | 1 | 100 | ||
| Tolerance | 0.001 | 0.001 | 0.01 | 0.001 | 0.01 | ||
| Inputs | 2 | 2 | 1 | 1 | 2 | ||
| LSTM | Units in hidden layer | 2 | 5 | 5 | – | 10 | |
| Inputs | 1 | 2 | 2 | – | 2 | ||
| S2 | SVR | Gamma | 0.001 | 1 | 1 | 0.001 | 1 |
| Cost | 100 | 1 | 0.1 | 100 | 1 | ||
| Tolerance | 0.001 | 0.01 | 0.01 | 0.001 | 0.001 | ||
| Inputs | 23 | 2 | 2 | 2 | 2 | ||
| LSTM | Units in hidden layer | 5 | 10 | 5 | – | 5 | |
| Inputs | 5 | 2 | 2 | – | 2 | ||
| S3 | SVR | Gamma | 1 | 0.001 | 1 | 1 | 1 |
| Cost | 1 | 100 | 1 | 100 | 100 | ||
| Tolerance | 0.001 | 0.01 | 0.01 | 0.01 | 0.01 | ||
| Inputs | 3 | 2 | 2 | 2 | 2 | ||
| LSTM | Units in hidden layer | 5 | 5 | 5 | – | 10 | |
| Inputs | 5 | 2 | 2 | – | 2 | ||
Comparison in terms of MSE, MAPE, and MAE of the combination approaches with single statistical and Machine Learning models of the literature applied to the SST daily forecasting.
| Dataset | Approach | Model | MSE | MAPE | MAE |
|---|---|---|---|---|---|
| Perturbative | SVR | 6.89E−04 | 3.61 | 1.85E−02 | |
| LSTM | 6.89E−04 | 3.61 | 1.85E−02 | ||
| NoLiC | SVR | 6.71E−04 | 3.60 | 1.84E−02 | |
| LSTM | |||||
| Literature | ETS[ | 5.36E−03 | 12.30 | 5.85E−02 | |
| Single SVR[ | 3.78E−03 | 9.40 | 4.56E−02 | ||
| Single LSTM[ | 5.06E−03 | 10.83 | 5.21E−02 | ||
| ConvLSTM[ | 1.39E−03 | 5.53 | 2.72E−02 | ||
| NARX[ | 8.77E−04 | 5.28 | 2.51E−02 | ||
| Perturbative | SVR | ||||
| LSTM | 1.98 | 7.86E−03 | |||
| NoLiC | SVR | 1.01E−03 | 4.52 | 2.08E−02 | |
| LSTM | 1.31E−04 | 2.19 | 8.82E−03 | ||
| Literature | ETS[ | 3.87E−03 | 13.93 | 5.43E−02 | |
| Single SVR[ | 1.01E−02 | 20.83 | 8.78E−02 | ||
| Single LSTM[ | 8.30E−03 | 18.86 | 7.93E−02 | ||
| ConvLSTM[ | 8.59E−04 | 5.92 | 2.33E−02 | ||
| NARX[ | 2.03E−04 | 3.04 | 1.16E−02 | ||
| Perturbative | SVR | 9.38E−04 | 3.80 | 2.36E−02 | |
| LSTM | 7.91E−04 | ||||
| NoLiC | SVR | 9.02E−04 | 3.75 | 2.34E−02 | |
| LSTM | 3.43 | 2.15E−02 | |||
| Literature | ETS[ | 5.78E−03 | 9.00 | 5.56E−02 | |
| Single SVR[ | 2.58E−03 | 6.20 | 3.75E−02 | ||
| Single LSTM[ | 1.15E−03 | 3.96 | 2.44E−02 | ||
| ConvLSTM[ | 1.27E−03 | 4.30 | 2.60E−02 | ||
| NARX[ | 9.18E−04 | 3.89 | 2.35E−02 |
For each data set, the best value of the metrics is highlighted in bold.
Percentage difference between the perturbative approach and literature models for MSE, MAPE, and MAE.
| Dataset | Model | Pertubative approach | |||||
|---|---|---|---|---|---|---|---|
| SVR | LSTM | ||||||
| MSE | MAPE | MAE | MSE | MAPE | MAE | ||
| S1 | ETS[ | 87.14 | 70.63 | 68.27 | 87.14 | 70.63 | 68.27 |
| Single SVR[ | 81.78 | 61.59 | 59.29 | 81.78 | 61.59 | 59.29 | |
| Single LSTM[ | 86.37 | 66.65 | 64.41 | 86.37 | 66.65 | 64.41 | |
| ConvLSTM[ | 50.25 | 34.73 | 31.85 | 50.25 | 34.73 | 31.85 | |
| NARX[ | 21.44 | 31.63 | 26.15 | 21.44 | 31.63 | 26.15 | |
| S2 | ETS[ | 97.20 | 85.88 | 85.59 | 97.20 | 85.79 | 85.51 |
| Single SVR[ | 98.93 | 90.56 | 91.09 | 98.92 | 90.50 | 91.05 | |
| Single LSTM[ | 98.70 | 89.57 | 90.13 | 98.69 | 89.51 | 90.08 | |
| ConvLSTM[ | 87.39 | 66.75 | 66.40 | 87.38 | 66.55 | 66.21 | |
| NARX[ | 46.76 | 35.27 | 32.71 | 46.71 | 34.88 | 32.34 | |
| S3 | ETS[ | 83.76 | 57.73 | 57.62 | 86.31 | 62.14 | 61.98 |
| Single SVR[ | 63.64 | 38.66 | 37.20 | 69.35 | 45.06 | 43.65 | |
| Single LSTM[ | 18.07 | 3.94 | 3.36 | 30.93 | 13.97 | 13.30 | |
| ConvLSTM[ | 25.90 | 11.48 | 9.25 | 37.54 | 20.72 | 18.57 | |
| NARX[ | − 2.15 | 2.16 | − 0.30 | 13.89 | 12.37 | 10.01 | |
Percentage difference between the NoLic and literature models for MSE, MAPE, and MAE.
| Dataset | Model | NoLiC | |||||
|---|---|---|---|---|---|---|---|
| SVR | LSTM | ||||||
| MSE | MAPE | MAE | MSE | MAPE | MAE | ||
| S1 | ETS[ | 87.48 | 70.76 | 68.58 | 92.59 | 78.46 | 76.36 |
| Single SVR[ | 82.26 | 61.75 | 59.69 | 89.51 | 71.83 | 69.67 | |
| Single LSTM[ | 86.73 | 66.79 | 64.75 | 92.15 | 75.54 | 73.48 | |
| ConvLSTM[ | 51.57 | 35.00 | 32.51 | 71.35 | 52.13 | 49.23 | |
| NARX[ | 23.51 | 31.91 | 26.87 | 54.75 | 49.86 | 44.98 | |
| S2 | ETS[ | 73.98 | 67.52 | 61.60 | 96.62 | 84.29 | 83.74 |
| Single SVR[ | 90.01 | 78.28 | 76.28 | 98.70 | 89.50 | 89.95 | |
| Single LSTM[ | 87.87 | 76.01 | 73.72 | 98.42 | 88.40 | 88.87 | |
| ConvLSTM[ | − 17.24 | 23.53 | 10.49 | 84.76 | 63.02 | 62.09 | |
| NARX[ | − 395.08 | − 48.86 | − 79.26 | 35.64 | 28.00 | 24.09 | |
| S3 | ETS[ | 84.38 | 58.30 | 57.94 | 86.60 | 61.86 | 61.29 |
| Single SVR[ | 65.03 | 39.50 | 37.68 | 70.00 | 44.66 | 42.64 | |
| Single LSTM[ | 21.20 | 5.25 | 4.10 | 32.41 | 13.34 | 11.74 | |
| ConvLSTM[ | 28.73 | 12.69 | 9.94 | 38.87 | 20.14 | 17.11 | |
| NARX[ | 1.75 | 3.50 | 0.47 | 15.73 | 11.73 | 8.39 | |
Figure 4One day ahead forecasting for the SST time series on the test set with Perturbative approach, NoLiC and the respective single model.
Results of the comparison of the hybrid systems using SVR and LSTM with single and literature models using Diebold–Mariano hypothesis test.
| Dataset | Model | Perturbative | NoLiC | ||
|---|---|---|---|---|---|
| SVR | LSTM | SVR | LSTM | ||
| S1 | ETS[ | + | + | + | + |
| Single SVR | + | + | + | + | |
| Single LSTM | + | + | + | + | |
| NARX[ | + | + | + | + | |
| ConvLSTM[ | + | + | + | + | |
| S2 | ETS[ | + | + | + | + |
| Single SVR | + | + | + | + | |
| Single LSTM | + | + | + | + | |
| NARX[ | + | + | − | + | |
| ConvLSTM[ | + | + | − | + | |
| S3 | ETS[ | + | + | + | + |
| Single SVR | + | + | + | + | |
| Single LSTM | + | + | + | + | |
| NARX[ | + | + | + | + | |
| ConvLSTM[ | + | + | + | + | |
Testing time in seconds of the single models and combination approaches for 1 day ahead forecasting.
| Datasets | Model | Approach | Execution time |
|---|---|---|---|
| Mean (Std) | |||
| S1 | SVR | Single | 0.014 (0.002) |
| 0.053 (0.005) | |||
| NoLiC | 0.064 (0.120) | ||
| LSTM | Single | 0.117 (0.037) | |
| 0.371 (0.088) | |||
| NoLiC | 0.261 (0.057) | ||
| S2 | SVR | Single | 0.011 (0.005) |
| 0.062 (0.011) | |||
| NoLiC | 0.303 (0.027) | ||
| LSTM | Single | 0.105 (0.009) | |
| 0.371 (0.036) | |||
| NoLiC | 0.271 (0.016) | ||
| S3 | SVR | Single | 0.020 (0.005) |
| 0.100 (0.012) | |||
| NoLiC | 0.282 (0.024) | ||
| LSTM | Single | 0.098 (0.025) | |
| 0.421 (0.107) | |||
| NoLiC | 0.338 (0.043) |
For each approach is presented the mean testing time and the respective standard deviation.