| Literature DB >> 35898102 |
Chen Gu1, Jifeng Qi2,3, Yizhi Zhao1, Wenming Yin1,4, Shanliang Zhu1,4.
Abstract
The effective estimation of mixed-layer depth (MLD) plays a significant role in the study of ocean dynamics and global climate change. However, the methods of estimating MLD still have limitations due to the sparse resolution of the observed data. In this study, a hybrid estimation method that combines the K-means clustering algorithm and an artificial neural network (ANN) model was developed using sea-surface parameter data in the Indian Ocean as a case study. The oceanic datasets from January 2012 to December 2019 were obtained via satellite observations, Argo in situ data, and reanalysis data. These datasets were unified to the same spatial and temporal resolution (1° × 1°, monthly). Based on the processed datasets, the K-means classifier was applied to divide the Indian Ocean into four regions with different characteristics. For ANN training and testing in each region, the gridded data of 84 months were used for training, and 12-month data were used for testing. The ANN results show that the optimized NN architecture comprises five input variables, one output variable, and four hidden layers, each of which has 40 neurons. Compared with the multiple linear regression model (MLR) with a root-mean-square error (RMSE) of 5.2248 m and the HYbrid-Coordinate Ocean Model (HYCOM) with an RMSE of 4.8422 m, the RMSE of the model proposed in this study was reduced by 27% and 22%, respectively. Three typical regions with high variability in their MLDs were selected to further evaluate the performance of the ANN model. Our results showed that the model could reveal the seasonal variation trend in each of the selected regions, but the estimation accuracy showed room for improvement. Furthermore, a correlation analysis between the MLD and input variables showed that the surface temperature and salinity were the main influencing factors of the model. The results of this study suggest that the pre-clustering ANN method proposed could be used to estimate and analyze the MLD in the Indian Ocean. Moreover, this method can be further expanded to estimate other internal parameters for typical ocean regions and to provide effective technical support for ocean researchers when studying the variability of these parameters.Entities:
Keywords: Indian Ocean; K-means clustering; artificial neural network model; mixed-layer depth
Mesh:
Year: 2022 PMID: 35898102 PMCID: PMC9371055 DOI: 10.3390/s22155600
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Flowchart of the methodologies used in this research.
Figure 2The study area used in this research.
Data summary.
| Data | Dimension | Usage | Data Source |
|---|---|---|---|
| SST, SSS (2012–2018) | 2869 × 84 months | Training and verifying model | Argo |
| SSH (2012–2018) | 2869 × 84 months | Training and verifying model | Aviso |
| UW, VW (2012–2018) | 2869 × 84 months | Training and verifying model | ASCAT |
| SST, SSS (2019) | 2869 × 12 months | Testing model | Argo |
| SSH (2019) | 2869 × 12 months | Testing model | Aviso |
| UW, VW (2019) | 2869 × 12 months | Testing model | ASCAT |
Samples of the ocean datasets in January 2019.
| Location | SSS (PSU) | SST (°C) | SSH (m) | UW (m/s) | VW (m/s) | MLD (m) |
|---|---|---|---|---|---|---|
| (27.5° S, 38.5° E) | 35.40 | 26.50 | −0.04 | 2.62 | −4.01 | 15.35 |
| (14.5° S, 44.5° E) | 34.99 | 29.48 | 0.30 | −2.87 | 0.10 | 17.37 |
| (11.5° S, 42.5° E) | 35.04 | 29.55 | 0.09 | −4.03 | 0.21 | 20.81 |
| (10.5° S, 41.5° E) | 35.08 | 29.56 | 0.06 | −4.39 | 0.34 | 22.42 |
| (0.5° S, 64.5° E) | 35.06 | 28.75 | −0.01 | −2.35 | −0.92 | 19.49 |
| (0.5° N, 61.5° E) | 35.19 | 28.41 | −0.01 | −2.38 | −1.36 | 12.22 |
| (1.5° N, 62.5° E) | 35.18 | 28.39 | −0.02 | −2.50 | −1.98 | 15.15 |
| (2.5° N, 63.5° E) | 35.15 | 28.42 | 0.00 | −2.87 | −2.89 | 19.89 |
| (5.5° N, 66.5° E) | 35.15 | 28.58 | 0.09 | −4.08 | −4.36 | 28.89 |
| (7.5° N, 69.5° E) | 35.16 | 28.67 | 0.05 | −3.54 | −3.56 | 25.45 |
| (9.5° N, 71.5° E) | 35.21 | 28.64 | 0.22 | −3.43 | −1.81 | 19.79 |
| (12.5° N, 72.5° E) | 34.68 | 28.53 | 0.12 | −3.24 | −0.60 | 21.41 |
| (7.5° S, 74.5° E) | 34.20 | 29.25 | 0.19 | 0.47 | −2.70 | 26.87 |
| (6.5° S, 76.5° E) | 34.16 | 29.10 | 0.12 | 0.68 | −1.59 | 23.33 |
| (5.5° S, 77.5° E) | 34.16 | 29.10 | 0.12 | 0.68 | −1.59 | 23.33 |
Design of experiments.
| Cluster Number | Clustering Variables | Training Models | RMSE |
|---|---|---|---|
| 1 | CSST, CSSS, CSSH, CUW, CVW | MLD = NN (SST, SSS, SSH, UW, VW) | 4.4002 |
| 2 | CSST, CSSS, CSSH, CUW, CVW | MLD = NN (SST, SSS, SSH, UW, VW) | 4.3451 |
| 3 | CSST, CSSS, CSSH, CUW, CVW | MLD = NN (SST, SSS, SSH, UW, VW) | 4.2770 |
| 4 | CSST, CSSS, CSSH, CUW, CVW | MLD = NN (SST, SSS, SSH, UW, VW) | 3.7936 |
| 5 | CSST, CSSS, CSSH, CUW, CVW | MLD = NN (SST, SSS, SSH, UW, VW) | 4.4384 |
Figure 3The outcome of the grid search strategy: (a) the RMSE variation with the number of neurons; (b) the RMSE variation with the number of hidden layers. Error bars have been included for each RMSE value based upon the standard deviation.
Figure 4The performance of the ANN model when using the multisource observation datasets in January 2019.
Figure 5The training and testing performance of the ANN model using multisource observation datasets.
The RMSE and R2 values of Case 1 to Case 5.
| Case | Clustering Variables | Training Models | Testing RMSE | Testing R2 |
|---|---|---|---|---|
| Case 1 | SST | MLD = NN (SST) | 5.1439 | 0.3223 |
| Case 2 | SST, SSS | MLD = NN (SST, SSS) | 4.8482 | 0.5150 |
| Case 3 | SST, SSS, SSH | MLD = NN (SST, SSS, SSH) | 4.1205 | 0.6392 |
| Case 4 | SST, SSS, SSH, SSW | MLD = NN (SST, SSS, SSH, SSW) | 3.8964 | 0.6859 |
| Case 5 | SST, SSS, SSH, UW, VW | MLD = NN (SST, SSS, SSH, UW, VW) | 3.7936 | 0.6664 |
Figure 6Annual mean MLD in the Indian Ocean estimated from (a) Argo data and (b) a pre-clustering ANN model in 2019.
Figure 7The RMSE (unit: m) (above) and R2 (below) for Case 5. The computation is based on the estimated and Argo MLDs.
Figure 8The monthly averaged MLD for the (a) SEAS, (b) BoB, and (c) EEIO in 2019. Error bars have been included for each ANN-estimated MLD based upon the standard deviation.
Figure 9Average annual MLD in the Indian Ocean estimated from (a) the HYCOM and (b) the MLR model in 2019.
Figure 10Spatial distribution of Pearson correlation coefficients between the estimated MLD and sea-surface parameters (SST, SSS, SSH, and SSW).
Figure 11Average annual Pearson correlation coefficients between the estimated MLD and sea-surface parameters (SST, SSS, SSH, and SSW).