| Literature DB >> 28956864 |
Yu Jiang1, Yu Gou2, Tong Zhang3, Kai Wang4, Chengquan Hu5.
Abstract
With the rapid development of sensor networks, big marine data arises. To efficiently use these data to predict thermoclines, we propose a machine learning approach. We firstly focus on analyzing how temperature, salinity, and geographic location features affect the formation of thermocline. Then, an improved model based on entropy value method for the thermocline selection is demonstrated. The experiments adopt BOA Argo data sets and the experimental results show that our novel model can predict thermoclines and related data effectively.Entities:
Keywords: entropy value calculation; machine learning; statistical learning; thermocline
Year: 2017 PMID: 28956864 PMCID: PMC5676641 DOI: 10.3390/s17102225
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Marine data analytical process.
Figure 2Thermocline.
Figure 3Inverse Thermocline.
Figure 4Multi-Thermocline.
Figure 5Mix Thermocline.
Characteristics and their descriptions.
| Characteristics | Brief Descriptions |
|---|---|
| Latitude/Longitude | Where the data were collected |
| Year/Month | When the data were collected |
| Depth | Distance from sea surface to the point |
| Temperature | Temperature of the point |
| Strength | Temperature difference/depth difference |
| Salinity | Hydrological characteristics |
Figure 6Three-dimensional state space describing underwater thermocline.
Figure 7The samples distribution map.
Accuracy vs size of training data.
| Train Set | Test Set | Accuracy |
|---|---|---|
| 5000 | 95,000 | 0.99245 |
| 10,000 | 90,000 | 0.99486 |
| 15,000 | 85,000 | 0.99570 |
| 20,000 | 80,000 | 0.99623 |
| 25,000 | 75,000 | 0.99671 |
| 30,000 | 70,000 | 0.99718 |
| 40,000 | 60,000 | 0.99741 |
| 50,000 | 50,000 | 0.99763 |
| 70,000 | 30,000 | 0.99801 |
Figure 8Three-dimensional Correlation among temperature, depth and time.
Figure 9Correlation between depth and temperature.
Figure 10Correlation between depth and strength.
Figure 11Attribute space made up of temperature, strength and depth.
Weights of each attribute when forming thermocline monthly.
| w | Latitude | Longitude | Depth | Temperature | Strength | Salinity |
|---|---|---|---|---|---|---|
| January | 0.1266 | 0.1266 | 0.0752 | 0.1737 | 0.4867 | 0.0112 |
| February | 0.1360 | 0.1360 | 0.0808 | 0.1810 | 0.4554 | 0.0108 |
| March | 0.1391 | 0.1391 | 0.0826 | 0.1827 | 0.4454 | 0.0112 |
| April | 0.1388 | 0.1388 | 0.0825 | 0.1867 | 0.4388 | 0.0144 |
| May | 0.1361 | 0.1361 | 0.0809 | 0.1885 | 0.4365 | 0.0218 |
| June | 0.1349 | 0.1349 | 0.0802 | 0.1866 | 0.4399 | 0.0235 |
| July | 0.1350 | 0.1350 | 0.0802 | 0.1905 | 0.4500 | 0.0094 |
| August | 0.1338 | 0.1338 | 0.0795 | 0.1926 | 0.4490 | 0.0112 |
| September | 0.1339 | 0.1339 | 0.0796 | 0.1910 | 0.4534 | 0.0082 |
| October | 0.1339 | 0.1339 | 0.0796 | 0.1898 | 0.4519 | 0.0110 |
| November | 0.1337 | 0.1337 | 0.0795 | 0.1909 | 0.4553 | 0.0069 |
| December | 0.1342 | 0.1342 | 0.0797 | 0.1908 | 0.4544 | 0.0067 |
Figure 12Weights s of each point from sea surface to 500 m underwater.
Figure 13Correlation between strength and score.
Figure 14Marine data curve and the fitting function curve.
Figure 15Residual plot.
Fitting function and all the residuals.
| Fitting Function | y = p1 × z10 + p2 × z9 + p3 × z8 + p4 × z7 + p5 × z6 + p6 × z5 + p7 × z4 + p8 × z3 + p9 × z2 + p10 × z + p11 | |
|---|---|---|
| Coefficient | P1 | −1.1173 |
| P2 | 0.75808 | |
| P3 | 8.3512 | |
| P4 | −6.2657 | |
| P5 | −20.714 | |
| P6 | 17.92 | |
| P7 | 15.847 | |
| P8 | −18.203 | |
| P9 | 6.6455 | |
| P10 | −6.0711 | |
| P11 | 7.3533 | |
| Residual | 7.2057 | |
| z | (x-mu)/sigma | |
| mu | 249.5 | |
| sigma | 144.48 | |