| Literature DB >> 31480449 |
Julius Venskus1, Povilas Treigys2, Jolita Bernatavičienė1, Gintautas Tamulevičius1, Viktor Medvedev1.
Abstract
The automated identification system of vessel movements receives a huge amount of multivariate, heterogeneous sensor data, which should be analyzed to make a proper and timely decision on vessel movements. The large number of vessels makes it difficult and time-consuming to detect abnormalities, thus rapid response algorithms should be developed for a decision support system to identify abnormal movements of vessels in areas of heavy traffic. This paper extends the previous study on a self-organizing map application for processing of sensor stream data received by the maritime automated identification system. The more data about the vessel's movement is registered and submitted to the algorithm, the higher the accuracy of the algorithm should be. However, the task cannot be guaranteed without using an effective retraining strategy with respect to precision and data processing time. In addition, retraining ensures the integration of the latest vessel movement data, which reflects the actual conditions and context. With a view to maintaining the quality of the results of the algorithm, data batching strategies for the neural network retraining to detect anomalies in streaming maritime traffic data were investigated. The effectiveness of strategies in terms of modeling precision and the data processing time were estimated on real sensor data. The obtained results show that the neural network retraining time can be shortened by half while the sensitivity and precision only change slightly.Entities:
Keywords: SOM data batching; marine traffic anomaly detection; model sensitivity and precision; neural network retrain time; streaming sensors data
Year: 2019 PMID: 31480449 PMCID: PMC6749247 DOI: 10.3390/s19173782
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Data split scheme.
Selection of learning rate.
| Learning Rate | TP | FP | TN | FN | Precision | Sensitivity |
|---|---|---|---|---|---|---|
| 0.005 | 924 | 519 | 26,648 | 757 | 0.6403 | 0.5497 |
| 0.010 | 943 | 505 | 26,662 | 738 | 0.6512 | 0.5610 |
| 0.015 | 957 | 498 | 26,669 | 724 | 0.6577 | 0.5693 |
| 0.020 | 963 | 487 | 26,680 | 718 | 0.6641 | 0.5729 |
| 0.025 | 968 | 478 | 26,689 | 713 | 0.6694 | 0.5758 |
| 0.030 | 976 | 471 | 26,696 | 705 | 0.6745 | 0.5806 |
| 0.035 | 986 | 468 | 26,699 | 695 | 0.6781 | 0.5866 |
| 0.040 | 998 | 461 | 26,706 | 683 | 0.6840 | 0.5937 |
| 0.050 | 1025 | 445 | 26,722 | 656 | 0.6973 | 0.6098 |
| 0.060 | 1066 | 413 | 26,754 | 615 | 0.7208 | 0.6341 |
| 0.070 | 1109 | 394 | 26,773 | 572 | 0.7379 | 0.6597 |
| 0.100 | 1197 | 303 | 26,864 | 484 | 0.7980 | 0.7121 |
| 0.200 | 1431 | 135 | 27,032 | 250 | 0.9138 | 0.8513 |
| 0.300 | 1486 | 81 | 27,086 | 195 | 0.9483 | 0.8840 |
| 0.400 | 1500 | 55 | 27,112 | 181 | 0.9646 | 0.8923 |
|
|
|
|
|
|
|
|
| 0.600 | 1507 | 54 | 27,113 | 174 | 0.9654 | 0.8965 |
| 0.700 | 1502 | 59 | 27,108 | 179 | 0.9622 | 0.8935 |
Training Strategy I performance at learning rate 0.5.
| Stage | TP | FP | TN | FN | Precision | Sensitivity |
|---|---|---|---|---|---|---|
| Testing (model error) | 1510 | 52 | 27,115 | 171 | 0.9667 | 0.8983 |
| Testing (general error) | 1868 | 69 | 33,890 | 233 | 0.9644 | 0.8891 |
Strategy II performance on model test data.
| No. | TP | FP | TN | FN | Precision | Sensitivity |
|---|---|---|---|---|---|---|
| 1 | 1364 | 241 | 26,926 | 317 | 0.8498 | 0.8114 |
| 2 | 1329 | 280 | 26,887 | 352 | 0.8260 | 0.7906 |
| 3 | 1359 | 252 | 26,915 | 322 | 0.8436 | 0.8084 |
| 4 | 1364 | 274 | 26,893 | 317 | 0.8327 | 0.8114 |
| 5 | 1356 | 253 | 26,914 | 325 | 0.8428 | 0.8067 |
| 6 | 1335 | 253 | 26,914 | 346 | 0.8407 | 0.7942 |
| 7 | 1314 | 251 | 26,916 | 367 | 0.8396 | 0.7817 |
| 8 | 1332 | 258 | 26,909 | 349 | 0.8377 | 0.7924 |
|
|
|
|
|
|
|
|
| 10 | 1338 | 240 | 26,927 | 343 | 0.8497 | 0.7960 |
| max | 0.8522 | 0.8132 | ||||
| min | 0.8260 | 0.7817 | ||||
| average | 0.8413 | 0.8011 | ||||
| stdev | 0.0079 | 0.0115 |
Retraining Strategy II performance at learning rate 0.025.
| Stage | TP | FP | TN | FN | Precision | Sensitivity |
|---|---|---|---|---|---|---|
| Testing (model error) | 1500 | 98 | 27,069 | 181 | 0.9387 | 0.8923 |
| Testing (general error) | 1836 | 122 | 33,837 | 265 | 0.9377 | 0.8739 |
Partitioning of dataset (Strategy III).
| Data Batches | % of Train and Validation Data | New Data Items | All Data Items |
|---|---|---|---|
| T1 | 60% | 69,235 | 69,235 |
| Tm2 | 10% | 11,539 | 23,078 |
| Tm3 | 10% | 11,539 | 23,078 |
| Tm4 | 10% | 11,539 | 23,078 |
| Tm5 | 10% | 11,539 | 23,078 |
Retraining Strategy III performance at learning rate 0.003.
| Stage | TP | FP | TN | FN | Precision | Sensitivity |
|---|---|---|---|---|---|---|
| Testing (model error) | 1527 | 73 | 27,094 | 154 | 0.9544 | 0.9084 |
| Testing (general error) | 1866 | 91 | 33,868 | 235 | 0.9535 | 0.8881 |
Retraining Strategies I–III performance on Cargo dataset.
| Strategy | Precision | Sensitivity | Relative Time |
|---|---|---|---|
| Strategy I | 0.9644 | 0.8891 | 1 |
| Strategy II | 0.9377 | 0.8739 | 0.4471 |
| Strategy III | 0.9535 | 0.8881 | 0.6832 |
Retraining Strategies I–III performance on Passenger dataset.
| Strategy | Precision | Sensitivity | Relative Time |
|---|---|---|---|
| Strategy I | 0.9795 | 0.8897 | 1 |
| Strategy II | 0.9802 | 0.8870 | 0.4478 |
| Strategy III | 0.9817 | 0.8888 | 0.6817 |