| Literature DB >> 35590809 |
Tomasz Kogut1, Arkadiusz Tomczak1, Adam Słowik2, Tomasz Oberski3.
Abstract
An important problem associated with the aerial mapping of the seabed is the precise classification of point clouds characterizing the water surface, bottom, and bottom objects. This study aimed to improve the accuracy of classification by addressing the asymmetric amount of data representing these three groups. A total of 53 Synthetic Minority Oversampling Technique (SMOTE) algorithms were adjusted and evaluated to balance the amount of data. The prepared data set was used to train the Multi-Layer Perceptron (MLP) neural network used for classifying the point cloud. Data balancing contributed to significantly increasing the accuracy of classification. The best overall classification accuracy achieved varied from 95.8% to 97.0%, depending on the oversampling algorithm used, and was significantly better than the classification accuracy obtained for unbalanced data and data with downsampling (89.6% and 93.5%, respectively). Some of the algorithms allow for 10% increased detection of points on the objects compared to unbalanced data or data with simple downsampling. The results suggest that the use of selected oversampling algorithms can aid in improving the point cloud classification and making the airborne laser bathymetry technique more appropriate for seabed mapping.Entities:
Keywords: SMOTE; airborne laser bathymetry; classification; imbalanced learning; oversampling
Mesh:
Year: 2022 PMID: 35590809 PMCID: PMC9100212 DOI: 10.3390/s22093121
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Location of the test area (approximately 25 km north of the city of Rostock in Germany).
Figure 2Three classes in the ALB point cloud (blue—water surface, class 1; green—seabed, class 2; red—points on the object on the seabed, class 3).
Description of features used to train the ANN.
| Ui | Description | Formula | |
|---|---|---|---|
| U1 | Amplitude—the maximal peak of the Gaussian curve and is closely associated with the reflectance intensity [ | ||
| U2 | Echo width—( |
| (1) |
| U3 | Return number ( | ||
| U4 | Number of returns ( | ||
| U5 | Normalized echo |
| (2) |
| U6 | Height difference ( |
| (3) |
| U7 | Height variance
|
| (4) |
| U8 | Eigenvalue | ||
| U9 | Eigenvalue | ||
| U10 | Eigenvalue | ||
| U11 | Sphericity—a property that describes the convexity or concavity of the analyzed point relative to points inside the cylinder |
| (5) |
| U12 | Planarity—a characteristic that represents the planar aspect of a point arrangement |
| (6) |
| U13 | Linearity—a characteristic indicating that the distribution of points is linear (continuous). |
| (7) |
| U14 | Eigentropy—defined as entropy computed from eigenvalues |
| (8) |
| U15 | Omnivariance—a property whose low values are associated with flat terrain or linear structures, while high values are associated with point spatial dispersion [ |
| (9) |
Figure 3Visualization of the cylinder and analyzed points (red—analyzed point, green—points inside the cylinder used to compute the features, grey—other points in the point cloud, r—radius).
Description of outputs from the ANN.
| Ui | Description |
|---|---|
| U38 | Class 1: “water surface” |
| U39 | Class 2: “seabed” |
| U40 | Class 3: “seabed object” |
Figure 4The architecture of ANN.
Results of classification with balanced learning.
| Name | Year | Best | Worst | Mean | Median | All Vectors | Class 1 | Class 2 | Class 3 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | SMOTE [ | 2002 | 93.4 | 91.5 | 92.7 | 92.9 | 10,188 | 3396 | 3396 | 3396 |
| 2 | SMOTE + Tomek [ | 2004 | 93.0 | 91.9 | 92.6 | 92.6 | 10,135 | 3396 | 3396 | 3343 |
| 3 | SMOTE + ENN [ | 2004 | 93.5 | 90.6 | 92.2 | 92.1 | 9990 | 3396 | 3396 | 3198 |
| 4 | Borderline-SMOTE1 [ | 2005 | 93.3 | 91.5 | 92.2 | 92.1 | 9191 | 2729 | 3396 | 3066 |
| 5 | Borderline-SMOTE2 [ | 2005 |
| 93.4 | 94.7 |
| 9191 | 2729 | 3396 | 3066 |
| 6 | SMOTE + LLE [ | 2006 | 91.1 | 88.2 | 89.7 | 89.7 | 10,188 | 3396 | 3396 | 3396 |
| 7 | Distance-SMOTE [ | 2007 | 93.5 | 91.9 | 92.5 | 92.5 | 10,188 | 3396 | 3396 | 3396 |
| 8 | Polynomial-SMOTE [ | 2008 | 91.0 | 88.7 | 90.3 | 90.4 | 13,234 | 5458 | 3396 | 4380 |
| 9 | ADOMS [ | 2008 | 94.2 | 91.4 | 93.3 | 93.5 | 10,188 | 3396 | 3396 | 3396 |
| 10 | Safe Level SMOTE [ | 2009 | 66.7 | 66.7 | 66.7 | 66.7 | 6573 | 2729 | 3396 | 448 |
| 11 | MSMOTE [ | 2009 | 94.1 | 92.0 | 92.9 | 92.9 | 10,188 | 3396 | 3396 | 3396 |
| 12 | SMOBD [ | 2011 | 95.0 | 92.7 | 93.3 | 93.0 | 10,188 | 3396 | 3396 | 3396 |
| 13 | SVM balance [ | 2012 | 94.2 | 91.9 | 92.7 | 92.5 | 10,172 | 3396 | 3396 | 3380 |
| 14 | TRIM SMOTE [ | 2012 | 92.4 | 91.5 | 92.0 | 92.0 | 10,188 | 3396 | 3396 | 3396 |
| 15 | SMOTE RSB [ | 2012 | 81.7 | 66.7 | 71.4 | 67.6 | 7716 | 3396 | 3396 | 924 |
| 16 | ProWSyn [ | 2013 | 93.6 | 90.6 | 92.4 | 92.5 | 10,188 | 3396 | 3396 | 3396 |
| 17 | SL graph SMOTE [ | 2013 | 92.1 | 91.1 | 91.6 | 91.6 | 9191 | 2729 | 3396 | 3066 |
| 18 | NRSBoundary SMOTE [ | 2013 | 92.6 | 91.4 | 91.8 | 91.8 | 9191 | 2729 | 3396 | 3066 |
| 19 | LVQ SMOTE [ | 2013 |
| 94.7 | 96.3 |
| 10,188 | 3396 | 3396 | 3396 |
| 20 | ROSE [ | 2014 |
| 92.5 | 94.6 |
| 10,188 | 3396 | 3396 | 3396 |
| 21 | SMOTE OUT [ | 2014 | 93.5 | 91.2 | 92.2 | 92.1 | 10,188 | 3396 | 3396 | 3396 |
| 22 | SMOTE Cosine [ | 2014 | 93.2 | 89.6 | 91.2 | 90.9 | 10,188 | 3396 | 3396 | 3396 |
| 23 | Selected SMOTE [ | 2014 | 94.9 | 92.7 | 93.6 | 93.6 | 10,188 | 3396 | 3396 | 3396 |
| 24 | LN SMOTE [ | 2011 | 94.4 | 66.7 | 86.3 | 93.5 | 9282 | 3396 | 3396 | 2490 |
| 25 | MWMOTE [ | 2014 | 91.5 | 90.4 | 91.0 | 91.0 | 10,188 | 3396 | 3396 | 3396 |
| 26 | PDFOS [ | 2014 |
| 92.9 | 94.6 |
| 10,188 | 3396 | 3396 | 3396 |
| 27 | RWO sampling [ | 2014 | 93.0 | 88.6 | 91.0 | 91.5 | 10,188 | 3396 | 3396 | 3396 |
| 28 | NEATER [ | 2014 | 88.0 | 75.8 | 84.8 | 86.5 | 8728 | 3396 | 3396 | 1936 |
| 29 | DEAGO [ | 2015 | 85.8 | 85.8 | 85.8 | 85.8 | 10,188 | 3396 | 3396 | 3396 |
| 30 | MCT [ | 2015 | 95.4 | 93.5 | 94.5 | 94.5 | 10,188 | 3396 | 3396 | 3396 |
| 31 | SMOTE IPF [ | 2015 | 94.1 | 92.5 | 93.2 | 93.4 | 10,188 | 3396 | 3396 | 3396 |
| 32 | OUPS [ | 2016 | 93.1 | 91.4 | 92.0 | 92.0 | 9493 | 3396 | 3396 | 2701 |
| 33 | SMOTE D [ | 2016 | 81.4 | 78.7 | 80.1 | 80.1 | 10,189 | 3398 | 3396 | 3395 |
| 34 | CE SMOTE [ | 2010 | 94.8 | 66.7 | 86.2 | 90.1 | 8647 | 2729 | 3396 | 2522 |
| 35 | Edge Det SMOTE [ | 2010 | 93.8 | 92.6 | 93.2 | 93.5 | 10,188 | 3396 | 3396 | 3396 |
| 36 | ASMOBD [ | 2012 | 88.2 | 86.8 | 87.4 | 87.3 | 10,188 | 3396 | 3396 | 3396 |
| 37 | Assembled SMOTE [ | 2013 | 93.0 | 90.9 | 91.6 | 91.5 | 9191 | 2729 | 3396 | 3066 |
| 38 | SDSMOTE [ | 2014 | 94.4 | 92.0 | 93.4 | 93.5 | 10,188 | 3396 | 3396 | 3396 |
| 39 | G SMOTE [ | 2014 | 94.4 | 92.5 | 93.2 | 93.2 | 10,188 | 3396 | 3396 | 3396 |
| 40 | NT SMOTE [ | 2014 | 93.7 | 92.8 | 93.1 | 93.1 | 10,188 | 3396 | 3396 | 3396 |
| 41 | Lee [ | 2015 | 93.8 | 92.9 | 93.3 | 93.3 | 10,188 | 3396 | 3396 | 3396 |
| 42 | MDO [ | 2016 | 92.1 | 90.3 | 91.3 | 91.4 | 10,188 | 3396 | 3396 | 3396 |
| 43 | Random SMOTE [ | 2011 | 94.4 | 92.5 | 93.3 | 93.2 | 10,188 | 3396 | 3396 | 3396 |
| 44 | VIS RST [ | 2016 | 66.7 | 66.6 | 66.7 | 66.7 | 7119 | 3396 | 3396 | 327 |
| 45 | AND SMOTE [ | 2016 | 92.0 | 90.4 | 91.1 | 91.0 | 10,188 | 3396 | 3396 | 3396 |
| 46 | NRAS [ | 2017 | 90.2 | 88.5 | 89.1 | 89.0 | 10,188 | 3396 | 3396 | 3396 |
| 47 | NDO sampling [ | 2011 | 95.1 | 93.6 | 94.5 | 94.6 | 10,189 | 3397 | 3396 | 3396 |
| 48 | Gaussian SMOTE [ | 2017 | 92.2 | 90.3 | 91.1 | 91.0 | 10,188 | 3396 | 3396 | 3396 |
| 49 | Kmeans SMOTE [ | 2018 | 92.1 | 90.8 | 91.5 | 91.6 | 10,188 | 3396 | 3396 | 3396 |
| 50 | Supervised SMOTE [ | 2014 | 92.8 | 91.5 | 92.1 | 92.1 | 10,188 | 3396 | 3396 | 3396 |
| 51 | SN SMOTE [ | 2012 | 95.2 | 92.3 | 93.7 | 93.7 | 10,188 | 3396 | 3396 | 3396 |
| 52 | CCR [ | 2017 | 88.9 | 87.1 | 88.0 | 88.2 | 9191 | 2729 | 3396 | 3066 |
| 53 | ANS [ | 2017 | 91.3 | 88.7 | 90.0 | 90.1 | 9191 | 2729 | 3396 | 3066 |
Confusion matrix of unbalanced data and data with downsampling.
| Class | Water Surface | Seabed | Seabed Object | |||
|---|---|---|---|---|---|---|
| (Points) | (%) | (Points) | (%) | (Points) | (%) | |
| Unbalance | ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 13,057 | 98.0 | 261 | 2.0 |
| Seabed object | 0 | 0 | 62 | 29.2 | 150 |
|
| Downsampling [ | ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 13,119 | 98.5 | 199 | 1.5 |
| Seabed object | 0 | 0 | 38 | 17.9 | 174 |
|
Confusion matrix of the four algorithms with best object detection.
| Class | Water Surface | Seabed | Seabed Object | |||
|---|---|---|---|---|---|---|
| (Points) | (%) | (Points) | (%) | (Points) | (%) | |
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 12,986 | 97.5 | 332 | 2.5 |
| Seabed object | 0 | 0 | 14 | 6.6 | 198 |
|
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 13,149 | 98.7 | 169 | 1.3 |
| Seabed object | 0 | 0 | 23 | 10.8 | 189 |
|
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 1 | 0.0 | 13,143 | 98.7 | 174 | 1.3 |
| Seabed object | 0 | 0 | 24 | 11.3 | 188 |
|
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 6 | 0.05 | 13,104 | 98.4 | 208 | 1.6 |
| Seabed object | 0 | 0 | 26 | 12.3 | 186 |
|
Confusion matrix for the algorithms with the highest median.
| Class | Water Surface | Seabed | Seabed Object | |||
|---|---|---|---|---|---|---|
| (Points) | (%) | (Points) | (%) | (Points) | (%) | |
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 13,003 | 97.6 | 315 | 2.4 |
| Seabed object | 0 | 0 | 16 | 7.5 | 196 |
|
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 13,160 | 98.8 | 158 | 1.2 |
| Seabed object | 0 | 0 | 29 | 13.7 | 183 |
|
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 3 | 0.02 | 13,175 | 98.9 | 140 | 1.1 |
| Seabed object | 0 | 0 | 31 | 14.6 | 181 |
|
|
| ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 1 | 0.01 | 13,212 | 99.2 | 105 | 0.8 |
| Seabed object | 0 | 0 | 32 | 15.1 | 180 |
|
Confusion matrix of median results for algorithm MDO.
| Class | Water Surface | Seabed | Seabed Object | |||
|---|---|---|---|---|---|---|
| (Points) | (%) | (Points) | (%) | (Points) | (%) | |
| MDO | ||||||
| Water surface | 10,612 | 100 | 0 | 0 | 0 | 0 |
| Seabed | 0 | 0 | 13,266 | 99.6 | 52 | 0.4 |
| Seabed object | 0 | 0 | 54 | 25.5 | 158 | 74.5 |