| Literature DB >> 24699553 |
David Stephens1, Markus Diesing1.
Abstract
Detailed seabed substrate maps are increasingly in demand for effective planning and management of marine ecosystems and resources. It has become common to use remotely sensed multibeam echosounder data in the form of bathymetry and acoustic backscatter in conjunction with ground-truth sampling data to inform the mapping of seabed substrates. Whilst, until recently, such data sets have typically been classified by expert interpretation, it is now obvious that more objective, faster and repeatable methods of seabed classification are required. This study compares the performances of a range of supervised classification techniques for predicting substrate type from multibeam echosounder data. The study area is located in the North Sea, off the north-east coast of England. A total of 258 ground-truth samples were classified into four substrate classes. Multibeam bathymetry and backscatter data, and a range of secondary features derived from these datasets were used in this study. Six supervised classification techniques were tested: Classification Trees, Support Vector Machines, k-Nearest Neighbour, Neural Networks, Random Forest and Naive Bayes. Each classifier was trained multiple times using different input features, including i) the two primary features of bathymetry and backscatter, ii) a subset of the features chosen by a feature selection process and iii) all of the input features. The predictive performances of the models were validated using a separate test set of ground-truth samples. The statistical significance of model performances relative to a simple baseline model (Nearest Neighbour predictions on bathymetry and backscatter) were tested to assess the benefits of using more sophisticated approaches. The best performing models were tree based methods and Naive Bayes which achieved accuracies of around 0.8 and kappa coefficients of up to 0.5 on the test set. The models that used all input features didn't generally perform well, highlighting the need for some means of feature selection.Entities:
Mesh:
Year: 2014 PMID: 24699553 PMCID: PMC3974812 DOI: 10.1371/journal.pone.0093950
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Study area, acoustic data and ground-truth samples.
A: Bathymetry data with ground-truth samples overlaid, colours indicating the test and validation sets. B: Backscatter data with ground-truth samples overlaid, colours indicating the substrate class.
Ground-truth data.
| Class | Training set | Test set | Total |
| mS | 16 | 2 | 18 |
| S | 144 | 50 | 194 |
| gS | 29 | 10 | 39 |
| sG | 5 | 2 | 7 |
| Total | 194 | 64 | 258 |
Secondary acoustic features generated from bathymetry and backscatter.
| Derivative | Description | Layer names |
| Bathymetric position index (BPI) | The vertical difference between a cell and the mean of the local neighbourhood | BPI_600m; BPI_200m |
| Roughness | The difference between the minimum and maximum of cell and its 8 neighbours | backscatter_roughness, bathymetry_roughness |
| Curvature | Curvature describes the rate of change of the slope. | curvature, curvature_planar, curvature_profile |
| Northness | Equals the cosine of the aspect, which is the direction of the steepest slope measured in clockwise degrees from north. | northness |
| Eastness | Equals the sine of the aspect. | eastness |
| Moran's I | Spatial auto-correlation in a 5×5 neighbourhood | bathymetry_moran, backscatter_moran |
| Sobel filter | A directional filter that emphasises areas of large spatial frequency (edges) running horizontally (X) or vertically (Y) across the image. | bathy_sobelY, bathy_sobelX |
Correlation matrix of input features.
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | f11 | f12 | f13 | f14 | f15 | |
| f1 | 1 | 0.24 | 0.33 | −0.67 | −0.26 | 0.01 | 0.14 | 0.03 | 0.15 | −0.14 | −0.12 | 0.03 | 0.52 | 0.28 | 0.29 |
| f2 | 0.24 | 1 | 0.8 | −0.21 | −0.05 | 0.02 | 0.13 | 0.05 | 0.01 | −0.43 | 0.4 | 0.52 | 0.2 | 0.15 | 0.12 |
| f3 | 0.33 | 0.8 | 1 | −0.3 | −0.11 | 0.04 | 0.05 | 0.06 | −0.01 | −0.29 | 0.17 | 0.3 | 0.3 | 0.24 | 0.17 |
| f4 | −0.67 | −0.21 | −0.3 | 1 | 0.14 | 0.02 | −0.05 | 0.02 | −0.07 | 0.15 | 0.05 | −0.07 | −0.23 | −0.01 | −0.24 |
| f5 | −0.26 | −0.05 | −0.11 | 0.14 | 1 | −0.31 | −0.36 | −0.16 | −0.13 | 0.03 | −0.06 | −0.06 | 0.1 | 0.1 | 0.13 |
| f6 | 0.01 | 0.02 | 0.04 | 0.02 | −0.31 | 1 | −0.03 | 0.7 | 0.03 | 0.02 | −0.02 | −0.02 | −0.01 | −0.08 | −0.09 |
| f7 | 0.14 | 0.13 | 0.05 | −0.05 | −0.36 | −0.03 | 1 | 0.05 | 0.69 | −0.11 | 0.23 | 0.21 | 0.02 | −0.02 | 0.06 |
| f8 | 0.03 | 0.05 | 0.06 | 0.02 | −0.16 | 0.7 | 0.05 | 1 | 0.09 | −0.02 | 0.04 | 0.04 | 0.14 | 0.05 | −0.07 |
| f9 | 0.15 | 0.01 | −0.01 | −0.07 | −0.13 | 0.03 | 0.69 | 0.09 | 1 | 0.02 | 0.06 | 0.02 | 0.07 | 0.07 | 0.02 |
| f10 | −0.14 | −0.43 | −0.29 | 0.15 | 0.03 | 0.02 | −0.11 | −0.02 | 0.02 | 1 | −0.27 | −0.84 | −0.03 | −0.02 | −0.03 |
| f11 | −0.12 | 0.4 | 0.17 | 0.05 | −0.06 | −0.02 | 0.23 | 0.04 | 0.06 | −0.27 | 1 | 0.75 | 0.01 | −0.06 | −0.02 |
| f12 | 0.03 | 0.52 | 0.3 | −0.07 | −0.06 | −0.02 | 0.21 | 0.04 | 0.02 | −0.84 | 0.75 | 1 | 0.03 | −0.02 | 0.01 |
| f13 | 0.52 | 0.2 | 0.3 | −0.23 | 0.1 | −0.01 | 0.02 | 0.14 | 0.07 | −0.03 | 0.01 | 0.03 | 1 | 0.62 | 0.26 |
| f14 | 0.28 | 0.15 | 0.24 | −0.01 | 0.1 | −0.08 | −0.02 | 0.05 | 0.07 | −0.02 | −0.06 | −0.02 | 0.62 | 1 | −0.07 |
| f15 | 0.29 | 0.12 | 0.17 | −0.24 | 0.13 | −0.09 | 0.06 | −0.07 | 0.02 | −0.03 | −0.02 | 0.01 | 0.26 | −0.07 | 1 |
f1 = bathymetry, f2 = BPI200, f3 = BPI600, f4 = bathymetry_moran, f5 = bathymetry_roughness, f6 = bathy_sobely, f7 = bathy_sobelx, f8 = Northness, f9 = Eastness, f10 = Curvature_profile, f11 = Curvature_planar, f12 = Curvature, f13 = backscatter, f14 = backscatter_moran, f15 = backscatter_roughness.
Model Parameters.
| Model | Parameters | Parameter Description | Tuning Range |
| k-NN | k | The number of neighbours considered in the classification | 1∶20 |
| SVM |
| The cost parameter determining how much data is included in creating the decision boundary, a small value will consider more observations | 2-5∶15 |
| γ | The kernel smoothing parameter which defines the shape and complexity of the resulting decision boundary | 2−15∶5 | |
| RF | nodesize | The minimum number of cases allowed in each of the terminal nodes of each tree | 1∶10 |
| mtry | The number of features tested at each split | 2∶15 | |
| CT | cp | The | 2−10:−1 |
| minsplit | Nodes of this size or smaller are no longer split | 1∶10 | |
| ANN | size | The number of units in the hidden layer | 21∶6 |
| decay | The | 10−5:−1 |
Figure 2Exploration of training data.
A: Comparing the distributions of bathymetry values between training samples (dashed) and raster grid (solid). B: Comparing the distributions of backscatter values between training samples (dashed) and raster grid (solid). C: Comparing the distribution of bathymetry values between substrate classes. D: Comparing the distribution of backscatter values between substrate classes.
Output from Boruta feature selection algorithm.
| Feature | Z-score |
| backscatter |
|
| bathymetry |
|
| backscatter_moran |
|
| curvature |
|
| curvature_planar |
|
| bathymetry_moran |
|
| curvature_profile |
|
| backscatter_roughness |
|
| northness |
|
| BPI_200m | 4.97 |
| BPI_600m | 4.93 |
| bathy_sobelY | 3.25 |
| bathy_sobelX | 1.45 |
| eastness | 1.11 |
| bathymetry_roughness | 0.79 |
Scores that were significantly higher (p<0.001) than scores of random features are indicated in bold.
Model Performance Comparison.
| Model | BER | Accuracy | Kappa |
| NB2 |
| 0.80 | 0.50 |
| RF2 |
| 0.81 | 0.45 |
| RF1 |
| 0.80 | 0.45 |
| CT1 |
| 0.80 |
|
| RF3 |
| 0.78 | 0.36 |
| NB3 | 0.43 | 0.78 | 0.38 |
| CT3 | 0.48 | 0.69 | 0.21 |
| CT2 | 0.48 | 0.69 | 0.27 |
| NN1 |
| 0.80 |
|
| SVM1 | 0.53 | 0.78 | 0.39 |
|
| 0.54 | 0.77 | 0.33 |
| k-NN2 | 0.61 | 0.72 | 0.19 |
| NB1 | 0.64 | 0.75 | 0.34 |
| SVM2 | 0.67 | 0.78 | 0.27 |
| NN3 | 0.69 | 0.78 | 0.21 |
| k-NN3 | 0.69 | 0.78 | 0.22 |
| SVM3 | 0.70 | 0.77 | 0.20 |
| NN2 | 0.77 | 0.73 | −0.07 |
BER, Kappa coefficients and Accuracy statistics calculated on the test set. Values indicated as being significantly better than the baseline (p≤0.05) are indicated in bold. Model numbers indicate the input features used; 1 indicates primary features; 2 indicates subset of features chosen by Boruta; 3 indicates all input features were used. K-NN1 is not included as the LOOcv indicated that 1 nearest neighbour gave the best performance making it the same as the baseline 1-NN model. (NB = Naive Bayes; RF = Random Forest; CT = Classification Tree; NN = Neural Network; SVM = Support Vector Machine; k-NN = k-nearest neighbour).
Figure 3Comparing model performance on the test data.
The dashed lines represent the performance of the baseline model. The best performing models are to the top-right.
Error matrices for the three best performing models.
| CT1 | mS | S | gS | sG | Error | |
| mS | 2 | 0 | 0 | 0 | 0.00 | |
| S | 1 | 44 | 3 | 2 | 0.12 | |
| gS | 1 | 3 | 5 | 1 | 0.50 | |
| sG | 0 | 1 | 1 | 0 | 1.00 |
Rows show the predicted class frequencies and columns show the observed frequencies.
Figure 4Output predictions from top three models and agreement between them.
A: NB2, B: RF2, C∶CT1, D: Agreement.