| Literature DB >> 28963498 |
K M Cooper1, J Barry2.
Abstract
In this study we produce a standardised dataset for benthic macrofauna and sediments through integration of data (33,198 samples) from 777 grab surveys. The resulting dataset is used to identify spatial and temporal patterns in faunal distribution around the UK, and the role of sediment composition and other explanatory variables in determining such patterns. We show how insight into natural variability afforded by the dataset can be used to improve the sustainability of activities which affect sediment composition, by identifying conditions which should remain favourable for faunal recolonisation. Other big data applications and uses of the dataset are discussed.Entities:
Mesh:
Year: 2017 PMID: 28963498 PMCID: PMC5622093 DOI: 10.1038/s41598-017-11377-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) Study area showing locations of aggregate dredging interest (blue - Humber dredging region, green - Anglian dredging region, yellow - Thames dredging region, orange - Goodwin Sands dredging licence, purple - eastern English Channel dredging region, red - South Coast dredging region, dark pink - Bristol Channel/Severn Estuary dredging region, brown - North-West dredging region). (b) Sample locations and extent of submaps used in Fig. 7. Underlying bathymetry is from the 2015 updated version of Defra’s Digital Elevation Model (DEM)[54]. Maps created in RStudio (version 1.0.143, https://www.rstudio.com/).
Figure 7Faunal cluster group and diversity (taxon Richness) for samples by sub region (see Fig. 1b for submap extents). Areas of aggregate dredging interest (licensed and application areas) shown as solid black lines, whilst areas of potential secondary effect are shown as dashed black lines. Maps created in RStudio (version 1.0.143, https://www.rstudio.com/).
Figure 2Heat maps based on a ranked ordering of samples for taxon richness and abundance per 0.1 m2 (see Fig. 7 for more detail of the aggregate producing regions). Maps created in RStudio (version 1.0.143, https://www.rstudio.com/).
Figure 3Elbow plots and dendrograms associated with a k-means clustering of (a) macrofaunal and (b) physical variables. The labels and colours used in the faunal dendrogram were chosen to reflect the relationship (similarity/dissimilarity) between the different cluster groups.
Figure 4Spatial distribution of macrofaunal assemblages with all samples (a) and by individual cluster group (b). Assemblage groups are based on a k-means clustering of fourth-root transformed macrofaunal abundance data (colonials included). Maps created in RStudio (version 1.0.143, https://www.rstudio.com/).
Biological characteristics of the macrofaunal assemblages identified through a k-means clustering of macrofaunal data (colonials included, forth-root transformation).
| Cluster | Characteristic taxa | Richness (Family) | Abundance |
| |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Max. | Min. | Mean | Max. | Min. | Mean | ||||||
|
| A1 |
| 120 | 35 | 70 (±14) | ↑ | 6738 | 186 | 1122 (±789) | ↑ | 366 |
|
| A2a |
| 106 | 21 | 52 (±14) | ↑ | 11821 | 104 | 972 (±1151) | ↑ | 707 |
|
| A2b |
| 104 | 29 | 58 (±12) | ↑ | 3458 | 73 | 366 (±265) | ↑ | 1191 |
|
| B1a |
| 117 | 35 | 64 (±12) | ↑ | 879 | 58 | 227 (±109) | → | 1099 |
|
| B1b |
| 86 | 24 | 44 (±9) | → | 2311 | 56 | 206 (±121) | → | 1687 |
|
| C1a |
| 66 | 10 | 31 (±9) | → | 6056 | 19 | 153 (±252) | → | 2462 |
|
| C1b |
| 195 | 18 | 44 (±11) | → | 4611 | 57 | 243 (±187) | → | 1850 |
|
| D1 |
| 107 | 12 | 40 (±11) | → | 7607 | 113 | 637 (±665) | ↑ | 962 |
|
| D2a |
| 56 | 6 | 24 (±8) | ↓ | 4471 | 13 | 91 (±126) | ↓ | 3288 |
|
| D2b |
| 56 | 9 | 27 (±8) | ↓ | 4023 | 24 | 139 (±126) | ↓ | 2631 |
|
| D2c |
| 32 | 1 | 8 (±5) | ↓ | 2098 | 1 | 26 (±61) | ↓ | 8286 |
|
| D2d |
| 45 | 4 | 19 (±6) | ↓ | 5563 | 13 | 100 (±183) | ↓ | 2903 |
Characterising species were identified through a SIMPER analysis and include taxa up to a total of 50% contribution. Letters in parenthesis identify the higher level taxonomic group: Amphipod crustacean (A), Ascidian tunicate (AT), Broyzoa (B), Bivalve Mollusc (BM), Crustacean (C), Decapod Crustacean (DC), Echinoderm (E), Polychaete (P), Phoronida (Ph), Nematoda (Ne). Values for Richness and Abundance are means and standard deviations. Arrows indicate the relative size of a value (↑ - High, ↓- Low, →- Medium).
Figure 5Faunal cluster distribution by year (a) and season (b). Maps created in RStudio (version 1.0.143, https://www.rstudio.com/).
Results of a best analysis identifying the subset of environmental variables which are most correlated with the macrofaunal data.
| Variable(s) | Size | Correlation (ρ) |
|---|---|---|
| Sand | 1 | 0.2998 |
| AvCur, Sand | 2 | 0.3842 |
|
|
|
|
| Lat, AvCur, Mud, Sand | 4 | 0.4161 |
| Lat, Depth, AvCur, Mud, Sand | 5 | 0.4089 |
| Lat, SPM, Depth, AvCur, Mud, Sand | 6 | 0.3973 |
| Lat, SPM, Depth, WOV, AvCur, Mud, Sand | 7 | 0.3847 |
| Lat, Sal, SPM, Depth, WOV, AvCur, Mud, Sand | 8 | 0.3704 |
Figure 6Distance-based redundancy analysis (dbRDA) ordination showing sampling sites (coloured by faunal assemblage group) and vectors for the main environmental predictor variables. Note that the variables Sand and AvCur were highly correlated with Gravel and Stress respectively.
Figure 8(a) Physical cluster group identity for individual sample stations. Analysis based on a k-means clustering of non-sediment environmental variables for: Sal, Temp, Chl a, SPM, Depth, WOV, AvCur and Stress (see Table 5 for details). (b) Box and whisker plots for physical variables by physical cluster group. Map created in RStudio (version 1.0.143, https://www.rstudio.com/).
Explanatory variables used in the study.
| Variable | Detail | Data Source | |
|---|---|---|---|
| 1 | Sal | Mean annual near-bed salinity (ppt) | A |
| 2 | Temp | Mean annual near-bed temperature (°C) | A |
| 3 | Chl a | Mean annual chlorophyll-a concentration (2002–2010) | B |
| 4 | SPM | Mean annual concentration of mineral origin suspended matter (g·m−3) (2002–2010) | B |
| 5 | Depth | Bathymetry (m) | C |
| 6 | WOV | Peak wave orbital velocity (m·s−1) | D |
| 7 | AvCur | Average current velocity (m·s−1) | D |
| 8 | Stress | Peak wave/current stress (N·m−2) | D |
| 9 | Gravel | Proportion of gravel (%) from sediment particle size data | E |
| 10 | Sand | Proportion of sand (%) from sediment particle size data | E |
| 11 | Mud | Proportion of silt/clay (%) from sediment particle size data | E |
Data sources: A (ICES climatology of surface and near-bed temperature and salinity 1971–2000)[56]; B (Data from MODIS satellite sensor)[57]; C (Defra DEM 500 m pixel resolution)[54]; D (POLCOM model)[58,59]; E (data from sediment particle size analysis).
Figure 9(a) Mean cumulative sediment distribution plots, with accompanying histogram, for each faunal cluster group. (b) Mean cumulative sediment distribution plots by physical cluster group, faceted by faunal cluster group.
Mean percentage composition of sediments by Wentworth size class for each faunal cluster group.
| Bio Cluster |
| % Mud | % Sand | % Gravel | Description | MVDISP | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sum | fS | mS | cS | Sum | fG | mG | cG | Sum | |||||
|
| A1 | 290 | 4 | 16 | 16 | 13 | 46 | 17 | 12 | 21 | 50 | v slightly muddy sandy gravel | 0.74 |
|
| A2a | 458 | 8 | 29 | 21 | 13 | 63 | 14 | 7 | 7 | 28 | slightly muddy gravelly sand | 0.73 |
|
| A2b | 731 | 7 | 11 | 14 | 15 | 40 | 15 | 12 | 25 | 52 | slightly muddy sandy gravel | 0.81 |
|
| B1a | 1010 | 1 | 5 | 24 | 27 | 56 | 23 | 12 | 8 | 43 | v slightly muddy gravelly sand | 0.44 |
|
| B1b | 774 | 2 | 5 | 24 | 25 | 54 | 23 | 13 | 9 | 44 | v slightly muddy gravelly sand | 0.44 |
|
| C1a | 1327 | 5 | 12 | 18 | 17 | 46 | 16 | 13 | 20 | 49 | slightly muddy sandy gravel | 0.90 |
|
| C1b | 1018 | 10 | 17 | 22 | 16 | 55 | 16 | 10 | 10 | 36 | slightly muddy gravelly sand | 0.80 |
|
| D1 | 145 | 15 | 44 | 22 | 9 | 75 | 6 | 2 | 2 | 10 | slightly gravelly slightly muddy sand | 0.99 |
|
| D2a | 1522 | 3 | 10 | 27 | 31 | 68 | 16 | 7 | 6 | 29 | v slightly muddy gravelly sand | 0.94 |
|
| D2b | 652 | 17 | 48 | 20 | 9 | 77 | 3 | 1 | 2 | 6 | slightly gravelly slightly muddy sand | 1.05 |
|
| D2c | 3485 | 4 | 22 | 44 | 18 | 84 | 6 | 3 | 3 | 12 | v slightly muddy slightly gravelly sand | 1.15 |
|
| D2d | 1014 | 3 | 60 | 27 | 6 | 93 | 2 | 1 | 1 | 4 | v slightly gravelly v slightly muddy sand | 0.82 |
Sediment descriptions are in accordance with Blott and Pye (2011)[55]. Number of samples is shown by column n. Variability in sediment composition is indicated by the Multivariate Index of Dispersion (MVDISP).
Figure 10Scatter plots for taxon richness by percentage gravel for all samples (a) and by faunal cluster group (b). Black lines show the estimated slope for the relationship between richness and percentage gravel. Line intercepts relate to the median survey intercept for each plot. Plot annotations are provided for number of samples (n) and slope (ß). Slope values include statistical significance (***p < 0.001).
Summary of a Mahalanobis distance test for the sediment composition of sample ‘EEC2010_Site 102’.
| Test sample | S/C | fS | mS | cS | fG | mG | cG | |
|---|---|---|---|---|---|---|---|---|
| Values (v) | 1.9 | 17.8 | 36.3 | 23.7 | 13.4 | 3.9 | 3.1 | |
| Distribution data (B1a_4) | S/C | fS | mS | cS | fG | mG | cG | |
| Means | 1.0 | 4.3 | 24.2 | 26.8 | 22.8 | 12.4 | 8.6 | |
| Covariance Matrix |
| 2.1 | ||||||
|
| 0.9 | 6.4 | ||||||
|
| −1.4 | 9.2 | 58.5 | |||||
|
| −2.7 | −10.2 | 1.8 | 106.6 | ||||
|
| −0.6 | −4.1 | −15.7 | 5.9 | 34.7 | |||
|
| 1.2 | 0.3 | −15.7 | −43.4 | 6.6 | 41.2 | ||
|
| 0.6 | −2.6 | −36.8 | −58.0 | −26.8 | 9.9 | 113.7 | |
| Difference | S/C | fS | mS | cS | fG | mG | cG | |
| Difference | +0.9 | +13.5 | +12.1 | −3.1 | −9.4 | −8.5 | −5.5 | |
| % Difference | +90 | +314 | +50 | −12 | −41 | −69 | −64 | |
Table shows the sediment composition of the test sample, and the sediment means and covariance matrix for the wider distribution data (group B1a_4, n = 948). The p-value from the test was <0.05, indicating the test sample was unlikely to be associated with wider cluster group distribution. Sediment fractions responsible for the failure are identified by subtracting test values for each fraction from the wider cluster group means.
Figure 11Dataset summary maps (a) and associated histograms (b) for Sector (industry or government data), Source (data owner), Data (newly acquired or existing data), Year (year of sample collection), Month (month of sample collection), Gear (device used to obtain sample), Sieve (mm) (size of sieve used for macrofaunal sample processing), Programme (reason for sample collection) and Treatment (sample relates to known ‘impact’ or ‘reference’ condition). See Supplementary Table S1 for list of abbreviations. Maps created in RStudio (version 1.0.143, https://www.rstudio.com/).
Standard sieve classes and sediment descriptions based on the Wentworth scale[52].
| Sieve (mm) | Wentworth Size Class | |
|---|---|---|
| 64 | Cobbles |
|
| 32 | Coarse gravel (cG) | |
| 16 | ||
| 8 | Medium gravel (mG) | |
| 4 | Fine gravel (fG) | |
| 2 | ||
| 1 | Coarse sand (cS) |
|
| 0.5 | ||
| 0.25 | Medium sand (mS) | |
| 0.125 | Fine sand (fS) | |
| 0.063 | ||
| Pan | Silt/Clay (S/C) |
|