| Literature DB >> 29710607 |
S Pulley1, A L Collins2.
Abstract
The mitigation of diffuse sediment pollution requires reliable provenance information so that measures can be targeted. Sediment source fingerprinting represents one approach for supporting these needs, but recent methodological developments have resulted in an increasing complexity of data processing methods rendering the approach less accessible to non-specialists. A comprehensive new software programme (SIFT; SedIment Fingerprinting Tool) has therefore been developed which guides the user through critical data analysis decisions and automates all calculations. Multiple source group configurations and composite fingerprints are identified and tested using multiple methods of uncertainty analysis. This aims to explore the sediment provenance information provided by the tracers more comprehensively than a single model, and allows for model configurations with high uncertainties to be rejected. This paper provides an overview of its application to an agricultural catchment in the UK to determine if the approach used can provide a reduction in uncertainty and increase in precision. Five source group classifications were used; three formed using a k-means cluster analysis containing 2, 3 and 4 clusters, and two a-priori groups based upon catchment geology. Three different composite fingerprints were used for each classification and bi-plots, range tests, tracer variability ratios and virtual mixtures tested the reliability of each model configuration. Some model configurations performed poorly when apportioning the composition of virtual mixtures, and different model configurations could produce different sediment provenance results despite using composite fingerprints able to discriminate robustly between the source groups. Despite this uncertainty, dominant sediment sources were identified, and those in close proximity to each sediment sampling location were found to be of greatest importance. This new software, by integrating recent methodological developments in tracer data processing, guides users through key steps. Critically, by applying multiple model configurations and uncertainty assessment, it delivers more robust solutions for informing catchment management of the sediment problem than many previously used approaches.Entities:
Keywords: Catchment management; Sediment; Sediment fingerprinting; Sediment source tracing; Uncertainty
Mesh:
Substances:
Year: 2018 PMID: 29710607 PMCID: PMC6024566 DOI: 10.1016/j.scitotenv.2018.04.126
Source DB: PubMed Journal: Sci Total Environ ISSN: 0048-9697 Impact factor: 7.963
Fig. 1The study catchment and sampling points.
Fig. 2Flow diagram of the stages of the SIFT methodology.
The five source group classifications used.
| Classification | Structure |
|---|---|
| Two-cluster | Each source sample is assigned into one of two clusters according to the results of a k-means cluster analysis. |
| Three-cluster | Each source sample is assigned into one of three clusters according to the results of a k-means cluster analysis. |
| Four-cluster | Each source sample is assigned into one of four clusters according to the results of a k-means cluster analysis. |
| Geology classification 1 | The original four geology source groups are combined to produce as many new source groups as possible with the requirement of very strong discrimination between them. |
| Geology classification 2 | The original four geology source groups are combined to produce as many new source groups as possible with the requirement of moderate discrimination between them. |
Fig. 3Bi-plot of the two largest discriminant functions generated by the initial LDA of the five geology-based source groups, with confusion matrix.
Fig. 4The mapped cluster analysis based sediment source classifications.
Fig. 5Bi-plots (A) and maps (B) of potentially misclassified source samples as identified by the LDA. A labelled sample was potentially misclassified. The label colour identifies the source group the sample is a better fit in.
Median, median absolute deviation, mean and maximum variability ratios for all pair combinations of source groups; ratios in bold pass the threshold values for further inclusion in the apportionment modelling.
| Two-cluster | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Median | χlf | χfd | χARM | SIRM | BackIRM | HIRM | R | G | B | HRGB | IRGB | SRGB | SI | HI | CI | RI |
| Cluster 1 (35 samples) | 0.72 | 43.57 | 6.32 | 7.33 | 6.01 | 0.68 | 182.1 | 147.1 | 124.4 | 2.98 | 151.73 | 28.75 | 0.19 | 4.03 | 0.11 | 0.84 |
| Cluster 2 (64 samples) | 0.34 | 16.85 | 2.73 | 4.08 | 3.09 | 0.49 | 187.3 | 158.2 | 135.05 | 1.26 | 160.22 | 24.85 | 0.16 | 3.46 | 0.08 | 0.64 |
| Median absolute deviation | ||||||||||||||||
| Cluster 1 | 0.53 | 46.23 | 6.35 | 6.03 | 5.32 | 0.38 | 4 | 3.11 | 3.11 | 0.37 | 2.77 | 2.59 | 0.02 | 0.2 | 0.01 | 0.05 |
| Cluster 2 | 0.19 | 13.64 | 2.28 | 2.32 | 2.12 | 0.18 | 5.93 | 4.89 | 4.74 | 0.78 | 4.87 | 2.52 | 0.01 | 0.27 | 0.01 | 0.07 |
| Mean variability ratio | 0.8 | |||||||||||||||
| Max variability ratio | 1.7 | 1.7 | 1.43 | 1.15 | 1.2 | 0.8 | 1.06 | 1.64 | ||||||||
Bold values signify values exceeding the threshold of 1 for the mean variability ratio and 2 for the maximum variability ratio.
Fig. 6Bi-plots of source (black) and sediment (red) samples, plots bordered in red failed to achieve the required correlation coefficient for progressing in the analysis.
The percentage of sediment samples falling within the maximum + one MAD to minimum − one MAD range of values for each tracer in the source classifications.
| χlf | χfd | χARM | SIRM | BackIRM | HIRM | R | G | B | HRGB | IRGB | SRGB | SI | HI | CI | RI | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Two-cluster | 83 | 100 | 100 | 83 | 83 | 83 | 83 | 67 | 67 | 83 | 67 | 100 | 67 | 83 | 67 | 67 |
| Three-cluster | 100 | 100 | 100 | 83 | 83 | 83 | 83 | 67 | 67 | 83 | 67 | 100 | 67 | 83 | 67 | 67 |
| Four-cluster | 100 | 83 | 100 | 83 | 83 | 83 | 83 | 83 | 83 | 83 | 83 | 100 | 100 | 83 | 83 | 83 |
| Geology classification 1 | 100 | 100 | 100 | 83 | 83 | 83 | 67 | 67 | 67 | 67 | 67 | 100 | 67 | 67 | 83 | 67 |
| Geology classification 2 | 100 | 100 | 100 | 83 | 83 | 83 | 83 | 67 | 67 | 83 | 67 | 83 | 67 | 83 | 67 | 83 |
| Percent within minimum - maximum | 100 | 100 | 100 | 100 | 83 | 100 | 83 | 83 | 100 | 83 | 83 | 100 | 100 | 100 | 100 | 100 |
Fig. 7Mean percentage differences between each source sample and the mean of all sediment samples for all tracers.
Fig. 8Mean percentage differences between each source sample and the mean of all sediment samples for individual tracers.
Fig. S1Percentile distributions of tracer concentrations in the source groups.
The percentage of source samples correctly classified into their respective groups by optimum composite fingerprints selected by the stepwise LDA.
| Signature | Two-cluster | Three-cluster | Four-cluster | Geology classification 1 | Geology classification 2 |
|---|---|---|---|---|---|
| Basic | 90.2 | 89.6 | 89.6 | 97.6 | 83.8 |
| Conservative | 95.9 | 91 | 89.3 | 96.6 | 82 |
| High variability | 90.1 | 90.5 | 87.3 | 97.1 | 74.6 |
The optimum composite fingerprints identified by the LDA.
| Basic | |
|---|---|
| Two-cluster | B, HI, RI |
| Three-cluster | HI, SIRM, BackIRM, G, HRGB |
| Four-cluster | RI, SIRM, BackIRM, G, HRGB, R |
| Geology classification 1 | SIRM, BackIRM, HRGB, SI, CI |
| Geology classification 2 | B, HI, RI, SIRM, BackIRM, G, HRGB, SI, CI, χlf, χfd, χARM, SRGB |
Fig. 9Bi-plots of the two largest discriminant functions for the source groups and sediment samples with the final composite fingerprints for each source classification.
Fig. S2Bi-plots of the two largest discriminant functions for the source groups and sediment samples with the final composite fingerprints for each source classification.
Fig. S3Probability density functions of the virtual mixtures.
The results of the manually selected 3× weightings on virtual mixture source apportionment.
| Tracers weighted | Improved basic fingerprint apportionment | Improved conservative fingerprint apportionment | Improved high variability fingerprint apportionment | |
|---|---|---|---|---|
| Two-cluster | RI | Yes | Yes | Yes |
| Three-cluster | HRGB, CI | Yes | Yes | No |
| Four-cluster | χARM, BackIRM | No | No | No |
| Geology classification 1 | BackIRM | Yes | Yes | No |
| Geology classification 2 | BackIRM, G | No | – | – |
Fig. S5Percentage of Monte Carlo iterations above the 0.35 threshold and mean GOF of iterations passing the threshold.
Fig. 10Estimated bed sediment provenance using the two-cluster source classification; median with 25th and 75th percentile uncertainties.
Fig. 11Estimated bed sediment provenance using the three-cluster source groups; median with 25th and 75th percentile uncertainties.
Fig. 12Estimated bed sediment provenance using the four-cluster source groups; median with 25th and 75th uncertainties.
Fig. 13Estimated bed sediment provenance using the Classification 1 source groups; median with 25th and 75th uncertainties.
Fig. 14Mapped mean contribution of each source sample to the bed sediment samples predicted by the un-mixing models.
A summary of key results.
| Two-cluster | Three-cluster | Four-cluster | Geology classification 1 | Geology classification 2 | |
|---|---|---|---|---|---|
| Sediment sample screening | For sample Bed 5, R, G, HRGB and IRGB fell outside of the range of values found in the source samples. For sample Bed 3 Back IRM fell outside of this range. | ||||
| Source group classification | Cluster 1: Predominantly contained lower catchment topsoils, Cluster 2: Lower catchment channel banks and upper catchment topsoils. | Cluster 1: Predominantly ironstone samples, Clusters 2 and 3: divide the middle and upper catchment into two sources which appear unrelated to geology but appear spatially grouped. | Comparable to the three-cluster solution, however, it identified an additional cluster of only eight samples with its samples primarily located in the centre of the catchment. | Group 1: Ironstone and Group 2: Sandstone, Limestone, Clays and Channel Banks. | Group 1: Ironstone, Group 2: Sandstone, and Group 3: Limestone, Clays and Channel Banks. |
| Misclassified samples | Sample S1 (sandstone) was identified as potentially misclassified and was a better fit to the ironstone group so was deleted as it did not fall close to the area of the catchment over ironstone, the Ironstone samples I18 and I19 were also identified as potentially misclassified and fit better as sandstone, clay or limestone samples and reclassified as they were close to the boundary of two geologies | ||||
| Mean variability ratios | 2.1 | 3.8 | 4.6 | 3.1 | 2.5 |
| Maximum variability ratio | HRGB, 3.7 | χlfd, 11.86 | χlfd, 15.71 | χlfd, 6.63 | χlfd, 7.81 |
| Tracers failing to achieve the variability ratio threshold values | χlf, χlfd, χlARM, SIRM, BackIRM, HIRM, R | R | None | R, IRGB | R |
| Bi-plot conservatism testing | For sample Bed 5 most colour tracers fall outside of the relationships found in the source samples. For sample Bed 3 SIRM and BackIRM fell outside of the relationships in the source samples. | ||||
| Range test | All tracers passed the range test for source classifications by tracer values in 40% of sediment samples falling within the median +/− one MAD range of the source groups and in 80% of sediment samples falling within the minimum to maximum range of the sources. | ||||
| Mapped differences between source and sediment tracer concentrations | Ironstone source samples in the lower catchment are very dissimilar to the mean tracer values of the sampled sediments, BackIRM has more variability in the middle and upper catchment whilst XARM shows little variability, Blue is able to differentiate between samples throughout the entire catchment, but with a different trend to χARM | ||||
| Distributions of tracers in source groups | With the mineral magnetic tracers there was a large difference between the percentile distribution of values in the source groups/clusters representing ironstone and the other source groups. In contrast non-ironstone sources were poorly separated. Colour tracers separated the non‑ironstone sources more effectively; however, all tracers placed the source groups into the same highest to lowest value order, suggesting that problems of equifinality may be present in model outputs when a large number of source groups are used. | ||||
| Source discrimination (percent correctly classified) (basic, conservative, high variability fingerprints) | 90.2%, 95.9%, 90.1% (only contains colour tracers) | 89.6%, 91%, 90.5% | 89.6%, 89.3%, 87.3% | 97.6%, 96.6%, 97.1% | 83.8%, 82%, 74.6% |
| Bi-plots of sources and sediments | Cluster 2 likely dominates contributions to the bed sediment, discrimination appears good. | A combination of clusters 2 and 3 likely dominates contributions to three of the sediment samples and cluster 3 appears to dominate contributions to two samples. Discrimination is good however, discrimination between clusters 2 and 3 is only achieved using DF2, which represents 8.79–9.13% of the total discriminatory power | Clusters 1 and 2 appear to dominate contributions to three samples and inputs from cluster 4 dominate contributions to two of the samples. DF2 representing 20% of total discrimination, is able to discriminate clusters 1 and 4 from clusters 2 and 3. Discrimination between clusters 2 and 3 is limited to a small amount by DF1, therefore equifinality related uncertainties are likely in model outputs. | Sediment provenance is dominated by non‑ironstone sources and source discrimination is good. | Ironstone contributes significantly to one sediment sample. The other sediment samples are likely composed of a combination of sandstone, limestone clays and channel banks. Discrimination between ironstone topsoils and other sources is good, discrimination between the sandstone topsoil and limestone, clays and channel banks group is poor and is only provided by DF2, which accounts for ~5% of total discriminatory power. |
| Virtual mixture source apportionment | Un-mixing models produced the correct provenance of the virtual mixtures. Uncertainties for the mixtures of 100% of each cluster were low; however, with the equal proportions of each cluster they were high. | Mixture apportionment was generally accurate but with a higher associated range of uncertainty than the two-cluster classification. Uncertainty was especially high when apportioning a 100% contribution from cluster 2, with significant estimated contributions from cluster 3 present. | The un-mixing models correctly identified contributions from clusters 3 and 4. However, when apportioning contributions from clusters 1 and 2 uncertainties were high, with significant overlap between the probability density functions for the two sources. The conservative fingerprint failed to identify Cluster 1 as the dominant source when 100% of the mixture was composed of this cluster. | Produced comparable results to the two-cluster groups but with a higher range of uncertainty. | Source apportionment with all three fingerprints for geology-based Classification 2 was unsuccessful. A 100% contribution from clays, limestone and channel banks was not represented in the un-mixing model results and a mixture of equal proportions of the sources produced an output heavily biased towards high sandstone topsoil contributions. |
| Weightings | A weighting of RI increased the accuracy of mixture apportionment for all three fingerprints. | A weighting of HRGB and CI improved mixture apportionment with the Basic and Conservative fingerprints. | No composite fingerprint improved mixture apportionment. Use of the Conservative fingerprint was discontinued due to its poor performance. | A weighting of BackIRM increased the accuracy of mixture apportionment for the Basic and Conservative fingerprints. | No composite fingerprint improved mixture apportionment. Due to the poor performance of Classification 2, its results were not considered for further analysis. |
| Goodness of fit | For the cluster analysis derived source classifications, >50% of model iterations exceeded the 0.35 GOF threshold. With the exception of those for sample Bed 5 where in all but four of the models run all iterations failed to achieve a GOF higher than 0.35 and therefore were rejected. The mean GOF of the model iterations passing the threshold was high (>0.75). GOF for geology classification 1 was generally lower than for the cluster-based classifications, the conservative fingerprint for sample Bed 5 had no iterations which exceeded the 0.35 threshold. | ||||
| Sediment provenance | For sediment samples in the lower half of the catchment and sample Bed 6 in the upper catchment similar contributions were estimated to originate from cluster 1 and cluster 2. Cluster 2 dominated contributions to samples Bed 4 and 5 in the middle catchment. All three composite fingerprints produced similar results although contributions varied by ~20%. | Contributions from cluster 1 are low in all models apart from sample Bed 3 with the basic fingerprint. Topsoil inputs in the lower catchment are primarily from areas which are not over the ironstone geology. Sediment contributions to sample Bed 3 likely originate from localised channel bank inputs. Bed 3 basic fingerprint estimates a much higher contribution from cluster 1 than the other fingerprints, but consistency is reasonable for all other samples. Uncertainties associated with conservative and high variability fingerprints were high for sample Bed 2. Both clusters 2 and 3 are important sediment sources. | There were some large discrepancies between the results of the two composite fingerprints used. Clusters 2 and 3 appear to dominate contributions in to samples Bed 1, however, the basic fingerprint estimated high contributions from cluster 1. For samples Bed 2, 3, and 5 there was either very poor consistency between the composite fingerprints or no model with an acceptable GOF could be produced. For sample Bed 4 cluster 4 which covers a small area in the centre of the catchment dominates contributions, and for Bed 6 cluster 2 dominates. | Ironstone topsoils a minor source in all but sample Bed 3. The basic fingerprint estimates a larger contribution from ironstone than the other fingerprints. | No result produced |