| Literature DB >> 28344443 |
K Gallacher1, C Miller1, E M Scott1, R Willows1, L Pope2, J Douglass2.
Abstract
Measurements recorded over monitoring networks often possess spatial and temporal correlation inducing redundancies in the information provided. For river water quality monitoring in particular, flow-connected sites may likely provide similar information. This paper proposes a novel approach to principal components analysis to investigate reducing dimensionality for spatiotemporal flow-connected network data in order to identify common spatiotemporal patterns. The method is illustrated using monthly observations of total oxidized nitrogen for the Trent catchment area in England. Common patterns are revealed that are hidden when the river network structure and temporal correlation are not accounted for. Such patterns provide valuable information for the design of future sampling strategies.Entities:
Keywords: PCA; connected monitoring networks; flow direction
Year: 2016 PMID: 28344443 PMCID: PMC5347935 DOI: 10.1002/env.2434
Source DB: PubMed Journal: Environmetrics ISSN: 1099-095X Impact factor: 1.900
Figure 1River network in Trent catchment area (gray lines) with 566 monitoring sites (black dots), (left). Location of Trent catchment area in England and Wales (middle). Diagram of simple river network with three monitoring sites and corresponding proportional influence (PI) values for upstream segments (right)
Figure 2Principal component scores for TPCA (unweighted PCA) and TPCA (spatially weighted PCA), for PCs 1 (left), 2 (middle), and 13 (right). Note: plots are on different scales
Results from SPCA(unweighted PCA), SPCA(spatial weights), and SPCA(spatial and temporal weights)
| PCA | PC1 (%) | PC2 (%) | PC3 (%) | var3 (%) |
| var |
|
|---|---|---|---|---|---|---|---|
| SPCA | 42 | 9 | 6 | 57 | 8 | 70.8 | 9,069 |
| SPCA | 38 | 9 | 5 | 52 | 12 | 70.5 | 8,354 |
| SPCA | 31 | 7 | 5 | 43 | 23 | 70.1 | 6,910 |
PC1‐3 contains % variability explained for each of the PCs, respectively; var3 is the % variability explained by the first three principal components; k is the number of principal components retained to explain at least 70% of the variance of the data; var is the amount of variance explained by k principal components; is the reconstruction error from k principal components.
Figure 3Glyph plots with loadings for the first three principal components from SPCA (top – no weights) and SPCA (bottom – weights for discharge and time) for a zoomed in section of the network. (Red indicates negative values, and blue indicates positive values, in online version). Length of line indicates relative magnitude of loading. Starting at the 12 o'clock position, the length of the line reflects the magnitude of the loading for the first PC, and moving clockwise, the other lines represent the loadings for subsequent PCs