Human agriculture, wastewater, and use of fossil fuels have saturated ecosystems with nitrogen and phosphorus, threatening biodiversity and human water security at a global scale. Despite efforts to reduce nutrient pollution, carbon and nutrient concentrations have increased or remained high in many regions. Here, we applied a new ecohydrological framework to ~12,000 water samples collected by the U.S. Environmental Protection Agency from streams and lakes across the contiguous U.S. to identify spatial and temporal patterns in nutrient concentrations and leverage (an indicator of flux). For the contiguous U.S. and within ecoregions, we quantified trends for sites sampled repeatedly from 2000 to 2019, the persistence of spatial patterns over that period, and the patch size of nutrient sources and sinks. While we observed various temporal trends across ecoregions, the spatial patterns of nutrient and carbon concentrations in streams were persistent across and within ecoregions, potentially because of historical nutrient legacies, consistent nutrient sources, and inherent differences in nutrient removal capacity for various ecosystems. Watersheds showed strong critical source area dynamics in that 2-8% of the land area accounted for 75% of the estimated flux. Variability in nutrient contribution was greatest in catchments smaller than 250 km2 for most parameters. An ensemble of four machine learning models confirmed previously observed relationships between nutrient concentrations and a combination of land use and land cover, demonstrating how human activity and inherent nutrient removal capacity interactively determine nutrient balance. These findings suggest that targeted nutrient interventions in a small portion of the landscape could substantially improve water quality at continental scales. We recommend a dual approach of first prioritizing the reduction of nutrient inputs in catchments that exert disproportionate influence on downstream water chemistry, and second, enhancing nutrient removal capacity by restoring hydrological connectivity both laterally and vertically in stream networks.
Human agriculture, wastewater, and use of fossil fuels have saturated ecosystems with nitrogen and phosphorus, threatening biodiversity and human water security at a global scale. Despite efforts to reduce nutrient pollution, carbon and nutrient concentrations have increased or remained high in many regions. Here, we applied a new ecohydrological framework to ~12,000 water samples collected by the U.S. Environmental Protection Agency from streams and lakes across the contiguous U.S. to identify spatial and temporal patterns in nutrient concentrations and leverage (an indicator of flux). For the contiguous U.S. and within ecoregions, we quantified trends for sites sampled repeatedly from 2000 to 2019, the persistence of spatial patterns over that period, and the patch size of nutrient sources and sinks. While we observed various temporal trends across ecoregions, the spatial patterns of nutrient and carbon concentrations in streams were persistent across and within ecoregions, potentially because of historical nutrient legacies, consistent nutrient sources, and inherent differences in nutrient removal capacity for various ecosystems. Watersheds showed strong critical source area dynamics in that 2-8% of the land area accounted for 75% of the estimated flux. Variability in nutrient contribution was greatest in catchments smaller than 250 km2 for most parameters. An ensemble of four machine learning models confirmed previously observed relationships between nutrient concentrations and a combination of land use and land cover, demonstrating how human activity and inherent nutrient removal capacity interactively determine nutrient balance. These findings suggest that targeted nutrient interventions in a small portion of the landscape could substantially improve water quality at continental scales. We recommend a dual approach of first prioritizing the reduction of nutrient inputs in catchments that exert disproportionate influence on downstream water chemistry, and second, enhancing nutrient removal capacity by restoring hydrological connectivity both laterally and vertically in stream networks.
Nutrient pollution is one of the most serious and costly environmental crises [1-3]. The overabundance of carbon (C), nitrogen (N), and phosphorus (P) in many terrestrial, aquatic, and marine environments has created eutrophic (over-nourished) conditions that harm human health and economy, degrade aquatic habitat, and exacerbate climate change [4-8]. Many human activities release nutrients into the environment, but agriculture, human sewage, industrial wastewater, and fossil fuel combustion are the primary sources [9-11]. Pioneering water quality legislation, such as the Clean Water Act in the U.S., has been effective at reducing discrete pollutant sources (e.g., wastewater and industrial effluent) in many regions, but nutrient legacies and continued diffuse nutrient sources from agriculture and atmospheric deposition sustain widespread eutrophication [12-18]. Additionally, two-way interactions between eutrophication and climate change are complicating mitigation efforts on both fronts [5,19-21]. Consequently, while major water quality improvements occurred in the U.S. immediately after the implementation of the Clean Water Act and other policies, recent efforts to reduce the occurrence of harmful algal blooms and nutrient loading more generally appear to have plateaued [22-24].Continued improvement of water quality in the U.S. will require an understanding of both discrete and diffuse nutrient sources and sinks across the landscape. For example, legacy nutrients stored in soils and groundwater can create time lags between changes in nutrient inputs and the response of nutrient outputs [25-28]. These lags mean that current nutrient conditions can be a result of nutrient overuse from previous decades that arise as discrete nutrient sources from the past. There is compelling evidence that nutrient legacies are driving continued eutrophication in many environments, particularly those with long nutrient residence times in soil and groundwater [16,29-31]. However, it is unclear if spatiotemporal patterns of nutrient concentrations correspond with the legacy nutrient hypothesis (that nutrient pollution will only improve after decades of patient persistence)—or if contemporary management strategies aimed at reducing non-point source pollution could rapidly improve nutrient conditions [26,29,32-34]. The diversity of nutrient recovery trajectories following reductions in nutrient loading suggest that nutrient removal capacity, contemporary nutrient loading, and nutrient-specific transport dynamics are as or more important than nutrient legacy in determining nutrient fluxes in some watersheds [22,35-38].While detailed inventories of nutrient loading are gaining accuracy and expanding in temporal and spatial coverage [39-42], simple diagnostic tools are still needed to assess nutrient trends and patterns in groundwater and surface water ecosystems [16,17,23,24,35,37]. Recently, Abbott and others [43] proposed a new analytical framework to assess spatiotemporal nutrient dynamics from infrequent, spatially extensive surveys (hereafter synoptic samplings). This synoptic framework measures the persistence of nutrient sources and sinks, the spatial scale or patch size of variation in nutrient concentrations, and the influence or leverage of individual subcatchments on watershed-level nutrient flux. The persistence of spatial patterns in the surface water network is quantified using rank correlations between instantaneous concentrations of nutrients at the same locations across samplings, with higher correlations indicating spatial patterns of nutrient sources and sinks are conserved over time [6,43-45]. The patch size of nutrient retention, removal, or release within an ecosystem can also be evaluated by analyzing the spatial scale (i.e., catchment size) where variance in concentration decreases rapidly or “collapses” toward the watershed mean [43,46,47]. Shogren and others [48] expanded on this framework by calculating the leverage of each subcatchment on overall watershed nutrient transport, allowing a first-order estimation of watershed-scale nutrient budgets and the identification of critical source areas from infrequent but spatially-extensive synoptic samplings [49]. While these metrics cannot and do not seek to replace the important work of measuring nutrient inputs and outputs through time [22,50,51], they allow robust assessment of current nutrient state. When measured repeatedly through time, the persistence, patch size, and leverage metrics could enable detection of trends and shed light on the effectiveness of changes in management and the inherent nutrient removal capacity of different ecosystems [36,44,45,52].In this context, we applied a synoptic ecohydrological approach [6,43,44,48] to data from infrequent but spatially extensive surveys covering the continental U.S. We focused on four primary questions: 1) Is water quality improving across surveys (i.e., have C, N, and P concentrations decreased at national and ecoregion scales over the 2000–2019 period)? 2) How effective are infrequent, spatially extensive surveys at characterizing water chemistry patterns and drivers in the U.S.? 3) What is the net nutrient release and uptake throughout surface waters of the U.S. (i.e., where are the nutrient sources and sinks)? 4) What landscape characteristics, including human land use, are associated with nutrient source and sink dynamics across the U.S.? We hypothesized that improved nutrient management, declines in atmospheric NOX deposition, and reductions in point source loads would cause decreases in nutrient concentrations through time—with some regional variation because of poorly constrained diffuse nutrient sources and variability in nutrient removal capacity and hydrological residence time [36,50,53-56]. Specifically, we hypothesized that these reductions in nutrient loading would create low spatial persistence among catchments (i.e., restructuring of nutrient concentration patterns) due to variable rates of nutrient decline and uneven elimination of point and diffuse nutrient sources. Conversely, spatial persistence could remain high despite overall decreases in concentration if local discrete and diffuse nutrient sources are less important than nutrient legacies, atmospheric deposition, and differences in nutrient removal capacity. Finally, we hypothesized that differences in land use and climate would result in distinct patch sizes among ecoregions [43], and that headwater streams and lakes would be disproportionately responsible for nutrient flux, in line with observations from other regions [43,48,57,58].
Materials and methods
Collection and analysis of water samples
We analyzed data collected as part of the National Aquatic Resource Surveys (NARS) by the U.S. Environmental Protection Agency (EPA) Office of Water to understand the hydrochemical patterns of water quality in the U.S. during seven surveys spanning 2000–2019 [59]. Detailed information about site selection and sampling can be found in the NARS documentation [59], and we include an abbreviated description below. All surveys used a weighted probability-based sample design for site selection that included 8,085 river and stream samples (hereafter referred to simply as streams) and 3,766 lake samples. The data were collected as part of the Wadeable Streams Assessment (WSA; 2000–2004), the National Rivers and Streams Assessments (NRSA; 2008–2009, 2013–2014, and 2018–2019), and the National Lake Assessments (NLA; 2007, 2012, and 2017). During the WSA (2000–2004), 1,392 samples were collected from 1,391 unique streams. From the NRSA, the 2008–2009 survey consisted of 2,320 samples collected from 2,114 streams, the 2013–2014 survey included 2,261 samples from 1,985 streams, and the 2018–2019 survey included 2,112 samples from 1,914 streams. The WSA shared 357 sites with the NRSA 2008 survey but had no shared sites with any of the other NRSA surveys. The NRSA 2008 survey shared 790 sites with NRSA 2013, and the NRSA 2013 survey shared 931 sites with the NRSA 2018 survey. For lakes, the NLA 2007 survey included 1,326 water chemistry samples from 1,148 lakes, the NLA 2012 survey included 1,230 samples from 1,127 lakes, and the NLA 2017 survey included 1,210 samples from 1,099 lakes. The NLA 2007 survey shared 406 sites with the NLA 2012 survey, and the NLA 2012 survey shared 495 sites with the NLA 2017 survey (See S1 and S2 Tables for summary statistics). Because catchment area data were not yet released by the EPA for the most recent surveys (NLA 2017 and NRSA 2018), we used catchment area for the sites that were sampled in any previous surveys. Using this method, we were able to concatenate area for 56% of the sites sampled in the most recent surveys for inclusion in the leverage and patch size analyses.EPA site selection criteria changed for both lakes and streams between surveys, affecting the range of catchment sizes studied. In the 2007 NLA survey, only lakes larger than 4 hectares were sampled, while in the 2012 and 2017 surveys, lakes larger than 1 hectare (a minimum of 0.1 hectares of open water) and at least 1-meter deep were sampled. For streams, the 2004 WSA study included only small, wadeable streams that had an average Strahler stream order of 2. The 2008, 2013, and 2018 NRSA studies included a larger range of catchment sizes and sampled large rivers such as the Mississippi, Columbia, Colorado, and Snake Rivers. Because of these adjustments to site selection, the WSA and NRSA stream surveys had unequal variance for catchment size but the lake surveys were less affected (S1 Fig). We accounted for these changes in sampling strategies by using paired t-tests and rank correlations for the temporal analysis, which rely solely on repeat samples between surveys.EPA field crews sampled lakes during the spring and summer, and streams were sampled in summer during baseflow conditions. At each site, a 4 L grab sample was collected from the surface at designated coordinates. Lake samples were collected from the euphotic zone (generally the first 2 meters or less as calculated by the Secchi Disk Transparency method) using an integrated sampler at the deepest point of natural lakes or from the midpoint of reservoirs. All samples were shipped on ice and analyzed within 24–48 hours of collection. Samples were analyzed for a suite of indicators including water chemistry, land use classification, physical habitat, benthic community, and fish ecology. In this study, we focused on dissolved organic carbon (DOC), nitrate (NO3-), total nitrogen (TN), and total phosphorus (TP), which are important water quality parameters relating to eutrophication, pollutant transport, aquatic habitat, and human drinking water [60-62]. While these parameters can serve as nutrient or energy sources [61,63,64], we refer to them as nutrients for convenience. We replaced zeroes in the concentration data with the measurement detection limit divided by 2.
Temporal change in nutrients
We assessed temporal changes of nutrient concentrations (i.e., changes from survey to survey) at the continental and regional scales in the contiguous U.S. To account for variability in site selection across samplings, we used paired t-tests to evaluate changes between repeat sample sites only (S3 Table). This paired approach provided a sensitive and robust method to quantify change through time, which we visualized with boxplots (Fig 1). We log-transformed non-normal distributions (determined by histogram and normal Q-Q plot analysis) prior to running t-tests and used a decision criterion of α = 0.05 to determine statistical significance. For sites that were sampled multiple times over individual survey periods (some locations were intentionally sampled repeatedly to quantify seasonal and inter-annual variability), we used the mean concentration of each parameter across the period.
Fig 1
Map (panel A) of stream and lake sampling points and distributions of nutrient concentrations at continental (panel B) and ecoregion levels (panel C). The map (panel A) shows EPA NARS sample sites (black circles) included in the analysis. The map area is shaded by Level 1 Ecoregion, and those included in the analysis are labelled with the ecoregion name and number. Boxplots (panels B and C) show change in nutrient concentrations for all sites sampled in each survey. The middle horizontal line in the boxplots shows the median and the boxes represent the interquartile range. Silver diamonds represent mean concentration. Asterisks indicate significant differences between repeat samples as determined by paired t-tests (p < 0.05)—orange asterisks mark significant increases and blue asterisks mark significant decreases.
Map (panel A) of stream and lake sampling points and distributions of nutrient concentrations at continental (panel B) and ecoregion levels (panel C). The map (panel A) shows EPA NARS sample sites (black circles) included in the analysis. The map area is shaded by Level 1 Ecoregion, and those included in the analysis are labelled with the ecoregion name and number. Boxplots (panels B and C) show change in nutrient concentrations for all sites sampled in each survey. The middle horizontal line in the boxplots shows the median and the boxes represent the interquartile range. Silver diamonds represent mean concentration. Asterisks indicate significant differences between repeat samples as determined by paired t-tests (p < 0.05)—orange asterisks mark significant increases and blue asterisks mark significant decreases.We tested for temporal changes at the scale of the contiguous U.S. and for EPA Level I Ecoregions. We used ecoregions to group samples because these divisions are commonly used to implement ecosystem management strategies across federal agencies, state agencies, and nongovernmental organizations [65]. Additionally, ecoregions correspond with general ecosystem characteristics, including climate, vegetation, and land-use parameters [60], grouping areas with common ecological configurations and human disturbance patterns in North America. Dividing the analysis by ecoregion also reduced uncertainty from differences in specific discharge (the amount of runoff per unit area), which is one of the assumptions of the leverage analysis (see below). Our analysis focused on ecoregions 5–13, which are the main ecoregions found in the contiguous U.S., though ecoregion 12 did not have enough sampling points to calculate all the metrics. We divided the analysis by aquatic environment (i.e., lakes and streams) because of differences in nutrient dynamics between flowing systems with lower residence times (streams), and standing or lotic systems (lakes) with higher residence times and hence opportunities for uptake or release of different nutrients [61,66].
Ecohydrological metrics
To understand how nutrients enter and propagate through freshwater landscapes, we calculated several ecohydrological metrics that evaluate spatiotemporal nutrient dynamics based on infrequent, spatially extensive surveys, such as the NARS [6,23,43,44,48]. These metrics quantify the spatial persistence, patch size, and leverage (relative influence) of nutrient sources and sinks, allowing comparison of nutrient state among different ecoregions and watershed sizes. When applied repeatedly, these metrics can indicate trends and effectiveness of management interventions. We present a general description and method of calculation for each of these metrics in the following paragraphs.Spatial persistence is the degree to which spatial patterns within a watershed or region stay the same through time (e.g., the same locations always have higher nutrient concentrations relative to other locations). Spatial persistence indicates the representativeness of infrequent samplings, which can be associated with the synchronicity of changes in nutrient concentration through time, or the absolute difference in nutrient concentration among sampling locations [6,43,44,48]. For example, if nutrients have low spatial persistence (perhaps due to shifts in local sources or sinks) then analyses of patch size and nutrient contribution made for a single moment in time may not be representative of long-term nutrient status [6,43,67]. We calculated spatial persistence of water chemistry parameters for lakes and streams across the contiguous U.S. using Spearman correlations of nutrient concentration following Equation 3 from Abbott et al [43]. This nonparametric method compares only the rank of the data (e.g., the lowest value is assigned a rank of 1), making it robust to extreme values, which are common in aquatic chemistry. We calculated the persistence of nutrient concentrations at the national level and by ecoregion for each waterbody (i.e., lakes and streams) and nutrient. We use a spearman correlation of √0.5 (i.e., 0.71) as a qualitative threshold, indicating that more than 50% of the spatial pattern persisted across sampling dates for that nutrient (Fig 2A). To understand differences between lake and stream persistence, we regressed lake and stream ecoregion persistence with the null hypothesis that persistence values between waterbodies would not be different from each other (i.e., R2 ~ 1, Fig 2B). We also analyzed the ions and physiochemical parameters (i.e., pH, turbidity, total suspended solids, and acid neutralizing capacity) included in the water chemistry dataset (S2 Fig).
Fig 2
Spatial persistence for C, N, and P by ecoregion for lakes and streams.
In panel A, points that fall above the orange line (y = √0.5) maintain more than half the spatial pattern for that parameter in pairwise comparisons of the two sampling dates. Horizontal gray lines show continental-scale persistence for that parameter. Panel B compares stream and lake persistence for each nutrient (i.e., each green dot represents mean DOC persistence for an ecoregion). The variable relationship between lake and stream persistence suggests that different factors affect persistence of nutrients in lakes and streams even within the same ecoregion.
Spatial persistence for C, N, and P by ecoregion for lakes and streams.
In panel A, points that fall above the orange line (y = √0.5) maintain more than half the spatial pattern for that parameter in pairwise comparisons of the two sampling dates. Horizontal gray lines show continental-scale persistence for that parameter. Panel B compares stream and lake persistence for each nutrient (i.e., each green dot represents mean DOC persistence for an ecoregion). The variable relationship between lake and stream persistence suggests that different factors affect persistence of nutrients in lakes and streams even within the same ecoregion.Patch size analysis evaluates what spatial scales express the greatest variability in nutrient concentration, revealing the size of inherent ecosystem characteristics and human disturbances that create nutrient retention, removal, or release [36,43,46,47]. For example, if large-scale atmospheric deposition was the primary source of a nutrient, it would have relatively large patches with high nutrient concentration, whereas if discrete point sources were dominant, there would be relatively smaller patches of high and low nutrients. We identified the patch size of nutrient sources and sinks for lake and stream catchments by performing a variance collapse analysis on nutrient concentration data [43,48]. First, we scaled the nutrient concentrations by subtracting the ecoregion mean and dividing by the ecoregion standard deviation so that the concentration of each parameter had a mean of zero and a variance of 1. This allowed comparison of nutrients with very different mean concentrations. Next, we ordered the scaled concentration data by catchment size from smallest to largest and used the changepoint package in R to determine breakpoints in the data [68,69]. All breakpoints were then manually reviewed to check for false positives, which can occur when a single extreme point triggers the algorithm. The resulting breakpoints identified decreases of variance (i.e., collapse thresholds) with spatial scale, indicating the patch size of nutrient sources or sinks [43]. We assessed breakpoints for the entire national dataset (Fig 3A and 3B) and by ecoregion (Figs 3C, 3D and S3), except for ecoregion 12, which had few or limited samples.
Fig 3
Variance collapse for nutrients in lakes and streams in the U.S.
Panel A shows variance collapse thresholds (marked by vertical gray bars) of the entire national dataset. Panel B is a zoomed-in version of panel A showing variance collapse thresholds for C, N and P at the national scale. Violin plots show the distribution of ecoregion variance collapse thresholds for lakes (panel C) and streams (panel D) with short horizontal bars bounding the interquartile range and a longer horizontal bar showing the median. Colored point and whisker plots show the median and range of variance collapse thresholds for each ecoregion within the violin plots. Gray lines connect between ecoregion thresholds for different nutrients to show a general relationship among nutrients. Thresholds were determined by partial exact linear time (PELT) analysis.
Variance collapse for nutrients in lakes and streams in the U.S.
Panel A shows variance collapse thresholds (marked by vertical gray bars) of the entire national dataset. Panel B is a zoomed-in version of panel A showing variance collapse thresholds for C, N and P at the national scale. Violin plots show the distribution of ecoregion variance collapse thresholds for lakes (panel C) and streams (panel D) with short horizontal bars bounding the interquartile range and a longer horizontal bar showing the median. Colored point and whisker plots show the median and range of variance collapse thresholds for each ecoregion within the violin plots. Gray lines connect between ecoregion thresholds for different nutrients to show a general relationship among nutrients. Thresholds were determined by partial exact linear time (PELT) analysis.Subcatchment leverage is a metric of the contribution of nutrients from each subcatchment relative to the overall watershed nutrient flux [43,48]. It combines concentration with catchment size (a proxy of discharge assuming specific discharge is similar across sites) to estimate whether each subcatchment increases (positive value) or decreases (negative value) the nutrient flux at the outlet, assuming conservative nutrient transport [43]. We calculated subcatchment leverage by multiplying the difference in subcatchment and watershed outlet nutrient concentrations with the ratio of subcatchment area to watershed outlet area [43]. We used ecoregion instead of topographic watershed, defining the largest measured watershed within the ecoregion as the watershed outlet. This infringes an assumption in the calculation (all subcatchments are usually nested), but we chose this method for three reasons. First, even though each EPA sampling included thousands of locations, there were usually only a few points within individual watersheds, except for large rivers such as the Mississippi and Colorado. Second, many of the sampled locations were not nested within larger watersheds, such as those draining directly to the ocean or occurring within endorheic (terminal) basins. Small watersheds are influential in determining lateral nutrient transport, and are key indicators of local nutrient management and conditions [43,57,70]. Third, splitting by ecoregion allows grouping of sampling points with more similar land uses and climatic conditions than would be seen within a single large watershed. This allows for more sensitive comparison of nutrient source and sink dynamics in the terrestrial environment and aquatic network. For example, two sampling locations that are adjacent but occur in different watersheds may have more in common than nested watersheds that are more geographically separated. However, we acknowledge that some ecoregions are quite large (e.g., ecoregions 8 and 9) and encompass areas with diverse land use and climate. Ultimately, we opted to calculate subcatchment leverage primarily by ecoregion instead of large watersheds following the reasons above. However, we recognize that this is a compromise that limits some interpretation in our results. Consequently, we discuss primarily the relative differences among watersheds within an ecoregion.Because discharge measurements were not available at most of the sampled locations, we assumed constant intra-ecoregion specific discharge [43,48]. We tested this assumption with discharge data for each ecoregion from USGS reference stations. We found comparable specific discharge across the sampling dates and for most ecoregions except ecoregions 6 and 7 (Northwestern Forested Mountains and Marine West Coast Forests), which showed substantial differences across sampling dates and between sites (S4 Fig). Raw leverage values were converted to percentages to estimate whether a specific subcatchment increased (leverage > 0) or decreased (leverage < 0) nutrient flux at the ecoregion outlet [48]. Thus, subcatchments with positive leverage indicated nutrient sources and negative leverage values indicated nutrient sinks. Very large leverage values can occur—for example, well over 100%—because some material is retained or removed as the water flows over and through the landscape before reaching the outlet. Conversely, estimates of leverage could be influenced by the differences in specific discharge mentioned above. For example, watersheds with high specific discharge tend to have shorter nutrient legacies [36]. Mean leverage values for a watershed (or in this case ecoregion) can estimate network-level mass balance. When the mean leverage is positive, this implies there has been nutrient removal within the surface water network (i.e., there are more sources than can be accounted for at the outlet), whereas a mean negative leverage value indicates production within the surface water network [48]. Given the multiple assumptions in the analysis, we interpreted the leverage values cautiously.
Using machine learning to predict nutrient concentration and flux
Machine learning models are increasingly used to make hydrological predictions [71,72], and the most accurate versions tend to utilize ensemble models that combine inputs from independent algorithms before making final decisions [73-75]. Machine learning models can also be used to explore complex, non-linear relationships between predictor and target variables. In these situations, the behavior of the model is probed to identify the importance of predictor variables, such as catchment size or ecoregion, to predict various targets (in this case nutrient concentration). While previous work has used single models in this kind of analysis, in this study we used four different machine learning models to produce more robust results and compare performance of the models. We included some of the most widely used models, including a decision tree regressor, a multi-layer perceptron, a random forest, and a gradient boosting regressor. For interpretation, we combined the feature importance and partial dependence plots of all four models into an ensemble, though we visualized the individual outputs to compare similarity qualitatively.We used the ensemble model to explore relationships between the predictor variables (e.g., catchment characteristics and climate variables) and target variables (e.g., nutrient concentrations and subcatchment leverage, though we focus on concentrations because of the leverage assumptions mentioned above). Data for catchment characteristics were available through the NARS dataset including elevation; EPA ecoregion; and percent cover of agriculture, wetlands, forest, and urban land as estimated from areal NLCD land use classes. We planned to include land use data from each of the sampling campaigns to assess the legacy hypothesis directly (i.e., does land use from 5 or 10 years ago predict water quality better than current land use), but catchment characteristics were only available for the earlier surveys. Therefore, the models included concentration and leverage data from the first two lake and the first three stream surveys (i.e., NLA 2007 and 2012, and WSA 2004 and NRSA 2008 and 2013) as a case study. In addition to catchment characteristics, we extracted mean annual air temperature (MAAT) and precipitation from the WorldClim repository using the raster package in R. To account for differing biogeochemical processes among aquatic environments, we ran separate models for lakes and streams. We accounted for temporal differences between EPA surveys (e.g., whether the sample was collected during the 2007 or 2012 NLA) by including EPA survey as a categorical variable. Samples with missing stream chemistry data were removed from the dataset (<2% of samples with catchment characteristics data). Predictor variables were normalized to have values between 0 and 1. The model trained on a random subset of 80% of the concentration and leverage data and predicted the other 20%. Because the chemistry data were right-skewed, we binned the data and reweighted the probability with which each sample was considered, so that the model would more effectively learn to train on the feature data instead of predicting the most likely target values [76]. The multi-layer perceptron was the exception because the model could not use sample weights, thus the data were randomly drawn, agnostic to data distribution. We compared the predicted output versus the actual data and report metrics of model fit and relative importance of predictors.Statistical and analytical methods were performed using R version 3.5.4 [77], except for the machine learning models that were completed using custom python (version 3.7) scripts and the scikit-learn package [78]. Code and data for the machine learning analysis are available at https://github.com/Populustremuloides/EPA_Synoptic_Water_Chemistry.git.
Results
Change in nutrient concentrations from 2000 to 2019
The temporal analysis of paired survey sites showed differing patterns among nutrients and between lakes and streams at the national and ecoregion scales (Fig 1). There were negligible changes in national and ecoregion lake DOC concentration (p > 0.05) during the 2007–2017 lake survey period ions (on average, ecoregions experienced a 7% increase in DOC concentration across surveys; Fig 1C). NO3- steadily decreased in lakes nationwide from 2007–2017, with a 52% decrease from 2007 to 2012 and a 36% decrease from 2012 to 2017. Individual ecoregions followed a similar trend with an average decrease of 39% across ecoregions and surveys. Ecoregion 10 was the only outlier with a 36% significant increase in NO3- concentration from 2012 to 2017. TN concentration did not change significantly nationwide from 2007 to 2012 but increased by 9% from 2012 to 2017 (p < 0.05). Individual ecoregions reflected this trend with slight but significant increases in ecoregions 5, 6, 8, and 9. TN concentration in ecoregion 10 followed a similar trajectory as NO3-: a decrease in concentration from 2007 to 2012 and then an increase from 2012 to 2017. TP significantly increased by 57% in lakes nationwide between 2007 and 2012 and then decreased by 17% from 2012 to 2017. Individual ecoregions exhibited similar patterns except ecoregions 11 and 13 which showed non-statistically significant increases in concentration from 2012 to 2017.Streams followed the same pattern as lakes for TP, with stream TP concentration increasing significantly from 2000 to 2013 (156% increase from 2004 to 2008 and 17% increase from 2008 to 2013) and then decreasing by 61% from 2013 to 2018. All ecoregions showed significant decreases in TP concentration in streams from 2013 to 2018 except ecoregion 11, which did not have a significant change but still decreased by 22%. DOC and TN concentrations in streams also followed the pattern of increase from 2000 to 2013 and then decreased from 2013 to 2018, though the magnitude of change was less than for TP. For example, TP concentration increased from 2000 to 2013 by 62% across ecoregions on average, while DOC increased by 3% and TN decreased by 0.5%. Likewise, TP decreased by 46% on average across ecoregions from 2013 to 2018, but DOC and TN decreased by only by 3% and 9%, respectively. Additionally, there were fewer statistically significant changes across surveys and ecoregions for stream DOC and TN than for TP. Stream NO3- concentration followed the opposite trend as stream DOC, TN, and TP, decreasing from 2004 to 2008 by 53% and then increasing by 16% and 61% in the subsequent surveys (i.e., 2008 to 2013 and 2013 to 2018). Summary statistics for all paired t-tests are reported in the supplementary material (S3 Table).
Spatial persistence of nutrients
Most major nutrients (DOC, TN, and TP; Fig 2) and ions (S2 Fig) at the national scale had an average spearman rank correlation above the threshold of ρ = 0.71 (the square root of 0.5), representing the persistence of more than half of the spatial pattern. NO3- was above the minimum threshold for streams but not for lakes (ρ = 0.77 and 0.37 respectively). Lakes and streams across ecoregions exhibited similar nutrient persistence for DOC, TN, and TP, but NO3- was less persistent for lakes in most ecoregions (Fig 2). A simple linear regression between lake and stream ecoregion rank correlations showed a positive relationship with an R2 of 0.46 (Fig 2B). This result is largely driven by low rank correlations for NO3- and a few other low (e.g., ρ < 0.30) rank correlations for DOC, TN, and TP lake values from ecoregions 11 and 13. Spatial patterns of DOC concentration were persistent in every ecoregion except for ecoregions 11 and 13. Spatial patterns of NO3- persisted for most streams except in ecoregion 6, 9, 11, and 13; and spatial patterns were not persistent for NO3- concentration in lakes for all ecoregions. TP exhibited spatial persistence for all ecoregions except for lakes in ecoregion 7 and streams in ecoregion 10, 11, and 13.
Patch size of nutrient sources and sinks
Scaled concentrations tended to converge towards 0 with increasing catchment area (Figs 3 and S3), indicating a representative sampling of source and sink tributaries across ecoregions. Variance collapse thresholds or patch size was nutrient dependent and ranged from 500 to 4,000 km2 for national data and 4 to 924 km2 for ecoregions. Median patch size of nutrient source and sink dynamics was less than 250 km2 across nutrients and ecoregions, suggesting relatively fine-grained sources of variation even at continental scales (Fig 3). Lakes and streams had similar variance collapse thresholds for all major nutrients except TP, which had a much smaller patch size for streams than for lakes. Likewise, ecoregion 9 showed almost a 3-fold difference in patch sizes for nitrogen and phosphorus (~125 km2 vs ~350 km2). We note that not all ecoregions had sufficient sample size to determine variance collapse thresholds through PELT analysis (specifically ecoregions 12 and 13).
Quantifying sources and sinks by ecoregion
Because the EPA NARS were not designed using a nested subcatchment framework, we used ecoregions to organize the leverage analysis, though this introduces some limitations as discussed in the methods. Subcatchment leverage generally followed the expected pattern wherein spatial variance of leverage decreased as subcatchment area increased (i.e., longitudinal averaging or mixing), and therefore most catchments had low leverage. Similarly, subcatchment leverage decreased with Strahler stream order. In other words, headwater catchments (Strahler stream order < 3; S5 Fig) largely controlled summer nutrient flux, including in downstream reaches (net change in longitudinal flux was largely ±1%, Figs 4, 5 and S6). Streams followed this spatial pattern the most closely, while lakes exhibited more spatial variability with some influential medium-sized subcatchments (Fig 4A). A notable exception to the headwater dominance occurred in the lower reaches of the Mississippi River where extremely large subcatchments (greater than 2,000,000 km2) exhibited surprising variability in subcatchment leverage for stream TN (Fig 4A). These high leverage values indicate additions of TN sufficient to change mainstem concentration. Most nutrients exhibited moderate to low leverage (both positive and negative), such that 92–98% of subcatchments accounted for less than 25% of ecoregion outflow concentration. Stated conversely, 2–8% of subcatchments accounted for 75% of the observed nutrient flux. An even smaller proportion of subcatchments (0.05–4%) had extremely large positive leverage values (> 100% leverage, and in some cases upwards of 3,000%), indicating critical source areas and inferring substantial retention or mixing of these nutrients in the surface water network (i.e., removal or dilution processes are needed to close the mass balance). The spatial analysis of the distribution of subcatchment leverage further highlighted that ecoregion nutrient dynamics were largely controlled by a handful of highly influential subcatchments (Fig 5).
Fig 4
Leverage for lakes and streams at the national level.
Positive leverage indicates nutrient sources and negative leverage indicates nutrient sinks (or weaker sources). In panel A, note the surprisingly low and high leverage values for DOC and TN in the large catchments of ecoregion 8. These leverage values correspond with reservoirs in the downstream reaches of the Mississippi River before entering the Gulf of Mexico. Panel B shows zoomed-in distributions of leverage values for nutrients in lake and stream catchments in the U.S. The horizontal gray bars represent the 95% confidence interval about the median, and the horizontal black bars represent the mean. A positive mean can be interpreted as the percentage of the total nutrient mass added to the stream network that did not leave the network (net removal), while a negative mean represents net production. Black arrows show that positive mean leverage values indicate nutrient removal and negative values indicate nutrient production.
Fig 5
Subcatchment leverage mapped across the contiguous U.S.
Points represent lakes and streams that were sampled over the study period (2000–2019). The points are colored by leverage value with cool colors representing negative leverage values (e.g., concentrations lower than the ecoregion outlet) and warm colors representing positive leverage values (e.g., concentrations higher than the ecoregion outlet).
Leverage for lakes and streams at the national level.
Positive leverage indicates nutrient sources and negative leverage indicates nutrient sinks (or weaker sources). In panel A, note the surprisingly low and high leverage values for DOC and TN in the large catchments of ecoregion 8. These leverage values correspond with reservoirs in the downstream reaches of the Mississippi River before entering the Gulf of Mexico. Panel B shows zoomed-in distributions of leverage values for nutrients in lake and stream catchments in the U.S. The horizontal gray bars represent the 95% confidence interval about the median, and the horizontal black bars represent the mean. A positive mean can be interpreted as the percentage of the total nutrient mass added to the stream network that did not leave the network (net removal), while a negative mean represents net production. Black arrows show that positive mean leverage values indicate nutrient removal and negative values indicate nutrient production.
Subcatchment leverage mapped across the contiguous U.S.
Points represent lakes and streams that were sampled over the study period (2000–2019). The points are colored by leverage value with cool colors representing negative leverage values (e.g., concentrations lower than the ecoregion outlet) and warm colors representing positive leverage values (e.g., concentrations higher than the ecoregion outlet).At the continental scale, our results pointed to net removal or retention of N and P, and net conservative transport of DOC. Mean subcatchment leverage for NO3-, TN, and TP was positive and much greater than 0, indicating net removal or, at the minimum, retention in the surface water network (Fig 4B). Stated differently, the flux observed at the catchment—or in this case ecoregion—outflow was substantially lower than the inputs from the subcatchments, implying net removal. Mean leverage for DOC was close to zero (1% and 0.1% for lakes and streams), representing a neutral mass-balance (i.e., conservative mixing with little net production or removal).
Modelling nutrients with catchment characteristics and climate
Nutrient concentrations showed a variety of relationships with catchment characteristics and climate based on the machine learning analyses (Fig 6). We synthesized the results from the four models—a decision tree regressor, multi-layer perceptron, random forest, and gradient boosting regressor—to democratize predictions of nutrient concentration. The average R2 among the four models was 0.66 and 0.32 for lakes and streams respectively. Additionally, model predictions were compared to a random number generator that sampled from a normal distribution centered at the mean for each nutrient, and lake and stream models, on average, were 5.36 and 3.23 times better respectively at predicting water chemistry over the random number generator. Interestingly, lake models performed the best when predicting TP concentration (mean R2 = 0.81), but stream model performance was the worst for the same parameter (mean R2 = 0.03). Lake models also generally outperformed stream models when predicting DOC, NO3-, and TN concentrations (S7A Fig).
Fig 6
Machine learning models illustrate the relationship between catchment characteristics, climate variables, and nutrient concentration.
Biplots between lake and stream model predictor importance (panels A, C, E, and G) show the most important variables that predict nutrient concentration. Partial dependence plots (panels B, D, F, and H) show the relationship between the dominant model predictors (e.g., mean annual air temperature (MAAT), agriculture, etc.), and nutrient concentration. The y-axes of the partial dependence plots have been normalized between 0 and 1 for model comparison. Four different machine learning models are shown to democratize the results and interpretation.
Machine learning models illustrate the relationship between catchment characteristics, climate variables, and nutrient concentration.
Biplots between lake and stream model predictor importance (panels A, C, E, and G) show the most important variables that predict nutrient concentration. Partial dependence plots (panels B, D, F, and H) show the relationship between the dominant model predictors (e.g., mean annual air temperature (MAAT), agriculture, etc.), and nutrient concentration. The y-axes of the partial dependence plots have been normalized between 0 and 1 for model comparison. Four different machine learning models are shown to democratize the results and interpretation.One of the benefits of machine learning is the ability to interpret non-linear relationships between predictors and target variables. Partial dependency plots showed the relationship of major predictors for all four models, and agreement among plots helped strengthen or weaken our confidence in the observed relationships. For example, DOC concentration was strongly associated with precipitation in lake models and wetland cover in stream models (Fig 6A). DOC partial dependency plots (Fig 6B) showed that lake catchments with annual precipitation of ~400–600 mm had the highest DOC concentration, and that catchments with high annual precipitation (e.g., > 800 mm) were predicted to have the lowest DOC. Partial dependency plots for wetland cover in stream catchments showed a strong positive trend between wetland cover and DOC concentration, though the trend is more logarithmic than linear. NO3- lake models showed strong associations with agriculture, forest, and urban areas, and stream models were strongly associated with agriculture and elevation (Fig 6C). As to be expected, the models agreed that greater proportions of agricultural land cover increased NO3- concentration in lakes and streams (Fig 6D). Partial dependency plots for elevation, however, showed less agreement among models. For streams, three of the models predicted highest concentrations at low elevations (e.g., 0–500 m), but the multi-layer perceptron model predicted an almost linear relationship with the highest concentrations at the highest elevations (> 2,000 m; Fig 6D). For lakes, elevation partial dependency plots also showed varying results, and the multi-layer perceptron model predicted a linear but opposite relationship to the stream model (i.e., lowest concentrations at highest elevations). TN was also associated with catchment characteristics such as forests, agriculture, and elevation, though none of these were particularly strong predictors on their own (Fig 6E). TP concentration in lakes was strongly associated with annual precipitation and to a lesser extent with elevation. Streams were also loosely associated with precipitation and MAAT (Fig 6F). Precipitation had a generally negative relationship with TP concentration, and like DOC, the highest concentration occurred in moderately wet climates (e.g., 400–800 mm; Fig 6H). We also modeled subcatchment leverage to gain insight into the drivers of flux dynamics and observed similar trends. The subcatchment leverage model results are reported in the supporting information (S7B, S8 and S9 Figs).
Discussion
In this study, we used spatially extensive stream and lake chemistry data collected by the EPA to test hypotheses about trends and drivers of surface water nutrient concentrations in the U.S. Contrary to our hypotheses, we did not observe monotonic declines in most nutrient concentrations despite known decreases in nutrient inputs for many regions [23,40]. In the following sections, we draw on various results from this study and the broader literature to evaluate what these complex spatiotemporal patterns could mean for nutrient management and water quality in the U.S. and elsewhere.
Have excess nutrients peaked or plateaued?
Environmental legislation such as the U.S. Clean Water Act has been largely successful in reducing point source pollutants, including wastewater effluent, feedlot outflows, and industrial discharge [12]. These decreases in nutrient inputs are having measurable effects on some water quality parameters, which should be recognized and celebrated [24,79,80]. However, our study and other recent work [14,23,81] highlight that the goal of the Clean Water Act “to restore the chemical, physical, and biological integrity of U.S. waters” has not been fully realized. From a management perspective, one of the primary questions is now whether the lack of progress in nutrient concentrations is a temporary condition preceding a gradual decline or a sign of stabilization at a new plateau.Though reductions in NOx emissions and wastewater nutrient release have improved nutrient pollution in some areas, persistent loading from diffuse sources and long-term nutrient legacies have hindered progress [35,54,56,82]. Indeed, the results in this study showed that nutrient concentrations in lakes and streams nationwide either increased or remained static until the most recent NLA 2017 and NRSA 2018 surveys, which exhibited downward trends for most nutrients. This downward trajectory is certainly encouraging, but it largely represents an incremental step toward the range of the earliest surveys’ nutrient conditions, and could be due to interannual hydrological variability or non-stationarity more generally associated with other factors. One explanation for this water quality plateau could be that we will not see major improvements until nutrient legacies are depleted [16,80]. If legacies are the cause of the lack of progress, current nutrient management efforts should be sustained where in place and applied to areas that currently are not managed.Another explanation for the nutrient plateau is that nutrient inputs have not decreased sufficiently to tip watersheds into net nutrient decline [26,35,83]. If this is the case, we need a major change to our overall management approach regarding method, intensity, and/or location. For example, this study and much previous work suggests focusing best practices in the watersheds acting as critical source areas [58,84]. While this may not seem tractable at regional to continental scales, we suggest that it would be much more cost effective to invest in adequate monitoring and spatiotemporal analysis prior to targeted intervention in the subcatchments with highest leverage.In either scenario (i.e., peak or plateau), the spatiotemporal complexity of water quality trends requires renewed innovation in data analysis to assess the effectiveness of current conservation measures. To determine whether the current lack of progress is a transition into a new water quality era or not, we need creative analysis of monitoring data and new modeling approaches to assess mass-balance [38,79,85]. New methods for combining spatiotemporal water chemistry monitoring with nutrient inventories and reactive transport models [17,34,37,44,86] could test many of the hypotheses raised by this study. Likewise, repeated and spatially extensive sampling such as these EPA surveys combined with remote sensing and nutrient retention modeling could be sensitive indicators of progress and management priorities [18,34,43,60,87]. Because the question of whether current management is adequate to result in ecological recovery is so fundamental to the goals of the Clean Water Act, we suggest more specific measures later in the discussion to hopefully generate more rapid water quality improvements.
Spatially persistent nutrients across ecoregions
We found that nutrients in many ecoregions are spatially persistent, increasing confidence that our analysis is representative of spatial nutrient dynamics and are not just an artifact of climatic variability. This high degree of spatial persistence could be evidence of three non-exclusive factors. First, it could be due to nutrient legacies. Even though nutrient inputs have changed through time in certain regions of the country, the long residence time of water and nutrients in watersheds could result in persistent spatial patterns [27,88,89]. Second, ecological differences in watershed characteristics could create large differences in nutrient retention, contributing to spatial persistence. There is increasing evidence that these watershed-scale differences in retention are associated with subsurface characteristics, including the soil, vadose zone, and groundwater [36,37]. Third, the location of nutrient sources may not have changed as much as expected. The locations that are best suited for agricultural or urban development are not randomly distributed in space, nor can they change easily through time [90]. Though practices have improved through time and atmospheric deposition has decreased in some areas [90], the relative distribution of nutrient loading and nutrient retention appears to have persisted.These results are consistent with national inventories that found that areas of high N inputs and surpluses were the same from 2002 to 2012 [50]. High spatial persistence further increases our confidence in the efficacy of infrequent, spatially extensive surveys [6]. While large surveys, such as the EPA NARS dataset require substantial investment of time and analytical capacity, the spatial persistence of most nutrients demonstrates the efficacy of the NARS design and provides critical information to improve watershed management. Citizen science approaches could allow continued and even expanded synoptic sampling at low cost, while also providing valuable experiences that can improve public understanding and willingness to support water conservation and protection measures [35,45,91-93].One important exception to the pattern of spatial persistence was NO3- in lakes. This result was particularly interesting because the opposite was observed for NO3- in streams and for TN in both waterbody types. We expect that low spatial persistence of NO3- is primarily due to high temporal variability of biogeochemical and hydrological processes in lakes (e.g. stratification and turnover) [48,94]. Indeed, in situ nitrification and denitrification rates in lakes are highly variable and depend on water depth, residence time, and microbial community [94,95]. Moreover, lakes were sampled across the spring and summer months when lake stratification and turnover could have contributed to low persistence of NO3-, which is particularly sensitive to redox conditions [36,96]. These dynamics could mean that more frequent spatially extensive sampling of lakes is needed to adequately represent short- and long-term variability, and consequently there is inherent uncertainty in lake nutrient trends identified here.
Locally sourced nutrients
Our analysis of the spatial scale of variance found that landscape patches that contribute or retain nutrients were relatively small (~250 km2), though this varied across ecoregions (Figs 3 and S6). We note that patch size can also change on event, seasonal, and inter-annual timescales [43,48,97-99], and we do not imply that the patch sizes determined here are inherent or permanent. However, the consistent pattern of patches at the tens to hundreds of square kilometer scales reinforces the conclusion that these intermediate scales are ecologically relevant to the release and retention of nutrients in watersheds of the U.S. From an administrative perspective, these scales (~250 km2) correspond to regional governmental units such as municipalities, counties, or states, emphasizing the importance of collaboration with local stakeholders to develop and implement durable and effective management approaches. Differences in patch size among nutrients offered some insight into possible drivers of specific nutrients. In particular, patch size of TP sources and sinks were consistently smaller than TN, highlighting potential hotspots of legacy P eroding from the landscape (e.g., in areas surrounding concentrated animal feeding operations or P-rich minerals).
Prioritizing highly influential catchments to improve water quality
This study provides further evidence that U.S. headwater catchments set the initial nutrient state as the majority of downstream reaches were neither major sources nor sinks in the stream network [42,100]. A surprising exception to this spatial pattern was in the lower Mississippi River Basin where nutrient concentrations suggested a strong removal signature along the mainstem of the river. Some of the major control points occurred at the confluence of the Arkansas and Ohio Rivers with the Mississippi River, as well as further downstream where retention reservoirs have been constructed for nutrient capture. The continental scale analysis of nutrient source and sink dynamics raised a long-standing question for water quality management: how much of the observed changes in nutrient concentrations can be accounted for by decreased retention versus increased loading in a catchment? In other words, would more water quality improvements be possible if we decreased nutrient loading instead of (or in conjunction with) focusing on landscape management strategies such as constructing riparian zone buffers? Our analysis of subcatchment leverage showed that most catchments exhibit neutral effects on overall nutrient export, as such downstream nutrient loads into surface water largely maintain observed nutrient concentrations, even in many areas where nutrient inputs are known to be quite large (Figs 4 and 5). This suggests that most catchments are highly efficient nutrient processors and could respond favorably to decreased loading. To reduce waterbody impairment, our data support a dual approach that first and foremost prioritizes reducing nutrient inputs to catchments that exert disproportionate influence on downstream water chemistry and increasing hydrological connectivity within stream networks (where effectual) to enhance nutrient removal capacity [36,101].Because headwaters largely determine downstream water quality [102], regional efforts to decrease nutrient inputs and surpluses could yield promising results [42,49,103]. Indeed, localized management efforts in the Chesapeake Bay clean up and elsewhere in the U.S. have led to improvements in water quality that can be replicated elsewhere [50,56,104-107]. This emphasizes the need to craft regional solutions and ensure entities or individuals at the local level are empowered to achieve them (i.e., subsidiarity). The needed information and resources to achieve further improvements include: 1) continued innovation in nutrient use efficiency in agriculture [55], 2) educational outreach and incentives to further improve nutrient management on farms [108,109], 3) documenting nutrient inputs and surpluses across the landscape to identify inefficiencies in the handling and use of N and P [50], 4) identifying areas of the landscape disproportionately degrading downstream water quality [80,98], and 5) establishing clear watershed-wide nutrient goals towards which local communities can strive [81,110]. Certainly, to assess the effectiveness of local decisions in achieving water quality goals, standardized tracking of management actions (e.g., fertilizer use, cover crops) and continued water quality monitoring is essential. The latter is generally fulfilled by the EPA National Aquatic Resource Surveys (complimented by a network of intensively monitored sites from other state and federal organizations), and the former is largely addressed with recent nutrient inventory work though standardized accounting of other management actions like implementation of cover crops and tillage practices across the landscape [50,53].
Conclusions
This study advances the understanding of spatial and temporal nutrient dynamics across the U.S. and encourages water quality managers to consider the watershed context and implement nutrient reduction strategies locally. Our analysis of the EPA National Aquatic Resource Surveys at the national and ecoregion levels revealed spatially persistent nutrient dynamics in most regions of the country. We also determined the patch size of nutrient sources and sinks and identified likely areas of water quality degradation across the United States. These analyses indicated a need for local interventions within highly influential catchments, in addition to general reductions of nutrient loading. Most catchments are already efficient nutrient processors; although nutrient retention pathways are not linear functions, we expect that reducing nutrient loads will lead to a net reduction in nutrient fluxes and concentrations at regional and national levels.
Variability of catchment sizes among the National Aquatic Resource Surveys.
Boxplots show the interquartile range and median (middle horizontal line) and notches show the 95% confidence interval around the median.(DOCX)Click here for additional data file.
Spatial persistence at the national level for all major nutrients and ions.
The red line (y = √0.5) delineates where at least half of the spatial pattern is preserved across samplings.(DOCX)Click here for additional data file.
Variance collapse thresholds determined by PELT analysis for each ecoregion.
(DOCX)Click here for additional data file.
Variability of specific discharge at neighboring USGS reference stations for the EPA National Aquatic Resource Surveys by ecoregion.
Boxplots show the interquartile range and median (middle horizontal line) of discharge data. Notches show the 95% confidence interval around the median.(DOCX)Click here for additional data file.
Boxplots of Strahler stream order and catchment area.
The horizontal gray line marks where the 250 km2 patch size lies in relation to stream order. Stream orders ≤ 3 are considered headwater streams.(DOCX)Click here for additional data file.
Subcatchment leverage by ecoregion.
(DOCX)Click here for additional data file.
R-squared values for machine learning models.
Models predicted nutrient concentration (plot A) and subcatchment leverage (an estimate of nutrient flux; plot B) using catchment characteristics and climate variables from the EPA National Aquatic Resource Surveys and WorldClim database.(DOCX)Click here for additional data file.
Machine learning models illustrating the relationship between catchment characteristics, climate variables, and subcatchment leverage (an estimate of nutrient flux).
Bi-plots between lake and stream model predictor importance show the most important predictors among the models predicting subcatchment leverage.(DOCX)Click here for additional data file.
Partial dependence plots show the modeled relationship between subcatchment leverage (an estimate of nutrient flux) and the primary predictors for lakes and streams.
They show how the predictions partially depend on the input variables of interest and are able to depict whether the relationship is linear, curvilinear, or a step function. The plots above show the partial dependence plots for the major predictors of DOC (A), NO3- (B), TN (C), and TP (D) subcatchment leverage for lakes and streams.(DOCX)Click here for additional data file.
Summary of the number of sites in each EPA National Aquatic Resource Survey (NARS).
(DOCX)Click here for additional data file.
Nutrient summary statistics per ecoregion.
(DOCX)Click here for additional data file.
Paired t-test results (p-values) and percent change from repeat sample sites between surveys.
(DOCX)Click here for additional data file.12 May 2021PONE-D-21-01607Limited progress in nutrient pollution in the U.S. caused by persistent nutrient sourcesPLOS ONEDear Dr. Frei,Thank you for submitting your manuscript to PLOS ONE. First, I apologize for the delay in returning this manuscript due to the difficulty in getting reviewers' reports. Several reviewers agreed to review the manuscript but never provided comments; thus, I had to seek more reviewers, causing further delay. Second, after careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.Here are some additional comments:One reviewer has made excellent comments, including (1) using watersheds than ecoregions as the latter does not always follow watershed boundaries and (2) making the study novel by using the reviewer's suggestion on restructuring the manuscript.The dataset used in the study is till 2012. Is there no data available after 2012?The words persistence and leverage are not common in biogeochemistry research. I noticed that you indicated a brief definition in the introduction. If possible, attempt to simplify the definition and include it in the results and discussion sections, as appropriate.I found the article difficult to read (including methods), see what could be done to facilitate reading.One reviewer included a Word file with comments.Please submit your revised manuscript by Jun 26 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.Please include the following items when submitting your revised manuscript:A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.We look forward to receiving your revised manuscript.Kind regards,Gurpal S. Toor, Ph.D.Academic EditorPLOS ONEJournal requirements:When submitting your revision, we need you to address these additional requirements.1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found atand2. We note that Figures 1 and 5 in your submission contain map/satellite images which may be copyrighted.All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.We require you to either (a) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (b) remove the figures from your submission:a. You may seek permission from the original copyright holder of Figures 1 and 5 to publish the content specifically under the CC BY 4.0 license.We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”b. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.The following resources for replacing copyrighted map figures may be helpful:USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.htmlNASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/Landsat: http://landsat.visibleearth.nasa.gov/USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#Natural Earth (public domain): http://www.naturalearthdata.com/[Note: HTML markup is below. Please do not edit.]Reviewers' comments:Reviewer's Responses to QuestionsComments to the Author1. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********2. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********3. Have the authors made all data underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********4. Is the manuscript presented in an intelligible fashion and written in standard English?PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********5. Review Comments to the AuthorPlease use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: I was really excited to read this paper about the persistence of nutrient loading across the United States. The paper introduction raises the interesting question of variation in nutrient persistence pathways: how surface water responds to reductions in nutrient inputs might follow various pathways depending on X, Y, Z. Unfortunately, the authors didn’t address this exciting question with their analyses – instead the analysis focused on understanding what land use was associated with concentrations of N, P and DOC. I think there is a lot of potential here, but I struggled with the disconnect between the gap articulated in the introduction and the analyses. I think there are two pathways forward: 1) keep the analysis as is. The gap could then be that there have been analyses of controls on nutrients, but they could be affected by temporal variation among sampling dates. This analysis tests that assumption – for the most part spatial patterns are persistent, so we can really trust the machine learning analysis. I think this would be a less interesting paper. We already know that land use affects nutrient concentrations, that these affects differ among nutrients. I wasn’t really clear what was novel about that part of the analysis. 2) What I think would be a more novel contribution would be to explore different nutrient trajectories, and where those are happening. I could imagine a similar ML approach, but using changes in nutrient concentrations (or changes in rank within a watershed?) as the response variable. Would allow you to address – what are the different pathways? What land uses are associated with increases/decreases? Are there differences among watersheds (or regions, but see my comments on that below) that are related to regional/local nutrient management approaches? For example, it would be interesting to see if agriculture is associated with increases in some regions and decreases in others (or no change in others) and if that were linked to differences in policies, etc. I recognize that this would be a substantially different paper, but I think the questions raised in the introduction are exciting and addressable using this dataset with some modified analyses.Some more major comments about the analysis approach:1. Spatial persistencea. I struggled with the various ways that spatial persistence could be interpreted. If patterns are the same between sampling points, then that seems straightforward - the patterns are the same, there hasn't been a change. But if the patterns are different, how do you know if that was because the sampling events were not representative of the first time period or because there was a change in the spatial patterns of inputs (or sinks). If you expect differences in responsiveness to point vs diffuse inputs of nutrients, then it would make sense that spatial patterns would change (assume that point sources and diffuse inputs are all reduced at the same rate [this is unrealistic, but a useful thought exercise], areas most affected by point sources might experience a greater decrease in concentration compared to those most affected by diffuse sources). You are trying to use one analysis to assess both spatial and temporal patterns, but I don’t think you can use this analysis to say whether or not snapshot surveys are representative of temporal trends (your question 2).b. A second question about the use of these stems from the statement in the text that: “if solutes have low spatial persistence (perhaps due to asynchronous variation) then analyses of patch size and nutrient contribution made for a single moment in time are not reliable indicators of long-term nutrient dynamics.” If low spatial persistence means that you cannot assess things like patch size, were regions/analytes with low persistence excluded from patch size and leverage analyses? Or, would it be interesting to see how patch size and leverage change for those between the two sampling times?2. Use of ecoregionsa. These ecoregions are really large and encompass a wide range of ecosystems, climate, nutrient management strategies, nutrient management challenges. Some more focused discussion about how you expected patterns to vary among ecoregions based on their hydrology, climate, vegetation, or land use would help justify the use of ecoregions and interpret differences among ecoregions. As is, there are interesting differences among regions, but the reasons for these differences are not explore.b. I was very confused about the estimates of leverage using the ecoregions. These are not watersheds, so it doesn’t make sense how or why you would calculate leverage for all the catchments within a region relative to the largest catchment in the region. For the temperate eastern forests, the ecoregion “outlet” would be the Mississippi River, but many (most?) of the “subcatchments” are not within the Mississippi River basin, and many of the subcatchments of the Mississippi River basin are not within that ecoregion. It seems to me that using watershed boundaries rather than ecoregions would make much more sense for these analyses which are designed for making comparisons within watersheds.3. Mass balance approacha. This really relies on specific discharge being consistent within a watershed or region, which is not the case (as your Figure S4 illustrates well, especially for regions 5,6,7 and 8 in which specific discharge varies by what looks like an order of magnitude).Specific line comments:Line 32: how are you defining nutrient resilience? I'm not sure how you reach this conclusion from the analysesLine 53: Spatiotemporal patterns of what?Lines 54-57: Can you say more about these trajectories - what are they? Is the point here to refute the legacy hypoth? Or just say that there are more patterns?Lines 57-59: unclear here exactly what is and isn't known, what the gap is, besides a need for expanded analysis. What will the expanded analysis tell us?Line 64: unclear what the correlations are between - the locations at diff times?Line 69: Can you define leverage here?Lines 77-79: I don’t think you can really address Q2 since you aren't comparing the snapshots with more temporally resolved dataLine 80 (Q4): ok - but you focus your results on the concentration results, not the leverage results. If the leverage results are what are most important, then it might make more sense to put those in the paper and put the concentration results in the supplementalLines 113-117: how would other results change if you restricted the 2008 study to include the same range of catchment sizes?Line 129: boxplot is a visualization tool, not a hypothesis test, but more importantly, unclear what is being compared to what in these boxplots and t-testsLine 131: is there evidence that nutrient management would vary across these regions?Do you have specific hypotheses about how patterns of nutrient persistence might vary across the regions? I understand that they are different, but how do you expect those differences to manifest in the response variables that you are focusing on? I'd be concerned that the differences are by chance, rather than because of something relevant to the ecoregion or how the ecoregion was designated.Line 170-171: Does this mean that other analyses only valid for areas with spatial persistence?Line 181: a table with the different parameters would be really helpful – how they are calculated, inferencesLine 186-187: What was analyzed for these parameters?Line 197: are these done for all samples/sites? Or just where there was spatial persistence? Was it done for a single sampling point (early/later?) or all data together?Line 226: how are you calculating leverage if not all the subwatersheds are within the largest watershed within each region?Lines 227-229: you found similar specific discharge between dates, but not within ecoregions, which was the assumption that you were testing.Lines 241-243: these interpretations are based on assumption of consistent within-region specific discharge. How might violations of that assumption (documented in S4, but maybe worth pulling the data that are associated with Q data and seeing how sensitive your results are to this assumption?) affect your interpretations?Line 252: how were these models selected? What are the important differences among them?Line 268: what data were missing? Wasn't this dataset based on the water quality data? How many data points were removed for missing values?Line 285: really over the two 5-year study periods, since you only compared changes over 5 years.Lines 285-289: reminder needed here - are these based on the paired sites only?Line 315-316: curious about how you interpret the differences in patch-size between continental and regional analyses - patch is scale-dependent (this isn't really surprising), but what does it mean for nutrient management?Lines 335-336: shouldn't this be stated, “most catchments had low leverage”?Line 341: dilution doesn't change the mass balance. I suspect that some of these extreme outliers are due to violations of the assumption that specific discharge is the same within ecoregions.Line 403: has MAAT been defined? If so, it's been a while and worth spelling out here.Line 426: how do we know persistence is due to diffuse sources and not point sources (or both)? If concentrations are increasing or staying the same, that might indicate no progress on either front.Line 431: drivers of nutrient concentrationsLine 435: this seems like a distinct point from the first one, but I'm not sure I understand it.Lines 445: Or....5 years is not a very long time to measure the impacts of nutrient management. I would be really interested to see an analysis that focuses in on the outliers here. I know the persistence analysis is for the whole dataset (or by region), but could you pull out those sites that changed a lot in rank (or even just look at change in rank as a response variable?) to understand why nutrients persist in some places and not in others. The threshold for persistence seems liberal in my view - at the threshold there is still a lot of change! At any rate, I'm confused about the interpretation of the persistence story, if it's the same, then there hasn't been a change, and you can do all the other analyses, if not then you can say that these are real spatial patterns and do the land use analysis. But I don't think you can use this (esp with a 5 year period) to say that we're not making progress with nutrient management. It's not a long time period and the analysis approach can be interpreted in importantly different ways.Lines 455-456: also to me means that ALL sources of nutrients have persisted, and that there isn't a difference in the persistence of point and diffuse sources over this time periodLines 460-462: BUT - this also only captures spatial patterns of nutrient inputs during baseflow. Seasonal high flows or storm events may activate different sources that may or may not be spatially correlated with baseflow sources and may nore may not be spatially persistent.Lines 479: but differences in patch size between regional and continental scale analyses suggests that patch size varies with scale of analysis. If you had access to data for smaller catchments, you might find yet another answer.Line 511: Can you define scale of headwaters again here?Lines 517 onward: Can you link these implications specifically to the model results?Lines 532-533: spatial patterns were persistent, but the increases in concentrations is relevant too!Lines 534: can you highlight some of these? Is this new? Comparisons with other analyses?Fig 2. Is the persistence value for NO3 in region 13 missing? X-axis label missing for panel aFig 4a. these are really hard to see because the axes are such large ranges and most of the values are smallFig 4b. The removal/production don't seem quite right here since you are assuming constant specific discharge. I see the value of making the simplifying assumption, but it does challenge this interpretationFig 6 scatterplots: maybe because of the transparency - the wetland points look like the urban color, and other shades are also inaccurate between plot and legend.how did the year of sampling play out in these models? Would be really neat if you could see how the models vary from year to yearReviewer #2: Line 86. Is a five year period really adequate to determine temporal trends. Please justify.Line 398-399. SUrprised ag did not have greater impact on nitrate.Line 436-438. WHere is the analysis on what the evolution of ag practices was during your time frame? Many areas would have had very little change during your study period. To call out the potential role of ag management changes during this period need to have more information on this.Line 526-528. THere is little on fertilizer use especially distribution of use and rates. Might be worth calling out there is a need in this area.Reviewer #3: Some of the supplementary figures embedded in the MS are in EPS format which was inaccessible from my device. Make sure to save it in correct format and check its accessibility. Specifically, check the format of the figures S1 through S10.**********6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #2: NoReviewer #3: No[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.Submitted filename: Limited progress in nutrient pollution_Review.docxClick here for additional data file.6 Aug 2021Dear Reviewers,Thank you for your careful and thorough reviews. We have addressed each point in the attached Response to Reviewers document.Thank you,Rebecca Frei on behalf of all coauthorsSubmitted filename: Frei_ResponsetoReviewers.docxClick here for additional data file.28 Sep 2021PONE-D-21-01607R1Limited progress in nutrient pollution in the U.S. caused by spatially persistent nutrient sourcesPLOS ONEDear Dr. Frei,Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.==============================ACADEMIC EDITOR COMMENTS:Dear Dr. Frei,You have done an outstanding job addressing most of the reviewer's comments, except for one comment on providing a solid justification for using ecoregions rather than watersheds. I agree with the reviewer's comment that using watersheds rather than ecoregions may have allowed you to compare and make interpretations across regions. However, at this point in the review, I am asking that you include the limitations of using ecoregions and make a point that future studies should use watersheds for the reasons suggested by the reviewer and others.I hope you can address this comment quickly so that the manuscript can move forward to the next step.Best wishes.Gurpal==============================Please submit your revised manuscript by Nov 12 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.Please include the following items when submitting your revised manuscript:If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.We look forward to receiving your revised manuscript.Kind regards,Gurpal S. Toor, Ph.D.Academic EditorPLOS ONEJournal Requirements:Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.[Note: HTML markup is below. Please do not edit.]Reviewers' comments:Reviewer's Responses to QuestionsComments to the Author1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.Reviewer #1: (No Response)Reviewer #3: All comments have been addressed**********2. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: PartlyReviewer #3: Yes**********3. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: YesReviewer #3: Yes**********4. Have the authors made all data underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: (No Response)Reviewer #3: Yes**********5. Is the manuscript presented in an intelligible fashion and written in standard English?PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #3: Yes**********6. Review Comments to the AuthorPlease use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: The authors have done a thorough revision, and the resulting manuscript is much stronger and clearer. I was also excited to see the inclusion of new data and the longer time period of their analysis. They did a great job revising the introduction and discussion to better match their analyses, and the methods now provide better detail about the many metrics that they use in the paper. The revisions address nearly all of my concerns.My one remaining concern is the use of ecoregions as the unit of analysis for the leverage analysis. I would suggest that your rationale for using them include the good reasons that you can't use watersheds - but it doesn't mean that ecoregions are an appropriate substitute. The analysis doesn't make physical sense if the watersheds being compared aren't hydrologically connected. I don't see how you can make any valid inferences from this analysis. I'm also confused by your third point - many of the ecoregions you use are very large and don't have similar land use or climate (e.g., Maine and Florida). While you acknowledge this in the methods, the results and discussion interpret the data as if these were nested watersheds – referring to smaller watersheds as “subwatersheds” and making inferences about nutrient retention between non-nested watersheds. I don’t think the leverage analysis is critical to the story here, and I would suggest cutting it. Otherwise, I think the paper is really nice.For the trends, I’m curious whether the authors think these are due to variation in nutrient inputs and retention alone, or how much could be caused by interannual variability in hydrology. This could be a small point for the discussion.Reviewer #3: (No Response)**********7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #3: No[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.4 Oct 2021Please see Response to Reviewers document.Submitted filename: Response to reviews round 2.docxClick here for additional data file.11 Oct 2021Limited progress in nutrient pollution in the U.S. caused by spatially persistent nutrient sourcesPONE-D-21-01607R2Dear Dr. Frei,We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.Kind regards,Gurpal S. Toor, Ph.D.Academic EditorPLOS ONEAdditional Editor Comments (optional):Congratulations!Reviewers' comments:16 Nov 2021PONE-D-21-01607R2Limited progress in nutrient pollution in the U.S. caused by spatially persistent nutrient sourcesDear Dr. Frei:I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.If we can help with anything else, please email us at plosone@plos.org.Thank you for submitting your work to PLOS ONE and supporting open access.Kind regards,PLOS ONE Editorial Office Staffon behalf ofDr. Gurpal S. ToorAcademic EditorPLOS ONE
Authors: Camille Vautier; Tamara Kolbe; Tristan Babey; Jean Marçais; Benjamin W Abbott; Anniet M Laverman; Zahra Thomas; Luc Aquilina; Gilles Pinay; Jean-Raynald de Dreuzy Journal: Sci Total Environ Date: 2021-05-08 Impact factor: 7.963
Authors: Xin Zhang; Eric A Davidson; Denise L Mauzerall; Timothy D Searchinger; Patrice Dumas; Ye Shen Journal: Nature Date: 2015-11-23 Impact factor: 49.962
Authors: Andrew Sharpley; Helen P Jarvie; Anthony Buda; Linda May; Bryan Spears; Peter Kleinman Journal: J Environ Qual Date: 2013-09 Impact factor: 2.751
Authors: Matthew J Ascott; Daren C Gooddy; Owen Fenton; Sara Vero; Rob S Ward; Nandita B Basu; Fred Worrall; Kimberly Van Meter; Ben W J Surridge Journal: Sci Total Environ Date: 2021-03-24 Impact factor: 7.963
Authors: Jana E Compton; Ryan A Hill; Alan T Herlihy; Robert D Sabo; J Renée Brooks; Marc Weber; Brian Pickard; Steve G Paulsen; John L Stoddard Journal: Environ Sci Technol Date: 2021-06-01 Impact factor: 11.357