| Literature DB >> 22457586 |
Gregory E Schwarz, Richard B Alexander, Richard A Smith, Stephen D Preston.
Abstract
This analysis modifies the parsimonious specification of recently published total nitrogen (TN) and total phosphorus (TP) national-scale SPAtially Referenced Regressions On Watershed attributes models to allow each model coefficient to vary geographically among three major river basins of the conterminous United States. Regionalization of the national models reduces the standard errors in the prediction of TN and TP loads, expressed as a percentage of the predicted load, by about 6 and 7%. We develop and apply a method for combining national-scale and regional-scale information to estimate a hybrid model that imposes cross-region constraints that limit regional variation in model coefficients, effectively reducing the number of free model parameters as compared to a collection of independent regional models. The hybrid TN and TP regional models have improved model fit relative to the respective national models, reducing the standard error in the prediction of loads, expressed as a percentage of load, by about 5 and 4%. Only 19% of the TN hybrid model coefficients and just 2% of the TP hybrid model coefficients show evidence of substantial regional specificity (more than ±100% deviation from the national model estimate). The hybrid models have much greater precision in the estimated coefficients than do the unconstrained regional models, demonstrating the efficacy of pooling information across regions to improve regional models.Entities:
Year: 2011 PMID: 22457586 PMCID: PMC3307635 DOI: 10.1111/j.1752-1688.2011.00581.x
Source DB: PubMed Journal: J Am Water Resour Assoc ISSN: 1093-474X
FIGURE 1The Residuals Estimated From the 1992 National SPARROW Models for (A) Total Nitrogen and (B) Total Phosphorus (Alexander ). The residuals for total nitrogen are weighted to account for uncertainty in the monitored load estimates (Alexander ). A negative residual implies overprediction of the SPARROW model. The number of monitoring sites in the region is given in parentheses.
The Reduction in the Root Mean Squared Error (RMSE) of Regional Models Incorporating Varying Degrees of Region-Specific Fixed Effects for Land and Aquatic Processes, as Compared to the 1992 National SPARROW Models for Total Nitrogen (TN) and Total Phosphorus (TP) (Alexander )
| Alternative Model Specification | TN | TP | |||
|---|---|---|---|---|---|
| Regional Effects | Process Coefficients for Which Region-Specific Fixed Effects Are Incorporated | Reduction in RMSE Compared to National Model Without Regional Fixed Effects | Number of Fixed-Effects Coefficients | Reduction in RMSE Compared to National Model Without Regional Fixed Effects | Number of Fixed-Effects Coefficients |
| All regions | Land and aquatic | 0.055 | 54 | 0.073 | 45 |
| Land only | 0.046 | 50 | 0.058 | 41 | |
| Aquatic only | 0.008 | 22 | 0.033 | 19 | |
| East region only | Land and aquatic | 0.025 | 36 | 0.039 | 30 |
| Land only | 0.026 | 34 | 0.037 | 28 | |
| Aquatic only | <0.001 | 20 | <0.001 | 17 | |
| Northwest region only | Land and aquatic | 0.026 | 36 | 0.044 | 30 |
| Land only | 0.018 | 34 | 0.024 | 28 | |
| Aquatic only | 0.007 | 20 | 0.034 | 17 | |
| Southwest region only | Land and aquatic | 0.035 | 36 | 0.053 | 30 |
| Land only | 0.032 | 34 | 0.041 | 28 | |
| Aquatic only | 0.007 | 20 | 0.029 | 17 | |
Notes: Statistical significance of a reduction is determined using an F-test, with
denoting significance at the 0.001 level
denoting significance at the 0.01 level; RMSE reductions without asterisks are not significant at the 0.05 level.
Calculated as the national model RMSE minus the regional fixed-effects model RMSE. Reduction in RMSE of the natural log residuals can be interpreted as the approximate reduction in the standard error of a prediction of load for a reach, expressed as a share of the predicted load in the reach.
The national model for TN has 18 coefficients.
The national model for TP has 15 coefficients.
Summary Statistics for the National, Regional Fixed-Effects, and Hybrid SPARROW Models for Total Nitrogen (TN) and Total Phosphorus (TP)
| TN | TP | |||||
|---|---|---|---|---|---|---|
| National Model | Regional Fixed-Effects Model | Hybrid Model | National Model | Regional Fixed-Effects Model | Hybrid Model | |
| Number of observations | 425 | 425 | 425 | 425 | 425 | 425 |
| Number of coefficients | 18 | 54 | 54 | 15 | 45 | 45 |
| Number of coefficients with binding physical constraints | 0 | 5 | 4 | 0 | 1 | 1 |
| Number of cross-region constraints | 0 | 0 | 22 | 0 | 0 | 28 |
| Error degrees of freedom | 407 | 376 | 397 | 410 | 381 | 409 |
| Percent of coefficients with | 72 | 57 | 76 | 80 | 41 | 80 |
| Average coefficient of variation | 0.403 | 0.891 | 0.549 | 0.369 | 2.416 | 0.389 |
| Sum of squared errors (SSE) | 124.3 | 93.3 | 101.3 | 238.1 | 180.9 | 215.8 |
| Mean squared error (MSE) | 0.305 | 0.248 | 0.255 | 0.581 | 0.475 | 0.528 |
| Root mean squared error (RMSE) | 0.553 | 0.498 | 0.505 | 0.762 | 0.689 | 0.726 |
| 0.933 | 0.953 | 0.945 | 0.870 | 0.902 | 0.883 | |
| Adjusted | 0.930 | 0.947 | 0.942 | 0.866 | 0.891 | 0.878 |
| Yield | 0.866 | 0.906 | 0.891 | 0.684 | 0.760 | 0.714 |
| Median absolute pct. prediction error | 42.3 | 40.2 | 38.4 | 67.1 | 52.6 | 59.1 |
| Significance of Moran's | <0.001 | 0.144 | 0.026 | <0.001 | 0.023 | <0.001 |
For the calculation of the percent of coefficients with p ≤ 0.05, a one-tailed p-value is used for coefficients subject to physical bounds (all source and aquatic removal coefficients); otherwise, a two-tailed p-value is used.
The standard error of a coefficient estimate divided by its estimated value, averaged over all coefficients in the model.
Yield R2 adjusts the R2 to account for the area of each observation's upstream basin, thereby removing from R2 the inflating effects of a wide range of basin scales (Schwarz ).
Median absolute error in prediction, expressed as a percent of the monitored load, for 678 TN and 865 TP stations not used in the estimation of the national, regional fixed-effects, or hybrid models.
FIGURE 2Flow Diagram Describing the Intermediate Steps Applied to the Results of the Regional Fixed-Effects Model to Obtain the Hybrid Model.
FIGURE 3A Comparison of Regional Fixed-Effects and Hybrid Model Estimates for Total Nitrogen (TN). Models are based on data from Alexander . Plotted values represent the coefficient estimate expressed as a percent difference from the national model estimate (see Alexander , for the national model coefficient estimates). Circled values pertain to regional fixed-effect and hybrid model coefficients that are constrained in the stepwise procedure and consequently generate a constraint in the estimation of the hybrid model. Note that the horizontal scale is compressed in both the right and left margins, so much so that values on the left and rightmost reference lines are unresolved.
FIGURE 4A Comparison of Regional Fixed-Effects and Hybrid Model Estimates for Total Phosphorus (TP). Models are based on data from Alexander . Plotted values represent the coefficient estimate expressed as a percent difference from the national model estimate (see Alexander , for the national model coefficient estimates). Circled values pertain to regional fixed-effect and hybrid model coefficients that are constrained in the stepwise procedure and consequently generate a constraint in the estimation of the hybrid model. Note that the horizontal scale is compressed in both the right and left margins, so much so that values on the left and rightmost reference lines are unresolved.