| Literature DB >> 29670010 |
Tiffany VanDerwerker1, Lin Zhang2, Erin Ling3, Brian Benham4, Madeline Schreiber5.
Abstract
We investigated if geologic factors are linked to elevated arsenic (As) concentrations above 5 &mu;g/L in well water in the state of Virginia, USA. Using geologic unit data mapped within GIS and two datasets of measured As concentrations in well water (one from public wells, the other from private wells), we evaluated occurrences of elevated As (above 5 &mu;g/L) based on geologic unit. We also constructed a logistic regression model to examine statistical relationships between elevated As and geologic units. Two geologic units, including Triassic-aged sedimentary rocks and Triassic-Jurassic intrusives of the Culpeper Basin in north-central Virginia, had higher occurrences of elevated As in well water than other geologic units in Virginia. Model results support these patterns, showing a higher probability for As occurrence above 5 &mu;g/L in well water in these two units. Due to the lack of observations (<5%) having elevated As concentrations in our data set, our model cannot be used to predict As concentrations in other parts of the state. However, our results are useful for identifying areas of Virginia, defined by underlying geology, that are more likely to have elevated As concentrations in well water. Due to the ease of obtaining publicly available data and the accessibility of GIS, this study approach can be applied to other areas with existing datasets of As concentrations in well water and accessible data on geology.Entities:
Keywords: drinking water; groundwater management; logistic regression; statistical modeling; water quality
Mesh:
Substances:
Year: 2018 PMID: 29670010 PMCID: PMC5923829 DOI: 10.3390/ijerph15040787
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Summary of datasets used for this study. VDH = Virginia Department of Health; VAHWQP = Virginia Household Water Quality Program.
| Dataset Variable | VDH Dataset | VAHWQP Dataset |
|---|---|---|
| Number of samples in dataset | 10,261 | 6739 |
| Number of duplicates removed | 9043 | 2325 |
| Final number of samples | 1218 | 4414 |
| Arsenic Reporting Limit (RL) | 5 μg/L | 1 μg/L |
| Sample collection dates | 1973–2013 | 2008–2015 |
| Filtration | None | None |
| Number of samples below RL | 1157 | 4144 |
| Number of samples above RL | 61 | 270 |
Figure 1Distribution of arsenic (As) concentrations in well water in Virginia based on the Virginia Department of Health (VDH) and Virginia Household Water Quality Program (VAHWQP) datasets. Outline of the U.S. showing Virginia in the inset map.
Concentration ranges for each dataset and their respective percentages.
| Concentration Range (μg/L) | VDH Dataset | VAHWQP Dataset |
|---|---|---|
| ≤5.00 | 95% | 99.25% |
| 5.01–10.00 | 2.7% | 0.52% |
| >10.00 | 2.3% | 0.23% |
Distribution of As concentrations from VDH and VAHWQP datasets for geologic units in Virginia.
| Geologic Unit | 5 < | Total | % | ||
|---|---|---|---|---|---|
| C—Cambrian shales and limestones | 0 | 11 | 569 | 580 | 2 |
| Ce—Cambrian metamorphic/volcanic | 3 | 15 | 369 | 387 | 5 |
| Cq—lower Cambrian clastic rocks | 0 | 0 | 83 | 83 | 0 |
| Cv—Cambrian volcanic rocks | 0 | 6 | 192 | 198 | 3 |
| D—Devonian-aged shales/sandstones | 1 | 9 | 115 | 125 | 8 |
| DS—Devonian-Silurian shales and limestones | 1 | 1 | 39 | 41 | 5 |
| lK—lower Cretaceous metamorphic rocks | 0 | 5 | 31 | 36 | 14 |
| M—Mississippian dolostone and sandstone | 2 | 0 | 52 | 54 | 4 |
| Mm1—felsic paragneiss and schist | 0 | 9 | 303 | 312 | 3 |
| Mm4—granite gneiss | 1 | 7 | 157 | 165 | 5 |
| O—Ordovician shales and dolostones | 0 | 7 | 504 | 511 | 1 |
| Oe—Ordovician metamorphic rocks | 0 | 0 | 30 | 30 | 0 |
| PP1—Atokan and Morrowan Series | 0 | 0 | 18 | 18 | 0 |
| Pzg1—lower Paleozoic granitic/metamorphic | 0 | 5 | 118 | 123 | 4 |
| Pzg2—middle Paleozoic granitic/metamorphic | 1 | 4 | 48 | 53 | 9 |
| Pzmi—Paleozoic mafic intrusives | 0 | 3 | 27 | 30 | 10 |
| Qp—Pleistocene sands | 5 | 15 | 321 | 341 | 6 |
| S—Silurian shales and limestones | 3 | 9 | 57 | 69 | 17 |
| Te—Eocene sands and gravels | 0 | 2 | 48 | 50 | 4 |
| Tm—Tertiary gravels and sands | 3 | 78 | 746 | 827 | 10 |
| Tr—Triassic sedimentary rocks | 7 | 24 | 180 | 211 | 15 |
| Tri—Triassic-Jurassic intrusives | 4 | 10 | 46 | 60 | 23 |
| Tx—Paleocene sands and gravels | 0 | 3 | 37 | 40 | 8 |
| Um—ultramafic rocks | 0 | 0 | 6 | 6 | 0 |
| Ya—Anorthosite | 0 | 0 | 3 | 3 | 0 |
| Ygn—Proterozoic volcanic/metamorphic rocks | 0 | 12 | 572 | 584 | 2 |
| Ym—Paragneiss and schist | 0 | 0 | 6 | 6 | 0 |
| Z—sedimentary and metamorphic rocks | 2 | 22 | 441 | 465 | 5 |
| Zg—granitic and metamorphic rocks | 0 | 0 | 10 | 10 | 0 |
| Zv—volcanic rocks | 2 | 6 | 206 | 214 | 4 |
Total number of samples, number of samples that exceed 5 µg/L As, and percent of samples that exceed 5 µg/L from the combined dataset, separated by physiographic province in Virginia.
| Physiographic Province |
| % | |
|---|---|---|---|
| Coastal Plain | 1162 | 29 | 2.5 |
| Piedmont | 2211 | 47 | 2.1 |
| Blue Ridge | 749 | 2 | 0.3 |
| Valley and Ridge | 1496 | 15 | 1.0 |
| Appalachian Plateau | 14 | 1 | 7.1 |
|
|
|
|
|
Results of variable selection.
| Geologic Unit | Coefficient (Mean) | 95% Confidence Interval, Lower Bound | 95% Confidence Interval, Upper Bound |
|---|---|---|---|
| C—Cambrian shales and limestones | −0.779 | −2.220 | −0.027 |
| Ce—Cambrian metamorphic and volcanic rocks | 0.817 | 0.050 | 1.714 |
| Cq—lower Cambrian clastic rocks | −0.668 | −1.693 | −0.100 |
| D—Devonian shales/sandstones | 1.060 | −0.659 | 2.317 |
| DS—Devonian and Silurian shales and limestones | 0.777 | −1.106 | 2.578 |
| lK—lower Cretaceous metamorphic rocks | 0.854 | −0.998 | 2.668 |
| M—Mississippian dolostone and sandstone | 0.485 | −1.174 | 2.280 |
| O—Ordovician shales and dolostones | −1.633 | −2.664 | −0.905 |
| Pzg2—middle Paleozoic granitic/metamorphic rocks | 0.420 | −1.266 | 2.275 |
| Pzmi—Paleozoic mafic intrusives | 1.065 | −0.989 | 2.750 |
| Qp—Pleistocene sands | 1.151 | 0.240 | 2.130 |
| S—Silurian shales and limestones | 1.490 | −0.510 | 2.841 |
| Te—Eocene sands and gravels | −0.527 | −1.529 | −0.039 |
| Tm—Tertiary gravels and sands | 0.891 | 0.137 | 1.822 |
| Tr—Triassic sedimentary rocks | 2.054 | 1.155 | 3.018 |
| Tri—Triassic-Jurassic intrusives | 2.745 | 1.368 | 3.875 |
| Tx—Paleocene sands and gravels | 0.781 | −1.065 | 2.625 |
| Ygn—Proterozoic volcanic and metamorphic rocks | −1.694 | −2.669 | −0.971 |
Results of the logistic regression model using both datasets (λ = 0.002, using cross validation). Positive coefficients reflect increased probability of As occurrence.
| Geologic Unit | Coefficient ( | Exp ( | Standard Error | ||
|---|---|---|---|---|---|
| Tri—Triassic-Jurassic intrusives | 2.5621 | 12.963 | 0.4602 | 5.567 | 0.0000 |
| Tr—Triassic sedimentary rocks | 1.8032 | 6.069 | 0.275 | 5.507 | 0.0000 |
Figure 2Spatial extent of significant geologic units (Tr: Triassic sedimentary rocks; Tri: Triassic-Jurassic intrusives) in the final logistic regression model overlaid on the spatial distribution of As concentrations.
Classification functions for model. TP = True Positive; P = Positive Instances; TN = True Negative; N = Negative Instances; FP = False Positive and; FN = False Negative. NA= not applicable (TP and FP are = 0).
| Classification Function | Formula | Value |
|---|---|---|
| True Positive Rate (Sensitivity) | TP/P | 0% |
| True Negative Rate (Specificity) | TN/N | 100% |
| Positive Predictive Value (Precision) | TP/(TP + FP) | NA |
| Negative Predictive Value | TN/(TN + FN) | 94.99% |
| False Positive Rate | FP/N | 0% |
| False Negative Rate | FN/(TP + FN) | 100% |
| Accuracy | (TP + TN)/(TP + FN + FP + TN) | 97.87% |
Figure 3Location of the Mesozoic rift basin complex along the east coast of the U.S. GIS data obtained from the U.S. Geological Survey.
Figure 4The Culpeper Basin located in north-central Virginia (inset) with overlays of Triassic-aged sedimentary rocks (green), the Triassic-Jurassic intrusives (grey) and As concentrations in well water.