Peter B McMahon1, Andrea K Tokranov2, Laura M Bexfield3, Bruce D Lindsey4, Tyler D Johnson5, Melissa A Lombard6, Elise Watson5. 1. U.S. Geological Survey, Bldg. 53, MS 415, Lakewood, Colorado, 80225, United States. 2. U.S. Geological Survey, 10 Bearfoot Rd., Northborough, Massachusetts 01532, United States. 3. U.S. Geological Survey, 6700 Edith Blvd NE, Albuquerque, New Mexico 87113, United States. 4. U.S. Geological Survey, 215 Limekiln Road, New Cumberland, Pennsylvania 17070, United States. 5. U.S. Geological Survey, 4165 Spruance Road, San Diego, California 92101, United States. 6. U.S. Geological Survey, 331 Commerce Way, Pembroke, New Hampshire 03275, United States.
Abstract
In 2019, 254 samples were collected from five aquifer systems to evaluate perfluoroalkyl and polyfluoroalkyl substance (PFAS) occurrence in groundwater used as a source of drinking water in the eastern United States. The samples were analyzed for 24 PFAS, major ions, nutrients, trace elements, dissolved organic carbon (DOC), volatile organic compounds (VOCs), pharmaceuticals, and tritium. Fourteen of the 24 PFAS were detected in groundwater, with 60 and 20% of public-supply and domestic wells, respectively, containing at least one PFAS detection. Concentrations of tritium, chloride, sulfate, DOC, and manganese + iron; percent urban land use within 500 m of the wells; and VOC and pharmaceutical detection frequencies were significantly higher in samples containing PFAS detections than in samples with no detections. Boosted regression tree models that consider 57 chemical and land-use variables show that tritium concentration, distance to the nearest fire-training area, percentage of urban land use, and DOC and VOC concentrations are the top five predictors of PFAS detections, consistent with the hydrologic position, geochemistry, and land use being important controls on PFAS occurrence in groundwater. Model results indicate that it may be possible to predict PFAS detections in groundwater using existing data sources.
In 2019, 254 samples were collected from five aquifer systems to evaluate perfluoroalkyl and polyfluoroalkyl substance (PFAS) occurrence in groundwater used as a source of drinking water in the eastern United States. The samples were analyzed for 24 PFAS, major ions, nutrients, trace elements, dissolved organic carbon (DOC), volatile organic compounds (VOCs), pharmaceuticals, and tritium. Fourteen of the 24 PFAS were detected in groundwater, with 60 and 20% of public-supply and domestic wells, respectively, containing at least one PFAS detection. Concentrations of tritium, chloride, sulfate, DOC, and manganese + iron; percent urban land use within 500 m of the wells; and VOC and pharmaceutical detection frequencies were significantly higher in samples containing PFAS detections than in samples with no detections. Boosted regression tree models that consider 57 chemical and land-use variables show that tritium concentration, distance to the nearest fire-training area, percentage of urban land use, and DOC and VOC concentrations are the top five predictors of PFAS detections, consistent with the hydrologic position, geochemistry, and land use being important controls on PFAS occurrence in groundwater. Model results indicate that it may be possible to predict PFAS detections in groundwater using existing data sources.
The
contamination of groundwater with perfluoroalkyl and polyfluoroalkyl
substances (PFAS) is a concern in many countries[1−4] because PFAS persist in the environment,[5−8] PFAS sources are widespread,[5,9,10] and some PFAS are known or suspected to be associated with adverse
human-health effects.[11−13] Moreover, PFAS in the unsaturated zone can be sources
to underlying groundwater systems for decades,[7,14] making
PFAS in groundwater a long-term public-health concern. The United
States does not currently (2022) have nationally enforceable drinking-water
standards for PFAS, but in 2016, the U.S. Environmental Protection
Agency (EPA) established a drinking-water health advisory level of
70 ng/L for the combined concentrations of perfluorooctane sulfonate
(PFOS) and perfluorooctanoate (PFOA),[15] two of the most commonly measured PFAS in groundwater.Conceptually,
the occurrence of PFAS in groundwater should be controlled
by land-use (source), hydrologic (transport), and biogeochemical (fate
and transport) factors. Several local-scale studies have provided
deeper understanding of the source, transport, and fate of PFAS in
soils and groundwater by combining PFAS data with data for those factors.[7,16,17] A study in Massachusetts used
data on groundwater-flow directions and measurements of dissolved
oxygen (O2) and boron (B) to help differentiate PFAS in
groundwater affected by wastewater effluent from that affected by
a fire-training area.[7] A study in Japan
used measurements of tritium (3H), pharmaceuticals, and
personal care products to link PFAS in modern groundwater to sewage
sources.[18] Measurements of benzene and
PFAS and information on enhanced bioremediation activities related
to the hydrocarbon contamination improved the understanding of PFAS
precursor transformations at a fire-training facility in South Dakota.[17] We hypothesize that data for the controlling
factors can improve our understanding of the occurrence of PFAS at
regional and national scales, such that those data could be used to
build large-scale statistical models that predict PFAS occurrence
in unmonitored areas, as has been done for arsenic and nitrate.[19−21] Such models could be used to guide future sampling efforts and identify
areas of high risk for human exposure.At least one study demonstrated
the value of PFAS-source information
in predicting PFAS occurrence in drinking-water supplies at the national
scale.[10] A state-wide study in New Hampshire
showed that data on hydrology and PFAS sources can also be predictive
of PFAS occurrence in groundwater.[22] We
are aware of one other study that combined source, hydrologic, and
geochemical data to test their power to predict PFAS occurrences in
groundwater at a regional scale (state of California).[23] Our study expands this type of analysis to a
different region of the United States and includes additional geochemical
parameters found to be useful in understanding and predicting PFAS
in groundwater (e.g., dissolved organic carbon (DOC) and pharmaceuticals).The purpose of this article is threefold: (1) explore hydrologic,
geochemical, and land-use controls on PFAS occurrence in groundwater
used as a source of drinking water in the eastern United States, (2)
examine chemical co-occurrences with PFAS that could improve understanding
of PFAS occurrences, and (3) identify which hydrologic, geochemical,
and land-use factors are the strongest predictors of PFAS occurrence
using boosted regression tree (BRT) models. Well depth and tritium
(3H) provide information on the hydrologic position of
samples in the groundwater-flow systems. pH, iron [Fe], B, DOC, pharmaceuticals,
VOCs, and other parameters provide information on geochemical conditions,
co-occurring chemicals, and potential PFAS sources. Geospatial data
provide locations for potential PFAS sources like fire-training facilities,
landfills, airports, military bases, and other features.
Materials and
Methods
Well Networks
In 2019, 254 wells were sampled in seven
networks (Figures and S1; Table S1). The well networks
cover 2383 to 164,619 km2 in five principal-aquifer systems:
Glacial, Mississippi Embayment, Southeastern Coastal Plain, Stream
Valley, and Surficial aquifer systems. The networks were established
by the U.S. Geological Survey’s National Water-Quality Assessment
(USGS NAWQA) project in 1999 to 2019 and are composed of public-supply
(64%), domestic (19%), monitoring (12%), and other (5%) well types
(Table S1). A stratified random approach
was used to select wells for sampling. Hydrogeologic units that are
important sources of water supply were targeted for sampling (stratification
step; Table S1). To facilitate random well
selection, each hydrogeologic unit was subdivided into 30 to 60 equal-area
cells, and one well in each cell was randomly selected to sample from
a population of existing wells.[24] All wells
within the NECBS network are water-table monitoring wells installed
in 1999 by randomly selecting one location in each equal-area cell.
Figure 1
Locations
of aquifer systems and well networks. Wells shown with
white symbols indicate that PFAS were not detected, and those shown
with other colors indicate that PFAS were detected. See Figure S1 for detailed maps of the networks.
Locations
of aquifer systems and well networks. Wells shown with
white symbols indicate that PFAS were not detected, and those shown
with other colors indicate that PFAS were detected. See Figure S1 for detailed maps of the networks.In 2012, median percentages of urban land use around
the wells
ranged from ∼48 (MIAM) to 92% (NECBS) (Table S1).[25] Three well networks
located in the Glacial aquifer system represent valley-fill aquifers
(MIAM) and relatively shallow (NECBS) and deep (NECBD) parts of the
stratified-drift aquifer. Metropolitan areas in those networks include
Dayton (MIAM) and Boston (NECBS; NECBD). Network MISE represents the
Memphis Sand aquifer in the Mississippi Embayment aquifer system.
The Memphis metropolitan area is a major user of groundwater from
the Memphis Sand aquifer. Network MOBL represents the Black Warrior
River aquifer in the Southeastern Coastal Plain aquifer system. Network
STRV represents the Ohio River alluvial aquifer in the Stream Valley
aquifer system. Metropolitan areas in STRV include Louisville, Cincinnati,
and Pittsburgh. Network SURF represents the Surficial aquifer system.
Metropolitan areas in SURF include Fort Lauderdale and West Palm Beach.
Networks were sampled as a part of the USGS NAWQA, National Hydrologic
Monitoring project.
Sample Collection and Laboratory Analysis
Standard
USGS protocols were used to collect groundwater samples from the wells
prior to any treatment, blending, or pressure tanks.[26,27] Samples for major-ion, nutrient, trace-element, and DOC analyses
were filtered (0.45 μm Versapor filters) and acidified with
nitric acid (major cations and trace elements) or sulfuric acid (DOC),
and/or chilled (nutrients, DOC) in the field. Pharmaceutical samples
were filtered (0.7 μm glass fiber) and chilled in the field.[28] Samples for VOC analysis were unfiltered, acidified
with hydrochloric acid, and chilled in the field. 3H samples
were unfiltered and unpreserved. Samples for PFAS analysis were unfiltered
and chilled in the field. Details of the PFAS sampling protocols are
provided in Section S1.Major-ion,
nutrient, trace-element, DOC, pharmaceutical, and VOC samples were
analyzed at the USGS National Water-Quality Laboratory in Lakewood,
Colorado. Major cations were analyzed by inductively coupled plasma-atomic
emission spectroscopy.[29] Major anions were
analyzed by ion chromatography.[29] Trace
elements were analyzed by inductively coupled plasma-mass spectroscopy.[30] DOC was analyzed by UV-promoted persulfate oxidation
and infrared spectrometry.[31] Pharmaceuticals
were analyzed by liquid chromatography/tandem mass spectrometry using
an electrospray ionization source operated in the positive ion mode.[32] VOCs were analyzed by purge-and-trap gas chromatography/mass
spectrometry.[33]3H was analyzed
at the USGS Tritium Laboratory in Menlo Park, California, using electrolytic
enrichment and liquid-scintillation counting.[34] PFAS were analyzed at the Orlando, Florida SGS Laboratory by liquid
chromatography/tandem mass spectrometry with isotope dilution (see Section S2 for details).[35] Analyzed compounds include 11 perfluoroalkyl carboxylates (PFCAs),
7 perfluoroalkyl sulfonates (PFSAs), perfluorooctane sulfonamide,
2 perfluorooctane sulfonamidoacetates, and 3 fluorotelomer sulfonates
(FTS). Reported concentrations are the sum of linear and branched
isomers. The chemical data are listed in Table S2, along with PFAS-specific reporting levels (3.8 to 40 ng/L),
and are available in ref (36).
PFAS Quality-Control Data and Analysis
Details about
the evaluation of quality-control samples associated with this study
are provided in Section S3 and Tables S3–S6. Briefly, field and laboratory
blanks were used to examine potential sources of contamination. No
PFAS detections were reported for any blank types collected in the
field, with the exception of one perfluorobutanoate (PFBA) detection
reported for a field blank; detection of PFBA in the associated laboratory
method blank indicated that this result was likely affected by contamination
at the laboratory. Laboratory-blank results prompted censoring of
reported detections for PFBA in 15 groundwater samples in two analytical
batches and for a detection of 6:2 FTS in one groundwater sample in
a separate batch. Examination of data for routine laboratory reagent
spikes and matrix spikes indicated little bias (all median recovery
values for PFAS between 85 and 103%); reagent spikes exhibited low
variability in recovery. Field replicates and laboratory matrix spike
duplicates indicated low variability in PFAS detection and (or) concentration.
Geospatial Data
Geospatial data from publicly available
sources and from U.S. Government proprietary sources were used to
analyze spatial relations between PFAS detections in groundwater and
potential sources of PFAS (Table S7). The
potential PFAS sources, data availability, and geospatial analysis
are described in Section S4. Five-hundred-meter
circular buffers around the wells were used to extract and assign
selected land-use data to the wells to examine relations between water
chemistry and land use (see Section S4 for
more information).
Statistical Methods
Mann–Whitney
and Kruskal–Wallis
tests, as implemented in the software OriginPro 2018,[37] were used on ranked data to test for significant differences
in concentrations between selected geochemical groups. Spearman correlation
analysis was used to examine relations between concentrations and
other variables. An α value of 0.05 was used for each test.
BRT Modeling
BRT models were fit to 57 variables including
water-quality parameters, land use, and distance from wells to identified
places that are potential sources of PFAS to the environment (Tables S2 and S8). Laboratory results for PFAS
were converted to a binary (detect or nondetect), and models were
fit using methods like those described in ref (19). Models were developed
using the R computing environment (R Team. R: A language and environment
for statistical computing version 4.0.3. https://www.R-project.org).
The dataset was split into training data (80%) and holdout data (20%)
to evaluate model performance (Section S5). The model parameters adjusted during tuning were interaction.depth
(the number of levels of trees), n.minobsinnode (the minimum number
of observations in terminal nodes), shrinkage (the learning rate),
and n.trees (number of trees in the model). Metrics for model performance
were calculated using fivefold cross validation and include accuracy,
sensitivity, specificity, and the area under the receiver operator
characteristics curve (ROC) (Section S5 and Tables S9 and S10). Accuracy is the
percentage of total correct predictions, sensitivity is the percentage
of successfully predicted detections (true positives), and specificity
is the percentage of successfully predicted nondetections (true negatives).
The ROC curve is plotted as the true positive divided by the false
positive rate for a range of probability thresholds, including the
0.5 threshold chosen. A binary variable indicating whether any of
the 24 PFAS were detected was used as the primary dependent variable.
However, for comparison, multiple models were constructed with the
dependent variable being a binary variable indicating the detection
of PFOS, PFOA, PFBS, long chain-length perfluoroalkyl acids (PFAA),
short chain-length PFAA, PFCA, or PFSA. Long-chain PFCAs and PFSAs
are those with ≥7 and ≥6 perfluorinated carbons, respectively.[38]
Results and Discussion
PFAS Occurrence in Groundwater
Fourteen of the 24 analyzed
PFAS were detected at least once in groundwater samples (Tables S2 and S11). At least one PFAS was detected
in 54% of the samples (n = 254), and ≥2 PFAS
were detected in 47% of the samples. For drinking-water wells, PFAS
were detected in 60% of public-supply wells and 20% of domestic wells.
At least two PFAS were detected in 53% of public-supply wells and
10% of domestic wells. The most commonly detected PFAS in our samples
include the six measured by the EPA’s Third Unregulated Contaminant
Monitoring Rule (UCMR3) program (perfluorobutane sulfonate [PFBS],
perfluorohexane sulfonate [PFHxS], PFOS, perfluoroheptanoate [PFHpA],
PFOA, and perfluorononanoate [PFNA]),[39] plus several other PFSA and PFCA compounds in the 4- to 9-carbon
range, most notably PFBA, perfluoropentanoate (PFPeA), and perfluorohexanoate
(PFHxA) (Figure A).
PFOA and PFOS represent two of the three most frequently detected
PFAS, and 2.4% (n = 6) of the samples have PFOA+PFOS
concentrations greater than the 70 ng/L health advisory level (all
are from public-supply wells). Eight of the 10 undetected PFAS are
PFAA precursors (n = 5) or PFCA with 12+ carbon atoms
(n = 3) (Table S11).
Figure 2
(A) Detection
frequency for perfluoroalkyl sulfonates (PFSA) and
perfluoroalkyl carboxylates (PFCA) in relation to the carbon number,
based on all data, (B) ΣPFAS in relation to PFAS detection frequency
by the well network, and (C) ΣPFAS in relation to the number
of PFAS detected in samples. In (B), the horizontal lines show median
concentrations; n is the number of samples with at
least one PFAS detection. In (B and C), ΣPFAS is the summed
concentration of detected PFAS.
(A) Detection
frequency for perfluoroalkyl sulfonates (PFSA) and
perfluoroalkyl carboxylates (PFCA) in relation to the carbon number,
based on all data, (B) ΣPFAS in relation to PFAS detection frequency
by the well network, and (C) ΣPFAS in relation to the number
of PFAS detected in samples. In (B), the horizontal lines show median
concentrations; n is the number of samples with at
least one PFAS detection. In (B and C), ΣPFAS is the summed
concentration of detected PFAS.Substantial differences in PFAS detection frequencies and summed
concentrations of detected PFAS (ΣPFAS) are observed between
the well networks. PFAS detection frequencies range from 3.7 (MISE
network) to 92.9% (NECBS) (Figure B). Although the wells in NECBS are monitoring wells
screened near the water table, not wells that supply drinking water,
the data from NECBS are important because they provide information
on the quality of recharge that may eventually reach the deeper public-supply
wells in NECBD. ΣPFAS medians range from 2.2 (MISE) to 40.0
ng/L (SURF) (Figure B). MOBL has a relatively high ΣPFAS median despite its low
detection frequency; nevertheless, there is a significant positive
correlation (rho = 0.86; p = 0.014) between detection
frequency and ΣPFAS medians. The highest ΣPFAS (1645 ng/L)
occurs in a public-supply well from STRV.Relatively little
is known about potential effects of complex PFAS
mixtures in drinking-water sources on human health,[13,40] but better understanding of the composition of those mixtures could
help inform toxicity studies. The relatively common occurrence of
multiple PFAS in the samples has implications for ΣPFAS and
the complexity of PFAS mixtures in groundwater, consistent with previous
point-of-use drinking-water studies.[41,42] ΣPFAS
exhibit a significant positive Spearman correlation with the number
of PFAS detected in the samples (rho = 0.91; p <
0.001) (Figure C),
also consistent with results from the UCMR3 program.[3] Networks NECBS, NECBD, and SURF have relatively large fractions
of samples containing >6 PFAS (37 to 54%) (Figure S2A), which may be related to the large fractions of urban
land use in 500 m buffers around those wells (74 to 92%) (Table S1). Not surprisingly, networks with the
largest numbers of co-occurring PFAS (NECBS, NECBD, SURF, and STRV)
also have the largest numbers of unique PFAS mixtures relative to
the number of samples in the networks (Figure S2B). In NECBS, 68% of the samples contain combinations of
two or more PFAS that are unique to that sample. Overall, three PFAS
occur in ≥80% of the mixtures (PFOA > PFBS > PFOS), but
the
dominant PFAS in mixtures vary between networks. Network NECBS has
three PFAS that occur in ≥80% of the mixtures (PFOA > PFOS
> PFBS), whereas SURF has 7 PFAS that occur in ≥80% of the
mixtures (PFOA > PFHxS > PFOS = PFHxA > PFBS = PFPeA = PFHpA)
(Figure S2C).
Hydrologic Controls on
PFAS Occurrence
Hydrologic characteristics
of groundwater systems could influence the occurrence of PFAS in groundwater,
yet data describing those characteristics are not commonly examined
in regional PFAS studies.[22] The position
of a groundwater sample in the flow system relative to the land surface
and the age of groundwater relative to the age of PFAS sources are
important with respect to PFAS occurrence in groundwater because PFAS
are derived from modern land-surface sources. Data for well depth
and 3H in groundwater indicate that the samples collected
for this study represent a broad spectrum of hydrologic positions
and age categories.There are significant differences in well
depths between the networks (Figure S3A), with median depths ranging from 7.7 (NECBS) to 88.4 m (MISE).
There are also significant differences in 3H concentrations
between the well networks (Figure S3B),
with median concentrations ranging from 0.1 tritium units (TU) (MISE)
to 4.7 TU (STRV). Generally, 3H concentrations less than
about 0.1 to 0.5 TU in groundwater collected in 2019 indicate that
the groundwater was recharged before the start of above-ground nuclear
weapon testing in 1953 (referred to as premodern water in this article).[43] More precise estimates of the threshold 3H concentration for premodern water depend on a sample’s
location and collection date due to spatial variations in 3H concentrations in precipitation and radioactive decay. Groundwater 3H data are used to assign the samples to one of three age
categories using previously developed methodology and datasets for 3H in U.S. precipitation:[43,44] premodern
(pre-1953 recharge), modern (recharged during or after 1953), or mixed
(mixture of premodern and modern recharge). Threshold 3H concentrations (TU) for the premodern and modern age categories
are 0.3, 2.8 (NECBS; NECBD); 0.2, 2.3 (MIAM); 0.2, 1.9 (STRV); 0.1,
0.6 (SURF); and 0.1, 1.2 (MISE; MOBL). The cutoff date for premodern
water (pre-1953) is close to the date when PFAS started to be widely
used (∼1950);[9] thus, knowing age
categories of the samples could help characterize the risk of groundwater
contamination with PFAS. Samples of modern and mixed-age groundwater
contain at least some water recharged after the start of widespread
PFAS use, whereas samples in the premodern age category were probably
recharged prior to the widespread use of PFAS. Moreover, modern samples
are from wells with significantly shallower depths than premodern
samples (Figure S2C). Only the shallowest
network (NECBS), with the highest PFAS detection frequency (Figure B), has 100% of its
samples assigned to the modern age category (Figure A and Table S2). Two other networks (NECBD; MIAM) have >95% of their samples
assigned
to the modern category. The four remaining networks have samples in
all three age categories, with MISE and MOBL having 44 to 54% of their
samples in the premodern age category and the lowest PFAS detection
frequencies.
Figure 3
(A) Percentage of samples in each of three groundwater
age categories
(modern, mixed, and premodern), by well networks and (B) percentage
of PFAS detections that occur in each age category. See the text for 3H concentrations used to define age categories in each well
network.
(A) Percentage of samples in each of three groundwater
age categories
(modern, mixed, and premodern), by well networks and (B) percentage
of PFAS detections that occur in each age category. See the text for 3H concentrations used to define age categories in each well
network.The relations between depth, age
categories, and onset of widespread
PFAS use imply that the likelihood of detecting PFAS in groundwater
decreases with increasing well depth. For example, network NECBS is
characterized by very shallow wells, high fraction of modern water,
and a high PFAS detection frequency, whereas MISE is characterized
by deep wells, low fraction of modern water, and a low PFAS detection
frequency (Figures B, 3A and S3A).
Overall, samples that contain a PFAS detection have significantly
shallower well depths and higher 3H concentrations than
samples with no detections (Table S12),
consistent with the hydrologic position and groundwater age being
controlling factors for PFAS occurrence in groundwater. In samples
that have both 3H data and PFAS detections (n = 136), 92% of the detections occur in modern water and about 99%
occur in samples with at least some modern water (modern + mixed)
(Figure B). The level
of protection against PFAS contamination provided by depth and age
could decrease over time as shallow groundwater moves deeper into
an aquifer system.
Geochemical Controls on PFAS Occurrence
Geochemical
characteristics of groundwater like pH and concentrations of DOC,
divalent cations, and chloride (Cl) could affect PFAS sorption to
aquifer solids. PFAS sorption, for example, can be greater under conditions
of low pH,[7,45] elevated concentrations of divalent cations,[46,47] or lower concentrations of DOC and Cl.[47−49] Reductive dissolution
of manganese (Mn) and Fe oxides in anoxic groundwater could reduce
the sorption capacity of solids or mobilize PFAS already adsorbed
to those solids.[7] Values of pH and concentrations
of selected inorganic constituents and DOC indicate that there are
significant differences in geochemical characteristics between the
networks (Figure S4). For example, median
pH values range from 5.6 (NECBS) to 7.1 (STRV). Median Ca + Mg, DOC,
and Mn + Fe concentrations range from 2.88 (MOBL) to 128 (MIAM) mg/L,
<0.23 (MISE) to 7.6 (SURF) mg/L, and 0.8 (MISE) to 357 (SURF) μg/L,
respectively.Concentrations of DOC, Cl, SO4, Ca
+ Mg, and Mn + Fe are significantly higher in samples containing PFAS
detections than in samples with no detections; however, pH is not
significantly different between the two PFAS groups (Table S12). The similarity in pH between the two PFAS groups
is due in part to high PFAS detection frequencies in networks with
low (NECBS, NECBD median pH = 5.9) and high (STRV, SURF median pH
= 7.0) pH values. In those networks, PFAS-source characteristics may
be more dominant controls on PFAS occurrence than pH. Higher DOC concentrations
in samples containing PFAS detections are consistent with previous
studies indicating that elevated DOC concentrations could promote
PFAS mobility through electrostatic or hydrophobic interactions in
the dissolved phase or by competing with PFAS for sorption sites.[48,49] Competition for sorption sites between negatively charged PFAS and
inorganic anions could also help explain the higher Cl and SO4 concentrations in samples containing PFAS detections.[47,48] In contrast, previous studies indicate that increasing concentrations
of divalent cations could increase PFAS sorption by reducing the negative
charge on soil surfaces, thereby reducing the electrostatic repulsion
between the surface and negatively charged PFAS.[45] The higher Ca + Mg concentrations in samples containing
PFAS detections appear to be inconsistent with those results, but
the observation could be influenced by Cl and SO4 concentrations,
given the competing effects of anions and cations on PFAS sorption.
The relation between PFAS detections and Ca + Mg appears to be more
consistent with previous studies when Ca + Mg concentrations are normalized
to Cl or SO4 concentrations. Ca + Mg/Cl and Ca + Mg/SO4 ratios are significantly lower in samples that contain PFAS
detections than in samples with no detections (Table S12). The relation between PFAS detections and Mn +
Fe is consistent with reductive dissolution of Mn and Fe oxides in
anoxic groundwater that could reduce the sorption capacity of solids
or mobilize PFAS already adsorbed to those solids. Concentrations
of Mn + Fe, Mn, and Fe are not significantly correlated with urban
land use, but there is a significant inverse correlation (p < 0.001) between well depth and Mn (but not Fe). Thus,
the PFAS, Mn + Fe relation could also more generally reflect higher
pollution loading near the land surface in some settings that results
in more reducing conditions and Mn reduction.
Land-Use Controls on PFAS
Occurrence
Urban areas could
be expected to have a greater density of potential PFAS sources compared
to agricultural and undeveloped landscapes, although biosolids are
recognized as a potential PFAS source on agricultural lands.[50] There is a significantly larger percentage of
urban land within 500 m of wells that contain PFAS detections than
near wells with no detections (Table S13), whereas there are significantly smaller percentages of agricultural
and undeveloped lands within 500 m of wells that contain PFAS detections
than near wells with no detections. The relation between PFAS and
urbanization is important because it implies that the risk of PFAS
contamination of groundwater could increase as urbanization encroaches
on the agricultural and undeveloped land.More specific to PFAS
occurrences in groundwater than land use is information on potential
PFAS sources near the sampled wells. U.S. Government proprietary databases
provided this study with geospatial information for facility categories
that could be potential PFAS sources (Table S8). Among the source categories are public-use airports, chemical
manufacturing facilities, fire-training facilities, and landfills.
These source categories and most others listed in Table S8 are significantly closer to wells that contain PFAS
detections than to wells with no detections (Table S13). Moreover, the cumulative number of potential PFAS sources
<5 km from the sampled wells is significantly higher for wells
containing PFAS detections than for wells with no detections. It is
unknown which, if any, of the potential sources listed in Table S8 used or released PFAS to the environment;
nevertheless, the results are consistent with the conceptual model
of PFAS concentrations in groundwater increasing with decreasing distance
to PFAS sources and increasing density of PFAS sources. Similar spatial
relations have been observed in other regional studies.[10]Differences in the number of unique PFAS
mixtures in the well networks
could be related to the diversity of PFAS sources in the networks
that is not captured in general land-use data. NECBD has a larger
fraction of samples with unique PFAS mixtures than SURF (Figure S2B), even though SURF (86%) has a larger
fraction of urban land within 500 m of its wells than NECBD (74%).
The number of potential PFAS sources within 5 km of NECBD wells is
significantly higher than the number within 5 km of SURF wells (Figure S5A). NECBD and SURF appear to have similar
numbers of septic systems in 500 m buffers around their wells (Figure S5B), based on estimates of N input to
septic systems in the United States,[51] which
is used here as a proxy for septic-system density. Septic systems
are a suspected source of PFAS in groundwater.[52] Differences in the number and composition of PFAS mixtures
between the networks could also be affected by PFAS degradation and
sorption processes. As previously discussed, the differences in geochemical
characteristics of groundwater in the networks could result in different
amounts of PFAS sorption to aquifer solids in the networks.
PFAS Co-Occurrence
with Other Chemicals
Linking the
PFAS detections in our samples to specific PFAS sources is beyond
the scope of this study, given its regional scale. Nevertheless, PFAS
co-occurrences with other chemicals can provide useful information
about the chemical complexity of PFAS-contaminated groundwater and,
more generally, about PFAS sources. VOC and pharmaceutical detection
frequencies are substantially higher in samples that contain PFAS
detections (62 and 33%, respectively) than in samples with no detections
(21 and 2%, respectively) (Table S12).
For samples from public-supply wells that have PFAS detections, 68,
37, and 24% also have at least one detection of a VOC, pharmaceutical,
or VOC + pharmaceutical, respectively. ΣVOCs (median = 0.07
μg/L) and Σpharmaceuticals (median = 0.025 μg/L)
are low, but their presence indicates that the chemical inventory
in groundwater contaminated with PFAS is complex. This finding is
important because estimating the toxicity of complex chemical mixtures
in water can be challenging.[53] The top
10 VOCs co-occurring with PFAS include chlorodifluoromethane (CDFM),
the most common co-occurring VOC, and 1,1-dichloroethane (Figure S6A). The co-occurrence of CDFM and 1,1-dichloroethane
with PFAS was also noted in the UCMR3 dataset.[3] The most common co-occurring pharmaceuticals include carbamazepine
> gabapentin > meprobamate (Figure S6B).The co-occurrence of PFAS with other chemicals does not
necessarily
indicate that they are from the same source, particularly in urban
areas where independent sources could coexist in proximity to each
other. PFAS and pharmaceuticals, however, are known to co-occur in
sources associated with human-waste disposal (e.g., septic-system/landfill
leachate and wastewater treatment plants).[10,54−56] Those types of sources could account for some of
the PFAS in the 37% of PFAS-contaminated samples from public-supply
wells that contain at least one pharmaceutical. In NECBD, 41% of the
public-supply wells with PFAS detections also have at least one detection
of a pharmaceutical. That network has a relatively high abundance
of septic systems within 500 m of the wells (Figure S5B), and nitrate and B concentrations, for the most part,
are significantly higher in NECBD samples that contain PFAS or pharmaceutical
detections than in samples with no detections (Table S14). Nitrate and B can have elevated concentrations
in groundwater due to contamination with septic leachate.[55] The overall dataset also appears to be consistent
with at least some PFAS in the samples with co-occurring PFAS and
pharmaceutical detections being associated with septic/landfill leachate,
wastewater effluent, or stormwater runoff.[52,54,56,57] Samples with
co-occurring detections of PFAS and pharmaceuticals have significantly
higher concentrations of B and total N than samples that do not have
co-occurring PFAS and pharmaceutical detections (Figure S7). All three sources can have elevated B and total
N concentrations,[55,58,59] so they could produce similar chemical patterns with respect to
PFAS, pharmaceuticals, B, and total N.VOCs known to be associated
with PFAS sources could also be potentially
useful PFAS-source indicators in groundwater. Fuel hydrocarbons like
benzene co-occur with PFAS in groundwater in some fire-training areas.[17] CDFM, the most common co-occurring VOC in our
dataset, is used to manufacture monomers like tetrafluoroethene, which
is an important building block in fluoropolymer production. Historically,
the polymerization process used PFOA and PFOS as emulsifiers.[60] Thus, co-occurrences of CDFM, PFOA, and PFOS
in groundwater could indicate that some of the PFAS are from fluoropolymer
manufacturing. CDFM has also been used as a replacement for chlorofluorocarbons
in air conditioning, refrigerants, and aerosols, resulting in increasing
atmospheric CDFM concentrations.[61] Atmospherically
derived CDFM could enter groundwater in recharge that is unrelated
to PFAS sources, although CDFM concentrations in air-saturated water
(0.0009 μg/L at 20 °C)[61] are
very low compared to the concentrations measured in this study. In
this study, the median detected CDFM concentration is 0.075 μg/L,
∼85 times higher than the concentration in air-saturated water.
PFAS detections co-occur in 90% of the samples that contain a CDFM
detection, and 96% of the samples with co-occurring CDFM and PFAS
also contain PFOA and/or PFOS. In the SURF network, where 70% of the
CDFM detections occur, wells with co-occurring CDFM and PFAS detections
are significantly closer to chemical manufacturing facilities (median
distance = 4.3 km) than wells without co-occurring detections (median
= 12.1 km) (Figure S8). We do not know
if PFAS and CDFM were used or released to the environment at those
facilities, but the chemical and spatial data are consistent with
at least some of the PFAS in samples with co-occurring CDFM and PFAS
detections being sourced from chemical manufacturing. SURF samples
with co-occurring PFAS and CDFM detections and samples with only CDFM
detections do not have significantly higher B or total N concentrations
than samples without those detections (Figure S9), suggesting that the PFAS + CDFM co-occurrences are not
associated with human-waste disposal. The regional analyses of co-occurring
PFAS-pharmaceuticals and PFAS-CDFM provide internally consistent hypotheses
about some PFAS being sourced from human-waste-disposal activities
and chemical manufacturing, respectively, that need to be tested with
data from local-scale studies.
BRT Models
Models
were developed to predict PFAS detections
using geochemical and geospatial data as predictor variables. The
model selected, with the dependent variable being a binary variable
indicating whether one or more of the 24 PFAS were detected, was constructed
using 1000 trees, an interaction depth of 2, shrinkage (learning rate)
of 0.002, and 10 as the minimum number of observations in terminal
nodes of a tree. The accuracy, sensitivity, specificity, and ROC were
0.91, 0.93, 0.89, and 0.97 for the training data and 0.84, 0.96, 0.72,
and 0.90 for the holdout data, respectively, indicating excellent
model performance (Table S9). The relative
importance of the model variables (Figure ) indicates that the top five predictors
of PFAS detection are 3H concentration, distance to the
nearest fire-training area, DOC concentration, percentage of urban
land use, and ΣVOC. Variables potentially related to PFAS sorption
(e.g., SO4 and Cl) are also among the top 10 predictors.
Overall, the model results are consistent with the hydrologic position,
groundwater geochemistry, and land use being important controls on
PFAS occurrence in groundwater. 3H is rarely collected
in assessments of PFAS contamination of groundwater, but given its
apparent predictive power, inclusion of 3H in such efforts
seems warranted.
Figure 4
Relative influence of potential predictor variables sorted
from
largest to smallest contribution to the model is shown in blue bars.
Inset panels are partial dependence plots for tritium, distance to
the nearest fire-training area, DOC, the percentage urban land use
around each well, and the sum of detected VOC concentrations. Partial
dependence plots show the relationship between probability of PFAS
detection and the predictor variable. Blue tick marks along the x axis of the inset panels indicate the variable minimum,
maximum, and deciles of the model training dataset. Plateaus in the
inset graphs are usually in areas with little or no data. FRS, facility
registry service.
Relative influence of potential predictor variables sorted
from
largest to smallest contribution to the model is shown in blue bars.
Inset panels are partial dependence plots for tritium, distance to
the nearest fire-training area, DOC, the percentage urban land use
around each well, and the sum of detected VOC concentrations. Partial
dependence plots show the relationship between probability of PFAS
detection and the predictor variable. Blue tick marks along the x axis of the inset panels indicate the variable minimum,
maximum, and deciles of the model training dataset. Plateaus in the
inset graphs are usually in areas with little or no data. FRS, facility
registry service.Partial dependence plots
can help visualize the relationship between
the probability of detection and the predictor variable.[62] Partial dependence plots (Figure ) indicate increasing probability of PFAS
detection with increasing 3H, DOC, and VOC concentrations
and increasing urban land-use percentages. Conversely, the probability
of PFAS detection decreases with increasing distance from a fire-training
area. These partial dependence plots are in line with expectations
discussed in this article. Higher 3H concentrations are
indicative of more modern water, urban areas have been associated
with PFAS occurrence, and fire-training areas are known point sources
of PFAS.[7,17,63] As discussed
above, VOC detection frequencies and DOC concentrations were higher
in samples containing PFAS detections: these associations are reflected
in the model, which found that DOC and VOCs have large relative influence
on the model.Different models were constructed with the dependent
variable being
a binary variable indicating the detection of PFOS, PFOA, PFBS, long
chain-length PFAA, short chain-length PFAA, PFCA, or PFSA. In all
scenarios, 3H, DOC, and VOC concentrations and distance
to the nearest fire-training area were in the top 10 most influential
predictor variables (Table S10). The percentage
of urban land use was also in the top 10 most influential predictor
variables for all scenarios except for the model predicting PFOA,
where it was number 11. These results indicate robustness of the model.The model developed here indicates that it may be possible to predict
PFAS detections in drinking-water wells using data sources already
available, consistent with another recent study that used statistical
models to predict susceptibility of private wells to PFAS contamination.[22] The USGS National Water Information System (NWIS)[64] contains many years of water-quality data, including
the chemical parameters used here, from wells across the U.S. Distance
metrics could likewise be developed from existing geospatial datasets.
Limitations and Implications
The data represent a conservative
estimate of human exposure to PFAS because samples were collected
prior to treatment. Nevertheless, many water-treatment processes do
not effectively remove PFAS.[41,42] Also, treatment is
often absent for households that rely on private domestic wells as
a source of drinking water. Thus, the PFAS results are likely to be
relevant to human exposure at the tap. Although the dataset represents
a broad spectrum of hydrogeologic settings in the eastern United States,
important aquifer systems like the Northern Atlantic Coastal Plain
system and the fracture-flow dominated Floridan carbonate-rock and
Piedmont crystalline-rock systems are not represented. Likewise, the
24 PFAS analyzed here may not represent the total PFAS inventory in
groundwater, given the large number of PFAS reported in commercial
use and the environment.[9] Nevertheless,
the dataset provides insight into the relations between PFAS in groundwater
and large numbers of hydrologic, geochemical, and geospatial variables.
Broadly, groundwater affected by modern anthropogenic activity appears
to be associated with PFAS, given significant relations between PFAS
detections and variables such as 3H, urban land use, VOCs,
and pharmaceuticals. BRT modeling indicates that it is possible to
predict PFAS occurrence based on the explanatory variables investigated
here. Therefore, future sampling related to PFAS may consider adding
targeted analyses such as 3H, DOC, SO4, and
Cl to build comprehensive datasets that may allow for national prediction
of PFAS occurrence.
Authors: Paul M Bradley; Larry B Barber; Jimmy M Clark; Joseph W Duris; William T Foreman; Edward T Furlong; Carrie E Givens; Laura E Hubbard; Kasey J Hutchinson; Celeste A Journey; Steffanie H Keefe; Dana W Kolpin Journal: Sci Total Environ Date: 2016-06-24 Impact factor: 7.963
Authors: Meghan E McGuire; Charles Schaefer; Trenton Richards; Will J Backe; Jennifer A Field; Erika Houtz; David L Sedlak; Jennifer L Guelfo; Assaf Wunsch; Christopher P Higgins Journal: Environ Sci Technol Date: 2014-06-09 Impact factor: 9.028
Authors: Andrea K Weber; Larry B Barber; Denis R LeBlanc; Elsie M Sunderland; Chad D Vecitis Journal: Environ Sci Technol Date: 2017-04-06 Impact factor: 9.028
Authors: Ning Wang; Jinxia Liu; Robert C Buck; Stephen H Korzeniowski; Barry W Wolstenholme; Patrick W Folsom; Lisa M Sulecki Journal: Chemosphere Date: 2010-11-26 Impact factor: 7.086
Authors: Jonathan P Benskin; Belinda Li; Michael G Ikonomou; John R Grace; Loretta Y Li Journal: Environ Sci Technol Date: 2012-10-23 Impact factor: 9.028
Authors: Robert C Buck; James Franklin; Urs Berger; Jason M Conder; Ian T Cousins; Pim de Voogt; Allan Astrup Jensen; Kurunthachalam Kannan; Scott A Mabury; Stefan P J van Leeuwen Journal: Integr Environ Assess Manag Date: 2011-10 Impact factor: 2.992