Literature DB >> 35113548

Perfluoroalkyl and Polyfluoroalkyl Substances in Groundwater Used as a Source of Drinking Water in the Eastern United States.

Peter B McMahon¹, Andrea K Tokranov², Laura M Bexfield³, Bruce D Lindsey⁴, Tyler D Johnson⁵, Melissa A Lombard⁶, Elise Watson⁵.

Abstract

In 2019, 254 samples were collected from five aquifer systems to evaluate perfluoroalkyl and polyfluoroalkyl substance (PFAS) occurrence in groundwater used as a source of drinking water in the eastern United States. The samples were analyzed for 24 PFAS, major ions, nutrients, trace elements, dissolved organic carbon (DOC), volatile organic compounds (VOCs), pharmaceuticals, and tritium. Fourteen of the 24 PFAS were detected in groundwater, with 60 and 20% of public-supply and domestic wells, respectively, containing at least one PFAS detection. Concentrations of tritium, chloride, sulfate, DOC, and manganese + iron; percent urban land use within 500 m of the wells; and VOC and pharmaceutical detection frequencies were significantly higher in samples containing PFAS detections than in samples with no detections. Boosted regression tree models that consider 57 chemical and land-use variables show that tritium concentration, distance to the nearest fire-training area, percentage of urban land use, and DOC and VOC concentrations are the top five predictors of PFAS detections, consistent with the hydrologic position, geochemistry, and land use being important controls on PFAS occurrence in groundwater. Model results indicate that it may be possible to predict PFAS detections in groundwater using existing data sources.

Entities: Chemical

Keywords: boosted regression tree; hydrologic position; land use; modeling; pharmaceuticals; tritium

Mesh：

Substances：

Year: 2022 PMID： 35113548 PMCID： PMC8970425 DOI： 10.1021/acs.est.1c04795

Source DB: PubMed Journal: Environ Sci Technol ISSN： 0013-936X Impact factor: 9.028

Introduction

The contamination of groundwater with perfluoroalkyl and polyfluoroalkyl substances (PFAS) is a concern in many countries[1−4] because PFAS persist in the environment,[5−8] PFAS sources are widespread,[5,9,10] and some PFAS are known or suspected to be associated with adverse human-health effects.[11−13] Moreover, PFAS in the unsaturated zone can be sources to underlying groundwater systems for decades,[7,14] making PFAS in groundwater a long-term public-health concern. The United States does not currently (2022) have nationally enforceable drinking-water standards for PFAS, but in 2016, the U.S. Environmental Protection Agency (EPA) established a drinking-water health advisory level of 70 ng/L for the combined concentrations of perfluorooctane sulfonate (PFOS) and perfluorooctanoate (PFOA),[15] two of the most commonly measured PFAS in groundwater. Conceptually, the occurrence of PFAS in groundwater should be controlled by land-use (source), hydrologic (transport), and biogeochemical (fate and transport) factors. Several local-scale studies have provided deeper understanding of the source, transport, and fate of PFAS in soils and groundwater by combining PFAS data with data for those factors.[7,16,17] A study in Massachusetts used data on groundwater-flow directions and measurements of dissolved oxygen (O2) and boron (B) to help differentiate PFAS in groundwater affected by wastewater effluent from that affected by a fire-training area.[7] A study in Japan used measurements of tritium (3H), pharmaceuticals, and personal care products to link PFAS in modern groundwater to sewage sources.[18] Measurements of benzene and PFAS and information on enhanced bioremediation activities related to the hydrocarbon contamination improved the understanding of PFAS precursor transformations at a fire-training facility in South Dakota.[17] We hypothesize that data for the controlling factors can improve our understanding of the occurrence of PFAS at regional and national scales, such that those data could be used to build large-scale statistical models that predict PFAS occurrence in unmonitored areas, as has been done for arsenic and nitrate.[19−21] Such models could be used to guide future sampling efforts and identify areas of high risk for human exposure. At least one study demonstrated the value of PFAS-source information in predicting PFAS occurrence in drinking-water supplies at the national scale.[10] A state-wide study in New Hampshire showed that data on hydrology and PFAS sources can also be predictive of PFAS occurrence in groundwater.[22] We are aware of one other study that combined source, hydrologic, and geochemical data to test their power to predict PFAS occurrences in groundwater at a regional scale (state of California).[23] Our study expands this type of analysis to a different region of the United States and includes additional geochemical parameters found to be useful in understanding and predicting PFAS in groundwater (e.g., dissolved organic carbon (DOC) and pharmaceuticals). The purpose of this article is threefold: (1) explore hydrologic, geochemical, and land-use controls on PFAS occurrence in groundwater used as a source of drinking water in the eastern United States, (2) examine chemical co-occurrences with PFAS that could improve understanding of PFAS occurrences, and (3) identify which hydrologic, geochemical, and land-use factors are the strongest predictors of PFAS occurrence using boosted regression tree (BRT) models. Well depth and tritium (3H) provide information on the hydrologic position of samples in the groundwater-flow systems. pH, iron [Fe], B, DOC, pharmaceuticals, VOCs, and other parameters provide information on geochemical conditions, co-occurring chemicals, and potential PFAS sources. Geospatial data provide locations for potential PFAS sources like fire-training facilities, landfills, airports, military bases, and other features.

Materials and Methods

Well Networks

In 2019, 254 wells were sampled in seven networks (Figures and S1; Table S1). The well networks cover 2383 to 164,619 km2 in five principal-aquifer systems: Glacial, Mississippi Embayment, Southeastern Coastal Plain, Stream Valley, and Surficial aquifer systems. The networks were established by the U.S. Geological Survey’s National Water-Quality Assessment (USGS NAWQA) project in 1999 to 2019 and are composed of public-supply (64%), domestic (19%), monitoring (12%), and other (5%) well types (Table S1). A stratified random approach was used to select wells for sampling. Hydrogeologic units that are important sources of water supply were targeted for sampling (stratification step; Table S1). To facilitate random well selection, each hydrogeologic unit was subdivided into 30 to 60 equal-area cells, and one well in each cell was randomly selected to sample from a population of existing wells.[24] All wells within the NECBS network are water-table monitoring wells installed in 1999 by randomly selecting one location in each equal-area cell.

Figure 1

Locations of aquifer systems and well networks. Wells shown with white symbols indicate that PFAS were not detected, and those shown with other colors indicate that PFAS were detected. See Figure S1 for detailed maps of the networks. In 2012, median percentages of urban land use around the wells ranged from ∼48 (MIAM) to 92% (NECBS) (Table S1).[25] Three well networks located in the Glacial aquifer system represent valley-fill aquifers (MIAM) and relatively shallow (NECBS) and deep (NECBD) parts of the stratified-drift aquifer. Metropolitan areas in those networks include Dayton (MIAM) and Boston (NECBS; NECBD). Network MISE represents the Memphis Sand aquifer in the Mississippi Embayment aquifer system. The Memphis metropolitan area is a major user of groundwater from the Memphis Sand aquifer. Network MOBL represents the Black Warrior River aquifer in the Southeastern Coastal Plain aquifer system. Network STRV represents the Ohio River alluvial aquifer in the Stream Valley aquifer system. Metropolitan areas in STRV include Louisville, Cincinnati, and Pittsburgh. Network SURF represents the Surficial aquifer system. Metropolitan areas in SURF include Fort Lauderdale and West Palm Beach. Networks were sampled as a part of the USGS NAWQA, National Hydrologic Monitoring project.

Sample Collection and Laboratory Analysis

Standard USGS protocols were used to collect groundwater samples from the wells prior to any treatment, blending, or pressure tanks.[26,27] Samples for major-ion, nutrient, trace-element, and DOC analyses were filtered (0.45 μm Versapor filters) and acidified with nitric acid (major cations and trace elements) or sulfuric acid (DOC), and/or chilled (nutrients, DOC) in the field. Pharmaceutical samples were filtered (0.7 μm glass fiber) and chilled in the field.[28] Samples for VOC analysis were unfiltered, acidified with hydrochloric acid, and chilled in the field. 3H samples were unfiltered and unpreserved. Samples for PFAS analysis were unfiltered and chilled in the field. Details of the PFAS sampling protocols are provided in Section S1. Major-ion, nutrient, trace-element, DOC, pharmaceutical, and VOC samples were analyzed at the USGS National Water-Quality Laboratory in Lakewood, Colorado. Major cations were analyzed by inductively coupled plasma-atomic emission spectroscopy.[29] Major anions were analyzed by ion chromatography.[29] Trace elements were analyzed by inductively coupled plasma-mass spectroscopy.[30] DOC was analyzed by UV-promoted persulfate oxidation and infrared spectrometry.[31] Pharmaceuticals were analyzed by liquid chromatography/tandem mass spectrometry using an electrospray ionization source operated in the positive ion mode.[32] VOCs were analyzed by purge-and-trap gas chromatography/mass spectrometry.[33]3H was analyzed at the USGS Tritium Laboratory in Menlo Park, California, using electrolytic enrichment and liquid-scintillation counting.[34] PFAS were analyzed at the Orlando, Florida SGS Laboratory by liquid chromatography/tandem mass spectrometry with isotope dilution (see Section S2 for details).[35] Analyzed compounds include 11 perfluoroalkyl carboxylates (PFCAs), 7 perfluoroalkyl sulfonates (PFSAs), perfluorooctane sulfonamide, 2 perfluorooctane sulfonamidoacetates, and 3 fluorotelomer sulfonates (FTS). Reported concentrations are the sum of linear and branched isomers. The chemical data are listed in Table S2, along with PFAS-specific reporting levels (3.8 to 40 ng/L), and are available in ref (36).

PFAS Quality-Control Data and Analysis

Details about the evaluation of quality-control samples associated with this study are provided in Section S3 and Tables S3–S6. Briefly, field and laboratory blanks were used to examine potential sources of contamination. No PFAS detections were reported for any blank types collected in the field, with the exception of one perfluorobutanoate (PFBA) detection reported for a field blank; detection of PFBA in the associated laboratory method blank indicated that this result was likely affected by contamination at the laboratory. Laboratory-blank results prompted censoring of reported detections for PFBA in 15 groundwater samples in two analytical batches and for a detection of 6:2 FTS in one groundwater sample in a separate batch. Examination of data for routine laboratory reagent spikes and matrix spikes indicated little bias (all median recovery values for PFAS between 85 and 103%); reagent spikes exhibited low variability in recovery. Field replicates and laboratory matrix spike duplicates indicated low variability in PFAS detection and (or) concentration.

Geospatial Data

Geospatial data from publicly available sources and from U.S. Government proprietary sources were used to analyze spatial relations between PFAS detections in groundwater and potential sources of PFAS (Table S7). The potential PFAS sources, data availability, and geospatial analysis are described in Section S4. Five-hundred-meter circular buffers around the wells were used to extract and assign selected land-use data to the wells to examine relations between water chemistry and land use (see Section S4 for more information).

Statistical Methods

Mann–Whitney and Kruskal–Wallis tests, as implemented in the software OriginPro 2018,[37] were used on ranked data to test for significant differences in concentrations between selected geochemical groups. Spearman correlation analysis was used to examine relations between concentrations and other variables. An α value of 0.05 was used for each test.

BRT Modeling

BRT models were fit to 57 variables including water-quality parameters, land use, and distance from wells to identified places that are potential sources of PFAS to the environment (Tables S2 and S8). Laboratory results for PFAS were converted to a binary (detect or nondetect), and models were fit using methods like those described in ref (19). Models were developed using the R computing environment (R Team. R: A language and environment for statistical computing version 4.0.3. https://www.R-project.org). The dataset was split into training data (80%) and holdout data (20%) to evaluate model performance (Section S5). The model parameters adjusted during tuning were interaction.depth (the number of levels of trees), n.minobsinnode (the minimum number of observations in terminal nodes), shrinkage (the learning rate), and n.trees (number of trees in the model). Metrics for model performance were calculated using fivefold cross validation and include accuracy, sensitivity, specificity, and the area under the receiver operator characteristics curve (ROC) (Section S5 and Tables S9 and S10). Accuracy is the percentage of total correct predictions, sensitivity is the percentage of successfully predicted detections (true positives), and specificity is the percentage of successfully predicted nondetections (true negatives). The ROC curve is plotted as the true positive divided by the false positive rate for a range of probability thresholds, including the 0.5 threshold chosen. A binary variable indicating whether any of the 24 PFAS were detected was used as the primary dependent variable. However, for comparison, multiple models were constructed with the dependent variable being a binary variable indicating the detection of PFOS, PFOA, PFBS, long chain-length perfluoroalkyl acids (PFAA), short chain-length PFAA, PFCA, or PFSA. Long-chain PFCAs and PFSAs are those with ≥7 and ≥6 perfluorinated carbons, respectively.[38]

Results and Discussion

PFAS Occurrence in Groundwater

Fourteen of the 24 analyzed PFAS were detected at least once in groundwater samples (Tables S2 and S11). At least one PFAS was detected in 54% of the samples (n = 254), and ≥2 PFAS were detected in 47% of the samples. For drinking-water wells, PFAS were detected in 60% of public-supply wells and 20% of domestic wells. At least two PFAS were detected in 53% of public-supply wells and 10% of domestic wells. The most commonly detected PFAS in our samples include the six measured by the EPA’s Third Unregulated Contaminant Monitoring Rule (UCMR3) program (perfluorobutane sulfonate [PFBS], perfluorohexane sulfonate [PFHxS], PFOS, perfluoroheptanoate [PFHpA], PFOA, and perfluorononanoate [PFNA]),[39] plus several other PFSA and PFCA compounds in the 4- to 9-carbon range, most notably PFBA, perfluoropentanoate (PFPeA), and perfluorohexanoate (PFHxA) (Figure A). PFOA and PFOS represent two of the three most frequently detected PFAS, and 2.4% (n = 6) of the samples have PFOA+PFOS concentrations greater than the 70 ng/L health advisory level (all are from public-supply wells). Eight of the 10 undetected PFAS are PFAA precursors (n = 5) or PFCA with 12+ carbon atoms (n = 3) (Table S11).

Figure 2

(A) Detection frequency for perfluoroalkyl sulfonates (PFSA) and perfluoroalkyl carboxylates (PFCA) in relation to the carbon number, based on all data, (B) ΣPFAS in relation to PFAS detection frequency by the well network, and (C) ΣPFAS in relation to the number of PFAS detected in samples. In (B), the horizontal lines show median concentrations; n is the number of samples with at least one PFAS detection. In (B and C), ΣPFAS is the summed concentration of detected PFAS. Substantial differences in PFAS detection frequencies and summed concentrations of detected PFAS (ΣPFAS) are observed between the well networks. PFAS detection frequencies range from 3.7 (MISE network) to 92.9% (NECBS) (Figure B). Although the wells in NECBS are monitoring wells screened near the water table, not wells that supply drinking water, the data from NECBS are important because they provide information on the quality of recharge that may eventually reach the deeper public-supply wells in NECBD. ΣPFAS medians range from 2.2 (MISE) to 40.0 ng/L (SURF) (Figure B). MOBL has a relatively high ΣPFAS median despite its low detection frequency; nevertheless, there is a significant positive correlation (rho = 0.86; p = 0.014) between detection frequency and ΣPFAS medians. The highest ΣPFAS (1645 ng/L) occurs in a public-supply well from STRV. Relatively little is known about potential effects of complex PFAS mixtures in drinking-water sources on human health,[13,40] but better understanding of the composition of those mixtures could help inform toxicity studies. The relatively common occurrence of multiple PFAS in the samples has implications for ΣPFAS and the complexity of PFAS mixtures in groundwater, consistent with previous point-of-use drinking-water studies.[41,42] ΣPFAS exhibit a significant positive Spearman correlation with the number of PFAS detected in the samples (rho = 0.91; p < 0.001) (Figure C), also consistent with results from the UCMR3 program.[3] Networks NECBS, NECBD, and SURF have relatively large fractions of samples containing >6 PFAS (37 to 54%) (Figure S2A), which may be related to the large fractions of urban land use in 500 m buffers around those wells (74 to 92%) (Table S1). Not surprisingly, networks with the largest numbers of co-occurring PFAS (NECBS, NECBD, SURF, and STRV) also have the largest numbers of unique PFAS mixtures relative to the number of samples in the networks (Figure S2B). In NECBS, 68% of the samples contain combinations of two or more PFAS that are unique to that sample. Overall, three PFAS occur in ≥80% of the mixtures (PFOA > PFBS > PFOS), but the dominant PFAS in mixtures vary between networks. Network NECBS has three PFAS that occur in ≥80% of the mixtures (PFOA > PFOS > PFBS), whereas SURF has 7 PFAS that occur in ≥80% of the mixtures (PFOA > PFHxS > PFOS = PFHxA > PFBS = PFPeA = PFHpA) (Figure S2C).

Hydrologic Controls on PFAS Occurrence

Hydrologic characteristics of groundwater systems could influence the occurrence of PFAS in groundwater, yet data describing those characteristics are not commonly examined in regional PFAS studies.[22] The position of a groundwater sample in the flow system relative to the land surface and the age of groundwater relative to the age of PFAS sources are important with respect to PFAS occurrence in groundwater because PFAS are derived from modern land-surface sources. Data for well depth and 3H in groundwater indicate that the samples collected for this study represent a broad spectrum of hydrologic positions and age categories. There are significant differences in well depths between the networks (Figure S3A), with median depths ranging from 7.7 (NECBS) to 88.4 m (MISE). There are also significant differences in 3H concentrations between the well networks (Figure S3B), with median concentrations ranging from 0.1 tritium units (TU) (MISE) to 4.7 TU (STRV). Generally, 3H concentrations less than about 0.1 to 0.5 TU in groundwater collected in 2019 indicate that the groundwater was recharged before the start of above-ground nuclear weapon testing in 1953 (referred to as premodern water in this article).[43] More precise estimates of the threshold 3H concentration for premodern water depend on a sample’s location and collection date due to spatial variations in 3H concentrations in precipitation and radioactive decay. Groundwater 3H data are used to assign the samples to one of three age categories using previously developed methodology and datasets for 3H in U.S. precipitation:[43,44] premodern (pre-1953 recharge), modern (recharged during or after 1953), or mixed (mixture of premodern and modern recharge). Threshold 3H concentrations (TU) for the premodern and modern age categories are 0.3, 2.8 (NECBS; NECBD); 0.2, 2.3 (MIAM); 0.2, 1.9 (STRV); 0.1, 0.6 (SURF); and 0.1, 1.2 (MISE; MOBL). The cutoff date for premodern water (pre-1953) is close to the date when PFAS started to be widely used (∼1950);[9] thus, knowing age categories of the samples could help characterize the risk of groundwater contamination with PFAS. Samples of modern and mixed-age groundwater contain at least some water recharged after the start of widespread PFAS use, whereas samples in the premodern age category were probably recharged prior to the widespread use of PFAS. Moreover, modern samples are from wells with significantly shallower depths than premodern samples (Figure S2C). Only the shallowest network (NECBS), with the highest PFAS detection frequency (Figure B), has 100% of its samples assigned to the modern age category (Figure A and Table S2). Two other networks (NECBD; MIAM) have >95% of their samples assigned to the modern category. The four remaining networks have samples in all three age categories, with MISE and MOBL having 44 to 54% of their samples in the premodern age category and the lowest PFAS detection frequencies.

Figure 3

(A) Percentage of samples in each of three groundwater age categories (modern, mixed, and premodern), by well networks and (B) percentage of PFAS detections that occur in each age category. See the text for 3H concentrations used to define age categories in each well network. The relations between depth, age categories, and onset of widespread PFAS use imply that the likelihood of detecting PFAS in groundwater decreases with increasing well depth. For example, network NECBS is characterized by very shallow wells, high fraction of modern water, and a high PFAS detection frequency, whereas MISE is characterized by deep wells, low fraction of modern water, and a low PFAS detection frequency (Figures B, 3A and S3A). Overall, samples that contain a PFAS detection have significantly shallower well depths and higher 3H concentrations than samples with no detections (Table S12), consistent with the hydrologic position and groundwater age being controlling factors for PFAS occurrence in groundwater. In samples that have both 3H data and PFAS detections (n = 136), 92% of the detections occur in modern water and about 99% occur in samples with at least some modern water (modern + mixed) (Figure B). The level of protection against PFAS contamination provided by depth and age could decrease over time as shallow groundwater moves deeper into an aquifer system.

Geochemical Controls on PFAS Occurrence

Geochemical characteristics of groundwater like pH and concentrations of DOC, divalent cations, and chloride (Cl) could affect PFAS sorption to aquifer solids. PFAS sorption, for example, can be greater under conditions of low pH,[7,45] elevated concentrations of divalent cations,[46,47] or lower concentrations of DOC and Cl.[47−49] Reductive dissolution of manganese (Mn) and Fe oxides in anoxic groundwater could reduce the sorption capacity of solids or mobilize PFAS already adsorbed to those solids.[7] Values of pH and concentrations of selected inorganic constituents and DOC indicate that there are significant differences in geochemical characteristics between the networks (Figure S4). For example, median pH values range from 5.6 (NECBS) to 7.1 (STRV). Median Ca + Mg, DOC, and Mn + Fe concentrations range from 2.88 (MOBL) to 128 (MIAM) mg/L, <0.23 (MISE) to 7.6 (SURF) mg/L, and 0.8 (MISE) to 357 (SURF) μg/L, respectively. Concentrations of DOC, Cl, SO4, Ca + Mg, and Mn + Fe are significantly higher in samples containing PFAS detections than in samples with no detections; however, pH is not significantly different between the two PFAS groups (Table S12). The similarity in pH between the two PFAS groups is due in part to high PFAS detection frequencies in networks with low (NECBS, NECBD median pH = 5.9) and high (STRV, SURF median pH = 7.0) pH values. In those networks, PFAS-source characteristics may be more dominant controls on PFAS occurrence than pH. Higher DOC concentrations in samples containing PFAS detections are consistent with previous studies indicating that elevated DOC concentrations could promote PFAS mobility through electrostatic or hydrophobic interactions in the dissolved phase or by competing with PFAS for sorption sites.[48,49] Competition for sorption sites between negatively charged PFAS and inorganic anions could also help explain the higher Cl and SO4 concentrations in samples containing PFAS detections.[47,48] In contrast, previous studies indicate that increasing concentrations of divalent cations could increase PFAS sorption by reducing the negative charge on soil surfaces, thereby reducing the electrostatic repulsion between the surface and negatively charged PFAS.[45] The higher Ca + Mg concentrations in samples containing PFAS detections appear to be inconsistent with those results, but the observation could be influenced by Cl and SO4 concentrations, given the competing effects of anions and cations on PFAS sorption. The relation between PFAS detections and Ca + Mg appears to be more consistent with previous studies when Ca + Mg concentrations are normalized to Cl or SO4 concentrations. Ca + Mg/Cl and Ca + Mg/SO4 ratios are significantly lower in samples that contain PFAS detections than in samples with no detections (Table S12). The relation between PFAS detections and Mn + Fe is consistent with reductive dissolution of Mn and Fe oxides in anoxic groundwater that could reduce the sorption capacity of solids or mobilize PFAS already adsorbed to those solids. Concentrations of Mn + Fe, Mn, and Fe are not significantly correlated with urban land use, but there is a significant inverse correlation (p < 0.001) between well depth and Mn (but not Fe). Thus, the PFAS, Mn + Fe relation could also more generally reflect higher pollution loading near the land surface in some settings that results in more reducing conditions and Mn reduction.

Land-Use Controls on PFAS Occurrence

Urban areas could be expected to have a greater density of potential PFAS sources compared to agricultural and undeveloped landscapes, although biosolids are recognized as a potential PFAS source on agricultural lands.[50] There is a significantly larger percentage of urban land within 500 m of wells that contain PFAS detections than near wells with no detections (Table S13), whereas there are significantly smaller percentages of agricultural and undeveloped lands within 500 m of wells that contain PFAS detections than near wells with no detections. The relation between PFAS and urbanization is important because it implies that the risk of PFAS contamination of groundwater could increase as urbanization encroaches on the agricultural and undeveloped land. More specific to PFAS occurrences in groundwater than land use is information on potential PFAS sources near the sampled wells. U.S. Government proprietary databases provided this study with geospatial information for facility categories that could be potential PFAS sources (Table S8). Among the source categories are public-use airports, chemical manufacturing facilities, fire-training facilities, and landfills. These source categories and most others listed in Table S8 are significantly closer to wells that contain PFAS detections than to wells with no detections (Table S13). Moreover, the cumulative number of potential PFAS sources <5 km from the sampled wells is significantly higher for wells containing PFAS detections than for wells with no detections. It is unknown which, if any, of the potential sources listed in Table S8 used or released PFAS to the environment; nevertheless, the results are consistent with the conceptual model of PFAS concentrations in groundwater increasing with decreasing distance to PFAS sources and increasing density of PFAS sources. Similar spatial relations have been observed in other regional studies.[10] Differences in the number of unique PFAS mixtures in the well networks could be related to the diversity of PFAS sources in the networks that is not captured in general land-use data. NECBD has a larger fraction of samples with unique PFAS mixtures than SURF (Figure S2B), even though SURF (86%) has a larger fraction of urban land within 500 m of its wells than NECBD (74%). The number of potential PFAS sources within 5 km of NECBD wells is significantly higher than the number within 5 km of SURF wells (Figure S5A). NECBD and SURF appear to have similar numbers of septic systems in 500 m buffers around their wells (Figure S5B), based on estimates of N input to septic systems in the United States,[51] which is used here as a proxy for septic-system density. Septic systems are a suspected source of PFAS in groundwater.[52] Differences in the number and composition of PFAS mixtures between the networks could also be affected by PFAS degradation and sorption processes. As previously discussed, the differences in geochemical characteristics of groundwater in the networks could result in different amounts of PFAS sorption to aquifer solids in the networks.

PFAS Co-Occurrence with Other Chemicals

Linking the PFAS detections in our samples to specific PFAS sources is beyond the scope of this study, given its regional scale. Nevertheless, PFAS co-occurrences with other chemicals can provide useful information about the chemical complexity of PFAS-contaminated groundwater and, more generally, about PFAS sources. VOC and pharmaceutical detection frequencies are substantially higher in samples that contain PFAS detections (62 and 33%, respectively) than in samples with no detections (21 and 2%, respectively) (Table S12). For samples from public-supply wells that have PFAS detections, 68, 37, and 24% also have at least one detection of a VOC, pharmaceutical, or VOC + pharmaceutical, respectively. ΣVOCs (median = 0.07 μg/L) and Σpharmaceuticals (median = 0.025 μg/L) are low, but their presence indicates that the chemical inventory in groundwater contaminated with PFAS is complex. This finding is important because estimating the toxicity of complex chemical mixtures in water can be challenging.[53] The top 10 VOCs co-occurring with PFAS include chlorodifluoromethane (CDFM), the most common co-occurring VOC, and 1,1-dichloroethane (Figure S6A). The co-occurrence of CDFM and 1,1-dichloroethane with PFAS was also noted in the UCMR3 dataset.[3] The most common co-occurring pharmaceuticals include carbamazepine > gabapentin > meprobamate (Figure S6B). The co-occurrence of PFAS with other chemicals does not necessarily indicate that they are from the same source, particularly in urban areas where independent sources could coexist in proximity to each other. PFAS and pharmaceuticals, however, are known to co-occur in sources associated with human-waste disposal (e.g., septic-system/landfill leachate and wastewater treatment plants).[10,54−56] Those types of sources could account for some of the PFAS in the 37% of PFAS-contaminated samples from public-supply wells that contain at least one pharmaceutical. In NECBD, 41% of the public-supply wells with PFAS detections also have at least one detection of a pharmaceutical. That network has a relatively high abundance of septic systems within 500 m of the wells (Figure S5B), and nitrate and B concentrations, for the most part, are significantly higher in NECBD samples that contain PFAS or pharmaceutical detections than in samples with no detections (Table S14). Nitrate and B can have elevated concentrations in groundwater due to contamination with septic leachate.[55] The overall dataset also appears to be consistent with at least some PFAS in the samples with co-occurring PFAS and pharmaceutical detections being associated with septic/landfill leachate, wastewater effluent, or stormwater runoff.[52,54,56,57] Samples with co-occurring detections of PFAS and pharmaceuticals have significantly higher concentrations of B and total N than samples that do not have co-occurring PFAS and pharmaceutical detections (Figure S7). All three sources can have elevated B and total N concentrations,[55,58,59] so they could produce similar chemical patterns with respect to PFAS, pharmaceuticals, B, and total N. VOCs known to be associated with PFAS sources could also be potentially useful PFAS-source indicators in groundwater. Fuel hydrocarbons like benzene co-occur with PFAS in groundwater in some fire-training areas.[17] CDFM, the most common co-occurring VOC in our dataset, is used to manufacture monomers like tetrafluoroethene, which is an important building block in fluoropolymer production. Historically, the polymerization process used PFOA and PFOS as emulsifiers.[60] Thus, co-occurrences of CDFM, PFOA, and PFOS in groundwater could indicate that some of the PFAS are from fluoropolymer manufacturing. CDFM has also been used as a replacement for chlorofluorocarbons in air conditioning, refrigerants, and aerosols, resulting in increasing atmospheric CDFM concentrations.[61] Atmospherically derived CDFM could enter groundwater in recharge that is unrelated to PFAS sources, although CDFM concentrations in air-saturated water (0.0009 μg/L at 20 °C)[61] are very low compared to the concentrations measured in this study. In this study, the median detected CDFM concentration is 0.075 μg/L, ∼85 times higher than the concentration in air-saturated water. PFAS detections co-occur in 90% of the samples that contain a CDFM detection, and 96% of the samples with co-occurring CDFM and PFAS also contain PFOA and/or PFOS. In the SURF network, where 70% of the CDFM detections occur, wells with co-occurring CDFM and PFAS detections are significantly closer to chemical manufacturing facilities (median distance = 4.3 km) than wells without co-occurring detections (median = 12.1 km) (Figure S8). We do not know if PFAS and CDFM were used or released to the environment at those facilities, but the chemical and spatial data are consistent with at least some of the PFAS in samples with co-occurring CDFM and PFAS detections being sourced from chemical manufacturing. SURF samples with co-occurring PFAS and CDFM detections and samples with only CDFM detections do not have significantly higher B or total N concentrations than samples without those detections (Figure S9), suggesting that the PFAS + CDFM co-occurrences are not associated with human-waste disposal. The regional analyses of co-occurring PFAS-pharmaceuticals and PFAS-CDFM provide internally consistent hypotheses about some PFAS being sourced from human-waste-disposal activities and chemical manufacturing, respectively, that need to be tested with data from local-scale studies.

BRT Models

Models were developed to predict PFAS detections using geochemical and geospatial data as predictor variables. The model selected, with the dependent variable being a binary variable indicating whether one or more of the 24 PFAS were detected, was constructed using 1000 trees, an interaction depth of 2, shrinkage (learning rate) of 0.002, and 10 as the minimum number of observations in terminal nodes of a tree. The accuracy, sensitivity, specificity, and ROC were 0.91, 0.93, 0.89, and 0.97 for the training data and 0.84, 0.96, 0.72, and 0.90 for the holdout data, respectively, indicating excellent model performance (Table S9). The relative importance of the model variables (Figure ) indicates that the top five predictors of PFAS detection are 3H concentration, distance to the nearest fire-training area, DOC concentration, percentage of urban land use, and ΣVOC. Variables potentially related to PFAS sorption (e.g., SO4 and Cl) are also among the top 10 predictors. Overall, the model results are consistent with the hydrologic position, groundwater geochemistry, and land use being important controls on PFAS occurrence in groundwater. 3H is rarely collected in assessments of PFAS contamination of groundwater, but given its apparent predictive power, inclusion of 3H in such efforts seems warranted.

Figure 4

Relative influence of potential predictor variables sorted from largest to smallest contribution to the model is shown in blue bars. Inset panels are partial dependence plots for tritium, distance to the nearest fire-training area, DOC, the percentage urban land use around each well, and the sum of detected VOC concentrations. Partial dependence plots show the relationship between probability of PFAS detection and the predictor variable. Blue tick marks along the x axis of the inset panels indicate the variable minimum, maximum, and deciles of the model training dataset. Plateaus in the inset graphs are usually in areas with little or no data. FRS, facility registry service. Partial dependence plots can help visualize the relationship between the probability of detection and the predictor variable.[62] Partial dependence plots (Figure ) indicate increasing probability of PFAS detection with increasing 3H, DOC, and VOC concentrations and increasing urban land-use percentages. Conversely, the probability of PFAS detection decreases with increasing distance from a fire-training area. These partial dependence plots are in line with expectations discussed in this article. Higher 3H concentrations are indicative of more modern water, urban areas have been associated with PFAS occurrence, and fire-training areas are known point sources of PFAS.[7,17,63] As discussed above, VOC detection frequencies and DOC concentrations were higher in samples containing PFAS detections: these associations are reflected in the model, which found that DOC and VOCs have large relative influence on the model. Different models were constructed with the dependent variable being a binary variable indicating the detection of PFOS, PFOA, PFBS, long chain-length PFAA, short chain-length PFAA, PFCA, or PFSA. In all scenarios, 3H, DOC, and VOC concentrations and distance to the nearest fire-training area were in the top 10 most influential predictor variables (Table S10). The percentage of urban land use was also in the top 10 most influential predictor variables for all scenarios except for the model predicting PFOA, where it was number 11. These results indicate robustness of the model. The model developed here indicates that it may be possible to predict PFAS detections in drinking-water wells using data sources already available, consistent with another recent study that used statistical models to predict susceptibility of private wells to PFAS contamination.[22] The USGS National Water Information System (NWIS)[64] contains many years of water-quality data, including the chemical parameters used here, from wells across the U.S. Distance metrics could likewise be developed from existing geospatial datasets.

Limitations and Implications

The data represent a conservative estimate of human exposure to PFAS because samples were collected prior to treatment. Nevertheless, many water-treatment processes do not effectively remove PFAS.[41,42] Also, treatment is often absent for households that rely on private domestic wells as a source of drinking water. Thus, the PFAS results are likely to be relevant to human exposure at the tap. Although the dataset represents a broad spectrum of hydrogeologic settings in the eastern United States, important aquifer systems like the Northern Atlantic Coastal Plain system and the fracture-flow dominated Floridan carbonate-rock and Piedmont crystalline-rock systems are not represented. Likewise, the 24 PFAS analyzed here may not represent the total PFAS inventory in groundwater, given the large number of PFAS reported in commercial use and the environment.[9] Nevertheless, the dataset provides insight into the relations between PFAS in groundwater and large numbers of hydrologic, geochemical, and geospatial variables. Broadly, groundwater affected by modern anthropogenic activity appears to be associated with PFAS, given significant relations between PFAS detections and variables such as 3H, urban land use, VOCs, and pharmaceuticals. BRT modeling indicates that it is possible to predict PFAS occurrence based on the explanatory variables investigated here. Therefore, future sampling related to PFAS may consider adding targeted analyses such as 3H, DOC, SO4, and Cl to build comprehensive datasets that may allow for national prediction of PFAS occurrence.

39 in total

1. Pre/post-closure assessment of groundwater pharmaceutical fate in a wastewater-facility-impacted stream reach.

Authors: Paul M Bradley; Larry B Barber; Jimmy M Clark; Joseph W Duris; William T Foreman; Edward T Furlong; Carrie E Givens; Laura E Hubbard; Kasey J Hutchinson; Celeste A Journey; Steffanie H Keefe; Dana W Kolpin
Journal: Sci Total Environ Date: 2016-06-24 Impact factor: 7.963

2. A Mathematical Model for the Release, Transport, and Retention of Per- and Polyfluoroalkyl Substances (PFAS) in the Vadose Zone.

Authors: Bo Guo; Jicai Zeng; Mark L Brusseau
Journal: Water Resour Res Date: 2020-01-10 Impact factor: 5.240

3. Evidence of remediation-induced alteration of subsurface poly- and perfluoroalkyl substance distribution at a former firefighter training area.

Authors: Meghan E McGuire; Charles Schaefer; Trenton Richards; Will J Backe; Jennifer A Field; Erika Houtz; David L Sedlak; Jennifer L Guelfo; Assaf Wunsch; Christopher P Higgins
Journal: Environ Sci Technol Date: 2014-06-09 Impact factor: 9.028

4. Geochemical and Hydrologic Factors Controlling Subsurface Transport of Poly- and Perfluoroalkyl Substances, Cape Cod, Massachusetts.

Authors: Andrea K Weber; Larry B Barber; Denis R LeBlanc; Elsie M Sunderland; Chad D Vecitis
Journal: Environ Sci Technol Date: 2017-04-06 Impact factor: 9.028

5. 6:2 fluorotelomer sulfonate aerobic biotransformation in activated sludge of waste water treatment plants.

Authors: Ning Wang; Jinxia Liu; Robert C Buck; Stephen H Korzeniowski; Barry W Wolstenholme; Patrick W Folsom; Lisa M Sulecki
Journal: Chemosphere Date: 2010-11-26 Impact factor: 7.086

6. Influence of dissolved organic matter concentration and composition on the removal efficiency of perfluoroalkyl substances (PFASs) during drinking water treatment.

Authors: Dolly N Kothawala; Stephan J Köhler; Anna Östlund; Karin Wiberg; Lutz Ahrens
Journal: Water Res Date: 2017-05-24 Impact factor: 11.236

7. Per- and polyfluoroalkyl substances in landfill leachate: patterns, time trends, and sources.

Authors: Jonathan P Benskin; Belinda Li; Michael G Ikonomou; John R Grace; Loretta Y Li
Journal: Environ Sci Technol Date: 2012-10-23 Impact factor: 9.028

8. Investigating sources and pathways of perfluoroalkyl acids (PFAAs) in aquifers in Tokyo using multiple tracers.

Authors: Keisuke Kuroda; Michio Murakami; Kumiko Oguma; Hideshige Takada; Satoshi Takizawa
Journal: Sci Total Environ Date: 2014-05-09 Impact factor: 7.963

Review 9. Perfluoroalkyl and polyfluoroalkyl substances in the environment: terminology, classification, and origins.

Authors: Robert C Buck; James Franklin; Urs Berger; Jason M Conder; Ian T Cousins; Pim de Voogt; Allan Astrup Jensen; Kurunthachalam Kannan; Scott A Mabury; Stefan P J van Leeuwen
Journal: Integr Environ Assess Manag Date: 2011-10 Impact factor: 2.992

10. Application of a Framework for Grouping and Mixtures Toxicity Assessment of PFAS: A Closer Examination of Dose-Additivity Approaches.

Authors: Philip E Goodrum; Janet K Anderson; Anthony L Luz; Graham K Ansell
Journal: Toxicol Sci Date: 2021-01-28 Impact factor: 4.849