Literature DB >> 31244084

Nontarget Screening Reveals Time Trends of Polar Micropollutants in a Riverbank Filtration System.

Vittorio Albergamo1, Jennifer E Schollée2, Emma L Schymanski2,3, Rick Helmus1, Harrie Timmer4, Juliane Hollender2,5, Pim de Voogt1,6.   

Abstract

The historic emissions of polar micropollutants in a natural drinking water source were investigated by nontarget screening with high-resolution mass spectrometry and open cheminformatics tools. The study area consisted of a riverbank filtration transect fed by the river Lek, a branch of the lower Rhine, and exhibiting up to 60-year travel time. More than 18,000 profiles were detected. Hierarchical clustering revealed that 43% of the 15 most populated clusters were characterized by intensity trends with maxima in the 1990s, reflecting intensified human activities, wastewater treatment plant upgrades and regulation in the Rhine riparian countries. Tentative structure annotation was performed using automated in silico fragmentation. Candidate structures retrieved from ChemSpider were scored based on the fit of the in silico fragments to the experimental tandem mass spectra, similarity to openly accessible accurate mass spectra, associated metadata, and presence in a suspect list. Sixty-seven unique structures (72 over both ionization modes) were tentatively identified, 25 of which were confirmed and included contaminants so far unknown to occur in bank filtrate or in natural waters at all, such as tetramethylsulfamide. This study demonstrates that many classes of hydrophilic organics enter riverbank filtration systems, persisting and migrating for decades if biogeochemical conditions are stable.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31244084      PMCID: PMC6610556          DOI: 10.1021/acs.est.9b01750

Source DB:  PubMed          Journal:  Environ Sci Technol        ISSN: 0013-936X            Impact factor:   9.028


Introduction

Thousands of anthropogenic chemicals are released into the aquatic environment via wastewater treatment plant (WWTP) effluents, runoffs, and accidental spills.[1−5] Transformation products (TPs) formed during water treatment and under environmental conditions increase the complexity of the chemical mixtures that occur in the environment.[6−8] Freshwater systems are particularly vulnerable to contamination by polar organic micropollutants (MPs) exhibiting low or negative pH-adjusted log distribution coefficients (logD) as they preferentially partition into the water phase. When persistent, polar MPs can migrate indefinitely throughout the water cycle and reach drinking water sources.[2,9,10] In Europe, riverbank filtration (RBF) is a common drinking water pretreatment with potential to remove dissolved MPs mainly by sorption and biodegradation as surface water infiltrates through the hyporheic zone.[11−19] Sorption to organic matter can delay the transport of neutral and moderately hydrophobic MPs (logD > 3) in RBF systems.[7] The ion-exchange capacity of soils can result in the retention of cationic MPs, but it is not effective on anionic MPs. Biodegradation is favored by a redox potential gradient and long travel time, as they result in greater biodiversity of microbial communities and longer time for adaptation.[7,13,14,20−23] Liquid chromatography coupled to high-resolution tandem mass spectrometry (LC-HRMS/MS) is the preferred system to analyze most polar MPs in aqueous matrices. The capability of recent mass analyzers to achieve sensitive detection with high resolving power (>20,000) and high mass accuracy (<5 ppm) is pivotal to tentatively identify unknown ions via accurate mass spectra without the use of reference standards.[24−28] In environmental research, these approaches are known as suspect screening and nontarget screening (NTS). Suspect screening and NTS are increasingly being applied to environmental samples and are gradually becoming harmonized.[24−28] Suspect screening aims at identifying pollutants expected in a sample. Commonly, HRMS1 data (mass-to-charge ratios of ionized analytes) is searched for masses of interest suspected to occur (e.g., for study-specific reasons), typically included in a suspect list, and then accompanying isotopic (and adduct) peaks, HRMS2 spectra (fragment ions), and retention time (tR) are used to support the identification and confirmation of suspect hits.[24,25,28,29] Initiatives to improve screening efforts include platforms to share suspect lists, e.g., the NORMAN Suspect List Exchange and the U.S. Environmental Protection Agency (EPA) CompTox Dashboard, and openly accessible accurate mass spectral libraries, e.g., MassBank and MassBank of North America (MoNA). In contrast to suspect screening, NTS aims at identifying compounds without searching for certain masses/substances up front, but rather using the measured data to reveal the masses of interest. Since thousands of ions are acquired indiscriminately in HRMS1 full scans, a prioritization strategy is required to select masses of interest. Tentative structures are generally assigned based on candidate searching in chemical databases, e.g., PubChem and ChemSpider,[30,31] often using HRMS2 spectra.[26,28] State-of-the-art NTS benefits from the increased availability of computational tools for prioritization, e.g., statistical analysis methods,[32−35] and cheminformatics tools for high-throughput structure annotation, e.g., in silico fragmentation querying openly accessible chemical databases and accurate mass spectral libraries.[36] An overview on state-of-the-art cheminformatics tools for structure annotation can be found in the literature.[37−39] In this study, we investigated a natural drinking water source consisting of a riverbank filtrate originated from the Lek, a branch of the river Rhine in The Netherlands. Bank filtration at this site exhibits up to 60-year travel time from the riverbank to the furthest of a series of wells built by a drinking water utility. This site can be regarded as a hydrogeological archive, where persistent anthropogenic chemicals from the “post-1950s acceleration” to the present are preserved.[40] Our goal was to detect major pollution trends across the bank filtration transect and characterize the identities of mobile MPs by applying state-of-the art nontarget screening. To the best of our knowledge, no previous studies have attempted to investigate a time series of nontarget polar contaminants in a natural bank filtrate with such an extended travel time. Exposure to over a half-century of anthropogenic emissions from intensified industrial and agricultural activities followed by mitigation measures such as wastewater treatment upgrades in the 1990s make this bank filtration system a unique location to detect contamination time series and investigate persistent and mobile MPs in the aquatic environment using nontarget analytical approaches. The occurrence of these chemicals is rationalized and their emission sources discussed. Compounds not previously known to occur in bank filtrate were identified, including chemicals that were not known to occur in the water cycle at all.

Materials and Methods

Standards and Reagents

Detailed information on the analytical standards and reagents used for this study are included in the Supporting Information (SI) section S-1.

Sampling Site and Sample Collection

Anaerobic bank filtrate with residence times from 1 to 60 years was abstracted from a series of wells located in two adjacent production fields, named Schuwacht and Tiendweg, fed by the river Lek, a branch of the Rhine (Figure ). The traveling time at each well was previously investigated by the drinking water utility Oasen (Gouda, The Netherlands) by means of isotopic age dating and hydrogeological modeling.[41] The well fields are located in the municipality of Krimpen aan de Lek, South Holland, The Netherlands, and provide raw water for approximately 7000 m3 of drinking water per day. Further details on the riverbank filtration site including a map of the groundwater flow lines with modeled travel time are given in SI section S-2.
Figure 1

Map of The Netherlands showing the location of the abstraction wells used in the present study with well code and traveling time in parentheses

Map of The Netherlands showing the location of the abstraction wells used in the present study with well code and traveling time in parentheses Bank filtrate samples from nine wells of increasing travel time (n = 3) were collected in 5 L polypropylene bottles from sampling faucets built on each well and immediately transported to the University of Amsterdam, where they were kept in the dark at 2 °C and extracted next day.

Solid-Phase Extraction

The samples were allowed to reach room temperature, then 100 mL was transferred to 250 mL polypropylene bottles and spiked with 100 μL of a mix of 128 isotope-labeled internal standards (IS) available at Eawag at concentration of 1 ng/μL. This resulted in IS concentrations of 1 μg/L in the samples before enrichment by solid-phase extraction (SPE). An offline extraction protocol relying on hydrophilic–lipophilic balance (HLB) sorbent with Oasis cartridges by Waters (Etten-Leur, The Netherlands) was adapted from a procedure described elsewhere[42] for the enrichment of moderately hydrophobic and polar organics. The adjustments to the extraction protocol were the spike volume and concentrations of IS and the final concentration step, which in the present study resulted in an enrichment factor of 200 as the extracts were diluted 5-fold with deionized water prior LC-HRMS analysis.

LC-HRMS Analysis

The samples were analyzed at Eawag using LC-HRMS. A high-performance liquid chromatography system (HPLC) consisting of a PAL Autosampler (CTC Analytics, Zwingen, Switzerland), an Accela 1250 mixing pump (Thermo Fisher Scientific, San Jose, United States), and a Waters Xbridge C18 column (2.1 × 100 mm, 3.5 μm) was used. The mobile phase consisted of water (A) and methanol (B), both acidified with 0.1% formic acid. The gradient program expressed as A:B was 90:10 at 0 min, 50:50 at 4 min, 5:95 from 17 to 25 min, and 90:10 from 25.1 to 29 min. The flow rate was 200 μL/min and the column temperature was 30 °C. The sample injection volume was 20 μL. This LC method was adopted from a previous application to biologically treated wastewater,[32] where its effectiveness for a wide range of polar and moderately polar MPs was shown. Highly polar MPs may not be sufficiently retained and would require dedicated extraction and chromatographic methods. HRMS detection was achieved with a Q-Exactive hybrid quadrupole-Orbitrap (Thermo Fisher) equipped with an electrospray ionization source (ESI). HRMS1 spectra were acquired for masses ranging from m/z 100 to 1000, with a resolving power of 140,000 at m/z 200 and a mass error below 5 ppm. HRMS2 spectra were recorded in data-dependent mode with a resolving power of 17,500 (more details in SI section S-3.1). Separate analysis runs were conducted for positive and negative ionization modes with a spray voltage of +4 kV and −4 kV, respectively, and a capillary temperature of 300 °C. Confirmation of the prioritized structures was conducted at the University of Amsterdam. Reference standard materials and sample extracts were analyzed with an ultra-high-performance liquid chromatography (UHPLC) system (Nexera, Shimadzu, Den Bosch, The Netherlands) coupled to a maXis 4G q-ToF/HRMS equipped with an ESI source (Bruker Daltonics, Wormer, The Netherlands). Further details on this system are provided in SI section S-3.2).

Nontarget Screening Workflow

The NTS workflow consisted of three steps dealing with (i) HRMS1 data preprocessing, (ii) prioritization, and (iii) structure elucidation. Unless stated otherwise, the steps were automated and computed within R (v 3.3.2).[43] For data preprocessing, the analyses raw files were converted to centroided mzXML format with ProteoWizard (v 3.0)[44] and imported into enviMass (v 3.4).[45] The enviMass settings used for this study are given in SI section S-4). Separate projects were created for positive and negative ESI data. Peak picking was performed to determine the nontarget features, i.e., unique m/z and retention time (tR) pairs. The 128 isotope-labeled ISs were used for mass recalibration, tR alignment, and intensity normalization; i.e., for each measurement, the intensities of the picked peaks were normalized using the median deviation of all ISs from their individual median profile intensity.[46] The features were profiled; i.e., unique IDs were assigned to m/z and tR pairs detected across different samples. Features detected in analysis blanks, procedural blanks, and deionized water samples spiked with isotope-labeled ISs (termed blind samples) were profiled and subtracted. Profiled features that were not detected in all three replicates or whose tR was <2 min and >24 min were filtered out. The tR filter ensured that high polarity (tR < 2 min) and low polarity organics (tR > 24 min) were excluded from the data processing, as data of substances eluting either early or late are known to be of lesser quality with the existing chromatographic method, making assignment of (quasi-)isobaric substances challenging. For positive data, grouping of the most common single-charge ESI adducts, i.e., [M + H]+, [M + Na]+, [M + NH4]+, and [M + K]+, was additionally performed to define components.[46] Other positive adducts, negative adducts, and isotopic peaks may increase the final number of profiled features, but were not included here. The profiled features were prioritized by hierarchical cluster analysis (HCA) based on successful applications to lake sediments[33] and to ozonation in a WWTP.[34] Briefly, profile intensities were normalized to the maximum value detected in the whole data set, and dissimilarities expressed as Euclidean distance were calculated with the stats package (daisy function). The hierarchical classification of the profiled features was computed with the cluster package (hclust function). More details on the application of HCA to profile prioritization can be found elsewhere.[33,34] The optimal number of clusters (k) was investigated by silhouette analysis computed with the cluster package (silhouette function). The average silhouette width, i.e., a dimensionless value indicating whether an object truly belongs to the cluster it was assigned to, was calculated at different k values.[47] Once the optimal number of clusters was defined, the 15 most populated clusters were considered for further prioritization. In order to obtain good quality spectra for structure annotation, the top 50% most intense ions in each cluster were prioritized and their experimental HRMS1 and HRMS2 data extracted using the RMassBank package.[48] Tentative structure elucidation was performed with MetFrag command line v 2.3 in batch mode.[36] Neutral monoisotopic masses of the prioritized features were used to retrieve structures within 5 ppm mass accuracy from ChemSpider.[31] The maximum number of candidates per feature was set to 5000. The maximum tree depth (MSn) was set to 2. The candidate structures retrieved by MetFrag are fragmented in silico in a combinatorial manner and the fragments matched to the experimental HRMS2 spectra. Additionally, spectral similarities with records in the MoNA spectral library built into MetFrag were calculated using the MetFusion approach.[49] All suspect lists available on the NORMAN Suspect List Exchange as of November 2016 were merged into one large list of 11,922 unique InChIKey codes. The candidate structures were scored based on seven terms: FragScore (in silico fragmentation score); MetFusionOffline (MoNA spectral similarities using the MetFusion approach[49]); CSRefsScores (4 scoring terms: reference count on ChemSpider, reference count on PubMed, reference count on Royal Society of Chemistry and ChemSpider data sources); Suspects (hit in suspect list). The seven scoring terms were normalized by the highest value found among the proposed candidates, and an equal weighting of 1 was used to calculate the MetFrag Combined Score. An in-house script was used to check for agreement of formulas as calculated by MetFrag and the GenFormR package.[50] Tentatively annotated spectra and extracted ion chromatograms (EICs) of the nontarget features were plotted with the packages ReSOLUTION(51) and patRoon,[52] respectively, for quality control. Finally, identification confidence levels were assigned to all the tentatively annotated features.[29]

Results and Discussion

Data Preprocessing, Prioritization, and Structure Annotation

An overview on detection, mass deviation, and intensity ranges of the ISs is given in SI section S-5. Preprocessing of HRMS1 spectra with enviMass resulted in 10,850 positive and 7412 negative profiled features across the transect. The HCA results were visualized by plotting heat maps and dendrograms, shown in Figure .
Figure 2

Heat maps of the clustered profile intensities across the riverbank filtration transect in positive (panel a) and negative (panel b) ESI data. Well codes are shown on the x-axes and represent the time line from 1-year (LS-P12) to 60-year (LT-P18) -old water. Dendrograms are shown on the y-axes, where the prioritized clusters are marked in red. The color scale used for profile intensities is illustrated in the legend (upper left).

Heat maps of the clustered profile intensities across the riverbank filtration transect in positive (panel a) and negative (panel b) ESI data. Well codes are shown on the x-axes and represent the time line from 1-year (LS-P12) to 60-year (LT-P18) -old water. Dendrograms are shown on the y-axes, where the prioritized clusters are marked in red. The color scale used for profile intensities is illustrated in the legend (upper left). The HCA classified intensity profiles based on similarities between the detected trends. The results revealed that the data in both polarities were characterized by a series of dynamic trends (clustered at the top of the heat maps) with a high-intensity region between wells LT-P09 and LT-P11, thus in water that originated from the river Lek throughout the 1990s. Additionally, each well seemed to display a set of unique features, visible on the heat maps as high-intensity spots. In the positive data, the spots displayed overall increasing dissimilarity with increasing RBF travel time (Figure a), whereas this behavior was not observed in the negative data (Figure b). While it could not be excluded that some compounds would occur only in one well, these profiles might result from detection above the peak-picking threshold value exclusively in one well or from lack of detection in all three replicates from adjacent wells. The optimal number of clusters (k) was determined by silhouette analysis. The average silhouette width values for k = 80 were 0.690 and 0.687 for positive and negative data, respectively, indicating reasonable separation of the data.[47] The clusters were sorted by the number of profiled features and the 15 most populated clusters were inspected. Inspection of the prioritized clusters is discussed in the following section (see Interpretation of Clustered Time Trends). The prioritized clusters were populated with 7525 positive and 5123 negative profiled features, of which 3764 and 3845 were prioritized based on their intensity. The RMassBank package was used to extract experimental HRMS1 and HRMS2 information, resulting in 1348 and 983 positive and negative profiled features, respectively, with associated fragmentation data. It should be noted that a higher percentage fraction of negative features was prioritized from the 15 most populated clusters on the basis of their intensities, as only about 25% of these ions triggered an HRMS2 acquisition. An overview of the population of each prioritized cluster, the number of features prioritized from these clusters and how many ions triggered HRMS2 acquisitions is shown in SI section S-6. Candidate structures were assigned to 884 and 550 positive and negative features, respectively, using MetFrag, whereas 369 positive and 345 negative features were excluded from further identification efforts as the maximum number of candidate structures was exceeded (set to 5000 to reduce runtime issues and eliminate cases with poor likelihood of success, based on experience). The MetFrag results were initially reduced by filtering out candidates with combined score lower than 4 (out of 7) and without any explained HRMS2 peaks. The minimum score filtering criterion proved too strict for negative data, which displayed lower scores overall, mostly due to lack of metadata, poor fit of the in silico fragments to the experimental data, or absence from the suspect list. Consequently, for negative data the minimum score was set to 3, including at least one explained HRMS2 peak. The MetFrag Scores plots were visualized for all tentatively annotated features to ease the assessment of the MetFrag results. An example of MetFrag Scores plot for the positive feature m/z 116.0165 detected at tR 2.5 min is provided in Figure a.
Figure 3

Panel a: MetFrag Scores plot of candidate structures to elucidate feature m/z 116.0165 at tR 2.5 min detected in the positive ESI data. CombScore: Combined Score; NoExplPeaks: number of explained peaks; FragScore: in silico fragmentation score; MetFusionOffline: score for spectral similarities against MassBank of North America (MoNA); AVGCSRefs: average ChemSpider references score (only averaged in the plots); Suspects: hit in suspect list. All plotted scores were normalized to 1 except the MetFusionOffline score (which was normalized during score calculation, but not during plotting, for diagnostic purposes). Panel b: HRMS2 spectrum of methylisothiazolinone tentatively annotated by MetFrag to elucidate m/z 116.0164 [M + H]+. Experimental HRMS2 (MSMS, black line); explained MSMS (ExplMSMS, red dashed line); HRMS1 (MS1, green dashed line).

Panel a: MetFrag Scores plot of candidate structures to elucidate feature m/z 116.0165 at tR 2.5 min detected in the positive ESI data. CombScore: Combined Score; NoExplPeaks: number of explained peaks; FragScore: in silico fragmentation score; MetFusionOffline: score for spectral similarities against MassBank of North America (MoNA); AVGCSRefs: average ChemSpider references score (only averaged in the plots); Suspects: hit in suspect list. All plotted scores were normalized to 1 except the MetFusionOffline score (which was normalized during score calculation, but not during plotting, for diagnostic purposes). Panel b: HRMS2 spectrum of methylisothiazolinone tentatively annotated by MetFrag to elucidate m/z 116.0164 [M + H]+. Experimental HRMS2 (MSMS, black line); explained MSMS (ExplMSMS, red dashed line); HRMS1 (MS1, green dashed line). The MetFrag Scores plot in Figure a shows individual and combined scores assigned to 59 candidates retrieved from ChemSpider and tentatively annotated to elucidate the nontarget feature of interest, in this example, positive feature m/z 116.0165 (tR = 2.5 min). The top 3 ranking structures were methylisothiazolinone, thiazole-5-methanol and 2-methoxy thiazole, which displayed a rounded score of 5.4, 4.4, and 3.8, respectively (CombScore, black line). The structure of methylisothiazolinone could explain 7/16 peaks, whereas the second and third candidates could explain 4/16 peaks (NoExplPeaks, red dashed line). The higher number of matching fragments resulted in a higher in silico fragmentation score for methylisothiazolinone (FragScore, red line). The top 3 candidates did not differ substantially in terms of spectral similarity scores (MetFusionOffline, blue dotted line), which was low overall and indicated a lack of similar or matching HRMS2 spectra in the library. The reference scores of the top 3 candidates were comparable, although higher for the first candidate (AvgCSRefs, green dot-dash line). The top 2 candidates also had a suspect hit (Suspects, purple dot). Further processing involved generating a tentatively annotated HRMS2 spectrum from the output of MetFrag and GenFormR. MetFrag retrieved candidate structures from ChemSpider, generated fragments in silico, back-calculated their (de)protonated monoisotopic masses, and fitted them to the experimental HRMS2 data. GenFormR instead performed an algebraic calculation on spectral data to find the best formulas fitting the precursor and product ions. As these approaches are complementary, GenFormR was used to gain additional information to MetFrag to enhance the interpretation of the spectra. An example of tentatively annotated spectrum of the highest ranking candidate to elucidate the structure of positive feature m/z 116.0164 is shown in Figure b. The structure of methylisothiazolinone was eventually confirmed with a reference standard. Applying the NTS workflow resulted in the tentative annotation of 72 nontarget features (all Level 3 at this point), 45 from positive clusters and 27 from negative clusters. The full list of tentatively annotated features, including final identification confidence levels, is given in SI section S-7. An overview of the data reduction at the different steps of the NTS workflow is shown in Figure .
Figure 4

Data reduction charts for positive ESI (left) and negative ESI (right) data.

Data reduction charts for positive ESI (left) and negative ESI (right) data. Reference standards of 42 compounds were obtained based on availability and price. All chemicals were highest ranking candidates of their respective nontarget features, with the exception of atrazine-desethyl-2-hydroxy and metamitron-desamino, which were the second-highest ranking candidates, selected based on expert knowledge and following inspection of the MetFrag scores. Atrazine-desethyl-2-hydroxy was chosen for its higher in silico fragmentation score and spectral similarity score compared to the top candidate. The identification of metamitron-desamino represented an interesting case to demonstrate the importance of analyst judgment. This compound was the second-top ranking structure to explain positive ESI feature m/z 188.0817 at tR 6.55 min with a MetFrag Combined Score of 3.25 and 11/17 explained HRMS2 peaks. The highest ranking structure for this feature was the phosphodiesterase 3 inhibitor amrinone, with a score of 5.74 and 10/17 annotated HRMS2 peaks. The most pronounced differences in the MetFrag scores of these two structures were found in the number of references. Since one of the scoring criteria was the number of PubMed references, it was not surprising that the pharmaceutical amrinone had a higher score than a pesticide TP. However, since nearly all tentatively annotated chemicals originated from either industrial or agricultural activities, metamitron-desamino was thought to be the more likely structure. Reference standards were obtained for both compounds, leading to confirmation of the pesticide TP. It is noteworthy that metamitron-desamino would have been missed without amrinone being the first candidate, as its MetFrag score was below the cutoff value used for prioritization of the MetFrag results of the positive ESI data. The candidate structures of 25 out of 42 nontarget features were confirmed, resulting in a success rate of 60% against the confirmation subset. An extensive evaluation of the performance of MetFrag can be found in the literature.[36,37] Details of the successfully confirmed compounds are shown in Table . Of the 17 compounds that were not confirmed, 4 could not be detected by the UHPLC-ESI-q-TOF/HRMS system and 13 eluted at a different tR.
Table 1

List of Nontarget Contaminants Confirmed with Reference Standards, Their Formula, Neutral Monoisotopic Mass, Detected Adduct, Retention Time (tR) in Samples and Standards, and ChemSpider ID

CompoundFormulaNeutral monoisotopic massDetected adducttR sampletR standardChemSpider ID
MethylisothiazolinoneC4H5NOS115.009[M + H]+2.52.536393
1,3-BenzothiazoleC7H5NS135.014[M + H]+7.57.56952
TetramethylsulfamideaC4H12N2O2S152.062[M + H]+4.84.8121689
Atrazine-desethyl-2-hydroxyaC6H11N5O169.096[M + H]+3.13.096906
4-ToluenesulfonamideaC7H9NO2S171.035[M + H]+5.65.66033
Simazine-2-hydroxyC7H13N5O183.112[M + H]+4.64.516505
Metamitron-desaminoaC10H9N3O187.075[M + H]+6.06.0157884
BenzoguanamineC9H9N5187.086[M + H]+5.04.96797
2,6-dichlorobenzamideaC7H5Cl2NO188.975[M + H]+5.05.015359
CarbendazimaC9H9N3O2191.069[M + H]+4.84.723741
ChlortoluronC10H13ClN2O212.072[M + H]+9.19.125472
Diuron-desmethylC8H8Cl2N2O218.001[M + H]+9.89.818040
Diphenylphosphinic acidaC12H11O2P218.050[M + H]+ and [M − H]7.77.714810
5-Amino-2-chlorotoluene-4-sulfonic acidaC7H8ClNO3S220.991[M + H]+ and [M − H]5.45.46670
ChloridazonC10H8ClN3O221.036[M + H]+6.06.014790
Naphthionic acidaC10H9NO3S223.030[M + H]+ and [M − H]2.02.06532
LamotrigineC9H7Cl2N5255.008[M + H]+6.76.63741
Tributyl phosphateaC12H27O4P266.165[M + H]+ and [M − H]14.915.029090
AcesulfameC4H5NO4S162.994[M − H]2.52.533607
O,O-diethyl thiophosphateaC4H11O3PS170.017[M − H]3.63.6635
p-Toluidine-m-sulfonic acidaC7H9NO3S187.030[M − H]3.33.360405
Dibutyl phosphateaC8H19O4P210.102[M − H]8.88.87593
Camphorsulfonic acidaC10H16O4S232.077[M − H]5.05.017438
4-Amino-2,5-dichlorobenzenesulfonic acidaC6H5Cl2NO3S240.937[M − H]2.62.659986
4-Dodecylbenzenesulfonic acidC18H30O3S326.192[M − H]17.317.38172

Identified in cluster with intensity maxima in the 1990s.

Identified in cluster with intensity maxima in the 1990s.

Interpretation of Clustered Time Trends

Several prioritized clusters displayed trends with intensity maxima in the 1990s (wells LT-P09 and LT-P11); there were seven in the positive ESI data and six in the ESI negative data. The presence of these clusters could be rationalized by the history of anthropogenic emissions and restoration measures in the Rhine basin from the 1950s onward. River protection programs coordinated by the International Commission for the Protection of the Rhine (ICPR) resulted in domestic and industrial WWTPs being built between 1970 and 1990 in the Rhine riparian countries.[53] A turning point in international river basin management was the 1986 Sandoz accident when 15,000 m3 of water mixed with organophosphorus and organochlorine compounds including pesticides, dyes, solvents, and intermediates accidentally entered the upper Rhine in Switzerland, resulting in widespread mortality of fish, macroinvertebrate, and plankton communities in the riparian countries downstream.[53−55] Following the Sandoz accident, the ICPR established the Rhine Action Programme (RAP) to coordinate and implement measures to lower the discharge of hazardous substances. Their goals were achieved in the mid-1990s when substantial reduction of organic emissions, improved oxygen content, and increased biodiversity were reported.[53] Given the high number of (tentatively) identified substances, the discussion of the results is limited to the most significant findings from the confirmed identities, whose profiles are highlighted in Figure . The cluster shown in Figure a indicated a net increase of emissions from the 1950s to the 1990s, followed by a moderate but constant decrease in the 2000s. In this cluster, a negligible number of profiles displayed intensity maxima in the early 2000s; however, these are not discussed further, as no candidate structures could be associated with these trends. Based on the known sources and environmental fate of 2,6-dichlorbenzamide[56] (Figure a–green line), nonpoint source emissions from agricultural applications of biocides or sewage farming could explain some of these profiles.
Figure 5

Examples of prioritized clusters with identified nontarget compounds. On the x-axis the well codes are shown. The bank filtrate travel time was as follows: ∼1 year (LS-P12), 6–11 years (LT-P01), 8–11 years (LT-P03), 8–13 years (LT-P05), 10–16 years (LT-P07), 15–20 years (LT-P09), 19–25 (LT-P11), ∼40 years (LT-P14), 50–60 years (LT-P18). Panel a: Positive ESI cluster with maxima in the 1990s accounting for 231 profiles. The profiles of tetramethylsulfamide (blue), 4-toluenesulfonamide (red), 2,6-dichlorobenzamide (green), and tributyl phosphate (gray) are shown in color. Panel b: Negative ESI cluster with maxima in the 1990s accounting for 286 profiles. The profiles of 4-amino-2,5-dichlorobenzenesulfonic acid (green), camphorsulfonic acid (blue), O,O-diethyl thiophosphate (red), and 5-amino-2-chlorotoluene-4-sulfonic acid (gray) are shown in color. Panel c: Positive ESI cluster with gradual increase from the 1970s and displaying stable intensities throughout the 1990s and early 2000s accounting for 238 profiles. The profiles of metamitron-desamino (yellow), diphenylphosphinic acid (green), and atrazine-desethyl-2-hydroxy (blue) are shown in color. Panel d: Positive ESI cluster with maxima in bank filtrate with 1-year travel time (late 2015) accounting for 834 profiles. All trends significantly overlapped, and no plots were manipulated for display purposes. The profiles of lamotrigine (blue) and simazine-2-hydroxy (red) are shown in color.

Examples of prioritized clusters with identified nontarget compounds. On the x-axis the well codes are shown. The bank filtrate travel time was as follows: ∼1 year (LS-P12), 6–11 years (LT-P01), 8–11 years (LT-P03), 8–13 years (LT-P05), 10–16 years (LT-P07), 15–20 years (LT-P09), 19–25 (LT-P11), ∼40 years (LT-P14), 50–60 years (LT-P18). Panel a: Positive ESI cluster with maxima in the 1990s accounting for 231 profiles. The profiles of tetramethylsulfamide (blue), 4-toluenesulfonamide (red), 2,6-dichlorobenzamide (green), and tributyl phosphate (gray) are shown in color. Panel b: Negative ESI cluster with maxima in the 1990s accounting for 286 profiles. The profiles of 4-amino-2,5-dichlorobenzenesulfonic acid (green), camphorsulfonic acid (blue), O,O-diethyl thiophosphate (red), and 5-amino-2-chlorotoluene-4-sulfonic acid (gray) are shown in color. Panel c: Positive ESI cluster with gradual increase from the 1970s and displaying stable intensities throughout the 1990s and early 2000s accounting for 238 profiles. The profiles of metamitron-desamino (yellow), diphenylphosphinic acid (green), and atrazine-desethyl-2-hydroxy (blue) are shown in color. Panel d: Positive ESI cluster with maxima in bank filtrate with 1-year travel time (late 2015) accounting for 834 profiles. All trends significantly overlapped, and no plots were manipulated for display purposes. The profiles of lamotrigine (blue) and simazine-2-hydroxy (red) are shown in color. Confirming the environmental occurrence of tetramethylsulfamide (TMS) was a novel discovery of the present study (Figure a–blue line), highlighting the environmental significance of NTS. No information about production or use of TMS was found on the European Chemical Agency (ECHA) or the US EPA Web sites. TMS was not present in the NORMAN Suspect Lists Exchange or in accurate mass spectral libraries. A search on PubChem returned 50 patents detailing formulations for the synthesis of dyes, flame retardants, and pesticides. TMS can be used for the synthesis of sulfur trioxide-dimethylamine complex, a sulfating reagent for dyestuffs.[57] TMS can also be a byproduct of the synthesis of dimethylsulfamyl chloride, a chemical used in a variety of industrial applications,[58] or a byproduct of the synthesis of sulfur-containing aziridines chemosterilants.[59] The exact source of emissions of TMS, although likely industrial, remains so far unknown. The identity of 4-toluenesulfonamide (4-TSA) in a cluster with maxima in the 1990s was confirmed with a reference standard (Figure a–red line). 4-TSA is a plasticizer, an intermediate for the synthesis of pesticides and disinfection byproduct of the antimicrobial agent N-sodium-N-chloro-p-toluenesulfonamide. 4-TSA was detected in the Berlin area in surface water, groundwater, and bank filtrate at concentrations up to 0.9 μg/L, 14.1 μg/L, and 0.24 ng/L, respectively.[60] The authors of that study pointed to untreated WWTP effluents and former sewage farms as sources of sulfonamides and concluded that 4-TSA can help identify bank filtrate originating from polluted surface water,[60] supporting the results of the present work. A more dynamic intensity trend showing increasing emission from the 1950s to the 1990s followed by a substantial decrease in the 2000s is shown in Figure b. In this cluster, camphorsulfonic acid (CSA) was tentatively identified and later confirmed, along with the metabolite O,O-diethyl thiophosphate (DETP), the industrial chemicals 5-amino-2-chlorotoluene-4-sulfonic acid (ACTSA), and 4-amino-2,5-dichlorobenzenesulfonic acid (ADCBSA). To the best of our knowledge, this is the first time CSA has been confirmed in bank filtrate and its environmental persistence demonstrated (Figure b–blue line), highlighting the environmental significance of NTS. CSA was not included in the suspect list, and only one reference was found reporting its occurrence in a WWTP effluent from a rubber manufacturing site in Spain.[61] CSA is used as a dopant in the synthesis of polyaniline, a conductive polymer.[62,63] Camphor derivatives, such as CSA, are used as UV filters in cosmetic products and eventually reach surface waters via insufficiently treated domestic WWTP effluents.[64] The UV filter terephthalydene dicamphorsulfonic acid (TPDCSA) is unstable under photolysis in aqueous media and uncharacterized degradation product(s) with UV absorbance <290 nm were reported.[65] CSA has a UV/vis absorbance of 285 nm,[66] suggesting that it may originate from TPDCSA in the environment, rather than exclusively from industrial sources. ACTSA (Figure b–gray line) is an important building block for the synthesis of dyes reported to persist in chemical and biological wastewater treatment.[67] Its 2-amino isomer can be obtained from the cleavage of the azo dye Pigment Red 53 and was included in a priority list of 23 unregulated aromatic amines of toxicological concern.[68] Limited literature references were found for this compound, and to the best of our knowledge, this is the first time it has been identified in a riverbank filtrate. The REACH registration dossier of ACTSA was submitted by a dye manufacturer with a production site at the river Main, the longest tributary of the Rhine, approximately 500 km upstream of the RBF site. The identity of another dyestuff synthesis intermediate, ADCBSA, was confirmed in this cluster (Figure b–green line). This chemical occurs in liquid waste from manufacturing processes; however, liquid waste containing ADCBSA is usually treated separately before being sent to WWTPs, because this compound is toxic to microorganisms and inhibits biological treatment.[69] This is the first time ADCBSA is identified in a riverbank filtrate and its persistence reported. Its decreasing intensities in the 2000s might be explained by upgrades of industrial WWTPs with ion exchange resins and/or by decreased production volumes. DETP is a product of mammal metabolism and biological wastewater treatment, as well as an environmental TP of insecticides, flame retardants, plasticizers, and industrial chemicals.[70,71] Even if nonpoint sources cannot be excluded, the intensity profile of DETP (Figure b–red line) suggests industrial or domestic effluents as possible sources. The decrease in the young bank filtrate might be explained by the upgrade of WWTPs and the implementation of more effective regulation in the 2000s in the riparian Rhine countries. The hypothesis about the point sources of DETP was supported by a recent wastewater-based epidemiological study on the human exposure to pesticides, which reported a detection frequency of 7% in domestic WWTP effluents across Europe in concentrations in the low ng/L range.[72] We demonstrate that DETP can enter RBF systems, where it can persist and migrate for at least two decades in the dark anaerobic conditions. Contaminants that entered the RBF in the mid-1970s and displayed stable intensities along the transect were found among the prioritized clusters. In the cluster shown in Figure c, the metabolites metamitron-desamino, atrazine-desethyl-2-hydroxy, and diphenylphosphinic acid (DPPS) were confirmed with their respective reference standard. Metamitron-desamino is the main biodegradation product of the herbicide metamitron, a mobile chemical known to reach surface waters via polluted runoffs or WWTP effluents and with high potential to leach into groundwater. Concentrations of metamitron-desamino up to 680 ng/L have been reported from rivers impacted by urban and agricultural activities.[73] Recent research showed that this metabolite can originate from biodegradation in water–sediment systems.[74] The profile of metamitron-desamino (Figure c–yellow line) matched the sales data of its parent compound metamitron, which was introduced in the European Union in 1975 and has displayed a stable sales trend from the mid-1990s onward.[75] The lower intensities in the first well might reflect the recent introduction of herbicide formulations with lower concentrations of metamitron combined with other active substances.[76] DPPS (Figure c–green line) is a degradation product of the pharmaceutical precursor triphenylphosphine oxide (TPPO). TPPO was quantified at concentrations below 300 ng/L in bank filtrate with up to 4-year travel time from the same area investigated in our study.[77] Literature indicated that DPPS was fully degraded within 30 days in a fixed-bed bioreactor filled with aerobic Rhine water and that a major source of TPPO is located approximately 400 km upstream of the RBF system investigated in the present manuscript.[78] DPPS is a highly hydrophilic anionic compound (logDpH7.4 = −1.69; pKa = 2.3), so it is not retarded by bank filtration. For the first time, we found that DPPS can be persistent and mobile in the dark anaerobic environment. Profiles displaying intensity maxima in well LS-P12 (1-year-old water) followed by a sudden decrease in the rest of the transect were assigned to the second and fourth most populated clusters in positive and negative ESI data, respectively (Figure d). It could be assumed that such clusters would include profiles of chemicals possibly infiltrated only recently, infiltrated and diluted to below detection level in the older bank filtrate, or formed at the riverbank within 1-year travel time and either degraded further or diluted to undetectable concentrations. In this cluster, lamotrigine, simazine-2-hydroxy, diuron-desmethyl, and 1,3-benzothiazole were confirmed with reference standards. The anticonvulsant lamotrigine (Figure d–blue line) is known to be persistent to biological wastewater treatment and has been previously detected in surface water[79] and in bank filtrate with short travel time.[13] Detection limited to the first well was not expected, because lamotrigine was first marketed in the EU already in 1993. Literature data on degradation of lamotrigine in anaerobic conditions was not found. Although reductive dehalogenation of aryl halide groups might occur in such conditions,[80] dechlorinated TPs of lamotrigine were screened for and not detected in the experimental HRMS1 data. A recent study on the fate of pharmaceuticals in soils irrigated with reclaimed wastewater found that lamotrigine (logDpH7.4 = 1.68) displayed the highest sorption affinity to soil compared to carbamazepine (logDpH7.4 = 2.28) and its metabolites.[81] The 6- to 11-year travel time between wells LS-P12 and LT-P01 might have maximized adsorptive interactions and restrained lamotrigine mobility in the subsurface. Ultimately, the contribution of dilution to undetectable levels in the older bank filtrate could not be determined. It is noteworthy how simazine-2-hydroxy was not detected after at least 11-year travel time (Figure d–red line), whereas atrazine-desethyl-2-hydroxy showed persistence across the transect (Figure c–blue line). Previous studies found that simazine occurred in surface water and groundwater at concentrations up to 10 times lower than atrazine.[82] It is unclear whether simazine-2-hydroxy was removed during RBF, transformed further or diluted to undetectable concentrations in the older bank filtrate. Both triazine herbicides that lead to formation of these TPs were banned in the European Union in 2004. The detection of atrazine-desethyl-2-hydroxy at constant levels might be attributed to the release of atrazine (or TPs) from contaminated river sediments prior to infiltration in the RBF system.[83] Other known metabolites of atrazine and simazine were screened for in HRMS1 data, but were not detected. These TPs might have either been absent from the RBF transect or occurred at undetectable concentrations. It cannot be excluded that atrazine-desethyl-2-hydroxy prevailed over other more commonly detected dealkylated metabolites,[84] as the aquifer screened in this study differs from others for being both confined and anaerobic. For example, in a recent screening of three aerobic bank filtration sites, atrazine-2-hydroxy and atrazine-desethyl were both found in low ng/L concentration, whereas atrazine-desethyl-2-hydroxy and atrazine-desisopropyl were not detected.[19]

Environmental Implications

This study contributes to the mounting evidence of environmental persistence of hydrophilic organic compounds and shows that polar substances can be highly mobile in RBF systems with long travel time at stable biogeochemical conditions. More research should be done on this or comparable RBF transects to investigate the fate of the most polar MPs, e.g., with even lower logD values than those identified in this study, which might have been lost during enrichment or insufficiently retained by reversed-phase chromatography. We showed that state-of-the-art NTS relying on open computational tools and performed in a semiautomated manner can be an extremely powerful method to explore water contaminants by HRMS. Spectra of the substances identified in this study will be uploaded to openly accessible accurate mass spectral libraries to contribute to future screenings. The list of confirmed contaminants has been shared with local drinking water utilities to assess their removal in drinking water treatment. The trend analysis presented here can be useful to manage bank filtration systems with long travel times in catchment areas impacted by anthropogenic activities. In these cases, to avoid contamination with many legacy pollutants from the 1990s, which overall displayed higher normalized intensities in the older water and thus likely occurred at higher concentrations, the use of young groundwater to produce potable water is recommended, whereas advanced treatment should be applied to the old groundwater. In the case of drinking water treatment at the Tiendweg well field, reverse osmosis is applied to maximize micropollutant removal.
  58 in total

1.  Polyaniline nanofibers: facile synthesis and chemical sensors.

Authors:  Jiaxing Huang; Shabnam Virji; Bruce H Weiller; Richard B Kaner
Journal:  J Am Chem Soc       Date:  2003-01-15       Impact factor: 15.419

2.  Quantification of more than 150 micropollutants including transformation products in aqueous samples by liquid chromatography-tandem mass spectrometry using scheduled multiple reaction monitoring.

Authors:  Nina Hermes; Kevin S Jewell; Arne Wick; Thomas A Ternes
Journal:  J Chromatogr A       Date:  2017-11-13       Impact factor: 4.759

3.  Non-target screening to trace ozonation transformation products in a wastewater treatment train including different post-treatments.

Authors:  Jennifer E Schollée; Marc Bourgin; Urs von Gunten; Christa S McArdell; Juliane Hollender
Journal:  Water Res       Date:  2018-05-25       Impact factor: 11.236

4.  Occurrence of Atrazine and Related Compounds in Sediments of Upper Great Lakes.

Authors:  Jiehong Guo; Zhuona Li; Prabha Ranasinghe; Solidea Bonina; Soheil Hosseini; Margaret B Corcoran; Colin Smalley; Rajashankar Kaliappan; Yan Wu; Da Chen; Andy L Sandy; Yawei Wang; Karl J Rockne; Neil C Sturchio; John P Giesy; An Li
Journal:  Environ Sci Technol       Date:  2016-07-01       Impact factor: 9.028

5.  Perfluorinated compounds in infiltrated river rhine water and infiltrated rainwater in coastal dunes.

Authors:  Christian Eschauzier; Joris Haftka; Pieter J Stuyfzand; Pim de Voogt
Journal:  Environ Sci Technol       Date:  2010-10-01       Impact factor: 9.028

Review 6.  Multiresidue methods using solid-phase extraction techniques for monitoring priority pesticides, including triazines and degradation products, in ground and surface waters.

Authors:  H Sabik; R Jeannot; B Rondeau
Journal:  J Chromatogr A       Date:  2000-07-14       Impact factor: 4.759

7.  The effect of redox conditions and adaptation time on organic micropollutant removal during river bank filtration: A laboratory-scale column study.

Authors:  C Bertelkamp; A R D Verliefde; K Schoutteten; L Vanhaecke; J Vanden Bussche; N Singhal; J P van der Hoek
Journal:  Sci Total Environ       Date:  2015-12-03       Impact factor: 7.963

8.  Exploring the Behaviour of Emerging Contaminants in the Water Cycle using the Capabilities of High Resolution Mass Spectrometry.

Authors:  Juliane Hollender; Marc Bourgin; Kathrin B Fenner; Philipp Longrée; Christa S Mcardell; Christoph Moschet; Matthias Ruff; Emma L Schymanski; Heinz P Singer
Journal:  Chimia (Aarau)       Date:  2014-11       Impact factor: 1.509

9.  PubChem Substance and Compound databases.

Authors:  Sunghwan Kim; Paul A Thiessen; Evan E Bolton; Jie Chen; Gang Fu; Asta Gindulyte; Lianyi Han; Jane He; Siqian He; Benjamin A Shoemaker; Jiyao Wang; Bo Yu; Jian Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

Review 10.  Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics.

Authors:  Ivana Blaženović; Tobias Kind; Jian Ji; Oliver Fiehn
Journal:  Metabolites       Date:  2018-05-10
View more
  7 in total

Review 1.  Analysis of mobile chemicals in the aquatic environment-current capabilities, limitations and future perspectives.

Authors:  Daniel Zahn; Isabelle J Neuwald; Thomas P Knepper
Journal:  Anal Bioanal Chem       Date:  2020-02-21       Impact factor: 4.142

2.  Organic contaminants in human breast milk identified by non-targeted analysis.

Authors:  Cuong D Tran; Nathan G Dodder; Penelope J E Quintana; Kayo Watanabe; Jae H Kim; Melbourne F Hovell; Christina D Chambers; Eunha Hoh
Journal:  Chemosphere       Date:  2019-08-26       Impact factor: 7.086

3.  Non-targeted tandem mass spectrometry enables the visualization of organic matter chemotype shifts in coastal seawater.

Authors:  Daniel Petras; Jeremiah J Minich; Lucia B Cancelada; Ralph R Torres; Emily Kunselman; Mingxun Wang; Margot E White; Eric E Allen; Kimberly A Prather; Lihini I Aluwihare; Pieter C Dorrestein
Journal:  Chemosphere       Date:  2021-01-02       Impact factor: 7.086

4.  patRoon: open source software platform for environmental mass spectrometry based non-target screening.

Authors:  Rick Helmus; Thomas L Ter Laak; Annemarie P van Wezel; Pim de Voogt; Emma L Schymanski
Journal:  J Cheminform       Date:  2021-01-06       Impact factor: 5.514

5.  From Centroided to Profile Mode: Machine Learning for Prediction of Peak Width in HRMS Data.

Authors:  Saer Samanipour; Phil Choi; Jake W O'Brien; Bob W J Pirok; Malcolm J Reid; Kevin V Thomas
Journal:  Anal Chem       Date:  2021-11-29       Impact factor: 6.986

6.  Evaluation of Sample Preparation Methods for Non-Target Screening of Organic Micropollutants in Urban Waters Using High-Resolution Mass Spectrometry.

Authors:  Nina Huynh; Emilie Caupos; Caroline Soares Peirera; Julien Le Roux; Adèle Bressy; Régis Moilleron
Journal:  Molecules       Date:  2021-11-23       Impact factor: 4.411

7.  Machine learning for identification of silylated derivatives from mass spectra.

Authors:  Milka Ljoncheva; Tomaž Stepišnik; Tina Kosjek; Sašo Džeroski
Journal:  J Cheminform       Date:  2022-09-15       Impact factor: 8.489

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.