Literature DB >> 34783167

Identifying organic chemicals not subject to bioaccumulation in air-breathing organisms using predicted partitioning and biotransformation properties.

Frank Wania¹, Ying Duan Lei¹, Sivani Baskaran¹, Alessandro Sangion¹.

Abstract

Because the respiration processes contributing to the elimination of organic chemicals deviate between air- and water-breathing organisms, existing and widely used procedures for identifying chemicals not subject to bioaccumulation in aquatic organisms based on the octanol-water partition ratio KOW need to be complemented with similar procedures for organisms respiring air. Here, we propose such a procedure that relies on the comparison of a compound's predicted KOW , octanol-air partition ratio KOA , and biotransformation half-life HLB with three threshold values, below which elimination is judged to be sufficiently rapid to prevent bioaccumulation. The method allows for the consideration of the effect of dissociation on the efficiency of urinary and respiratory elimination. Explicit application of different types of the prediction error, such as the 95% prediction interval or the standard error, allows for variable tolerance for false-negative decisions, that is, the potential to judge a chemical as not bioaccumulative even though it is. A test with a set of more than 1000 diverse organic chemicals confirms the applicability of the prediction methods for a wide range of compounds and the procedure's ability to categorize approximately four-fifth of compounds as being of no bioaccumulation concern, suggesting its usefulness to screen large numbers of commercial chemicals to identify those worthy of further scrutiny. The test also demonstrates that a screening based solely on KOW and KOA would be far less effective because the fraction of chemicals that can be judged as sufficiently volatile and/or sufficiently water soluble for rapid respiratory and urinary elimination based on the partitioning properties predicted for their neutral form is relatively small. Future improvements of the proposed procedure depend largely on the development of prediction methods for the biotransformation kinetics in air-breathing organisms and for the potential for renal reabsorption. Integr Environ Assess Manag 2022;18:1297-1312.

© 2021 The Authors. Integrated Environmental Assessment and Management published by Wiley Periodicals LLC on behalf of Society of Environmental Toxicology & Chemistry (SETAC). © 2021 The Authors. Integrated Environmental Assessment and Management published by Wiley Periodicals LLC on behalf of Society of Environmental Toxicology & Chemistry (SETAC).

Entities: Chemical

Keywords: Bioaccumulation; Biotransformation; Chemical assessment; Partitioning

Mesh：

Substances：

Year: 2021 PMID： 34783167 PMCID： PMC9541168 DOI： 10.1002/ieam.4555

Source DB: PubMed Journal: Integr Environ Assess Manag ISSN： 1551-3777 Impact factor: 3.084

INTRODUCTION

The process that causes a chemical's concentration in an organism to exceed the concentration in the medium that this organism respires or in its diet, or both, has been termed bioaccumulation (B) (Gobas et al., 2009). While bioaccumulation is not a prerequisite for toxicity, a bioaccumulating substance is more likely to achieve levels within an organism that result in adverse effects than a substance that does not bioaccumulate. Because adverse effects of chemicals on organisms and ecosystems are often difficult to establish, bioaccumulation itself is regarded as an important element of chemical risk assessment (Beek et al., 2000; Franke et al., 1994). In fact, it is generally regarded as one of the most troublesome characteristics of a compound and should be avoided in commercially produced and used substances. Consequently, it is important to screen such substances to identify those that are potentially bioaccumulative. Traditionally, chemical regulation sought to identify compounds that are bioaccumulative in aquatic organisms. The first step in that identification process is to confirm that the partitioning properties of a chemical allow for B. The relationship between chemical partitioning properties, such as the octanol–water partitioning ratio K OW and bioaccumulation in aquatic organisms, has been known for a long time (Chiou et al., 1977; Mackay, 1982) and is a result of lipophilic compounds not being readily eliminated through the gills of a fish. Bioaccumulation is therefore often judged by comparing a chemical's log K OW against a threshold value. A limitation of this approach is that there are compounds that are bioaccumulative in air‐breathing animals, but not in aquatic organisms (Czub & McLachlan, 2004; Kelly et al., 2003, 2007). Such chemicals can be eliminated efficiently by respiration of water, which is an elimination process not available to organisms respiring air. Therefore, a B assessment based on K OW alone is not protective of air‐breathing animals. Conversely, there may also be chemicals readily eliminated by respiration of air that accumulate in water‐breathing organisms. This means that separate B assessments for air‐ and water‐breathing organisms are necessary. One of the difficulties associated with decisions made based on a comparison of a chemical property with a threshold value is that every chemical property value has an uncertainty, irrespective of whether it is derived experimentally or through a prediction method. Zhang et al. (2010) have, for example, shown that such decisions can depend on which property prediction method is applied. Ideally, therefore, a method that relies on the comparison of chemical property values with thresholds should (i) quantify the uncertainty of the property value and (ii) estimate the likelihood of making a wrong decision because of that uncertainty. In summary, it is important to have a procedure allowing for the identification of chemicals that may bioaccumulate in air‐breathing organisms that complements existing methods for B assessment in fish. Here, we propose and describe such a method, whereby the method's development was guided by the following considerations: Taking its cue from the B assessments relying on a log K OW threshold, the method should ideally be simple and be based, to the extent possible, on chemical equilibrium partitioning properties. Because a method based on partitioning alone proved quite inefficient, we also show how elimination by biotransformation can be integrated into the assessment process. The assessment procedure should be efficient in that it requires little time and financial resources. For a procedure to be efficient, experiments to measure property values should only be necessary in exceptional cases. The method should therefore rely on predicted partitioning and biotransformation properties, so that it could be used both for (i) the relatively rapid screening of fairly large numbers of chemicals, and therefore, the focusing of more refined and detailed assessments on chemicals that are initially judged to be potentially bioaccumulative, and (ii) the quick assessment of individual chemicals, with the objective being to decide whether experimental determinations are required to appraise their potential for B. The method should explicitly take into account the uncertainty of predicted properties. Consideration should be given to the confidence with which these chemical properties are known and therefore the statistical likelihood that a value is indeed on one side of a threshold. We also tested the efficiency of the proposed procedure by applying it to a set of more than 1000 organic compounds.

METHODS

Outline of the assessment procedure

The purpose of the procedure is to identify chemicals that are clearly not bioaccumulative in air‐breathing organisms, and therefore, no further assessment of their potential for bioaccumulation should be necessary. Importantly, this is not synonymous with classifying chemicals as bioaccumulative. It is thus a procedure for de‐prioritization rather than prioritization. The basic premise of the proposed method is that a chemical is not bioaccumulative in air‐breathing organisms if the organism is capable of rapidly eliminating the compound. The log K OW threshold used in the assessment of B in aquatic organisms could be seen as being based on the same premise, as chemicals with low log K OW are more readily eliminated by respiration across the gills of a fish. There are primarily three pathways for air‐breathing animals to eliminate a chemical: urinary excretion, respiratory exhalation, and enzymatic biotransformation. While it is also possible for compounds to be excreted with feces (Rozman, 1985), this process is too slow to prevent the bioaccumulation of hydrophobic substances. This is because the efficient assimilation of lipids during digestion assures that dietary intake rates always markedly exceed fecal elimination rates for bioaccumulative substances. While there are also substances that do not bioaccumulate by virtue of inefficient uptake rather than efficient elimination, we do not account for this possibility here because of the very large uncertainty of both the relevant threshold and the high log K OW values of such substances. Furthermore, screening approaches for bioaccumulation in aquatic organisms generally neither include an upper log K OW threshold indicative of inefficient dietary uptake. A chemical's propensity for urinary excretion and respiratory exhalation can be judged based on its equilibrium partitioning properties. In particular, ECHA (2017a, 2017b) recommends that chemicals with a log K OW > 2 and a logarithmic octanol–air equilibrium partitioning ratio log K OA > 5 are considered potentially bioaccumulative in air‐breathing organisms. These thresholds are based on model calculations for terrestrial food webs that have suggested that any chemical with a log K OW below 2 can be readily eliminated by urination, and any chemical with a log K OA below 5 can be readily eliminated by exhalation (Kelly & Gobas, 2003; Kelly et al., 2003). Bioaccumulation in humans has been shown to be strongly determined by a compound's susceptibility to enzymatic biotransformation (McLachlan et al., 2011) and this is likely also true for other air‐breathing organisms. The potential for rapid elimination by biotransformation cannot be assessed with partitioning properties, but requires a parameter describing the kinetics of biotransformation in the organism. While this is generally not done in a tier 1 assessment of B in aquatic organisms, there has been considerable progress in predicting biotransformation kinetics of organic chemicals (Arnot et al., 2014; Brown et al., 2012; Papa et al., 2018). Furthermore, models can be used to estimate biotransformation half‐life HL thresholds that assure rapid elimination, whereby the numerical thresholds depend on the size of the organism (Goss et al., 2013). The proposed procedure therefore relies on the comparison of the compounds' log K OW, log K OA, and HL with three thresholds. If the properties for a chemical are above all three of them, it is judged to be potentially bioaccumulative and therefore in need of further, more detailed assessment. If one of the chemical properties can, with confidence, be said to fall below any one threshold, it is deemed not to be susceptible to bioaccumulation in air‐breathing organisms. While it is conceivable that any one elimination process is insufficient to prevent bioaccumulation of a chemical, whereas the combination of two or more of these processes would be sufficient, we ignore this possibility. This is defensible as it only increases the chance of false‐positive decisions (i.e., chemicals being recommended for higher‐tiered assessments even though they are not bioaccumulative), but not false negatives (i.e., chemicals are judged not B, even though they are). One complication arises for compounds that dissociate, that is, are not completely neutral at the physiological pH prevailing in the body of the air‐breathing organism. The partitioning of a dissociating compound between octanol and water is described with the distribution ratio, D, which is the ratio of the sum of the concentrations of the charged and neutral forms of a compound in octanol and water at equilibrium. D depends on the pH of the aqueous phase. A chemical that is predominantly present in the charged form is more likely to be eliminated by urinary excretion, but less likely eliminated by respiration than its neutral conjugate. Our approach therefore includes two steps that consider the potential for dissociation. If dissociation is ignored, it might result in some substances being assessed as requiring further assessment of their bioaccumulative potential, even though they could be readily eliminated via urination in their charged forms. For chemicals whose neutral form is judged to have a log K OW above the threshold for efficient urinary excretion, we therefore also compare the log D at the physiological pH of 7.4 with this threshold. Even more troublesome is that ignoring dissociation might lead to some substances being deemed not B by having a predicted log K OA below the threshold, even though in their charged form they are not readily exhaled. Therefore, a chemical with a log K OA for the neutral species judged to be below the threshold is not assessed to be readily lost by exhalation if it is predicted to be entirely present in charged form at a pH of 7.4, that is, if the neutral fraction in blood is estimated to be less than 1%.

Detailed description of procedure

Comparing predicted values against threshold values

The core elements of the approach are comparisons of a predicted property of chemical i (X ) with a threshold (T) to establish whether the property is above or below T. Because it is preferred to have false positives than to have false negatives, all the comparisons take the form: where PE is the prediction error of X . The choice of the parameter to use as PE in these comparisons depends on the tolerance for different types of errors in an assessment. A very stringent approach would use the 95% prediction interval (PI95 ) for PE : Using such an approach, one minimizes the potential of a false‐negative assessment, that is, reduces the chance of “missing” a potentially bioaccumulative substance. There is a 95% likelihood that the true value of a property is between X minus the lower PI95 and X plus the upper PI95 . There is thus a 2.5% likelihood that the true value is higher than X plus the upper PI95 . For every comparison of type (1′), there is thus at most a 1 in 40 chance of categorizing a chemical wrongly as not B, and only if X + upper PI95 equals T (Figure 1). In most cases, when this categorization of not B is made, X + upper PI95 is smaller (often very much smaller) than T and the likelihood of misclassification is in fact much smaller than 1 in 40. This approach therefore is very cautious.

Figure 1

Illustration of the use of different quantities to estimate the error of a predicted property X and of the maximum likelihood of making a false‐negative decision when comparing X with a threshold T

Illustration of the use of different quantities to estimate the error of a predicted property X and of the maximum likelihood of making a false‐negative decision when comparing X with a threshold T If one were willing to tolerate a higher number of false negatives, for example, if the threshold value T itself is already cautiously selected, one could use the standard error of the prediction (SE ) as the PE : This choice leads to fewer false‐positive assessments, that is, reduces the fraction of chemicals requiring a higher‐tier assessment. However, for every comparison of type (1″), there is at most a 1 in 6 chance of categorizing a chemical wrongly as not B, and only if X + SE equals T. It is of course also possible, and presumably quite common, to ignore the uncertainty of a prediction, that is, use the comparison: For every comparison of type (1‴), there is at most a 1 in 2 chance of categorizing a chemical wrongly as not B, and only if X equals T. Specifically, the proposed approach relies on the following four comparisons: where , , and are suitably selected thresholds for log K OW, log K OA, and biotransformation half‐life, respectively. Whereas the upper prediction error is multiplied by the property value in Equation (5), in Equations (2)–(4) it is added to the property value. This is because the error is always assumed to be log‐normally distributed. In principle, the sequence of these comparisons should not matter. However, if any one of the comparisons leads to a decision of not B for a chemical, the subsequent comparisons become obsolete for that chemical. It is therefore desirable to start with comparisons involving properties that can be predicted with techniques that (i) yield predictions for a wide range of diverse compounds and (ii) are accessible easily and for free. Therefore, we start with the comparisons involving K OW and K OA. Note that the approach can still be applied if a prediction technique for log D or HL is not accessible or if it does not yield a result for a compound. In that case, any comparison involving log D and HL can be omitted from the procedure, which is equivalent to assuming that the chemical exceeds the threshold for that property. It simply means that the likelihood of a judgment of “potential B,” that is, the number of false positives, increases. To perform comparisons (2)–(5), we therefore need numerical values for the three thresholds , , and as well as, for each assessed chemical, values of log K OW, log K OA, log D, and HL , and, importantly, an estimate of the error for those predicted values. The following sections describe how we propose these values to be obtained.

The thresholds for log K OA, log K OW, and the biotransformation half‐lives HL

Selecting numerical threshold values

For the thresholds and , we used the values of 2 and 5, which we believe have their origin in the food chain bioaccumulation calculations by Kelly and Gobas (2003) and Kelly et al. (2003). This choice is based on the fact that these values have been adopted by ECHA (2017) and thus have regulatory relevance. These should, however, not be considered necessarily as set in stone. Armitage and Gobas (2007) proposed a of 1.75 and a of 5.25 based on simulations of the soil−earthworm−shrew food chain. Simulations for food webs leading up to large, warm‐blooded carnivores by Czub and McLachlan (2004) and Kelly et al. (2007) yielded a higher log K OA threshold of 6. We further note that Czub and McLachlan (2004) refer to their choice of thresholds as “reasonable, but not especially conservative.” Selecting the threshold is trickier, as no regulatory recommendation currently exists and because elimination half‐lives for different organisms (especially of different sizes and physiologies) are not directly comparable. In other words, the selected HL threshold has to be appropriate for a particular organism. Because the prediction methods for HL s that we propose to use are for humans, the threshold should be appropriate for humans. Goss et al. (2013) have suggested that a total elimination half‐life of 70 days, equivalent to an elimination rate constant of 0.01 day−1, corresponds to a lipid‐normalized biomagnification factor (BMF) of 1 under a certain set of assumptions, such as the daily food uptake rate equaling 1% of body mass, a dietary uptake efficiency of 95%, and a dietary lipid content that is equal to the lipid content of the organisms. However, while these assumptions seem reasonably appropriate for humans, equally plausible values for the food intake rate or the lipid content of the food would yield different numerical values for the . Of course, different air‐breathing organisms can also differ in their dietary characteristics. It is furthermore possible that the biotransformation rates in animals deviate from those in humans (Watkins & Klaasen, 1986; Williams, 1974). Nevertheless, we assume here that if an air‐breathing animal with the average body size of a human can biotransform a chemical with an HL of 70 days, it is unlikely to be bioaccumulative. There is some conservatism built in this assumption because biotransformation half‐lives are compared with a threshold for the total elimination half‐life, that is, the contribution of other elimination processes is neglected. There is some hesitation to perform an assessment of B on elimination half‐lives. The primary concerns are that the half‐life that produces no biomagnification is highly dependent on the species and a number of assumptions, and that chemicals that are not efficiently eliminated do not necessarily bioaccumulate (e.g., if they are not taken up efficiently). However, the numerical value of BMF or the thresholds for log K OA and log K OW are similarly dependent on a number of assumptions, such as the lipid content of an organism or the temperature of exhaled air. Furthermore, here, we propose the use a threshold that is specific for a particular type of organism and we apply this threshold in such a way that if HL < , then the chemical is not B, which does not imply that a chemical with an HL > is necessarily B. After all, a chemical with a log K OA > and/or a log K OW or logD > is not necessarily bioaccumulative either.

Consistency of threshold values

It is interesting to explore whether the three thresholds for respiratory, urinary, and biotransformation loss that we adopt here are internally consistent. It is possible to estimate the respiratory elimination half‐life of chemicals from the human body by assuming that exhaled air is in chemical equilibrium with the human body. where G respiration is the body mass‐normalized respiration rate (L air kg−1 body mass day−1), f lipid is the lipid‐equivalent fraction of the body (L lipid kg−1 of body mass), and K OA applies to the temperature of exhaled air. Similarly, we can estimate the urinary elimination half‐life of chemicals assuming that urine is in chemical equilibrium with the human body. where G urination is the body mass‐normalized urination rate (L urine kg−1 body mass day−1). Using Equations (6) and (7) with values for f lipid of 0.22 L kg−1, G urination of 0.017 L kg−1 day−1, and G respiration of 250 L kg−1 day−1 (Goss et al., 2013), we estimate that an elimination half‐life in humans of 70 days corresponds to a log K OA at body temperature of 5.06 and a log K OW of 0.90. Using a slightly different approach, Goss et al. (2013) obtained similar values of log K OW 1.3 and log K OA 5.5. If we assume a body temperature of 37 °C and an internal energy of phase transfer from octanol to the gas‐phase ΔU OA of 50 kJ/mol (Goss & Schwarzenbach, 1999), this corresponds to a log K OA at 25 °C of 5.40. In other words, the log K OA threshold is broadly consistent with the chosen HL threshold, while the log K OW threshold seems to be too high. A chemical with a log K OW of 2 is estimated to have a whole‐body elimination half‐life by urination in humans of more than two years. A closer look at the boundaries in the chemical partitioning space plots displaying modeling results on biomagnification (Czub & McLachlan, 2004; Kelly et al., 2007) actually suggests that thresholds of a log K OA of 6 and a log K OW of 2 should be indicative of similar food chain biomagnification behavior. While these thresholds were derived from food web simulations and Equations (6) and (7) apply to a single human, both approaches agree that a of 2 and a of 5 are inconsistent. It may thus be worthwhile to revisit the regulatory thresholds for B in air‐breathing organisms (ECHA, 2017). Incidentally, Equations (6) and (7) illustrate that fixed log K OW and log K OA thresholds result in different elimination half‐lives in different air‐breathing organisms because organisms differ in terms of f lipid, G respiration, and G urination, as well as in body temperature (poikilotherms vs. homeotherms). In other words, when applied to different organisms, not only will require adjustment to achieve threshold consistency but quite possibly also and . The reasons are differences in allometric scaling with respect to the lipid content and the rates of feeding, respiration and urination. In summary, while we apply here numerical values of 2, 5, and 70 days for , , and , we stress that these values should be considered tentative because a of 2 appears not to be consistent with a of 5, and because all thresholds, and especially , are dependent on a number of assumptions. The proposed procedure can obviously be applied to different thresholds, should a set of consistent ones be agreed upon in the future.

Predicting log K OW and log K OA

There are several techniques for predicting the log K OA of a chemical (Baskaran et al., 2021a) and many more for the prediction of log K OW (e.g., Cappelli et al., 2015). The criteria that we applied for selecting the prediction technique for these two parameters included the following: (i) It should have a good predictive performance, particularly within the range close to the thresholds of interest. (ii) It should be possible to make these predictions easily, without having to access commercial software. (iii) It should be possible to derive a confidence interval on the predictions. The work by Endo and Goss (2014) and co‐workers (Stenzel et al., 2013) has demonstrated the superior performance of poly‐parameter linear free‐energy relationships (ppLFER) for the prediction of the solvation properties of a large diversity of neutral organic compounds. For example, Stenzel et al. (2013) reported a root mean square error (RMSE) of 0.39 log units from the comparison of measured and predicted K OW for 111 substances (86 pesticides and pesticide transformation products, 10 PCBs, six heterocyclic aromatic compounds containing either nitrogen (N) or oxygen (O), and seven nitroaromatic compounds, one hormone and one antibacterial). In the specific case of the log K OA, Baskaran et al. (2021a) have shown that ppLFERs yield predictions with smaller confidence intervals than other easily accessible and commonly used prediction techniques. The noncommercial UFZ LSER website further assures that anyone can make such predictions and derive their confidence limits (Ulrich et al., 2017). Therefore, our proposed procedure relies on the following ppLFERs to predict the K OA and K OW values of compound i: where S , A , B , V , and L are solute descriptors (SDs) quantifying a compound i's ability to undergo various types of intermolecular interactions. Specifically, S is the polarizability/dipolarity parameter, A and B are the H‐bond acidity and basicity, V is the McGowan molar volume, and L is the decadic logarithm of the hexadecane–air partitioning ratio. Equation (8) is taken from Endo and Goss (2014), whereas Equation (9) is a version of a ppLFER for log K OA presented by Endo and Goss (2014), which eliminates the term containing V , because it has no statistical significance (Baskaran et al., 2021a). Please note that Equations (8) and (9) are currently not implemented on the UFZ LSER website (Ulrich et al., 2017). The performance of ppLFER predictions depends on the origin of the SDs, with smaller prediction errors obtained with experimentally derived SDs. In the case of predicting log K OA with Equation (9), this has been shown by Baskaran et al. (2021a). Experimentally determined SDs, including those in the ABSOLV database (ACD/Labs), are available for close to 10 000 organic chemicals and accessible through the UFZ LSER database (Ulrich et al., 2017). If no experimental SDs are available, they can be predicted with quantitative structure–property relationships (QSPRs) (e.g., Platts et al., 1999, 2000), but one should then expect considerably higher PE s, in particular, if a compound's structure requires an extrapolation beyond the applicability domain (AD) of the QSPRs used for SD prediction. The QSPRs, developed using the Iterative Fragment Selection (IFS) algorithm (Brown et al., 2012) for the SDs in Equations (8) and (9), are also available (e.g., Brown, 2014) and accessible through the UFZ LSER website (Ulrich et al., 2017) and only require a chemical's molecular structure as a simplified molecular input line entry system (SMILES) notation as input. The proposed procedure uses experimental SDs to predict log K OW and log K OA, if they are available, and otherwise uses the IFS‐QSPR predicted ones. The upper PE is are obtained differently depending on whether experimental or predicted SDs are used.

Estimating the error of log K OW and log K OA

Estimating the error of log K OW and log K OA when using experimental SDs

If experimental SDs are used, we propose using a PE that is empirically derived from the comparison of measured and ppLFER‐predicted partition ratios in the threshold region (i.e., within two and a half orders of magnitude of T) and is a constant for all chemicals. In the case of log K OA, we extracted all measured values within the range 2.5 to 7.5 from the database compiled by Baskaran et al. (2021b). In the case of log K OW, we extracted all measured values, excluding salts, within the range −0.5 to 4.5 from the OPERA database (Mansouri et al., 2018). To remove any inconsistency and correctly assign a structure to a chemical, SMILES and CAS number data were standardized and curated using an in‐house routine based on the software OpenBabel v3.1.1 (O'Boyle et al., 2011) and the R package Webchem v 0.5.0 (Szöcs et al., 2020). We then calculated the difference between the experimental values and the values calculated using Equations (8) and (9) together with experimental SDs from the UFZ LSER database (giving preference to UFZ‐preselected values over values from the ABSOLV database). The statistics on these differences are presented in Table 1. The PE is derived as either PI95 = mean ± 1.96 · StDev or SE = mean ± StDev, whereby mean and StDev refer to the mean and the standard deviation of the differences between the measured and predicted values.

Table 1

	log K _OA	log K _OW
N	492	1964
Standard deviation	0.18	0.33
Mean	0.00	−0.07
Upper SE	0.18	0.26
Upper PI95	0.35	0.58

Statistical data on the discrepancy between measured partition ratios within two and a half orders of magnitude of the threshold of interest to B assessment and partition ratios predicted using Equations (8) and (9) and experimental solute descriptors Based on this analysis, we use the following versions of Equations (2) and (3) when experimental SDs are available. If we apply the 95% prediction interval PI95: Also, when using the standard error of the prediction SE: Estimating the error of log K OW and log K OA when using predicted SDs. If predicted SDs are used, the error of the prediction is expected to vary based on the fit of a chemical with the AD of the QSPRs used in the prediction of the SDs. The IFS‐based QSPRs quantify this fit and assign a standard error SE to an SD X predicted for compound i (X = S , A , B , (V ), L ). As can be seen in Equations (8) and (9), the system constants x in a ppLFER have an uncertainty SE x as well (x = s, a, b, (v), l, and intercept c). These constants are derived during the original calibration of a ppLFER, which involves multiple linear regression of measured partition ratios against experimental SDs. In other words, in a ppLFER equation, uncertain system constants are combined with uncertain SDs to predict a chemical property. We therefore propose to estimate the compound‐specific standard error of the predicted log K OAi by propagating the standard errors SE Xi of the SDs and the standard error SE x of the fitted system constants in Equation (9) as: where SE c is the error of the intercept in Equation (9). An equivalent equation with additional square terms for the error of vV i in Equation (8) is used to estimate the of the predicted log K OWi. In doing so, we neglect the contribution of any covariance terms. Finally, we multiply the or by 1.96 to estimate the and . Note that the same compound‐specific approach is not feasible for predictions relying on experimental SDs because the uncertainty of those SDs is rarely known quantitatively. The uncertainty of experimentally determined SDs has occasionally been reported for individual chemicals (Tülp et al., 2008). In other cases, an estimate of the uncertainty of a set of SDs S , A , B , V , and L for a compound is estimated from the RMSE between the ppLFER predicted and experimental log K for the systems used in the calibration of that set of SDs (called “internal consistency” by Stenzel et al., 2013). Often, however, no uncertainty information is available, which is the case for the bulk of the SDs available through the UFZ LSER database (Ulrich et al., 2017). Therefore, it is generally not possible to estimate the uncertainty of a partition ratio predicted with a ppLFER and experimental SDs on a compound‐specific basis. To confirm the appropriateness of the error propagation calculation using Equation (10), and specifically of the assumption that the SDs are independent and not correlated with each other and that the covariance terms can therefore be neglected, we used two other methods to estimate the error of a ppLFER prediction using predicted SDs. One of these relies on a comparison of measured and predicted K OA and K OW values, that is, it uses the approach applied to predictions with experimental SDs. The other is a Monte Carlo simulation. Details for both are provided in the Supporting Information. Monte Carlo simulations yielded PI95 that are essentially identical to those obtained by error propagation. The SE and PI95 obtained by error propagation are of the same order of magnitude as those obtained from the model–measurement comparison, even though they relate to completely different sets of chemicals. Again, details are provided in the Supporting Information. Accordingly, the following versions of Equations (2) and (3) are used when predicted SDs are used. If we apply the 95% prediction interval: Also, when using the standard error of the prediction: Finally, irrespective of the use of experimental or predicted SDs, we use: when ignoring the error of the prediction. How does this approach account for the AD of the individual models? Because different models quantify the AD differently, they are generally not directly comparable. The AD information serves as a warning that the prediction for a particular chemical is less reliable; it is not necessarily wrong, but simply has a higher uncertainty. Thus, by including the uncertainty in the assessment, we indirectly include information about the AD of the models. Chemicals that have a poor fit with the AD are assigned larger errors. This also serves as an incentive to seek out models appropriate for each chemical, since this will reduce the error associated with a prediction and thereby increase confidence in the assessment.

Accounting for dissociation and the formation of charged species

Dissociation and the formation of charged species can affect the elimination of organic chemicals from air‐breathing organisms in two ways. It will increase the water solubility of a compound and therefore increase the possibility for urinary excretion and it will decrease the volatility of a compound and therefore might prevent exhalation loss, especially if the chemical is entirely present in its charged form. Because Equations (8) and (9) only predict the partitioning behavior of the neutral form of a compound, this could result in misclassifications. If it is possible to gain access to a technique predicting the dissociation of neutral chemicals, the proposed procedure should also contain additional steps. If a chemical has been judged not B based on Equations (3′)–(3″″′), that is, if its neutral species is assessed to have a log K OA below the threshold of 5, it should be confirmed that the compound can exist in neutral form at the physiological pH of 7.4. If the fraction of the neutral species is predicted to be less than 1%, the chemical cannot be assumed to be not B. Underlying this approach is the assumption that 1% of a chemical with a log K OA less than 5 being present in the neutral form is sufficient for rapid respiratory elimination. We used the Global, Adjusted Locally According to Similarity (GALAS) algorithm (Sazonovas et al., 2010) by ACD/Percepta (2021) to estimate the percent of a chemical in the neutral form. Even if a chemical has been judged to have a log K OW above 2 based on Equations (2′)–(2″″′), it may still be possible to exclude it as a bioaccumulation concern if it is dissociating and has a log D below that threshold. We therefore include a comparison of a chemical i's log D at pH 7.4, again predicted using the GALAS algorithm by ACD/Percepta (2021), with the log K OW threshold as in Equation (4). This software does not provide an estimate of the reliability of a log D prediction that would be suitable to estimate a prediction error. However, it does provide a reliability index (RI) for its log K OW predictions. We used the same data set of measured log K OW values that we used in the derivation of a prediction interval for the ppLFER‐predicted log K OW to evaluate the performance of GALAS for predicting log K OW values in the threshold region between −0.5 and 4.5. We calculated the statistics on the differences between the measured and predicted log K OW for four groups of compounds, depending on the value of the RI (Table S2). The PE of the log D will have to be larger than the PE of the log K OW from which it is calculated because of the additional uncertainty introduced by the prediction of the acid dissociation constant(s) pK A and the calculation of dissociation. On this basis, we apply an SE log of 0.38, 0.75, and 1.5 and a PI95 log of 0.75, 1.5, and 3, if the RI for log K OW is greater than 0.75 (high reliability), between 0.5 and 0.75 (medium) and below 0.5 (low/poor), respectively. Depending on what error is applied, Equation (4) thus takes either the form: or

Estimating biotransformation loss

Predicting biotransformation half‐lives HL and their error: We are not aware of techniques predicting rates of biotransformation in air‐breathing animals (Papa et al., 2017). However, Arnot et al. (2014) and Papa et al. (2018) have presented prediction techniques for the biotransformation half‐life HL in humans, which are based on the IFS algorithm and global molecular descriptors selected by the Genetic Algorithm implemented in the QSARINS software (Gramatica et al., 2013), respectively. Both were trained with the same empirical data set (Papa et al., 2018). In the absence of prediction techniques for air‐breathing animals, we suggest that it is defensible to rely on predicted biotransformation rates in humans because humans have enzymatic systems for the biotransformation of xenobiotic compounds that are not fundamentally different from those of other air‐breathing organisms (Williams, 1974). Because it is not known whether the IFS‐QSPR or QSARINS provides more reliable predictions, we propose relying on the geometric mean of the HL values provided by the two techniques: The uncertainty of the HL prediction is related to how well a molecule fits within the AD of the prediction methods, and both IFS and QSARINS estimate a prediction error from that fit. Specifically, a prediction interval of indicates that there is a 95% likelihood that the true value of HL is between HL / and HL · . Similarly, we can estimate standard errors using log10()/1.96, such that there is a 68% likelihood that the true value of HL is between HL / and HL · . The value that should be compared with the threshold therefore is the geometric mean of the upper limits of the two techniques, that is, HL · PE HL and HL · PE HL . When using the 95% prediction error, statement (5) therefore reads: When using the standard error, we use: When we ignore the error of the prediction, we simply compare the geometric mean of the predictions with the threshold:

Schematic of the overall approach

Figure 2 shows the entire approach in a flow chart. In particular, it clarifies the sequence of the various comparisons with threshold values. The approach is hierarchical in the sense that if a chemical is judged by any one comparison to be not bioaccumulative, subsequent comparisons are no longer required. This is why the comparisons involving the freely and easily accessible ppLFER predictions precede those requiring the commercial ACD/Labs predictions.

Figure 2

Flowchart delineating the entire assessment approach consisting of several comparisons of predicted chemical properties with threshold values indicative of rapid elimination from an organism by urination, exhalation, and biotransformation. The threshold of 70 days is based on the use of a predicted biotransformation half‐life in humans. When experimental solute descriptors are available, a fixed prediction error is applied for all compounds, whereas a compound‐specific prediction error PE is applied when solute descriptors have to be estimated We suggest that the comparison involving the log K OW precede the one involving the log K OA because the former turns out to be more effective than the latter, that is, it can judge a large fraction of chemicals as being not B (see the Results section below). The comparison involving the biotransformation half‐life is last because not everyone may agree that a first tier should include information beyond partitioning ratios (in analogy to a tier 1 assessment for aquatic bioaccumulation solely based on a log K OW value).

Testing the applicability and effectiveness of the approach

We applied the approach illustratively to 1158 organic chemicals to test (i) how readily applicable the various property estimation techniques are to a large number of diverse organic chemicals and (ii) how effective the approach is in eliminating chemicals as potential bioaccumulation threats in air‐breathing organisms. The data set was extracted from a list of compounds collected in the context of the CEFIC LRI ECO41 and ECO44 projects (https://cefic-lri.org/projects/) aimed at gathering in vitro and in vivo toxicokinetic data for mammals and included in the EAS‐E Suite database (www.eas-e-suite.com). The complete data set is very heterogeneous, comprising pharmaceuticals and environmentally relevant compounds (e.g., PCBs, PBDEs, and PFAs), covering a broad range of chemistries (e.g., neutral and ionizable compounds) and featuring partitioning properties (i.e., K OW and K OA) spanning over 10 orders of magnitude. To test our approach on a relevant area of the chemical space (i.e., substances that could possibly be subject to respiratory elimination), we selected compounds with a log K OA estimated with EPISuite's KOAWIN (US EPA, 2021) below 10, thus limiting the test data set to the 1158 chemicals reported in the Supporting Information. By excluding very involatile substances, the data set is biased toward relatively volatile organic compounds. We performed the screening procedure for the set of chemicals three times, either using the PI95 or the SE as the prediction error or ignoring the uncertainty of the predicted property values altogether. Using SE instead of PI95 corresponds to a larger tolerance of false‐negative categorizations. A justification for proceeding this way, could be, for example, if the threshold values have already been selected conservatively, that is, are in fact lower than they really need to be protective. Using the SE to express the prediction error can be used to assess how much more effective the procedure would be in identifying rapidly eliminated substances if more room for a wrong decision is provided. The screening without consideration of prediction errors can be used to assess the number of false negatives that one would incur, if one does not quantify and consider the prediction uncertainty.

RESULTS

Applicability of property prediction techniques

The ppLFER predictions of log K OW and log K OA could be made for all except one of the 1158 chemicals, that is, the prediction only failed for paraquat (CAS 004685‐14‐7). Experimental SDs could be found for 532 chemicals, implying that they had to be predicted for the remaining 625 substances. The HL predictions by QSARINS were successful for all chemicals, whereas the IFS‐HL QSPR failed again only for paraquat. Predictions of the neutral fraction and log D could be made by ACD/Labs for all chemicals. Overall, the fraction of chemicals with successful application of the property prediction techniques was therefore very high, and the likelihood that no judgement can be passed as a result of the failure to make a prediction is very low.

Effectiveness of eliminating nonbioaccumulative substances

The flow of the 1157 chemicals through the total procedure is depicted in Figure 3. The three panels within that diagram indicate the flow through the procedure by applying either the 95% PIs or the SEs or by assuming that the predicted properties have no error. The flow path of the 532 chemicals with experimental SDs is shown in red, whereas the 625 chemicals for which SDs needed to be predicted are shown in blue. Arrows leaving a diamond towards the left indicate a “Yes” decision, generally indicating that a chemical is judged to be below a threshold, whereas an arrow leaving a diamond on the right indicates a “No,” in most cases indicating that a chemical could not be confidently judged to have a property below a threshold.

Figure 3

Application of the proposed B assessment procedure to 1157 organic chemicals, applying either the 95% prediction intervals or the standard errors or by assuming that the predicted properties have no error If the most stringent uncertainty estimate (PI95) are applied, 960 out of 1157 chemicals (83%) are eliminated as not being a bioaccumulation concern in air‐breathing organisms. This fraction increases to 89% and 95% if the standard error or no error is applied. Accordingly, 17%, 11%, or 5% of screened chemicals are judged to require a more sophisticated B assessment. In other words, tolerating a potential for a larger number of false negatives decreases the number of chemicals requiring higher‐tier assessments by ~6% (both when moving from PI95 to SE and from SE to no error). By categorizing approximately four‐fifth of compounds as being of no bioaccumulation concern, the procedure functions as a reasonably effective filter; comparison of the three different panels in Figure 3 indicates that this effectiveness as a filter is not unduly dependent on the parameter chosen to represent the prediction error. Not too much emphasis should be placed on the numerical value of the percentages shown in Figure 3 (and also Figure 4), as they will clearly depend on the specific compound data set being assessed. Because of a relatively large number of polychlorinated biphenyls in our test set, the proportion of chemicals assigned to the “not B” category would likely be even larger for a data set that is truly representative of the universe of commercial chemicals and not biased by a high proportion of highly halogenated compounds. By using a reasonably diverse and large data set, this analysis should at least be broadly suggestive of the effectiveness of the proposed procedure, and especially allow for the comparison of the effectiveness of different variations (e.g., including biotransformation or not, including log D or not, using different types of error), as will be done in the next section.

Figure 4

Categorization of the 1157 chemicals in the test set based on the proposed procedure using different choices for the error of the property predictions (PI95, 95% prediction interval; SE, standard error; noE, no error). The brackets on the right group the chemicals that are categorized as not B based on (i) partition ratios of the neutral species, (ii) any partitioning property, and (iii) any criterion A valid question is whether chemicals are correctly categorized by the procedure. Because the stated goal is de‐prioritization, success can only be assessed by confirming that de‐prioritized chemicals are indeed not B. In other words, whether the chemicals earmarked for higher‐tier assessment are in fact B cannot be a measure of success; it is sufficient for them to be plausibly B. The Microsoft® Excel sheet in the supporting information identifies all of the chemicals that are judged not B and potentially B. While it is difficult to state affirmatively that all de‐prioritized chemicals cannot bioaccumulate in air‐breathing organisms, we can note the absence of substances among them that are known to bioaccumulate. Furthermore, compounds with known bioaccumulation potential in air‐breathing organisms, in particular, organochlorinated pesticides (e.g., endosulfan, β‐hexachlorocyclohexane) and a large number of polychlorinated biphenyls and benzenes (e.g., 1,2,4,5‐tetrachlorobenzene), are identified as requiring higher‐tier assessment (see Table S3). Food web modeling had identified trifluralin as potentially bioaccumulative in air‐breathing organisms (see Tab. 1 in Kelly et al., 2007), whereas it was de‐prioritized here based on rapid biotransformation. Field studies examining the potential bioaccumulation of trifluralin in food webs composed of air‐breathing organisms have found no evidence of bioaccumulation (e.g., Morris et al., 2016; Vorkamp & Rigét, 2014).

Processes responsible for eliminating nonbioaccumulative substances

Figure 4 summarizes the categorization of 1157 chemicals by the proposed procedure. There are three versions depending on the values that were used for the prediction error. The largest fractions of chemicals are declared not B because of rapid elimination by urination (blue bars) and biotransformation (green bars). Only a small fraction of chemicals is judged not B because of rapid elimination by respiration (yellow bars). A screening method based solely on partitioning properties would be quite inefficient, as only between a third (PI95) and half (no E) of the chemicals would be classified as not B. If the screening method were to rely only on log K OW and log K OA as elimination criteria, that is, if dissociation and log D were not taken into account, a mere 21%–37% of chemicals would be classified as not B. In other words, the consideration of biotransformation as an elimination process is important, as it accounts for 60%–50% of the decisions to declare a chemical not B. The inefficiency of screening based on log K OA and log K OW can be illustrated by placing the 1157 chemicals in a two‐dimensional partitioning space defined by these two partition ratios (Figure 5). The whiskers on each datapoint represent the PI95, either being the same for all chemicals with experimental SDs (orange markers) or with compound‐specific prediction errors for chemicals with predicted SDs (blue markers). The thresholds of log K OW 2 and log K OA 5 are shown by colored shading. Only for those chemicals for which at least one of the two whiskers falls fully within the green shaded area of the plot is a judgment of not B reached based on ppLFER‐predicted log K OX values.

Figure 5

Placement of the 1157 test chemicals in the chemical partitioning space defined by the log K OW and log K OA, based on the partitioning properties of the neutral version of the compound, as predicted with poly‐parameter linear free‐energy relationships using either experimental SDs (orange markers) or SDs estimated with the IFS‐QSPRs implemented in the UFZ LSER website (blue markers). The whiskers indicate the upper standard error. If a chemical, more precisely either its right or its upper whisker, falls in the green shaded area of the plot, it is judged to be not B in air‐breathing organisms. The small black outline close to the threshold values indicates chemical partitioning properties, where exhalation and urination act in concert to prevent bioaccumulation Very few chemicals fall in the green area at the bottom of the plots in Figure 5, which indicates that the number of chemicals classified as not B based on the volatility threshold is small (Figure 4). Among the multitude of organic chemicals, only a very small fraction is sufficiently volatile for respiration to be an efficient elimination pathway. For example, among mono‐functional linear alkyl compounds, the C6 alcohol, the C8 primary amine, and the C9 aldehyde and ketones have a log K OA above 5. Very few compounds with two functional groups have a log K OA below that threshold. For example, 1,2‐ethanediol, 2‐nitropropanal, 3‐aminopropanol, and 5‐amino‐pentanal all have log K OA above 5, as do aniline, nitro‐toluene, and para‐benzoquinone. Despite the small number of chemicals eliminated by the volatility criterion, we believe it nevertheless advisable to include it in a screening assessment, as experiments that may be required at a higher tier of assessment (e.g., an in vitro assay to determine the biotransformation kinetics of a compound) could be particularly challenging to conduct for such volatile compounds. Considerably more chemicals fall in the green part to the left of those plots (Figure 5), which is consistent with a larger fraction of substances being sufficiently water soluble for efficient urinary elimination. However, the bulk of the chemicals fall in the upper right of the plots and therefore cannot be judged not B based on partitioning properties alone. The plots also make it apparent that screening for B in fish based solely on partitioning (i.e., based on a log K OW threshold of 4.5 [ECHA, 2017b] or 5 [UNEP, 2017]) is more effective than screening for B in air‐breathing organisms based solely on partitioning because of the large number of chemicals with a log K OW between 2 and 4.5 or 5. This had already been highlighted by Kelly et al. (2003), who noted that 36% of the chemicals on the Canadian domestic substances list are deemed not B based on a log K OW threshold of 5, but could conceivably be B in air‐breathing organisms. If there is a large increase in the number of chemicals being judged not B upon relaxing the error stringency (i.e., moving from PI95 to SE to no error in Figure 4), this implies that there is a potential for a relatively large number of false negatives, if uncertainty is not considered. However, it also implies that the efficiency of the procedure would potentially benefit from a prediction technique with a smaller error. This is apparent for the ppLFER prediction of log K OW with estimated SDs (intermediate blue bars in Figure 4). The number of chemicals that can be excluded using the log K OW threshold increases from 91 to 163 and 241 if PI95, SE, and no error are applied. The fairly large error of a log K OW predicted with estimated SDs is also apparent from the large blue whiskers in the right panel of Figure 5. If we compare which elimination processes are responsible for a chemical being classified as not B, we can observe a shift of increasing importance of urinary excretion relative to the importance of biotransformation with decreasing assumed error (Figure 4). For example, 36% and 60% of the not B decisions are based on urinary excretion and biotransformation, respectively, when PI95 is applied, but these shift to 50% and 47% when the error of the predicted properties is ignored. The reason is simply the sequence of the various comparisons in the procedure. If a chemical is already sorted out in an earlier step of the procedure (e.g., based on a comparison of the ppLFER‐predicted log K OW), the relative importance of subsequent steps in identifying not B chemicals is obviously diminished.

DISCUSSION

Substituting the prediction methods proposed here

While we relied entirely on predicted property values, it is obviously possible to substitute a measured value in any of the decisions 2–5, if such a value is available. However, measured property values also have an uncertainty and it would be appropriate to apply an equivalent expression of measurement uncertainty, such as a 95% confidence interval or a standard error. Importantly, precision as obtained by replicate measurements often underestimates the true experimental uncertainty because of unknown measurement bias. Also, not every measured value is necessarily better than a prediction. For example, only high‐quality measurements will be able to match or reduce the uncertainty of a log K OA prediction with Equation (9) and experimental SDs. Similarly, it is possible to substitute any of the prediction methods proposed here with one perceived to be superior or with one that is more easily accessible. As mentioned earlier, the log K OW prediction with ppLFERs incurs quite high uncertainties when estimated SDs are used, in particular, if a molecule does not fall within the AD of one of the IFS‐QSPR used for the prediction of the SDs. It is conceivable that methods exist that are capable of predicting log K OW from molecular structures with a smaller prediction error, but they may often be commercial and/or fail to provide a quantitative estimate of prediction errors. The latter is a key prerequisite for the use of a prediction method in the proposed procedure. A different method obviously could also substitute for a prediction that failed. For example, in contrast to the IFS‐QSPR, the commercial software ACD/LABS ABSOLV succeeds in predicting SDs for paraquat. Equations (6) and (7) can be used to estimate log K OW and log K OA values for the neutral form of 4.70 and 5.99, respectively, but no information on the error of these predictions is available. In fact, a measured value is almost 10 orders of magnitudes lower (log K OW = −5, Platford, 1983), presumably because of the two positive charges in the molecule. In the future, it is likely that QSPRs will be developed that predict the biotransformation half‐life of organic chemicals in air‐breathing organisms, for example, based on the extensive data sets that have been generated on the toxicokinetics of organic compounds in rats and mice. It may then be advisable to substitute, or rather complement, predictions of HL in such organisms for the HL in humans used here. This seems prudent because while all air‐breathing organisms share the basic pattern of metabolism, large variations between species can arise within this pattern (Williams, 1974). In this case, it is imperative that the threshold value is adjusted to the body size of the organism. Here, we proposed a procedure that uses K OA and K OW to judge a chemical's propensity for elimination, which implies the assumption that n‐octanol is a suitable surrogate for describing the solvation properties of the tissues of air‐breathing organisms. There are ppLFERs available that predict solvation of neutral organic substances in different biological materials such as storage and membrane lipids and proteins (Endo et al., 2011, 2012; Geisler et al., 2012), which allows for the estimation of an entire organism's uptake capacity from an organism's biological composition and a chemical's SDs (Endo et al., 2013), that is, the same data that we propose to use for K OW and K OA predictions. It should be feasible to formulate comparisons 2 and 3 in terms of partitioning ratios between the whole organism and the gas and aqueous phases, and predict these partition ratios with the same approaches used here and thus avoid the use of octanol as a surrogate entirely. While such an approach would be more mechanistically realistic, thresholds in K organism/A and K organism/W that correspond to rapid elimination would still need to be established (Goss et al., 2013). Also, current regulatory thresholds tend to be based on parameters that not only can be predicted but also are readily accessible to reproducible experimental determination.

Strength of the proposed procedure

Of course, one could imagine many different approaches for a tier 1 assessment of bioaccumulation in air‐breathing animals, including many based on the same predicted properties as used here. For example, one could envisage using all of the comparisons of a chemical's log K OW and log K OA and HL with the threshold values, that is, to not impose a sequence to the comparisons and not disregard a comparison, if an earlier one had already classified a chemical as not B. One could also imagine using the predicted chemical properties as input to a mass balance model of contaminant fate in a representative air‐breathing organism and use a model‐derived parameter as a criterion for assessing a chemical as potentially B or not, for example, a total elimination half‐life (Goss et al., 2013) or a BMF. Among the advantages of the proposed procedure are: Its conceptual simplicity and transparency. Its reliance on partition ratio thresholds that have already been enshrined in chemical regulation (e.g., ECHA, 2017a). Its use of properties that can be predicted for a very wide range of organic chemicals. The potential to flexibly exclude certain threshold comparisons, if the prediction is not accessible (e.g., because of the need to access a commercial software) or because of conceptual preferences (e.g., not wanting to include the potential for biotransformation in a tier 1 assessment). Its tolerance for missing property values, a often a decision of not B can be rendered based on predicted values that are available. For example, although no IFS‐based prediction could be obtained for paraquat, QSARIN predicted an HL · of 12 days and an HL · of three days, rendering it not B. The relative ease of considering the prediction uncertainty in the assessment process. The possibility to account for dissociation, if relevant, under physiological conditions. In particular, the latter two aspects are more difficult to implement when using an integrative bioaccumulation model. For example, the consideration of parameter uncertainty presumably would require a full Monte Carlo analysis to propagate the uncertainty of the chemical properties to the model output that is to serve as the bioaccumulation criterion. While this is feasible, it is operationally difficult to implement when screening large numbers of chemicals. Bioaccumulation models treating dissociating organic compounds are only in their infancy (Armitage et al., 2017) and may not even exist yet for air‐breathing organisms. Also, consensus would have to be developed on (i) the organism to be modeled, (ii) the model approach as well as the model parameterization, (iii) the model‐generated output parameter to serve as the B criterion, and (iv) the threshold value to identify potentially bioaccumulative substances.

Weaknesses of the proposed procedure

There are also shortcomings of our approach: By using separate thresholds for efficient elimination by respiration, urination, and biotransformation, our scheme classifies a chemical as potentially B if none of the three processes by itself is fast enough to prevent bioaccumulation. If such a chemical is efficiently eliminated by a combination of more than one elimination process, the approach would thus lead to a false‐positive categorization. Whereas an approach using an integrated bioaccumulation model that considers the various processes working in concert would avoid such errors, false positives are not a major concern in a screening aimed at de‐prioritization. Furthermore, the number of chemicals affected is likely to be very small. For example, only for chemicals with partitioning properties falling into the small triangular area close to the threshold values in Figure 5 would the combination of exhalation and urination prevent bioaccumulation. The proposed approach would also categorize some perfluorinated acids as not B based on their very low log D values, which would suggest that they are rapidly eliminated by urinary excretion. Our procedure does not consider the phenomenon of renal reabsorption, which essentially prevents the efficient urinary excretion of some such substances that therefore are potentially highly bioaccumulative. Examples are perfluorooctanesulfonate (CAS 2795‐39‐3), perfluorobutane‐sulfonate (CAS 29420‐49‐3), perfluorohexanoic acid (CAS 307‐24‐4), perfluoropentanoic acid (2706‐90‐3), and perfluorobutanoic acid (375‐22‐4), all of which have predicted log D values below 0. In general, the treatment of dissociating compounds in the procedure proposed here should be considered as preliminary. For example, while we use here the pH of blood to calculate the log D, the pH of urine is often lower, that is, the dissociation equilibrium will be different in blood and urine for compounds with a pK or pK around 7. As the permeability between blood and urine is different for charged and neutral species, this can lead to ion trap phenomena and, accordingly, to a pH dependence in the rate of urinary excretion (e.g., Kiddie et al., 1972). In fact, it may not be entirely valid to compare a log D value against a threshold for efficient urinary excretion expressed in terms of log K OW. The assessment of the bioaccumulation potential of dissociating compounds may ultimately need to rely on toxicokinetic models specifically designed for such compounds (Armitage et al., 2017). The success of the proposed procedure relies very strongly on the ability to predict biotransformation half‐lives. The uncertainty of the prediction methods applied here is large and therefore the screening results depend considerably on the error value chosen. The list of chemicals deemed potentially bioaccumulative is strongly dominated by highly halogenated compounds and only includes one substance with less than two halogens (D5, decamethylcyclo‐pentasiloxane, CAS 000541‐02‐6) (Table S3). It is conceivable that the calibration data set for the QSPRs for HL was not sufficiently diverse to allow identification of poorly biotransformed structures that do not contain multiple halogens. Of course, it is also possible that our test data set of 1158 chemicals did not contain such substances.

Higher tiers of assessment

Chemicals that cannot be excluded as potentially B using a simple tier 1 procedure based on predicted chemical properties, as described here, will need to be subjected to a more sophisticated screening. The current study suggests that the focus of higher tiers should be on better characterization of biotransformation loss, and not necessarily on measured as opposed to predicted partitioning properties. The most promising route for this is the determination of biotransformation rates in in vitro assays using a system (e.g., S9 fraction, hepatocytes) for air‐breathing organisms and the application of in vitro in vivo extrapolation methods to estimate the rate of elimination by biotransformation.

DATA REPOSITORY

A Microsoft® Excel spreadsheet distributed as supplemental data contains the entire data set of 1158 chemicals, most of the predicted properties and their error, as well as the decisions made on a comparison of the predictions with threshold values.

CONFLICT OF INTEREST

The authors declare that there are no conflicts of interest. This article includes online‐only Supporting information. Three tables with additional information on the proposed procedure and on the chemicals identified as potentially bioaccumulative. Click here for additional data file. Click here for additional data file.

32 in total

1. Estimation of molecular linear free energy relationship descriptors by a group contribution approach. 2. Prediction of partition coefficients

Authors:
Journal: J Chem Inf Comput Sci Date: 2000-01

2. Correlation of bioconcentration factors.

Authors: D Mackay
Journal: Environ Sci Technol Date: 1982-05-01 Impact factor: 9.028

3. Iterative fragment selection: a group contribution approach to predicting fish biotransformation half-lives.

Authors: Trevor N Brown; Jon A Arnot; Frank Wania
Journal: Environ Sci Technol Date: 2012-07-20 Impact factor: 9.028

4. Assessment of chemical screening outcomes based on different partitioning property estimation methods.

Authors: Xianming Zhang; Trevor N Brown; Frank Wania; Eldbjørg S Heimstad; Kai-Uwe Goss
Journal: Environ Int Date: 2010-05-06 Impact factor: 9.621

5. Estimation of reliability of predictions and model applicability domain evaluation in the analysis of acute toxicity (LD50).

Authors: A Sazonovas; P Japertas; R Didziapetris
Journal: SAR QSAR Environ Res Date: 2010-01-01 Impact factor: 3.000

6. General model for estimating partition coefficients to organisms and their tissues using the biological compositions and polyparameter linear free energy relationships.

Authors: Satoshi Endo; Trevor N Brown; Kai-Uwe Goss
Journal: Environ Sci Technol Date: 2013-05-29 Impact factor: 9.028

Review 7. A review of new and current-use contaminants in the Arctic environment: evidence of long-range transport and indications of bioaccumulation.

Authors: Katrin Vorkamp; Frank F Rigét
Journal: Chemosphere Date: 2014-05-14 Impact factor: 7.086