| Literature DB >> 33300210 |
Silke Gabbert1, Miriam Mathea2, Susanne N Kolle2, Robert Landsiedel2.
Abstract
For hazard classifications of chemicals, continuous data from animal- or nonanimal testing methods are often dichotomized into binary positive/negative outcomes by defining classification thresholds (CT). Experimental data are, however, subject to biological and technical variability. Each test method's precision is limited resulting in uncertainty of the positive/negative outcome if the experimental result is close to the CT. Borderline ranges (BR) around the CT were suggested, which represent ranges in which the study result is ambiguous, that is, positive or negative results are equally likely. The BR reflects a method's precision uncertainty. This article explores and compares different approaches to quantify the BR. Besides using the pooled standard deviation, we determine the BR by means of the median absolute deviation (MAD), with a sequential combination of both methods, and by using nonparametric bootstrapping. Furthermore, we quantify the BR for different hazardous effects, including nonanimal tests for skin corrosion, eye irritation, skin irritation, and skin sensitization as well as for an animal test on skin sensitization (the local lymph node assay, LLNA). Additionally, for one method (direct peptide reactivity assay) the BR was determined experimentally and compared to calculated BRs. Our results demonstrate that (i) the precision of the methods is determining the size of their BRs, (ii) there is no "perfect" method to derive a BR, alas, (iii) a consensus on BR is needed to account for the limited precision of testing methods.Entities:
Keywords: Borderline range; OECD test guideline; classification threshold; decision-making; precision uncertainty; prediction; toxicity testing
Mesh:
Year: 2020 PMID: 33300210 PMCID: PMC9292900 DOI: 10.1111/risa.13648
Source DB: PubMed Journal: Risk Anal ISSN: 0272-4332 Impact factor: 4.302
Overview of Test Methods Used for Determining the BR of Their Prediction Models
| Method Name (abbreviation) | Hazardous Effect/Endpoint | Animal/Nonanimal Test | OECD Test Guideline (TG); Reference | BR in Test Guideline? | Prediction Model's Classification Threshold |
|---|---|---|---|---|---|
| Local lymph node assay (LLNA) | Skin sensitization | Animal test | 429: OECD ( | No | Skin sensitizing potential if stimulation index (SI) of 3H‐thymidine incorporation ≥ 3 compared to the concurrent vehicle control group |
| Direct Peptide Reactivity Assay (DPRA) | Skin sensitization | Nonanimal ( | 442C; OECD ( | Mean Lys/Cys |
Positive if mean of Lys Positive if Cys‐depletion >13.89% |
| LuSens assay | Skin sensitization | Nonanimal ( | 442D; OECD ( | No | Positive if statistically significant induction of the luciferase activity > 1.5 |
| Human Cell Line Activation Test (h‐CLAT) | Skin sensitization | Nonanimal ( | 442E: OECD ( | No | Positive if cell surface markers CD54 and CD86 show an expression above 2.0‐fold and/or 1.5‐fold, respectively, after test substance treatment relative to concurrent vehicle controls and at relative cell viabilities of at least 50% |
| EpiDerm Skin Corrosion test (SCT) | Skin corrosion | Nonanimal ( | 431; OECD ( | No | Corrosive if relative cell viability < 50% after 3 minute exposure and/or < 15% after one hour exposure |
| Corrositex® | Skin corrosion | Nonanimal ( | 435; OECD ( | No |
Corrosive optional category 1A if mean breakthrough time of 0–3 minutes. Corrosive optional category 1B if mean breakthrough time > 3 and < 60 minutes. Corrosive optional category 1C if mean breakthrough time > 60 and < 240 minutes. Noncorrosive if mean breakthrough time > 240 minutes.
Corrosive optional category 1A if mean breakthrough time of 0–3 minutes. Corrosive optional category 1B if mean breakthrough time > 3 and < 30 minutes. Corrosive optional category 1C if mean breakthrough time > 30 and < 60 minutes. Noncorrosive if mean breakthrough time > 60 minutes. |
| EpiDerm® Skin Irritation test (SIT) | Skin irritation | Nonanimal ( | 439: OECD ( | Yes (45–55%) | Irritant if relative cell viability is < 50%. |
| Bovine corneal opacity and permeability test (BCOP) | Eye irritation | Nonanimal ( | 437; OECD ( | Yes |
Seriously eye damaging if in vitro irritation score (IVIS) > 55 No prediction can be made if 3 < IVIS ≤ 55 Nonirritant if IVIS ≤ 3 |
| EpiOcular® Eye Irritation test (EIT) | Eye irritation | Nonanimal ( | 492: OECD ( |
Yes (55–65%) | Nonirritant to the eye if relative cell viability is > 60% |
No numerical BR is described in OECD TG 437, but it is described that a “testing run is considered borderline if the predictions from the three corneas were nonconcordant, such that (i) two of the three corneas gave discordant predictions from the mean of all three corneas, OR, (ii) one of the three corneas gave a discordant prediction from the mean of all three corneas, and the discordant result was >10 IVIS units from the cut‐off threshold of 55. If the repeat testing run corroborates the prediction of the initial testing run (based upon the mean IVIS value), then a final decision can be taken without further testing. If the repeat testing run results in a nonconcordant prediction from the initial testing run (based upon the mean IVIS value), then a third and final testing run should be conducted to resolve equivocal predictions, and to classify the test chemical. It may be permissible to waive further testing for classification and labeling in the event any testing run results in a UN GHS Category 1 prediction,” cf. OECD (2017).
Lys/Cys: Lysine and cysteine.
Borderline Ranges (BR) of Various In Vitro Methods and One In Vivo Method Quantified by Different Methods
| Borderline Range | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Method 1 | Method 2 | Method 3 | Method 4 | Method 5 | Method 6 | Method 7 | Method 8 | |||||
| Test Method | Test Result | Cut‐off | Tg Borderline Range | Data Set Size | Pooled | log pooled | pooled MAD | log pooled MAD | MAD + bootstrap | log MAD + bootstrap | MAD + percentile | log MAD + percentile |
| Skin Corrosion/Irritation Tests | ||||||||||||
| EpiDerm SCT (OECD TG 431) | relative viability after 3 minutes exposure [%] | 50 | 45–55 | 275 | 44–56 | 46 ‐ 55 | 46–54 | 47–53 | 45–55 | 47 ‐ 54 | 40–60 | 43–58 |
| relative viability after 60 min exposure [%] | 15 | 10–20 | 281 | 8 –22 | 13–18 | 10–20 | 14–17 | 10–20 | 13–17 | 4–26 | 12–19 | |
| EpiDerm SIT (OECD TG 439) | relative viability after 60 minutes exposure [%] | 50 | 45–55 | 718 | 38–62 | 35–71 | 45–55 | 45–55 | 45–55 | 45–56 | 39–61 | 40–62 |
| Corrositex (OECD TG 435) | Corrositex Category 1 | 3 | n/a | 49 | 1.1–4.9 | 2.6–3.5 | 2.0–4.0 | 2.8–3.3 | 1.8–4.2 | 2.7–3.3 | 2.6–3.5 | 0.5–5.5 |
| 60 | n/a | 58.1–61.9 | 51.2–70.3 | 59.0–61.0 | 55.2–65.2 | 58.8–61.2 | 54.0–66.7 | 51.6–69.7 | 57.5–62.5 | |||
| 240 | n/a | 238.1–241.9 | 204.9–281.2 | 239.0–241.0 | 220.7–261.0 | 238.8–241.2 | 216.0–266.7 | 206.5–279.0 | 237.5–242.5 | |||
| Corrositex Category 2 | 3 | n/a | 5 | ‐6.2–12.2 | 2.7–3.3 | 0.7–5.3 | 2.9–3.2 | 2.0–4.0 | 2.9–3.1 | ‐2.08.0 | 2.7–3.3 | |
| 30 | n/a | 20.8–39.2 | 27.1–33.2 | 27.7–32.3 | 28.5–31.5 | 29.0–31.0 | 29.4–30.6 | 25.0–35.0 | 27.5–32.7 | |||
| 60 | n/a | 50.8–69.2 | 54.3–66.3 | 57.7–62.3 | 57.1–63.1 | 59.0–61.0 | 58.8–61.2 | 55.0–65.0 | 55.0–65.0 | |||
| Eye Irritation Tests | ||||||||||||
| EpiOcular EIT (OECD TG 492) | relative viability [%] | 60 | 55– 65 | 907 | 55–65 | 50–72 | 56–64 | 54–67 | 56–64 | 53–68 | 51–69 | 47–77 |
| BCOP (OECD TG 437) | IVIS | 55 | n/a | 814 | 46–64 | 29–125 | 51–59 | 38–80 | 50–60 | 35–71 | 46–64 | 22–136 |
| IVIS | 3 | n/a | 1–7 | ‐6–12 | ‐1–7 | 2–4 | ‐2–8 | 2–4 | ‐6–12 | 1–7 | ||
| Skin Sensitization Tests | ||||||||||||
| DPRA (OECD TG 442C) | Mean peptide depletion [%] | 6.38 | 3–10 | 385 | 4.13–8.63 | 4.33 ‐ 9.40 | 4.79–7.98 | 5.29–7.69 | 4.70–8.06 | 5.20–7.82 | 2.69–10.07 | 3.68–8.01 |
| Cysteine‐only depletion [%] | 13.89 | 9–17 | 9.95– 17.83 | 9.82–19.64 | 11.15–16.63 | 11.62–16.61 | 11.00–16.78 | 11.47–16.82 | 7.99–19.79 | 8.20–23.52 | ||
| LuSens (OECD TG 442D) | Luciferase fold induction | 1.5 | n/a | 2,514 | 1.2– 1.8 | 1.3–1.7 | 1.4–1.6 | 1.4–1.6 | 1.4 ‐ 1.6 | 1.4–1.6 | 1.3–1.7 | 1.3–1.8 |
| h‐CLAT (OECD TG 442E) | CD54 induction | 200 | n/a | 2,379 | ‐1–401 | 158–254 | 125–275 | 170–235 | 121–279 | 170–236 | 51–349 | 142–283 |
| CD86 induction | 150 | n/a | 2,415 | 127– 173 | 125–180 | 135–165 | 132–170 | 135–165 | 132–171 | 117–183 | 115–196 | |
| LLNA (OECD TG 429) | Thymidine incorporation | 3 | n/a | 277 | 0.9–5.1 | 2.0–4.6 | 2.1–3.9 | 2.1–4.2 | 2.0–4.0 | 2.1–4.3 | 2.3–3.7 | 2.0–4.5 |
| Lymph node cell counts | 1.5 | n/a | 1.2–1.8 | 1.2–1.9 | 1.3–1.7 | 1.3–1.8 | 1.3–1.7 | 1.3–1.8 | 0.8–2.2 | 1.0–2.2 | ||
| Lymph node weight | 1.5 | n/a | 1.3–1.7 | 1.3–1.7 | 1.4–1.7 | 1.3–1.7 | 1.4–1.7 | 1.3–1.7 | 0.8–2.2 | 1.0–2.2 | ||
| Ear weight | 1.25 | n/a | 1.19–1.31 | 1.18–1.32 | 1.21–1.30 | 1.20–1.30 | 1.20–1.30 | 1.19–1.31 | 0.59–1.91 | 0.84–1.87 | ||
All experimental data used in the current study was generated during routine testing in the GLP and ISO 17020 certified laboratories of BASF SE's Experimental Toxicology and Ecology, Germany.
The Corrositex Category is determined in the categorization screen. Depending on the outcome one or the other prediction model is used for the final prediction.
No numerical BR is described in OECD TG 437, but it is described that a “testing run is considered borderline if the predictions from the three corneas were nonconcordant, such that (i) two of the three corneas gave discordant predictions from the mean of all three corneas, OR, (ii) one of the three corneas gave a discordant prediction from the mean of all three corneas, AND the discordant result was >10 IVIS units from the cut‐off threshold of 55. If the repeat testing run corroborates the prediction of the initial testing run (based upon the mean IVIS value), then a final decision can be taken without further testing. If the repeat testing run results in a nonconcordant prediction from the initial testing run (based upon the mean IVIS value), then a third and final testing run should be conducted to resolve equivocal predictions, and to classify the test chemical. It may be permissible to waive further testing for classification and labeling in the event any testing run results in a UN GHS Category 1 prediction”, cf. OECD (2017).
Fig 1DPRA BRs for mean peptide depletion (panel A) and Cys‐only depletion (panel B) based on log transformed (indicated by “_log”) and nontransformed data of the mean absolute deviation (MAD), pooled standard deviation (SD), the bootstrapped method and the 90% percentile of all MADs. The interrupted line marks the cut‐offs as described in OECD TG 442C (i.e., 6.38% mean peptide depletion and 13.89% Cys‐only depletion) and the red lines indicate the upper and lower boundaries of the borderline range for the individual calculation methods. All experimental data used in the current study was generated during routine testing in the GLP and ISO 17020 certified laboratories of BASF SE's Experimental Toxicology and Ecology, Germany.
Fig 2LLNA BRs for based on data of the stimulation index for thymidine incorporation (panel A) and the stimulation index for lymph node cell counts (panel B) based on log transformed (indicated by “_log”) and nontransformed data of the mean absolute deviation (MAD), pooled standard deviation (SD), the bootstrapped method and the 90% percentile of all MADs. The interrupted line marks the cut‐offs as described in OECD TG 429 (i.e., an SI of 3 for thymidine incorporation) and in Basketter et al. (2012). The red lines indicate the upper and lower boundaries of the borderline range for the individual calculation methods. All experimental data used in the current study was generated during routine testing in the GLP and ISO 17020 certified laboratories of BASF SE's Experimental Toxicology and Ecology, Germany.
Fig 3Mean rank of methods across all data sets based on log transformed (indicated by “_log”) and nontransformed data of the mean absolute deviation (MAD), pooled standard deviation (SD), the bootstrapped method and the 90% percentile of all MADs.
Experimental Borderline Range (BR) of DPRA Close to the Cut‐Offs at 6.38% and 13.89%, Respectively
| Test Method | Test Substance Stock Concentration | Size of Data Set | Mean Test Result | Result Range [mean + | Result Range linearly transformed to the cut‐offs at 6.38% and 13.89%, respectively[mean + | |
|---|---|---|---|---|---|---|
| DPRA (OECD TG 442C) | 8 µM EGDMA | 27 | Mean peptide depletion 9%) | 6.85 | 5.92–7.78 | 5.45–7.31 |
| 27 | Cysteine‐only depletion (%) | 12.15 | 10.15–14.16 | 11.89–15.90 | ||
| DPRA (OECD TG 442C) | 10 µM EGDMA | 27 | Mean peptide depletion (%) | 8.27 | 7.26–9.28 | 5.37–7.39 |
| 27 | Cysteine‐only depletion (%) | 14.15 | 12.03–16.01 | 11.77–15.75 | ||
Notation Used for the Statistical Analysis
| Notation | Explanation |
|---|---|
|
| Classification threshold in a prediction model of a testing method |
|
| Substance |
|
| Concentration tested per substance |
|
| Number of replicates per substance |
|
| Replicate per substance |
|
| Test result of substance |
|
| Arithmetic mean of test results for substance |