| Literature DB >> 27752442 |
Robert Kase1, Muris Korkaric2, Inge Werner1, Marlene Ågerstrand3.
Abstract
BACKGROUND: The regulatory evaluation of ecotoxicity studies for environmental risk and/or hazard assessment of chemicals is often performed using the method established by Klimisch and colleagues in 1997. The method was, at that time, an important step toward improved evaluation of study reliability, but lately it has been criticized for lack of detail and guidance, and for not ensuring sufficient consistency among risk assessors.Entities:
Keywords: Ecotoxicology; Hazard/risk assessment; Klimisch method; Relevance evaluation; Reliability evaluation; Water quality criteria
Year: 2016 PMID: 27752442 PMCID: PMC5044958 DOI: 10.1186/s12302-016-0073-x
Source DB: PubMed Journal: Environ Sci Eur ISSN: 2190-4715 Impact factor: 5.893
General characteristics of the Klimisch and the final CRED evaluation methods
| Characteristics | Klimisch | CRED |
|---|---|---|
| Data type | Toxicity and ecotoxicity | Aquatic ecotoxicity |
| Number of reliability criteria | 12–14 (ecotoxicity) | Evaluating 20 (reporting 50) |
| Number of relevance criteria | 0 | 13 |
| Number of OECD reporting criteria includeda | 14 (of 37) | 37 (of 37) |
| Additional guidance | No | Yes |
| How to summarize the evaluation | Qualitative for reliability | Qualitative for reliability and relevance |
aAccording to [13]
List of ecotoxicity studies evaluated in the ring test
| Study | Test organism | Taxonomic group | Tested substance | Chemical substance classes | Evaluated endpoint | Peer reviewed | GLP study | References | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cyanobacteria | Algae | Higher plant | Crustacea | Fish | Industrial chemical | Biocide | Plant protection product | Pharmaceutical | Steroidal estrogens | |||||||
| A |
| X | Deltamethrin | X | EC50 48 h Immobilization | X | [ | |||||||||
| B |
| X | Erythromycin | X | NOEC 7 days growth | X | [ | |||||||||
| Ca |
| X | Erythromycin | X | NOEC 144 h growth | X | [ | |||||||||
| D |
| X | Deltamethrin | X | NOEC 72 h growth | X | [ | |||||||||
| Ea |
| X | Estrone | X | NOEC 40 days sex ratio | X | [ | |||||||||
| Fa |
| X | Estrone | X | NOEC 40 days sex ratio | X | [ | |||||||||
| Ga |
| X | Nonylphenol | X | NOEC 60 days hatching success | X | [ | |||||||||
| Ha |
| X | Cybutryne | X | NOEC 14 days growth | X | [ | |||||||||
aUnder discussion as a key study for EQS derivation
Fig. 1a Overall reliability categorization results using the Klimisch (n = 121) and the CRED (n = 104) evaluation methods. b Study-specific reliability categories assigned using the Klimisch and CRED evaluation methods for studies A–H. Significant differences were found for studies D and E (**p ≤ 0.01) and study G (*p ≤ 0.05) using the exact (permutation) version of the Chi-square test in R [38]
Fig. 2a Overall relevance categorization results using the Klimisch (n = 120) and CRED (n = 104) evaluation methods. b Study- specific relevance categories assigned using the Klimisch and CRED evaluation methods for studies A–H. Study H differed significantly (*p ≤ 0.05) using the exact (permutation) version of the Chi-square test in R [37]
Percentage fulfilled criteria for the assigned reliability and relevance categories using the CRED evaluation method
| Percentage of fulfilled criteriaa | |||||
|---|---|---|---|---|---|
| Mean | SD | Min | Max | n | |
| Reliability categories | |||||
| Reliable without restrictions | 93 | 12 | 79 | 100 | 3 |
| Reliable with restrictions | 72 | 12 | 47 | 90 | 24 |
| Not reliable | 60 | 15 | 21 | 90 | 58 |
| Not assignable | 51 | 15 | 21 | 64 | 19 |
| Relevance categoriesb | |||||
| Relevant without restrictions | 84 | 8 | 64 | 100 | 50 |
| Relevant with restrictions | 73 | 14 | 27 | 91 | 42 |
| Not relevant | 61 | 14 | 46 | 82 | 12 |
Shown are arithmetic mean, standard deviation (SD), minimum and maximum, and the number of evaluations assigned to each reliability and relevance category
aThe ring test was performed with a draft version of the CRED evaluation method including critical and non-critical criteria; this was not differentiated in this table
bThe non-conclusive category “not assignable” (C4) is excluded from this analysis, because it was not part of the ring test
Fig. 3Participants’ (n = 41) agreement regarding the accuracy, applicability, consistency, and dependence on expert judgement for the Klimisch (white) and CRED (gray) evaluation methods. Results are shown in Whisker plots. 1 totally agree, 2 mainly agree, 3 partially agree, 4 mainly disagree, 5 totally disagree; arithmetic mean of rating is indicated by +. Significant differences (Wilcoxon test) between ratings are indicated by *α < 0.05; **α < 0.01; and ***α < 0.001; na not applicable (question was only asked in phase II of the ring test)
Fig. 4Participants’ confidence in the results of their reliability (a) and relevance (b) evaluations when using the Klimisch method (n = 121) and the CRED evaluation method (n = 103). Chi-square analysis shows significant differences in the distribution of the responses between the two evaluation methods regarding reliability (p < 0.01) and relevance (p < 0.001)