Literature DB >> 24662204

Methods for assigning confidence to toxicity data with multiple values--Identifying experimental outliers.

Fabian P Steinmetz1, Steven J Enoch1, Judith C Madden1, Mark D Nelms1, Neus Rodriguez-Sanchez1, Phil H Rowe1, Yang Wen2, Mark T D Cronin3.   

Abstract

The assessment of data quality is a crucial element in many disciplines such as predictive toxicology and risk assessment. Currently, the reliability of toxicity data is assessed on the basis of testing information alone (adherence to Good Laboratory Practice (GLP), detailed testing protocols, etc.). Common practice is to take one toxicity data point per compound - usually the one with the apparently highest reliability. All other toxicity data points (for the same experiment and compound) from other sources are neglected. To show the benefits of incorporating the "less reliable" data, a simple, independent, statistical approach to assess data quality and reliability on a mathematical basis was developed. A large data set of toxicity values to Aliivibrio fischeri was assessed. The data set contained 1813 data points for 1227 different compounds, including 203 identified as non-polar narcotic. Log KOW values were calculated and non-polar narcosis quantitative structure-activity relationship (QSAR) models were built. A statistical approach to data quality assessment, which is based on data outlier omission and confidence scoring, improved the linear QSARs. The results indicate that a beneficial method for using large data sets containing multiple data values per compound and highly variable study data has been developed. Furthermore this statistical approach can help to develop novel QSARs and support risk assessment by obtaining more reliable values for biological endpoints.
Copyright © 2014 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Confidence; Conflicting data; Data quality; Microtox; Predictive toxicology; QSAR

Mesh:

Substances:

Year:  2014        PMID: 24662204     DOI: 10.1016/j.scitotenv.2014.02.115

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   7.963


  4 in total

1.  Interpreting physicochemical experimental data sets.

Authors:  Nicola Colclough; Mark C Wenlock
Journal:  J Comput Aided Mol Des       Date:  2015-06-09       Impact factor: 3.686

Review 2.  In silico toxicology: From structure-activity relationships towards deep learning and adverse outcome pathways.

Authors:  Jennifer Hemmerich; Gerhard F Ecker
Journal:  Wiley Interdiscip Rev Comput Mol Sci       Date:  2020-03-31

3.  CERAPP: Collaborative Estrogen Receptor Activity Prediction Project.

Authors:  Kamel Mansouri; Ahmed Abdelaziz; Aleksandra Rybacka; Alessandra Roncaglioni; Alexander Tropsha; Alexandre Varnek; Alexey Zakharov; Andrew Worth; Ann M Richard; Christopher M Grulke; Daniela Trisciuzzi; Denis Fourches; Dragos Horvath; Emilio Benfenati; Eugene Muratov; Eva Bay Wedebye; Francesca Grisoni; Giuseppe F Mangiatordi; Giuseppina M Incisivo; Huixiao Hong; Hui W Ng; Igor V Tetko; Ilya Balabin; Jayaram Kancherla; Jie Shen; Julien Burton; Marc Nicklaus; Matteo Cassotti; Nikolai G Nikolov; Orazio Nicolotti; Patrik L Andersson; Qingda Zang; Regina Politi; Richard D Beger; Roberto Todeschini; Ruili Huang; Sherif Farag; Sine A Rosenberg; Svetoslav Slavov; Xin Hu; Richard S Judson
Journal:  Environ Health Perspect       Date:  2016-02-23       Impact factor: 9.031

4.  QSAR modelling of a large imbalanced aryl hydrocarbon activation dataset by rational and random sampling and screening of 80,086 REACH pre-registered and/or registered substances.

Authors:  Kyrylo Klimenko; Sine A Rosenberg; Marianne Dybdahl; Eva B Wedebye; Nikolai G Nikolov
Journal:  PLoS One       Date:  2019-03-14       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.