| Literature DB >> 23614543 |
Stefan Brandmaier1, Sergii Novotarskyi, Iurii Sushko, Igor V Tetko.
Abstract
The importance of reliable methods for representative sub-sampling in terms of experimental design and risk assessment within the European Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) system is crucial. We developed experimental design approaches, by utilising predicted properties and the 'distance to model' parameter, to estimate the benefits of certain compounds to the quality of a resulting model. A statistical evaluation of four regression data sets and one classification data set showed that the adaptive concept of iteratively refining the representation of the chemical space contributes to a more efficient and more reliable selection in comparison to traditional approaches. The evaluation of compounds with regard to the uncertainty and the correlation of prediction is beneficial, and in particular, for regression data sets of sufficient size, whereas the use of predicted properties to define the chemical space is beneficial for classification models. 2013 FRAME.Mesh:
Substances:
Year: 2013 PMID: 23614543 DOI: 10.1177/026119291304100106
Source DB: PubMed Journal: Altern Lab Anim ISSN: 0261-1929 Impact factor: 1.303