| Literature DB >> 21660515 |
Iurii Sushko1, Sergii Novotarskyi, Robert Körner, Anil Kumar Pandey, Matthias Rupp, Wolfram Teetz, Stefan Brandmaier, Ahmed Abdelaziz, Volodymyr V Prokopenko, Vsevolod Y Tanchuk, Roberto Todeschini, Alexandre Varnek, Gilles Marcou, Peter Ertl, Vladimir Potemkin, Maria Grishina, Johann Gasteiger, Christof Schwab, Igor I Baskin, Vladimir A Palyulin, Eugene V Radchenko, William J Welsh, Vladyslav Kholodovych, Dmitriy Chekmarev, Artem Cherkasov, Joao Aires-de-Sousa, Qing-You Zhang, Andreas Bender, Florian Nigsch, Luc Patiny, Antony Williams, Valery Tkachenko, Igor V Tetko.
Abstract
The Online Chemical Modeling Environment is a web-based platform that aims to automate and simplify the typical steps required for QSAR modeling. The platform consists of two major subsystems: the database of experimental measurements and the modeling framework. A user-contributed database contains a set of tools for easy input, search and modification of thousands of records. The OCHEM database is based on the wiki principle and focuses primarily on the quality and verifiability of the data. The database is tightly integrated with the modeling framework, which supports all the steps required to create a predictive model: data search, calculation and selection of a vast variety of molecular descriptors, application of machine learning methods, validation, analysis of the model and assessment of the applicability domain. As compared to other similar systems, OCHEM is not intended to re-implement the existing tools or models but rather to invite the original authors to contribute their results, make them publicly available, share them with other users and to become members of the growing research community. Our intention is to make OCHEM a widely used platform to perform the QSPR/QSAR studies online and share it with other users on the Web. The ultimate goal of OCHEM is collecting all possible chemoinformatics tools within one simple, reliable and user-friendly resource. The OCHEM is free for web users and it is available online at http://www.ochem.eu.Entities:
Mesh:
Year: 2011 PMID: 21660515 PMCID: PMC3131510 DOI: 10.1007/s10822-011-9440-2
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
Fig. 1A schematic overview of an individual record in the OCHEM database
Fig. 2A screenshot of the second step of the batch upload process
Fig. 3A screenshot of data preview, the third step of the batch upload
Fig. 4An overview of different user levels in the OCHEM. User’s rights decrease with level. Users can edit/delete entities of the same or lower user levels
Fig. 5The workflow of a typical QSAR research in the OCHEM system
Fig. 6The first step of model creation: selection of a training and validation set, a machine learning method and a validation protocol
Fig. 7A screenshot of the descriptor selection and configuration panel
Fig. 8The list of pending models. The models being calculated and the completed models waiting for an inspection by the user are listed here
Fig. 9Basic statistics for a predictive model. The training set has a link that opens a browser of experimental records where a user can examine properties of all compounds used in the model. A click on a dot in the observed versus predicted chart opens a similar browser information window for the corresponding compound
Fig. 10Statistics of a classification model. Summarized are the prediction accuracies for the training and test sets as well as confusion matrices
Fig. 11The plot shows residuals of the predictions for the training set, mapped against the selected “distance to model”, in this case ASNN-STDEV, the standard deviation of ensemble prediction vector. This information is used to estimate accuracy of predictions, when the model is applied to new compounds
Fig. 12Selection of models to predict properties for new compounds