Literature DB >> 24160896

Estimating error rates in bioactivity databases.

Pekka Tiikkainen1, Louisa Bellis, Yvonne Light, Lutz Franke.   

Abstract

Bioactivity databases are routinely used in drug discovery to look-up and, using prediction tools, to predict potential targets for small molecules. These databases are typically manually curated from patents and scientific articles. Apart from errors in the source document, the human factor can cause errors during the extraction process. These errors can lead to wrong decisions in the early drug discovery process. In the current work, we have compared bioactivity data from three large databases (ChEMBL, Liceptor, and WOMBAT) who have curated data from the same source documents. As a result, we are able to report error rate estimates for individual activity parameters and individual bioactivity databases. Small molecule structures have the greatest estimated error rate followed by target, activity value, and activity type. This order is also reflected in supplier-specific error rate estimates. The results are also useful in identifying data points for recuration. We hope the results will lead to a more widespread awareness among scientists on the frequencies and types of errors in bioactivity data.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24160896     DOI: 10.1021/ci400099q

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  13 in total

Review 1.  Predicting drug metabolism: experiment and/or computation?

Authors:  Johannes Kirchmair; Andreas H Göller; Dieter Lang; Jens Kunze; Bernard Testa; Ian D Wilson; Robert C Glen; Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2015-04-24       Impact factor: 84.694

2.  Workflow for defining reference chemicals for assessing performance of in vitro assays.

Authors:  Richard S Judson; Russell S Thomas; Nancy Baker; Anita Simha; Xia Meng Howey; Carmen Marable; Nicole C Kleinstreuer; Keith A Houck
Journal:  ALTEX       Date:  2018-12-17       Impact factor: 6.043

3.  LBVS: an online platform for ligand-based virtual screening using publicly accessible databases.

Authors:  Minghao Zheng; Zhihong Liu; Xin Yan; Qianzhi Ding; Qiong Gu; Jun Xu
Journal:  Mol Divers       Date:  2014-09-03       Impact factor: 2.943

Review 4.  Chemical databases: curation or integration by user-defined equivalence?

Authors:  Anne Hersey; Jon Chambers; Louisa Bellis; A Patrícia Bento; Anna Gaulton; John P Overington
Journal:  Drug Discov Today Technol       Date:  2015-03-11

5.  The CARLSBAD database: a confederated database of chemical bioactivities.

Authors:  Stephen L Mathias; Jarrett Hines-Kay; Jeremy J Yang; Gergely Zahoransky-Kohalmi; Cristian G Bologa; Oleg Ursu; Tudor I Oprea
Journal:  Database (Oxford)       Date:  2013-06-21       Impact factor: 3.451

6.  Activity, assay and target data curation and quality in the ChEMBL database.

Authors:  George Papadatos; Anna Gaulton; Anne Hersey; John P Overington
Journal:  J Comput Aided Mol Des       Date:  2015-07-23       Impact factor: 3.686

7.  Proteochemometric modeling in a Bayesian framework.

Authors:  Isidro Cortes-Ciriano; Gerard Jp van Westen; Eelke Bart Lenselink; Daniel S Murrell; Andreas Bender; Thérèse Malliavin
Journal:  J Cheminform       Date:  2014-06-28       Impact factor: 5.514

8.  A reliable computational workflow for the selection of optimal screening libraries.

Authors:  Yocheved Gilad; Katalin Nadassy; Hanoch Senderowitz
Journal:  J Cheminform       Date:  2015-12-11       Impact factor: 5.514

9.  VB-MK-LMF: fusion of drugs, targets and interactions using variational Bayesian multiple kernel logistic matrix factorization.

Authors:  Bence Bolgár; Péter Antal
Journal:  BMC Bioinformatics       Date:  2017-10-04       Impact factor: 3.169

10.  Orthologue chemical space and its influence on target prediction.

Authors:  Lewis H Mervin; Krishna C Bulusu; Leen Kalash; Avid M Afzal; Fredrik Svensson; Mike A Firth; Ian Barrett; Ola Engkvist; Andreas Bender
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.