Literature DB >> 29559364

Redundancy in two major compound databases.

Dimitar Yonchev1, Dilyana Dimova1, Dagmar Stumpfe1, Martin Vogt1, Jürgen Bajorath2.   

Abstract

Public repositories of compounds and activity data are of prime importance for pharmaceutical research in academic and industrial settings. Major databases have evolved over the years. Their growth is accompanied by an increasing tendency toward data sharing. This is a positive development but not without potential problems. Using ChEMBL and PubChem as examples, we show that crosstalk between databases also leads to substantial data redundancy that might not be obvious. Redundancy is an important issue because it biases data analysis and knowledge extraction and leads to inflated views of available compounds, assays and activity data. Going forward it will be important to further refine data exchange and deposition criteria and make redundancy as transparent as possible.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Mesh:

Substances:

Year:  2018        PMID: 29559364     DOI: 10.1016/j.drudis.2018.03.005

Source DB:  PubMed          Journal:  Drug Discov Today        ISSN: 1359-6446            Impact factor:   7.851


  1 in total

Review 1.  Assessing the public landscape of clinical-stage pharmaceuticals through freely available online databases.

Authors:  Rebekah H Griesenauer; Constantino Schillebeeckx; Michael S Kinch
Journal:  Drug Discov Today       Date:  2019-01-25       Impact factor: 7.851

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.