Literature DB >> 22035187

Efficient substructure searching of large chemical libraries: the ABCD chemical cartridge.

Dimitris K Agrafiotis1, Victor S Lobanov, Maxim Shemanarev, Dmitrii N Rassokhin, Sergei Izrailev, Edward P Jaeger, Simson Alex, Michael Farnum.   

Abstract

Efficient substructure searching is a key requirement for any chemical information management system. In this paper, we describe the substructure search capabilities of ABCD, an integrated drug discovery informatics platform developed at Johnson & Johnson Pharmaceutical Research & Development, L.L.C. The solution consists of several algorithmic components: 1) a pattern mapping algorithm for solving the subgraph isomorphism problem, 2) an indexing scheme that enables very fast substructure searches on large structure files, 3) the incorporation of that indexing scheme into an Oracle cartridge to enable querying large relational databases through SQL, and 4) a cost estimation scheme that allows the Oracle cost-based optimizer to generate a good execution plan when a substructure search is combined with additional constraints in a single SQL query. The algorithm was tested on a public database comprising nearly 1 million molecules using 4,629 substructure queries, the vast majority of which were submitted by discovery scientists over the last 2.5 years of user acceptance testing of ABCD. 80.7% of these queries were completed in less than a second and 96.8% in less than ten seconds on a single CPU, while on eight processing cores these numbers increased to 93.2% and 99.7%, respectively. The slower queries involved extremely generic patterns that returned the entire database as screening hits and required extensive atom-by-atom verification.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22035187     DOI: 10.1021/ci200413e

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  2 in total

1.  Sachem: a chemical cartridge for high-performance substructure search.

Authors:  Miroslav Kratochvíl; Jiří Vondrášek; Jakub Galgonek
Journal:  J Cheminform       Date:  2018-05-23       Impact factor: 5.514

2.  Systematic benchmark of substructure search in molecular graphs - From Ullmann to VF2.

Authors:  Hans-Christian Ehrlich; Matthias Rarey
Journal:  J Cheminform       Date:  2012-07-31       Impact factor: 5.514

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.