Literature DB >> 21854053

Anatomy of high-performance 2D similarity calculations.

Imran S Haque1, Vijay S Pande, W Patrick Walters.   

Abstract

Similarity measures based on the comparison of dense bit vectors of two-dimensional chemical features are a dominant method in chemical informatics. For large-scale problems, including compound selection and machine learning, computing the intersection between two dense bit vectors is the overwhelming bottleneck. We describe efficient implementations of this primitive as well as example applications using features of modern CPUs that allow 20-40× performance increases relative to typical code. Specifically, we describe fast methods for population count on modern x86 processors and cache-efficient matrix traversal and leader clustering algorithms that alleviate memory bandwidth bottlenecks in similarity matrix construction and clustering. The speed of our 2D comparison primitives is within a small factor of that obtained on GPUs and does not require specialized hardware.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21854053      PMCID: PMC4839782          DOI: 10.1021/ci200235e

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  17 in total

1.  Reoptimization of MDL keys for use in drug discovery.

Authors:  Joseph L Durant; Burton A Leland; Douglas R Henry; James G Nourse
Journal:  J Chem Inf Comput Sci       Date:  2002 Nov-Dec

2.  SCISSORS: a linear-algebraical technique to rapidly approximate chemical similarities.

Authors:  Imran S Haque; Vijay S Pande
Journal:  J Chem Inf Model       Date:  2010-06-28       Impact factor: 4.956

3.  ZINC--a free database of commercially available compounds for virtual screening.

Authors:  John J Irwin; Brian K Shoichet
Journal:  J Chem Inf Model       Date:  2005 Jan-Feb       Impact factor: 4.956

4.  Small molecule shape-fingerprints.

Authors:  James A Haigh; Barry T Pickup; J Andrew Grant; Anthony Nicholls
Journal:  J Chem Inf Model       Date:  2005 May-Jun       Impact factor: 4.956

Review 5.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

6.  Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sublinear time.

Authors:  S Joshua Swamidass; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2007-02-28       Impact factor: 4.956

Review 7.  Strategies for compound selection.

Authors:  Marius M Olah; Cristian G Bologa; Tudor I Oprea
Journal:  Curr Drug Discov Technol       Date:  2004-10

8.  970 million druglike small molecules for virtual screening in the chemical universe database GDB-13.

Authors:  Lorenz C Blum; Jean-Louis Reymond
Journal:  J Am Chem Soc       Date:  2009-07-01       Impact factor: 15.419

9.  Clustering a large number of compounds. 1. Establishing the method on an initial sample.

Authors:  L Hodes
Journal:  J Chem Inf Comput Sci       Date:  1989-05

10.  SIML: a fast SIMD algorithm for calculating LINGO chemical similarities on GPUs and CPUs.

Authors:  Imran S Haque; Vijay S Pande; W Patrick Walters
Journal:  J Chem Inf Model       Date:  2010-04-26       Impact factor: 4.956

View more
  5 in total

1.  GSA: a GPU-accelerated structure similarity algorithm and its application in progressive virtual screening.

Authors:  Xin Yan; Qiong Gu; Feng Lu; Jiabo Li; Jun Xu
Journal:  Mol Divers       Date:  2012-10-19       Impact factor: 2.943

2.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

3.  The chemfp project.

Authors:  Andrew Dalke
Journal:  J Cheminform       Date:  2019-12-05       Impact factor: 5.514

4.  USRCAT: real-time ultrafast shape recognition with pharmacophoric constraints.

Authors:  Adrian M Schreyer; Tom Blundell
Journal:  J Cheminform       Date:  2012-11-06       Impact factor: 5.514

5.  Blazing Signature Filter: a library for fast pairwise similarity comparisons.

Authors:  Joon-Yong Lee; Grant M Fujimoto; Ryan Wilson; H Steven Wiley; Samuel H Payne
Journal:  BMC Bioinformatics       Date:  2018-06-11       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.