MOTIVATION: The ability to accurately measure structural similarities among small molecules is important for many analysis routines in drug discovery and chemical genomics. Algorithms used for this purpose include fragment-based fingerprint and graph-based maximum common substructure (MCS) methods. MCS approaches provide one of the most accurate similarity measures. However, their rigid matching policies limit them to the identification of perfect MCSs. To eliminate this restriction, we introduce a new mismatch tolerant search method for identifying flexible MCSs (FMCSs) containing a user-definable number of atom and/or bond mismatches. RESULTS: The fmcsR package provides an R interface, with the time-consuming steps of the FMCS algorithm implemented in C++. It includes utilities for pairwise compound comparisons, structure similarity searching, clustering and visualization of MCSs. In comparison with an existing MCS tool, fmcsR shows better time performance over a wide range of compound sizes. When mismatching of atoms or bonds is turned on, the compute times increase as expected, and the resulting FMCSs are often substantially larger than their strict MCS counterparts. Based on extensive virtual screening (VS) tests, the flexible matching feature enhances the enrichment of active structures at the top of MCS-based similarity search results. With respect to overall and early enrichment performance, FMCS outperforms most of the seven other VS methods considered in these tests. AVAILABILITY: fmcsR is freely available for all common operating systems from the Bioconductor site (http://www.bioconductor.org/packages/devel/bioc/html/fmcsR.html). CONTACT: thomas.girke@ucr.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The ability to accurately measure structural similarities among small molecules is important for many analysis routines in drug discovery and chemical genomics. Algorithms used for this purpose include fragment-based fingerprint and graph-based maximum common substructure (MCS) methods. MCS approaches provide one of the most accurate similarity measures. However, their rigid matching policies limit them to the identification of perfect MCSs. To eliminate this restriction, we introduce a new mismatch tolerant search method for identifying flexible MCSs (FMCSs) containing a user-definable number of atom and/or bond mismatches. RESULTS: The fmcsR package provides an R interface, with the time-consuming steps of the FMCS algorithm implemented in C++. It includes utilities for pairwise compound comparisons, structure similarity searching, clustering and visualization of MCSs. In comparison with an existing MCS tool, fmcsR shows better time performance over a wide range of compound sizes. When mismatching of atoms or bonds is turned on, the compute times increase as expected, and the resulting FMCSs are often substantially larger than their strict MCS counterparts. Based on extensive virtual screening (VS) tests, the flexible matching feature enhances the enrichment of active structures at the top of MCS-based similarity search results. With respect to overall and early enrichment performance, FMCS outperforms most of the seven other VS methods considered in these tests. AVAILABILITY: fmcsR is freely available for all common operating systems from the Bioconductor site (http://www.bioconductor.org/packages/devel/bioc/html/fmcsR.html). CONTACT: thomas.girke@ucr.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Chenghao Zhu; Lisa Sawrey-Kubicek; Elizabeth Beals; Riley L Hughes; Chris H Rhodes; Romina Sacchi; Angela M Zivkovic Journal: Metabolomics Date: 2019-08-17 Impact factor: 4.290
Authors: Jennifer E Schollée; Emma L Schymanski; Michael A Stravs; Rebekka Gulde; Nikolaos S Thomaidis; Juliane Hollender Journal: J Am Soc Mass Spectrom Date: 2017-09-26 Impact factor: 3.109
Authors: Michalis Hadjithomas; I-Min Amy Chen; Ken Chu; Anna Ratner; Krishna Palaniappan; Ernest Szeto; Jinghua Huang; T B K Reddy; Peter Cimermančič; Michael A Fischbach; Natalia N Ivanova; Victor M Markowitz; Nikos C Kyrpides; Amrita Pati Journal: mBio Date: 2015-07-14 Impact factor: 7.867
Authors: Zeynep Kurkcuoglu; Panagiotis I Koukos; Nevia Citro; Mikael E Trellet; J P G L M Rodrigues; Irina S Moreira; Jorge Roel-Touris; Adrien S J Melquiond; Cunliang Geng; Jörg Schaarschmidt; Li C Xue; Anna Vangone; A M J J Bonvin Journal: J Comput Aided Mol Des Date: 2017-08-22 Impact factor: 3.686
Authors: Mohammad Tauqeer Alam; Viridiana Olin-Sandoval; Anna Stincone; Markus A Keller; Aleksej Zelezniak; Ben F Luisi; Markus Ralser Journal: Nat Commun Date: 2017-07-10 Impact factor: 14.919