| Literature DB >> 21576229 |
Tyler W H Backman1, Yiqun Cao, Thomas Girke.
Abstract
ChemMine Tools is an online service for small molecule data analysis. It provides a web interface to a set of cheminformatics and data mining tools that are useful for various analysis routines performed in chemical genomics and drug discovery. The service also offers programmable access options via the R library ChemmineR. The primary functionalities of ChemMine Tools fall into five major application areas: data visualization, structure comparisons, similarity searching, compound clustering and prediction of chemical properties. First, users can upload compound data sets to the online Compound Workbench. Numerous utilities are provided for compound viewing, structure drawing and format interconversion. Second, pairwise structural similarities among compounds can be quantified. Third, interfaces to ultra-fast structure similarity search algorithms are available to efficiently mine the chemical space in the public domain. These include fingerprint and embedding/indexing algorithms. Fourth, the service includes a Clustering Toolbox that integrates cheminformatic algorithms with data mining utilities to enable systematic structure and activity based analyses of custom compound sets. Fifth, physicochemical property descriptors of custom compound sets can be calculated. These descriptors are important for assessing the bioactivity profile of compounds in silico and quantitative structure-activity relationship (QSAR) analyses. ChemMine Tools is available at: http://chemmine.ucr.edu.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21576229 PMCID: PMC3125754 DOI: 10.1093/nar/gkr320
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Illustration of the functionalities provided by ChemMine Tools. The utilities of the five application domains (i–v) are listed in more detail in Table 1.
List of services provided by ChemMine Tools
| Functions | Program | Input | Output | Comments |
|---|---|---|---|---|
| (i) Compound workbench | ||||
| Structure import/export | Mouse clicks | SMILES/SDF | One or many compounds | |
| Format interconversions | SDF/SMILES | SMILES/SDF | One or many compounds | |
| Bioactivity data import | Tabular data | Table/heat map | SAR table | |
| Structure depictions | SMILES/SDF | Image file (GIF) | One or many compounds | |
| Structure drawing | Mouse clicks | SMILES/SDF | Single compound | |
| Database import | XML/SDF | SMILES/SDF | PubChem | |
| Scriptable access from | SDF, tabular data | Online viewing | SAR table | |
| (ii) Similarity toolbox | ||||
| Fragment-based similarity | SDF/SMILES | Similarity coefficients | Pairwise comparisons | |
| Maximum common substructure | SDF/SMILES | MCS (SDF), similarity coefficient | Pairwise comparisons | |
| (iii) Search toolbox | ||||
| Embedding and indexing | Mouse clicks, SDF/SMILES | Ranked compound list | Database search | |
| Fingerprint search | Mouse clicks, SDF/SMILES | Ranked compound list | Database search | |
| (iv) Clustering toolbox | ||||
| Binning clustering | SDF/SMILES, custom table | Cluster table | ||
| Hierarchical clustering | SDF/SMILES, custom table | Tree, distance matrix | Optional heat map | |
| Multidimensional scaling | SDF/SMILES, custom table | Scatter plot | Interactive | |
| (v) Property toolbox | ||||
| Physicochemical descriptors | SDF/SMILES | Property table | 38 descriptors |
The names of software tools, libraries and environments are italicized.
aPrograms developed by the ChemMine Tools project. Acronyms defined in text.