Literature DB >> 26138567

MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment.

Muthukumarasamy Karthikeyan1, Yogesh Pandit, Deepak Pandit, Renu Vyas.   

Abstract

Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26138567     DOI: 10.2174/1386207318666150703113525

Source DB:  PubMed          Journal:  Comb Chem High Throughput Screen        ISSN: 1386-2073            Impact factor:   1.339


  1 in total

1.  ChemEngine: harvesting 3D chemical structures of supplementary data from PDF files.

Authors:  Muthukumarasamy Karthikeyan; Renu Vyas
Journal:  J Cheminform       Date:  2016-12-29       Impact factor: 5.514

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.