Literature DB >> 25502379

AutoWeka: toward an automated data mining software for QSAR and QSPR studies.

Chanin Nantasenamat1, Apilak Worachartcheewan, Saksiri Jamsak, Likit Preeyanon, Watshara Shoombuatong, Saw Simeon, Prasit Mandi, Chartchalerm Isarankura-Na-Ayudhya, Virapong Prachayasittikul.   

Abstract

UNLABELLED: In biology and chemistry, a key goal is to discover novel compounds affording potent biological activity or chemical properties. This could be achieved through a chemical intuition-driven trial-and-error process or via data-driven predictive modeling. The latter is based on the concept of quantitative structure-activity/property relationship (QSAR/QSPR) when applied in modeling the biological activity and chemical properties, respectively, of compounds. Data mining is a powerful technology underlying QSAR/QSPR as it harnesses knowledge from large volumes of high-dimensional data via multivariate analysis. Although extremely useful, the technicalities of data mining may overwhelm potential users, especially those in the life sciences. Herein, we aim to lower the barriers to access and utilization of data mining software for QSAR/QSPR studies. AutoWeka is an automated data mining software tool that is powered by the widely used machine learning package Weka. The software provides a user-friendly graphical interface along with an automated parameter search capability. It employs two robust and popular machine learning methods: artificial neural networks and support vector machines. This chapter describes the practical usage of AutoWeka and relevant tools in the development of predictive QSAR/QSPR models. AVAILABILITY: The software is freely available at http://www.mt.mahidol.ac.th/autoweka.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25502379     DOI: 10.1007/978-1-4939-2239-0_8

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  6 in total

Review 1.  Preparing next-generation scientists for biomedical big data: artificial intelligence approaches.

Authors:  Jason H Moore; Mary Regina Boland; Pablo G Camara; Hannah Chervitz; Graciela Gonzalez; Blanca E Himes; Dokyoon Kim; Danielle L Mowery; Marylyn D Ritchie; Li Shen; Ryan J Urbanowicz; John H Holmes
Journal:  Per Med       Date:  2019-02-14       Impact factor: 2.512

Review 2.  Using Big Data to Discover Diagnostics and Therapeutics for Gastrointestinal and Liver Diseases.

Authors:  Benjamin Wooden; Nicolas Goossens; Yujin Hoshida; Scott L Friedman
Journal:  Gastroenterology       Date:  2016-10-20       Impact factor: 33.883

3.  Classification of P-glycoprotein-interacting compounds using machine learning methods.

Authors:  Veda Prachayasittikul; Apilak Worachartcheewan; Watshara Shoombuatong; Virapong Prachayasittikul; Chanin Nantasenamat
Journal:  EXCLI J       Date:  2015-08-19       Impact factor: 4.068

Review 4.  Unraveling the bioactivity of anticancer peptides as deduced from machine learning.

Authors:  Watshara Shoombuatong; Nalini Schaduangrat; Chanin Nantasenamat
Journal:  EXCLI J       Date:  2018-07-25       Impact factor: 4.068

Review 5.  Towards reproducible computational drug discovery.

Authors:  Nalini Schaduangrat; Samuel Lampa; Saw Simeon; Matthew Paul Gleeson; Ola Spjuth; Chanin Nantasenamat
Journal:  J Cheminform       Date:  2020-01-28       Impact factor: 5.514

Review 6.  The promise of automated machine learning for the genetic analysis of complex traits.

Authors:  Elisabetta Manduchi; Joseph D Romano; Jason H Moore
Journal:  Hum Genet       Date:  2021-10-28       Impact factor: 5.881

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.