Literature DB >> 24010266

Evaluating data mining algorithms using molecular dynamics trajectories.

Vasileios A Tatsis1, Christos Tjortjis, Panagiotis Tzirakis.   

Abstract

Molecular dynamics simulations provide a sample of a molecule's conformational space. Experiments on the mus time scale, resulting in large amounts of data, are nowadays routine. Data mining techniques such as classification provide a way to analyse such data. In this work, we evaluate and compare several classification algorithms using three data sets which resulted from computer simulations, of a potential enzyme mimetic biomolecule. We evaluated 65 classifiers available in the well-known data mining toolkit Weka, using 'classification' errors to assess algorithmic performance. Results suggest that: (i) 'meta' classifiers perform better than the other groups, when applied to molecular dynamics data sets; (ii) Random Forest and Rotation Forest are the best classifiers for all three data sets; and (iii) classification via clustering yields the highest classification error. Our findings are consistent with bibliographic evidence, suggesting a 'roadmap' for dealing with such data.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24010266     DOI: 10.1504/ijdmb.2013.055499

Source DB:  PubMed          Journal:  Int J Data Min Bioinform        ISSN: 1748-5673            Impact factor:   0.667


  3 in total

1.  An empirical study of a hybrid imbalanced-class DT-RST classification procedure to elucidate therapeutic effects in uremia patients.

Authors:  You-Shyang Chen
Journal:  Med Biol Eng Comput       Date:  2016-04-06       Impact factor: 2.602

2.  Development and application of a Chinese webpage suicide information mining system (sims).

Authors:  Penglai Chen; Jing Chai; Lu Zhang; Debin Wang
Journal:  J Med Syst       Date:  2014-09-30       Impact factor: 4.460

3.  Machine learning for pattern and waveform recognitions in terahertz image data.

Authors:  Dmitry S Bulgarevich; Miezel Talara; Masahiko Tani; Makoto Watanabe
Journal:  Sci Rep       Date:  2021-01-13       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.