| Literature DB >> 34549368 |
Abstract
While machine learning models have become a mainstay in Cheminformatics, the field has yet to agree on standards for model evaluation and comparison. In many cases, authors compare methods by performing multiple folds of cross-validation and reporting the mean value for an evaluation metric such as the area under the receiver operating characteristic. These comparisons of mean values often lack statistical rigor and can lead to inaccurate conclusions. In the interest of encouraging best practices, this tutorial provides an example of how multiple methods can be compared in a statistically rigorous fashion.Entities:
Keywords: Classification model; Machine learning; QSAR; Statistical validation; Tutorial
Mesh:
Year: 2021 PMID: 34549368 DOI: 10.1007/s10822-021-00417-2
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 4.179