Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Modelling methods and cross-validation variants in QSAR: a multi-level analysis$.

Literature DB >> 30160175

Modelling methods and cross-validation variants in QSAR: a multi-level analysis^$.

Abstract

Prediction performance often depends on the cross- and test validation protocols applied. Several combinations of different cross-validation variants and model-building techniques were used to reveal their complexity. Two case studies (acute toxicity data) were examined, applying five-fold cross-validation (with random, contiguous and Venetian blind forms) and leave-one-out cross-validation (CV). External test sets showed the effects and differences between the validation protocols. The models were generated with multiple linear regression (MLR), principal component regression (PCR), partial least squares (PLS) regression, artificial neural networks (ANN) and support vector machines (SVM). The comparisons were made by the sum of ranking differences (SRD) and factorial analysis of variance (ANOVA). The largest bias and variance could be assigned to the MLR method and contiguous block cross-validation. SRD can provide a unique and unambiguous ranking of methods and CV variants. Venetian blind cross-validation is a promising tool. The generated models were also compared based on their basic performance parameters (r2 and Q2). MLR produced the largest gap, while PCR gave the smallest. Although PCR is the best validated and balanced technique, SVM always outperformed the other methods, when experimental values were the benchmark. Variable selection was advantageous, and the modelling had a larger influence than CV variants.

Entities: Disease

Keywords: ANN; MLR; PCR; PLS; QSAR; SRD; SVM; cross-validation; toxicity; validation

Mesh：

Year: 2018 PMID： 30160175 DOI： 10.1080/1062936X.2018.1505778

Source DB: PubMed Journal: SAR QSAR Environ Res ISSN： 1026-776X Impact factor: 3.000

Keyword Cloud
Cited

4 in total

Review 1. Machine Learning of Raman Spectroscopy Data for Classifying Cancers: A Review of the Recent Literature.

Authors: Nathan Blake; Riana Gaifulina; Lewis D Griffin; Ian M Bell; Geraint M H Thomas
Journal: Diagnostics (Basel) Date: 2022-06-17

2. Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR.

Authors: Anita Rácz; Dávid Bajusz; Károly Héberger
Journal: Mol Inform Date: 2019-04-04 Impact factor: 3.353

3. Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer.

Authors: Yunsong Peng; Ziliang Cheng; Chang Gong; Chushan Zheng; Xiang Zhang; Zhuo Wu; Yaping Yang; Xiaodong Yang; Jian Zheng; Jun Shen
Journal: Front Oncol Date: 2022-03-10 Impact factor: 6.244

Review 4. Machine learning models for classification tasks related to drug safety.

Authors: Anita Rácz; Dávid Bajusz; Ramón Alain Miranda-Quintana; Károly Héberger
Journal: Mol Divers Date: 2021-06-10 Impact factor: 3.364

4 in total