Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Design and analysis of classifier learning experiments in bioinformatics: survey and case studies.

Literature DB >> 22908127

Design and analysis of classifier learning experiments in bioinformatics: survey and case studies.

Ozan Irsoy¹, Olcay Taner Yildiz, Ethem Alpaydin.

Abstract

In many bioinformatics applications, it is important to assess and compare the performances of algorithms trained from data, to be able to draw conclusions unaffected by chance and are therefore significant. Both the design of such experiments and the analysis of the resulting data using statistical tests should be done carefully for the results to carry significance. In this paper, we first review the performance measures used in classification, the basics of experiment design and statistical tests. We then give the results of our survey over 1,500 papers published in the last two years in three bioinformatics journals (including this one). Although the basics of experiment design are well understood, such as resampling instead of using a single training set and the use of different performance metrics instead of error, only 21 percent of the papers use any statistical test for comparison. In the third part, we analyze four different scenarios which we encounter frequently in the bioinformatics literature, discussing the proper statistical methodology as well as showing an example case study for each. With the supplementary software, we hope that the guidelines we discuss will play an important role in future studies.

Mesh：

Substances：
Proteins

Year: 2012 PMID： 22908127 DOI： 10.1109/TCBB.2012.117

Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN： 1545-5963 Impact factor: 3.710

Keyword Cloud
Cited

5 in total

1. Visualizing Validation of Protein Surface Classifiers.

Authors: A Sarikaya; D Albers; J Mitchell; M Gleicher
Journal: Comput Graph Forum Date: 2014-06 Impact factor: 2.078

2. Choosing the Most Effective Pattern Classification Model under Learning-Time Constraint.

Authors: Priscila T M Saito; Rodrigo Y M Nakamura; Willian P Amorim; João P Papa; Pedro J de Rezende; Alexandre X Falcão
Journal: PLoS One Date: 2015-06-26 Impact factor: 3.240

Review 3. Biologically inspired intelligent decision making: a commentary on the use of artificial neural networks in bioinformatics.

Authors: Timmy Manning; Roy D Sleator; Paul Walsh
Journal: Bioengineered Date: 2013-12-16 Impact factor: 3.269

4. rMSIcleanup: an open-source tool for matrix-related peak annotation in mass spectrometry imaging and its application to silver-assisted laser desorption/ionization.

Authors: Gerard Baquer; Lluc Sementé; María García-Altares; Young Jin Lee; Pierre Chaurand; Xavier Correig; Pere Ràfols
Journal: J Cheminform Date: 2020-07-22 Impact factor: 5.514

5. Diagnostic and prognostic utility of a DNA hypermethylated gene signature in prostate cancer.

Authors: Liang Kee Goh; Natalia Liem; Aadhitthya Vijayaraghavan; Gengbo Chen; Pei Li Lim; Kae-Jack Tay; Michelle Chang; John Soon Wah Low; Adita Joshi; Hong Hong Huang; Emarene Kalaw; Puay Hoon Tan; Wen-Son Hsieh; Wei Peng Yong; Joshi Alumkal; Hong Gee Sim
Journal: PLoS One Date: 2014-03-13 Impact factor: 3.240

5 in total