Literature DB >> 20150679

On the importance of comprehensible classification models for protein function prediction.

Alex A Freitas1, Daniela C Wieser, Rolf Apweiler.   

Abstract

The literature on protein function prediction is currently dominated by works aimed at maximizing predictive accuracy, ignoring the important issues of validation and interpretation of discovered knowledge, which can lead to new insights and hypotheses that are biologically meaningful and advance the understanding of protein functions by biologists. The overall goal of this paper is to critically evaluate this approach, offering a refreshing new perspective on this issue, focusing not only on predictive accuracy but also on the comprehensibility of the induced protein function prediction models. More specifically, this paper aims to offer two main contributions to the area of protein function prediction. First, it presents the case for discovering comprehensible protein function prediction models from data, discussing in detail the advantages of such models, namely, increasing the confidence of the biologist in the system's predictions, leading to new insights about the data and the formulation of new biological hypotheses, and detecting errors in the data. Second, it presents a critical review of the pros and cons of several different knowledge representations that can be used in order to support the discovery of comprehensible protein function prediction models.

Mesh:

Substances:

Year:  2010        PMID: 20150679     DOI: 10.1109/TCBB.2008.47

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  10 in total

1.  Learning Classification Models of Cognitive Conditions from Subtle Behaviors in the Digital Clock Drawing Test.

Authors:  William Souillard-Mandar; Randall Davis; Cynthia Rudin; Rhoda Au; David J Libon; Rodney Swenson; Catherine C Price; Melissa Lamar; Dana L Penney
Journal:  Mach Learn       Date:  2015-10-20       Impact factor: 2.940

2.  A data mining approach for classifying DNA repair genes into ageing-related or non-ageing-related.

Authors:  Alex A Freitas; Olga Vasieva; João Pedro de Magalhães
Journal:  BMC Genomics       Date:  2011-01-12       Impact factor: 3.969

3.  Mining flexible-receptor docking experiments to select promising protein receptor snapshots.

Authors:  Karina S Machado; Ana T Winck; Duncan D A Ruiz; Osmar Norberto de Souza
Journal:  BMC Genomics       Date:  2010-12-22       Impact factor: 3.969

4.  Context-based preprocessing of molecular docking data.

Authors:  Ana T Winck; Karina S Machado; Osmar Norberto de Souza; Duncan D Ruiz
Journal:  BMC Genomics       Date:  2013-10-25       Impact factor: 3.969

Review 5.  Hierarchical ensemble methods for protein function prediction.

Authors:  Giorgio Valentini
Journal:  ISRN Bioinform       Date:  2014-05-04

Review 6.  A review of supervised machine learning applied to ageing research.

Authors:  Fabio Fabris; João Pedro de Magalhães; Alex A Freitas
Journal:  Biogerontology       Date:  2017-03-06       Impact factor: 4.277

7.  An interpretable machine learning model for diagnosis of Alzheimer's disease.

Authors:  Diptesh Das; Junichi Ito; Tadashi Kadowaki; Koji Tsuda
Journal:  PeerJ       Date:  2019-03-01       Impact factor: 2.984

8.  Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data.

Authors:  Rodrigo C Barros; Ana T Winck; Karina S Machado; Márcio P Basgalupp; André C P L F de Carvalho; Duncan D Ruiz; Osmar Norberto de Souza
Journal:  BMC Bioinformatics       Date:  2012-11-21       Impact factor: 3.169

9.  Predicting volume of distribution with decision tree-based regression methods using predicted tissue:plasma partition coefficients.

Authors:  Alex A Freitas; Kriti Limbu; Taravat Ghafourian
Journal:  J Cheminform       Date:  2015-02-26       Impact factor: 5.514

10.  An infrastructure to mine molecular descriptors for ligand selection on virtual screening.

Authors:  Vinicius Rosa Seus; Giovanni Xavier Perazzo; Ana T Winck; Adriano V Werhli; Karina S Machado
Journal:  Biomed Res Int       Date:  2014-04-09       Impact factor: 3.411

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.