Literature DB >> 20671325

A study of hierarchical and flat classification of proteins.

Arthur Zimek1, Fabian Buchwald, Eibe Frank, Stefan Kramer.   

Abstract

Automatic classification of proteins using machine learning is an important problem that has received significant attention in the literature. One feature of this problem is that expert-defined hierarchies of protein classes exist and can potentially be exploited to improve classification performance. In this article, we investigate empirically whether this is the case for two such hierarchies. We compare multiclass classification techniques that exploit the information in those class hierarchies and those that do not, using logistic regression, decision trees, bagged decision trees, and support vector machines as the underlying base learners. In particular, we compare hierarchical and flat variants of ensembles of nested dichotomies. The latter have been shown to deliver strong classification performance in multiclass settings. We present experimental results for synthetic, fold recognition, enzyme classification, and remote homology detection data. Our results show that exploiting the class hierarchy improves performance on the synthetic data but not in the case of the protein classification problems. Based on this, we recommend that strong flat multiclass methods be used as a baseline to establish the benefit of exploiting class hierarchies in this area.

Mesh:

Substances:

Year:  2010        PMID: 20671325     DOI: 10.1109/TCBB.2008.104

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  Bacterial species identification using MALDI-TOF mass spectrometry and machine learning techniques: A large-scale benchmarking study.

Authors:  Thomas Mortier; Anneleen D Wieme; Peter Vandamme; Willem Waegeman
Journal:  Comput Struct Biotechnol J       Date:  2021-11-09       Impact factor: 7.271

2.  Using Domain Knowledge for Interpretable and Competitive Multi-Class Human Activity Recognition.

Authors:  Sebastian Scheurer; Salvatore Tedesco; Kenneth N Brown; Brendan O'Flynn
Journal:  Sensors (Basel)       Date:  2020-02-22       Impact factor: 3.576

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.