Literature DB >> 28245947

A novel hierarchical selective ensemble classifier with bioinformatics application.

Leyi Wei1, Shixiang Wan1, Jiasheng Guo2, Kelvin Kl Wong3.   

Abstract

Selective ensemble learning is a technique that selects a subset of diverse and accurate basic models in order to generate stronger generalization ability. In this paper, we proposed a novel learning algorithm that is based on parallel optimization and hierarchical selection (PTHS). Our novel feature selection method is based on maximize the sum of relevance and distance (MSRD) for solving the problem of high dimensionality. Specifically, we have a PTHS algorithm that employs parallel optimization and candidate model pruning based on k-means and a hierarchical selection framework. We combine the prediction result of each basic model by majority voting, which employs the divide-and-conquer strategy to save computing time. In addition, the PT algorithm is capable to transform a multi-class problem into a binary classification problem, and thereby allowing our ensemble model to address multi-class problems. Empirical study shows that MSRD is efficient in solving the high dimensionality problem, and PTHS exhibits better performance than the other existing classification algorithms. Most importantly, our classifier achieved high-level performance on several bioinformatics problems (e.g. tRNA identification, and protein-protein interaction prediction, etc.), demonstrating efficiency and robustness.
Copyright © 2017 Elsevier B.V. All rights reserved.

Keywords:  Bioinformatics; Divide and conquer; Multi-class classification; Parallel optimization; Selective ensemble learning

Mesh:

Substances:

Year:  2017        PMID: 28245947     DOI: 10.1016/j.artmed.2017.02.005

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  50 in total

1.  ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.

Authors:  Shihu Jiao; Zheng Chen; Lichao Zhang; Xun Zhou; Lei Shi
Journal:  Amino Acids       Date:  2022-03-14       Impact factor: 3.520

2.  Evaluating hierarchical machine learning approaches to classify biological databases.

Authors:  Pâmela M Rezende; Joicymara S Xavier; David B Ascher; Gabriel R Fernandes; Douglas E V Pires
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool.

Authors:  Xiao Yang; Xiucai Ye; Xuehong Li; Lesong Wei
Journal:  Front Genet       Date:  2021-03-31       Impact factor: 4.599

4.  Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.

Authors:  Lei Chen; Yu-Hang Zhang; ShaoPeng Wang; YunHua Zhang; Tao Huang; Yu-Dong Cai
Journal:  PLoS One       Date:  2017-09-05       Impact factor: 3.240

5.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

6.  4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism.

Authors:  Rao Zeng; Song Cheng; Minghong Liao
Journal:  Front Cell Dev Biol       Date:  2021-05-10

7.  Cervical Cancer Prediction by Merging Features of Different Colposcopic Images and Using Ensemble Classifier.

Authors:  Elham Nikookar; Ebrahim Naderi; Ali Rahnavard
Journal:  J Med Signals Sens       Date:  2021-05-24

8.  AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest.

Authors:  Pratiti Bhadra; Jielu Yan; Jinyan Li; Simon Fong; Shirley W I Siu
Journal:  Sci Rep       Date:  2018-01-26       Impact factor: 4.379

9.  Discovery of novel therapeutic properties of drugs from transcriptional responses based on multi-label classification.

Authors:  Lingwei Xie; Song He; Yuqi Wen; Xiaochen Bo; Zhongnan Zhang
Journal:  Sci Rep       Date:  2017-08-02       Impact factor: 4.379

10.  Analysis of Bioactive Amino Acids from Fish Hydrolysates with a New Bioinformatic Intelligent System Approach.

Authors:  Mohamed Abd Elaziz; Ahmed Monem Hemdan; AboulElla Hassanien; Diego Oliva; Shengwu Xiong
Journal:  Sci Rep       Date:  2017-09-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.