Literature DB >> 19088122

A genetic programming-based approach to the classification of multiclass microarray datasets.

Kun-Hong Liu1, Chun-Gui Xu.   

Abstract

MOTIVATION: Feature selection approaches have been widely applied to deal with the small sample size problem in the analysis of micro-array datasets. For the multiclass problem, the proposed methods are based on the idea of selecting a gene subset to distinguish all classes. However, it will be more effective to solve a multiclass problem by splitting it into a set of two-class problems and solving each problem with a respective classification system.
RESULTS: We propose a genetic programming (GP)-based approach to analyze multiclass microarray datasets. Unlike the traditional GP, the individual proposed in this article consists of a set of small-scale ensembles, named as sub-ensemble (denoted by SE). Each SE consists of a set of trees. In application, a multiclass problem is divided into a set of two-class problems, each of which is tackled by a SE first. The SEs tackling the respective two-class problems are combined to construct a GP individual, so each individual can deal with a multiclass problem directly. Effective methods are proposed to solve the problems arising in the fusion of SEs, and a greedy algorithm is designed to keep high diversity in SEs. This GP is tested in five datasets. The results show that the proposed method effectively implements the feature selection and classification tasks.

Mesh:

Year:  2008        PMID: 19088122     DOI: 10.1093/bioinformatics/btn644

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Alterations of tumor-related genes do not exactly match the histopathological grade in gastric adenocarcinomas.

Authors:  Guo-Yan Liu; Kun-Hong Liu; Yong Zhang; Yu-Zhi Wang; Xiao-Hong Wu; Yi-Zhuo Lu; Chao Pan; Ping Yin; Hong-Feng Liao; Ji-Qin Su; Qing Ge; Qi Luo; Bin Xiong
Journal:  World J Gastroenterol       Date:  2010-03-07       Impact factor: 5.742

2.  Methods for optimizing statistical analyses in pharmacogenomics research.

Authors:  Stephen D Turner; Dana C Crawford; Marylyn D Ritchie
Journal:  Expert Rev Clin Pharmacol       Date:  2009-09-01       Impact factor: 5.045

3.  An Algorithm Framework for Drug-Induced Liver Injury Prediction Based on Genetic Algorithm and Ensemble Learning.

Authors:  Bowei Yan; Xiaona Ye; Jing Wang; Junshan Han; Lianlian Wu; Song He; Kunhong Liu; Xiaochen Bo
Journal:  Molecules       Date:  2022-05-12       Impact factor: 4.927

4.  Multiclass classification of microarray data samples with a reduced number of genes.

Authors:  Elizabeth Tapia; Leonardo Ornella; Pilar Bulacio; Laura Angelone
Journal:  BMC Bioinformatics       Date:  2011-02-22       Impact factor: 3.169

5.  Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems.

Authors:  Kim-Anh Lê Cao; Simon Boitard; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2011-06-22       Impact factor: 3.169

6.  Genetic programming based ensemble system for microarray data classification.

Authors:  Kun-Hong Liu; Muchenxuan Tong; Shu-Tong Xie; Vincent To Yee Ng
Journal:  Comput Math Methods Med       Date:  2015-02-25       Impact factor: 2.238

Review 7.  A survey of computational intelligence techniques in protein function prediction.

Authors:  Arvind Kumar Tiwari; Rajeev Srivastava
Journal:  Int J Proteomics       Date:  2014-12-11

8.  Integrating gut microbiome and host immune markers to understand the pathogenesis of Clostridioides difficile infection.

Authors:  Shanlin Ke; Nira R Pollock; Xu-Wen Wang; Xinhua Chen; Kaitlyn Daugherty; Qianyun Lin; Hua Xu; Kevin W Garey; Anne J Gonzales-Luna; Ciarán P Kelly; Yang-Yu Liu
Journal:  Gut Microbes       Date:  2021 Jan-Dec
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.