Literature DB >> 21567124

Enhanced cancer recognition system based on random forests feature elimination algorithm.

Akin Ozcift1.   

Abstract

Accurate classifiers are vital to design precise computer aided diagnosis (CADx) systems. Classification performances of machine learning algorithms are sensitive to the characteristics of data. In this aspect, determining the relevant and discriminative features is a key step to improve performance of CADx. There are various feature extraction methods in the literature. However, there is no universal variable selection algorithm that performs well in every data analysis scheme. Random Forests (RF), an ensemble of trees, is used in classification studies successfully. The success of RF algorithm makes it eligible to be used as kernel of a wrapper feature subset evaluator. We used best first search RF wrapper algorithm to select optimal features of four medical datasets: colon cancer, leukemia cancer, breast cancer and lung cancer. We compared accuracies of 15 widely used classifiers trained with all features versus to extracted features of each dataset. The experimental results demonstrated the efficiency of proposed feature extraction strategy with the increase in most of the classification accuracies of the algorithms.

Entities:  

Mesh:

Year:  2011        PMID: 21567124     DOI: 10.1007/s10916-011-9730-1

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  8 in total

1.  Classification of breast tissue by electrical impedance spectroscopy.

Authors:  J E da Silva; J P de Sá; J Jossinet
Journal:  Med Biol Eng Comput       Date:  2000-01       Impact factor: 2.602

2.  Computer aided diagnosis system for the Alzheimer's disease based on partial least squares and random forest SPECT image classification.

Authors:  J Ramírez; J M Górriz; F Segovia; R Chaves; D Salas-Gonzalez; M López; I Alvarez; P Padilla
Journal:  Neurosci Lett       Date:  2010-02-01       Impact factor: 3.046

3.  Where are linear feature extraction methods applicable?

Authors:  Aleix M Martinez; Manli Zhu
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-12       Impact factor: 6.226

Review 4.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

5.  Using random forest for reliable classification and cost-sensitive learning for medical diagnosis.

Authors:  Fan Yang; Hua-zhen Wang; Hong Mi; Cheng-de Lin; Wei-wen Cai
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

6.  Random forests classification analysis for the assessment of diagnostic skill.

Authors:  James D Katz; Gulnara Mamyrova; Olena Guzhva; Lena Furmark
Journal:  Am J Med Qual       Date:  2010-02-08       Impact factor: 1.852

7.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

8.  Random forests ensemble classifier trained with data resampling strategy to improve cardiac arrhythmia diagnosis.

Authors:  Akin Ozçift
Journal:  Comput Biol Med       Date:  2011-03-17       Impact factor: 4.589

  8 in total
  2 in total

1.  Hypergraph Based Feature Selection Technique for Medical Diagnosis.

Authors:  Nivethitha Somu; M R Gauthama Raman; Kannan Kirthivasan; V S Shankar Sriram
Journal:  J Med Syst       Date:  2016-09-24       Impact factor: 4.460

2.  Prediction of acute kidney injury risk after cardiac surgery: using a hybrid machine learning algorithm.

Authors:  Yelena Petrosyan; Thierry G Mesana; Louise Y Sun
Journal:  BMC Med Inform Decis Mak       Date:  2022-05-18       Impact factor: 2.796

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.