Literature DB >> 19647098

A novel feature selection approach for biomedical data classification.

Yonghong Peng1, Zhiqing Wu, Jianmin Jiang.   

Abstract

This paper presents a novel feature selection approach to deal with issues of high dimensionality in biomedical data classification. Extensive research has been performed in the field of pattern recognition and machine learning. Dozens of feature selection methods have been developed in the literature, which can be classified into three main categories: filter, wrapper and hybrid approaches. Filter methods apply an independent test without involving any learning algorithm, while wrapper methods require a predetermined learning algorithm for feature subset evaluation. Filter and wrapper methods have their, respectively, drawbacks and are complementary to each other in that filter approaches have low computational cost with insufficient reliability in classification while wrapper methods tend to have superior classification accuracy but require great computational power. The approach proposed in this paper integrates filter and wrapper methods into a sequential search procedure with the aim to improve the classification performance of the features selected. The proposed approach is featured by (1) adding a pre-selection step to improve the effectiveness in searching the feature subsets with improved classification performances and (2) using Receiver Operating Characteristics (ROC) curves to characterize the performance of individual features and feature subsets in the classification. Compared with the conventional Sequential Forward Floating Search (SFFS), which has been considered as one of the best feature selection methods in the literature, experimental results demonstrate that (i) the proposed approach is able to select feature subsets with better classification performance than the SFFS method and (ii) the integrated feature pre-selection mechanism, by means of a new selection criterion and filter method, helps to solve the over-fitting problems and reduces the chances of getting a local optimal solution.

Entities:  

Mesh:

Year:  2009        PMID: 19647098     DOI: 10.1016/j.jbi.2009.07.008

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  10 in total

1.  Identification of tissue-specific tumor biomarker using different optimization algorithms.

Authors:  Shib Sankar Bhowmick; Debotosh Bhattacharjee; Luis Rato
Journal:  Genes Genomics       Date:  2018-12-08       Impact factor: 1.839

2.  A software framework for building biomedical machine learning classifiers through grid computing resources.

Authors:  Raúl Ramos-Pollán; Miguel Angel Guevara-López; Eugénio Oliveira
Journal:  J Med Syst       Date:  2011-04-09       Impact factor: 4.460

3.  Medical data set classification using a new feature selection algorithm combined with twin-bounded support vector machine.

Authors:  Márcio Dias de Lima; Juliana de Oliveira Roque E Lima; Rommel M Barbosa
Journal:  Med Biol Eng Comput       Date:  2020-01-04       Impact factor: 2.602

4.  A machine learning approach to epileptic seizure prediction using Electroencephalogram (EEG) Signal.

Authors:  Marzieh Savadkoohi; Timothy Oladunni; Lara Thompson
Journal:  Biocybern Biomed Eng       Date:  2020-07-16       Impact factor: 5.687

5.  Bias and Stability of Single Variable Classifiers for Feature Ranking and Selection.

Authors:  Shobeir Fakhraei; Hamid Soltanian-Zadeh; Farshad Fotouhi
Journal:  Expert Syst Appl       Date:  2014-11-01       Impact factor: 6.954

6.  Analysis of hepatitis C infection using Raman spectroscopy and proximity based classification in the transformed domain.

Authors:  Anabia Sohail; Saranjam Khan; Rahat Ullah; Shahzad Ahmad Qureshi; Muhammad Bilal; Asifullah Khan
Journal:  Biomed Opt Express       Date:  2018-04-03       Impact factor: 3.732

7.  Exploiting heterogeneous features to improve in silico prediction of peptide status - amyloidogenic or non-amyloidogenic.

Authors:  Smitha Sunil Kumaran Nair; N V Subba Reddy; K S Hareesha
Journal:  BMC Bioinformatics       Date:  2011-11-30       Impact factor: 3.169

8.  Preprocessing Breast Cancer Data to Improve the Data Quality, Diagnosis Procedure, and Medical Care Services.

Authors:  Zeinab Sajjadnia; Raof Khayami; Mohammad Reza Moosavi
Journal:  Cancer Inform       Date:  2020-05-27

9.  A New Strategy for Analyzing Time-Series Data Using Dynamic Networks: Identifying Prospective Biomarkers of Hepatocellular Carcinoma.

Authors:  Xin Huang; Jun Zeng; Lina Zhou; Chunxiu Hu; Peiyuan Yin; Xiaohui Lin
Journal:  Sci Rep       Date:  2016-08-31       Impact factor: 4.379

10.  Classification Accuracy of Hepatitis C Virus Infection Outcome: Data Mining Approach.

Authors:  Mario Frias; Jose M Moyano; Antonio Rivero-Juarez; Jose M Luna; Ángela Camacho; Habib M Fardoun; Isabel Machuca; Mohamed Al-Twijri; Antonio Rivero; Sebastian Ventura
Journal:  J Med Internet Res       Date:  2021-02-24       Impact factor: 5.428

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.