Literature DB >> 21493051

A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets.

Der-Chiang Li1, Chiao-Wen Liu, Susan C Hu.   

Abstract

OBJECTIVE: Medical data sets are usually small and have very high dimensionality. Too many attributes will make the analysis less efficient and will not necessarily increase accuracy, while too few data will decrease the modeling stability. Consequently, the main objective of this study is to extract the optimal subset of features to increase analytical performance when the data set is small.
METHODS: This paper proposes a fuzzy-based non-linear transformation method to extend classification related information from the original data attribute values for a small data set. Based on the new transformed data set, this study applies principal component analysis (PCA) to extract the optimal subset of features. Finally, we use the transformed data with these optimal features as the input data for a learning tool, a support vector machine (SVM). Six medical data sets: Pima Indians' diabetes, Wisconsin diagnostic breast cancer, Parkinson disease, echocardiogram, BUPA liver disorders dataset, and bladder cancer cases in Taiwan, are employed to illustrate the approach presented in this paper.
RESULTS: This research uses the t-test to evaluate the classification accuracy for a single data set; and uses the Friedman test to show the proposed method is better than other methods over the multiple data sets. The experiment results indicate that the proposed method has better classification performance than either PCA or kernel principal component analysis (KPCA) when the data set is small, and suggest creating new purpose-related information to improve the analysis performance.
CONCLUSION: This paper has shown that feature extraction is important as a function of feature selection for efficient data analysis. When the data set is small, using the fuzzy-based transformation method presented in this work to increase the information available produces better results than the PCA and KPCA approaches.
Copyright © 2011 Elsevier B.V. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21493051     DOI: 10.1016/j.artmed.2011.02.001

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  14 in total

1.  A deep learning approach for prediction of Parkinson's disease progression.

Authors:  Afzal Hussain Shahid; Maheshwari Prasad Singh
Journal:  Biomed Eng Lett       Date:  2020-04-16

2.  The role of uropathogenic Escherichia coli adhesive molecules in inflammatory response- comparative study on immunocompetent hosts and kidney recipients.

Authors:  Bartosz Wojciuk; Karolina Majewska; Bartłomiej Grygorcewicz; Żaneta Krukowska; Ewa Kwiatkowska; Kazimierz Ciechanowski; Barbara Dołęgowska
Journal:  PLoS One       Date:  2022-05-23       Impact factor: 3.752

3.  A decision support system to improve medical diagnosis using a combination of k-medoids clustering based attribute weighting and SVM.

Authors:  Musa Peker
Journal:  J Med Syst       Date:  2016-03-21       Impact factor: 4.460

4.  New fuzzy support vector machine for the class imbalance problem in medical datasets classification.

Authors:  Xiaoqing Gu; Tongguang Ni; Hongyuan Wang
Journal:  ScientificWorldJournal       Date:  2014-03-23

5.  Detecting representative data and generating synthetic samples to improve learning accuracy with imbalanced data sets.

Authors:  Der-Chiang Li; Susan C Hu; Liang-Sian Lin; Chun-Wu Yeh
Journal:  PLoS One       Date:  2017-08-03       Impact factor: 3.240

6.  A New Intelligent Medical Decision Support System Based on Enhanced Hierarchical Clustering and Random Decision Forest for the Classification of Alcoholic Liver Damage, Primary Hepatoma, Liver Cirrhosis, and Cholelithiasis.

Authors:  Aman Singh; Babita Pandey
Journal:  J Healthc Eng       Date:  2018-02-01       Impact factor: 2.682

7.  Feature selection method based on artificial bee colony algorithm and support vector machines for medical datasets classification.

Authors:  Mustafa Serter Uzer; Nihat Yilmaz; Onur Inan
Journal:  ScientificWorldJournal       Date:  2013-07-28

8.  An efficient diagnosis system for Parkinson's disease using kernel-based extreme learning machine with subtractive clustering features weighting approach.

Authors:  Chao Ma; Jihong Ouyang; Hui-Ling Chen; Xue-Hua Zhao
Journal:  Comput Math Methods Med       Date:  2014-11-18       Impact factor: 2.238

9.  A Multiple-Classifier Framework for Parkinson's Disease Detection Based on Various Vocal Tests.

Authors:  Mahnaz Behroozi; Ashkan Sami
Journal:  Int J Telemed Appl       Date:  2016-04-12

10.  Accuracy Improvement for Predicting Parkinson's Disease Progression.

Authors:  Mehrbakhsh Nilashi; Othman Ibrahim; Ali Ahani
Journal:  Sci Rep       Date:  2016-09-30       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.