Literature DB >> 20411285

An approach for classification of highly imbalanced data using weighting and undersampling.

Ashish Anand1, Ganesan Pugalenthi, Gary B Fogel, P N Suganthan.   

Abstract

Real-world datasets commonly have issues with data imbalance. There are several approaches such as weighting, sub-sampling, and data modeling for handling these data. Learning in the presence of data imbalances presents a great challenge to machine learning. Techniques such as support-vector machines have excellent performance for balanced data, but may fail when applied to imbalanced datasets. In this paper, we propose a new undersampling technique for selecting instances from the majority class. The performance of this approach was evaluated in the context of several real biological imbalanced data. The ratios of negative to positive samples vary from ~9:1 to ~100:1. Useful classifiers have high sensitivity and specificity. Our results demonstrate that the proposed selection technique improves the sensitivity compared to weighted support-vector machine and available results in the literature for the same datasets.

Mesh:

Substances:

Year:  2010        PMID: 20411285     DOI: 10.1007/s00726-010-0595-2

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  10 in total

1.  Distance Metric Based Oversampling Method for Bioinformatics and Performance Evaluation.

Authors:  Meng-Fong Tsai; Shyr-Shen Yu
Journal:  J Med Syst       Date:  2016-05-16       Impact factor: 4.460

2.  Patterns and predictions of drinking water nitrate violations across the conterminous United States.

Authors:  Michael J Pennino; Scott G Leibowitz; Jana E Compton; Ryan A Hill; Robert D Sabo
Journal:  Sci Total Environ       Date:  2020-03-05       Impact factor: 7.963

3.  Recognition of multiple imbalanced cancer types based on DNA microarray data using ensemble classifiers.

Authors:  Hualong Yu; Shufang Hong; Xibei Yang; Jun Ni; Yuanyuan Dan; Bin Qin
Journal:  Biomed Res Int       Date:  2013-08-26       Impact factor: 3.411

4.  Automatic lung nodule detection using multi-scale dot nodule-enhancement filter and weighted support vector machines in chest computed tomography.

Authors:  Yu Gu; Xiaoqi Lu; Baohua Zhang; Ying Zhao; Dahua Yu; Lixin Gao; Guimei Cui; Liang Wu; Tao Zhou
Journal:  PLoS One       Date:  2019-01-10       Impact factor: 3.240

5.  SVM recursive feature elimination analyses of structural brain MRI predicts near-term relapses in patients with clinically isolated syndromes suggestive of multiple sclerosis.

Authors:  Viktor Wottschel; Declan T Chard; Christian Enzinger; Massimo Filippi; Jette L Frederiksen; Claudio Gasperini; Antonio Giorgio; Maria A Rocca; Alex Rovira; Nicola De Stefano; Mar Tintoré; Daniel C Alexander; Frederik Barkhof; Olga Ciccarelli
Journal:  Neuroimage Clin       Date:  2019-10-22       Impact factor: 4.881

6.  A Spectral-Based Approach for BCG Signal Content Classification.

Authors:  Mohamed Chiheb Ben Nasr; Sofia Ben Jebara; Samuel Otis; Bessam Abdulrazak; Neila Mezghani
Journal:  Sensors (Basel)       Date:  2021-02-02       Impact factor: 3.576

7.  Image enhancement techniques on deep learning approaches for automated diagnosis of COVID-19 features using CXR images.

Authors:  Ajay Sharma; Pramod Kumar Mishra
Journal:  Multimed Tools Appl       Date:  2022-08-01       Impact factor: 2.577

8.  Comparing two machine learning approaches in predicting lupus hospitalization using longitudinal data.

Authors:  Yijun Zhao; Dylan Smith; April Jorge
Journal:  Sci Rep       Date:  2022-09-30       Impact factor: 4.996

9.  NMFBFS: A NMF-Based Feature Selection Method in Identifying Pivotal Clinical Symptoms of Hepatocellular Carcinoma.

Authors:  Zhiwei Ji; Guanmin Meng; Deshuang Huang; Xiaoqiang Yue; Bing Wang
Journal:  Comput Math Methods Med       Date:  2015-10-12       Impact factor: 2.238

Review 10.  Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection.

Authors:  Afsaneh Jalalian; Syamsiah Mashohor; Rozi Mahmud; Babak Karasfi; M Iqbal B Saripan; Abdul Rahman B Ramli
Journal:  EXCLI J       Date:  2017-02-20       Impact factor: 4.068

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.