Literature DB >> 27378821

Transfer Learning for Class Imbalance Problems with Inadequate Data.

Samir Al-Stouhi1, Chandan K Reddy2.   

Abstract

A fundamental problem in data mining is to effectively build robust classifiers in the presence of skewed data distributions. Class imbalance classifiers are trained specifically for skewed distribution datasets. Existing methods assume an ample supply of training examples as a fundamental prerequisite for constructing an effective classifier. However, when sufficient data is not readily available, the development of a representative classification algorithm becomes even more difficult due to the unequal distribution between classes. We provide a unified framework that will potentially take advantage of auxiliary data using a transfer learning mechanism and simultaneously build a robust classifier to tackle this imbalance issue in the presence of few training samples in a particular target domain of interest. Transfer learning methods use auxiliary data to augment learning when training examples are not sufficient and in this paper we will develop a method that is optimized to simultaneously augment the training data and induce balance into skewed datasets. We propose a novel boosting based instance-transfer classifier with a label-dependent update mechanism that simultaneously compensates for class imbalance and incorporates samples from an auxiliary domain to improve classification. We provide theoretical and empirical validation of our method and apply to healthcare and text classification applications.

Entities:  

Keywords:  AdaBoost; Class imbalance; HealthCare informatics; Rare class; Text mining; Transfer learning; Weighted Majority Algorithm

Year:  2015        PMID: 27378821      PMCID: PMC4929860          DOI: 10.1007/s10115-015-0870-3

Source DB:  PubMed          Journal:  Knowl Inf Syst        ISSN: 0219-3116            Impact factor:   2.822


  2 in total

1.  How economic development and family planning programs combined to reduce Indonesian fertility.

Authors:  P J Gertler; J W Molyneaux
Journal:  Demography       Date:  1994-02

2.  Suitability of dysphonia measurements for telemonitoring of Parkinson's disease.

Authors:  Max A Little; Patrick E McSharry; Eric J Hunter; Jennifer Spielman; Lorraine O Ramig
Journal:  IEEE Trans Biomed Eng       Date:  2009-04       Impact factor: 4.538

  2 in total
  5 in total

1.  A survey on generative adversarial networks for imbalance problems in computer vision tasks.

Authors:  Vignesh Sampath; Iñaki Maurtua; Juan José Aguilar Martín; Aitor Gutierrez
Journal:  J Big Data       Date:  2021-01-29

2.  Automated Determination of Left Ventricular Function Using Electrocardiogram Data in Patients on Maintenance Hemodialysis.

Authors:  Akhil Vaid; Joy J Jiang; Ashwin Sawant; Karandeep Singh; Patricia Kovatch; Alexander W Charney; David M Charytan; Jasmin Divers; Benjamin S Glicksberg; Lili Chan; Girish N Nadkarni
Journal:  Clin J Am Soc Nephrol       Date:  2022-06-06       Impact factor: 10.614

Review 3.  A review of deep learning applications in human genomics using next-generation sequencing data.

Authors:  Wardah S Alharbi; Mamoon Rashid
Journal:  Hum Genomics       Date:  2022-07-25       Impact factor: 6.481

4.  Early Prediction of Diabetes Using an Ensemble of Machine Learning Models.

Authors:  Aishwariya Dutta; Md Kamrul Hasan; Mohiuddin Ahmad; Md Abdul Awal; Md Akhtarul Islam; Mehedi Masud; Hossam Meshref
Journal:  Int J Environ Res Public Health       Date:  2022-09-28       Impact factor: 4.614

5.  Sorting Center Value Identification of "Internet + Recycling" Based on Transfer Clustering.

Authors:  Cheng Cheng; Xiaoli Luan
Journal:  Sensors (Basel)       Date:  2022-10-08       Impact factor: 3.847

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.