Literature DB >> 31331881

Multiset Feature Learning for Highly Imbalanced Data Classification.

Xiao-Yuan Jing, Xinyu Zhang, Xiaoke Zhu, Fei Wu, Xinge You, Yang Gao, Shiguang Shan, Jing-Yu Yang.   

Abstract

With the expansion of data, increasing imbalanced data has emerged. When the imbalance ratio (IR) of data is high, most existing imbalanced learning methods decline seriously in classification performance. In this paper, we systematically investigate the highly imbalanced data classification problem, and propose an uncorrelated cost-sensitive multiset learning (UCML) approach for it. Specifically, UCML first constructs multiple balanced subsets through random partition, and then employs the multiset feature learning (MFL) to learn discriminant features from the constructed multiset. To enhance the usability of each subset and deal with the non-linearity issue existed in each subset, we further propose a deep metric based UCML (DM-UCML) approach. DM-UCML introduces the generative adversarial network technique into the multiset constructing process, such that each subset can own similar distribution with the original dataset. To cope with the non-linearity issue, DM-UCML integrates deep metric learning with MFL, such that more favorable performance can be achieved. In addition, DM-UCML designs a new discriminant term to enhance the discriminability of learned metrics. Experiments on eight traditional highly class-imbalanced datasets and two large-scale datasets indicate that: the proposed approaches outperform state-of-the-art highly imbalanced learning methods and are more robust to high IR.

Year:  2019        PMID: 31331881     DOI: 10.1109/TPAMI.2019.2929166

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  5 in total

1.  Deep learning and computer vision will transform entomology.

Authors:  Toke T Høye; Johanna Ärje; Kim Bjerge; Oskar L P Hansen; Alexandros Iosifidis; Florian Leese; Hjalte M R Mann; Kristian Meissner; Claus Melvad; Jenni Raitoharju
Journal:  Proc Natl Acad Sci U S A       Date:  2021-01-12       Impact factor: 11.205

2.  SMOTE-DRNN: A Deep Learning Algorithm for Botnet Detection in the Internet-of-Things Networks.

Authors:  Segun I Popoola; Bamidele Adebisi; Ruth Ande; Mohammad Hammoudeh; Kelvin Anoh; Aderemi A Atayero
Journal:  Sensors (Basel)       Date:  2021-04-24       Impact factor: 3.576

3.  Predictors of Maternal Death Among Women With Pulmonary Hypertension in China From 2012 to 2020: A Retrospective Single-Center Study.

Authors:  Ling-Ling Dai; Tian-Ci Jiang; Peng-Fei Li; Hua Shao; Xi Wang; Yu Wang; Liu-Qun Jia; Meng Liu; Lin An; Xiao-Gang Jing; Zhe Cheng
Journal:  Front Cardiovasc Med       Date:  2022-04-18

4.  A novel method for credit scoring based on feature transformation and ensemble model.

Authors:  Hongxiang Li; Ao Feng; Bin Lin; Houcheng Su; Zixi Liu; Xuliang Duan; Haibo Pu; Yifei Wang
Journal:  PeerJ Comput Sci       Date:  2021-06-04

5.  In silico prediction of chemical-induced hematotoxicity with machine learning and deep learning methods.

Authors:  Yuqing Hua; Yinping Shi; Xueyan Cui; Xiao Li
Journal:  Mol Divers       Date:  2021-07-01       Impact factor: 2.943

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.