Literature DB >> 31403330

Clustering-based undersampling with random over sampling examples and support vector machine for imbalanced classification of breast cancer diagnosis.

Jue Zhang1,2, Li Chen1.   

Abstract

To overcome the two-class imbalanced classification problem existing in the diagnosis of breast cancer, a hybrid of Random Over Sampling Example, K-means and Support vector machine (RK-SVM) model is proposed which is based on sample selection. Random Over Sampling Example (ROSE) is utilized to balance the dataset and further improve the diagnosis accuracy by Support Vector Machine (SVM). As there is one different sample selection factor via clustering that encourages selecting the samples near the class boundary. The purpose of clustering here is to reduce the risk of removing useful samples and improve the efficiency of sample selection. To test the performance of the new hybrid classifier, it is implemented on breast cancer datasets and the other three datasets from the University of California Irvine (UCI) machine learning repository, which are commonly used datasets in class imbalanced learning. The extensive experimental results show that our proposed hybrid method outperforms most of the competitive algorithms in term of G-mean and accuracy indices. Additionally, experimental results show that this method also performs superiorly for binary problems.

Entities:  

Keywords:  Breast cancer diagnosis; class-imbalance problem; sample selection

Mesh:

Year:  2019        PMID: 31403330     DOI: 10.1080/24699322.2019.1649074

Source DB:  PubMed          Journal:  Comput Assist Surg (Abingdon)        ISSN: 2469-9322            Impact factor:   1.787


  4 in total

1.  A Clinical Decision Support System (CDSS) for Unbiased Prediction of Caesarean Section Based on Features Extraction and Optimized Classification.

Authors:  Ashir Javeed; Liaqat Ali; Abegaz Mohammed Seid; Arif Ali; Dilpazir Khan; Yakubu Imrana
Journal:  Comput Intell Neurosci       Date:  2022-06-06

2.  Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble.

Authors:  Yueling Xiong; Mingquan Ye; Changrong Wu
Journal:  Comput Math Methods Med       Date:  2021-04-24       Impact factor: 2.238

3.  Risk Prediction of Major Adverse Cardiovascular Events Occurrence Within 6 Months After Coronary Revascularization: Machine Learning Study.

Authors:  Jinwan Wang; Shuai Wang; Mark Xuefang Zhu; Tao Yang; Qingfeng Yin; Ya Hou
Journal:  JMIR Med Inform       Date:  2022-04-20

4.  Novel Insights on Establishing Machine Learning-Based Stroke Prediction Models Among Hypertensive Adults.

Authors:  Xiao Huang; Tianyu Cao; Liangziqian Chen; Junpei Li; Ziheng Tan; Benjamin Xu; Richard Xu; Yun Song; Ziyi Zhou; Zhuo Wang; Yaping Wei; Yan Zhang; Jianping Li; Yong Huo; Xianhui Qin; Yanqing Wu; Xiaobin Wang; Hong Wang; Xiaoshu Cheng; Xiping Xu; Lishun Liu
Journal:  Front Cardiovasc Med       Date:  2022-05-06
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.