Literature DB >> 33461685

EKNN: Ensemble classifier incorporating connectivity and density into kNN with application to cancer diagnosis.

Mohamed A Mahfouz1, Amin Shoukry2, Mohamed A Ismail3.   

Abstract

In the microarray-based approach for automated cancer diagnosis, the application of the traditional k-nearest neighbors kNN algorithm suffers from several difficulties such as the large number of genes (high dimensionality of the feature space) with many irrelevant genes (noise) relative to the small number of available samples and the imbalance in the size of the samples of the target classes. This research provides an ensemble classifier based on decision models derived from kNN that is applicable to problems characterized by imbalanced small size datasets. The proposed classification method is an ensemble of the traditional kNN algorithm and four novel classification models derived from it. The proposed models exploit the increase in density and connectivity using K1-nearest neighbors table (KNN-table) created during the training phase. In the density model, an unseen sample u is classified as belonging to a class t if it achieves the highest increase in density when this sample is added to it i.e. the unseen sample can replace more neighbors in the KNN-table for samples of class t than other classes. In the other three connectivity models, the mean and standard deviation of the distribution of the average, minimum as well the maximum distance to the K neighbors of the members of each class are computed in the training phase. The class t to which u achieves the highest possibility of belongness to its distribution is chosen, i.e. the addition of u to the samples of this class produces the least change to the distribution of the corresponding decision model for class t. Combining the predicted results of the four individual models along with traditional kNN makes the decision space more discriminative. With the help of the KNN-table which can be updated online in the training phase, an improved performance has been achieved compared to the traditional kNN algorithm with slight increase in classification time. The proposed ensemble method achieves significant increase in accuracy compared to the accuracy achieved using any of its base classifiers on Kentridge, GDS3257, Notterman, Leukemia and CNS datasets. The method is also compared to several existing ensemble methods and state of the art techniques using different dimensionality reduction techniques on several standard datasets. The results prove clear superiority of EKNN over several individual and ensemble classifiers regardless of the choice of the gene selection strategy.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Cancer diagnosis; Ensemble classification; Gene expression analysis; Nearest neighbors

Year:  2020        PMID: 33461685     DOI: 10.1016/j.artmed.2020.101985

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  3 in total

1.  GraphChrom: A Novel Graph-Based Framework for Cancer Classification Using Chromosomal Rearrangement Endpoints.

Authors:  Golrokh Mirzaei
Journal:  Cancers (Basel)       Date:  2022-06-22       Impact factor: 6.575

2.  Automatic COVID-19 detection mechanisms and approaches from medical images: a systematic review.

Authors:  Amir Masoud Rahmani; Elham Azhir; Morteza Naserbakht; Mokhtar Mohammadi; Adil Hussein Mohammed Aldalwie; Mohammed Kamal Majeed; Sarkhel H Taher Karim; Mehdi Hosseinzadeh
Journal:  Multimed Tools Appl       Date:  2022-03-31       Impact factor: 2.577

3.  An Ensemble-Based Deep Convolutional Neural Network for Computer-Aided Polyps Identification From Colonoscopy.

Authors:  Pallabi Sharma; Bunil Kumar Balabantaray; Kangkana Bora; Saurav Mallik; Kunio Kasugai; Zhongming Zhao
Journal:  Front Genet       Date:  2022-04-26       Impact factor: 4.772

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.