Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Analysis of k-means clustering approach on the breast cancer Wisconsin dataset.

Literature DB >> 27311823

Analysis of k-means clustering approach on the breast cancer Wisconsin dataset.

Ashutosh Kumar Dubey¹, Umesh Gupta², Sonal Jain².

Abstract

PURPOSE: Breast cancer is one of the most common cancers found worldwide and most frequently found in women. An early detection of breast cancer provides the possibility of its cure; therefore, a large number of studies are currently going on to identify methods that can detect breast cancer in its early stages. This study was aimed to find the effects of k-means clustering algorithm with different computation measures like centroid, distance, split method, epoch, attribute, and iteration and to carefully consider and identify the combination of measures that has potential of highly accurate clustering accuracy.
METHODS: K-means algorithm was used to evaluate the impact of clustering using centroid initialization, distance measures, and split methods. The experiments were performed using breast cancer Wisconsin (BCW) diagnostic dataset. Foggy and random centroids were used for the centroid initialization. In foggy centroid, based on random values, the first centroid was calculated. For random centroid, the initial centroid was considered as (0, 0).
RESULTS: The results were obtained by employing k-means algorithm and are discussed with different cases considering variable parameters. The calculations were based on the centroid (foggy/random), distance (Euclidean/Manhattan/Pearson), split (simple/variance), threshold (constant epoch/same centroid), attribute (2-9), and iteration (4-10). Approximately, 92 % average positive prediction accuracy was obtained with this approach. Better results were found for the same centroid and the highest variance. The results achieved using Euclidean and Manhattan were better than the Pearson correlation.
CONCLUSIONS: The findings of this work provided extensive understanding of the computational parameters that can be used with k-means. The results indicated that k-means has a potential to classify BCW dataset.

Entities: Disease Species

Keywords: Breast cancer; Breast cancer Wisconsin (BCW) diagnostic dataset; Foggy and random centroid; K-means

Mesh：

Year: 2016 PMID： 27311823 DOI： 10.1007/s11548-016-1437-9

Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN： 1861-6410 Impact factor: 2.924

5 in total

1. A biased random-key genetic algorithm for data clustering.

Authors: P Festa
Journal: Math Biosci Date: 2013-07-26 Impact factor: 2.144

2. Breast cancer patient stratification using a molecular regularized consensus clustering method.

Authors: Chao Wang; Raghu Machiraju; Kun Huang
Journal: Methods Date: 2014-03-18 Impact factor: 3.608

Review 3. Breast cancer statistics and prediction methodology: a systematic review and analysis.

Authors: Ashutosh Kumar Dubey; Umesh Gupta; Sonal Jain
Journal: Asian Pac J Cancer Prev Date: 2015

4. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012.

Authors: Jacques Ferlay; Isabelle Soerjomataram; Rajesh Dikshit; Sultan Eser; Colin Mathers; Marise Rebelo; Donald Maxwell Parkin; David Forman; Freddie Bray
Journal: Int J Cancer Date: 2014-10-09 Impact factor: 7.396

5. A novel hierarchical clustering algorithm for gene sequences.

Authors: Dan Wei; Qingshan Jiang; Yanjie Wei; Shengrui Wang
Journal: BMC Bioinformatics Date: 2012-07-23 Impact factor: 3.169

5 in total

8 in total

1. THALIS: Human-Machine Analysis of Longitudinal Symptoms in Cancer Therapy.

Authors: Carla Floricel; Nafiul Nipu; Mikayla Biggs; Andrew Wentzel; Guadalupe Canahuate; Lisanne Van Dijk; Abdallah Mohamed; C David Fuller; G Elisabeta Marai
Journal: IEEE Trans Vis Comput Graph Date: 2021-12-24 Impact factor: 4.579

2. Medulla oblongata volume as a promising predictor of survival in amyotrophic lateral sclerosis.

Authors: Giammarco Milella; Alessandro Introna; Alma Ghirelli; Domenico Maria Mezzapesa; Ucci Maria; Eustachio D'Errico; Angela Fraddosio; Isabella Laura Simone
Journal: Neuroimage Clin Date: 2022-04-22 Impact factor: 4.891

3. An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.

Authors: Yushuang Li; Tian Song; Jiasheng Yang; Yi Zhang; Jialiang Yang
Journal: PLoS One Date: 2016-12-05 Impact factor: 3.240

4. BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm.

Authors: Shafaq Abbas; Zunera Jalil; Abdul Rehman Javed; Iqra Batool; Mohammad Zubair Khan; Abdulfattah Noorwali; Thippa Reddy Gadekallu; Aqsa Akbar
Journal: PeerJ Comput Sci Date: 2021-03-12

5. Diagnosis of Breast Cancer Pathology on the Wisconsin Dataset with the Help of Data Mining Classification and Clustering Techniques.

Authors: Walid Theib Mohammad; Ronza Teete; Heyam Al-Aaraj; Yousef Saleh Yousef Rubbai; Majd Mowafaq Arabyat
Journal: Appl Bionics Biomech Date: 2022-04-01 Impact factor: 1.781

Review 6. Diagnostic Strategies for Breast Cancer Detection: From Image Generation to Classification Strategies Using Artificial Intelligence Algorithms.

Authors: Jesus A Basurto-Hurtado; Irving A Cruz-Albarran; Manuel Toledano-Ayala; Mario Alberto Ibarra-Manzano; Luis A Morales-Hernandez; Carlos A Perez-Ramirez
Journal: Cancers (Basel) Date: 2022-07-15 Impact factor: 6.575

7. Unsupervised Hierarchical Classification Approach for Imprecise Data in the Breast Cancer Detection.

Authors: Mario Fordellone; Paolo Chiodini
Journal: Entropy (Basel) Date: 2022-07-03 Impact factor: 2.738

8. Predicting Breast Cancer Leveraging Supervised Machine Learning Techniques.

Authors: Sanam Aamir; Aqsa Rahim; Zain Aamir; Saadullah Farooq Abbasi; Muhammad Shahbaz Khan; Majed Alhaisoni; Muhammad Attique Khan; Khyber Khan; Jawad Ahmad
Journal: Comput Math Methods Med Date: 2022-08-16 Impact factor: 2.809

8 in total