| Literature DB >> 35401789 |
Walid Theib Mohammad1, Ronza Teete2, Heyam Al-Aaraj3, Yousef Saleh Yousef Rubbai1, Majd Mowafaq Arabyat4.
Abstract
Breast cancer must be addressed by a multidisciplinary team aiming at the patient's comprehensive treatment. Recent advances in science make it possible to evaluate tumor staging and point out the specific treatment. However, these advances must be combined with the availability of resources and the easy operability of the technique. This study is aimed at distinguishing and classifying benign and malignant cells, which are tumor types, from the data on the Wisconsin Diagnostic Breast Cancer (WDBC) dataset by applying data mining classification and clustering techniques with the help of the Weka tool. In addition, various algorithms and techniques used in data mining were measured with success percentages, and the most successful ones on the dataset were determined and compared with each other.Entities:
Year: 2022 PMID: 35401789 PMCID: PMC8993572 DOI: 10.1155/2022/6187275
Source DB: PubMed Journal: Appl Bionics Biomech ISSN: 1176-2322 Impact factor: 1.781
Figure 1Image of a benign tumor cell.
Figure 2Image of a malignant tumor cell.
Characteristics of the samples.
| (i) Radius: radius of all cells are shown by the mean, standard deviation, and worst value |
| (ii) Texture: the mean, standard deviation, and worst value of the grayscale change rates of interior surfaces are shown in the table below. |
| (iii) Perimeter: the perimeters of the cells were measured for the mean, standard deviation, and worst value |
| (iv) Area: the mean, standard deviation, and worst-case value of the surface areas of the cells are all calculated and displayed |
| (v) SVMothness: The average, standard deviation, and worst value of the radius lengths of neighbouring cells are all displayed in the graph |
| (vii) Concavity: the mean, standard deviation, and worst value of the indentations and protrusions around the cell are all displayed on this graph |
| (viii) Concave points: the mean, standard deviation, and worst value for the number of indentation and protrusion sites around the cell are all calculated using this data |
| (ix) Symmetry: the mean, standard deviation, and worst value of the change in ellipse shape of cells were calculated |
| (x) Fractal dimension: there are three values for this ratio: the mean, standard deviation, and worst value. There are three values for this ratio. |
Figure 3Data of the Naive Bayes technique.
The complexity matrix of the Naive Bayes technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 191 | 21 |
| Benign | 16 | 341 | |
Figure 4Accuracy and sensitivity information of the Naive Bayes technique.
Complexity matrix of the multilayer perceptron technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 200 | 10 |
| Benign | 9 | 342 | |
Figure 5Success percentages of the multilayer perceptron technique.
Figure 6Accuracy and sensitivity information of the multilayer perceptron technique.
Complexity matrix of the SVM technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 200 | 10 |
| Benign | 3 | 356 | |
Figure 7Success percentages of the SVM technique.
Figure 8Accuracy and sensitivity information of the SVM technique.
Complexity matrix of the J48 technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 195 | 17 |
| Benign | 24 | 333 | |
Figure 9Success percentages of the J48 technique.
Figure 10Accuracy and sensitivity information of the J48 technique.
Complexity matrix of the KNN technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 199 | 13 |
| Benign | 12 | 345 | |
Figure 11Percentages of success of the KNN technique.
Figure 12Accuracy and sensitivity information of the KNN technique.
Complexity matrix of the K-means technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 179 | 31 |
| Benign | 10 | 349 | |
Figure 13Success percentages of the K-means technique.
Complexity matrix of the hierarchical clustering technique.
| Estimated value | |||
|---|---|---|---|
| Malignant | Benign | ||
| Actual value | Malignant | 2 | 211 |
| Benign | 0 | 356 | |
Figure 14Success percentages of the hierarchical clustering technique.
Figure 15Comparison of classifiers.