| Literature DB >> 31182028 |
Hui Huang1, Xi'an Feng1, Suying Zhou2, Jionghui Jiang3, Huiling Chen4, Yuping Li5, Chengye Li6.
Abstract
BACKGROUND: It is of great clinical significance to develop an accurate computer aided system to accurately diagnose the breast cancer. In this study, an enhanced machine learning framework is established to diagnose the breast cancer. The core of this framework is to adopt fruit fly optimization algorithm (FOA) enhanced by Levy flight (LF) strategy (LFOA) to optimize two key parameters of support vector machine (SVM) and build LFOA-based SVM (LFOA-SVM) for diagnosing the breast cancer. The high-level features abstracted from the volunteers are utilized to diagnose the breast cancer for the first time.Entities:
Keywords: Breast cancer diagnosis; Fruit fly optimization; Levy flight; Parameter optimization; Support vector machine
Mesh:
Year: 2019 PMID: 31182028 PMCID: PMC6557762 DOI: 10.1186/s12859-019-2771-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Flowchart of LFOA-SVM
The brief descriptions and quantization of features used in this study
| Feature | Brief description | Quantization |
|---|---|---|
| X1 | Age | <=40: 1; > 40: 2 |
| X2 | Locality | unilateral: 1; bilateral: 2 |
| X3 | Boundary | clear: 1; not clear: 2; obscure: 3 |
| X4 | Calcification | no: 1; yes: 2 |
| X5 | Cystic | no: 1; yes: 2 |
| X6 | Nucleus/cytoplasm ratio(N/C) | low: 1; medium: 2; high: 3 |
| X7 | Nucleolus size | small: 1; big: 2 |
| X8 | Nucleolus number | Single: 1; Multi: 2 |
| X9 | Cell density | low: 1; medium: 2; high: 3 |
| X10 | Presence of fiber focal | no: 1:; yes: 2 |
| X11 | Inflammatory cell infiltration | few: 1; more: 2; many: 3 |
| X12 | Infiltration of the gland | no: 1; yes: 2 |
| X13 | Perineural invasion | no: 1; yes: 2 |
| X14 | Glands clearance around | no: 1; yes: 2 |
The parameter settings for the relevant methods
| Algorithm | Parameter | Value |
|---|---|---|
| LFOA |
| 20,20 |
|
| 10,10 | |
| FOA |
| 20,20 |
|
| 10,10 | |
| GA |
| 0.4 |
|
| 0.01 | |
| PSO |
| 2 |
|
| 1 | |
|
| 6 | |
| MFO |
| |
|
| 1 | |
| BA (bat algorithm) |
| |
|
| 0.5 | |
|
| 0.5 | |
| DA (dragon fly algorithm) |
| |
|
| 0.1 | |
|
| 0.1 | |
|
| 0.7 | |
|
| 1 | |
|
| 1 | |
| FPA (flower pollination algorithm) |
| 0.8 |
|
| 1.5 | |
| SCA (sine cosine algorithm) |
| 2 |
| RF |
| 500 |
|
| 3 | |
| BPNN |
| 8 |
|
| Sigmoid | |
|
| tranlim | |
| ELM |
| 50 |
|
| Sigmoid |
Unimodal benchmark functions
| Function | Dim | Range |
|
|---|---|---|---|
|
| 30 | [− 100, 100] | 0 |
|
| 30 | [− 10, 10] | 0 |
|
| 30 | [− 100, 100] | 0 |
| 30 | [− 100, 100] | 0 | |
|
| 30 | [− 30, 30] | 0 |
|
| 30 | [− 100, 100] | 0 |
|
| 30 | [− 1.28, 1.28] | 0 |
Multimodal benchmark functions
| Function | Dim | Range |
|
|---|---|---|---|
|
| 30 | [− 500,500] | −418.9829 × 5 |
|
| 30 | [− 5.12,5.12] | 0 |
|
| 30 | [− 32,32] | 0 |
|
| 30 | [− 600,600] | 0 |
|
| 30 | [− 50,50] | 0 |
|
| 30 | [− 50,50] | 0 |
Fixed-dimension multimodal benchmark functions
| Function | Dim | Range |
|
|---|---|---|---|
|
| 2 | [− 65,65] | 1 |
|
| 4 | [− 5, 5] | 0.00030 |
|
| 2 | [− 5,5] | −1.0316 |
|
| 2 | [− 5,5] | 0.398 |
|
| 2 | [− 2,2] | 3 |
|
| 3 | [1, 3] | −3.86 |
|
| 6 | [0,1] | −3.32 |
|
| 4 | [0,10] | −10.1532 |
|
| 4 | [0,10] | −10.4028 |
|
| 4 | [0,10] | −10.5363 |
Results of testing benchmark functions
| Function | Metric | LFOA | FOA | MFO | BA | DA | FPA | PSO | SCA |
|---|---|---|---|---|---|---|---|---|---|
|
| mean | 1.53E-09 | 9.27E-07 | 1.00E+ 03 | 1.53E+ 01 | 1.36E+ 03 | 1.71E+ 03 | 1.39E+ 02 | 7.65E+ 00 |
| std | 3.71E-09 | 7.92E-09 | 3.05E+ 03 | 2.32E+ 00 | 1.11E+ 03 | 4.11E+ 02 | 1.77E+ 01 | 1.07E+ 01 | |
| rank | 1 | 2 | 6 | 4 | 7 | 8 | 5 | 3 | |
|
| mean | 2.20E-04 | 5.26E-03 | 3.51E+ 01 | 1.84E+ 01 | 2.20E+ 01 | 4.90E+ 01 | 7.48E+ 01 | 1.33E-02 |
| std | 2.25E-04 | 3.13E-05 | 2.09E+ 01 | 1.99E+ 00 | 1.69E+ 01 | 1.20E+ 01 | 1.43E+ 01 | 1.47E-02 | |
| rank | 1 | 2 | 6 | 4 | 5 | 7 | 8 | 3 | |
|
| mean | 1.85E-07 | 2.91E-04 | 2.12E+ 04 | 9.62E+ 01 | 1.51E+ 04 | 2.59E+ 03 | 5.14E+ 02 | 7.18E+ 03 |
| std | 3.20E-07 | 2.66E-06 | 1.25E+ 04 | 2.99E+ 01 | 1.13E+ 04 | 8.39E+ 02 | 1.13E+ 02 | 4.60E+ 03 | |
| rank | 1 | 2 | 8 | 3 | 7 | 5 | 4 | 6 | |
|
| mean | 4.12E-06 | 1.76E-04 | 5.95E+ 01 | 2.25E+ 00 | 2.38E+ 01 | 2.91E+ 01 | 4.69E+ 00 | 3.36E+ 01 |
| std | 4.15E-06 | 7.28E-07 | 1.27E+ 01 | 4.90E-01 | 7.36E+ 00 | 3.35E+ 00 | 2.77E-01 | 1.05E+ 01 | |
| rank | 1 | 2 | 8 | 3 | 5 | 6 | 4 | 7 | |
|
| mean | 8.32E+ 00 | 2.87E+ 01 | 5.35E+ 06 | 4.07E+ 03 | 5.02E+ 05 | 2.44E+ 05 | 1.64E+ 05 | 1.84E+ 05 |
| std | 6.92E+ 00 | 1.10E-04 | 2.92E+ 07 | 1.10E+ 03 | 7.48E+ 05 | 9.91E+ 04 | 3.81E+ 04 | 8.00E+ 05 | |
| rank | 1 | 2 | 8 | 3 | 7 | 6 | 4 | 5 | |
|
| mean | 4.47E-03 | 7.51E+ 00 | 1.34E+ 03 | 1.57E+ 01 | 1.47E+ 03 | 1.64E+ 03 | 1.36E+ 02 | 2.00E+ 01 |
| std | 3.29E-03 | 2.67E-05 | 3.46E+ 03 | 1.99E+ 00 | 9.07E+ 02 | 4.54E+ 02 | 1.62E+ 01 | 3.46E+ 01 | |
| rank | 1 | 2 | 6 | 3 | 7 | 8 | 5 | 4 | |
|
| mean | 2.41E-04 | 2.16E-04 | 2.23E+ 00 | 1.01E+ 01 | 4.73E-01 | 4.75E-01 | 9.93E+ 01 | 6.57E-02 |
| std | 2.33E-04 | 1.16E-04 | 4.13E+ 00 | 6.23E+ 00 | 3.03E-01 | 1.58E-01 | 2.44E+ 01 | 4.57E-02 | |
| rank | 2 | 1 | 6 | 7 | 4 | 5 | 8 | 3 | |
|
| mean | −1.00E+ 03 | −9.36E+ 01 | −8.83E+ 03 | −7.41E+ 03 | − 5.19E+ 03 | −7.63E+ 03 | −6.89E+ 03 | −3.78E+ 03 |
| std | 7.16E+ 02 | 4.62E+ 01 | 7.31E+ 02 | 8.71E+ 02 | 5.72E+ 02 | 1.67E+ 02 | 8.35E+ 02 | 2.90E+ 02 | |
| rank | 7 | 8 | 1 | 3 | 5 | 2 | 4 | 6 | |
|
| mean | 3.83E-07 | 1.84E-04 | 1.59E+ 02 | 2.65E+ 02 | 1.53E+ 02 | 1.39E+ 02 | 3.71E+ 02 | 3.65E+ 01 |
| mean | 7.64E-07 | 1.86E-06 | 4.23E+ 01 | 1.96E+ 01 | 6.13E+ 01 | 2.07E+ 01 | 2.43E+ 01 | 3.09E+ 01 | |
| rank | 1 | 2 | 6 | 7 | 5 | 4 | 8 | 3 | |
|
| mean | 1.65E-05 | 7.04E-04 | 1.48E+ 01 | 4.94E+ 00 | 9.29E+ 00 | 1.44E+ 01 | 8.52E+ 00 | 1.20E+ 01 |
| std | 2.21E-05 | 4.06E-06 | 7.11E+ 00 | 2.71E+ 00 | 1.86E+ 00 | 1.12E+ 00 | 4.74E-01 | 9.24E+ 00 | |
| rank | 1 | 2 | 8 | 3 | 5 | 7 | 4 | 6 | |
|
| mean | 1.99E-10 | 6.19E-08 | 6.89E+ 00 | 5.98E-01 | 1.71E+ 01 | 1.53E+ 01 | 1.03E+ 00 | 8.30E-01 |
| std | 5.63E-10 | 8.26E-10 | 2.28E+ 01 | 5.98E-02 | 1.46E+ 01 | 3.27E+ 00 | 5.56E-03 | 4.20E-01 | |
| rank | 1 | 2 | 6 | 3 | 8 | 7 | 5 | 4 | |
|
| mean | 4.13E-04 | 1.67E+ 00 | 4.65E+ 00 | 1.36E+ 01 | 2.00E+ 03 | 1.71E+ 02 | 5.22E+ 00 | 3.14E+ 02 |
| std | 2.30E-04 | 5.08E-06 | 3.79E+ 00 | 6.00E+ 00 | 8.44E+ 03 | 5.53E+ 02 | 8.48E-01 | 1.06E+ 03 | |
| rank | 1 | 2 | 3 | 5 | 8 | 6 | 4 | 7 | |
|
| mean | 6.07E-03 | 6.18E-01 | 1.37E+ 07 | 2.47E+ 00 | 1.04E+ 05 | 9.76E+ 04 | 2.61E+ 01 | 4.49E+ 04 |
| std | 3.92E-03 | 9.19E-02 | 7.49E+ 07 | 3.21E-01 | 2.72E+ 05 | 1.03E+ 05 | 4.32E+ 00 | 1.55E+ 05 | |
| rank | 1 | 2 | 8 | 3 | 7 | 6 | 4 | 5 | |
|
| mean | 1.25E+ 01 | 1.27E+ 01 | 1.53E+ 00 | 4.84E+ 00 | 1.69E+ 00 | 9.98E-01 | 2.71E+ 00 | 1.53E+ 00 |
| std | 9.16E-01 | 3.27E-15 | 1.31E+ 00 | 4.74E+ 00 | 1.01E+ 00 | 2.63E-04 | 2.02E+ 00 | 8.92E-01 | |
| rank | 6 | 7 | 2 | 5 | 3 | 1 | 4 | 2 | |
|
| mean | 4.82E-04 | 8.04E-04 | 9.76E-04 | 5.15E-03 | 9.75E-03 | 7.60E-04 | 1.37E-03 | 1.01E-03 |
| std | 1.81E-04 | 2.93E-04 | 3.11E-04 | 7.79E-03 | 1.34E-02 | 1.05E-04 | 3.33E-04 | 3.58E-04 | |
| rank | 1 | 3 | 4 | 7 | 8 | 2 | 6 | 5 | |
|
| mean | −5.77E-01 | −1.68E-01 | −1.03E+ 00 | −1.03E+ 00 | −1.03E+ 00 | −1.03E+ 00 | −1.03E+ 00 | −1.03E+ 00 |
| std | 2.82E-01 | 1.32E-01 | 6.78E-16 | 5.25E-04 | 2.03E-09 | 1.69E-08 | 1.82E-03 | 3.43E-05 | |
| rank | 2 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | |
|
| mean | 5.33E-01 | 1.98E+ 00 | 3.98E-01 | 3.98E-01 | 3.98E-01 | 3.98E-01 | 3.99E-01 | 3.99E-01 |
| std | 2.75E-01 | 1.01E+ 00 | 0.00E+ 00 | 5.57E-04 | 1.00E-13 | 6.22E-09 | 1.13E-03 | 1.15E-03 | |
| rank | 3 | 4 | 1 | 1 | 1 | 1 | 2 | 2 | |
|
| mean | 3.04E+ 01 | 6.00E+ 02 | 3.00E+ 00 | 3.06E+ 00 | 3.00E+ 00 | 3.00E+ 00 | 3.10E+ 00 | 3.00E+ 00 |
| std | 3.42E+ 00 | 1.73E-03 | 1.70E-15 | 5.86E-02 | 1.89E-14 | 7.88E-07 | 1.29E-01 | 5.38E-05 | |
| rank | 4 | 5 | 1 | 2 | 1 | 1 | 3 | 1 | |
|
| mean | −3.71E+ 00 | −3.63E+ 00 | −3.86E+ 00 | −3.84E+ 00 | −3.85E+ 00 | −3.86E+ 00 | −3.85E+ 00 | −3.85E+ 00 |
| std | 1.20E-01 | 2.23E-01 | 2.71E-15 | 1.50E-02 | 6.36E-02 | 1.09E-06 | 1.37E-02 | 1.96E-03 | |
| rank | 4 | 5 | 1 | 3 | 2 | 1 | 2 | 2 | |
|
| mean | −2.15E+ 00 | −1.81E+ 00 | −3.23E+ 00 | −2.84E+ 00 | − 3.23E+ 00 | − 3.32E+ 00 | −2.86E+ 00 | −2.95E+ 00 |
| std | 5.03E-01 | 4.24E-01 | 5.40E-02 | 1.29E-01 | 1.22E-01 | 4.20E-03 | 1.74E-01 | 3.53E-01 | |
| rank | 6 | 7 | 2 | 5 | 2 | 1 | 4 | 3 | |
|
| mean | −5.54E+ 00 | −3.47E+ 00 | −6.55E+ 00 | −4.60E+ 00 | −9.31E+ 00 | −1.01E+ 01 | −3.88E+ 00 | − 3.29E+ 00 |
| std | 9.25E-01 | 7.98E-01 | 3.33E+ 00 | 2.32E+ 00 | 1.92E+ 00 | 4.78E-02 | 1.14E+ 00 | 1.82E+ 00 | |
| rank | 4 | 7 | 3 | 5 | 2 | 1 | 6 | 8 | |
|
| mean | −6.16E+ 00 | −3.42E+ 00 | −7.45E+ 00 | −5.51E+ 00 | −9.52E+ 00 | −1.03E+ 01 | −4.25E+ 00 | −4.00E+ 00 |
| std | 1.39E+ 00 | 8.57E-01 | 3.49E+ 00 | 2.86E+ 00 | 2.00E+ 00 | 1.06E-01 | 1.26E+ 00 | 1.68E+ 00 | |
| rank | 4 | 8 | 3 | 5 | 2 | 1 | 6 | 7 | |
|
| mean | −5.55E+ 00 | −3.51E+ 00 | −8.54E+ 00 | −6.10E+ 00 | −9.64E+ 00 | −1.04E+ 01 | −4.02E+ 00 | − 4.20E+ 00 |
| std | 8.64E-01 | 7.50E-01 | 3.41E+ 00 | 3.15E+ 00 | 2.04E+ 00 | 1.28E-01 | 1.27E+ 00 | 1.24E+ 00 | |
| rank | 5 | 8 | 3 | 4 | 2 | 1 | 7 | 6 | |
| Sum of rank | 59 | 88 | 101 | 89 | 104 | 88 | 108 | 99 | |
| Average rank | 2.5652 | 3.8261 | 4.3913 | 3.8696 | 4.5217 | 3.8261 | 4.6957 | 4.3043 | |
| Overall rank | 1 | 2 | 5 | 3 | 6 | 2 | 7 | 4 | |
Fig. 2Convergence curves of LFOA and other algorithms for f1, f2, f3 and f4
Fig. 3Convergence curves of LFOA and other algorithms for f10, f11, f12 and f13
Classification performance of LFOA-SVM
| Fold | ACC | Sensitivity | Specificity | MCC |
|---|---|---|---|---|
| #1 | 0.9362 | 0.9167 | 0.9565 | 0.8732 |
| #2 | 0.8936 | 0.8148 | 1.0000 | 0.8074 |
| #3 | 0.9362 | 0.8500 | 1.0000 | 0.8746 |
| #4 | 0.9574 | 0.9565 | 0.9583 | 0.9149 |
| #5 | 0.9574 | 1.0000 | 0.9167 | 0.9183 |
| #6 | 0.9149 | 0.8400 | 1.0000 | 0.8431 |
| #7 | 0.9787 | 1.0000 | 0.9565 | 0.9583 |
| #8 | 0.9362 | 0.9000 | 0.9630 | 0.8694 |
| #9 | 0.9362 | 0.9130 | 0.9583 | 0.8730 |
| #10 | 0.9362 | 0.9310 | 0.9444 | 0.8672 |
| Avg. | 0.9383 | 0.9122 | 0.9653 | 0.8799 |
| Std. | 0.0234 | 0.0636 | 0.0272 | 0.0419 |
Fig. 4Classification performance obtained by the seven methods in terms of ACC, sensitivity, specificity and MCC
Confusion matrix obtained by the proposed LFOA-SVM and FOA-SVM
| No. of fold | LFOA-SVM | FOA-SVM | ||
|---|---|---|---|---|
| 1 | 20 | 3 | 27 | 2 |
| 0 | 24 | 3 | 15 | |
| 2 | 25 | 3 | 11 | 2 |
| 1 | 18 | 3 | 31 | |
| 3 | 26 | 3 | 25 | 2 |
| 1 | 17 | 1 | 19 | |
| 4 | 24 | 2 | 22 | 3 |
| 1 | 20 | 2 | 20 | |
| 5 | 16 | 3 | 24 | 2 |
| 2 | 26 | 0 | 21 | |
| 6 | 17 | 2 | 20 | 2 |
| 1 | 27 | 0 | 25 | |
| 7 | 25 | 0 | 22 | 1 |
| 0 | 22 | 2 | 22 | |
| 8 | 20 | 2 | 18 | 3 |
| 0 | 25 | 0 | 26 | |
| 9 | 22 | 2 | 18 | 4 |
| 0 | 23 | 1 | 24 | |
| 10 | 21 | 2 | 28 | 2 |
| 1 | 23 | 0 | 17 | |
| Sum | 216 | 22 | 215 | 23 |
| 7 | 225 | 12 | 220 | |
Fig. 5Relationship between the iteration and training accuracy of LFOA-SVM, FOA-SVM, PSO-SVM, and GA-SVM