| Literature DB >> 29297357 |
Hao Jiang1, Wai-Ki Ching2, Yushan Qiu3, Xiao-Qing Cheng4.
Abstract
BACKGROUND: Positive semi-definiteness is a critical property in kernel methods for Support Vector Machine (SVM) by which efficient solutions can be guaranteed through convex quadratic programming. However, a lot of similarity functions in applications do not produce positive semi-definite kernels.Entities:
Keywords: Bregman matrix divergence; Indefinite kernel; Projection method; SVM
Mesh:
Year: 2017 PMID: 29297357 PMCID: PMC5751553 DOI: 10.1186/s12918-017-0479-0
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Data set information
| Data set | Number of instances | Number of attributes |
|---|---|---|
| Sonar | 208 | 60 |
| Live disorder | 345 | 6 |
| Breast cancer | 680 | 10 |
| Cystic fibrosis | 177 | Depends on q |
| Leukemia | 355 | Depends on q |
| Lung cancer | 91 | 54675 |
Fig. 1Attribute distribution for different values of q in leukemia data and cystic fibrosis data
Averaged AUC values (%) of projection method and GHI kernel using sonar data, live disorder data, breast cancer data and NSCLC data
| Data sets | Parameters | Projection method | GHI kernel |
|---|---|---|---|
| Sonar |
| 82.87 ± 0.99 | 82.87 ± 0.99 |
|
|
| 53.42 ± 4.94 | |
|
|
| 54.10 ± 4.92 | |
|
| 84.29 ± 1.54 | 84.29 ± 1.54 | |
|
|
| 83.06 ± 2.04 | |
|
| 83.62 ± 1.17 | 83.62 ± 1.17 | |
| Live |
| 82.87 ± 0.99 | 82.87 ± 0.99 |
|
|
| 53.42 ± 4.94 | |
|
|
| 54.10 ± 4.92 | |
|
| 84.29 ± 1.54 | 84.29 ± 1.54 | |
|
|
| 83.06 ± 2.04 | |
|
| 83.62 ± 1.17 | 83.62 ± 1.17 | |
| Breast |
| 96.73 ± 0.11 | 96.73 ± 0.11 |
|
|
| 90.12 ± 4.78 | |
|
|
| 75.61 ± 7.44 | |
|
| 96.71 ± 0.11 | 96.71 ± 0.11 | |
|
| 96.92 ± 0.01 | 96.96 ± 0.01 | |
|
| 96.63 ± 0.10 | 96.63 ± 0.10 | |
| NSCLC |
| 100 ± 0 | 100 ± 0 |
|
|
| 64.07 ± 7.42 | |
|
|
| 51.47 ± 5.53 | |
|
| 100 ± 0 | 100 ± 0 | |
|
|
| 73.07 ± 8.17 | |
|
| 100 ± 0 | 100 ± 0 |
Bold face represents best performance, and no marks are made if two methods show comparable performance
Averaged AUC values (%) of projection method and GHI kernel using cystic fibrosis data
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 78.57 ± 1.75 | 78.57 ± 1.75 | 81.32 ± 1.25 | 81.32 ± 1.25 |
|
| 78.94 ± 1.86 | 78.94 ± 1.86 | 81.74 ± 1.60 | 81.74 ± 1.60 |
|
| 78.64 ± 1.01 | 78.63 ± 1.01 | 80.82 ± 1.30 | 80.82 ± 1.29 |
|
| 79.33 ± 1.42 | 79.33 ± 1.41 | 80.53 ± 1.72 | 80.53 ± 1.72 |
|
| 79.32 ± 1.19 | 79.32 ± 1.19 | 81.06 ± 1.37 | 81.06 ± 1.36 |
|
| 78.14 ± 1.11 | 78.13 ± 1.11 | 80.79 ± 1.12 | 80.78 ± 1.12 |
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 80.77 ± 1.44 | 80.76 ± 1.44 | 83.10 ± 2.10 | 83.09 ± 2.10 |
|
| 80.98 ± 1.81 | 80.97 ± 1.81 | 82.11 ± 1.77 | 82.13 ± 1.77 |
|
| 81.20 ± 1.95 | 81.19 ± 1.94 | 83.54 ± 1.46 | 83.51 ± 1.48 |
|
| 81.32 ± 1.26 | 81.30 ± 1.27 | 82.75 ± 2.14 | 82.79 ± 2.15 |
|
| 81.10 ± 1.10 | 81.09 ± 1.11 | 83.62 ± 1.61 | 83.65 ± 1.65 |
|
| 81.06 ± 1.39 | 81.04 ± 1.39 | 83.49 ± 0.77 | 83.56 ± 0.82 |
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 74.03 ± 2.18 | 74.00 ± 2.18 | 72.30 ± 1.93 | 72.50 ± 1.87 |
|
| 71.67 ± 2.52 | 71.62 ± 2.58 | 73.62 ± 2.69 | 73.80 ± 2.70 |
|
| 74.77 ± 2.27 | 74.73 ± 2.28 | 71.94 ± 1.77 | 72.11 ± 1.65 |
|
| 73.73 ± 1.36 | 73.73 ± 1.38 | 71.49 ± 2.78 | 71.60 ± 2.84 |
|
| 72.62 ± 2.97 | 72.61 ± 2.92 | 72.81 ± 1.91 | 73.01 ± 1.92 |
|
| 75.23 ± 2.64 | 75.20 ± 2.55 | 73.53 ± 2.62 | 73.80 ± 2.67 |
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 67.99 ± 2.78 | 67.60 ± 2.87 | 60.65 ± 4.20 | 60.90 ± 4.36 |
|
| 68.28 ± 3.51 | 67.89 ± 3.60 | 58.19 ± 3.72 | 58.33 ± 3.77 |
|
| 67.75 ± 2.20 | 67.25 ± 2.19 | 58.98 ± 3.67 | 59.28 ± 3.69 |
|
| 67.90 ± 3.11 | 67.23 ± 3.04 | 58.28 ± 4.20 | 58.34 ± 4.13 |
|
| 67.58 ± 2.91 | 66.96 ± 2.88 | 58.66 ± 2.40 | 58.86 ± 2.37 |
|
| 68.85 ± 2.28 | 68.44 ± 2.13 | 59.62 ± 3.34 | 59.77 ± 3.37 |
| Parameters | Projection method( | GHI( | ||
|
| 53.25 ± 3.99 | 53.25 ± 3.99 | ||
|
| 52.12 ± 4.28 | 52.12 ± 4.28 | ||
|
| 52.54 ± 3.22 | 52.54 ± 3.22 | ||
|
| 51.16 ± 2.37 | 51.16 ± 2.37 | ||
|
| 51.62 ± 4.18 | 51.62 ± 4.18 | ||
|
| 51.96 ± 5.01 | 51.96 ± 5.01 |
Averaged AUC values (%) of projection method and GHI kernel using leukemia data
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 93.68 ± 0.62 | 93.68 ± 0.62 | 95.90 ± 0.84 | 95.90 ± 0.84 |
|
|
| 87.00 ± 4.10 |
| 94.93 ± 0.81 |
|
|
| 86.94 ± 3.37 |
| 94.53 ± 0.89 |
|
| 93.33 ± 0.74 | 93.32 ± 0.74 | 95.61 ± 0.46 | 95.61 ± 0.46 |
|
| 93.31 ± 0.47 | 93.51 ± 0.46 | 95.17 ± 0.69 | 95.66 ± 0.85 |
|
| 93.54 ± 0.66 | 93.54 ± 0.66 | 95.77 ± 0.40 | 95.77 ± 0.40 |
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 95.07 ± 0.64 | 95.08 ± 0.65 | 93.51 ± 0.54 | 93.54 ± 0.54 |
|
| 95.13 ± 0.46 | 95.10 ± 0.471 | 93.86 ± 0.77 | 93.88 ± 0.77 |
|
| 94.83 ± 0.41 | 94.77 ± 0.42 | 94.13 ± 0.52 | 94.15 ± 0.52 |
|
| 95.13 ± 0.53 | 95.13 ± 0.53 | 94.05 ± 0.49 | 94.06 ± 0.49 |
|
| 94.84 ± 0.67 | 94.85 ± 0.67 | 93.81 ± 0.69 | 93.82 ± 0.69 |
|
| 94.77 ± 0.61 | 94.77 ± 0.61 | 93.98 ± 0.36 | 93.99 ± 0.36 |
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 93.40 ± 0.58 | 93.44 ± 0.58 | 93.23 ± 0.26 | 93.38 ± 0.26 |
|
| 93.12 ± 0.70 | 93.16 ± 0.70 | 93.07 ± 0.75 | 93.21 ± 0.74 |
|
| 93.20 ± 0.27 | 93.25 ± 0.28 | 93.05 ± 0.63 | 93.18 ± 0.63 |
|
| 93.61 ± 0.73 | 93.64 ± 0.74 | 93.21 ± 0.48 | 93.35 ± 0.48 |
|
| 93.78 ± 0.56 | 93.83 ± 0.56 | 93.26 ± 0.70 | 93.41 ± 0.72 |
|
| 93.71 ± 0.72 | 93.75 ± 0.73 | 93.38 ± 0.65 | 93.51 ± 0.67 |
| Parameters | Projection method( | GHI( | Projection method( | GHI( |
|
| 92.15 ± 0.68 | 92.37 ± 0.67 | 90.10 ± 0.71 | 90.36 ± 0.70 |
|
| 92.33 ± 0.57 | 92.53 ± 0.59 | 90.68 ± 1.14 | 90.92 ± 1.13 |
|
| 92.11 ± 0.86 | 92.31 ± 0.86 | 90.72 ± 0.73 | 90.96 ± 0.73 |
|
| 92.01 ± 0.50 | 92.23 ± 0.50 | 90.67 ± 1.06 | 90.93 ± 1.04 |
|
| 92.06 ± 0.45 | 92.27 ± 0.43 | 90.31 ± 0.90 | 90.53 ± 0.89 |
|
| 92.28 ± 0.71 | 92.48 ± 0.73 | 90.66 ± 0.65 | 90.92 ± 0.67 |
| Parameters | Projection method( | GHI( | ||
|
| 88.92 ± 0.59 | 89.20 ± 0.62 | ||
|
| 89.61 ± 0.62 | 89.86 ± 0.63 | ||
|
| 89.33 ± 0.68 | 89.60 ± 0.67 | ||
|
| 89.54 ± 0.96 | 89.80 ± 0.96 | ||
|
| 88.57 ± 0.67 | 88.82 ± 0.68 | ||
|
| 88.56 ± 0.63 | 88.84 ± 0.63 |
Bold face represents best performance for leukemia data in the compared two methods: Projection method and GHI Kernel method, and no marks are made if two methods show comparable performance
Averaged AUC values (%) of projection method and Cosine kernel for the considered datasets
| Dataset | Projection method | Cosine kernel |
|---|---|---|
| Live disorder data |
| 65.63 ± 2.75 |
| Sonar data |
| 67.46 ± 4.32 |
| Breast data |
| 97.99 ± 3.09 |
| Cystic ( |
| 76.89 ± 3.24 |
| Cystic ( |
| 79.80 ± 1.84 |
| Cystic ( |
| 70.10 ± 4.01 |
| Cystic ( |
| 58.52 ± 4.95 |
| Cystic ( |
| 52.13 ± 4.30 |
| Cystic ( |
| 60.72 ± 5.36 |
| Cystic ( |
| 58.54 ± 3.80 |
| Cystic ( | 63.17 ± 2.89 | 63.66 ± 3.21 |
| Cystic ( |
| 43.05 ± 2.38 |
| Leukemia ( |
| 90.73 ± 1.94 |
| Leukemia ( |
| 69.45 ± 4.81 |
| Leukemia ( |
| 69.97 ± 6.58 |
| Leukemia ( |
| 73.33 ± 5.99 |
| Leukemia ( |
| 71.81 ± 9.62 |
| Leukemia ( |
| 79.08 ± 6.96 |
| Leukemia ( |
| 65.26 ± 6.90 |
| Leukemia ( |
| 58.31 ± 2.87 |
| Leukemia ( |
| 55.88 ± 3.82 |
| NSCLC |
| 48.64 ± 5.30 |
Bold face represents best performance for the considered data sets in the compared two methods: Projection method and Cosine Kernel method, and no marks are made if two methods show comparable performance
Fig. 2Averaged AUC values for different values of λ in projection method with two considered kernels using sonar data set
Fig. 3Averaged AUC values for different values of λ in projection method with two considered kernels using live disorder data set
Fig. 4Averaged AUC values for different values of λ in projection method with two considered kernels using breast cancer data set
Fig. 5Averaged AUC values for different values of λ in projection method with two considered kernels using cystic fibrosis data set
Fig. 6Averaged AUC values for different values of λ in projection method with two considered kernels using leukemia data set
Fig. 7Averaged AUC values for different values of λ in projection method with two considered kernels using NSCLC data set
Optimal λ suggested in projection method with considered kernels
| Methods | GHI Kernel | Cosine kernel | ||
|---|---|---|---|---|
| Dataset |
|
|
| |
| Live disorder data | 2.38 | 2.37 | 2.45 | 2.17 |
| Sonar data | 2 | 2 | 2 | 2 |
| Breast data | 4.6 | 6.57 | 4.06 | 4.29 |
| Cystic ( | 71 | 71 | 100 | 5.8 |
| Cystic ( | 100 | 100 | 1 | 2.5 |
| Cystic ( | 100 | 100 | 1 | 2.8 |
| Cystic ( | 100 | 100 | 1 | 3.67 |
| Cystic ( | 100 | 100 | 100 | 6.2 |
| Cystic ( | 1 | 1 | 1 | 14 |
| Cystic ( | 1 | 1 | 1 | 21 |
| Cystic ( | 1 | 1 | 1 | 37 |
| Cystic ( | 1 | 1 | 1 | 85 |
| Leukemia ( | 47.33 | 47.33 | 46.67 | 10 |
| Leukemia ( | 28.25 | 28.25 | 22.80 | 7.4 |
| Leukemia ( | 46.5 | 46.5 | 47 | 5.42 |
| Leukemia ( | 1 | 1 | 1 | 3.06 |
| Leukemia ( | 1 | 1 | 1 | 2.33 |
| Leukemia ( | 1 | 1 | 1 | 2.39 |
| Leukemia ( | 100 | 100 | 100 | 2.56 |
| Leukemia ( | 100 | 100 | 100 | 2.67 |
| Leukemia ( | 1 | 1 | 1 | 2.98 |
| NSCLC | 2.0 | 46 | 2 | 2 |
Optimal λ comparison in projection method with considered kernels in sonar data, live disorder data, breast cancer data and NSCLC data
|
|
|
| Cosine | ||
|---|---|---|---|---|---|
| Sonar | ( | (2.00,0.8266) | (2.00,0.8787) |
| (2.00,0.9034) |
| ( | (2.59,0.8284) | (2.16,0.8784) | (4.32,0.8486) |
| |
| Live | ( | (2.38,0.7559) | (2.37,0.7397) | (2.45,0.7543) | (2.17,0.7292) |
| ( | (2.08,0.7571) | (2.04,0.7415) | (2.09,0.7542) | (6.70,0.7249) | |
| Breast | ( | (4.60,0.9689) | (6.57,0.9659) | (4.06,0.9684) | (4.29,0.9937) |
| ( | (2.03,0.9702) | (2.02,0.9675) | (2.20,0.9686) | (13.04,0.9936) | |
| NSCLC | ( | (2.00,0.9996) | (2.00,0.9959) | (2.00,0.9903) | (2.00,0.4059) |
| ( | (4.96,0.9990) | (3.58,0.9978) | (2.69,0.9910) | (2.30,0.4010) |
The italicize represents visible difference detected for projection methods with different optimal λ
Optimal λ comparison in projection method with considered kernels in cystic fibrosis data
|
|
|
| Cosine | ||
|---|---|---|---|---|---|
|
| ( | (71,0.7771) | (71,0.7711) | (100,0.7829) | (5.8,0.7889) |
| ( | (36.5,0.7775) | (36.5,0.7713) | (100,0.7829) | (28.3,0.7912) | |
|
| ( | (100,0.8031) | (100,0.8114) | (1,0.8209) | (2.5,0.7951) |
| ( | (100,0.8031) | (100,0.8114) | (1,0.8209) | (43.38,0.7959) | |
|
| ( | (100,0.8103) | (100,0.8140)) | (1,0.8033) | (2.8,0.7978) |
| ( | (100,0.8103) | (100,0.8140) | (1,0.8033) |
| |
|
| ( | (100,0.8296) | (100,0.8356) | (1,0.8286) | (3.67,0.7825) |
| ( | (100,0.8296) | (100,0.8356) | (1,0.8286) |
| |
|
| ( | (100,0.7400) | (100,0.7272) | (100,0.7405) | (6.2,0.6973) |
| ( | (100,0.7400) | (100,0.7272) | (100,0.7405) |
| |
|
| ( | (1,0.7173) | (1,0.7164) | (1,0.7224) | (14,0.7144) |
| ( | (1,0.7173) | (1,0.7164) | (1,0.7224) | (34.16,0.7156) | |
|
| ( | (1,0.6702) | (1,0.6721) | (1,0.6713) | (21,0.6620) |
| ( | (1,0.6702) | (1,0.6721) | (1,0.6713) | (22.17,0.6616) | |
|
| ( | (1,0.5928) | (1,0.5791) | (1,0.5935) | (37,0.6388) |
| ( | (1,0.5928) | (1,0.5791) | (1,0.5935) | (19.25,0.6387) | |
|
| ( | (1,0.5146) | (1,0.5107) | (1,0.5254) | (85,0.5637) |
| ( | (1,0.5146) | (1,0.5107) | (1,0.5254) | (17.5,0.5688) |
The italicize represents visible difference detected for projection methods with different optimal λ
Optimal λ comparison in projection method with considered kernels in leukemia data
|
|
|
| Cosine | ||
|---|---|---|---|---|---|
|
| ( | (47.3,0.9418) | (47.3,0.9377) | (46.7,0.9365) | (10,0.9469) |
| ( | (26.8,0.9419) | (26.8,0.9377) | (27.8,0.9367) | (16.6,0.9472) | |
|
| ( | (28.3,0.9551) | ((28.3,0.9551) | (22.8,0.9582) | (7.4,0.9541) |
| ( | (35.9,0.9550) | (35.9,0.9551) | (29.3,0.9582) | (24.7,0.9555) | |
|
| ( | (46.5,0.9512) | (46.5,0.9540) | (47,0.9500) | (5.42,0.9573) |
| ( | (87.8,0.9512) | (87.8,0.9551) | (88.8,0.9500) | (23.1,0.9593) | |
|
| ( | (1,0.9427) | (1,0.9416) | (1,0.9405) | (3.06,0.9485) |
| ( | (1,0.9427) | (1,0.9416) | (1,0.9405) | (18.6,0.9522) | |
|
| ( | (1,0.9352) | (1,0.9362) | (1,0.9363) | (2.33,0.9175) |
| ( | (1,0.9352) | (1,0.9362) | (1,0.9363) | (11.08,0.9259) | |
|
| ( | (1,0.9310) | (1,0.9319) | (1,0.9311) | (2.39,0.9333) |
| ( | (1,0.9310) | (1,0.9319) | (1,0.9311) | (7.74,0.9337) | |
|
| ( | (100,0.9201) | (100,0.9236) | (1,0.9212) | (2.56,0.8993) |
| ( | (100,0.9201) | (100,0.9236) | (1,0.9212) | (5.03,0.8921) | |
|
| ( | (100,0.9035) | (100,0.9096) | (100,0.9059) | (2.67,0.8795) |
| ( | (100,0.9035) | (100,0.9096) | (100,0.9059) | (3.56,0.8845) | |
|
| ( | (1,0.8936) | (1,0.8915) | (1,0.8899) | (2.98,0.8734) |
| ( | (1,0.8936) | (1,0.8915) | (1,0.8899) | (3.72,0.8735) |
The boldface represents best performance detected for projection methods with different optimal λ, and no marks are made if two methods show comparable performance