| Literature DB >> 34764591 |
Peng Gao1,2, Weifei Wu1, Jingmei Li1.
Abstract
Knowledge in the source domain can be used in transfer learning to help train and classification tasks within the target domain with fewer available data sets. Therefore, given the situation where the target domain contains only a small number of available unlabeled data sets and multi-source domains contain a large number of labeled data sets, a new Multi-source Fast Transfer Learning algorithm based on support vector machine(MultiFTLSVM) is proposed in this paper. Given the idea of multi-source transfer learning, more source domain knowledge is taken to train the target domain learning task to improve classification effect. At the same time, the representative data set of the source domain is taken to speed up the algorithm training process to improve the efficiency of the algorithm. Experimental results on several real data sets show the effectiveness of MultiFTLSVM, and it also has certain advantages compared with the benchmark algorithm.Entities:
Keywords: Classification; Multi-source transfer learning; Support vector machine
Year: 2021 PMID: 34764591 PMCID: PMC8023540 DOI: 10.1007/s10489-021-02194-9
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.019
Fig. 1Difference between transfer learning and traditional machine learning
Fig. 2Multi-source transfer learning
Fig. 3Framework of MultiFTLSVM
Steps of MultiTLGP algorithm
20-Newsgroups Data set
| Data set | Category | Subcategory | Samples | Features |
|---|---|---|---|---|
| 20-Newsgroups | comp | comp.graphics | 970 | 25,804 |
| 963 | ||||
| comp.sys.ibm.pc.hardware | 979 | |||
| comp.sys.mac.hardware | 958 | |||
| rec | rec.autos | 987 | ||
| rec.motorcycles | 993 | |||
| rec.sport.baseball | 991 | |||
| rec.sport.hokey | 997 | |||
| sci | sci.crypt | 989 | ||
| sci.electronics | 984 | |||
| sci.med | 987 | |||
| sci.space | 985 | |||
| talk | talk.politics.guns | 909 | ||
| talk.politics.mideast | 940 | |||
| talk.politics.misc | 774 | |||
| talk.religion.misc | 627 |
Emotion analysis data set
| Domain | Comments | Training | Test | Positive sample ratio | Features |
|---|---|---|---|---|---|
| Books (B) | 6465 | 2000 | 4465 | 50% | 30,000 |
| DVDs (D) | 5586 | 2000 | 3586 | 50% | 30,000 |
| Electronics (E) | 7681 | 2000 | 5681 | 50% | 30,000 |
| Kitchen (K) | 7945 | 2000 | 5945 | 50% | 30,000 |
Spam data set
| Domain | Emails | Positive sample | Negative sample | Features |
|---|---|---|---|---|
| U1 | 4000 | 2000 | 2000 | 206,908 |
| U2 | 2500 | 1250 | 1250 | 206,908 |
| U3 | 2500 | 1250 | 1250 | 206,908 |
| U4 | 2500 | 1250 | 1250 | 206,908 |
The average classification accuracy (%) and standard deviation of the algorithm on 20Newsgroups
| Algorithms | r- > s | c- > s | t- > s | c- > r | s- > r | t- > r | s- > t | c- > t | r- > t |
|---|---|---|---|---|---|---|---|---|---|
| SVM | 73.81 (0.59) | 75.38 (0.65) | 75.47 (0.77) | 87.51 (0.89) | 71.39 (1.05) | 84.32 (0.92) | 76.82 (0.87) | 95.43 (0.97) | 83.26 (1.23) |
| STL-SVM | 91.52 (1.06) | 87.88 (1.12) | 91.23 (1.23) | 97.32 (1.18) | 81.28 (1.03) | 91.35 (1.18) | 92.57 (1.21) | 98.25 (1.01) | 97.06 (1.17) |
| STIL | 86.65 (1.26) | 81.76 (1.68) | 83.23 (1.57) | 89.57 (1.64) | 84.14 (1.57) | 88.65 (1.55) | 82.71 (1.62) | 93.56 (1.53) | 89.28 (1.75) |
| RankRE-TL | 90.28 (0.98) | 90.01 (1.05) | 91.57 (0.52) | 96.38 (0.28) | 89.15 (1.13) | 93.35 (0.64) | 92.71 (0.53) | 97.13 (0.29) | 92.25 (0.69) |
| HCMA | 81.27 (0.58) | 88.28 (0.65) | 87.29 (0.71) | 94.65 (0.75) | 87.32 (0.86) | 92.24 (0.54) | 88.97 (0.62) | 96.87 (0.47) | 90.34 (0.53) |
| {r,c,t}- > s | {r,s,t}- > r | {s,c,r}- > t | |||||||
| MultiDTNN | 96.21 (0.35) | 94.76 (0.41) | 95.88 (0.39) | 97.45 (0.67) | 92.76 (0.72) | 98.34 (0.88) | 95.46 (0.87) | 97.65 (0.72) | 98.86 (0.83) |
| FastDAM | 95.34 (0.57) | 92.98 (0.51) | 94.32 (0.58) | 94.36 (0.53) | 91.25 (0.49) | 97.02 (0.67) | 94.76 (0.58) | 96.75 (0.64) | 96.89 (0.69) |
| IMTL | 96.56 (0.43) | 95.84 (0.48) | 96.15 (0.51) | 98.87 (0.45) | 93.74 (0.52) | 99.25 (0.47) | 95.83 (0.65) | 98.87 (0.57) | 98.47 (0.55) |
| SSL-MSTL | 94.33 (0.38) | 93.21 (0.39) | 95.42 (0.46) | 98.75 (0.56) | 92.65 (0.65) | 98.82 (0.72) | 94.28 (0.78) | 97.98 (0.69) | 97.54 (0.63) |
| MultiFTLSVM | 97.65 (0.37) | 96.13 (0.42) | 97.26 (0.43) | 99.23 (0.45) | 95.23 (0.51) | 99.57 (0.49) | 96.75 (0.47) | 99.58 (0.54) | 99.85 (0.61) |
The average classification accuracy (%) and standard deviation of the algorithm on spam email and emotion analysis
| Algorithms | B- > K | D- > K | E- > K | U1- > U4 | U2- > U4 | U3- > U4 |
|---|---|---|---|---|---|---|
| SVM | 59.26 (2.78) | 60.83 (2.74) | 63.24 (2.73) | 79.54 (2.52) | 78.25 (2.51) | 80.75 (2.68) |
| STL-SVM | 64.21 (2.12) | 63.34 (2.25) | 68.12 (2.23) | 86.23 (2.19) | 84.63 (2.65) | 86.34 (2.21) |
| STIL | 65.11 (2.23) | 65.13 (2.12) | 67.28 (2.36) | 87.21 (1.27) | 85.87 (1.21) | 87.21 (1.29) |
| RankRE-TL | 63.78 (2.65) | 65.12 (2.64) | 69.57 (2.63) | 87.54 (2.21) | 86.12 (2.98) | 88.63 (2.78) |
| HCMA | 62.52 (2.67) | 64.15 (2.66) | 66.55 (2.64) | 86.45 (2.37) | 85.28 (2.12) | 86.54 (1.98) |
| {B,D,E}- > K | {U1,U2,U3}- > U4 | |||||
| MultiDTNN | 71.87 (2.13) | 74.67 (2.24) | 77.56 (2.11) | 94.87 (2.48) | 91.23 (2.23) | 94.98 (2.32) |
| FastDAM | 67.45 (2.31) | 69.17 (2.26) | 74.32 (2.19) | 92.35 (2.28) | 91.27 (2.35) | 94.56 (2.42) |
| IMTL | 70.45 (2.13) | 73.98 (2.11) | 76.15 (1.98) | 94.87 (2.14) | 93.23 (2.18) | 96.11 (2.13) |
| SSL-MSTL | 68.33 (2.28) | 71.26 (2.17) | 75.43 (2.21) | 93.47 (2.26) | 92.65 (2.28) | 95.32 (2.25) |
| MultiFTLSVM | (1.89) | 75.13 (1.95) | 78.23 (1.86) | 95.43 (2.11) | 94.32 (2.15) | 97.86 (2.10) |
Average score training time (s) and standard deviation of the algorithm on 20Newsgroups
| Algorithms | r- > s | c- > s | t- > s | c- > r | s- > r | t- > r | s- > t | c- > t | r- > t |
|---|---|---|---|---|---|---|---|---|---|
| SVM | 1.35 (0.14) | 1.31 (0.15) | 1.41 (0.16) | 1.28 (0.16) | 1.32 (0.15) | 1.25 (0.12) | 1.43 (0.15) | 1.47 (0.13) | 1.26 (0.11) |
| STL-SVM | 7.23 (0.73) | 8.23 (0.89) | 6.34 (0.98) | 5.34 (0.69) | 7.43 (0.72) | 6.12 (0.63) | 8.32 (0.77) | 9.13 (0.76) | 11.23 (0.86) |
| STIL | 9.12 (0.63) | 8.23 (0.61) | 7.26 (0.59) | 8.21 (0.58) | 8.11 (0.57) | 7.43 (0.72) | 9.21 (0.82) | 10.23 (0.83) | 12.34 (0.93) |
| RankRE-TL | 12.65 (0.98) | 11.13 (1.12) | 13.86 (1.13) | 10.15 (1.01) | 11.76 (1.17) | 9.43 (1.14) | 14.36 (1.21) | 15.67 (1.23) | 13.86 (1.19) |
| HCMA | 8.11 (0.43) | 7.32 (0.45) | 9.63 (0.48) | 7.98 (0.41) | 9.86 (0.54) | 8.23 (0.47) | 11.35 (0.63) | 12.56 (0.68) | 12.04 (0.64) |
| {r,c,t}- > s | {r,c,t}- > r | {r,c,t}- > t | |||||||
| MultiDTNN | 72.23 (3.23) | 65.23 (3.12) | 69.32 (3.23) | 55.67 (2.12) | 62.34 (2.24) | 52.34 (2.98) | 76.21 (4.76) | 73.87 (4.18) | 67.76 (4.65) |
| FastDAM | 119.56 (2.76) | 112.53 (2.55) | 116.38 (2.63) | 110.45 (1.98) | 111.38 (1.95) | 109.87 (1.92) | 122.26 (2.78) | 124.33 (2.81) | 123.15 (2.79) |
| IMTL | 18.23 (1.25) | 13.65 (1.18) | 15.63 (1.24) | 12.54 (1.19) | 13.24 (1.21) | 11.35 (1.16) | 19.12 (1.76) | 20.58 (1.79) | 19.77 (1.77) |
| SSL-MSTL | 1.34 (0.85) | 1.21 (0.84) | 1.29 (0.87) | 1.16 (0.96) | 1.28 (1.03) | 0.95 (0.82) | 1.36 (1.16) | 1.42 (1.18) | 1.41 (1.17) |
| MultiFTLSVM | 1.15 (0.68) | 0.98 (0.62) | 1.11 (0.65) | 0.88 (0.52) | 0.96 (0.55) | 0.93 (0.54) | 1.24 (0.72) | 1.28 (0.74) | 1.26 (1.72) |
The average training time (s) and standard deviation of the algorithm on the sentiment analysis data set and spam data set
| Algorithms | B- > K | D- > K | E- > K | U1- > U4 | U2- > U4 | U3- > U4 |
|---|---|---|---|---|---|---|
| SVM | 2.45 (0.15) | 2.56 (0.14) | 2.62 (0.16) | 1.83 (0.15) | 1.82 (0.14) | 1.81 (1.12) |
| STL-SVM | 4.56 (0.87) | 6.78 (0.92) | 5.98 (0.92) | 6.11 (0.87) | 4.98 (0.97) | 5.01 (0.89) |
| STIL | 10.23 (0.68) | 12.34 (0.82) | 13.76 (0.83) | 7.97 (0.84) | 6.83 (0.87) | 6.33 (0.98) |
| RankRE-TL | 13.78 (1.15) | 14.12 (1.13) | 16.57 (1.12) | 9.98 (0.97) | 9.69 (0.96) | 9.15 (0.95) |
| HCMA | 8.27 (0.73) | 8.38 (0.72) | 8.55 (0.71) | 7.45 (0.76) | 7.32 (0.77) | 7.29 (0.75) |
| {B,D,E}- > K | {U1,U2,U3}- > U4 | |||||
| MultiDTNN | 87.23 (4.36) | 76.34 (4.27) | 71.23 (5.12) | 65.83 (4.31) | 68.67 (4.21) | 53.87 (4.34) |
| FastDAM | 129.78 (2.65) | 132.86 (2.75) | 136.34 (2.87) | 121.43 (2.43) | 119.54 (2.42) | 117.24 (2.34) |
| IMTL | 20.45 (1.35) | 23.98 (1.51) | 26.15 (1.43) | 21.25 (1.23) | 19.86 (1.19) | 18.43 (1.18) |
| SSL-MSTL | 1.98 (0.49) | 2.32 (0.51) | 2.65 (0.52) | 2.28 (0.64) | 2.21 (0.63) | 2.19 (0.61) |
| MultiFTLSVM | 1.65 (0.59) | 1.58 (0.56) | 1.29 (0.55) | 1.12 (0.42) | 0.95 (0.41) | 0.92 (0.46) |
Fig. 4Sensitivity of Parameter C in MultiFTLSVM Algorithm in 20-Newsgroups, Emotion Analysis, and Spam Data Sets
Fig. 5Sensitivity of Parameter C in MultiFTLSVM Algorithm in 20-Newsgroups, Emotion Analysis, and Spam Data Sets
Fig. 6Sensitivity of Parameter λ in MultiFTLSVM Algorithm in 20-Newsgroups, Emotion Analysis, and Spam Data Sets