| Literature DB >> 35885189 |
Ying Lv1, Bofeng Zhang2,3, Guobing Zou1, Xiaodong Yue1, Zhikang Xu1, Haiyan Li3.
Abstract
Domain adaptation aims to learn a classifier for a target domain task by using related labeled data from the source domain. Because source domain data and target domain task may be mismatched, there is an uncertainty of source domain data with respect to the target domain task. Ignoring the uncertainty may lead to models with unreliable and suboptimal classification results for the target domain task. However, most previous works focus on reducing the gap in data distribution between the source and target domains. They do not consider the uncertainty of source domain data about the target domain task and cannot apply the uncertainty to learn an adaptive classifier. Aimed at this problem, we revisit the domain adaptation from source domain data uncertainty based on evidence theory and thereby devise an adaptive classifier with the uncertainty measure. Based on evidence theory, we first design an evidence net to estimate the uncertainty of source domain data about the target domain task. Second, we design a general loss function with the uncertainty measure for the adaptive classifier and extend the loss function to support vector machine. Finally, numerical experiments on simulation datasets and real-world applications are given to comprehensively demonstrate the effectiveness of the adaptive classifier with the uncertainty measure.Entities:
Keywords: domain adaptation; evidence theory; transfer learning; uncertainty measure
Year: 2022 PMID: 35885189 PMCID: PMC9317131 DOI: 10.3390/e24070966
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1(a) Data distribution of source domain, (b) category distribution of source domain data in label space of the target domain.
Figure 2Evidence net architecture.
Figure 3Schematic diagram: (a) Classification hyperplane is generated by SVM. (b) Classification hyperplane is generated by SVM with uncertainty.
Cross-domain sentiment classification accuracies of Amazon product reviews generated by SVMU and baseline methods.
| Task | SVMU | TCA | CORAL | GFK | JDA | KMM | MTLF | SCA | EasyTL | WDGAL |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| 77.76 | 70.76 | 75.76 | 77.26 | 83.76 | 68.59 | 81.56 | 79.80 | 83.05 |
|
|
| 75.54 | 66.21 | 72.00 | 75.93 | 79.02 | 69.63 | 78.08 | 79.70 | 80.09 |
|
| 81.92 | 78.74 | 70.00 | 73.50 | 78.09 | 75.90 | 72.74 | 79.09 | 80.90 |
|
|
| 82.11 | 76.05 | 73.05 | 71.85 | 77.65 | 80.50 | 70.70 |
| 79.90 | 80.72 |
|
|
| 76.38 | 68.70 | 68.96 | 76.03 | 68.51 | 71.90 | 78.82 | 80.80 | 82.26 |
|
| 82.64 | 79.34 | 71.96 | 75.70 | 78.29 | 76.45 | 74.18 | 80.39 | 82.00 |
|
|
|
| 73.35 | 69.90 | 72.60 | 72.65 | 73.70 | 69.20 | 77.00 | 75.00 | 77.22 |
|
|
| 73.66 | 65.71 | 71.11 | 72.16 | 77.86 | 70.73 | 77.26 | 75.30 | 78.28 |
|
| 86.40 | 79.74 | 72.35 | 76.20 | 80.14 | 80.39 | 71.36 | 84.63 | 84.90 |
|
|
|
| 73.05 | 67.45 | 73.75 | 75.05 | 74.25 | 66.04 | 78.90 | 76.50 | 77.16 |
|
|
| 77.26 | 68.61 | 74.21 | 77.56 | 75.96 | 70.31 | 77.46 | 76.30 | 78.89 |
|
|
| 78.74 | 75.68 | 76.58 | 80.32 | 85.00 | 68.58 | 85.65 | 82.50 | 86.29 |
| Average |
| 76.63 | 70.03 | 73.52 | 76.76 | 77.61 | 70.33 | 80.10 | 79.47 | 81.90 |
Cross-domain classification accuracies on Office+Caltech image datasets (SURF features) generated by SVMU and baseline methods.
| Task | SVMU | TCA | CORAL | GFK | JDA | KMM | MTLF | SCA | EasyTL |
|---|---|---|---|---|---|---|---|---|---|
|
|
| 47.76 | 45.37 | 40.25 | 49.36 | 45.41 | 45.37 | 48.29 | 43.01 |
|
| 44.31 | 41.12 | 43.75 | 43.31 | 42.49 | 41.40 | 41.38 | 44.21 |
|
|
|
| 44.63 | 44.78 | 43.98 | 45.97 | 42.85 | 42.59 | 43.90 | 40.68 |
|
|
| 58.20 | 53.59 | 51.20 | 54.78 | 50.10 | 54.17 | 53.74 | 50.10 |
|
| 44.00 | 41.40 | 46.22 | 42.85 | 43.22 | 43.58 | 40.69 | 39.49 |
|
|
|
| 42.64 | 43.73 | 40.68 | 41.69 | 43.81 | 46.10 | 43.56 | 42.49 |
|
|
| 52.15 | 58.81 | 52.05 | 53.09 | 58.60 | 59.92 | 57.72 | 61.94 |
|
|
| 49.70 | 48.01 | 48.28 | 45.52 | 47.81 | 45.73 | 50.32 | 51.17 |
|
|
| 46.10 | 44.40 | 45.59 | 43.49 | 44.45 | 43.50 | 42.81 | 44.49 |
|
|
| 58.06 | 56.20 | 59.75 | 56.78 | 52.15 | 51.07 | 60.48 | 60.18 |
|
|
| 45.30 | 42.08 | 48.72 | 49.17 | 49.81 | 49.38 | 50.63 | 49.65 |
|
|
| 43.26 | 44.08 | 40.89 | 46.17 | 45.62 | 44.76 | 46.36 | 47.07 |
| Average |
| 47.52 | 47.58 | 46.46 | 47.64 | 47.13 | 47.05 | 48.45 | 48.75 |
Figure 4Results on synthetic data: (a) Classification hyperplane is generated by SVM. (b) Classification hyperplane is generated by SVM with uncertainty.
Figure 5Cross-domain sentiment classification accuracies on Amazon product reviews generated by SVM with and without uncertainty.
Figure 6The uncertainty of the category ’backpack’ and ’bike’ in source domain C about the target domain A classification task.