| Literature DB >> 27747505 |
Zengda Guan1, Ang Li2,3, Tingshao Zhu4,5.
Abstract
It is important to acquire web users' psychological characteristics. Recent studies have built computational models for predicting psychological characteristics by supervised learning. However, the generalization of built models might be limited due to the differences in distribution between the training and test dataset. To address this problem, we propose some local regression transfer learning methods. Specifically, k-nearest-neighbour and clustering reweighting methods are developed to estimate the importance of each training instance, and a weighted risk regression model is built for prediction. Adaptive parameter-setting method is also proposed to deal with the situation that the test dataset has no labels. We performed experiments on prediction of users' personality and depression based on users of different genders or different districts, and the results demonstrated that the methods could improve the generalization capability of learning models.Entities:
Keywords: Covariate shift; Local transfer learning; Psychological characteristics prediction
Year: 2015 PMID: 27747505 PMCID: PMC4883139 DOI: 10.1007/s40708-015-0017-z
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Local regression transfer learning results for predicting personality across different-gender datasets. MSE is used to measure the test results
| Condition | A | C | E | N | O |
|---|---|---|---|---|---|
| MARS | 34.8431 | 45.9335 | 34.0655 | 29.5776 | 32.6700 |
| KMM | 26.7654 | 30.8683 | 24.0116 | 27.9208 | 28.1425 |
| GkNN | 25.2125 | 31.5119 | 23.1247 | 27.6345 | 30.6127 |
| kNN | 24.3776 | 31.1357 | 23.1247 | 27.4160 | 28.2948 |
| TTkNN | 24.3149 | 31.0282 | 22.8547 | 27.8493 | 28.1424 |
| AkNN1 | 24.3913 | 31.2013 | 24.5649 | 27.4419 | 28.2027 |
| AkNN2 | 29.8956 | 31.0112 | 24.0063 | 27.8779 | 28.1899 |
| Clust | 27.3070 | 30.4555 | 23.9003 | 27.7718 | 28.1425 |
Fig. 1The impact of the number of nearest neighbours on the performance of k-NN transfer methods in trait A
Fig. 2The impact of cluster number in clustering regression transfer learning in trait A, C, E, N and O
Local regression transfer learning results for predicting personality across different-district datasets. MSE is used to measure the test results
| Condition | A | C | E | N | O |
|---|---|---|---|---|---|
| MARS | 43.6764 | 65.0172 | 44.3688 | 47.4115 | 229.8742 |
| KMM | 42.1194 | 48.9055 | 39.3781 | 47.4057 | 59.7330 |
| GkNN | 44.7136 | 45.8609 | 43.0928 | 49.2114 | 43.1696 |
| kNN | 43.2840 | 42.0574 | 38.8104 | 42.9135 | 43.1696 |
| TTkNN | 38.6335 | 40.8370 | 35.0338 | 41.8510 | 45.3623 |
| AkNN1 | 43.5722 | 41.3360 | 39.0398 | 41.4173 | 195.6540 |
| AkNN2 | 41.5186 | 62.8834 | 39.3683 | 52.2294 | 218.9917 |
| Clust | 39.1079 | 42.5235 | 37.7979 | 44.6171 | 113.6659 |
Local regression transfer learning results for predicting depression across different-gender dataset
| MARS | KMM | GkNN | kNN | TTkNN | AkNN1 | AkNN2 | Clust | |
|---|---|---|---|---|---|---|---|---|
| MSE | 126.9868 | 111.2089 | 113.5784 | 113.4111 | 113.1296 | 113.3482 | 113.3624 | 111.5430 |