| Literature DB >> 35615756 |
Ana Vukicevic1,2, Milan Vukicevic1, Sandro Radovanovic1, Boris Delibasic1.
Abstract
Crowdsourcing and crowd voting systems are being increasingly used in societal, industry, and academic problems (labeling, recommendations, social choice, etc.) due to their possibility to exploit "wisdom of crowd" and obtain good quality solutions, and/or voter satisfaction, with high cost-efficiency. However, the decisions based on crowd vote aggregation do not guarantee high-quality results due to crowd voter data quality. Additionally, such decisions often do not satisfy the majority of voters due to data heterogeneity (multimodal or uniform vote distributions) and/or outliers, which cause traditional aggregation procedures (e.g., central tendency measures) to propose decisions with low voter satisfaction. In this research, we propose a system for the integration of crowd and expert knowledge in a crowdsourcing setting with limited resources. The system addresses the problem of sparse voting data by using machine learning models (matrix factorization and regression) for the estimation of crowd and expert votes/grades. The problem of vote aggregation under multimodal or uniform vote distributions is addressed by the inclusion of expert votes and aggregation of crowd and expert votes based on optimization and bargaining models (Kalai-Smorodinsky and Nash) usually used in game theory. Experimental evaluation on real world and artificial problems showed that the bargaining-based aggregation outperforms the traditional methods in terms of cumulative satisfaction of experts and crowd. Additionally, the machine learning models showed satisfactory predictive performance and enabled cost reduction in the process of vote collection.Entities:
Keywords: Bargaining models; Crowd-voting; Expert knowledge; Machine learning; Matrix-factorisation
Year: 2022 PMID: 35615756 PMCID: PMC9123878 DOI: 10.1007/s10726-022-09783-0
Source DB: PubMed Journal: Group Decis Negot ISSN: 0926-2644
Fig. 1General data flow of BargCrEx
Defined level of agreement between groups
| Case | Expert (within) LOA | Crowd (within) LOA | Joint (total) LOA | Description |
|---|---|---|---|---|
| 1 | High | High | High | All participants have similar opinions ES: High, CS: High, OS: High |
| 2 | High | High | Low | Opinions within groups are similar but not between groups. The optimal grade cannot satisfy both groups. ES: High or Low CS: High or Low, OS: Medium |
| 3 | High | Low | High | Not possible (one group has a high within disagreement) |
| 4 | High | Low | Low | Expert opinion should be supported ES: High, CS: Low, OS: Medium |
| 5 | Low | High | High | Not possible (one group has a high within disagreement) |
| 6 | Low | High | Low | Crowd opinion should be supported: ES: Low, CS: High, OS: Medium |
| 7 | Low | Low | High | Not possible (both groups have high within disagreement cannot lead to high satisfaction) |
| 8 | Low | Low | Low | All opinions are different. No solution is trustworthy? ES: Low CS: Low, OS: Low |
Fig. 3Artificial “less extreme” cases
Hyperparameter grid search optimization of ALS
| Parameter | Hyperparameter range |
|---|---|
| Number of latent factors | [20, 30, 40, 50, 70, 100] |
| Regularizations | [0., 0.1, 0.3, 0.5, 0.7, 1., 10., 100.] |
Hyperparameter grid search optimization of regression algorithms
| Algorithm | Parameters | Values |
|---|---|---|
| Linear regression | – | – |
| Random forest | n_estimators | [1, 3, 5, 7, 10, 50, 100, 200, 500] |
| max_depth | [5, 10, 15] | |
| Gradient boosted regressor | n_estimators | [1, 3, 5, 7, 10, 50, 100, 200, 500] |
| max_depth | [1, 3, 5, 7, 10, 15, 30] | |
| llearning_rate | [0.01, 0.1, 0.05, 0.25, 0.5, 1] |
Description of benchmark aggregation models
| Majority (global) | Majority vote of all voters (experts and crowd) |
| Median (global) | Median vote of all voters (experts and crowd) |
| Mean (global) | Mean vote of all voters (experts and crowd) |
| Majority (expert) | Majority vote of expert voters (experts and crowd) |
| Median (expert) | Median vote of expert voters (experts and crowd) |
| Mean (expert) | Mean vote of expert voters (experts and crowd) |
| Majority (crowd) | Majority vote of expert voters (experts and crowd) |
| Median (crowd) | Median vote of expert voters (experts and crowd) |
| Mean (crowd) | Mean vote of expert voters (experts and crowd) |
| Weighted (crowd) | Weighted average sum of crowd votes |
| Weighted (crowd and experts) | Weighted average of crowd and expert votes |
Results of ALS algorithm for data embedding
| Data set | N_factors | Regularization | Train MSE | Test MSE |
|---|---|---|---|---|
| Journal | 50 | 0 | 0.65 | 0.89 |
| Science | 50 | 0 | 0.64 | 0.88 |
Estimation results for Science dataset
| DataSet | Model | best_params | Train MSE | Test MSE |
|---|---|---|---|---|
| Science | LinearRegression | {} | 0.764 | 0.952 |
| Science | RandomForest | {’max_depth’: 15, ’n_estimators’: 500} | 0.549 | |
| Science | GradientBoostingRegressor | {’learning_rate’: 0.05, ’max_depth’: 7, ’n_estimators’: 200} | 0.546 | 0.591 |
Minimal MSE on test set is showed in bold font
Estimation results for Journal dataset
| DataSet | Model | best_params | Train MSE | Test MSE |
|---|---|---|---|---|
| Journal | LinearRegression | {} | 0.760 | 0.958 |
| Journal | RandomForest | {’max_depth’: 15, ’n_estimators’: 200} | 0.542 | |
| Journal | GradientBoostingRegressor | {’learning_rate’: 0.05, ’max_depth’: 5, ’n_estimators’: 500} | 0.559 | 0.598 |
Minimal MSE on test set is showed in bold font
Fig. 2Artificial “extreme” cases
Fig. 4Results on Science dataset
Fig. 5Results on Journal dataset