| Literature DB >> 33071400 |
Mohamad Arafeh1, Paolo Ceravolo1, Azzam Mourad2, Ernesto Damiani3, Emanuele Bellini4.
Abstract
Online Social Network (OSN) is considered a key source of information for real-time decision making. However, several constraints lead to decreasing the amount of information that a researcher can have while increasing the time of social network mining procedures. In this context, this paper proposes a new framework for sampling Online Social Network (OSN). Domain knowledge is used to define tailored strategies that can decrease the budget and time required for mining while increasing the recall. An ontology supports our filtering layer in evaluating the relatedness of nodes. Our approach demonstrates that the same mechanism can be advanced to prompt recommendations to users. Our test cases and experimental results emphasize the importance of the strategy definition step in our social miner and the application of ontologies on the knowledge graph in the domain of recommendation analysis.Entities:
Keywords: Big data; Data analysis; Data miner; Data sampling; Ontology; Recommender system; Social network
Year: 2020 PMID: 33071400 PMCID: PMC7546693 DOI: 10.1016/j.future.2020.09.030
Source DB: PubMed Journal: Future Gener Comput Syst ISSN: 0167-739X Impact factor: 7.187
Fig. 2The architectural hierarchy.
Fig. 3The system workflow.
Fig. 1Relationships between nodes in a graph database.
Fig. 4Recommender system component workflow.
Fig. 5Academic ontology model.
Fig. 6Example of a labeled graph result.
Set 1 data overview.
| Seed | 1 | 2 | 3 | 4 |
|---|---|---|---|---|
| Italy | 62% | 27% | 50% | 25% |
Set 1 mining strategies.
| S1 | S2 | S3 | S4 | S5 | |
|---|---|---|---|---|---|
| Exploring algorithm | BF | RW | MH | RW | MH |
| Account fetched | 10% | 10% | 10% | 5% | 5% |
| Location filter | Italy | Italy | Italy | Italy | Italy |
| Iterations | – | 500 | 500 | 500 | 500 |
| Forward weight | – | 0.2 | 0.2 | 0.2 | 0.2 |
| Distribution | – | – | Normal | – | Normal |
Fig. 7Density of the Italian accounts.
Set 2 mining strategies.
| S1 | S2 | S3 | S4 | |
|---|---|---|---|---|
| Exploring algorithm | BF | BF | RW | MH |
| Account fetched | – | 50 | 50 | 50 |
| Max depth | 8 | 3 | – | – |
| Iterations | – | – | 500 | 500 |
| Forward weight | – | – | 0.8 | 0.8 |
| Distribution | – | – | – | Normal |
Community detected from Set 2.
| Strategy | Communities detected |
|---|---|
| S1 | 29 |
| S2 | 14 |
| S3 | 19 |
| S4 | 10 |
Set 2 execution time.
| S1 | S2 | S3 | S4 | |
|---|---|---|---|---|
| API Req. | 13 650 | 379 | 289 | 177 |
| Exec. time (s) | 56 456 | 1879 | 1400 | 970 |
Fig. 8Level 1 comparison.
Fig. 9Level 2 comparison.
Fig. 10MovieLens representative ontology.
Fig. 11Recommendations scores precision, recall.
Fig. 12Recommendations scores F1-test.
Fig. 13Recommendations scores MAE.