| Literature DB >> 34305252 |
Shaina Raza1, Chen Ding1.
Abstract
Nowadays, more and more news readers read news online where they have access to millions of news articles from multiple sources. In order to help users find the right and relevant content, news recommender systems (NRS) are developed to relieve the information overload problem and suggest news items that might be of interest for the news readers. In this paper, we highlight the major challenges faced by the NRS and identify the possible solutions from the state-of-the-art. Our discussion is divided into two parts. In the first part, we present an overview of the recommendation solutions, datasets, evaluation criteria beyond accuracy and recommendation platforms being used in the NRS. We also talk about two popular classes of models that have been successfully used in recent years. In the second part, we focus on the deep neural networks as solutions to build the NRS. Different from previous surveys, we study the effects of news recommendations on user behaviors and try to suggest possible remedies to mitigate those effects. By providing the state-of-the-art knowledge, this survey can help researchers and professional practitioners have a better understanding of the recent developments in news recommendation algorithms. In addition, this survey sheds light on the potential new directions. © Crown 2021.Entities:
Keywords: Beyond-accuracy; Datasets; Deep learning; Evaluation measures; News; Recommender system; User behavior
Year: 2021 PMID: 34305252 PMCID: PMC8294232 DOI: 10.1007/s10462-021-10043-x
Source DB: PubMed Journal: Artif Intell Rev ISSN: 0269-2821 Impact factor: 9.588
Challenges discussed in different NRS surveys
| Survey | Challenges discussed |
|---|---|
| Borges and Lorena ( | Accuracy |
| Karwa ( | Cold-start, data sparsity, recency, implicit user feedback, changing interests of users, scalability, unstructured content |
| Dwivedi and Arya ( | Data sparsity, changing users’ interests, news content, recommendation techniques |
| Karimi et al. ( | Recommendation paradigms, user modeling, cold start, data sparsity, recency, beyond-accuracy measures, scalability |
| Chakraborty et al. ( | Recency, relevance, diversity, accuracy, recommendation techniques |
| Li and Wang ( | Recency, popularity, massive and unstructured data |
| Qin and Lu ( | News content feature engineering |
| Feng et al. ( | Cold start, explicit user feedbacks, changing users’ interests |
Fig. 1Number of papers on NRS per year from mid-year 2012 till early year 2021
Fig. 2Mind map diagram of the survey paper
Fig. 3Algorithms used in NRS
Fig. 4Deep learning algorithms used in NRS from 2016 till early year 2021
Evaluation Metrics accuracy (acc): beyond-accuracy (beyond-acc)
| Metric | Description | Type |
|---|---|---|
| Accuracy | Number of correct predictions over total predictions | acc |
| Precision (prec) | Proportion of relevant items over total recommendations | acc |
| Recall (rec) | Proportion of relevant items over total relevant items | acc |
| F1-score (F1) | Weighted average of the precision and recall | acc |
| Mean Reciprocal Rank (MRR) | Multiplicative inverse of rank of the first correct item | ranking, acc |
| Mean Average Precision (MAP) | The average precisions across all relevant queries | ranking, acc |
| Percentile-ranking within the ordered list items | ranking, acc | |
| Cumulative Rating | Total relevance of all documents above a rank position in top recommended items | ranking, acc |
| Success | Current item that is in a set of recommended items | ranking, acc |
| Novelty (nov) | Ratio of unseen items over recommended items | beyond-acc |
| Serendipity (seren) | Measure of unexpectedness with relevance (relevance is 1 if recommended item is interacted with otherwise 0) | beyond-acc |
| Coverage (cov) | Percent of items that the model is able to recommend | beyond-acc |
| Diversity (div) | Degree of dissimilar recommendations either at system level (aggregate diversity) or for each user (individual diversity) | beyond-acc |
| Hit Rate (HR) | Ratio of hits in ranked items over the number of users | acc |
| Log-loss | Probability of a prediction input between 0 and 1 | acc |
| Average Reciprocal Hit Rate (ARHR) | A hit is inversely relative to its position in top recommendations | acc |
| Root-mean-square error (RMSE) | Difference between predicted and actual rating | acc |
| Click-through rate (CTR) | The likelihood of a news item that will be clicked | acc |
| Discounted Cumulative Gain (DCG) | Gain of an item as per to its position in recommendation list | acc |
| Area under curve (AUC) | ROC curve plots recall against fallout (false positive rate) | acc |
| Customer Satisfaction Index (CSI) | The satisfaction degree of user on the recommendations | beyond-acc |
| Personalized (NRS-specific) | Current item that is in a set of recommended list but is not among the popular items (Garcin et al. | relevancy, acc |
| Saliency (NRS-specific) | A function of entities’ frequency in news articles, with a decay factor (Cucchiarelli et al. | beyond-acc |
| Future-Impact (NRS-specific) | Evaluate a tradeoff between recency and relevancy (Chakraborty et al. | beyond-acc |
| Tradeoff (NRS-specific) | Evaluate a tradeoff between high accuracy and reasonable diversity (Raza and Ding | beyond-acc |
| Senti (NRS-specific) | Evaluate sentiment diversity of recommendations, motivated by MRR and hit ratio (Wu et al. | beyond-acc |
Fig. 5Distribution of accuracy and beyond-accuracy aspects in NRS
Fig. 6Distribution of evaluation approaches used in NRS
Fig. 7Distribution of datasets in NRS
Algorithms (alg.), challenge, solution, dataset, evaluation (eval) metric: accuracy (acc): beyond-accuracy (beyond-acc), and evaluation protocol (protocol) for NRS papers
| Paper | Alg | Challenge | Solution | Dataset | Eval. Metric | Protocol |
|---|---|---|---|---|---|---|
| Park et al. ( | CBF | Content quality | Bias detection | Private—Google news | F1-score (acc) | Offline, User Study |
| Xia et al. ( | CF | Timeliness | Time-decay | Private—Alibaba.com | CSI (beyond-acc), prec (acc) | Offline |
| De Francisci Morales et al. ( | CBF | Timelines User modeling | Time-decay Microblog | Private—Twitter, Yahoo | MRR (acc), cov (beyond-acc) | Offline |
| Agarwal et al. ( | CBF | User modeling | Ontology | Private—RSS news feed | prec, rec (acc) | Offline |
| Prawesh and Padmanabhan ( | CBF | Timeliness | Popularity | – | – | – |
| Boutet et al. ( | CF | User modeling Content quality | Feature-based Bias detection | Private—Arxiv,Digg | prec, rec (acc) | Offline, User Study |
| Ilievski and Roy ( | CBF | User modeling | Knowledge-based | Private—German news papers | Offline | |
| Li and Li ( | CBF, CF | User modeling | Feature-based | Private—not mentioned | prec, rec, F1-score, NDCG (acc), div (beyond-acc) | Offline |
| Jonnalagedda and Gauch ( | CBF, CF | Timeliness | Popularity-based | Private—Twitter, CNN and BBC | Accuracy (acc) | Offline, User Study |
| Garcin et al. ( | CBF | Timeliness | Graph-based | Private—Tribune de Geneve and 24heures.ch | Personalized (acc), nov (beyond-acc) | Offline |
| Gu et al. ( | CBF | User modeling | Microblog | Private—news.sina.com | F1, prec, rec (acc), div, nov (beyond-acc) | Offline |
| Asikin and Wörndl ( | CBF | User modeling | Stereotypical | Private | seren (beyond-acc) | User study |
| Trevisiol et al. ( | CBF | Timeliness | Graph-based | Private—Facebook, Twitter, Reddit | prec, MRR (acc) | Offline |
| Oh et al. ( | CBF | User modeling | Feature-based | Private—Korean news | HR (acc) | Offline |
| Muralidhar et al. ( | CF, CBF | Timeliness User modeling | Time-decay Feature-based | Private—Washington Post | HR (acc) | Offline |
| Xiao et al. ( | CF | Timeliness, User modeling | Sequential Collaborative | Private Chinese news | prec, rec, F1 (acc) | Offline |
| Maksai et al. ( | CBF | Timeliness | Popularity | Private—Swissinfo.ch, Yahoo FrontPage, lePoint.fr | div, nov, seren, cov (beyond-acc), RMSE, CTR (acc) | Online, Offline |
| Jenders et al. ( | CBF | Content quality | Feature-based | Private—New York Times | seren (beyond-acc) | User Study |
| Garrido et al. ( | CBF | User modeling | Stereotypical | Private—heraldo.es | – | Offline |
| Lu et al. ( | CBF, CF | User modeling | Feature-based, Collaborative | Private—Bing Now news | Accuracy | Offline |
| Doychev et al. ( | CBF, CF | Timeliness | Popularity | Plista | prec, CTR (acc) | Online, Offline |
| Ma et al. ( | CBF, CF | User modeling | Stereotypical, Collaborative | Private—Bing Now news | MAP, MRR, CTR (acc) | Offline |
| Jonnalagedda et al. ( | CBF | User modeling | Microblog | Private—Twitter | NDCG (acc) | Offline |
| Chakraborty et al. ( | CBF | User modeling | Clickbait removal | Private—BuzzFeed & Wikinews | prec, rec, F1 (acc) | Offline, Online |
| Viana and Soares ( | CBF, CF | User modeling | Feature-based | – | – | User Study, Simulation |
| Rizos et al. ( | CBF | User Modeling | Feature-based, Microblog | Private—Reddit | Accuracy | Offline |
| Okura et al. ( | CBF | Content quality | Duplicate detection | Private—Yahoo FrontPage | CTR (acc) | Online |
| Guan et al. ( | CBF, CF | Timeliness | Deep neural network | Private—news.sohu.com | prec, rec, F1 (acc) | Offline |
| Robindro et al. ( | CBF | User modeling | Stereotypical | BBC news | NDCG (acc) | Offline |
| Khattar et al. ( | CF | Timeliness User modeling | Session-based, Knowledge-based | Private—Veooz.com news | MAP, Hit rate, NDCG (acc) | Offline |
| Okura et al. ( | CBF | Content quality Timeliness | Duplicate detection, Time decay | Private—Yahoo FrontPage | AUC, MRR, NDCG, CTR (Online) (acc) | Online, Offline |
| Kumar et al. ( | CBF | Content quality | Knowledge-based | Plista | HR, NDCG (acc) | Online, Offline |
| Cucchiarelli et al. ( | CBF | User modeling | Microblogging | Private—Twitter, Wikipedia | Saliency, seren (beyond-acc), MAP (acc) | Online, User Study |
| Constantinides and Dowell ( | CBF, CF | User modeling | Stereotyping | Private—Habito.com News | Accuracy | Offline, User Study |
| Jugovac et al. ( | CF | User modeling Timeliness | Feature-based Session-based | Outbrain.com, Plista | MRR, F1-Score (acc) | Online, Offline |
| de Souza Pereira Moreira ( | CBF CF | Timeliness | Session-based | Adressa, Globo.com | Rec, NDCG (acc) | Offline |
| Sottocornola et al. ( | CBF CF | Timeliness | Time-decay, Session-based | Private—not mentioned | prec (acc) | Offline |
| Lin et al. ( | CBF | User Modeling | PMF | Private—not mentioned | Accuracy | Offline |
| Yan et al. ( | CBF | User Modeling | NMF | Tweet data. News data crawled from sogou.com | Accuracy | Offline |
| Xia et al. ( | CF | Content Quality | BPR | Private—not mentioned | Accuracy | Offline |
| Wang et al. ( | CBF | User Modeling | TF | Not mentioned | − | − |
| Shu et al. ( | CBF | User Modeling | NMF | BuzzFeed, PolitiFact | Accuracy | Offline |
| Gharahighehi and Vens ( | CBF | User Modeling | BPR | Private- not mentioned | Accuracy | Offline |
| Raza and Ding ( | CBF | User Modeling, Timeliness | MF (baseline predictors) | NYTimes—crawled | prec,rec (acc) | Offline |
| Raza and Ding ( | CBF | User Modeling | GLM with CF | NYTimes—crawled | prec, rec (acc), div, nov (beyond-acc) | Offline |
Algorithms (alg.), DL mechanism, dataset, evaluation (eval) metric: accuracy (acc): beyond-accuracy (beyond-acc), and evaluation protocol (protocol) for NRS papers
| Paper | Alg | DL mechanism | Dataset | Eval. Metric | Protocol |
|---|---|---|---|---|---|
| Song et al. ( | CBF | RNNs (LSTM) for timeliness in user modeling for short-term preferences. Sent2Vec (BoW) for news representation | Private—crawled, not mentioned | prec, rec, F1, AUC, MAP, MRR (acc)y | Offline |
| Okura et al. ( | CBF | Denoising AE for news representations and RNNs (LSTM) for timeliness in user modeling for short-term preferences | Private -Yahoo Japan news | AUC, MRR, DCG, CTR (acc) | Offline |
| Wang et al. ( | CBF + CF | CNNs for news representations; Attention for user modeling | Private—crawled | AUC, prec, rec, F1 (acc) | Offline, online |
| Kumar et al. ( | CF | Doc2Vec for news representation, RNNs (Bidirectional LSTM) for timeliness in user modeling for short-term preferences | Private—crawled | HR, NDCG (acc) | Offline |
| Lian et al. ( | CBF | CNNs for news representations, Attention for user modeling | Private—Bing | AUC, Logloss (acc) | Offline |
| Wang et al. ( | CBF | Knowledge graph with CNNs for news representations, Attention for user modeling | Private—Bing news logs | CTR (acc) | Offline |
| Zheng et al. ( | CBF | RL, Deep Q-Network for news and user representations | Private—not given | CTR, prec, NDCG, (acc), div (beyond-acc) | Offline |
| Yu et al. ( | CBF | CNNs for news text, AE for images representation, MLP to recommend | Private—frnews.ifeng.com.ca | Accuracy | Offline |
| de Souza Pereira Moreira ( | CBF, CF | Word2Vec + CNN for news representations, RNNs (GRUs) for timeliness in user modeling for short-term preferences | Globo.com, Plista, Adressa | HR, MRR, Accuracy (acc); cov, nov, div (beyond-acc) | Online and Offline |
| Zhang et al. ( | CBF | CNNs for news representations, RNNs (GRUs) for timeliness in user modeling for short-term preferences | Addressa | prec, rec, AUC (acc) | Offline |
| Zhu et al. ( | CBF | CNNs for news representations, RNNs (GRUs) for timeliness in user modeling for short-term preferences | Addressa | prec, rec, AUC (acc) | Offline |
| Cao et al. ( | CBF | Stacked AE | Movielens (not news) | prec, rec, AUC (acc) | Offline |
| An et al. ( | CBF, CF | CNNs for news representations, RNNs (GRUs) for timeliness in user modeling for short-term preferences | MIND | AUC, NDCG, MRR (acc) | Offline |
| Wu et al. ( | CBF | CNNs for news representations, Attention for user modeling | MIND | AUC, NDCG, MRR (acc) | Offline |
| Wu et al. ( | CBF, CF | CNNs for news representations, Attention for user modeling | MIND | AUC, NDCG, MRR (acc) | Offline |
| Wu et al. ( | CBF, CF | Transformer for news representations, GNNs for user modeling | MIND | AUC, NDCG, MRR (acc) | Offline |
| Wu et al. ( | CBF, CF | Attention for news and user representations, MSE loss in news module for diversity | MIND | AUC, NDCG, MRR (acc), Senti (beyond-acc) | Offline |
| Lee et al. ( | CBF, CF | GNNs for news and user representations | – | – | – |
| Yang et al. ( | CBF, CF | Knowledge Graphs + CNNs for news; Attention for user modeling | Private-Wikidata, Weibo | prec, rec, F1, AUC (acc) | Offline |
| Wu et al. ( | CBF, CF | BERT for news representations; Attention for user representation | MIND | AUC, NDCG, MRR (acc), Senti(beyond-acc) | Offline |
Post-algorithmic challenges in the NRS and the solutions
| Paper | Challenge | Algorithmic Cause | Effect on readers’ behavior | Solution |
|---|---|---|---|---|
| Dandekar et al. ( | Polarization | Narrowed readers’ exposure | Causes denial to others’ viewpoint | Degroot's graphical model of opinion |
| Resnick et al. ( | Filter bubble | Over personalization | Creates filter bubbles, polarization | Selective exposure, diversity, nudge theory |
| Beam ( | Counter-attitudinal | Over personalization | Affects readers’ acceptance to opposing viewpoints | Selective exposure |
| Li et al. ( | Filter bubble | Over personalization | Readers get bored of old news | Budgeted maximum coverage |
| Maksai et al. ( | Filter bubble | Over personalization | Readers get bored of similar news stories | Trade-off among various evaluations |
| Flaxman et al. ( | Filter bubble, Echo chambers | Ideological segregation | Affects voters and functioning democracies | Selective exposure |
| Allcott and Gentzkow ( | Filter bubble, Echo chambers | Algorithmic fake news | Separates readers from contradictory perspectives | Selective exposure |
| DiFranzo and Gloria-Garcia ( | Filter bubble, | |||
| Echo chambers | Social media spread fake news | Causes readers’ denial to others’ opinions | Diversity-aware methods and nudges | |
| Möller et al. ( | Filter bubble | Accuracy-centric algorithms | Extreme opinions, misinterpreted facts | Diversity in re-ranking |
| Helberger ( | Filter bubble | Over personalization | Affects negatively on the democracy | Exposure diversity, ban manipulative practices |
| Zheng et al. ( | Filter bubble | Over personalization | Readers get bored of similar news stories | Dueling Bandit Gradient Descent for diversity |
| Chakraborty et al. ( | Filter bubble | Over personalization | Readers tend to get bored of similar news stories | Future-impact metric |