Literature DB >> 30286123

An improved memory-based collaborative filtering method based on the TOPSIS technique.

Hael Al-Bashiri¹, Mansoor Abdullateef Abdulgabber¹, Awanis Romli¹, Hasan Kahtan¹.

Abstract

This paper describes an approach for improving the accuracy of memory-based collaborative filtering, based on the technique for order of preference by similarity to ideal solution (TOPSIS) method. Recommender systems are used to filter the huge amount of data available online based on user-defined preferences. Collaborative filtering (CF) is a commonly used recommendation approach that generates recommendations based on correlations among user preferences. Although several enhancements have increased the accuracy of memory-based CF through the development of improved similarity measures for finding successful neighbors, there has been less investigation into prediction score methods, in which rating/preference scores are assigned to items that have not yet been selected by a user. A TOPSIS solution for evaluating multiple alternatives based on more than one criterion is proposed as an alternative to prediction score methods for evaluating and ranking items based on the results from similar users. The recommendation accuracy of the proposed TOPSIS technique is evaluated by applying it to various common CF baseline methods, which are then used to analyze the MovieLens 100K and 1M benchmark datasets. The results show that CF based on the TOPSIS method is more accurate than baseline CF methods across a number of common evaluation metrics.

Entities: Chemical Disease Species

Mesh：

Year: 2018 PMID： 30286123 PMCID： PMC6171847 DOI： 10.1371/journal.pone.0204434

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Traditional information outlets—including friends, newspapers, advertisements, and mass media—have been increasingly supplanted by the Internet as a source for advice and guidance in decision making. Although the Internet is a powerful resource, the vast quantity of data available online can make it difficult to obtain the information needed to make decisions efficiently [1]. Research on the problem of online information overload has led to the development of tools, such as recommendation systems (RSs), that assist users in effective decision making [2]. RSs make suggestions to users based on preferences inferred from prior selections [3-10], thus reducing the time and effort required to make online selections [11]. This process depends on users’ historical behavior and involves the construction of user profiles for comparison with those of other users to locate his/her nearest neighbors in terms of preference. This process results in a list of items that are predicted to be most preferred by the user [3, 6, 7, 12]. Moreover, RS studies are of significant interest to a variety of entities and have been the focus of intensive academic and commercial research [13, 14]. Many commercial and nonprofit websites, including Amazon, eBay, and Lazada, use RS to assist their customers in purchasing items by making suggestions based on their previous selections and those of the most-similar customers. Such recommendations have become an integral aspect of e-commerce platforms and are used to personalize the shopping experience [15]. In general, RSs can be classified as either content-based, collaborative filtering (CF)-based, or a hybrid of the two [6, 8, 12, 14, 16–21]. In content-based approaches, items are recommended to the target user by comparing the information content of his/her past item selections with the content of items in the database [6, 22–26]. In contrast, CF-based systems propose items based on an analysis of user feedback along with the preferences of similar users [3, 27–32]; this additional robustness makes CF the most widely used and successful RS method. CF approaches can be further classified into model- and memory-based techniques [1, 33–35]. Model-based approaches apply a pre-built model for predicting user preferences, whereas memory-based approaches (also known as neighbor-based models) access entire databases of user-provided ratings to find correlations between users/items. Memory-based recommendation algorithms can generally be further subdivided into user- and item-based approaches [1, 12, 32, 36]. This paper addresses the application of user-based algorithms. Such algorithms use two key processes—similarity computing and prediction. In the computing process, the system seeks to find relationships between users, and those who are strongly correlated are designated as the neighbors of the target user. Any items rated by these neighbors that have not yet been purchased or obtained by the target user are then assembled into a set of candidate items. In the second process, the system predicts a user score for each item in the candidate set and promotes the highest-rated items as recommendations. This process of evaluating and ranking candidate items is therefore quite significant to the performance accuracy of the CF algorithm. Thus, the essential problem in information filtering is calculating whether a specific item is likely to be of interest to a user. The outcome of this process can be either Boolean (yes or no) or a score representing the degree to which the item is of interest. Unfortunately, most studies on improving the accuracy of conventional CF systems have focused solely on enhancing the similarity measure [3, 5, 6, 11, 37–51]. In contrast, improving the prediction algorithm has been somewhat neglected, even though it is of similar importance in improving memory-based CF recommendations [33, 52]. Prediction algorithms produce user preference scores for items using common aggregation methods. In this paper, we propose a method for enhancing the accuracy of memory-based CF recommendations by replacing the conventional prediction algorithm with TOPSIS, which is one of the most frequently used techniques for the evaluation and ranking of multiple alternatives. As mentioned above, the majority of studies on enhancing the accuracy of CF have focused on improving the similarity measure, with relatively few investigating the prediction score models, even though these are of similar importance [53]. In this study, we investigated the use of TOPSIS as an alternative to prediction models for improving the accuracy of user-based CF. The proposed method applies TOPSIS in the evaluation and sorting of items rated by nearest-neighbor users to produce a set of Top-M ranked recommendations. The TOPSIS method can be described as a measurement technique based on the use of defined criteria to rank sets of alternatives, and is widely used as a tool in decision support problems. TOPSIS is useful in evaluating, sorting, and selecting from a variety of available options [54]. The remainder of this paper is organized as follows. The following section provides an overview of traditional collaborative RSs, discusses relevant memory-based CF methods, and outlines the TOPSIS method. The proposed TOPSIS-based recommendation method is then presented, before the experimental methodology and results are discussed. Finally, the conclusions to this study are provided with suggestions for future work.

Related work

Collaborative filtering techniques

The term “collaborative filtering” was first applied by Goldberg to the Tapestry recommender system [55], and CF has since become one of the most widely used techniques for providing service recommendations to users online [11, 56, 57]. As discussed in the Introduction, CF can be either model-based or memory-based. In model-based approaches, a pre-built model is used to predict user preferences [14]. The most widely used approaches, involving the use of Bayesian networks or cluster models, were proposed in [58, 59]; the use of latent factor models was subsequently proposed in [60, 61]. Whereas Memory-based approaches compute the correlations between users and items to produce a preference score that predicts the likelihood of a user acquiring an item in the future and provide corresponding recommendations. User- and item-based algorithms are the most common types of memory-based recommendation methods [1, 12, 36]. User-based methods generate recommendations according to the similarities between users [29], whereas item-based methods compute similarities within a space of items to find strong relationships with items that have already been rated by an active user [29, 62]. User-based CF, the first automated CF method to be developed [28], was initially applied in the Group Lens Usenet article recommender [3], and is currently used in the BellCor video and Ringo music recommenders [4, 5]. This technique essentially involves four stages: user-to-user correlations are applied to find the most similar users to a target user (the neighbors) [29]; after collecting items rated by neighbors, those that have already been obtained by the target user are removed, leaving a set of candidate items; a degree of preference score is generated to determine the likelihood of future purchase by the target user for each candidate item; based on their respective prediction scores, the items are ranked and a list of recommendations comprising the items with the highest ranks is generated. In item-based CF, which was first proposed by Karypis and Sarwar [29], similarities among items are calculated according to other users’ evaluations. Generally, item-based CF follows the same steps as the user-based method, except that relationships are calculated across the space of items. Common similarity measures and their limitations were discussed in [63, 64]. In the next section, we briefly present the most commonly used conventional memory-based CF methods. Pearson’s Correlation Coefficient

Baseline memory-based CF methods

Resnick and Iacovou [3] used Pearson’s correlation coefficient (PCC) to find correlations among users in an approach that has become popular in memory-based CF. However, the PCC method can be inaccurate when the data are sparse, as missed ratings make it difficult to find correlations between users. This leads to high/low similarities and, therefore, weak recommendations [11, 65, 66]. The relationship among users can be defined as: where S(x,y) is the similarity between users x and y, I represents the set of items that are rated by both x and y, and denote the average ratings by users x and y, respectively, and r denotes the rating value given to item i by user x. Constrained Pearson Correlation The RINGO recommender was developed to provide users with recommendations of music albums and artists. Under RINGO, users provide feedback on a nominal scale from one (“strong dislike”) to seven (“strong like”), with a neutral value (“neither like nor dislike”) in the middle of the scale. Based on the increasing number of RINGO users, Shraddhanand and Mae [5] proposed the constrained Pearson correlation (CPCC) approach to replace the average rating variables used by PCC approaches with the median value of a scale of positive and negative ratings. The correlation is calculated as: where r denotes the median value of the rating scale. Cosine method The cosine method is a vector-space model that applies a linear algebra approach to define the relationships between pairs of users [6] as vectors, with user similarities computed as the cosine distance between each pair of rating vectors. This correlation is defined as: Jaccard method Koutrika and Bercovitz [58] proposed the Jaccard method to compute the correlations between pairs of users. The Jaccard method only considers the number of co-ratings for each user pair to define their relationship. Two users will have a strong correlation if they have similar rating patterns, and vice versa. However, the Jaccard computation process does not consider the absolute values of ratings [44, 45]. Formally, the similarity between users x and y is given by: where |I ∪ I| represents the union set of items rated by users x and y. Sigmoid function-based PCC Jamali and Ester [46] used a sigmoid function to decrement the similarity values between items for which few users have rated both items. The sigmoid function-based PCC (SPCC) approach produces similarity values in the range [0, 1] using the following formulation: However, a pair of users with similar ratings can still have a low similarity under this approach. For example, two users with ratings vectors of u1 = (4,3,5,4) and u2 = (4,3,3,4) will have very similar ratings but an SPCC similarity of zero. Jaccard and mean squared difference measure Bobadilla and Serradilla [39] hybridized the Jaccard [67] method with a mean squared difference approach [5] to produce the JMSD measure, which is computed as follows: where The JMSD approach addresses the respective drawbacks of the Jaccard and mean squared difference approaches, but suffers from the cold-start problem, does not consider the credibility of common ratings, and is vulnerable to local information and the utilization of rating problems [45]. New heuristic similarity measure (proximity–significance–singularity) Liu and Hu [66] analyzed the drawbacks of Principles in Pattern approaches [38] and proposed an improved version called the new heuristic similarity measure (NHSM). The NHSM model considers three user rating factors—proximity, significance, and singularity (PSS)—and combines local context information on these ratings with the global preferences of user ratings to alleviate the cold-start problem [68]. However, NHSM only considers co-rated items in identifying relationships between users [44]. The measure is defined as: where PSS(r,r) is the PSS value of users x and y, which is calculated as: The individual aspects are given by: Liu and Hu further combined the PSS measures with the Jaccard measure to address the problem of small proportions of common ratings. This so-called JPSS measure is defined as: To account for cases arising when different rating preferences are provided by different users (i.e., high ratings provided by some and low ratings by others), they also developed a measure of user preference based on the rating mean and standard variance: where σ and μ are the mean rating and standard variance for user x, respectively, which are defined as: Their final formalization combined the JPSS and user rating preference metrics into the improved new heuristic similarity model, or improved NHSM, which is defined as: Note that prediction algorithms have not been mentioned in the above discussion, which instead has focused on improving the accuracy of memory-based CF through the development of similarity methods. In general, there are a number of mechanisms in the generation of recommendations that can predict the score for target user x with respect to item i. Many such methods involve aggregation (see Table 1) [69]. In this paper, we propose replacing such conventional methods with the TOPSIS approach to obtain improved recommendations.

Table 1

Aggregation methods.

Algorithm	Formula
Average method	Px,i=1/\|Gx,i\|∑y∈Gx,iry,i, where G_x,i ≠ ∅
Weighted sum method	Px,i=∑y∈Gx,is(x,y)*ry,i∑y∈Gx,is(x,y), where G_x,i ≠ ∅
Adjusted weighted method (Deviation-From-Mean)	Px,i=rx¯+∑y∈Gx,is(x,y)*(ry,i−ry¯)∑y∈Gx,is(x,y),where G_x,i ≠ ∅

In the formulations in Table 1, P represents a prediction in the form of a numeric score representing how interested target user x would be in a specific item i based on their similarities to and ratings by his/her K neighbors, represents a set of users who are neighbors of user x and have rated item i, and denotes the average rating of the users. In the next subsection, TOPSIS is introduced as a useful multi-attribute decision-making (MADM) technique for the ranking and selection of a number of alternatives based on several criteria.

Multi-attribute decision-making method

As mentioned in the preceding section, most studies on improving the accuracy of CF have focused on improving the similarity measure, even though the prediction score model is of similar importance [53]. In memory-based CF, after locating a target user’s neighbors, the system collects their items and predicts the rating scores that the target user would apply to them; the items are then ranked and recommended according to these predicted scores. Clearly, the prediction algorithm plays an important role in this process. As a replacement for prediction, we propose the use of TOPSIS as a useful MADM method for evaluating and ranking items. The numerous alternatives people face online can render the decision-making process difficult, particularly when called upon to rank or choose the best alternative from a set of available items. In general, multiple criteria are used to evaluate sets of alternatives. For example, the main criteria in purchasing a car include cost, safety, comfort, and fuel consumption. Multi-criteria decision making (MCDM), one of the better-known approaches for deciding among alternatives, can be applied when the decision maker’s preferences must be taken into account. The literature divides MCDM problems into two basic approaches [70]: multi-objective decision making (MODM) and multi-attribute decision making (MADM). MADM problems are distinguished from MODM problems by the number of predetermined decision alternatives. In MADM, decision problems are subjected to a number of decision criteria to produce rankings of multiple alternatives according to their attributes. This primarily involves gathering information and evaluating it against additional information provided by the decision maker, resulting in a decision matrix that is used to determine the final ranking of alternatives [71]. Hwang and Yoon [72] describe several MADM methods, including the TOPSIS. Originally presented by Yoon and Hwang [73], TOPSIS is a practical method for ranking and selecting several externally determined alternatives through the use of distance measures [74]. The primary advantages of TOPSIS include its ability to quickly identify the best alternative [75] and comparable or superior performance to that of simple additive weighting and analytic hierarchy processes, respectively [76]. A limited number of simple inputs (i.e., the weights associated with the respective criteria [76]) are required of decision-makers, and the output of the process is easy to understand. The underlying principle of TOPSIS is that the best alternative is that located closest to the ideal solution and furthest from the negative ideal solution [77]. The TOPSIS technique is implemented in several computational steps, which are outlined as follows: determine the decision alternatives; identify the criteria (attributes) that are related to the decision problem; construct a decision matrix containing m alternatives associated with n attributes (or criteria); normalize the raw scores to construct a precedence score, or normalized decision, matrix. The scores in the normalized matrix should be transformed into a normalized scale; construct a weighted normalized decision matrix in which each attribute is given a specific weight to reflect how important it is to the overall decision; identify the ideal and negative-ideal solutions; calculate the separation measure as an n-dimensional Euclidean distance between alternatives; calculate the relative closeness of each alternative to the ideal solution; create a ranking of alternatives based on the maximization of the relative closeness measures in the preceding step. These steps will be explained in more detail in the next section.

Proposed memory-based CF using TOPSIS

The proposed technique involves the application of TOPSIS to the recommendation of sets of items that might be of interest to a user. This is implemented over several main phases, as shown by the architecture in Fig 1.

Fig 1

Architecture of the proposed memory-based CF method.

The phases of the proposed method are summarized as follows: Build user profile: The system gathers feedback from a target user to build his/her preference profile. Such preferences are conventionally associated with a scale of values representing the degree of user preference for an item, e.g., one-to-five stars or one-to-ten points. A user x rating movie a with a “five” score and movie b with a “three” score could therefore be seen to prefer a over b. Construct user-item matrix: Data relating to users and items in the system are entered into a user-item matrix as a collection of numerical ratings. Compute similarity measures: The similarities among users are calculated using several common CF baseline methods (e.g., PCC, CPCC, SPCC, Cos, MSD, JMSD, NHSM). Following this, the top-K users with the strongest correlations in terms of similarity with the target user are used to form his/her neighborhood. Construct the decision matrix: The attributes of the K-nearest users are collected and used to populate a matrix of alternatives comprising items that have been rated by these users but not yet chosen by the target user. Items that have not been rated are assigned values based on a default vote [58]. Apply TOPSIS method: The TOPSIS method [71, 73] is then applied to evaluate and rank all of the alternative items. As discussed in the next section, TOPSIS identifies the best alternative as the one with the shortest and furthest distances from the ideal and negative-ideal solutions, respectively. TOPSIS allows the best alternative to be identified quickly [75], is easy to implement, requires only a limited number of inputs from decision-makers, and produces easily understandable output. The only input parameters are the weight values associated with the criteria [76]. This main phase of the process will be explained in detail in the following subsection. Generate recommendations: As described above, the output produced by TOPSIS is a list of sorted alternatives (candidate items) ranked according to an importance measurement based on several criteria (K-neighbors). In the final phase of the recommendation process, the Top-M items are selected and presented to the target user as a set of item suggestions.

TOPSIS technique

In the proposed method, the TOPSIS technique is used in place of prediction ratings to evaluate and rank candidate items and produce a sorted list of item recommendations in terms of their predicted preference. An essential input to this procedure is the list of K-neighbors and their items, ratings, and similarity weights with respect to the target user. TOPSIS converts this selection and ranking problem into a decision matrix X with m alternatives (rows) and n criteria (columns) corresponding to the candidate items and K-neighbors, respectively. In X, each entry x represents the numerical outcome of the j alternative with respect to the i criterion, i.e., the rating value applied by user i to item j. To avoid division by zero during execution, missing ratings are represented by an average for each user. Because the criteria cannot be assumed to have equal importance, a set of weighting parameters provided by the decision-maker is associated with the criteria. These weights are then compared to those of the decision-maker neighbors to obtain the set of K-neighbors. Before examining the functioning of TOPSIS in detail, we define the sets used in the analysis: A is the set of candidate items representing the alternatives A = {a1,a2,…,a,…,a,a}, where j = 1,2,…,m and m is the total number of candidate items. C is the set of neighbors representing the various criteria C = {c1,c2,…,c,…,c,c}, where i = 1,2,…,n and n denotes the number of criteria (K-neighbors). X is the set of ratings X = {x|j = 1,…,m; i = 1,…,n}, where x is the rating value of the j alternative/item with respect to the i criterion/neighbor user. W is the set of weights W = {w1,w2,…,w,…,w,w|i = 1,2,…,n}, where w is the weight of the i criterion/neighbor (i.e., the similarity value between the i neighbor and the target user). A decision matrix X containing m alternatives associated with n criteria is represented in Table 2.

Table 2

Conceptual decision matrix X.

			Neighbors
X =			c₁	c₂	⋯	_ci	⋯	c_n−1	c_n
	Candidate Items	a₁	x_1,1	x_1,2	⋯	x_1,i	⋯	x_1,n−1	x_1,n
		a₂	x_2,1	x_2,2	⋯	x_2,i	⋯	x_2,n−1	x_2,n
		⋮	⋮	⋮	⋱	⋮	⋱	⋮	⋮
		a_j	x_j,1	x_j,2	⋯	x_j,i	⋯	x_j,n−1	x_j,n
		⋮	⋮	⋮	⋱	⋮	⋱	⋮	⋮
		a_m−1	x_m−1,1	x_m−1,2	⋯	x_m−1,i	⋯	x_m−1,n−1	x_n−1,n
		a_m	x_m,1	x_m,2	⋯	x_m,i	x_m,1	x_m,n−1	x_n,n

The steps in the TOPSIS method are described as follows. Step 1: Construct a normalized decision matrix Some users prefer to provide high ratings, even for items they do not like very much, whereas others will give low ratings to items they like. To account and adjust for such rating disparities and irregularities, it is necessary to normalize the decision matrix. This can be done through distributive normalization, in which the rating values in each column are divided by the square root of the sum of each squared alternative in the column. The elements r of the normalized decision matrix R are therefore given by: The results of applying Eq (16) to matrix X to produce the normalized matrix R are presented in Table 3.

Table 3

Conceptual normalized decision matrix R.

			Neighbors
R =			c₁	C₂	⋯	c_i	⋯	c_n−1	c_n
	Candidate Items	a₁	r_1,1	r_1,2	⋯	r_1,i	⋯	r_1,n−1	r_1,n
		a₂	r_2,1	r_2,2	⋯	r_2,i	⋯	r_2,n−1	r_2,n
		⋮	⋮	⋮	⋱	⋮	⋱	⋮	⋮
		a_j	r_j,1	r_j,2	⋯	r_j,i	⋯	r_j,n−1	r_j,n
		⋮	⋮	⋮	⋱	⋮	⋱	⋮	⋮
		a_m−1	r_m−1,1	r_m−1,2	⋯	r_m−1,i	⋯	r_m−1,n−1	r_n−1,n
		a_m	r_m,1	r_m,2	⋯	r_m,i	r_m,1	r_m,n−1	r_n,n

Step 2: Construct the weighted normalized decision matrix To take the weights W provided by the decision-maker into account, a weighted normalized decision matrix V is given by multiplying the normalized values r by their corresponding weights w. In the proposed method, the similarity weights of the target user with respect to his/her neighbors are used to develop the user’s weight criteria. For example, for a target user u who has k neighbors (with n criteria), the similarity weights s = {s,s,…,s,…,s,s|,i = 1,2,…,n}, where s denotes the similarity value between u and the i neighbor, are used to populate the set of weights w. The weighted normalized decision matrix V is then obtained as follows: Table 4 presents a weighted normalized decision matrix V obtained by applying Eq (17) to the normalized decision matrix R.

Table 4

Conceptual weighted normalized decision matrix V.

			Neighbors
V =			c₁	c₂	⋯	c_i	⋯	c_n−1	c_n
	Candidate Items	a₁	v_1,1	v_1,2	⋯	v_1,i	⋯	v_1,n−1	v_1,n
		a₂	v_2,1	v_2,2	⋯	v_2,i	⋯	v_2,n−1	v_2,n
		⋮	⋮	⋮	⋱	⋮	⋱	⋮	⋮
		a_j	v_j,1	v_j,2	⋯	v_j,i	⋯	v_j,n−1	v_j,n
		⋮	⋮	⋮	⋱	⋮	⋱	⋮	⋮
		a_m−1	v_m−1,1	v_m−1,2	⋯	v_m−1,i	⋯	v_m−1,n−1	v_n−1,n
		a_m	v_m,1	v_m,2	⋯	v_m,i	v_m,1	v_m,n−1	v_n,n

Step 3: Determine positive and negative ideal solutions The best and worst evaluation alternatives for each criterion in the normalized decision matrix V are then identified and used to represent the ideal and negative-ideal solutions, respectively. For a set of positive attributes or criteria I1 associated with benefit (more is better) and a set of negative attributes or criteria I2 associated with cost (less is better), the positive- and negative-ideal solutions can be defined as follows: Ideal solution: Negative-ideal solution: The alternatives A* and A′ represent the most-favored (ideal solution) and least-favored (negative-ideal solution) options, respectively. Step 4: Calculate the separation measure The distance from each alternative to the ideal and negative-ideal solutions for all alternatives can be calculated using the Euclidean distance measurement. The distance of each alternative from the ideal is given by: Similarly, the distance of each alternative from the negative-ideal is given by: Table 5 gives an example of a separation matrix (V′).

Table 5

Conceptual separation matrix V′.

V′ =			S*	S′
	Candidate items	a₁	S₁*	S₁′
		a₂	S₂*	S₂′
		⋮	⋮	⋮
		a_j	S_j*	S_j′
		⋮	⋮	⋮
		a_m−1	S_m−1^*	S_m−1′
		a_m	S_m^*	S_m′

Step 5: Calculate the relative closeness to the ideal solution The degree of closeness of each alternative to the ideal solution A* is calculated as The relative closeness rating ranges between zero and one; these extremes represent, respectively, the least- and most-favored alternatives. To elaborate, if the distance of alternative a from the ideal solution A* is smaller than its distance from the negative-ideal A′, then C* will be closer to one than to zero, and vice versa, as shown in Fig 2.

Fig 2

Euclidean distances to the ideal and negative-ideal solutions.

Step 6: Ranking the alternatives in order according to C* To produce an outcome in the form of a sorted list of alternatives, TOPSIS determines a preference order by arranging the alternatives in descending order of closeness degree C*.

Experimental setup and results

Datasets

Experiments were performed using four widely used and publicly available datasets, namely MovieLens 100K and 1M, HetRec2011, and FilmTrust. The MovieLens [78] 100K and 1M datasets, collected by the GroupLens research group at the University of Minnesota (http://grouplens.org/datasets/MovieLens/), are often used by CF systems [11, 51]. For this study, they were used to evaluate the performance of the proposed technique in combination with several common memory-based CF methods. The MovieLens 100K dataset, which was initially released in April 1998, includes 100,000 ratings of 1,682 movies provided by 943 users. It only captures users who have rated 20 or more movies. The 1M MovieLens dataset, which was initially released in February 2003, contains 1,000,209 ratings of approximately 3,900 movies from 6,040 users. In both datasets, the ratings are given on a scale of one to five stars with a one-star granularity. The sparsity values of 100k and 1M are 93.7 and 95.8%, respectively. The HetRec2011-MovieLens dataset is an extension of a dataset published by GroupLens (http://grouplens.org/). This dataset was released in the framework of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems [79]. HetRec2011-MovieLens consists of 855,598 ratings provided by 2,113 users on 10,197 movies and has 96.03% sparsity. The FilmTrust dataset (https://www.librec.net/datasets.html) contains 35,497 ratings provided by 1,986 users on 2,071 items and has 98.86% sparsity [80].

Experimental process

The experimental process to evaluate the proposed method was conducted as follows: Each dataset was partitioned into five equally sized sets to allow the cross-validation method to be applied [81]. In five separate trials, one subset was used as the test set (20%) and the other four were combined to form a training set (80%), with the test and training set roles rotated across trials. The average result across all trials was then computed. Based on the user-item rating matrix, the similarity between users was calculated using PCC, CPCC, SPCC, COS, MSD, JMSD, and NHSM. A set of K-nearest neighbors was then formed for the results produced by each similarity method. The items ranked by each K-nearest neighbor set were collected and any items that the active user had previously selected were removed to obtain sets of candidate items. Decision matrices were constructed and the TOPSIS technique was applied to them to obtain sets of ranked items. Finally, the top M items were identified as recommendations and presented to the target user. The accuracy of CF recommender systems is influenced by two parameters, namely the number K of neighbors and the size of the recommendation list. These two parameters should be fixed in the experimental process to ensure a fair comparison among algorithms [12, 82]. Hence, the experiments were executed with K values of 10, 20, 30, 40, and 50 and recommended list sizes of 10, 20, 30, 40, and 50. Fig 3 illustrates the experimental process with respect to input parameters, datasets, baseline CF methods with and without TOPSIS, and the measurements. A total of 25 experiments were conducted on each of the four datasets using all seven baseline methods with and without TOPSIS. Four metrics were selected to evaluate the proposed method. These metrics, which are described in the following subsection, are widely used to evaluate the accuracy of memory-based CF techniques.

Fig 3

Experimental process with respect to input parameters, datasets, methods with & without TOPSIS, and measurements.

Evaluation metrics

The TOPSIS technique was applied as an MADM approach in conjunction with various conventional memory-based CF methods, and the results were compared with those obtained without the use of TOPSIS. The recall, precision, and F-measure [69, 83], which are widely used to evaluate the accuracy of memory-based CF [15, 37, 84, 85], were used as performance metrics. These metrics measure the accuracy of a recommender system based on the items recommended to its users. The precision is the fraction of items rated by the users in the test set and recommended by the recommender system. The precision metric represents the ratio of the recommended items to the total number of items recommended by the system, and is given by Eq (23). The recall metric is the fraction of rated items recommended by a recommender system. The recall represents the ratio of the recommended items to all of the items rated by the users in the test set, and is defined by Eq (24). The F-measure metric is the weighted mean of the precision and recall, and is given by Eq (25). Thus, the F-measure is a combined metric of precision and recall. Table 6 illustrates the recommendation confusion matrix and its relation to these metrics.

Table 6

Recommendation confusion matrix.

	Rated	Unrated
Recommended	TP	FP
Not recommended	FN	TN

The terms in Table 6 are defined as follows: TP, true positive: number of test samples belonging to the user Interest that are Recommended. FN, false negative: number of test samples belonging to the user Interest that are not Recommended. TN, true negative: number of test samples not belonging to the user Interest that are not Recommended. FP, false positive: number of test samples not belonging to the user Interest that are Recommended. The precision, recall, and F-measure were computed using Eqs (23)–(25), respectively: The mean average precision (MAP) was also used to measure the accuracy of the ranking produced by each algorithm [2]. MAP computes the average of the precision scores over all recommendation sizes [86]. In this study, five recommendation sizes of 10, 20, 30, 40, and 50 were considered. Therefore, the Precision value of each specified index j (Precision@j) was computed separately. The MAP value was then normalized by dividing the sum of the Precision values for the specified indexes by the total number of specified indexes. The MAP value for L sets of specified indexes is calculated as: where Precision@j represents the precision of the j specified index in the recommendation list, j = 10, 20, 30, 40, and 50. L is a set of predefined indices and |L| represents the size of the specified indexes.

Results

To assess the accuracy of the proposed approach, the TOPSIS method was used to replace the prediction method in various memory-based CFs, which were then applied to the MovieLens 100K & 1M, HetRec20111, and FilmTrust datasets. Several trials were conducted using the cross-validation partitioning method and the results were assessed in terms of the recall, precision, F-measure, and MAP metrics. The results were used to construct bar graphs reflecting the accuracy over an averaged number of neighbors for K values of 10, 20, 30, 40, and 50. Figs 4–7 show the recall results produced by applying PCC, CPCC, SPCC, Cos, JMSD, and NHSM with and without TOPSIS. Figs 4 and 5 show the results obtained using the 100K and 1M datasets, respectively, whereas Figs 6 and 7 show the results under cross-validation partitioning using the HetRec2011 and FilmTrust datasets, respectively. The legend shows the recommendation size in each case (10, 20, 30, 40, or 50).

Fig 4

Recall measure by number of recommendations on 100K MovieLens.

Fig 7

Recall measure by number of recommendations on FilmTrust.

Fig 5

Recall measure by number of recommendations on 1M MovieLens.

Fig 6

Recall measure by number of recommendations on HetRec2011.

The results clearly show that the use of TOPSIS produces significant improvement in terms of recall, with the TOPSIS adaptation of the NHSM CF approach producing the best results across all cases. Conversely, the Cos and MSD CF methods produce the worst recall values. In general, the recall rises with the number of recommendations. Furthermore, the results in Figs 4–6 indicate that TOPSIS increases the accuracy by a factor two when applied to the PCC, CPCC, SPCC, MSD, and Cos methods; the same figures reveal more than three-fold enhancements to JMSD and NHSM using the 100K, HetRec2011, and FilmTrust datasets. Similarly, on the 1M dataset, there are three-fold enhancements for PCC, CPCC, SPCC, MSD, and Cos, and more than four- and six-fold enhancements for JMSD and NHSM, respectively. These results indicate that the application of TOPSIS to conventional methods can significantly improve the recall performance of memory-based CF. Figs 8–11 show the precision results obtained by PCC, CPCC, SPCC, Cos, MSD, JMSD, and NHSM with and without TOPSIS. Figs 8 and 9 show the results obtained using the 100K and 1M datasets, respectively, whereas Figs 10 and 11 show the results for the HetRec2011 and FilmTrust datasets, respectively. The legend shows the recommendation size of each case (10, 20, 30, 40, or 50).

Fig 8

Precision measure by number of recommendations on 100K MovieLens.

Fig 11

Precision measure by number of recommendations on FilmTrust.

Fig 9

Precision measure by number of recommendations on 1M MovieLens.

Fig 10

Precision measure by number of recommendations on HetRec2011.

The results indicate that the application of TOPSIS produces a significant improvement in precision across all cases. It is seen that TOPSIS-enhanced NHSM has the highest precision, although the Cos and MSD methods produce results that are nearly as good. The average result with respect to the number of recommended items increases from less than 0.05 under CF-NHSM to more than 0.2 under CF-TOPSIS-NHSM, representing a four-fold increase in precision for NHSM. Contrary to the recall results, the precision gradually decreases with the number of recommendations for all methods when TOPSIS is applied. Nevertheless, the results indicate that the application of TOPSIS significantly improves the precision accuracy of memory-based CF. Figs 12–15 compare the F-measures produced by PCC, CPCC, SPCC, Cos, JMSD, and NHSM with and without the TOPSIS method. Figs 12 and 13 show the results obtained using the 100K and 1M MovieLens datasets, respectively, whereas Figs 14 and 15 show the F-measure results using the HetRec2011 and FilmTrust datasets, respectively. The legend denotes the different recommendation sizes of 10, 20, 30, 40, or 50.

Fig 12

F-measure by number of recommendations on 100K MovieLens.

Fig 15

F-measure by number of recommendations on FilmTrust.

Fig 13

F-measure by number of recommendations on 1M MovieLens.

Fig 14

F-measure by number of recommendations on HetRec2011.

As with the other two metrics, all methods show a notable enhancement in their F-measure results with the application of TOPSIS. In general, the F-measure decreases slightly as the number of recommendations increases. It is again seen that the TOPSIS-enhanced NHSM method produces the best results across all cases, with an improvement of approximately 50% and 75% obtained through the application of TOPSIS to (PCC, CPCC, SPCC, Cos, and JMSD) and NHSM, respectively. These results reinforce the preceding results and indicate that the application of TOPSIS significantly improves both the precision and recall of memory-based CF. Fig 16 shows the MAP results with respect to the recommendation list size for the PCC, CPCC, SPCC, Cos, JMSD, and NHSM methods with and without TOPSIS. The legend denotes the different data sets (100K & 1M MovieLens, HetRec2011, and FilmTrust).

Fig 16

Comparison of MAP for all methods using all datasets.

The results clearly show that the use of TOPSIS produces a significant improvement in terms of MAP, with the TOPSIS adaptation of the NHSM CF approach producing the best results across all datasets. Conversely, the Cos and MSD CF methods produce the worst results. In terms of data sets, the MAP values using 100K MovieLens and HetRec2001 are better than with the other data sets (1M MovieLens and FilmTrust). The worst scores were obtained when applying the FilmTrust data set to all methods based on TOPSIS, except for SPCC, which performed worst using the HeRec2011 data set. Overall, the results indicate that the methods based on TOPSIS more than double the accuracy of PCC, CPCC, SPCC, MSD, and Cos, and produce three-fold enhancements in JMSD and NHSM. These results indicate that the application of TOPSIS to conventional methods can significantly improve the MAP accuracy of memory-based CF. Generally, the RS does not guarantee that the suggested items will be relevant to the preferences of the target user, but may encourage users to find useful or interesting items. Therefore, the accuracy of the RS is affected by the user’s subsequent selection from the list of recommendations. For instance, if the recommendation list contains 10 items and the user selects just four, then the accuracy will be negatively affected by the user disregarding the other six items. Thus, in this study, the experimental results above clearly show that the application of TOPSIS to the baseline methods results in better accuracy. Although the general accuracy of the proposed method is less than 0.5 in term of precision, the accuracy of all baseline methods is lower than that of the proposed method. For instance, the precision of the baseline methods does not exceed 0.1, except for NHSM, which scored around 0.12 using 100K MovieLens. In contrast, the maximum precision when TOPSIS was applied to NHSM reached 0.44 and 0.38 on the 100K and 1M MovieLens datasets, respectively. The low accuracy of the baseline methods in this case is related to the prediction algorithm. The prediction algorithm produces a predicted score for all candidate items within a given range of 1–5. Thus, there is a possibility that many items will have the same predicted score rating. Consequently, we do not know which (if either) of two items that have the same prediction score is actually more preferred by the user. This may lead to incorrect rankings and, in turn, low accuracy. However, the proposed method based on TOPSIS successfully minimizes the negative effect of the prediction algorithm in evaluating and ranking the candidate items. Thus, the application of TOPSIS significantly improves the accuracy of memory-based CF and produces more accurate results than the baseline methods.

Conclusions

This paper has presented a new memory-based CF in which the TOPSIS method is applied to improve the accuracy of recommendations. The proposed method applies TOPSIS as a substitute for the prediction methods used in conventional memory-based CF. The application of TOPSIS to several commonly used CF methods (PCC, CPCC, SPCC, Cos, MSD, JMSD, and NHSM) was shown to produce sharp improvements in terms of precision, recall, F-measure, and MAP results over the respective baseline methods. In particular, the recall and MAP improved by a factor of more than two under application to the PCC, CPCC, SPCC, MSD, and Cos methods and by factors of more than three and four under application to JMSD and NHSM, respectively. Although the improvement in precision was generally smaller across all cases, applying TOPSIS achieved a three-fold increase in precision in NHSM and a doubling of the precision in the other methods. The results conclusively underline the enhancements that can be achieved by using TOPSIS in place of prediction to improve the accuracy of memory-based CF methods. This improvement arises from the consideration by the TOPSIS-enhanced CF of the item ratings by all K-neighbors in constructing a decision matrix to weight the criteria applied by the target user, and the application of the TOPSIS technique to evaluate and rank candidate items. The key to successful memory-based CF is finding an appropriate set of neighbors. In future work, therefore, we will focus on improving the accuracy of recommendations by formulating a new similarity measure to locate sets of neighbors that produce better recommendations.

Dataset.

(ZIP) Click here for additional data file.

7 in total

1 in total

1. ARG-Mask RCNN: An Infrared Insulator Fault-Detection Network Based on Improved Mask RCNN.

Authors: Ming Zhou; Jue Wang; Bo Li
Journal: Sensors (Basel) Date: 2022-06-22 Impact factor: 3.847