| Literature DB >> 35910042 |
Abstract
Dining out is one of the biggest expenditures for travelers worldwide and is essential for tourism-dependent destinations. Market segmentation gives industries the potential to classify similar customers and categorize their preferred target markets to ensure marketing expenses' operative management. It has been a practical approach for business improvement in tourism and hospitality. Big data are fundamentally changing the management of the hospitality sector and the relationship between the customer and business by simplifying the decision-making process based on large amounts of data. The data provided in social media have played an important role in customer segmentation. In fact, the data provided by the customers in social media have been a valuable source for decision-makers to precisely discover the customers' satisfaction dimensions on their services. Therefore, there is a need for the development of data-driven approaches for social data analysis for customers segmentation. This research aims to develop a new data-driven approach to reveal customers' satisfaction in restaurants. Specifically, k-means and Artificial Neural Network (ANN) with the aid of the Particle Swarm Optimization (PSO) technique are, respectively, used in data clustering and prediction tasks. In this research, the data of customers on the service quality of restaurants are collected from the TripAdvisor platform. The results of the data analysis are provided. We evaluate the prediction model through a set of evaluation metrics, Mean Squared Error (MSE) and coefficient of determination (R 2), compared with the other prediction approaches. The results showed that k-means-PSO-ANN (MSE = 0.09847; R 2 = 0.98764) has outperformed other methods. The current study demonstrates that the use of online review data for customer segmentation can be an effective way in the restaurant industry in relation to the traditional data analysis approaches. © King Fahd University of Petroleum & Minerals 2022.Entities:
Keywords: Customer decision-making; Customer segmentation; Data-driven analysis; Neural Network; Optimization learning techniques; Restaurants
Year: 2022 PMID: 35910042 PMCID: PMC9315337 DOI: 10.1007/s13369-022-07091-y
Source DB: PubMed Journal: Arab J Sci Eng ISSN: 2191-4281 Impact factor: 2.807
List of abbreviations used in this manuscript
| Abbreviation | Description |
|---|---|
| ANFIS | Adaptive Network-Based Fuzzy Inference System |
| ANN | Artificial Neural Network |
| CART | Classification and Regression Tree |
| CHAID | Chi-Squared Automatic Interaction Detection |
| COVID-19 | Coronavirus Disease 2019 |
| CS | Customer Satisfaction |
| CSS | Customer-Related Social Stressors |
| XNN | Explainable Neural Network |
| eWOM | Electronic Word of Mouth |
| HOSVD | Higher-Order Singular Value Decomposition |
| KNN | K-Nearest Neighbors |
| LDA | Latent Dirichlet Analysis |
| MCDM | Multi-Criteria Decision-Making |
| MLR | Means-Multiple Linear Regression |
| MSE | Mean Squared Error |
| NOR | Negative Online Review |
| PF | Price Fairness |
| PFQ | Perceived Food Quality |
| PLTS | Probabilistic Linguistic Term Set |
| PV | Perceived Value |
| RFM | Recency, Frequency and Monetary |
| SOM | Self-Organizing Map |
| SVR | Support Vector Regression |
| TOPSIS | Technique for Order of Preference by Similarity to Ideal Solution |
| EM | Expectation Maximization |
A literature review of customer segmentation in restaurant
| References | Study objective | Method | Results |
|---|---|---|---|
| [ | Developing a model for spa hotel customers segmentation | Machine learning methods such as SOM, HOSVD, CART | The machine learning approach is a suitable method for spa hotel segmentation |
| [ | Investigating the relationship among the user-generated data and brand in the restaurant | Panel data analysis | Based on results that the financial effects of positive contents diminished with the existence of the brand |
| [ | Applying machine learning techniques for customer segmentation in vegetarian restaurants | Text mining, cluster analysis, and predictive learning technique | Based on findings customers' online reviews can be an effective tool to determine their preferences |
| [ | Using probabilistic linguistic term set (PLTS) to determine the cluster analysis for restaurant selection | K-means clustering | A novel restaurant recommendation method by cluster analysis is proposed |
| [ | Using preference variables to determine consumers loyalty | K-means clustering | By using clustering, managers can offer customized price discounts for users |
| [ | Profiling profitable hotel customers by Recency, Frequency, and Monetary (RFM) indicators | RFM analysis | RFM indicators can classify customers based on preferences |
| [ | Proposing a hybrid method for online reviews analysis | Cluster analysis, text mining, and Neuro-Fuzzy technique | The impact of green hotels on customer satisfaction is investigated for different |
| [ | Traveling groups | ||
| [ | Determining guest experience to increase hospitality services | Identifying pertinent attributes to determine customer satisfaction | The naive Bayesian method is a suitable method for review classification |
| [ | Determining eco-friendly hotels by machine learning methods | MCDM method | The proposed technique is useful for customers segmentation |
Online review study in restaurant
| References | Objective | Results |
|---|---|---|
| [ | The effect of COVID-19 on hospitality services by LDA, EM, and ANFIS | COVID-19 has a profound impact on hospitality performance and overall customers satisfaction level |
| [ | Sentiment-analysis method in Yelp reviews | Customers provided positive content than negative content in restaurant reviews |
| [ | The relationship between user experience online reviews posting time | There is a reverse association between user experience and content posting time |
| [ | MCDM for determining the satisfaction of travelers | Based on the results various users have different satisfaction levels and preferences |
| [ | Machine learning techniques in eco-friendly hotels review | The findings demonstrate that the hybrid methods are useful in preference analysis |
| [ | Combining social media data and historical sales data | The proposed method is useful for customer segmentation |
| [ | Determining consumers' preferences for tourism products | Customer segmentation is a valuable tool that aids users' decision-making |
| [ | Data mining model in TripAdvisor for hotel rating | Users' experience has a key role in hotel rating and creating positive word of mouth |
| [ | Evaluating the role of PFQ, PF, PV, and Customer Satisfaction on customers revisit | Based on the results PFQ has a positive effect on PF and PV |
| [ | Examining the influence of online review on restaurant performance | Based on the results, the number of online reviews has a positive effect on restaurant performance |
| [ | Examining the role of eWOM on the popularity of hospitality | Based on findings, quality of food, environment, and service have a positive impact on the popularity of restaurants |
| [ | Determining contributing factors on the hotel rating | Food, service, environment price, and ambiance are the main features determining star ratings |
| [ | Identifying main factors in positive eWOM | The results have shown that physical setting has a profound impact on positive word of mouth |
| [ | Investigating hotel loyalty programs in flyertalk.com members | The main clusters that were obtained were program experience, value, and process |
| [ | Using Yelp reviews for restaurants categorization | Ingredient, type of food, taste, price, and hygiene are important in restaurant segmentation |
| [ | Proposing a new method to determine comparative relations from online reviews | The proposed method enables the identification of top competitors using dissimilarity indicators |
| [ | Analyzing the effects of traditional face-to-face customer-related social stressors (CSSs), and negative online review (NOR) stressor | The proposed method demonstrates that, by considering the contribution of CSSs, receipt of NORs predicts anger, and anger moderates the association among NOR receipt and two indices of burnout |
| [ | Estimating the effect of different elements for online consumer review | The findings explain the impact of online consumer review elements on restaurant popularity |
| [ | Examining how various service experiences lead to different intentions to create eWOM | Findings reveal that expert users have more motivation to create eWOM after negative experiences |
| [ | Online restaurant reviews analysis in Yelp.com | There are different factors for certified green and non-certified restaurants rating |
| [ | Applying the bootstrap resampling method in the international hotel chain | A deep relationship was established on the most representative attributes of repeaters being traveled without children |
| [ | Analyzing the online reviews from different social media | Results demonstrated that the number of online reviews has a positive effect on hotel performance |
| [ | Examining information quality of three online review platforms | The findings show that there is vast inconsistency in the illustration of the hospitality industry on these tools |
| [ | Using machine learning models to analyzing reviews hotels of Tehran on TripAdvisor | KNN algorithm is the best approach for customer segmentation |
Fig. 1Proposed hybrid method
Fig. 2Constructing prediction models in ANN using PBP
Fig. 3PSO procedure
Fig. 4Restaurants registered in TripAdvisor
Fig. 5Cluster centroids
Restaurants criteria in each cluster
| Criteria | Ratings | Clusters | |||||
|---|---|---|---|---|---|---|---|
| Kmeans_1 | Kmeans_2 | Kmeans_3 | Kmeans_4 | Kmeans_5 | Kmeans_6 | ||
| Count | Count | Count | Count | Count | Count | ||
| Food | 1 | 0 | 155 | 291 | 75 | 148 | 116 |
| 2 | 0 | 186 | 226 | 96 | 134 | 160 | |
| 3 | 18 | 195 | 127 | 131 | 104 | 175 | |
| 4 | 218 | 131 | 0 | 131 | 75 | 160 | |
| 5 | 420 | 75 | 0 | 104 | 43 | 128 | |
| Service | 1 | 160 | 0 | 279 | 198 | 0 | 120 |
| 2 | 176 | 0 | 256 | 204 | 0 | 176 | |
| 3 | 184 | 89 | 104 | 116 | 63 | 179 | |
| 4 | 119 | 249 | 5 | 19 | 158 | 162 | |
| 5 | 17 | 404 | 0 | 0 | 283 | 102 | |
| Value | 1 | 61 | 304 | 116 | 228 | 0 | 0 |
| 2 | 142 | 286 | 154 | 220 | 0 | 0 | |
| 3 | 179 | 152 | 154 | 89 | 75 | 86 | |
| 4 | 143 | 0 | 133 | 0 | 211 | 305 | |
| 5 | 131 | 0 | 87 | 0 | 218 | 348 | |
| Atmosphere | 1 | 0 | 111 | 0 | 246 | 0 | 365 |
| 2 | 34 | 191 | 26 | 231 | 27 | 309 | |
| 3 | 157 | 222 | 146 | 58 | 120 | 65 | |
| 4 | 221 | 125 | 236 | 2 | 163 | 0 | |
| 5 | 244 | 93 | 236 | 0 | 194 | 0 | |
Customer satisfaction in each cluster
| Criteria | Ratings | Clusters | |||||
|---|---|---|---|---|---|---|---|
| Kmeans_1 | Kmeans_2 | Kmeans_3 | Kmeans_4 | Kmeans_5 | Kmeans_6 | ||
| Count | Count | Count | Count | Count | Count | ||
| Customer Satisfaction | 1 | 0 | 37 | 70 | 144 | 0 | 24 |
| 2 | 115 | 241 | 223 | 200 | 57 | 215 | |
| 3 | 195 | 255 | 237 | 169 | 147 | 287 | |
| 4 | 231 | 194 | 110 | 24 | 179 | 184 | |
| 5 | 115 | 15 | 4 | 0 | 121 | 29 | |
Fig. 6Clustering of data by k-means
Fig. 7MSE versus iteration in ANN for 6 clusters
Fig. 8Coefficient of determination in six clusters
Fig. 9Actual satisfaction versus predicted satisfaction in six clusters
Method comparisons
| Method | MSE | Coefficient of Determination ( |
|---|---|---|
| 0.09847 | 0.98764 | |
| ANN | 0.18936 | 0.86535 |
| SVR | 0.17563 | 0.91234 |
| 0.16134 | 0.92678 | |
| 0.18534 | 0.88342 | |
| ANFIS | 0.18829 | 0.87194 |
| ANN Ensembles | 0.15632 | 0.94352 |