| Literature DB >> 35923837 |
Ruichen Ge1, Hong Zhao1, Sha Zhang1.
Abstract
There is a trend that customers increasingly join the online brand community. However, evidence shows that there are nuances between different user segments, and only a small group of users are active. Thus, one key concern marketers face is identifying and targeting specific segments and decreasing user churn rates in an online environment. To this end, this study aims to propose a UGC-based segmentation of online brand community users, identify the characteristics of each segment, and consequently reduce online brand community users' churn rate. We used python to obtain users' post data from a well-known online brand community in China between July 2012 and December 2019, resulting in 912,452 posts and 20,493 users. We then use text mining and clustering methods to segment the users and compare the differences between the segments. Three groups-information-oriented users, entertainment-oriented users, and multi-motivation users-were emerged. Our results imply that entertainment-oriented users were the most active, yet, multi-directional users have the lowest probability of churn, with a churn rate of only 0.607 times than that of users who focus either on information or entertainment. Implications for marketing and future research opportunities are discussed.Entities:
Keywords: UGC; online brand community; text mining; user churn; user segmentation
Year: 2022 PMID: 35923837 PMCID: PMC9339712 DOI: 10.3389/frai.2022.900775
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1The steps of data classification.
Measure and performance of SVM.
|
|
|
|
|---|---|---|
| Precision |
| 0.84 |
| Accuracy |
| 0.84 |
| Sensitivity |
| 0.83 |
| Specificity |
| 0.85 |
Centers of three clusters.
|
|
|
|
|
|---|---|---|---|
| Number of informational posts | 0.98 | 0.13 | 0.72 |
| Number of entertaining posts | 0.07 | 0.97 | 0.58 |
| Diversity | 0.03 | 0.04 | 0.24 |
Characteristics of each segment.
|
|
|
|
|
|---|---|---|---|
| Number of posts | 27.31 | 175.77 | 61.74 |
| Number of replies | 664.07 | 2219.33 | 1497.02 |
| Number of friends | 4.69 | 8.27 | 7.83 |
| Popularity | 620.11 | 2203.04 | 1218.90 |
| Prestige | 68.35 | 118.15 | 103.91 |
| Churn rate | 0.93 | 0.94 | 0.91 |
| Size of Cluster | 10487 | 4000 | 6006 |
Regression results.
|
|
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
| Diversity | −0.248*** | 0.607 | ||
| No. of posts of Entertainment | −0.019 | 1.000 | 0.035 | 1.035 |
| No. of posts of Information | −0.103** | 0.999 | −0.087** | 0.999 |
| No. of friends | 0.004 | 1.000 | −0.006 | 1.006 |
| Popularity | 0.197* | 1.000 | 0.329* | 1.391 |
| Prestige | −0.061* | 0.999 | −0.068* | 0.934 |
| Year Dummy | - | - | ||
| Pseudo | 0.143 | 0.148 | ||
| Log Likelihood | −3979.87 | −3955.38 | ||
| No. of observations | 20,493 | 20,493 | ||
*p <0.1; **p <0.01; ***p <0.001. Coeff, Standarized Coefficients; OR, Odds Ratio.