| Literature DB >> 32455918 |
Lifeng He1, Dongmei Han1,2, Xiaohang Zhou1, Zheng Qu1.
Abstract
Many web-based pharmaceutical e-commerce platforms allow consumers to post open-ended textual reviews based on their purchase experiences. Understanding the true voice of consumers by analyzing such a large amount of user-generated content is of great significance to pharmaceutical manufacturers and e-commerce websites. The aim of this paper is to automatically extract hidden topics from web-based drug reviews using the structural topic model (STM) to examine consumers' concerns when they buy drugs online. The STM is a probabilistic extension of Latent Dirichlet Allocation (LDA), which allows the consolidation of document-level covariates. This innovation allows us to capture consumer dissatisfaction along with their dynamics over time. We extract 12 topics, and five of them are negative topics representing consumer dissatisfaction, whose appearances in the negative reviews are substantially higher than those in the positive reviews. We also come to the conclusion that the prevalence of these five negative topics has not decreased over time. Furthermore, our results reveal that the prevalence of price-related topics has decreased significantly in positive reviews, which indicates that low-price strategies are becoming less attractive to customers. To the best of our knowledge, our work is the first study using STM to analyze the unstructured textual data of drug reviews, which enhances the understanding of the aspects of drug consumer concerns and contributes to the research of pharmaceutical e-commerce literature.Entities:
Keywords: consumer concerns; online drug review; structural topic model; text mining
Year: 2020 PMID: 32455918 PMCID: PMC7277719 DOI: 10.3390/ijerph17103648
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Plate diagram of Latent Dirichlet Allocation (LDA).
Figure 2Plate diagram of structural topic model (STM).
Topic summary.
| # | Topic Labels | Topic Proportions | Top Words | Appeared in Literate |
|---|---|---|---|---|
| 1 | Package | 2.04% | box, product packaging, glass bottle, cases, careful, big, liquids | NO |
| 2 | After-sales service | 6.13% | consumer, solve, returns, business deception, quality, prescription drugs, advertising | NO |
| 3 | Price | 6.81% | cost-effective, low, cheap, convenience, coupons, small, free-shipping | YES |
| 4 | Side effects | 3.74% | instruction manual, side effects, diarrhea, poor effect, sore throat, garbage, outer packing | YES |
| 5 | Delivery | 9.65% | logistics, slow, cost, home delivery, boxes, pills, reasonable | YES |
| 6 | Curative effect | 11.30% | significant, desired effect, thumbs, breathability, elderly, fall, symptomatic | YES |
| 7 | Brand image | 10.84% | habituation, time-honored, bar code, quality, commodity, regular customers, trustworthy | YES |
| 8 | Expiration date | 12.60% | production date, shelf life, expiration date, expired, sent over, help, attitude | NO |
| 9 | Main functions | 21.46% | mouth ulcers, the trots, sneezing, aerosol, health products, laryngitis, packaging | YES |
| 10 | Only online purchase | 4.88% | physical stores, unavailable, online orders, essentials, necessities, bag, spray | YES |
| 11 | Mailing service | 3.97% | express fees, staff, parcel, consignment, waste, stores, angina | YES |
| 12 | Pre-sales consulting | 6.58% | customer service, consulting, waiter, thoughtful, unbearable, dependent, digestion | YES |
Figure 3Change in topic prevalence based on review extremity (positive vs. negative).
Figure 4Moderating effects of time.