| Literature DB >> 35013700 |
Anupam Singh1, Aldona Glińska-Neweś1.
Abstract
This study aims to identify the topics that users post on Twitter about organic foods and to analyze the emotion-based sentiment of those tweets. The study addresses a call for an application of big data and text mining in different fields of research, as well as proposes more objective research methods in studies on food consumption. There is a growing interest in understanding consumer choices for foods which are caused by the predominant contribution of the food industry to climate change. So far, customer attitudes towards organic food have been studied mostly with self-reported methods, such as questionnaires and interviews, which have many limitations. Therefore, in the present study, we used big data and text mining techniques as more objective methods to analyze the public attitude about organic foods. A total of 43,724 Twitter posts were extracted with streaming Application Programming Interface (API). Latent Dirichlet Allocation (LDA) algorithm was applied for topic modeling. A test of topic significance was performed to evaluate the quality of the topics. Public sentiment was analyzed based on the NRC emotion lexicon by utilizing Syuzhet package. Topic modeling results showed that people discuss on variety of themes related to organic foods such as plant-based diet, saving the planet, organic farming and standardization, authenticity, and food delivery, etc. Sentiment analysis results suggest that people view organic foods positively, though there are also people who are skeptical about the claims that organic foods are natural and free from chemicals and pesticides. The study contributes to the field of consumer behavior by implementing research methods grounded in text mining and big data. The study contributes also to the advancement of research in the field of sustainable food consumption by providing a fresh perspective on public attitude toward organic foods, filling the gaps in existing literature and research.Entities:
Keywords: Big data; Latent Dirichlet Allocation (LDA); Machine learning; Organic foods; Sentiment analysis; Text mining; Topic modeling
Year: 2022 PMID: 35013700 PMCID: PMC8733915 DOI: 10.1186/s40537-021-00551-6
Source DB: PubMed Journal: J Big Data ISSN: 2196-1115
Fig. 1The methodological architecture of the study
Fig. 2Graphical model representation of LDA
Validation perplexities under the different numbers of topics
| Number of topics | Validation perplexity | Number of topics | Validation perplexity | Number of topics | Validation perplexity |
|---|---|---|---|---|---|
| 1 | 290.5 | 6 | 247.9 | 11 | 244.2 |
| 2 | 269.1 | 7 | 241.1 | 12 | 256.9 |
| 3 | 276.4 | 8 | 237.5 | 13 | 251.9 |
| 4 | 253.7 | 9 | 239.4 | 14 | 270.7 |
| 5 | 245.6 | 10 | 243.3 | 15 | 267.2 |
Fig. 3Best topic by coherence score
Topics listing top 12 key terms extracted from LDA model
| Topic id | Terms | Label |
|---|---|---|
| Topic 1 | food, organic, jail, judge, Capitol, horns, get, hunger, riot, ordered, fur, strike | US politics (Attack on Capitol Hill) |
| Topic 2 | organic, food, get, jail, man, white, prison, guy, judge, horns, privilege, vacation | US politics (Capitol attacker’s demand for organic food) |
| Topic 3 | organic, food, people, consumption, health, vegan, grown, eating, fed, pesticides, chemicals | Authenticity |
| Topic 4 | food, organic, month, seasonal, three, spring, package, survival, vitality, bundle, chakra, better | Seasonality |
| Topic 5 | organic, food, plant-based, vegan, diet, ecological, refreshing, vegetables, health, future, good | Plant-based diet |
| Topic 6 | organic, food, plant, local, fresh, natural, farming, agriculture, save, planet, impact, world | Saving the planet |
| Topic 7 | farming, choose, harvest, latest, technology, crops, labels, standards, agriculture, supply-chain, organic, focus | Organic farming and standardization |
| Topic 8 | online, grocery, demand, local, store, orders, delivery, affordable, save, COVID19, planet, markets | Food delivery |
Fig. 4LDAvis visualization for Topic 3
Topic significance rank
| Topic id | Topic label | TSR | Rank | ||
|---|---|---|---|---|---|
| Topic 5 | Plant-based diet | 21.3 | 0.8 | 17.0 | 1 |
| Topic 3 | Authenticity | 20.7 | 0.8 | 16.6 | 2 |
| Topic 4 | Seasonality | 19.8 | 0.8 | 15.8 | 3 |
| Topic 7 | Organic farming and standardization | 18.5 | 0.8 | 14.8 | 4 |
| Topic 6 | Saving the planet | 16.3 | 0.8 | 13.0 | 5 |
| Topic 2 | US politics (Capitol attacker’s demand for organic food) | 15.7 | 0.8 | 12.6 | 6 |
| Topic 8 | Food delivery | 14.8 | 0.7 | 10.4 | 7 |
| Topic 1 | US politics (Attack on Capitol Hill) | 14.5 | 0.7 | 10.2 | 8 |
Fig. 5Public sentiments toward organic foods