| Literature DB >> 31667380 |
Davide Marengo1, Danny Azucar2, Fabrizia Giannotta3, Valerio Basile4, Michele Settanni1.
Abstract
Recent literature suggests that variations in both formal and content aspects of texts shared on social media tend to reflect user-level differences in demographic, psychosocial, and behavioral characteristics. In the present study, we examined associations between language use on Facebook and problematic alcohol use. We collected texts shared on Facebook by a sample of 296 adult social media users (66.9% females; mean age = 28.44 years (SD = 7.38)). Texts were mined using the closed-vocabulary approach based on the Linguistic Inquiry Word Count (LIWC) semantic dictionary, and an open-vocabulary approach performed via Latent Dirichlet Allocation (LDA). Then, we examined associations between emerging textual features and alcohol-drinking scores as assessed using the AUDIT-C questionnaire. As a final aim, we employed the Random Forest machine-learning algorithm to determine and compare the predictive accuracy of closed- and open-vocabulary features over users' AUDIT-C scores. We found use of words about family, school, and positive feelings and emotions to be negatively associated with alcohol use and problematic drinking, while words suggesting interest in sport events, politics and economics, nightlife, and use of coarse language were more frequent among problematic drinkers. Results coming from LIWC and LDA analyses were quite similar, but LDA added information that could not be retrieved only with LIWC analysis. Furthermore, open-vocabulary features outperformed closed-vocabulary features in terms of predictive power over participants' AUDIT-C scores (r = .46 vs. r = .28, respectively). Emerging relationships between text features and offline behaviors may have important implications for alcohol screening purposes in the online environment.Entities:
Keywords: Data mining; Digital footprints; Linguistics; Problem alcohol drinking; Psychology; Social media; Text analysis
Year: 2019 PMID: 31667380 PMCID: PMC6812202 DOI: 10.1016/j.heliyon.2019.e02523
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
LIWC categories showing significant correlations with AUDIT-C scores.
| LIWC categories | r | p |
|---|---|---|
| Family | -0.24 | p < .01 |
| Pronouns | -0.22 | p < .01 |
| Present (Time) | -0.21 | p < .01 |
| Self | -0.19 | p < .01 |
| Sex | -0.19 | p < .01 |
| Cognitive processes | -0.18 | p < .01 |
| Symptoms/sensations | -0.18 | p < .01 |
| Affective processes | -0.18 | p < .01 |
| Positive emotions | -0.18 | p < .01 |
| Time | -0.18 | p < .01 |
| Feelings | -0.17 | p < .01 |
| Social | -0.16 | p < .01 |
| 3rd person singular verbs (male) | -0.16 | p < .01 |
| Touching | -0.16 | p < .01 |
| Introspection | -0.16 | p < .01 |
| Future (Time) | -0.15 | p < .01 |
| 1st person singular | -0.15 | p < .05 |
| Have | -0.14 | p < .05 |
| 3d person plural | -0.14 | p < .05 |
| Physical states/factors | -0.13 | p < .05 |
| Swear words | 0.13 | p < .05 |
| Negations | -0.12 | p < .05 |
| Optimism | -0.12 | p < .05 |
| Occupation | -0.11 | p < .05 |
Fig. 1Word clouds of LDA topics with top positive correlations with AUDIT-C scores.
Fig. 2Word clouds of LDA topics with top negative correlations with AUDIT-C scores.
Results of prediction models for the AUDIT-C using the Random Forests algorithm.
| Features | R | MAE | RMSE | n. trees |
|---|---|---|---|---|
| LIWC | .285 | 1.569 | 2.019 | 5000 |
| LDA-Topics | .462 | 1.493 | 1.960 | 500 |
| LIWC + LDA-Topics | .452 | 1.513 | 1.955 | 1000 |
Note. Prediction performed using 80/20 split cross-validation.
Topics showing significant correlations with AUDIT-C scores (N = 296).
| r | p | Topic | Top Words |
|---|---|---|---|
| 0.308 | p<.001 | 74 | fuck life ass pussy shit twitter death tits guy fuck wanted poop porno miss shit |
| 0.234 | p<.001 | 175 | club fans staff arrive search boys beautiful should this miss enter events insert tag publish |
| 0.194 | p<.001 | 119 | social italy web digital web post google facebook blog online media internet site network marketing |
| 0.194 | p<.001 | 257 | germany brazil italy game world soccer argentina world cup goal holland german team goal neymar |
| 0.193 | 0.001 | 100 | renzi berlusconi reforms senate politics government president reformation grillo europe matteo republic camera silvio party |
| 0.172 | 0.003 | 200 | italy history country politics now world so today rome reason left times time to be journalists |
| 0.171 | 0.003 | 38 | done have to do well that being so maybe not even say that part milan |
| 0.156 | 0.007 | 75 | time xoxo next lignano evening alassio nut subscribe win information tell me indie pray rudeness |
| 0.145 | 0.013 | 123 | eli volta call usa known memory multiple mica sclerosis finally summer meantime beautiful cute internet |
| 0.144 | 0.013 | 81 | done earth ramazzotti eros world say you want I can born beautiful confirmation try understand |
| 0.144 | 0.013 | 141 | genova concordia liguria perugia century ship agi umbria lily lily agency port costa savona ligure |
| 0.14 | 0.016 | 22 | made cock ass shit life force ass you gotta balls cocks said you can jerk |
| 0.137 | 0.018 | 5 | done thanks so much to say it seems I think sense point case said problem |
| 0.135 | 0.020 | 221 | revenge today call cash step center I speak twitter opinion made death I'll work live dance pussy |
| 0.132 | 0.023 | 185 | italy politics grillo mafia country politicians voted party italians votes shame democracy say vote be |
| 0.129 | 0.026 | 101 | rome italy mafia milan euro mayor capital million arrested marine house case police video money |
| 0.122 | 0.036 | 173 | happened retweet occupied seats we sat big part friends twisted anonymous man seen strong words shit |
| 0.122 | 0.036 | 187 | life washing machine bice raffaele italian been edition memi photo passes niky inserted contest click |
| 0.122 | 0.036 | 213 | mery cuin sere miki ire kaety multifandom speleus alexia fiki odi last shalala hate |
| 0.121 | 0.037 | 264 | life history art culture exhibition today milan book film rome literature war books city world cinema |
| 0.12 | 0.039 | 167 | band mars for jared the gerard frank love day thank you leto letter shannon concert |
| 0.119 | 0.041 | 177 | reform government senate work law reforms italy room fees renzi workers costs euro employees public |
| 0.116 | 0.046 | 110 | instagram exchange made like reciprocate likes follow me we can call you want to reciprocate |
| 0.115 | 0.048 | 237 | work thanks problem case be know use copy serve mail use saw price site pay |
| -0.117 | 0.044 | 296 | night facts isa update eunhyuk donghae tagged alexiara location instagram twitter curti seconds pfv sister |
| -0.119 | 0.041 | 39 | life be strong man part world wants to give truth so many good words it |
| -0.119 | 0.041 | 48 | love being made to life like that man italy that wife rethinked to have time |
| -0.122 | 0.036 | 76 | sardinia cagliari Sardinian sea Sardinian bag Sardinian seas bombs zone island sassari luck work olbia |
| -0.125 | 0.032 | 144 | juve conte juventus rome vidal iturbe italy morata marotta pogba coach tevez evra player market |
| -0.129 | 0.026 | 122 | love wish wanna make you eyes person need be know vault will be can heart |
| -0.13 | 0.025 | 71 | thank you beautiful picture sun beautiful sea beautiful like beautiful so beautiful beauty day pleasure |
| -0.13 | 0.025 | 121 | darling kiss kisses night sweet love dreams hug goodnight heart I want good morning joy |
| -0.137 | 0.018 | 258 | love idol want thank you life dream hope smile I'll be world my dear idols miss |
| -0.14 | 0.016 | 35 | made go home days photo so can see tomorrow time day just kind week today |
| -0.142 | 0.014 | 87 | would like treport emis emi killa nick profile inactive you might like arrive |
| -0.148 | 0.011 | 114 | really want feel think would like to be told can today happy be hopeful yesterday I think beautiful |
| -0.156 | 0.007 | 239 | good day thanks evening good morning hello easter good night wishes goodnight afternoon week greetings |
| -0.164 | 0.005 | 65 | love life heart soul words happiness music emotions passion night thoughts moment thought pain woman |
| -0.164 | 0.005 | 216 | life love made person so beautiful you know how beautiful you can be rest be afraid |
| -0.18 | 0.002 | 249 | thank you heart congratulations family beautiful today wishes tonight good tomorrow beautiful we hope yesterday |
| -0.184 | 0.001 | 171 | school tomorrow day go start class days start today tasks want monday come back anxiety |
| -0.189 | 0.001 | 84 | thanks dear hello hug happy friends dear serene good night good day evening beautiful dearest |
| -0.192 | p<.001 | 206 | heart love life soul night eyes words world moon sweet smile sun sky sea stars |
| -0.219 | p<.001 | 58 | christmas made thank you tree dad gift house mom gifts wants mother beautiful grandma daughter |
| -0.22 | p<.001 | 292 | life be day so live person world you can all say love moment need heart |