| Literature DB >> 34932014 |
Mehmed Kantardzic1, Rachel Vickers-Smith2, Jason Turner1.
Abstract
BACKGROUND: In the absence of official clinical trial information, data from social networks can be used by public health and medical researchers to assess public claims about loosely regulated substances such as cannabidiol (CBD). For example, this can be achieved by comparing the medical conditions targeted by those selling CBD against the medical conditions patients commonly treat with CBD.Entities:
Keywords: CBD; Twitter; cannabidiol; cannabis; drug regulation; public health; sentiment analysis; social media; social networks; text mining; unregulated substances
Mesh:
Substances:
Year: 2021 PMID: 34932014 PMCID: PMC8726039 DOI: 10.2196/27307
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Workflow of Tweet collection and classification. API: application programming interface; CBD: cannabidiol.
Training set for personal cannabidiol (CBD) class counts.
| Class | Predownsampling, n | Postdownsampling, n |
| Personal CBD | 631 | 631 |
| Nonpersonal CBD | 4865 | 631 |
| Total | 5496 | 1262 |
Training set for commercial cannabidiol (CBD) class counts.
| Class | Predownsampling, n | Postdownsampling, n |
| Commercial CBD | 489 | 480 |
| Noncommercial CBD | 45,007 | 489 |
| Total | 5496 | 978 |
Parameters used in text classification tuning with the logistic regression model.
| Parameter | Range |
| Penalty | {none,.l1.l2} |
| Regularization parameter | xk=10a+(b-a)(k-1)/(n-1), k=1,…n; a=0; b=5; n=20 |
| Solver | {newton-cg, lbfgs, liblinear, sag, saga} |
Personal cannabidiol (CBD) logistic regression classifier performance metrics.
| Classification | Precision | Recall | F1 | Support | Accuracy | AUCa | |||
|
| 0.85 | 0.86 | |||||||
|
| Nonpersonal CBD | 0.93 | 0.79 | 0.85 | 138 |
|
| ||
|
| Personal CBD | 0.79 | 0.93 | 0.85 | 115 |
|
| ||
|
| 0.89 | 0.87 | |||||||
|
| Nonpersonal CBD | 0.94 | 0.91 | 0.93 | 367 |
|
| ||
|
| Personal CBD | 0.78 | 0.83 | 0.81 | 133 |
|
| ||
aAUC: area under the receiver operating characteristic curve.
Commercial cannabidiol (CBD) logistic regression classifier performance metrics.
| Classifier | Precision | Recall | F1 | Support | Accuracy | AUCa | |||||||||
|
| 0.89 | 0.89 | |||||||||||||
|
| Noncommercial CBD | 0.92 | 0.85 | 0.89 | 95 |
|
| ||||||||
|
| Commercial CBD | 0.87 | 0.93 | 0.90 | 101 |
|
| ||||||||
|
| 0.87 | 0.82 | |||||||||||||
|
| Noncommercial CBD | 0.90 | 0.93 | 0.91 | 367 |
|
| ||||||||
|
| Commercial CBD | 0.79 | 0.70 | 0.74 | 133 |
|
| ||||||||
aAUC: area under the receiver operating characteristic curve.
Figure 2Cannabis-related term frequency per class.
Figure 3Other substances term frequency per class.
Figure 4Medical/health/wellness-related term frequency per class.
Medical-related terms with nonneutral sentiment.
| Term | VADERa compound score |
| anxiety | –0.1779 |
| anxious | 0.25 |
| calm | 0.3182 |
| calming | 0.4019 |
| cancer | –0.6597 |
| depression | –0.5719 |
| energy | 0.2732 |
| pain | –0.5106 |
| pains | –0.4215 |
| stress | –0.4215 |
aVADER: Valence Aware Dictionary for Sentiment Reasoning.
Personal and commercial cannabidiol (CBD) sentiment categorical counts.
| Term | Personal tweets | Commercial tweets | ||||||||||||||||
|
| n | With term | Without term | n | With term | Without term | ||||||||||||
|
|
| posa | neub | negc | pos | neu | neg |
| pos | neu | neg | pos | neu | neg | ||||
| anxiety | 5353 | 2818 | 126 | 2409 | 3125 | 519 | 1718 | 2924 | 1564 | 44 | 1316 | 1726 | 352 | 846 | ||||
| anxious | 515 | 266 | 11 | 238 | 307 | 47 | 161 | 114 | 53 | 4 | 57 | 84 | 5 | 25 | ||||
| autism | 395 | 180 | 47 | 168 | 180 | 47 | 168 | 27 | 17 | 2 | 8 | 17 | 2 | 8 | ||||
| calm | 1224 | 1007 | 17 | 200 | 761 | 145 | 318 | 725 | 659 | 9 | 57 | 535 | 80 | 110 | ||||
| calming | 445 | 399 | 4 | 42 | 324 | 33 | 88 | 389 | 369 | 2 | 18 | 308 | 47 | 34 | ||||
| cancer | 986 | 230 | 19 | 737 | 530 | 111 | 345 | 246 | 76 | 0 | 170 | 122 | 44 | 80 | ||||
| depression | 568 | 164 | 17 | 387 | 307 | 31 | 230 | 326 | 69 | 8 | 249 | 178 | 19 | 129 | ||||
| energy | 507 | 416 | 9 | 82 | 334 | 57 | 116 | 444 | 421 | 7 | 16 | 357 | 52 | 35 | ||||
| fitness | 57 | 48 | 2 | 7 | 37 | 4 | 16 | 128 | 125 | 0 | 3 | 100 | 15 | 13 | ||||
| pain | 7432 | 2948 | 188 | 4296 | 4985 | 558 | 1889 | 6287 | 3262 | 113 | 2912 | 4956 | 591 | 740 | ||||
| pains | 394 | 157 | 11 | 226 | 225 | 19 | 150 | 311 | 168 | 9 | 134 | 219 | 11 | 81 | ||||
| ptsdd | 217 | 111 | 14 | 92 | 111 | 14 | 92 | 55 | 33 | 9 | 13 | 33 | 9 | 13 | ||||
| skin | 618 | 461 | 55 | 102 | 464 | 54 | 100 | 2516 | 2211 | 150 | 155 | 2229 | 136 | 151 | ||||
| sleep | 3761 | 2518 | 356 | 887 | 2517 | 356 | 888 | 2980 | 2129 | 322 | 529 | 2131 | 321 | 528 | ||||
| stress | 1012 | 560 | 18 | 434 | 713 | 45 | 254 | 1407 | 883 | 28 | 496 | 1100 | 36 | 271 | ||||
| weight loss | 8 | 5 | 2 | 1 | 5 | 2 | 1 | 24 | 18 | 3 | 3 | 18 | 3 | 3 | ||||
| wellness | 144 | 129 | 2 | 13 | 98 | 18 | 28 | 4216 | 4020 | 38 | 158 | 3106 | 814 | 296 | ||||
apos: positive sentiment.
bneu: neutral sentiment.
cneg: negative sentiment.
dptsd: posttraumatic stress disorder.
Personal and commercial cannabidiol (CBD) sentiment score t test results (with and without term).
| Term | Personal with vs without term | Commercial with vs without term | Commercial vs personal with term | Commercial vs personal without term | ||||||||
|
|
|
|
|
|
|
|
|
| ||||
| anxiety | –10.31 | 10,704 | <.001 | –8.51 | 5846 | <.001 | –3.33 | 8275 | .001 | –4.28 | 8275 | <.001 |
| anxious | –4.29 | 1028 | <.001 | –2.39 | 226 | .02 | 0.53 | 627 | .59 | –0.88 | 627 | .38 |
| autism | 0.00 | 788 | >.99 | 0.00 | 52 | >.99 | –1.74 | 420 | .08 | –1.74 | 420 | .08 |
| calm | 9.15 | 2446 | <.001 | 7.40 | 1448 | <.001 | –8.06 | 1947 | <.001 | –8.04 | 1947 | <.001 |
| calming | 6.40 | 888 | <.001 | 6.49 | 776 | <.001 | –3.37 | 832 | .001 | –3.09 | 832 | .002 |
| cancer | –20.71 | 1970 | <.001 | –8.65 | 490 | <.001 | –1.59 | 1230 | .11 | –0.22 | 1230 | .83 |
| depression | –11.67 | 1134 | <.001 | –11.49 | 650 | <.001 | 2.06 | 892 | .40 | 0.29 | 892 | .77 |
| energy | 4.45 | 1012 | <.001 | 5.25 | 886 | <.001 | –7.68 | 949 | <.001 | –7.02 | 949 | <.001 |
| fitness | 1.81 | 112 | .07 | 3.92 | 254 | <.001 | –3.69 | 183 | <.001 | –2.88 | 183 | .004 |
| pain | –41.39 | 14,862 | <.001 | –43.82 | 12,572 | <.001 | –18.37 | 13,717 | <.001 | –23.02 | 13,717 | <.001 |
| pains | –6.32 | 786 | <.001 | –5.43 | 620 | <.001 | –3.98 | 703 | <.001 | –4.01 | 703 | <.001 |
| ptsda | –0.01 | 432 | .99 | 0.00 | 108 | >.99 | –1.77 | 270 | .08 | –1.76 | 270 | .08 |
| skin | –0.24 | 1234 | .81 | –1.76 | 5030 | .08 | –7.25 | 3132 | <.001 | –7.89 | 3132 | <.001 |
| sleep | 0.01 | 7520 | .99 | –0.12 | 5958 | .90 | –6.97 | 6739 | <.001 | –7.10 | 6739 | <.001 |
| stress | –9.12 | 2022 | <.001 | –11.98 | 2812 | <.001 | –4.65 | 2417 | <.001 | –5.60 | 2417 | <.001 |
| weight loss | 0.00 | 14 | >.99 | 0.00 | 46 | >.99 | –0.71 | 30 | .48 | –0.71 | 30 | .48 |
| wellness | 3.94 | 286 | <.001 | 27.38 | 8430 | <.001 | –4.72 | 4358 | <.001 | –3.35 | 4358 | .001 |
aptsd: posttraumatic stress disorder.
Personal and commercial cannabidiol (CBD) sentiment score descriptive statistics (with and without the term).
| Term | Personal tweets | Commercial tweets | |||||
|
| n | With term, mean (SD) | Without term, mean (SD) | n | With term, mean (SD) | Without term, mean (SD) | |
| anxiety | 5353 | 0.074 (0.573) | 0.186 (0.557) | 2924 | 0.118 (0.568) | 0.241 (0.538) | |
| anxious | 515 | 0.048 (0.591) | 0.203 (0.566) | 114 | 0.08 0 (0.566) | 0.254 (0.529) | |
| autism | 395 | –0.001 (0.546) | –0.001 (0.546) | 27 | 0.188 (0.557) | 0.188 (0.557) | |
| calm | 1224 | 0.448 (0.484) | 0.258 (0.540) | 725 | 0.616 (0.374) | 0.452 (0.467) | |
| calming | 445 | 0.608 (0.408) | 0.410 (0.508) | 389 | 0.695 (0.334) | 0.513 (0.444) | |
| cancer | 986 | –0.369 (0.571) | 0.158 (0.559) | 246 | –0.303 (0.638) | 0.167 (0.564) | |
| depression | 568 | –0.275 (0.573) | 0.122 (0.571) | 326 | –0.353 (0.506) | 0.111 (0.525) | |
| energy | 507 | 0.469 (0.492) | 0.324 (0.541) | 444 | 0.681 (0.331) | 0.547 (0.421) | |
| fitness | 57 | 0.429 (0.463) | 0.263 (0.514) | 128 | 0.633 (0.283) | 0.464 (0.400) | |
| pain | 7432 | –0.099 (0.605) | 0.293 (0.547) | 6287 | 0.088 (0.580) | 0.490 (0.440) | |
| pains | 394 | –0.098 (0.610) | 0.169 (0.577) | 311 | 0.087 (0.615) | 0.342 (0.553) | |
| ptsda | 217 | 0.037 (0.626) | 0.037 (0.627) | 55 | 0.200 (0.563) | 0.200 (0.563) | |
| skin | 618 | 0.420 (0.501) | 0.427 (0.501) | 2516 | 0.550 (0.371) | 0.568 (0.371) | |
| sleep | 3761 | 0.305 (0.522) | 0.305 (0.523) | 2980 | 0.392 (0.493) | 0.394 (0.493) | |
| stress | 1012 | 0.116 (0.632) | 0.360 (0.567) | 1407 | 0.234 (0.596) | 0.481 (0.396) | |
| weight loss | 8 | 0.289 (0.344) | 0.289 (0.344) | 24 | 0.436 (0.549) | 0.436 (0.549) | |
| wellness | 144 | 0.606 (0.431) | 0.384 (0.524) | 4216 | 0.720 (0.279) | 0.505 (0.426) | |
aptsd: posttraumatic stress disorder.
Figure 5Distribution of sentiment scores of personal tweets referencing the term “pain.”.
Figure 6Distribution of sentiment scores of commercial tweets referencing the term “pain.”.
Figure 7Sentiment distribution of personal cannabidiol-related tweets referencing autism.