| Literature DB >> 32469317 |
Ryan Rivas1, Moloud Shahbazi1, Renee Garett2, Vagelis Hristidis1, Sean Young3.
Abstract
BACKGROUND: There have been recurring reports of web-based harassment and abuse among adolescents and young adults through anonymous social networks.Entities:
Keywords: data analysis; social media; students; supervised machine learning; universities
Year: 2020 PMID: 32469317 PMCID: PMC7293060 DOI: 10.2196/17224
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Characteristics of universities included in the study.
| State and university | Public or private | Enrollment | Ranking | ||||
|
| |||||||
|
| California Polytechnic State University | Public | 19,226 | 221 | |||
|
| CSUb Chico | Public | 16,535 | 467 | |||
|
| CSU Los Angeles | Public | 20,353 | 700 | |||
|
| CSU San Bernardino | Public | 17,167 | 700 | |||
|
| University of California, Irvine | Public | 25,001 | 153 | |||
|
| |||||||
|
| Florida International University | Public | 53,525 | 550 | |||
|
| Florida State University | Public | 36,575 | 226 | |||
|
| University of Central Florida | Public | 59,894 | 445 | |||
|
| University of Florida | Public | 36,731 | 56 | |||
|
| University of South Florida | Public | 35,035 | 396 | |||
|
| |||||||
|
| Cornell University | Private | 14,706 | 9 | |||
|
| CUNYe Hunter College | Public | 20,582 | 350 | |||
|
| CUNY John Jay College of Criminal Justice | Public | 15,845 | 700 | |||
|
| SUNYf Buffalo State | Public | 10,665 | 700 | |||
|
| SUNY New Paltz | Public | 7756 | 423 | |||
|
| |||||||
|
| Tarleton State University | Public | 11,008 | 800 | |||
|
| Texas Tech University | Public | 29,342 | 550 | |||
|
| University of Houston | Public | 36,128 | 388 | |||
|
| University of Texas, Rio Grande Valley | Public | 27,560h | N/Ai | |||
aCA: California.
bCSU: California State University.
cFL: Florida.
dNY: New York.
eCUNY: City University of New York.
fSUNY: State University of New York.
gTX: Texas.
hFall 2016 enrollment for the University of Texas Rio Grande Valley [24].
iN/A: not applicable.
Definitions of messaging behaviors included in the study.
| Messaging behavior | Definition | Examples | Cohen kappa (number of agreements) |
| Seeking help | Seeking social support (eg, emotional support and help with problems) from other users |
“I like don't know what to do with myself. Literally I have no one to talk to” “What's the easiest class to fill art requirement? I'm terrible at art” | 0.48 (90) |
| Offering support | Giving social support to other users |
“Hope everything gets resolved OP!” “You've got this!” | 0.56 (86) |
| Bullying | Intends harm, indicative of a power imbalance, and messages are repeatedly sent [ |
“You people are disgusting” “In the words of DJ Khaled ‘congratulations you played yourself’ it's not hard to portray being a moron. It's quite sad actually” | 0.00 (95) |
| Humor | Intends to be funny without bullying |
“I predict my day based on my morning poo” “Why get thinner when you can get more dinner?” | 0.48 (87) |
Characteristics of messages with each messaging behavior.
| Characteristic | Range | Mean (SD) | Median | ||
|
| |||||
|
|
| ||||
|
|
| Characters | 11-204 | 74.10 (47.82) | 61 |
|
|
| Words | 2-42 | 14.61 (9.60) | 12 |
|
|
| ||||
|
|
| All posts | 0-50 | 4.14 (7.03) | 2 |
|
|
| Initial post | 0-50 | 5.47 (7.61) | 3 |
|
|
| ||||
|
|
| AM | 12:01 AM-11:48 AM | 3:38 (2:56) | 2:43 |
|
|
| PM | 12:06 PM-11:57 PM | 7:33 (3:13) | 8:06 |
|
| |||||
|
|
| ||||
|
| Characters | 2-200 | 74.87 (58.39) | 58 | |
|
|
| Words | 1-43 | 14.39 (11.25) | 11 |
|
|
| ||||
|
| All posts | 0-17 | 0.04 (0.66) | 0 | |
|
|
| Initial post | 0-17 | 4.57 (6.50) | 1 |
|
|
| ||||
|
|
| AM | Midnight-11:57 AM | 3:27 (2:43) | 2:47 |
|
|
| PM | Noon-11:59 PM | 7:44 (3:01) | 8:25 |
|
| |||||
|
|
| ||||
|
|
| Characters | 3-230 | 63.26 (49.64) | 47 |
|
|
| Words | 1-40 | 11.92 (9.32) | 9 |
|
|
| ||||
|
|
| All posts | 0-44 | 0.17 (2.42) | 0 |
|
|
| Initial post | 0-44 | 4.07 (11.53) | 1 |
|
|
| ||||
|
|
| AM | Midnight-11:58 AM | 3:37 (2:32) | 3:12 |
|
|
| PM | 12:10 PM-11:58 PM | 8:38 (3:00) | 9:43 |
|
| |||||
|
|
| ||||
|
|
| Characters | 2-199 | 32.37 (43.96) | 36 |
|
|
| Words | 1-41 | 6.37 (8.43) | 7 |
|
|
| ||||
|
|
| All posts | 0-9 | 0.28 (1.02) | 0 |
|
|
| Initial post | 0-9 | 1.83 (2.02) | 1 |
|
|
| ||||
|
|
| AM | 12:02 AM-11:58 AM | 3:21 (2:49) | 2:40 |
|
|
| PM | 12:09 PM-23:59 PM | 7:17 (3:25) | 8:09 |
Cohen kappa for each topic (n=96).
| Statistic | Relationships and sex | College living | Politics | School and classes |
| Cohen kappa | 0.73 | 1.00 | Undefined | 0.77 |
| Number of agreements | 90 | 96 | 96 | 91 |
Characteristics of messages with each topic.
| Characteristic | Range | Mean (SD) | Median | ||
|
| |||||
|
|
| ||||
|
|
| Characters | 2-252 | 82.18 (52.32) | 70 |
|
|
| Words | 1-47 | 16.17 (10.32) | 14 |
|
|
| ||||
|
| All posts | 0-50 | 0.96 (3.43) | 0 | |
|
|
| Initial post | 0-50 | 4.60 (6.31) | 3 |
|
|
| ||||
|
|
| AM | Midnight-11:58 AM | 3:27 (2:21) | 3:07 |
|
|
| PM | Noon-11:59 PM | 8:05 (3:16) | 8:55 |
|
| |||||
|
|
| ||||
|
|
| Characters | 3-200 | 74.56 (49.98) | 62 |
|
|
| Words | 1-42 | 14.36 (9.52) | 12 |
|
|
| ||||
|
| All posts | 0-19 | 0.83 (2.15) | 0 | |
|
|
| Initial post | 0-19 | 2.60 (3.14) | 2 |
|
|
| ||||
|
|
| AM | Midnight-11:56 AM | 3:34 (2:38) | 2:57 |
|
|
| PM | Noon-11:59 PM | 6:57 (3:15) | 7:24 |
|
| |||||
|
|
| ||||
|
|
| Characters | 5-210 | 107.72 (58.43) | 99 |
|
|
| Words | 1-43 | 19.22 (10.65) | 17 |
|
|
| ||||
|
| All posts | 0-53 | 0.83 (4.27) | 0 | |
|
|
| Initial post | 0-53 | 7.13 (10.59) | 4 |
|
|
| ||||
|
|
| AM | Midnight-11:47 AM | 3:26 (2:32) | 3:06 |
|
|
| PM | 12:08 PM-11:58 PM | 7:52 (3:11) | 7:30 |
|
| |||||
|
|
| ||||
|
|
| Characters | 3-202 | 71.41 (49.59) | 59 |
|
|
| Words | 1-42 | 13.67 (9.38) | 11 |
|
|
| ||||
|
|
| All posts | 0-44 | 0.98 (3.33) | 0 |
|
|
| Initial post | 0-44 | 4.39 (5.90) | 3 |
|
|
| ||||
|
|
| AM | Midnight-11:58 AM | 3:41 (2:58) | 2:46 |
|
|
| PM | 12:03 PM-11:59 PM | 6:58 (3:09) | 7:35 |
Classifier hyperparameter values evaluated in our experiments.
| Classifier and hyperparameter | Values | |
|
| ||
|
| Maximum tree depth | 2, 4, 8, 16, 32, 64 |
|
| Number of trees | 10, 100, 1000 |
|
| ||
|
|
| 0.001, 0.01, 0.1, 1, 10 |
|
| Loss function | Hinge, squared hinge |
|
| ||
|
| Filter window sizes | (2, 3, 4), (3, 4, 5), (4, 5, 6) |
|
| Feature maps per filter window size | 100, 200, 300, 400, 500, 600 |
aSVM: support vector machine.
bC: SVM regularization parameter.
cCNN: convolutional neural network.
Frequency of messaging behaviors by state.
| Messaging behavior | CAa (N=4496), n (%) | FLb (N=4694), n (%) | NYc (N=4273), n (%) | TXd (N=3503), n (%) | Total (N=16,966), n (%) | Bonferroni-corrected Fisher exact |
| Seeking help | 70 (1.56) | 94 (2.00) | 65 (1.52) | 70 (2.00) | 299 (1.76) | .20 |
| Offering support | 183 (4.07) | 381 (8.12) | 234 (5.48) | 88 (2.51) | 886 (5.22) | <.001 |
| Bullying | 61 (1.36) | 68 (1.45) | 98 (2.29) | 93 (2.65) | 320 (1.96) | <.001 |
| Humor | 140 (3.11) | 134 (2.85) | 144 (3.37) | 98 (2.80) | 516 (3.15) | .40 |
aCA: California.
bFL: Florida.
cNY: New York.
dTX: Texas.
Frequency of topics by state.
| Topics | CAa (N=4443), n (%) | FLb (N=4668), n (%) | NYc (N=4253), n (%) | TXd (N=3485), n (%) | Total (N=16,849), n (%) | Bonferroni-corrected Fisher exact |
| Relationships and sex | 730 (16.43) | 689 (14.76) | 532 (13.21) | 535 (15.35) | 2516 (14.93) | <.001 |
| College living | 224 (5.04) | 83 (1.78) | 157 (3.69) | 180 (5.16) | 644 (3.82) | <.001 |
| Politics | 133 (2.99) | 122 (2.61) | 317 (7.45) | 35 (1.00) | 607 (3.60) | <.001 |
| School and classes | 208 (4.68) | 114 (2.44) | 150 (3.53) | 198 (5.68) | 670 (3.98) | <.001 |
aCA: California.
bFL: Florida.
cNY: New York.
dTX: Texas.
Popularity of messaging behaviors and topics by state.
| Messaging behavior | CAa | FLb | NYc | TXd | Total | ||||||||
|
| Meane (SE) | n | Mean (SE) | n | Mean (SE) | n | Mean (SE) | n | Mean (SE) | n | |||
| Seeking help | 1.04 (0.26) | 68 | 1.37 (0.21) | 92 | 0.78 (0.30) | 63 | 0.53 (0.27) | 70 | 0.97 (0.13) | 293 | |||
| Offering support | 1.00 (0.11) | 182 | 0.98 (0.08) | 380 | 1.22 (0.12) | 230 | 0.77 (0.16) | 88 | 1.03 (0.06) | 880 | |||
| Bullying | 0.40 (0.32) | 58 | 0.32 (0.17) | 68 | 0.59 (0.23) | 96 | 0.32 (0.18) | 92 | 0.42 (0.11) | 314 | |||
| Humor | 1.50 (0.20) | 124 | 1.71 (0.22) | 125 | 2.14 (0.27) | 130 | 1.27 (0.20) | 90 | 1.69 (0.12) | 469 | |||
aCA: California.
bFL: Florida.
cNY: New York.
dTX: Texas.
eMean: Mean message popularity scores are based on the aggregate number of upvotes (+1) and downvotes (−1) per message.
Popularity of topics by state.
| Topic | CAa | FLb | NYc | TXd | Total | |||||||||
|
| Meane (SE) | n | Mean (SE) | n | Mean (SE) | n | Mean (SE) | n | Mean (SE) | n | ||||
| Relationships and sex | 1.56 (0.09) | 700 | 1.03 (0.08) | 678 | 1.16 (0.10) | 548 | 0.96 (0.08) | 528 | 1.19 (0.05) | 2454 | ||||
| College living | 1.31 (0.15) | 209 | 1.56 (0.26) | 78 | 1.70 (0.23) | 146 | 0.78 (0.14) | 175 | 1.28 (0.09) | 608 | ||||
| Politics | 1.17 (0.21) | 129 | 1.46 (0.24) | 119 | 1.34 (0.14) | 314 | 1.49 (0.43) | 35 | 1.34 (0.10) | 597 | ||||
| School and classes | 0.84 (0.12) | 197 | 1.09 (0.20) | 114 | 1.08 (0.18) | 145 | 0.43 (0.09) | 194 | 0.82 (0.07) | 650 | ||||
aCA: California.
bFL: Florida.
cNY: New York.
dTX: Texas.
eMean: Mean message popularity scores are based on the aggregate number of upvotes (+1) and downvotes (−1) per message.
Intercorrelations at the school level.
| Variable | SHa | OSb | BUc | PHd | PSe | PBf | RSg | CLh | POi | SCj | ENk | RAl |
| SH | —m | 0.48 | −0.13 | −0.06 | 0.37 | 0.01 | −0.35 | 0.01 | −0.38 | 0.36 | 0.17 | −0.29 |
| OS | —n | —m | −0.33 | 0.16 | 0.00 | 0.05 | −0.66 | −0.30 | 0.07 | −0.08 | 0.20 | −0.62 |
| BU | —n | —n | —m | 0.52 | 0.37 | −0.35 | 0.36 | 0.01 | 0.46 | −0.07 | −0.07 | 0.10 |
| PH | —n | —n | —n | —m | 0.37 | −0.02 | 0.19 | −0.03 | 0.30 | −0.11 | 0.90 | −0.21 |
| PS | —n | —n | —n | —n | —m | −0.18 | 0.26 | 0.19 | 0.16 | 0.47 | −0.15 | −0.17 |
| PB | —n | —n | —n | —n | —n | —m | −0.20 | −0.11 | 0.13 | 0.03 | −0.21 | −0.08 |
| RS | —n | —n | —n | —n | —n | —n | —m | 0.09 | −0.09 | −0.02 | 0.09 | 0.29 |
| CL | —n | —n | —n | —n | —n | —n | —n | —m | −0.14 | 0.47 | −0.45 | 0.29 |
| PO | —n | —n | —n | —n | —n | —n | —n | —n | —m | −0.19 | −0.27 | −0.35 |
| SC | —n | —n | —n | —n | —n | —n | —n | —n | —n | —m | −0.26 | −0.01 |
| EN | —n | —n | —n | —n | —n | —n | —n | —n | —n | —n | —m | −0.33 |
| RA | —n | —n | —n | —n | —n | —n | —n | —n | —n | —n | —n | —m |
aSH: seeking help.
bOS: offering support.
cBU: bullying.
dPH: popularity of seeking help.
ePS: popularity of offering support.
fPB: popularity of bullying.
gRS: relationships and sex.
hCL: college living.
iPO: politics.
jSC: school and classes.
kEN: enrollment.
lRA: ranking.
mCells along the diagonal represent the same variable in both row and column, thus no correlation is reported.
nCells below the diagonal duplicate those above the diagonal and are left blank for clarity.
Messaging behavior classification results.
| Metric and classifier | Seeking help | Offering support | Bullying | Humor | |
|
| |||||
|
| Random forest |
|
|
| 0.6417 |
|
| SVMb | 0.6771 | 0.7501 | 0.9240 |
|
|
| CNNc | 0.9098 | 0.6618 | 0.9146 | 0.7195 |
|
| |||||
|
| Random forest |
| 0.7151 | 0.6763 | 0.6392 |
|
| SVM | 0.8007 |
|
| 0.6543 |
|
| CNN | 0.6557 | 0.7313 | 0.7702 |
|
aThe highest accuracy and balanced accuracy achieved for each messaging behavior are italicized for emphasis.
bSVM: support vector machine.
cCNN: convolutional neural network.
Topic classification results.
| Metric and classifier | Relationships and sex | College living | Politics | School and classes | |||||
|
| |||||||||
|
| Random forest | 0.8209 |
| 0.8704 | 0.9387 | ||||
|
| SVMb |
| 0.8981 |
|
| ||||
|
| CNNc | 0.7943 | 0.8533 | 0.9399 | 0.9010 | ||||
|
| |||||||||
|
| Random forest | 0.7380 | 0.7323 | 0.7775 | 0.7899 | ||||
|
| SVM |
| 0.7842 |
|
| ||||
|
| CNN | 0.7902 |
| 0.8524 | 0.8147 | ||||
aThe highest accuracy and balanced accuracy achieved for each topic are italicized for emphasis.
bSVM: support vector machine.
cCNN: convolutional neural network.