| Literature DB >> 35261872 |
Livio Bioglio1, Ruggero G Pensa1.
Abstract
User-generated contents often contain private information, even when they are shared publicly on social media and on the web in general. Although many filtering and natural language approaches for automatically detecting obscenities or hate speech have been proposed, determining whether a shared post contains sensitive information is still an open issue. The problem has been addressed by assuming, for instance, that sensitive contents are published anonymously, on anonymous social media platforms or with more restrictive privacy settings, but these assumptions are far from being realistic, since the authors of posts often underestimate or overlook their actual exposure to privacy risks. Hence, in this paper, we address the problem of content sensitivity analysis directly, by presenting and characterizing a new annotated corpus with around ten thousand posts, each one annotated as sensitive or non-sensitive by a pool of experts. We characterize our data with respect to the closely-related problem of self-disclosure, pointing out the main differences between the two tasks. We also present the results of several deep neural network models that outperform previous naive attempts of classifying social media posts according to their sensitivity, and show that state-of-the-art approaches based on anonymity and lexical analysis do not work in realistic application scenarios.Entities:
Keywords: Content analysis; Privacy; Text classification
Year: 2022 PMID: 35261872 PMCID: PMC8892403 DOI: 10.1140/epjds/s13688-022-00324-y
Source DB: PubMed Journal: EPJ Data Sci ISSN: 2193-1127 Impact factor: 3.184
Figure 1A potentially sensitive post. The post does not mention any sensitive term or topic, but discloses information about the author and his friend Alice Green, and contains hidden spatiotemporal references that are immediately clear from the context
Figure 2A non-sensitive post mentioning sensitive topics and terms.The post contains several sensitive terms (struggling, suffering, COVID-19) and topics (health, economic crisis), but no private information is disclosed about any specific person
Guidelines and examples for the annotations
| Category | Guidelines | Examples |
|---|---|---|
| Sensitive | A post is “sensitive” if the text is understandable, i.e., written in clear English, and the annotator is certain that it contains information that violates a person’s privacy, not necessarily of the author of the post. A text violates a person’s privacy if contains the following types of information (non-exhaustive list): • current or upcoming moves; • information on events in the private sphere; • information on health or mental status; • information about one’s habits; • information that can help geolocalize the author of the post or other people mentioned; • information on the sentimental status; • considerations that may hint at the political orientation or religious belief of a mentioned person. In general, given the subjectivity of the topic, a post can be sensitive if the person reading it feels discomfort due to the private content it contains (and not to other moral considerations). | “...heading to the gym with *PROPNAME*, *PROPNAME* and my sista!!” “is feeling uninspired and unmotivated. Can someone else please pay her bills and move her into her new apartment?” “is very sore and very tired...” “Just wanted to thank everyone for all the support (and great tips) yesterday, it meant a lot! made it through yesterday without smoking at all...and still going strong! :)” “Lazy day around the house after the family has left.” “ARGH. 2 whole years! Congratulations, *PROPNAME*! You’ve tolerated me for a total of 730 days! Plus ‘getting to know you’ time... hahaha!” “is shaking his head wondering when some of his conservative christian friends became so hate filled that they will join any anti-obama group on facebook.” |
| Non sensitive | A post is “non-sensitive” if the text is understandable, i.e., written in clear English, and the annotator is sure that it does not contain information that violates privacy, according to the indications of the “sensitive” category. | “Fabulous weekend :-)” “When we are no longer able to change a situation – we are challenged to change ourselves. Viktor E. Frankl” “loves summer evenings” |
| Unknown | A post is of “unknown sensitivity” if the text is understandable, i.e., written in clear English, but the annotator is unable to tell if it contains information that is sensitive for privacy, because (non-exhaustive motivations): • the context is not sufficient to understand the sensitivity of the message; • the post is incomplete, i.e., the text does not contain the whole post, and from the available portion one is unable to understand its sensitivity; • the post contains a reference to a media (an image, a link, a GIF) which is considered essential for understanding the message, if the text alone is not sufficient to understand its sensitivity. | “black” “Goodbye *PROPNAME*. :(“ “I know 6 sick people at the moment, and now I’m...” “Check out what I’ve got written for The Book of *PROPNAME*. [link]” |
| Unintelligible | A post can be marked as “unintelligible” when: • it is written with slang/abbreviations or a grammar that does not render it understandable from a lexical point of view; • the post is written in a language other than English. | “hooked on PBS” “fml” “wahhhh,. di na ko. hurot na jud ako kwarta aning AI. huhuhu” “Pas de mauvaise nouvelle pour l’instant! Je presume donc que c’est une bonne chose!” |
Agreement computed according to Fleiss’ κ
| Group | Fleiss’ | 2 agree | 3 agree |
|---|---|---|---|
| Group 1 | 0.34 | 94.44% | 45.00% |
| Group 2 | 0.23 | 93.75% | 35.14% |
| Group 3 | 0.22 | 90.96% | 35.18% |
| Group 4 | 0.42 | 96.49% | 56.56% |
| Mean | 0.31 | 93.91% | 42.97% |
Details of the annotations. The last column contains the number of posts receiving at least one annotation for each class
| Class | 1 annot. | 2 annot. | 3 annot. | Sum |
|---|---|---|---|---|
| Sensitive | 2490 | 1892 | 1444 | 5826 |
| Non-sensitive | 2494 | 2827 | 2602 | 7923 |
| Unknown | 1529 | 183 | 7 | 1719 |
| Unintelligible | 357 | 150 | 208 | 715 |
| Total | 6870 | 5052 | 4261 | – |
Details on the datasets used
| Dataset | # posts | # sens | # ns | Avg # words |
|---|---|---|---|---|
| SENS2 | 8765 | 3336 | 5429 | 15.11 ± 12.58 |
| SENS3 | 4046 | 1444 | 2602 | 15.40 ± 12.67 |
| OMC | 17,860 | 10,793 | 7067 | 15.58 ± 11.00 |
| WH+TW | 8765 | 3336 | 5429 | 13.08 ± 8.26 |
Most relevant words for each class in dataset SENS2
| Sensitive | Non-sensitive | ||||||
|---|---|---|---|---|---|---|---|
| Overall rank | Word | Overall count | Relative frequency | Overall rank | Word | Overall count | Relative frequency |
| 22 | home | 33.40 ± 5.74 | 88.05 ± 5.62 | 8 | love | 45.00 ± 7.77 | 59.86 ± 5.92 |
| 26 | tomorrow | 31.40 ± 4.58 | 80.36 ± 6.81 | 10 | one | 44.50 ± 7.46 | 56.72 ± 8.67 |
| 29 | tonight | 30.20 ± 5.49 | 75.78 ± 5.95 | 19 | need | 33.70 ± 3.71 | 56.52 ± 4.91 |
| 27 | week | 30.90 ± 3.00 | 75.17 ± 5.40 | 6 | like | 55.50 ± 7.79 | 55.10 ± 5.52 |
| 9 | back | 44.70 ± 7.01 | 74.95 ± 7.04 | 13 | new | 40.50 ± 5.32 | 52.94 ± 6.83 |
| 5 | work | 56.80 ± 6.03 | 74.46 ± 4.95 | 20 | make | 33.70 ± 7.45 | 52.60 ± 8.10 |
| 15 | night | 37.30 ± 6.86 | 71.56 ± 7.04 | 16 | think | 36.30 ± 5.87 | 52.57 ± 8.28 |
| 1 | go | 97.40 ± 9.75 | 67.61 ± 5.45 | 25 | cant | 31.60 ± 6.33 | 48.57 ± 6.53 |
| 12 | today | 42.30 ± 4.79 | 66.66 ± 5.14 | 7 | time | 51.40 ± 8.41 | 46.59 ± 7.41 |
| 0 | propnam | 123.10 ± 11.53 | 65.36 ± 4.62 | 14 | good | 40.00 ± 7.82 | 46.02 ± 7.43 |
| 4 | im | 58.20 ± 6.53 | 62.80 ± 6.88 | 28 | happi | 30.30 ± 5.68 | 45.41 ± 10.64 |
| 3 | day | 72.80 ± 10.27 | 62.52 ± 6.28 | 11 | want | 44.40 ± 4.53 | 45.02 ± 9.49 |
| 17 | feel | 34.00 ± 6.65 | 62.38 ± 5.86 | 24 | come | 32.40 ± 5.44 | 42.50 ± 6.18 |
| 18 | see | 33.90 ± 6.30 | 60.74 ± 5.49 | 23 | know | 33.40 ± 7.44 | 41.55 ± 10.81 |
| 2 | get | 80.50 ± 6.60 | 58.63 ± 2.28 | 21 | got | 33.60 ± 8.10 | 41.46 ± 8.29 |
| 21 | got | 33.60 ± 8.10 | 58.54 ± 8.29 | 2 | get | 80.50 ± 6.60 | 41.37 ± 2.28 |
| 23 | know | 33.40 ± 7.44 | 58.45 ± 10.81 | 18 | see | 33.90 ± 6.30 | 39.26 ± 5.49 |
| 24 | come | 32.40 ± 5.44 | 57.50 ± 6.18 | 17 | feel | 34.00 ± 6.65 | 37.63 ± 5.86 |
| 11 | want | 44.40 ± 4.53 | 54.98 ± 9.49 | 3 | day | 72.80 ± 10.27 | 37.48 ± 6.28 |
| 28 | happi | 30.30 ± 5.68 | 54.60 ± 10.64 | 4 | im | 58.20 ± 6.53 | 37.20 ± 6.88 |
Most relevant words for each class in dataset SENS3
| Sensitive | Non-sensitive | ||||||
|---|---|---|---|---|---|---|---|
| Overall rank | Word | Overall count | Relative frequency | Overall rank | Word | Overall count | Relative frequency |
| 13 | home | 43.50 ± 4.33 | 92.82 ± 4.94 | 24 | peopl | 33.00 ± 3.13 | 70.34 ± 7.09 |
| 15 | tomorrow | 38.30 ± 3.83 | 90.45 ± 4.21 | 9 | one | 48.60 ± 6.59 | 66.78 ± 7.59 |
| 29 | tonight | 31.20 ± 6.25 | 86.73 ± 3.92 | 11 | love | 45.60 ± 5.04 | 64.11 ± 5.28 |
| 30 | weekend | 30.50 ± 4.79 | 85.68 ± 4.12 | 19 | think | 34.60 ± 3.92 | 63.46 ± 5.11 |
| 4 | work | 56.90 ± 6.59 | 84.55 ± 5.39 | 22 | dont | 33.50 ± 5.76 | 61.47 ± 5.66 |
| 5 | back | 56.10 ± 5.69 | 80.52 ± 3.67 | 7 | like | 55.60 ± 8.86 | 61.36 ± 6.06 |
| 1 | go | 110.00 ± 10.58 | 73.62 ± 2.63 | 18 | make | 34.60 ± 4.01 | 58.45 ± 8.25 |
| 0 | propnam | 149.10 ± 12.51 | 73.12 ± 5.17 | 23 | happi | 33.30 ± 3.68 | 57.76 ± 8.16 |
| 26 | night | 32.10 ± 5.47 | 70.74 ± 11.19 | 27 | know | 32.10 ± 6.89 | 55.75 ± 6.38 |
| 10 | today | 45.70 ± 3.80 | 69.28 ± 4.51 | 14 | new | 40.00 ± 6.46 | 49.41 ± 6.27 |
| 20 | got | 34.50 ± 2.42 | 67.46 ± 7.07 | 16 | good | 37.90 ± 7.77 | 47.78 ± 4.71 |
| 21 | come | 33.80 ± 5.63 | 67.43 ± 6.09 | 17 | want | 36.10 ± 6.12 | 46.69 ± 6.84 |
| 6 | im | 55.90 ± 6.59 | 67.15 ± 4.28 | 28 | feel | 31.40 ± 5.76 | 42.22 ± 10.66 |
| 3 | day | 72.50 ± 6.75 | 67.09 ± 3.33 | 8 | time | 54.60 ± 7.09 | 41.16 ± 6.60 |
| 12 | see | 44.60 ± 7.00 | 63.17 ± 5.80 | 25 | cant | 32.60 ± 4.17 | 38.58 ± 5.07 |
| 2 | get | 87.20 ± 8.20 | 62.64 ± 4.44 | 2 | get | 87.20 ± 8.20 | 37.36 ± 4.44 |
| 25 | cant | 32.60 ± 4.17 | 61.42 ± 5.07 | 12 | see | 44.60 ± 7.00 | 36.83 ± 5.80 |
| 8 | time | 54.60 ± 7.09 | 58.84 ± 6.60 | 3 | day | 72.50 ± 6.75 | 32.92 ± 3.33 |
| 28 | feel | 31.40 ± 5.76 | 57.78 ± 10.66 | 6 | im | 55.90 ± 6.59 | 32.85 ± 4.28 |
| 17 | want | 36.10 ± 6.12 | 53.31 ± 6.84 | 21 | come | 33.80 ± 5.63 | 32.57 ± 6.09 |
Most relevant words for each class in dataset OMC
| Sensitive | Non-sensitive | ||||||
|---|---|---|---|---|---|---|---|
| Overall rank | Word | Overall count | Relative frequency | Overall rank | Word | Overall count | Relative frequency |
| 1 | im | 79.70 ± 7.07 | 71.51 ± 3.59 | 2 | dont | 75.30 ± 8.76 | 54.77 ± 7.02 |
| 29 | year | 30.10 ± 2.77 | 70.39 ± 12.45 | 17 | your | 39.20 ± 5.75 | 54.17 ± 7.17 |
| 20 | much | 34.30 ± 6.52 | 68.82 ± 6.39 | 25 | way | 31.40 ± 5.25 | 53.71 ± 7.79 |
| 26 | friend | 31.20 ± 7.15 | 67.46 ± 8.38 | 18 | good | 38.20 ± 4.92 | 53.52 ± 9.68 |
| 14 | realli | 43.50 ± 6.38 | 63.46 ± 7.29 | 27 | that | 30.50 ± 6.26 | 52.89 ± 6.82 |
| 23 | work | 33.10 ± 5.34 | 61.98 ± 13.41 | 24 | tri | 31.90 ± 6.81 | 50.95 ± 8.36 |
| 21 | even | 34.20 ± 4.59 | 61.95 ± 10.70 | 8 | peopl | 55.60 ± 5.93 | 49.88 ± 5.09 |
| 16 | life | 42.00 ± 6.94 | 61.80 ± 5.75 | 10 | think | 48.50 ± 4.09 | 48.99 ± 9.29 |
| 7 | go | 56.80 ± 6.61 | 59.78 ± 5.72 | 28 | person | 30.20 ± 6.32 | 48.53 ± 11.36 |
| 15 | would | 42.20 ± 7.96 | 59.32 ± 4.06 | 22 | need | 33.60 ± 4.81 | 47.45 ± 8.00 |
| 5 | know | 60.00 ± 4.74 | 58.69 ± 4.51 | 9 | thing | 55.40 ± 7.31 | 47.25 ± 5.91 |
| 4 | feel | 63.20 ± 7.15 | 57.82 ± 5.32 | 12 | make | 46.80 ± 5.73 | 46.65 ± 10.65 |
| 11 | want | 48.00 ± 8.10 | 57.72 ± 8.62 | 13 | one | 45.00 ± 9.49 | 44.14 ± 6.84 |
| 0 | like | 91.70 ± 8.15 | 57.22 ± 4.25 | 6 | time | 57.10 ± 6.87 | 43.91 ± 7.14 |
| 19 | love | 37.50 ± 6.02 | 56.39 ± 7.99 | 3 | get | 74.10 ± 7.32 | 43.85 ± 7.06 |
| 3 | get | 74.10 ± 7.32 | 56.16 ± 7.06 | 19 | love | 37.50 ± 6.02 | 43.61 ± 7.99 |
| 6 | time | 57.10 ± 6.87 | 56.09 ± 7.14 | 0 | like | 91.70 ± 8.15 | 42.78 ± 4.25 |
| 13 | one | 45.00 ± 9.49 | 55.86 ± 6.84 | 11 | want | 48.00 ± 8.10 | 42.28 ± 8.62 |
| 12 | make | 46.80 ± 5.73 | 53.35 ± 10.65 | 4 | feel | 63.20 ± 7.15 | 42.18 ± 5.32 |
| 9 | thing | 55.40 ± 7.31 | 52.75 ± 5.91 | 5 | know | 60.00 ± 4.74 | 41.31 ± 4.51 |
Most relevant words for each class in WH+TW
| Sensitive | Non-sensitive | ||||||
|---|---|---|---|---|---|---|---|
| Overall rank | Word | Overall count | Relative frequency | Overall rank | Word | Overall count | Relative frequency |
| 295 | lesbian | 41.70 ± 6.36 | 97.49 ± 2.75 | 190 | ni**a | 58.80 ± 8.23 | 100.00 ± 0.00 |
| 357 | bi | 33.20 ± 6.63 | 97.43 ± 2.41 | 269 | rt | 44.60 ± 9.19 | 99.78 ± 0.69 |
| 91 | chat | 101.10 ± 7.82 | 96.59 ± 1.49 | 194 | tweet | 57.80 ± 7.08 | 99.24 ± 1.36 |
| 281 | whisper | 43.20 ± 7.39 | 95.85 ± 2.63 | 219 | da | 52.90 ± 11.61 | 98.94 ± 1.53 |
| 73 | boyfriend | 117.30 ± 15.94 | 95.19 ± 1.96 | 376 | kno | 30.60 ± 5.17 | 98.60 ± 2.53 |
| 142 | male | 71.00 ± 7.54 | 95.09 ± 3.00 | 169 | 63.50 ± 6.59 | 98.18 ± 1.50 | |
| 182 | relationship | 60.70 ± 11.44 | 93.27 ± 2.87 | 349 | snow | 34.30 ± 5.31 | 97.63 ± 2.29 |
| 249 | 18 | 47.30 ± 6.38 | 92.92 ± 3.63 | 314 | wat | 38.70 ± 7.56 | 97.20 ± 2.80 |
| 218 | ex | 53.20 ± 6.36 | 92.50 ± 3.08 | 121 | lmao | 79.80 ± 6.88 | 97.03 ± 1.39 |
| 237 | girlfriend | 49.40 ± 6.20 | 92.17 ± 3.65 | 287 | jus | 42.40 ± 6.72 | 96.90 ± 2.79 |
| 62 | sex | 136.80 ± 13.17 | 91.40 ± 2.78 | 159 | wit | 66.40 ± 6.47 | 96.79 ± 2.14 |
| 381 | attract | 30.30 ± 7.53 | 91.30 ± 3.81 | 289 | yea | 42.20 ± 8.42 | 96.30 ± 3.01 |
| 113 | femal | 86.00 ± 8.96 | 90.98 ± 3.01 | 257 | smh | 46.40 ± 7.52 | 95.63 ± 3.38 |
| 364 | older | 32.00 ± 5.72 | 89.81 ± 4.24 | 174 | bout | 62.30 ± 6.38 | 94.98 ± 2.96 |
| 288 | f | 42.30 ± 7.09 | 88.64 ± 5.65 | 144 | ya | 70.90 ± 5.26 | 94.43 ± 3.17 |
| 157 | messag | 66.80 ± 8.04 | 87.81 ± 4.99 | 3 | u | 613.90 ± 31.07 | 93.99 ± 0.93 |
| 167 | gay | 64.40 ± 5.78 | 86.50 ± 5.51 | 66 | ur | 125.10 ± 17.70 | 93.37 ± 2.98 |
| 373 | bf | 30.80 ± 5.47 | 86.11 ± 6.56 | 185 | yall | 59.20 ± 4.32 | 93.30 ± 2.87 |
| 374 | cheat | 30.80 ± 9.10 | 84.84 ± 9.58 | 263 | lil | 45.60 ± 6.38 | 92.81 ± 3.51 |
| 327 | secret | 37.20 ± 7.45 | 84.56 ± 6.29 | 6 | lol | 520.20 ± 30.12 | 92.36 ± 1.08 |
Categories of the Privacy Dictionary [26]
| Category name | Examples of words | Ratio SENS2 | Ratio SENS3 | Ratio OMC | Ratio WH+TW |
|---|---|---|---|---|---|
| bully*, troubled, interfere | 0.55 | 0.43 | 1.06 | 1.67 | |
| block, hidden, quiet | 0.90 | 0.80 | 1.00 | 2.24 | |
| consent, respect, discrete | 0.24 | 0.05 | 1.02 | 7.75 | |
| freedom, separation, alone | 0.81 | 1.05 | 1.48 | 1.45 | |
| post, display, accessible | 0.56 | 0.40 | 0.83 | 1.54 | |
| secret, intimate, data | 0.43 | 0.53 | 0.95 | 2.24 | |
| family, friend, group | 1.24 | 1.30 | 1.51 | 3.99 | |
| criminal, illegal, offence | 1.96 | 4.25 | 1.00 | 0.89 |
Psychological categories of LIWC [45]
| Dataset | Relevant dictionaries |
|---|---|
| SENS2 | we, you, |
| SENS3 | |
| OMC | |
| WH+TW |
Classification results (macro averaged F1-score) using dictionary features. Results on WH+TW are averaged on ten samples
| Dataset | Class. | All dict. | Psych. dict. | Priv. Dict |
|---|---|---|---|---|
| SENS2 | LR | 0.64 | 0.65 | 0.38 |
| RF | 0.65 | 0.66 | 0.41 | |
| SVM | 0.64 | 0.65 | 0.38 | |
| SENS3 | LR | 0.72 | 0.72 | 0.39 |
| RF | 0.70 | 0.69 | 0.42 | |
| SVM | 0.70 | 0.72 | 0.39 | |
| OMC | LR | 0.63 | 0.63 | 0.38 |
| RF | 0.67 | 0.66 | 0.40 | |
| SVM | 0.62 | 0.63 | 0.38 | |
| WH+TW | LR | 0.78 ± 0.01 | 0.77 ± 0.01 | 0.46 ± 0.01 |
| RF | 0.78 ± 0.01 | 0.78 ± 0.01 | 0.49 ± 0.02 | |
| SVM | 0.77 ± 0.01 | 0.77 ± 0.01 | 0.46 ± 0.01 |
Detailed classification results (F1-score) using dictionary features with the best classifier. Results on WH+TW are averaged on ten samples
| Dataset | Best class. | F1(sens.) | F1(non-sens.) | F1(macro) |
|---|---|---|---|---|
| SENS2 | RF | 0.53 | 0.78 | 0.66 |
| SENS3 | LR | 0.62 | 0.82 | 0.72 |
| OMC | RF | 0.75 | 0.56 | 0.66 |
| WH+TW | RF | 0.70 ± 0.01 | 0.84 ± 0.00 | 0.78 ± 0.01 |
Top-20 relevant features and their coefficients computed by the logistic regression classifier for the sensitive class
| Dataset | Feture name (coefficient value) |
|---|---|
| SENS2 | Law (0.1075), family (0.0968), OutcomeState (0.0725), health (0.0697), i (0.0617), informal (0.0586), Restriction (0.0537), affect (0.0486), shehe (0.0479), home (0.0463), prep (0.0450), focusfuture (0.0431), ipron (0.0421), Intimacy (0.0408), NormsRequisites (0.0356), ppron (0.0289), work (0.0265), conj (0.0257), friend (0.0228), anx (0.0212) |
| SENS3 | Law (0.1928), family (0.1639), affect (0.1133), OutcomeState (0.1006), informal (0.0900), health (0.0865), home (0.0836), Restriction (0.0822), pronoun (0.0822), focusfuture (0.0812), prep (0.0665), i (0.0628), shehe (0.0543), conj (0.0502), money (0.0487), friend (0.0454), reward (0.0417), sad (0.0388), number (0.0303), differ (0.0283) |
| OMC | pronoun (0.1831), family (0.0552), OutcomeState (0.0461), i (0.0398), Intimacy (0.0286), negemo (0.0283), bio (0.0263), conj (0.0236), friend (0.0216), sexual (0.0203), feel (0.0189), relativ (0.0188), informal (0.0177), male (0.0169), prep (0.0148), number (0.0145), adj (0.0142), quant (0.0142), posemo (0.0134), female (0.0107) |
| WH+TW | sexual (0.1358 ± 0.0312), female (0.1033 ± 0.0103), PrivTtl (0.0978 ± 0.0401), i (0.0833 ± 0.0489), ipron (0.0806 ± 0.1230), male (0.0744 ± 0.0113), cogproc (0.0703 ± 0.0087), ppron (0.0654 ± 0.1355), feel (0.0547 ± 0.0209), social (0.0534 ± 0.0063), conj (0.0483 ± 0.0080), number (0.0446 ± 0.0046), see (0.0427 ± 0.0252), prep (0.0414 ± 0.0051), affect (0.0355 ± 0.0399), article (0.0306 ± 0.0079), body (0.0295 ± 0.0099), health (0.0256 ± 0.0135), quant (0.0242 ± 0.0059), affiliation (0.0242 ± 0.0156) |
Detailed composition (number of neurons) of the Convolutional Neural Networks
| Model name | Node type | Size of the emb. layer | Size of conv. layer 1 | Size of conv. layer 2 | Size of dense layer 1 | Size of dense layer 2 |
|---|---|---|---|---|---|---|
| CNN1 | 1D-CNN | 100 | 256 | – | 256 | – |
| CNN2 | 1D-CNN | 100 | 128 | – | 128 | – |
| CNN3 | 1D-CNN | 100 | 256 | 128 | 64 | 32 |
| CNN4 | 1D-CNN | 200 | 128 | 128 | 128 | 128 |
Detailed composition (number of neurons) of the Recurrent Neural Networks
| Model name | Node type | Size of the emb. layer | Size of rec. layer 1 | Size of rec. layer 2 | Size of dense layer 1 | Size of dense layer 2 |
|---|---|---|---|---|---|---|
| RNN1 | GRU | 100 | 128 | 128 | 128 | 128 |
| RNN2 | LSTM | 100 | 256 | 128 | 64 | 32 |
| RNN3 | GRU | 200 | 128 | – | 128 | – |
| RNN4 | LSTM | 200 | 128 | 128 | 128 | 128 |
Classification results (macro-averaged F1-scores and percentage gain w.r.t the best bag-of-word classifier)
| Dataset | Classifier | F1-score | Gain |
|---|---|---|---|
| SENS2 | BoW-LR | 0.68 | – |
| BoW-RF | 0.67 | – | |
| BoW-SVM | 0.68 | – | |
| CNN1 | 0.73 | 7.35% | |
| CNN2 | 0.73 | 7.35% | |
| CNN3 | 0.72 | 5.88% | |
| CNN4 | 0.71 | 4.41% | |
| RNN1 | 0.77 | 13.24% | |
| RNN2 | 0.77 | 13.24% | |
| BERT | 0.78 | ||
| SENS3 | BoW-LR | 0.73 | – |
| BoW-RF | 0.73 | – | |
| BoW-SVM | 0.78 | – | |
| CNN1 | 0.81 | 3.85% | |
| CNN2 | 0.81 | 3.85% | |
| CNN3 | 0.60 | −11.76% | |
| CNN4 | 0.81 | 3.85% | |
| RNN3 | 0.87 | 11.54% | |
| RNN4 | 0.86 | 10.26% | |
| BERT | 14.10% | ||
| OMC | BoW-LR | 0.60 | – |
| BoW-RF | 0.59 | – | |
| BoW-SVM | 0.60 | – | |
| CNN1 | 0.63 | 5.28% | |
| CNN2 | 0.64 | 5.95% | |
| CNN3 | 0.65 | 8.29% | |
| CNN4 | 0.65 | 8.07% | |
| RNN1 | 0.65 | 11.67% | |
| RNN2 | 0.65 | 11.67% | |
| RNN3 | 0.66 | 13.33% | |
| RNN4 | 0.67 | 14.39% | |
| BERT | 0.68 | 13.32% | |
| WH+TW | BoW-LR | 0.80 ± 0.01 | – |
| BoW-RF | 0.78 ± 0.01 | – | |
| BoW-SVM | 0.81 ± 0.01 | – | |
| CNN1 | 0.77 ± 0.03 | −3.62 ± 3.89 | |
| CNN2 | 0.78 ± 0.03 | −4.51 ± 4.41 | |
| CNN3 | 0.75 ± 0.07 | −7.32 ± 8.31 | |
| CNN4 | 0.68 ± 0.15 | −15.15 ± 18.92 | |
| RNN1 | 0.83 ± 0.02 | 2.59 ± 1.78% | |
| RNN2 | 0.83 ± 0.01 | 3.08 ± 1.48% | |
| RNN3 | 0.83 ± 0.01 | 3.28 ± 2.00% | |
| RNN4 | 0.84 ± 0.02 | 3.85 ± 1.85% | |
| BERT | 0.88 ± 0.01 | 8.80 ± 1.55% |
Detailed classification results (F1-score) using BERT. Results on WH+TW are averaged on ten samples
| Dataset | F1(sens.) | F1(non-sens.) | F1(macro) |
|---|---|---|---|
| SENS2 | 0.73 | 0.83 | 0.78 |
| SENS3 | 0.85 | 0.92 | 0.89 |
| OMC | 0.75 | 0.61 | 0.68 |
| WH+TW | 0.85 ± 0.01 | 0.91 ± 0.01 | 0.88 ± 0.01 |
Cross-classification results (macro-averaged F1-scores). Classifiers are trained on the datasets reported in the row, and tested on the datasets reported in the columns
| Dataset | Class. | SENS2 | SENS3 | OMC | WH+TW |
|---|---|---|---|---|---|
| SENS2 | DICT-RF | – | 0.66 | 0.38 | 0.46 ± 0.00 |
| BoW | – | 0.75 | 0.44 | 0.50 ± 0.00 | |
| BERT | – | 0.90 | 0.50 | 0.58 ± 0.00 | |
| SENS3 | DICT-RF | 0.63 | – | 0.37 | 0.44 ± 0.00 |
| BoW | 0.64 | – | 0.42 | 0.47 ± 0.00 | |
| BERT | 0.74 | – | 0.48 | 0.51 ± 0.00 | |
| OMC | DICT-RF | 0.33 | 0.21 | – | 0.52 ± 0.01 |
| BoW | 0.51 | 0.51 | – | 0.55 ± 0.00 | |
| BERT | 0.56 | 0.59 | – | 0.52 ± 0.01 | |
| WH+TW | DICT-RF | 0.29 ± 0.00 | 0.18 ± 0.00 | 0.54 ± 0.01 | – |
| BoW | 0.48 ± 0.01 | 0.47 ± 0.02 | 0.50 ± 0.01 | – | |
| BERT | 0.45 ± 0.01 | 0.44 ± 0.01 | 0.57 ± 0.01 | – |