| Literature DB >> 34545278 |
Kevin Roitero1, Michael Soprano1, Beatrice Portelli1, Massimiliano De Luise1, Damiano Spina2, Vincenzo Della Mea1, Giuseppe Serra1, Stefano Mizzaro1, Gianluca Demartini3.
Abstract
Recently, the misinformation problem has been addressed with a crowdsourcing-based approach: to assess the truthfulness of a statement, instead of relying on a few experts, a crowd of non-expert is exploited. We study whether crowdsourcing is an effective and reliable method to assess truthfulness during a pandemic, targeting statements related to COVID-19, thus addressing (mis)information that is both related to a sensitive and personal issue and very recent as compared to when the judgment is done. In our experiments, crowd workers are asked to assess the truthfulness of statements, and to provide evidence for the assessments. Besides showing that the crowd is able to accurately judge the truthfulness of the statements, we report results on workers' behavior, agreement among workers, effect of aggregation functions, of scales transformations, and of workers background and bias. We perform a longitudinal study by re-launching the task multiple times with both novice and experienced workers, deriving important insights on how the behavior and quality change over time. Our results show that workers are able to detect and objectively categorize online (mis)information related to COVID-19; both crowdsourced and expert judgments can be transformed and aggregated to improve quality; worker background and other signals (e.g., source of information, behavior) impact the quality of the data. The longitudinal study demonstrates that the time-span has a major effect on the quality of the judgments, for both novice and experienced workers. Finally, we provide an extensive failure analysis of the statements misjudged by the crowd-workers.Entities:
Keywords: COVID-19; Crowdsourcing; Information behavior; Misinformation
Year: 2021 PMID: 34545278 PMCID: PMC8444165 DOI: 10.1007/s00779-021-01604-6
Source DB: PubMed Journal: Pers Ubiquitous Comput ISSN: 1617-4909
Fig. 1The agreement between the PolitiFact experts and the crowd judgments. From left to right: C6 individual judgments; C6 aggregated with mean; C6 aggregated with median; C6 aggregated with majority vote
Fig. 2The agreement between the PolitiFact experts and the crowd judgments. From left to right: C6 aggregated with mean; C6 aggregated with median; C6 aggregated with majority vote. First row: E6 to E3; second row: E6 to E2. Compare with Fig. 1
Fig. 3Comparison with E6. C6 to C3 (first two plots) and to C2 (last two plots), then aggregated with the mean function. Best cut selected according to α (fist and third plot) and Φ (second and fourth plot). Compare with Fig. 1
Fig. 4C6 to C3 (first two plots) and to C2 (last two plots), then aggregated with the mean function. First two plots: E6 to E3. Last two plots: E6 to E2. Best cut selected according to α (first and third plots) and Φ (second and fourth plots). Compare with Figs. 1, 2, and 3
Statement position in the task versus: time elapsed, cumulative on each single statement (first row), CEMORD (second row), number of queries issued (third row), and number of times the statement has been used as a query (fourth row). The total and average number of queries is respectively 2095 and 262, while the total and average number of statements as query is respectively of 245 and 30.6
| Statement position | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|
| Time (sec) | 299 | 282 | 218 | 216 | 223 | 181 | 190 | 180 |
| CEMORD | .63 | .618 | .657 | .611 | .614 | .569 | .639 | .655 |
| Number of queries | 352 (16.8%) | 280 (13.4%) | 259 (12.4%) | 255 (12.1%) | 242 (11.6%) | 238 (11.3%) | 230 (11.0%) | 230 (11.4%) |
| Statement as query | 22 (9%) | 32 (13%) | 31 (12.6%) | 33 (13.5%) | 34 (13.9%) | 30 (12.2%) | 29 (11.9%) | 34 (13.9%) |
Fig. 5On the left, distribution of the ranks of the URLs selected by workers, on the right, websites from which workers chose URLs to justify their judgments
Fig. 6Effect of the origin of a justification on the absolute value of the prediction error (top; cumulative distributions shown with thinner lines and empty markers) and the prediction error (bottom). Text copied/not copied from the selected URL
Experimental setting for the longitudinal study. All dates refer to 2020. Values reported are absolute numbers
| Number of workers | ||||||
|---|---|---|---|---|---|---|
| Date | Acronym | Batch1 | Batch2 | Batch3 | Batch4 | Total |
| May | Batch1 | 100 | – | – | – | 100 |
| June | Batch2 | – | 100 | – | – | 100 |
| Batch2from1 | 29 | – | – | – | 29 | |
| July | Batch3 | – | – | 100 | – | 100 |
| Batch3from1 | 22 | – | – | – | 22 | |
| Batch3from2 | – | 20 | – | – | 20 | |
| Batch3from1or2 | 22 | 20 | – | – | 42 | |
| August | Batch4 | – | – | – | 100 | 100 |
| Batch4from1 | 27 | – | – | – | 27 | |
| Batch4from2 | – | 11 | – | – | 11 | |
| Batch4from3 | – | – | 33 | – | 33 | |
| Batch4from1or2or3 | 27 | 11 | 33 | – | 71 | |
| Batchall | 100 | 100 | 100 | 100 | 400 | |
Abandonment data for each batch of the longitudinal study
| Number of workers | ||||
|---|---|---|---|---|
| Acronym | Complete | Abandon | Fail | Total |
| Batch1 | 100 (30%) | 188 (56%) | 46 (14%) | 334 |
| Batch2 | 100 (37%) | 129 (48%) | 40 (15%) | 269 |
| Batch3 | 100 (23%) | 220 (51%) | 116 (26%) | 436 |
| Batch4 | 100 (36%) | 124 (45%) | 54 (19%) | 278 |
| Average | 100 (31%) | 165 (50%) | 64 (19%) | 1317 |
Fig. 7Correlation values between the judgments (aggregated by the mean) across Batch1, Batch2, Batch3, and Batch4
Fig. 8Agreement between the PolitiFact experts and crowd judgments. From left to right, top to bottom: Batch1 (same as second boxplot in Fig. 1), Batch2, Batch3, Batch4, and Batchall
ρ (lower triangles) and τ (upper triangles) correlation values among batches for the aggregated scores of Fig. 8
| b1 | b2 | b3 | b4 | |
|---|---|---|---|---|
| pants-on-fire (0) | ||||
| b1 | – | 0.37 | 0.58 | 0.54 |
| b2 | 0.44 | – | 0.3 | 0.25 |
| b3 | 0.74 | 0.69 | – | 0.42 |
| b4 | 0.58 | 0.24 | 0.46 | – |
| false (1) | ||||
| b1 | – | 0.72 | 0.74 | 0.04 |
| b2 | 0.87 | – | 0.75 | 0.02 |
| b3 | 0.84 | 0.85 | – | -0.2 |
| b4 | -0.01 | -0.07 | -0.29 | – |
| mostly-false (2) | ||||
| b1 | – | 0.07 | 0.47 | 0.51 |
| b2 | 0.46 | – | 0.37 | 0.09 |
| b3 | 0.72 | 0.49 | – | 0.58 |
| b4 | 0.82 | 0.36 | 0.83 | – |
| half-true (3) | ||||
| b1 | – | 0.12 | 0.12 | 0 |
| b2 | -0.03 | – | 0.52 | 0.22 |
| b3 | 0.01 | 0.7 | – | 0.1 |
| b4 | 0.09 | 0.28 | 0.2 | – |
| mostly-true (4) | ||||
| b1 | – | 0.35 | 0.16 | 0.24 |
| b2 | 0.6 | – | -0.07 | 0.69 |
| b3 | 0.31 | 0.03 | – | -0.28 |
| b4 | 0.24 | 0.62 | -0.22 | – |
| true (5) | ||||
| b1 | – | 0.74 | 0.51 | 0.48 |
| b2 | 0.9 | – | 0.26 | 0.28 |
| b3 | 0.33 | 0.31 | – | 0.67 |
| b4 | 0.51 | 0.45 | 0.69 | – |
RBO bottom-heavy (lower triangles) and RBO top-heavy (upper triangles) correlation values among batches for the aggregated scores of Fig. 8. Document sorted by increasing aggregated score
| b1 | b2 | b3 | b4 | |
|---|---|---|---|---|
| pants-on-fire (0) | ||||
| b1 | – | 0.47 | 0.79 | 0.51 |
| b2 | 0.31 | – | 0.54 | 0.6 |
| b3 | 0.49 | 0.27 | – | 0.51 |
| b4 | 0.5 | 0.28 | 0.32 | – |
| false (1) | ||||
| b1 | – | 0.85 | 0.86 | 0.36 |
| b2 | 0.81 | – | 0.98 | 0.24 |
| b3 | 0.53 | 0.47 | – | 0.23 |
| b4 | 0.34 | 0.41 | 0.33 | – |
| mostly-false (2) | ||||
| b1 | – | 0.62 | 0.7 | 0.43 |
| b2 | 0.62 | – | 0.74 | 0.34 |
| b3 | 0.71 | 0.74 | – | 0.59 |
| b4 | 0.71 | 0.64 | 0.76 | – |
| half-true (3) | ||||
| b1 | – | 0.26 | 0.26 | 0.22 |
| b2 | 0.26 | – | 0.47 | 0.25 |
| b3 | 0.29 | 0.75 | – | 0.64 |
| b4 | 0.22 | 0.51 | 0.36 | – |
| mostly-true (4) | ||||
| b1 | – | 0.33 | 0.28 | 0.43 |
| b2 | 0.48 | – | 0.22 | 0.78 |
| b3 | 0.28 | 0.18 | – | 0.15 |
| b4 | 0.39 | 0.88 | 0.17 | – |
| true (5) | ||||
| b1 | – | 0.5 | 0.79 | 0.49 |
| b2 | 0.92 | – | 0.29 | 0.38 |
| b3 | 0.49 | 0.41 | – | 0.49 |
| b4 | 0.49 | 0.44 | 0.79 | – |
Correlation between α and Φ values; ρ in the lower triangles, τ in the upper triangles
| b1 | b2 | b3 | b4 | |
|---|---|---|---|---|
| b1 | – | 0.49 | 0.61 | 0.52 |
| b2 | 0.72 | – | 0.42 | 0.39 |
| b3 | 0.79 | 0.67 | – | 0.57 |
| b4 | 0.67 | 0.55 | 0.78 | – |
| b1 | – | 0.25 | 0.13 | -0.03 |
| b2 | 0.38 | – | 0.15 | 0.04 |
| b3 | 0.19 | 0.23 | – | 0.06 |
| b4 | -0.06 | 0.05 | 0.09 | – |
Fig. 9Distribution of the ranks of the URLs selected by workers for all the batches
Fig. 10Effect of the origin of a justification on the labelling error. Text copied/not copied from the selected URL
Fig. 11MAE and CEM for individual judgments for returning workers. Green indicates that returning workers (i.e., workers which saw the task for the second time) are better than returning workers, red indicates the opposite
Fig. 12Relative ordering of statements across batches according to MAE for each PolitiFact category. Statements are sorted according to decreasing MAE; rank 1 represents the highest MAE
Fig. 13MAE (aggregated by statement) against the number of days elapsed (from when the statement was made to when it was evaluated), for novice workers (top) and returning workers (bottom). Each point is the MAE of a single statement in a Batch. Dotted lines are the trend of MAE in time for the Batch, straight lines are the mean MAE for the Batch. The black dashed line is the global trend of MAE across all Batches
| Id | Claimant | Date | Ground | Statement |
|---|---|---|---|---|
| Truth | ||||
| S1 | Facebook User | 2020-25-03 | pants-on-fire | If your child gets this virus their going to hospital alone in a van with people they don’t know...to be with people they don’t know… you will be at home without them in their time of need. |
| S2 | Donald Trump | 2020-30-03 | pants-on-fire | We inherited a “broken test” for COVID-19. |
| S3 | Facebook User | 2020-19-03 | pants-on-fire | Says “there is no” COVID-19 virus. |
| S4 | Facebook User | 2020-25-03 | pants-on-fire | COVID literally stands for Chinese Originated Viral Infectious Disease. |
| S5 | Bloggers | 2020-26-02 | pants-on-fire | A post say “hair weave and lace fronts manufactured in China may contain the coronavirus.” |
| S6 | Youtube Video | 2020-29-02 | pants-on-fire | A video says that the Vatican confirmed that Pope Francis and two aides tested positive for coronavirus. |
| S7 | Ron Desantis | 2020-09-04 | pants-on-fire | This particular pandemic is one where I don’t think nationwide, there’s been a single fatality under 25. |
| S8 | Facebook User | 2020-16-03 | pants-on-fire | The government is closing businesses to stop the spread of coronavirus even though “the numbers are nothing compared to H1N1 or Ebola. Everyone needs to realize our government is up to something ...” |
| S9 | Facebook User | 2020-16-03 | pants-on-fire | the U.S. is developing an “antivirus” that includes a chip to track your movement. |
| S10 | Bloggers | 2020-31-03 | pants-on-fire | Italy arrested a doctor “for intentionally killing over 3,000 coronavirus patients.” |
| S11 | Facebook User | 2020-28-03 | false | Says COVID-19 remains in the air for eight hours and that everyone is now required to wear masks “everywhere.” |
| S12 | Facebook User | 2020-27-03 | false | Says to leave objects in the sun to avoid contracting the coronavirus. |
| S13 | Facebook User | 2020-23-03 | false | “Slices of lemon in a cup of hot water can save your life. The hot lemon can kill the proliferation of” the novel coronavirus. |
| S14 | Facebook User | 2020-23-03 | false | Says the CDC now says that the coronavirus can survive on surfaces for up to 17 days. |
| S15 | Viral Image | 2020-13-03 | false | Drinking “water a lot and gargling with warm water & salt or vinegar eliminates” the coronavirus. |
| S16 | Snapchat | 2020-23-03 | false | Says “special military helicopters will spray pesticide against the Corona virus in the skies all over the country.” |
| S17 | Bloggers | 2020-20-03 | false | Says COVID-19 came to the United States in 2019. |
| S18 | Bloggers | 2020-09-04 | false | Church services can’t resume until we’re all vaccinated, says Bill Gates. |
| S19 | Facebook User | 2020-10-04 | false | Mass vaccination for COVID-19 in Senegal was started yesterday (4/8) and the first 7 CHILDREN who received it “DIED on the spot.” |
| S20 | Facebook User | 2020-02-04 | false | Says video shows “the Chinese are destroying the 5G poles as they are aware that it is the thing triggering the corona symptoms.” |
| S21 | Turning Point Usa | 2020-25-03 | mostly-false | Says Nevada Governor Steve Sisolak “has banned the use of an anti-malaria drug that might help cure coronavirus.” |
| S22 | Marco Rubio | 2020-19-03 | mostly-false | For coronavirus cases “in the U.S. 38% of those hospitalized are under 35.” |
| S23 | Image | 2020-15-03 | mostly-false | COVID-19 started because we eat animals. |
| S24 | Facebook User | 2020-14-03 | mostly-false | Italy has decided not to treat their elderly for this virus. |
| S25 | Viral Image | 2020-13-03 | mostly-false | President Trump, COVID-19 coronavirus: U.S. cases 1,329; U.S. deaths, 38; panic level: mass hysteria. President Obama, H1N1 virus: U.S. cases, 60.8 million; U.S. deaths, 12,469; panic level: totally chill. Do you all see how the media can manipulate your life. |
| S26 | Facebook User | 2020-27-02 | mostly-false | Post says “the blood test for coronavirus costs $3,200.” |
| S27 | Deanna Lorraine | 2020-12-04 | mostly-false | Says of COVID-19 that Dr. Anthony Fauci ”was telling people on February 29th that there was nothing to worry about and it posed no threat to the US public at large. |
| S28 | Instagram Post | 2020-18-03 | mostly-false | Bill Gates and other globalists, in collaboration with pharmaceutical companies, are reportedly working to push tracking bracelets and ’invisible tattoos’ to monitor Americans during an impending lockdown. |
| S29 | Facebook User | 2020-28-03 | mostly-false | Says a “5G LAW PASSED while everyone was distracted” with the coronavirus pandemic and lists 20 symptoms associated with 5G exposure. |
| S30 | Facebook User | 2020-28-03 | mostly-false | Says for otherwise healthy people “experiencing mild to moderate respiratory symptoms with or without a COVID-19 diagnosis… only high temperatures kill a virus, so let your fever run high,” but not over 103 or 104 degrees. |
| S31 | Facebook User | 2020-31-03 | half-true | Ron Johnson said Americans should go back to work, because “death is an unavoidable part of life.” |
| S32 | Jeff Jackson | 2020-19-03 | half-true | North Carolina “hospital beds are typically 85% full across the state.” |
| S33 | Facebook User | 2020-15-03 | half-true | So Oscar Health, the company tapped by Trump to profit from covid tests, is a Kushner company. Imagine that, profits over national safety. |
| S34 | Brian Fitzpatrick | 2020-23-03 | half-true | We’ve got to give the American public a rough estimate of how long we think this is going to take, based mostly on the South Korean model, which seems to be the trajectory that we are on, thankfully, and not the Italian model. |
| S35 | Facebook User | 2020-10-03 | half-true | Harvard scientists say the coronavirus is ”spreading so fast that it will infect 70% of humanity this year.” |
| S36 | Drew Pinsky | 2020-03-03 | half-true | You’re more likely to die of influenza right now” than the 2019 coronavirus. |
| S37 | Michael Bloomberg | 2020-26-02 | half-true | Says of President Donald Trump’s actions on the coronavirus: “No. 1, he fired the pandemic team two years ago. No. 2, hes been defunding the Centers for Disease Control.” |
| S38 | Joe Biden | 2020-05-04 | half-true | 45 nations had already moved “to enforce travel restrictions with China” before the president moved. |
| S39 | Facebook User | 2020-07-04 | half-true | Says Donald Trump “himself has a financial stake in the French company that makes the brand-name version of hydroxychloroquine.” |
| S40 | Facebook User | 2020-01-04 | half-true | “Non-essential people get to file for unemployment and make two to three times more than normal,” but essential workers still on the job get no pay raise. |
| S41 | Facebook User | 2020-29-03 | mostly-true | Says a study projects Wisconsin’s coronavirus cases will peak on April 26, 2020. |
| S42 | Facebook User | 2020-20-03 | mostly-true | Says truck drivers are being turned away from fast-food restaurants during the COVID-19 pandemic. |
| S43 | Facebook User | 2020-18-03 | mostly-true | 2019 coronavirus can live for “up to 3 hours in the air, up to 4 hours on copper, up to 24 hours on cardboard up to 3 days on plastic and stainless steel.” |
| S44 | Facebook User | 2020-15-03 | mostly-true | Bill Gates told us about the coronavirus in 2015. |
| S45 | Chart | 2020-09-03 | mostly-true | Says 80% of novel coronavirus cases are “mild.” |
| S46 | Lou Dobbs | 2020-02-03 | mostly-true | The United States is “actually screening fewer people (for the coronavirus than other countries) because we don’t have appropriate testing.” |
| S47 | Charlie Kirk | 2020-24-02 | mostly-true | Three Chinese nationals were apprehended trying to cross our Southern border illegally. Each had flu-like symptoms. Border Patrol quickly quarantined them and assessed any threat of coronavirus. |
| S48 | Bernie Sanders | 2020-08-04 | mostly-true | It has been estimated that only 12% of workers in businesses that are likely to stay open during this crisis are receiving paid sick leave benefits as a result of the second coronavirus relief package. |
| S49 | Viral Image | 2020-08-04 | mostly-true | Says a California surfer was “alone, in the ocean,” when he was arrested for violating the state’s stay-at-home order. |
| S50 | Dan Patrick | 2020-31-03 | mostly-true | Says for the coronavirus, “the death rate in Texas, per capita of 29 million people, we’re one of the lowest in the country.” |
| S51 | Facebook User | 2020-02-04 | true | On February 7, the WHO warned about the limited stock of PPE. That same day, the Trump administration announced it was sending 18 tons of masks, gowns and respirators to China. |
| S52 | Pat Toomey | 2020-28-03 | true | My mask will keep someone else safe and their mask will keep me safe. |
| S53 | Andrew Cuomo | 2020-17-03 | true | No city in the state can quarantine itself without state approval. |
| S54 | Kelly Alexander | 2020-14-03 | true | Says “most” NC legislators are in the “high risk age group” for coronavirus |
| S55 | Viral Image | 2020-13-03 | true | Says Spectrum will provide free internet to students during coronavirus school closures. |
| S56 | Michael Dougherty | 2020-12-03 | true | Some states are only getting 50 tests per day, and the Utah Jazz got 58. |
| S57 | Blog Post | 2020-10-03 | true | Whole of Italy goes into quarantine |
| S58 | Dan Crenshaw | 2020-13-03 | true | Says longstanding Food and Drug Administration regulations “created barriers to the private industry creating a test quickly” for the coronavirus. |
| S59 | Viral Image | 2020-02-04 | true | Photo shows a crowded New York City subway train during stay-at-home order. |
| S60 | John Bel Edwards | 2020-05-04 | true | Says of the coronavirus threat, “there was not a single suggestion by anyone, a doctor, a scientist, a political figure, that we needed to cancel Mardi Gras.” |