| Literature DB >> 34142975 |
Michele Miller1, William Romine1, Terry Oroszi1.
Abstract
BACKGROUND: Social media allows researchers to study opinions and reactions to events in real time. One area needing more study is anthrax-related events. A computational framework that utilizes machine learning techniques was created to collect tweets discussing anthrax, further categorize them as relevant by the month of data collection, and detect discussions on anthrax-related events.Entities:
Keywords: Federal Bureau of Investigation; Twitter; anthrax; big data; biological weapon; digital health; infodemiology; infoveillance; internet; machine learning; public health threat; social listening; terrorism
Year: 2021 PMID: 34142975 PMCID: PMC8277308 DOI: 10.2196/27976
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Figure 1Methods for a hierarchical supervised classification technique. Large black boxes indicate where supervised machine learning algorithms were trained, supervised machine learning algorithms were tested, and where unsupervised machine learning algorithms were used. LDA: Latent Dirichlet allocation.
Figure 2Line graph showing the number of tweets collected during each day of data collection (September 25, 2017, to August 15, 2018). Vertical lines indicate when news was first published about one of the detected anthrax-related discussions.
Time of the news report or the first tweet concerning a detected discussion, time of the first tweet discussing the news article or the first retweet, and the time between the event and its detection.
| Event | Time of report | Time of detection | Time between report and detection |
| North Korea threatens a third World War | October 6, 2017, at 1:29 PM | October 6, 2017, at 5:35 PM | 4 hours 6 minutes |
| The Mueller investigation | November 24, 2017, at 3 AM | November 25, 2017, at 5:02 PM | ~1 day 15 hours |
| Brian Ross suspended | December 1, 2017 (clock time unknown) | December 1, 2017, at 4:14 PM | <24 hours |
| North Korea tests anthrax-mounted intercontinental ballistic missiles | December 19, 2017, at 7:32 PM | December 20, 2017, at 12:32 AM | 5 hours |
| North Korean defector has anthrax antibodies | December 26, 2017, at 9:51 AM | December 26, 2017, at 2:53 PM | 5 hours 2 minutes |
| Anthrax band announces a concert | January 11, 2018, at 3:47 AM | January 12, 2018, at 3:03 PM | ~12 hours |
| Seth Meyers tweets about an anthrax experience | January 26, 2018, at 3:27 AM | January 26, 2018, at 3:28 AM | 1 minute |
| #OnThisDay Collin Powell brought “anthrax” to the United Nations | February 5, 2018, at 9:46 AM | February 5, 2018, at 9:57 AM | 11 minutes |
| Anthrax band is a member of the Big 4 | February 8, 2018, at 2:40 AM | February 8, 2018, at 2:41 AM | 1 minute |
| Vanessa Trump anthrax scare | February 12, 2018, at 10 AM | February 12, 2018, at 6:14 PM | ~8 hours |
| Prince Harry anthrax scare | February 22, 2018, at 5:59 AM | February 22, 2018, at 10:58 AM | 4 hours 59 minutes |
| The Mueller investigation | March 18, 2018, at 2:20 AM | March 18, 2018, at 2:20 AM | <1 minute |
| The Mueller investigation | February 8, 2018 (clock time unknown) | April 10, 2018, at 4:36 AM | ~2 months |
| The Mueller investigation | May 3, 2018, at 9:59 PM | May 4, 2018, at 1:59 AM | 4 hours |
| Anthrax band’s European tour | May 11, 2018, at 4 AM | May 11, 2018, at 8 AM | 4 hours |
| Tweet about being a parent | May 25, 2018, at 11:34 AM | May 25, 2018, at 3:34 PM | 4 hours |
| Culling of hippopotamuses owing to anthrax | July 1, 2018, at 9 AM | July 1, 2018, at 12:59 PM | 3 hours 59 minutes |
| Culling of hippopotamuses owing to anthrax | July 18, 2018, at 3:09 AM | July 18, 2018, at 7:09 AM | 4 hours |
| Maxine Waters anthrax scare | July 24, 2018, at 3:22 PM | July 24, 2018, 10:26 PM | 7 hours 4 minutes |
| The Mueller investigation | August 1, 2018, 1:47 PM | August 1, 2018, at 1:47 PM | <1 minute |
Precision, recall, and F1-score for the relevance machine learning algorithms with optimal performance on logistic regression analysis.
| Machine learning algorithm | F1-score | Precision | Recall |
| Support vector machine | 0.72 | 0.75 | 0.75 |
| Random forest | 0.78 | 0.78 | 0.79 |
| Naïve Bayes classifier | 0.79 | 0.79 | 0.79 |
| Logistic regression | 0.80 | 0.81 | 0.81 |
Results of topic modeling for each month of data collection (September 25, 2017 to August 15, 2018) (N=26 topics).
| Month | Topic |
| September and October |
(#1) Threats from North Korea (#2) Responsible (#3) Culling of hippopotamuses |
| November |
(#1) Vaccine (#2) Angela Merkel |
| December |
(#1) Threats from North Korea (#2) India |
| January |
(#1) Seth Meyers (#2) The Mueller investigation |
| February |
(#1) New York Post (#2) Anthrax scare (#3) Anthrax scare (#4) Korean War |
| March |
(#1) The Mueller investigation (#2) Travis Air Force Base |
| April |
(#1) Abortion (#2) The Mueller investigation |
| May |
(#1) The Mueller investigation (#2) Culling of hippopotamuses (#3) Being a parent |
| June |
(#1) Cattle (#2) The Mueller investigation (#3) Abortion |
| July |
(#1) Anthrax scare (#2) The Mueller investigation |
| August |
(#1) The Mueller investigation |