| Literature DB >> 34784364 |
Pouria Babvey1, Gabriela Gongora-Svartzman2, Carlo Lipizzi1, Jose E Ramirez-Marquez1.
Abstract
Disasters strike communities around the world, with a reduced time-frame for warning and action leaving behind high rates of damage, mortality, and years in rebuilding efforts. For the past decade, social media has indicated a positive role in communicating before, during, and after disasters. One important question that remained un-investigated is that whether social media efficiently connect affected individuals to disaster relief agencies, and if not, how AI models can use historical data from previous disasters to facilitate information exchange between the two groups. In this study, the BERT model is first fine-tuned using historical data and then it is used to classify the tweets associated with hurricanes Dorian and Harvey based on the type of information provided; and alongside, the network between users is constructed based on the retweets and replies on Twitter. Afterwards, some network metrics are used to measure the diffusion rate of each type of disaster-motivated information. The results show that the messages by disaster eyewitnesses get the least spread while the posts by governments and media have the highest diffusion rates through the network. Additionally, the "cautions and advice" messages get the most spread among other information types while "infrastructure and utilities" and "affected individuals" messages get the least diffusion even compared with "sympathy and support". The analysis suggests that facilitating the propagation of information provided by affected individuals, using AI models, will be a valuable strategy to pursue in order to accelerate communication between affected individuals and survival groups during the disaster and aftermath.Entities:
Mesh:
Year: 2021 PMID: 34784364 PMCID: PMC8594803 DOI: 10.1371/journal.pone.0259342
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Methodology overview.
The process includes fine-tuning the BERT model using the datasets in the literature, collecting new datasets to validate the fine-tuned model, as well as creating the network between users in the disaster-motivated network and evaluating the spread of each type of information in the network.
Fig 2Overview description of fine-tuning process.
According to the results from the first experiment the fine-tuned model shows a significant improvement compared to the previous methods. In the second experiment, confusion matrix is used to evaluate the model performance.
Performance comparison of BERT against baseline results.
As can be seen the performance is significantly improved.
| Baseline | BERT | |||||
|---|---|---|---|---|---|---|
| Label | Precision | Recall | F-1 | Precision | Recall | F-1 |
|
| 0.859 | 0.765 | 0.809 | 0.925 | 0.955 | 0.940 |
|
| 0.726 | 0.716 | 0.721 | 0.976 | 0.951 | 0.963 |
|
| 0.526 | 0.652 | 0.583 | 0.666 | 0.787 | 0.721 |
|
| 0.545 | 0.581 | 0.562 | 0.859 | 0.878 | 0.868 |
Most frequent words of different information types.
| Information type | Most frequent words and bi-grams |
|---|---|
|
| injured people, dead, killed, fire, death toll, rescue |
|
| building collapse, train derailment, plant explosion, helicopter crash, closed, water, fire, flood, homes, Refinería, oil, roof damage |
|
| raise funds, emergency, evacuation centers, help, relief efforts, support, food, water, victims, accepting donations, aid, rescue, shelter |
|
| breaking news, typhoon, tropical storm, weather bulletin, rains, metro, hits, alert, update, warning, stay at, emergency, flooding, tsunami, earthquake |
|
| please pray, keep safe, god bless, affected, hope, people, families, tragedy, sad |
|
| photo of, press conference, report, meteor, train |
Fig 3Confusion matrix of BERT model for the type of actor classification.
The model shows a better performance in detecting the posts from Media, Outsiders, Eyewitnesses, and Government, while the performance is lower for messages from NGOs and Business.
Fig 4Average PageRank score of different information types.
“Caution and advice” gets the highest diffusion rate in the network, followed by “Donations and volunteering” while the messages by “Affected individuals” do not get enough spread. In hurricane Harvey however, the diffusion rate of the messages by “Affected individual” and “Infrastructure and utilities” get more spread compared to hurricane Dorian.
Fig 5Average PageRank score of different user types.
The messages by “Government” get the highest spread in the network, followed by messages “Media” while the messages by “Eyewitnesses” get the lowest spread among others. In hurricane Harvey however, the spread of the “Eyewitnesses” messages get more spread compared to hurricane Dorian.
Fig 6The network between users for A: hurricane Dorian B: hurricane Harvey. The size of the nodes are proportional with the PageRank score of them.