| Literature DB >> 35095194 |
Lida Huang1, Panpan Shi2, Haichao Zhu2, Tao Chen1.
Abstract
Emergency events require early detection, quick response, and accurate recovery. In the era of big data, social media users can be seen as social sensors to monitor real-time emergency events. This paper proposed an integrated approach to detect all four kinds of emergency events early, including natural disasters, man-made accidents, public health events, and social security events. First, the BERT-Att-BiLSTM model is used to detect emergency-related posts from massive and irrelevant data. Then, the 3 W attribute information (what, where, and when) of the emergency event is extracted. With the 3 W attribute information, we create an unsupervised dynamical event clustering algorithm based on text similarity and combine it with the supervised logistical regression model to cluster posts into different events. Experiments on Sina Weibo data demonstrate the superiority of the proposed framework. Case studies on some real emergency events show that the proposed framework has good performance and high timeliness. Practical applications of the framework are also discussed, followed by future directions for improvement.Entities:
Keywords: BERT; Bi-LSTM; Early detection; Emergency event; Social media; Text clustering
Year: 2022 PMID: 35095194 PMCID: PMC8782712 DOI: 10.1007/s11069-021-05081-1
Source DB: PubMed Journal: Nat Hazards (Dordr) ISSN: 0921-030X
Fig. 1The process flow of the EDEE framework
Emergency events considered in this paper and their seed words
| Main type | Subtype | Seed words and their weights |
|---|---|---|
| Natural disaster | Flood | 爆发山洪/洪水 0.5, 洪水 0.4, 山洪 0.4, 内涝 0.4 |
| Typhoon | 台风登陆/路径 0.8, 台风 0.6, 风力 0.6 | |
| Tornado | 龙卷风 0.5, 龙卷 0.3, 龙吸水 0.3 | |
| Rainstorm | 暴雨 0.6, 强降雨 0.4, 雨 0.3, 降雨 0.3, 降水 0.3, 大雨 0.2 | |
| Snowstorm | 雪灾 0.7, 暴雪 0.6, 大雪 0.6, 雪 0.4, 降雪 0.1 | |
| Earthquake | 地震 0.4, 震源 0.2, 震中 0.2, 震感 0.2, 强震 0.2 | |
| Landslide | 滑坡 0.5, 泥石流 0.5 | |
| Collapse | 崩塌 0.6, 山体 0.2 | |
| Forest fire | 山林火灾 0.8, 林火 0.7, 森林大火 0.7, 森林火灾 0.7, 山火 0.7 | |
| Accident | Mine accident | 矿难 0.7, 冒顶 0.7, 片帮 0.7 |
| Building collapse | 坍塌 0.6, 倒塌 0.5, 建筑 0.1, 房屋 0.1, 脚手架 0.1, 塔吊 0.1 | |
| Fire accident | 火灾 0.6, 大火 0.6, 火情 0.2, 火警 0.2, 着火 0.2, 失火 0.2, 起火 0.2 | |
| Explosion | 爆炸 0.4, 爆燃 0.3, 巨响 0.2 | |
| Leakage accident | 泄漏 0.6, 燃气 0.1, 毒气 0.1 | |
| Road accident | 交通事故 0.4, 车祸 0.4, 撞死 0.3, 撞人 0.3, 撞车 0.3, 相撞 0.3, 追尾 0.3, 翻车 0.3, 肇事 0.1 | |
| Drowning | 溺亡 0.4, 溺水 0.3 | |
| Capsize accident | 翻船 1, 沉船 0.8 | |
| Railway accident | 脱轨 0.6, 火车事故 0.4 | |
| Aviation accident | 坠机 0.6, 空难 0.5, 坠毁 0.4, 坠落 0.2 | |
| Falling accident | 坠楼 0.6, 坠落 0.2 | |
| Stampede event | 踩踏 1 | |
| Other accident | 事故 0.5, 死亡 0.2, 身亡 0.1, 意外 0.1 | |
| Public health event | Avian influenza | 禽流感 0.6 |
| Plague | 鼠疫 0.6, 肺鼠疫 0.6 | |
| COVID-19 | 新冠 0.4, 新型冠状病毒 0.4, 冠状病毒 0.2, 肺炎 0.1 | |
| Swine fever | 猪瘟 0.4, 猪瘟疫 0.3 | |
| Food poisoning | 中毒 0.4, 食物 0.1 | |
| Other public health events | 感染 0.1, 病毒 0.1, 疫情 0.1, 疫苗 0.1 | |
| Social security event | Major criminal case | 刑事案件 0.7, 刑事拘留 0.7, 杀人/害 0.4, 猥亵 0.3, 暴打 0.3, 遇/被害 0.3, 嫌疑 0.3, 凶手 0.2, 砍死 0.2, 殴打 0.2, 斗殴 0.2 |
| Mass protest | 群体性事件 0.8, 罢工 0.3, 游行 0.3, 非法集结 0.3, 暴乱 0.2, 暴动 0.2, 暴徒 0.2, 示威 0.2, 集会 0.2, 聚众 0.1 |
Fig. 2The BERT-Att-BiLSTM model architecture
Assignment rules of s, s, and s
| Parameter | Value |
|---|---|
If the event types are the same, Else, | |
If locations are the same at the village/town level, Else if the locations are the same at the county/district level and at least one post lacks the village/street information, Else if the locations are the same at the city level and at least one post lacks the county/district information, Else if the locations are the same at the province level and at least one post lacks the city information, Else, | |
If the time difference is less than 1 min, Else if the time difference is less than 1 h, Else if the time difference is less than 1 day, Else if the time difference is less than 3 days, Else, |
Fig. 3The number of posts related to different types of emergencies
Classification performance of different models
| Model | Precision | Recall | F1 measure |
|---|---|---|---|
| Word2Vec-Att-BiLSTM | 0.76 | 0.79 | 0.77 |
| BERT | 0.84 | 0.89 | 0.86 |
| BERT-Att-BiLSTM | 0.85 | 0.93 | 0.89 |
Fig. 4Classification performance of different models for different types of events
Fig. 5Publishing time distribution of posts related to the Xiangshui Explosion from 14:00 to 23:59 on 21 March
Fig. 6High-frequency word evolution and visualization of detected events from 14:00 to 23:59 on 21 March
Fig. 7Publishing time distribution of posts related to COVID-19 in Wuhan
Fig. 8The post was first detected (a) and the post with maximum forwarding times (b) for COVID-19
Fig. 9Word clouds of posts about COVID-19 from 30 December 2019, 6 to January 2020
Fig. 10Publishing time distribution of posts related to COVID-19 in other cities and provinces
Fig. 11The system interface
Fig. 12Statistical chart of detected emergencies from June to November 2020
Fig. 13Location distribution map of the detected emergencies
Fig. 14Streamgraph of detected emergencies of different types
Fig. 15The interval between the time of the emergency being detected and the time of the emergency occurring