| Literature DB >> 32994750 |
Sreenivasulu Madichetty1, Sridevi M1.
Abstract
Social media platform like Twitter is one of the primary sources for sharing real-time information at the time of events such as disasters, political events, etc. Detecting the resource tweets during a disaster is an essential task because tweets contain different types of information such as infrastructure damage, resources, opinions and sympathies of disaster events, etc. Tweets are posted related to Need and Availability of Resources (NAR) by humanitarian organizations and victims. Hence, reliable methodologies are required for detecting the NAR tweets during a disaster. The existing works don't focus well on NAR tweets detection and also had poor performance. Hence, this paper focus on detection of NAR tweets during a disaster. Existing works often use features and appropriate machine learning algorithms on several Natural Language Processing (NLP) tasks. Recently, there is a wide use of Convolutional Neural Networks (CNN) in text classification problems. However, it requires a large amount of manual labeled data. There is no such large labeled data is available for NAR tweets during a disaster. To overcome this problem, stacking of Convolutional Neural Networks with traditional feature based classifiers is proposed for detecting the NAR tweets. In our approach, we propose several informative features such as aid, need, food, packets, earthquake, etc. are used in the classifier and CNN. The learned features (output of CNN and classifier with informative features) are utilized in another classifier (meta-classifier) for detection of NAR tweets. The classifiers such as SVM, KNN, Decision tree, and Naive Bayes are used in the proposed model. From the experiments, we found that the usage of KNN (base classifier) and SVM (meta classifier) with the combination of CNN in the proposed model outperform the other algorithms. This paper uses 2015 and 2016 Nepal and Italy earthquake datasets for experimentation. The experimental results proved that the proposed model achieves the best accuracy compared to baseline methods. © Springer Science+Business Media, LLC, part of Springer Nature 2020.Entities:
Keywords: CNN; Disaster; NAR; Stacking
Year: 2020 PMID: 32994750 PMCID: PMC7517055 DOI: 10.1007/s11042-020-09873-8
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.757
Examples of need and availability of resource tweet
| Tweet no | Need of Resource tweets |
|---|---|
| 1. | Italy Earthquake Emphasizes The Need For Mobile Hospitals [URL] |
| 2. | New post: People urged to remove password from WiFi to help Italian earthquake relief [URL] |
| 3. | #Terremoto Numeri Protezione Civile: 840840840/803555 Serve sangue.Civil Protection Nos. Urgent need for blood donation #Italy #earthquake”, |
| 4. | Italy needs help! The earthquake left many people without home, hospitals need ambulances [URL] |
| Availability of Resource tweets | |
| 5. | Death toll rises to 84 in Italy - The Italian army has been mobilized to aid earthquake-affected areas [URL]. |
| 6. | 2 tent cities totaling about 50 tents being set up for quake victims in Italy’s central Marche region - ANSA |
| 7. | I just donated some $ to the Italian Red Cross Ïtaly’s Earthquake Victims Need Aid |
| Support[¨URL] via @bustle”. | |
| 8. | Search and rescue dogs |
| 20 ambulances on the ground in #Perugia following #earthquake volunteers from @crocerossa on the scene. |
Fig. 1The overview of the proposed stacked convolutional neural network
Proposed domain-specific features for identifying the NAR tweets
| S.No. | Features (Informative words) | Information category |
|---|---|---|
| 1). | Earthquake | Calamity |
| 2). | Food, Meals, Packets and Water | Nutriment |
| 3). | Urgent and Urgently | Emergency |
| 4). | Victims | Fatality |
| 5). | Aid, Need, Needed, Needs | Availability and Need |
| Tents, Medical, Help and Shelter | ||
| 6). | Delhi and Temple | Spot |
Details of Nepal and Italy earthquake datasets
| Dataset | Total no. of tweets | Distinct tweets | Availability tweets | Need tweets |
|---|---|---|---|---|
| Nepal Earthquake | 100k | 50,018 | 1,333 | 495 |
| Italy Earthquake | 180k | 70,487 | 233 | 177 |
Accuracy of CNN by varying the batch sizes
| S.No | Mini-batch size | Accuracy |
|---|---|---|
| 1. | 32 | 75.89 |
| 2. | ||
| 3. | 128 | 68.79 |
Inscriptions
| S.No | Methods | Abbreviations |
|---|---|---|
| 1. | CSS | CNN+SVM → SVM |
| 2. | CSK | CNN+SVM → KNN |
| 3. | CSNB | CNN+SVM → Naive Bayes |
| 4. | CSD | CNN+SVM → Decision tree |
| 5. | CDS | CNN+Decision tree → SVM |
| 6. | CDK | CNN+Decision tree→ KNN |
| 7. | CDN | CNN+Decision tree → Naive Bayes |
| 8. | CDD | CNN+Decision tree → Decision tree |
| 9. | CNBS | CNN+ Naive bayes classifier → SVM |
| 10. | CNBK | CNN+ Naive bayes → KNN |
| 11. | CNBNB | CNN+ Naive bayes → Naive Bayes |
| 12. | CNBD | CNN+ Naive bayes → Decision tree |
| 13. | CKS | CNN+ KNN → SVM |
| 14. | CKK | CNN+ KNN→ KNN |
| 15. | CKNB | CNN+KNN → Naive Bayes |
| 16. | CKD | CNN+ KNN→ Decision tree |
| 17. | SPROP | SVM with proposed features |
| 18. | Baseline | SVM with BoW model |
Comparison of SPROP with baseline model
| Dataset | Precision | F1-Score | Accuracy | |||
|---|---|---|---|---|---|---|
| Baseline | SPROP | Baseline | SPROP | Baseline | SPROP | |
| Nepal Earthquake | 26.5 | 34.5 | 59.5 | |||
Comparison of proposed models (CNN+SVM → classifier) on Nepal and Italy earthquake datasets
| Dataset | Model | Precision | F1-score | Accuracy | Execution time (ms) |
|---|---|---|---|---|---|
| Nepal Earthquake | CSS | 73.50 | 75.50 | 77.48 | 1.84 |
| CSK | 73.50 | 76.00 | 1.91 | ||
| CSNB | 76.00 | 74.00 | 77.30 | 1.86 | |
| CSD | 72.00 | 70.50 | 73.40 | 1.79 | |
| Italy Earthquake | CSS | 77.50 | 74.00 | 1.77 | |
| CSK | 73.00 | 70.00 | 72.57 | 2.39 | |
| CSNB | 79.50 | 71.50 | 73.45 | 1.59 | |
| CSD | 66.50 | 66.50 | 69.91 | 1.59 | |
| Average of Nepal and Italy Earthquake | CSS | 75.50 | 74.75 | 75.91 | 1.80 |
| CSK | 73.25 | 73.00 | 75.11 | 2.15 | |
| CSNB | 77.75 | 72.75 | 75.37 | 1.73 | |
| CSD | 69.25 | 68.50 | 71.65 | 1.69 |
Comparison of proposed models (CNN+ Decision tree → classifier) on Nepal and Italy earthquake datasets
| Dataset | Model | Precision | F1-score | Accuracy | Execution time (ms) |
|---|---|---|---|---|---|
| Nepal Earthquake | CDS | 72.50 | 74.00 | 76.50 | |
| CDK | 72.50 | 74.50 | 1.81 | ||
| CDNB | 73.00 | 72.50 | 76.40 | 1.83 | |
| CDD | 72.50 | 73.00 | 74.80 | 1.84 | |
| Italy Earthquake | CDS | 80.00 | 73.50 | 73.45 | 1.59 |
| CDK | 77.50 | 74.00 | 75.22 | 2.04 | |
| CDNB | 79.50 | 72.00 | 75.22 | 1.59 | |
| CDD | 68.50 | 68.00 | 69.03 | 2.04 | |
| Average of Nepal and | CDS | 76.25 | 73.75 | 74.97 | 1.65 |
| Italy Earthquake | CDK | 75.00 | 74.25 | 75.96 | 1.93 |
| CDNB | 76.25 | 72.25 | 75.81 | 1.71 | |
| CDD | 70.50 | 70.50 | 71.91 | 1.94 |
Comparison of proposed models (CNN+ Naive Bayes classifier → classifier) with variations on Nepal and Italy earthquake datasets
| Dataset | Model | Precision | F1-score | Accuracy | Execution time (ms) |
|---|---|---|---|---|---|
| Nepal Earthquake | CNBS | 72.00 | 74.00 | 76.50 | 1.75 |
| CNBK | 71.50 | 73.50 | 76.20 | ||
| CNBNB | 74.50 | 72.50 | 1.80 | ||
| CNBD | 74.50 | 71.00 | 74.60 | 1.77 | |
| Italy Earthquake | CNBS | 73.00 | 71.00 | 72.57 | 1.68 |
| CNBK | 73.00 | 66.50 | 70.80 | 2.12 | |
| CNBNB | 79.50 | 69.50 | 72.57 | 2.12 | |
| CNBD | 70.50 | 67.50 | 71.68 | 1.95 | |
| Average of Nepal and | CNBS | 72.50 | 72.50 | 74.53 | 1.71 |
| Italy Earthquake | CNBK | 72.25 | 70.00 | 73.50 | 1.92 |
| CNBNB | 77.00 | 71.00 | 74.76 | 1.96 | |
| CNBD | 72.50 | 69.25 | 73.14 | 1.86 |
Comparison of proposed models (CNN+ KNN → classifier) with variations on Nepal and Italy earthquake datasets
| Dataset | Model | Precision | F1-score | Accuracy | Execution time (sec) |
|---|---|---|---|---|---|
| Nepal Earthquake | CKS | 73.50 | 75.50 | ||
| CKK | 74.00 | 75.50 | 76.95 | ||
| CKNB | 77.00 | 72.50 | 76.77 | 1.80 | |
| CKD | 72.50 | 71.00 | 73.93 | 1.95 | |
| Italy Earthquake | CKS | 82.50 | 76.50 | 76.99 | 1.68 |
| CKK | 76.50 | 73.00 | 75.22 | 1.68 | |
| CKNB | 80.50 | 72.00 | 75.22 | 1.76 | |
| CKD | 69.00 | 69.00 | 69.91 | 2.47 | |
| Average of Nepal and | CKS | 78.00 | 76.00 | 77.25 | 1.73 |
| Italy Earthquake | CKK | 75.25 | 74.25 | 76.09 | 1.73 |
| CKNB | 78.75 | 72.25 | 75.99 | 1.78 | |
| CKD | 70.75 | 70.00 | 71.92 | 2.21 |
Accuracy of ablation experiments on Nepal and Italy earthquakes
| Classifier | Nepal Earthquake (Execution time) | Italy Earthquake (Execution time) |
|---|---|---|
| Proposed method (CKS) | 77.50 (1.79ms) | 76.99 (1.64ms) |
| CNN→SVM | 76.60 (1.76ms) | 71.68 (1.54ms) |
| KNN→SVM | 50.89 (1.78ms) | 56.64 (1.60ms) |
| KNN | 50.88 (0.47ms) | 56.63(0.294ms) |
| CNN | 76.77 (1.75ms) | 70.79 (1.57ms) |
Comparison of proposed model with existing methods on average of Nepal and Italy earthquake datasets using Accuracy parameter
| Models | Nepal Earthquake (Execution time in ms) | Italy Earthquake (Execution time in ms) | Average of both Nepal and Italy Earthquake | Year |
|---|---|---|---|---|
| Baseline [ | 59.50 (13.67) | 67.20 (1.23) | 63.35 | 2014 |
| Kims et al. [ | 76.77 (1.75) | 70.79 (1.59) | 73.88 | 2014 |
| Koustav et al. [ | 47.10 ( | 48.83 ( | 47.96 | 2018 |
| Sreenivasulu M et al. [ | 53.36 (0.40) | 67.25 (0.35) | 60.30 | 2017 |
| Proposed method | Present |
Contigency table
| Number of categories | Correct by model A | Incorrect by model A |
|---|---|---|
| Correct by Model B | ||
| Incorrect by Model B |
Results of the MCNemar-tests on variants of the proposed methods for both Nepal and Italy Earthquake datasets
| Methods | Method-1 | Method-2 | Method-3 |
|---|---|---|---|
| CSS | CSK (↑) | CSNB( | CSD (↑↑) |
| CDK | CDS (↑) | CDNB ( | CDD (↑↑) |
| CNBNB | CNBS ( | CNBK (↑↑) | CNBD (↑↑) |
| CKS | CKK (↑↑) | CKNB (↑↑) | CKD (↑↑) |
| CKS | CSS (↑↑) | CDK (↑↑) | CNBNB (↑↑) |
Results of the MCNemar-tests for ablation methods and the existing methods for both Nepal and Italy Earthquake datasets
| Proposed methods | Method-1 | Method-2 | Method-3 | Method-4 |
|---|---|---|---|---|
| CKS | CNN→SVM (↑↑) | KNN→SVM (↑↑) | KNN (↑↑) | CNN (↑↑) |
| Proposed Method (CKS) | Baseline [ | Kims et al. [ | Koustav et al. [ | Sreenivasulu M et al. [ |