| Literature DB >> 35857776 |
Samrat Kumar Dey1, Md Mahbubur Rahman2, Arpita Howlader3, Umme Raihan Siddiqi4, Khandaker Mohammad Mohi Uddin5, Rownak Borhan5, Elias Ur Rahman5.
Abstract
Dengue fever is a severe disease spread by Aedes mosquito-borne dengue viruses (DENVs) in tropical areas such as Bangladesh. Since its breakout in the 1960s, dengue fever has been endemic in Bangladesh, with the highest concentration of infections in the capital, Dhaka. This study aims to develop a machine learning model that can use relevant information about the factors that cause Dengue outbreaks within a geographic region. To predict dengue cases in 11 different districts of Bangladesh, we created a DengueBD dataset and employed two machine learning algorithms, Multiple Linear Regression (MLR) and Support Vector Regression (SVR). This research also explores the correlation among environmental factors like temperature, rainfall, and humidity with the rise and decline trend of Dengue cases in different cities of Bangladesh. The entire dataset was divided into an 80:20 ratio, with 80 percent used for training and 20% used for testing. The research findings imply that, for both the MLR with 67% accuracy along with Mean Absolute Error (MAE) of 4.57 and SVR models with 75% accuracy along with Mean Absolute Error (MAE) of 4.95, the number of dengue cases reduces throughout the winter season in the country and increases mainly during the rainy season in the next ten months, from August 2021 to May 2022. Importantly, Dhaka, Bangladesh's capital, will see the maximum number of dengue patients during this period. Overall, the results of this data-driven analysis show that machine learning algorithms have enormous potential for predicting dengue epidemics.Entities:
Mesh:
Year: 2022 PMID: 35857776 PMCID: PMC9299345 DOI: 10.1371/journal.pone.0270933
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Description of the DengueBD dataset that is created and used in this research.
| DengueBD Dataset | |
|---|---|
| Attributes | Description |
| District | Highest affected Major cities of Bangladesh |
| Month | Exact month and time of admission of the patient |
| Rainfall | Amount of rainfall in a particular month |
| Maximum Temperature | The maximum temperature recorded in a particular month |
| Humidity | Average humidity recorded in a month |
| Patients | Total number of patients admitted |
System specifications.
| System Specification for the Experiment | |
|---|---|
| RAM | 8 GB |
| CPU | 1× single core hyper threaded i.e. (1 core, 2 threads) Xeon Processors @2.3Ghz |
| Cache | 46 MB |
| GPU | NVidia K80 GPU |
| GPU Memory | 16 GB |
| Session Limit | 9 hours |
| Disk Space | 100GB |
Comparative analysis of state-of-the-art research work in predicting dengue incidents globally using different data sources.
| Reference | Year | Dataset source and time period | Approach | Research Methods | Performance/Findings | Study Location |
|---|---|---|---|---|---|---|
| Paul et al. [ | 2021 | Data source: Climate data from the Bangladesh Meteorological Department. | Bias-adjusted GCM data for two RCPs to estimate the VC for A. aegypti mosquitoes | Variable cost (VC) mathematical approach was used | Dengue transmission season could eventually extend to all-year-round | Bangladesh |
| Hossain et al. [ | 2022 | Data source: Directorate General of Health Services (DGHS), Bangladesh and Bangladesh Meteorological Department (BMD) | Prediction of dengue incidents using seasonal climate data | Quasi poisson model and corrected quasi Akaike information criteria (QAICc) | Successfully predicted the largest outbreak of dengue cases in 2018 in Bangladesh | Bangladesh |
| Dourjoy et al. [ | 2021 | Data source: 600 dengue patients survey data and internet; 1047 data have been used | Prediction of dengue fever using ML approach | SVM and Random Forest | Accuracy: 69% (SVM) and 68% (RF) | Bangladesh |
| Martheswaran et al. [ | 2022 | Data source: Ministry of Health Infectious Disease Bulletin of Singapore, and PAHO/ WHO data of Honduras | Prediction of dengue fever outbreaks using climate variability and Markov chain Monte Carlo techniques | Random‑sampling‑based SIR and Bayesian Markov Chain Monte Carlo (MCMC) technique | 1. Seasonal model accounted for 98.5% and 92.8% of the variance in case count in the 2020 Singapore and 2019 Honduras outbreaks | Singapore and Honduras |
| McGough et al. [ | 2021 | Data source: Brazilian Ministry of Health, Brazil, and GMAO-NASA | Ensemble learning approach to forecast dengue fever epidemic years in Brazil | Support Vector Machine (SVM) and ensemble learning model | Accuracy: 75% | Brazil |
| Sarma et al. [ | 2020 | Data source: 209 dengue patients data collected from | ML algorithm-based dengue prediction | Decision Tree (DT) and Random Forest (RF) | Precision: 79% (DT), 67% (RF) | Bangladesh |
| Salim et al. [ | 2021 | Data source: private clinics, public clinics, and hospitals of Malaysia, and the source reported to the Ministry of Health through eNotifkasi, a real-time surveillance system | Prediction of dengue outbreak in using ML techniques | CART, SVM (linear, polynomial, rbf), Naïve Bayes, ANN | Accuracy:70%, Sensitivity: 14% Specificity:95% | Malaysia |
| Salami et al. [ | 2020 | Data source: European Centre for Disease Prevention and Control (ECDC) and Air passenger data from the international air transport association (IATA) | Prediction of dengue importation using ML algorithms | Random Forest (RF), XGBoost | AUC: 94% (RF, XGBoost) | 21 European Union countries |
| Guo et al. [ | 2017 | Data source: Guangdong provincial CDC, China National Notifiable Disease Surveillance System, China Meteorological data sharing service, Dengue search index from Baidu index website | A dengue forecast model using ML | Support vector regressor (SVR), Gradient Boosted regressor model (GBM), Negative binomial regressor (NBM), Least absolute shrinkage and selection operator (LASSO), and generalized additive model (GAM) | The SVR model accurately forecasted the 2014 large outbreak | China |
| Our proposed model (DengueBD dataset and multiple regression model) | 2022 | Data source: Directorate General of Health Services (DGHS), Bangladesh Bureau of Statistics, and Bangladesh Metrological Department | Prediction of dengue incidents using the regression model | Support vector regression (SVR) and Multiple linear regression (MLR) | Accuracy: 75% (SVR), 67% (MLR). Successfully estimated the dengue cases for the next 10 months in 11 different cities | Bangladesh |