| Literature DB >> 35993085 |
Misbah Ahmad1, Imran Ahmed2, Gwanggil Jeon3.
Abstract
The idea of sustainability aims to provide a protected operating environment that supports without risking the capacity of coming generations and to satisfy their demands in the future. With the advent of artificial intelligence, big data, and the Internet of Things, there is a tremendous paradigm transformation in how environmental data are managed and handled for sustainable applications in smart cities and societies. The ongoing COVID-19 (Coronavirus Disease) pandemic maintains a mortifying impact on the world population's health. A continuous rise in the number of positive cases produced much stress on governing organizations worldwide, and they are finding it challenging to handle the situation. Artificial Intelligence methods can be extended quite efficiently to monitor the disease, predict the pandemic's growth, and outline policies and strategies to control its transmission or spread. The combination of healthcare, along with big data, and machine learning methods, can improve the quality of life by providing better care services and creating cost-effective systems. Researchers have been using these techniques to fight against the COVID-19 pandemic. This paper emphasizes on the analysis of different factors and symptoms and presents a sustainable framework to predict and detect COVID-19. Firstly, we have collected a data set having different symptoms information of COVID-19. Then, we have explored various machine learning algorithms or methods: including Logistic Regression, Naive Bayes, Decision Tree, Random Forest Classifier, Extreme Gradient Boost, K-Nearest Neighbour, and Support Vector Machine to predict and detect COVID-19 lab results, using different symptoms information. The model might help to predict and detect the long-term spread of a pandemic and implement advanced proactive measures. The findings show that the Logistic Regression and Support Vector Machine outperformed from other machine learning algorithms in terms of accuracy; algorithms exhibit 97.66% and 98% results, respectively.Entities:
Keywords: Artificial intelligence; Big data; COVID-19; Internet of Things; Machine learning; Sustainable healthcare
Year: 2022 PMID: 35993085 PMCID: PMC9379242 DOI: 10.1007/s10668-022-02584-0
Source DB: PubMed Journal: Environ Dev Sustain ISSN: 1387-585X Impact factor: 4.080
Fig. 1Confirmed COVID-19 positive cases and recorded deaths per million peoples
Fig. 2Proposed intelligent machine learning-based COVID-19 detection system. The collected raw clinical data set is pre-processed for missing values; the data attributes are then converted into final attributes and binary form, the data set is splited into training and testing samples. The training data are utilized for the training of different algorithms; the trained model is used with testing data for the detection of COVID-19 virus; the final patient status is determined based on predicted lab results. Finally, the performance of the algorithms are evaluated using different parameters
Detailed description of different attributes of the clinical data set used for experimentation
| S. no. | Attribute | Attribute code | Attribute description |
|---|---|---|---|
| 1 | Patient id | Patient number | In numbers |
| 2 | Age | Age | Age in years |
| 3 | Gender | Gender | Male = 0, Female = 1 |
| 4 | Is patient symptomatic? | Isptsymptamatic | Yes = 1, No = 0 |
| 5 | Flu | Flu | Yes = 1, No = 0 |
| 6 | Fever | Fever | Yes = 1, No = 0 |
| 7 | Sore throat | SoreThroat | Yes = 1, No = 0 |
| 8 | Cough | Cough | Yes = 1, No = 0 |
| 9 | Breathing issue | Breathingissue | Yes = 1, No = 0 |
| 10 | Headache | Headache | Yes = 1, No = 0 |
| 11 | Cardiovascular & hypertension | Cardiovascular and hypertension | Yes = 1, No = 0 |
| 12 | Chronic lung disease | chroniclung | Yes = 1, No = 0 |
| 13 | Foreign travel history | ForeignTravel History | Yes = 1, No = 0 |
| 14 | Test specimen information | Specimen information | Nasopharyngeal swab = 0, Oropharyngeal swab=1 |
| 15 | Lab results | LabResults | Positive = 1, Negative = 0 |
| 16 | Patient final status | PatientFinal Status | Expired = 0, Active = 1, Recovered = 2 |
Fig. 3ROC curve plotted using TPR versus FPR
Fig. 4Precision, Recall and F1-Score, of different machine learning algorithms
Fig. 5Accuracy comparison of different machine learning algorithms
Comparison results of different machine learning algorithms
| S. no. | Algorithm | Accuracy (%) | Recall (%) | Precision (%) | F1-Score (%) |
|---|---|---|---|---|---|
| 1 | Logistic Regression | 97.66 | 98 | 97 | 98 |
| 2 | Naive Bayes | 97 | 96 | 95 | 96 |
| 3 | Decision Tree | 97 | 96 | 95 | 96 |
| 4 | Extreme Gradient Boost | 94.91 | 93 | 94 | 93 |
| 5 | K-Nearest Neighbour | 97 | 96 | 94 | 96 |
| 6 | Random Forest | 97.50 | 95 | 95 | 95 |
| 7 | Support Vector Machine | 98 | 98 | 97 | 98 |