| Literature DB >> 34566404 |
Shivam Chahar1, Pradeep Kumar Roy2.
Abstract
Coronavirus disease is communicable and inhibits the infected person's immune system. It belongs to the Coronaviridae family and has affected 213 nations and territories so far. Many kinds of studies are being carried out to filter advice and provide oversight to monitor this outbreak. A comparative and brief review was carried out in this paper on research concerning the early identification of symptoms, estimation of the end of the pandemic, and examination of user-generated conversations. Chest X-ray images, abdominal computed tomography scan, tweets shared on social media are several of the datasets used by researchers. Using machine learning and deep learning methods such as K-means clustering, Random Forest, Convolutional Neural Network, Long Short-Term Memory, Auto-Encoder, and Regression approaches, the above-mentioned datasets are processed. The studies on COVID-19 with machine learning and deep learning models with their results and limitations are outlined in this article. The challenges with open future research directions are discussed at the end. © CIMNE, Barcelona, Spain 2021.Entities:
Year: 2021 PMID: 34566404 PMCID: PMC8449694 DOI: 10.1007/s11831-021-09641-3
Source DB: PubMed Journal: Arch Comput Methods Eng ISSN: 1134-3060 Impact factor: 8.171
Fig. 1Top ten impacted countries in the world as of July 05,2021
Fig. 2Status of covid-19 in India with respect to the world till July 05, 2021
Fig. 3The number of new COVID -19 cases and deaths in India
Fig. 4Multiple waves of covid-19 outbreak in different countries
Fig. 5Covid-19 waves in India by July 05, 2021
Fig. 6Steps to extract the relevant articles
Research findings and limitations using Deep Learning frameworks
| Objective | Dataset | Time Frame | Source of Dataset | Methods | Model performance | Limitation |
|---|---|---|---|---|---|---|
| To forecast of the future of covid-19 cases [ | Total 346 people from five countries | January 20, 2020 to April 4, 2020 | Not available | i) ARIIMA and wavelet-based technique ii) Regression Tree | Predictive model | – |
| To Predict the trends and stopping time of covid-19 outbreak [ | Not available | Till march 31,2020 | Two universities | LSTM networks | End by June 2020 | – |
| Predicting trend of covid-19 in China [ | Current Covid-19 and 2003 SARS epidemic data | Not available | National Health Commision of China | SEIR and LSTM | Peak of 4,000 daily infections between February 4 to 7, 2020 in China | Limited number of factors are considered |
| Pandemic prediction [ | Confirmed covid-19 cases | Not available | (i) Johns Hopkins University, (ii) WHO, (iii) Ding Xiang Yuana | SEIR | Reach peak in late May 2020 | – |
| Forecasting of covid-19 in China [ | Confirmed covid-19 cases | (i) News Networks (ii) WHO | Modified stacked auto-encoder | Predictive model | – | |
| Detection of covid-19 cases using chest X-ray images [ | 13,975 CXR images | Not available | COVIDx dataset | COVID-Net | Predictive model | Not a production ready solution |
| To determine the uncertainity and interpretability[ | 5,941 PA chest radiography images | Not available | (i) Dr. Joseph Cohen’s Github repository and (ii) Kaggle’s Chest X-Ray Images | Bayesian Deep Learning | Predictive model | Study is only limited to estimating uncertainty in already developed models |
| Detection of coronavirus disease using X-ray images [ | 100 chest X-ray images | Not available | GitHub | Convolutional neural networks | Accuracy : 98% | Small dataset |
| covid-19 outbreak prediction in India [ | Covid-19 cases in India | January 30th 2020 to March 30th 2020 | Johns Hopkins University | SEIR and Regression Model | RMSLE : 1.75 | Limited data |
| Prediction of country wise risk of covid-19 [ | Confirmed covid-19 cases | January 22 2020 to March 10 2020 | Not available | LSTM networks | Accuracy: 78% | Model achieved less accuracy |
| Forecasting of covid-19 pandemic [ | Not available | January 21, 2020 to April 02, 2020 | Sourceb | Machine Learning | 97% confidence interval | Small Dataset was taken for Senegal |
| Screening for covid-19 disease using CT images [ | 453 CT images | Not available | Not available | Deep Learning | AUC: 0.90 | Training dataset is small |
| Detection for covid-19 from chest CT using weak label [ | 630 CT images | December 13, 2019 to February 6, 2020 | Not available | DeCoVNet | AUC: 0.959 | Dataset from single hospital |
| Automatic detection of covid-19 from X-ray images [ | 2,870 X-ray images | Not available | Public medical repositories | Deep Learning, Transfer Learning | Accuracy : 96.78% | Small sample of positive cases |
| Prediction of the pandemic trend of covid-19 in Italy [ | Daily reports of covid-19 | Jan 22,2020 to Apr 02,2020 | Johns Hopkins University | Extended susceptible-infected-removed | 95% CI | Asymptomatic and unconfirmed cases may be ignored. |
| Identification of covid-19 through mobile phone based survey [ | Not available | Not available | Not available | AI algorithms | Difficult to collect data that is used for this model | |
| Detection of covid-19 using Artificial Intelligence [ | 260 chest X-ray images | Not available | (i) GitHub, and (ii) Kaggle | Convolutional neural network | Accuracy: 100% | Small dataset was taken to validate model |
| Predicting outbreak trend of coronavirus disease in India [ | China’s covid-19 cases | Jan 22, 2020 to April 3, 2020 | Kaggle | Machine Learning | Forecasting prediction for India | Limited to Indian context |
| Predicting the trends in covid-19 outbreak in Iran [ | Iran’s covid-19 data | Feb 15, 2020 to Mar 18, 2020 | (i) WorldOmeter website, and (ii) Google trends | LSTM and Linear Regression | RMSE : 7.562 | Limited Google Search data |
| Analysis of confirmed cases using AI [ | Not available | Not available | Binary Classification and Regression model | Accuracy : 95.7% | Study only takes weather parameters in consideration | |
| AI model to distinguish covid-19 cases [ | 4,356 3D chest CT images | Not available | Six medical centers | Transfer Learning | AUC : 0.96 | Train and Test from same dataset. |
| covid-19 case detection [ | 127 X-ray | Not available | ChestX-ray8 database | Deep Learning | Accuracy : 98.09% | Limited number of covid-19 X-ray images. |
| AI System for covid-19 [ | 6,752 CT scans images | Not available | CC-CCII | Deep Learning | Accuracy : 92.49% | |
| Diagnosis of coronavirus disease from X-ray images [ | 5,949 posteroanterior chest radiography images | Not available | COVIDx | COVIDiagnosis-Net | Accuracy : 98.3% | Model require Fine-tuned hyperparameters for accurate predictions |
| Accurate model for covid-19 prediction [ | 605 real-world data | Not available | Not available | Deep Learning | Accuracy : 94.5% | Limited dataset |
| Time dependent SIR model for covid-19 [ | Not available | Jan 15,2020 to Mar 2,2020 | NHC | SIR model | Day-wise prediction | Limited domain |
| Lung infection quantification [ | 549 CT images | Not available | Not available | Deep Learning | Similarity coefficients : 91.6% | Validation was conducted on same dataset |
| Large scale screening method [ | 2,685 CT images | Not available | Three medical source from China | Random Forest | Accuracy :87.9% | only baseline CT findings of covid-19 patients were included in study |
| To diagnose COIVD-19 patient [ | 50 chest X-ray images | Not available | Not available | Transfer Learning | F1-score : 0.89 | Small dataset |
| Pneumonia screening [ | 43,583 chest CT images | Not available | (i) X-VIRAL, and (ii) X-COVID | Deep Learning | AUC : 0.836 | High false negative rate |
| Classification of covid-19 cases [ | 196 samples of CXRs | Not available | Japanese Society of Radiological Technology | Deep Learning | Accuracy : 95.12% | Limted training data. |
| Diagnosis of covid-19 [ | 349 covid-19 CT images | Not available | COVID-CT-Dataset | Multi-task learning | F1-score : 0.90 | Limited number of CT images |
| Classification model [ | 5,856 CT images | Not available | Not available | Transfer Learning | Accuracy :96% | |
| covid-19 diagnosis method using X-ray [ | 170 X-ray images and 361 CT images | Not available | Multiple Sources | Deep Learning | Accuracy : 98% | Small dataset. |
| Detection of covid-19 cases [ | 5,863 X-ray images | Not available | Not available | Transfer Learning | Accuracy : 99% | Used only 624 images |
| Identification of abnormal CT patterns [ | 9,749 chest CT images | Not available | Not available | Deep Learning | Pearson correlation coefficient : 0.92 | Model was trained only with specific abnormalities data |
| Identification of covid-19 using chest X-ray [ | 455 chest X-ray images | Not available | Multiple Sources | Transfer Learning | Accuracy : 91.24% | Small number of cases are considered |
| covid-19 patient detection using CT images [ | 109 confirmed cases from China | Not available | Not available | Transfer Learning | AUC : 0.948 | Small dataset |
| Identification of covid-19 cases from X-ray images [ | 94,323 frontal view chest X-ray images | Not available | NIH CXR dataset | Deep Learning | Accuracy : 95.7% | Pre-training is required for the model to achieve high accuracy |
| Estimation of global covid-19 spread [ | Not available | Not available | Chinese National Health Commission | Neural network | Analysis based model | |
| To evaluate pneumonia cases during the covid-19 [ | 5,863 chest X-ray images | Not available | Not available | Transfer Learning | False negatives : 0.7% | low number of covid-19 chest X-ray images in dataset |
| covid-19 diagnosis based on chest X-ray [ | 15,959 CXR images | Not available | Multiple Sources | Transfer Learning | F1-score : 0.945 | Limited amount of CXR images |
| covid-19 patterns detection through X-ray images [ | 13,800 X-ray images | Not available | Multiple Sources | Deep artificial neural networks | Accuracy : 93.9% | Less heterogeneous dataset was used |
| covid-19 infection detection [ | 60 3D CT lung scans | Not available | TCIA dataset | Convolutional neural network | Accuracy : 96.2% | Small dataset |
| To develop covid-19 diagnosis system [ | 144,167 CT images | Not available | COVID-CS | Transfer Learning | Sensitivity : 95.0% | – |
| covid-19 Detection using CXRs [ | Not available | Not available | Multiple Sources | Transfer Learning | Accuracy : 99.01% | Limited number of covid-19 pneumonia CXR data |
| To predict covid-19 from chest X-ray images [ | 5,071 chest X-ray images | Not available | (i) Chestxray-Dataset, and (ii) ChexPert dataset | Transfer Learning | Specificity : 97.8% | Dataset contained less than 100 covid-19 X-ray images |
| covid-19 detection [ | 100 axial CT images | Not available | covid-19 CT segmentation dataset | Deep Learning | Analysis based model | Small dataset |
| Diagnosis of covid-19 with chest X-ray images | Transfer Learning [ | 1,076 posteroanterior CXR (PCXR) images | Not available | i) covid-19 Image, and ii) RSNA dataset | Accuracy : 96% | – |
| Estimating covid-19 trend in Spain [ | Not available | Not available | Sourcec | Bayesian Poison-Gamma model | Predictive model | – |
| covid-19 pneumonia severity predicting model [ | 94 posteroanterior CXR images | Not available | Multiple Sources | DenseNet model | MAE: 0.78 | Small number of samples |
| Diagnosis of coronavirus using CT images [ | 88 chest CT scans images | Not available | Not available | Details Relation Extraction neural network | AUC : 0.99 | Small dataset |
| AI system for covid-19 diagnosis [ | 10,250 CT images | Not available | Multiple Sources | Deep Learning based AI system | AUC : 0.971 | Some limitations arises when unbalanced dataset is used |
| covid-19 diagnosis based on CT scans [ | 349 CT scan images | Jan 19th to Mar 25th | (i) medRxiv, and (ii) bioRxiv | Transfer Learning | F1-score : 0.85 | Not robust |
| CT imaging classification and segmentation for covid-19 [ | 1,044 CT images | Not available | (i) COVID-CT data, and (ii) Sourced | MTL architecture | AUC : 0.93 | Limited covid-19 images |
| covid-19 lung infection segmentation model using CT images [ | 100 axial CT images | Not available | CT Segmentation dataset | Transfer Learning | Sensitivity : 0.870 | Limited dataset |
| covid-19 diagnostic and prognostic analysis [ | 5,372 CT images | Not available | Not available | Deep Learning | AUC : 0.90 | Many factors are not considered |
| CT images feature analysis to screen covid-19 [ | 51 CT images | Not available | Kaggle database | Naive Bayes | Accuracy : 96.07% | Small dataset |
| covid-19 predicitons using X-ray Images [ | 3,905 X-ray images | Not available | Not available | Convolutional neural network | Accuracy : 99.18% | Used limited dataset |
aChinese government authorized website
bhttps://www.tableau.com/covid-19-coronavirus-data-resources
chttps://covid19.isciii.es/resources/serie_historica_acumulados.csv
dhttp://medicalsegmentation.com/covid19/
Research findings and limitations using Machine Learning frameworks
| Objective | Dataset | Time Frame | Source of Dataset | Model used | Performance | Limitation |
|---|---|---|---|---|---|---|
| Short-term forecasting of covid-19 in Brazil [ | Dataset of confirmed cases of covid-19 from 10 states in Brazil | April 18, 2020 to April 19, 2020 | Not Available | ML based models | SVR and stacking performed the best | For one-day-ahead the applications are limited |
| Predicting the growth and trend of covid-19 outbreak [ | Our World in Data by Hannah Ritchie | Gaussian model | Predictive model | May lead to worse of public health situation | ||
| Prediction of the spread of covid-19 in India [ | covid-19 data of India | 30th January 2020 to 4th April 2020 | Not Available | LSTM and curve fitting method | High accuracy | Only accurate within a range |
| Prediction of covid-19 disease progression in India [ | Not Available | Not Available | (i) Johns Hopkins University, (ii) Covid19India website, (iii) Kaggle-Covid-19 | Statistical machine learning model | R0 =2.75 | Prediction was not in sync with actual desease progression |
| Real time forecasting of covid-19 [ | Not Available | Not Available | Not Available | Clustering techniques | Predictive model | More miss classification rate |
| Predicting the survival for severe covid-19 patients [ | 375 patients data | Not Available | Tongi Hospital, Wuhan,China | XGBoost classifier | F1-score 0.92 | Sample size is relatively limited |
| Confirmation of covid-19 cases [ | Not Available | 20th January 2020 to 12th March 2020 | Kagglea | LSTM-GRU based model | Accuracy of 87% | Accuracy for released case was only 40.5% |
| To predict mortality risk in patients [ | 117,000 confirmed covid-19 patients data from 76 countries | Not Available | Not Available | ML based models | Accuracy: 93.75% | – |
| Analysis of covid-19 epidemic [ | Daily reported cases | January 22, 2020 to April 1,2020 | Johns Hopkins University | ML and DL based models | Polynomial regression yielded minimum RMSE | Models were validated against the training dataset |
| Forecasting future of covid-19 [ | Not Available | Not Available | Multiple sources | Regression model | Exponential smoothing (ES) performed best | Difficult to put an accurate hyperplane |
| To identify an intrinsic covid-19 virus [ | 5000 unique viral genomic sequences | Jan 27, 2020 | Not Available | SVM and KNN | Accuracy:100% | Availability of coronavirus genomic signatures are limited |
| Pandemic prediction for Hungary [ | Daily cases and number of deaths in Hungary | 4 March 2020 to 28 April 2020 | Not Available | Hybrid machine learning model | MLP-ICA and ANFIS gave promising results | Model is susceptible to major interruptions |
| covid-19 outbreak prediction [ | Not Available | Not Available | WorldOmeter website | Machine learning algorithms | MLP and ANFIS achieved high generalization | Study provides only the initial benchmarking |
| Prediction of coronavirus clinical severity with AI [ | 53 hospitalized patients with confirmed covid-19 | Not Available | Not Available | ML based classifiers | Accuracy: 80% | Limited size of the dataset with some incomplete data |
| Classification of covid-19 using CT images [ | Not Available | Not Available | Moroccan Ministry to Healthb | ML based classifiers | Good predictive model | Localized dataset was taken to validate the model |
| Classification of covid-19 using CT images [ | 150 CT abdominal images | Not Available | Societa Italina di Radiologia Medica e Interventistica | Mixed models | Accuracy : 99.68% | Performance was not validated with other CT image dataset |
| Forecasting of covid-19 outbreak in India [ | Not Available | 22nd January 2020 to 10th April 2020 | Kaggle c | Multilayer perceptron | CI: 95.0% | Less number of attributes |
| Diagnosis of covid-19 using AI [ | 1838 cough sounds and 3597 non-cough environmental sounds | Not Available | ESC-50 database | ML and DL based models | Accuracy: 95.60% | Limited quantity of the training and testing data |
ahttps://www.kaggle.com/kimjihoo/coronavirusdataset-old
bwww.covidmaroc.ma
chttps://www.kaggle.com/imdevskp/corona-virus-report/data
List of datasets used for covid-19 research
| Type | Source | Status |
|---|---|---|
| CT Scan | Working [June 2021] | |
| CT Scan | Working [June 2021] | |
| CT Scan | Working [June 2021] | |
| CT Scan | Working [June 2021] | |
| CT Scan | Working [June 2021] | |
| CT Scan | Working [June 2021] | |
| CT Scan | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| CXR | Working [June 2021] | |
| COVID Cases | Working [June 2021] | |
| COVID Cases | Working [June 2021] | |
| COVID Cases | Working [June 2021] | |
| COVID Cases | Working [June 2021] | |
| COVID Cases | Working [June 2021] | |
| COVID Cases | Working [June 2021] | |
| COVID Cases | Working [June 2021] |