| Literature DB >> 34777970 |
Onur Dogan1,2, Sanju Tiwari3, M A Jabbar4, Shankru Guggari5.
Abstract
A pandemic disease, COVID-19, has caused trouble worldwide by infecting millions of people. The studies that apply artificial intelligence (AI) and machine learning (ML) methods for various purposes against the COVID-19 outbreak have increased because of their significant advantages. Although AI/ML applications provide satisfactory solutions to COVID-19 disease, these solutions can have a wide diversity. This increase in the number of AI/ML studies and diversity in solutions can confuse deciding which AI/ML technique is suitable for which COVID-19 purposes. Because there is no comprehensive review study, this study systematically analyzes and summarizes related studies. A research methodology has been proposed to conduct the systematic literature review for framing the research questions, searching criteria and relevant data extraction. Finally, 264 studies were taken into account after following inclusion and exclusion criteria. This research can be regarded as a key element for epidemic and transmission prediction, diagnosis and detection, and drug/vaccine development. Six research questions are explored with 50 AI/ML approaches in COVID-19, 8 AI/ML methods for patient outcome prediction, 14 AI/ML techniques in disease predictions, along with five AI/ML methods for risk assessment of COVID-19. It also covers AI/ML method in drug development, vaccines for COVID-19, models in COVID-19, datasets and their usage and dataset applications with AI/ML.Entities:
Keywords: Artificial intelligence; COVID-19; Machine learning; Pandemic; Research analysis; Systematic review
Year: 2021 PMID: 34777970 PMCID: PMC8256231 DOI: 10.1007/s40747-021-00424-8
Source DB: PubMed Journal: Complex Intell Systems ISSN: 2199-4536
Fig. 1Systematic literature review flowchart
Fig. 2Result of the study selection process
Fig. 3AI/ML approaches in COVID-19
Fig. 4Objectives of AI/ML approaches in COVID-19
AI/ML methods for prediction of patient outcome
| Study | Objective | AI/ML approach |
|---|---|---|
| [ | Identify the monocyte ratio and blood pressure in human body | RF |
| [ | Predicting hospitalization | RF and Reg |
| [ | Severity assessment | RF and Reg |
| [ | Severity assessment | Reg |
| [ | Identify the high-risk and low-risk patients | Reg |
| [ | Identify the mortality risk, | XGBoost |
| [ | Patient risk stratification | CNN |
| [ | Confirmation of covı cases | LSTM |
XGBoost extreme gradient boosting
AI/ML techniques in disease predictions
| Study | Disease | AI/ML method | Country |
|---|---|---|---|
| [ | Dengue fever | CTree | Bangladesh |
| [ | Oyster norovirus | GP | USA |
| [ | Dengue fever | Reg, NB | India |
| [ | H1N1 Flu | NN | Japan |
| [ | Influenza | RF | Iran |
| [ | Dengue fever | NN | Japan |
| [ | Swine Fever | RF | China |
| [ | Asthma exacerbations | NB, SVM | USA |
| [ | Dementia prediction | SVM | Italy |
| [ | Diabetes classification | Reg, NN, NB, KNN, RF | Brazil |
| [ | Hepatic fibrosis | NB, RF, KNN, SVM, NN | N/A |
| [ | Course of depression | Reg | N/A |
CTree classification tree, GP genetic programming, KNN K-nearest neighbors, NB Naive Bayes, NN neural network
AI/ML methods for risk assessment of COVID-19
| Study | Objective | AI/ML technique |
|---|---|---|
| [ | Predict the duration of the disease | LSTM |
| [ | Transmission prediction | LSTM, RNN |
| [ | Community-level risk assessment | GAN |
| [ | Transmission prediction | TL |
| [ | Disease monitoring | CNN |
GAN generative adversarial network, RNN recurrent neural networks, TL transfer learning
Measurement types of study success
| Measurement | Percentage | Min (%) | Max (%) | Measurement | Percentage | Min | Max (%) |
|---|---|---|---|---|---|---|---|
| Accuracy | 31 | 50 | 100 | Precision | 6 | 79% | 99.29 |
| AUC | 12 | 85 | 99.6 | R squared | 3 | 98% | 99.7 |
| Explained variance | 2 | 99 | 99.7 | RMSE | 1 | 136.547 | |
| Sensitivity | 20 | 0.01% | 99.62 | ||||
| F1-score | 7 | 79 | 98.46 | Specificity | 18 | 70.7% | 99.99 |
AUC area under the curve, RMSE root mean square error
Data types used in the COVID studies
| Data type | Percentage | Min | Max |
|---|---|---|---|
| CT | 49 | 106 images | 16,756 images |
| X-ray | 35 | 50 images | 15,085 images |
| Case data | 16 | 14 days | 77 days |
AI/ML method in drug development
| Study | Drug type | AI method | AI/ML objective | Potential drugs |
|---|---|---|---|---|
| [ | SARS-CoV-2 inhibitors | ChemAI | Predict inhibitory effects of molecules | 30,000 top-ranked compounds |
| [ | Antiviral drugs | MT-DTI | Predict commercially available antiviral drugs | Atazanavir, Remdesivir, and Efavirenz |
| [ | Antiviral drugs | MT-DTI | Predict binding affinity between drugs and protein target | Remdesivir, Atazanavir, Efavirenz, Ritonavir, Dolutegravir, Kaletra |
| [ | Anti-COVID-19 drugs | CNN, LSTM, MLP | Generate SMILES strings and molecules | 110 drugs |
| [ | Targeted proteins of SARS-CoV-2 | DL | Predict binding between drugs and protein | 10 drugs |
| [ | SARS-CoV-2 drug | NN, NB | Construct drug likelihood prediction model | 3 drugs |
| [ | 2019-nCoV | DL | Generate new molecular structures for 3CLpro | 100 molecules |
The viral main proteinase of coronavirus
Fig. 5Drugs and vaccines for COVID-19
Models in COVID-19 with software platform
| Software | Study | Model | Data source |
|---|---|---|---|
| Python | [ | SIR, SDM, PA | Worldometers |
| [ | Regression model | MoHFW, covid19india.org | |
| [ | Pre-trained CNN | GitHub, Kaggle, Open-I repository | |
| [ | CT radiomics | GitHub | |
| [ | Regression model | covid19india.org, WHO | |
| R | [ | SIRD and SVM | Worldometers |
| [ | ARIMA, SIR | Johns Hopkins U. | |
| [ | Regression model | Worldometers | |
| [ | SIR | Johns Hopkins U. | |
| [ | Regression model | Worldometer, covid19India.org | |
| [ | Hybrid model approach | Worldometers, ourworldindata.org | |
| [ | Regression model | MoHFW, John Hopkins U | |
| [ | Regression model | WHO, Historical weather | |
| Not Given | [ | Regression model, MLP | Kaggle |
| [ | ARIMA, SVM | WHO | |
| [ | Fractional mathematical model | N/A | |
| [ | AP, TB | WHO, Worldometers | |
| [ | Exponential growth model | MoHFW, WHO, covid19india.org | |
| [ | SIR, Network model | COVID19USA | |
| [ | Regression model | John Hopkins U |
AP arithmetic projection, ARIMA autoregressive integrated moving average, MoHFW Ministry of Health and Family Welfare, Government of India, MLP Multilayer perceptron, PA propagation analysis, SDM social distancing matrix, TB tree-based model
Datasets and their details
| Textual data sets | Medical datasets | ||
|---|---|---|---|
| Data sets | Explanation | Data sets | Explanation |
| T1 [ | Datahub repository | M1 [ | COVID-19 CT scans of Chinese hospitals with an online repository |
| T2 [ | Github repository of the data | M2 [ | Dataset consists of 20 COVID-19 CT scans |
| T3 [ | Medical community | M3 [ | Segmentation benchmark |
| T4 [ | Real-time interactive dashboard | M4 [ | COVID-19 CT segmentation dataset |
| T5 [ | Open source datasets | M5 [ | Images from a repository |
| T6 [ | crowd-sourced list of open access COVID-19 projects | M6 [ | 3D CT scans of confirmed cases |
| T7 [ | Country specific case reports and articles | M7 [ | COVID-19 positive and suspected patients |
| T8 [ | Demographic database | M8 [ | Analyzing radiographical images |
| T9 [ | Real-time and historical mobility data from Wuhan | M9 [ | Repository for COVID-19 radiographic images |
| T10 [ | Real-time data | Speech and audio datasets | |
| T11 [ | Data sets of Twitter posts | Data sets | Explanation |
| T12 [ | Data sets of Twitter posts | S1 [ | Web application for data collection |
| T13 [ | Web search portal for dataset of scholarly articles | S2 [ | Open source voice dataset |
| T14 [ | Google mobility reports | S3 [ | Collection of the cough data |
| T15 [ | Data set available on mobility based on user requests to location services | S4 [ | Collection of the cough data |
| T16 [ | Web application identifying mobility patterns across the U.S | S5 [ | Collection of the cough data |
| T17 [ | Mobility data from Baidu location services | S6 [ | Data collection for cough data |
| T18 [ | Google location services | S7 [ | Repository for the cough data |
Dataset applications with AI/ML
| Study | Application | Methods | Database |
|---|---|---|---|
| [ | COVID-19 diagnosis | DenseNet, TL | Medical |
| [ | COVID-19 diagnosis | Deep CNN | Medical |
| [ | COVID-19 diagnosis | Deep learning | Medical |
| [ | COVID-19 diagnosis | CNN, TL | Medical |
| [ | COVID-19 diagnosis | CNN | Medical |
| [ | COVID-19 diagnosis | CNN | Medical |
| [ | Cases exported from China | Statistical | Medical |
| [ | Correcting under reported cases | Statistical | Textual |
| [ | International travel control analysis | Statistical | Textual |
| [ | COVID-19 transmission control | Regression analysis | Textual |
| [ | Community transmission | Expectation maximization | Textual |
| [ | Community transmission | Bayesian approach | Textual |
| [ | Social dynamics data | Statistical analysis | Textual |
| [ | Perception and policies | Proposed NLP | Textual |
| [ | COVID-19 symptom identification | Data mining | Textual |
| [ | COVID-19 diagnosis | Boosting Trees, SVM | Speech |
| [ | COVID-19 diagnosis | N/A | Speech |
| [ | COVID-19 speech analysis | SVM with linear kernel | Speech |
| [ | Government and Media Tweets | N/A | Textual |
| [ | Conversation dynamics | N/A | Textual |
Abbreviations used in this study
| Abbr. | Explanation | Abbr. | Explanation |
|---|---|---|---|
| AI | Artificial intelligence | NN | Neural network |
| AP | Arithmetic projection | NPI | Non-pharmaceutical interventions |
| ARIMA | Autoregressive integrated moving average | PA | Propagation analysis |
| AUC | Area under curve | Reg | Regression models |
| CNN | Convolutional neural network | RF | Random forest |
| COVID-19 | Coronavirus disease 2019 | RMSE | Root mean square error |
| CT | Computational tomography | RNN | Recurrent neural networks |
| CTree | Classification tree | RQ | Research questions |
| DL | Deep learning | RT-PCR | Reverse transcription polymerase chain reaction |
| GAN | Generative adversarial network | SDM | Social distancing matrix |
| GP | Genetic programming | SEIR | Susceptible, exposed, infectious, recovered |
| KNN | K-Nearest Neighbor | SIR | Susceptible, infectious, recovered models |
| LSTM | Long short-term memory | SIRD | Susceptible, infectious, recovered, deceased |
| ML | Machine learning | SVM | Support vector machine |
| MLP | Multilayer perceptron | TB | Tree-based |
| MRI | Magnetic resonance imaging | TL | Transfer learning |
| MT-DTI | Molecule transformer drug target interaction | WHO | World Health Organization |
| NB | Naive Bayes | XGBoost | Extreme gradient boosting |