| Literature DB >> 35559010 |
Haishuai Wang1, Shangru Jia2, Zhao Li3, Yucong Duan4, Guangyu Tao5, Ziping Zhao2.
Abstract
The unprecedented outbreak of the Corona Virus Disease 2019 (COVID-19) pandemic has seriously affected numerous countries in the world from various aspects such as education, economy, social security, public health, etc. Most governments have made great efforts to control the spread of COVID-19, e.g., locking down hard-hit cities and advocating masks for the population. However, some countries and regions have relatively poor medical conditions in terms of insufficient medical equipment, hospital capacity overload, personnel shortage, and other problems, resulting in the large-scale spread of the epidemic. With the unique advantages of Artificial Intelligence (AI), it plays an extremely important role in medical imaging, clinical data, drug development, epidemic prediction, and telemedicine. Therefore, AI is a powerful tool that can help humans solve complex problems, especially in the fight against COVID-19. This study aims to analyze past research results and interpret the role of Artificial Intelligence in the prevention and treatment of COVID-19 from five aspects. In this paper, we also discuss the future development directions in different fields and prove the validity of the models through experiments, which will help researchers develop more efficient models to control the spread of COVID-19.Entities:
Keywords: Artificial Intelligence; COVID-19; COVID-19 review; Pandemic Prediction; clinical diagnosis; medical imaging; pandemic; telemedicine
Year: 2022 PMID: 35559010 PMCID: PMC9086537 DOI: 10.3389/fgene.2022.845305
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1The number of new confirmed cases worldwide every day. The abscissa is the timeline and the ordinate is the number of COVID-19 confirmed cases. The number of COVID-19 cases is increasing in the first 300 days, and there is a wavy line in the second 300 days.
FIGURE 2Top 15 countries with cumulative confirmed cases and deaths. Confirmed cases are shown above, and deaths are shown below. The United States, India and Brazil are the top three, with the United States having the most cumulative confirmed cases and deaths.
Main methods of Medical Imaging for COVID-19.
| Classifier | Data set | Accuracy | Data availability | References |
|---|---|---|---|---|
| CNN | 2000 x-rays images (162 COVID-19 positive, 4280 common pneumonia positive, 400 TB positive) | 99.92% |
|
|
| CNN + PCA | 500 X-ray images (250 COVID-19 positive cases and 250 normal healthy cases.) | 97.6–100% |
|
|
| CNN + ACGAN | 1124 X-ray images (403 images of COVID-19 and 721 normal images) | 95% |
|
|
| Based on deep convolutional neural network CovXNet | 1583 normal X-ray images, (1493 COVID-19 pneumonia X-ray images and 2780 bacterial pneumonia X-ray images) | 97.4% (Second category) |
|
|
| 90.2% (Multiple categories) | ||||
| Deep CNN transfer learning method | 423 COVID-19, 1485 viral pneumonia and 1579 normal chest X-ray images | 99.7% (Second category) |
|
|
| 97.9% (Three categories) | ||||
| Deep CNN model CoroNet | X-ray images of 1203 normal cases, 1591 viral pneumonia cases | 95% (Three categories) |
|
|
| 93% (Four categories) | ||||
| COVID-Net | COVID X Open access to the benchmark data set (13,975 CXR images, 358 COVID-19 CXR images.) | 98.9% |
|
|
| nCOVnet | 142 COVID-19 X-ray images 5863 non-COVID-19 X-ray images | 97% |
|
|
| DenseNet121 | 2724 C T images (1029 COVID-19 images,) | 90.8% |
|
|
| DarkNet model based on Deep Learning | 500 normal and 500 COVID-19 images | 98.08% (Second category) |
|
|
| 87.02% (Multiple categories) | ||||
| Deep transfer learning (DTL) model with DenseNet201 | 1,262 COVID-19 positive images, 1,230 negative images | 99.82% |
|
|
| An automated COVID-19 screening (ACoS) | 696 normal, 696 pneumonia and 696 COVID-19 X-ray images | 98.062% |
|
|
| Based on deep Bayes-Extrusion Network-COVID Diagnosis-Net | X-ray images (1583 normal persons, 4290 cases of common pneumonia, and 76 cases of COVID-19 infection) | 100% (Second category) |
|
|
| 98.3% (Three categories) | ||||
| Deep learning model and transfer learning based on VGG-16 | 250 COVID-19 images, 2753 other lung diseases images, and 3520 health images | 98% |
|
|
| A weakly supervised deep learning framework | TCIA Open data set (150 3D volumetric chest CT exams of COVID-19, CAP and NP patients) | 92.3% |
|
|
| A technique based on a deep residual network | 1345 viral pneumonia cases, 10,200 normal cases and 3616 COVID-19 cases | 92.1% (Four categories) |
|
|
| Transfer learning 29 different types of AI-based models | 352 chest X-ray images (51 COVID-19, 21 non-COVID-19,160 pneumonia,54 TB, and 66 normal images) | 93.8% (Validation accuracy) |
|
|
| A multi-view feature learning method | 1092 X-ray images (364 COVID-19, 364 normal, and 364 pneumonia) | 99.82% (Three categories) |
|
|
FIGURE 3Classification results of Resnet model. If the classification is correct, a green label will appear, otherwise a red label will appear. The Resnet model can correctly classify normal, viral pneumonia, and COVID-19 after training.
FIGURE 4Resnet model framework structure. The characteristic of Resnet is that it is easy to optimize and can improve accuracy by increasing depth. The internal residual block uses jump connection to alleviate the problem of gradient disappearance.
Modeling method of EHR data.
| Model | Data set | Result | Important features | Availability | References |
|---|---|---|---|---|---|
| Three models, clinical feature model (C model), radiological semantic feature model (R model), and clinical and radiological semantic feature combination model (CR model) | CT images and clinical data from 70 COVID-19 and 66 non-COVID-19 pneumonia patients | The CR model has the highest accuracy and specificity with a maximum AUC of 0.98 | GGO with consolidation, tree-in-bud, offending vessel augmentation in lesions, temperature, heart ratio, etc. |
|
|
| Four models (Logistic Regression, Random Forest, Light-GBM, and a collection of these three models) | Clinical data and EHR data of 3345 retrospective and 474 prospective Inpatients | The Light-GBM model achieved the best performance on the validation set | Age, Sex, Race, Neutrophils Percent, Lymphocytes Percent, Eosinophils Percent, C-Reactive Protein, C-Reactive Protein, etc. |
|
|
| Recursive Feature Elimination method, Logistic Regression, Support Vector Machine, Random Forest and Extreme Gradient Enhancement (XGBoost) algorithm for prediction. | In 3841 patients at Mount Sinai Health System, 961 retrospective and 249 prospective patients | XGBoost algorithm can accurately classify patients as likely to live or die. | Age, minimum oxygen saturation, and type of patient encounter, etc. |
|
|
| χ2 test or Kruskal-Wallis test, Multivariate Regression analysis | 1,951 charts of confirmed cases in 26 hospitals in Italy. | mortality is predicted by age and the presence of comorbidities. | Age, diabetes, chronic obstructive pulmonary disease (COPD) and chronic kidney disease, etc. |
|
|
| Mann-whitney U, χ2 test, Univariate Cox Analysis | Clinical data from 69 patients | The risk of death in elderly patients may be independent of age, and the presence of severe dementia is a risk factor for this population. | Lactate dehydrogenase and blood oxygen saturation, etc. |
|
|
| Minimum absolute contraction selection operator (LASSO) and Logistic Regression | COVID-19 patients from 575 hospitals in 31 provincial-level regions in China | The predictive variables were extracted and the severity of the patients was calculated successfully | Age, Dyspnea, Cancer history, COPD, Comorbidity, X-ray abnormality, etc. |
|
|
| Multivariate COX Regression | Clinical data of 208 patients | The CALL scoring model was established, and the area under ROC curve was 0.91 | Age, Comorbidity, Lymphocyte, D-dimer, LDH, Lymphocyte, etc. |
|
|
| Machine learning variable selection algorithm for Minimum Absolute Contraction and Selection Operator (LASSO), Combined with Cox deep learning model | 1590 patients at 575 medical centers | Deep learning survival Cox model is better than traditional Cox model | Age, hemoptysis, dyspnea, unconsciousness, number of comorbidities, cancer history, neutrophil-to-lymphocyte ratio, etc. |
|
|
| Models Based on Whole Clinical Parameters | A publicly available dataset consisting of clinical parameters and protein profile data | The best classification model based on clinical parameters achieved a maximum accuracy of 89.47% | Serum creatinine, age, absolute lymphocyte count, and D-dimer and proteins. |
|
|
| Unsupervised hierarchical clustering and principal component analysis. | Patients. Rotterdam cohort samples | An immune-type based scheme to stratify COVID-19 patients at hospital admittance into high and low risk clinical categories | Serum pro-inflammatory, anti-inflammatory and anti-viral cytokine and anti-SARS-CoV-2 antibody measurements |
|
|
FIGURE 5Model comparison of EHR data. Using a box plot to reflect the classification characteristics of different models. The NN,RF,SVM, NB, LR and LDA are better than other models.
FIGURE 6SARS-COV-2 genome. The number line represents the number of amino acids, and different colors represent different Protein Fragments.
FIGURE 7Mathematical model. The picture shows four classic models of infectious diseases, Susceptible-Infected model (SI), Susceptible-Infected- Susceptible model (SIS), Susceptible-Infected-Recovered model (SIR), Susceptible-Exposed-Infected-Recovered-Dead model (SEIRD), with each letter representing a state. For example, SEIRD model represents susceptible, infected, exposed, recovered, and dead.
FIGURE 8Prediction results of our model. The vertical axis represents the number of global confirmed cases. The red lines are fitted trends, the purple lines are actual cases, and the orange lines are predicted trends over the next month.
FIGURE 9Transformer model structure diagram. Transformer models have Input, Output, Attention mechanisms and Encoder-Decoder architecture. In our T-SIRGAN prediction module, the encoder models the relationships among orders in the sequence, and the decoder learns the variable representation vector.