| Literature DB >> 35070385 |
Zhoulin Chang1, Zhiqing Zhan2, Zifan Zhao3, Zhixuan You3, Yang Liu4, Zhihong Yan5, Yong Fu5, Wenhua Liang6, Lei Zhao7.
Abstract
BACKGROUND: Coronavirus disease 2019 (COVID-19) has caused a large-scale global epidemic, impacting international politics and the economy. At present, there is no particularly effective medicine and treatment plan. Therefore, it is urgent and significant to find new technologies to diagnose early, isolate early, and treat early. Multimodal data drove artificial intelligence (AI) can potentially be the option. During the COVID-19 Pandemic, AI provided cutting-edge applications in disease, medicine, treatment, and target recognition. This paper reviewed the literature on the intersection of AI and medicine to analyze and compare different AI model applications in the COVID-19 Pandemic, evaluate their effectiveness, show their advantages and differences, and introduce the main models and their characteristics.Entities:
Keywords: Artificial intelligence (AI); coronavirus disease 2019 (COVID-19); deep learning; machine learning; severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
Year: 2021 PMID: 35070385 PMCID: PMC8743418 DOI: 10.21037/jtd-21-747
Source DB: PubMed Journal: J Thorac Dis ISSN: 2072-1439 Impact factor: 3.005
Applications of AI in COVID-19 epidemiology
| First author [year] (reference) | Country (region) | Modality | Model | Data source | Application area | Result |
|---|---|---|---|---|---|---|
| Aslan | China/Turkey | Demographic data | SEIQR model | Training dataset: the cumulative number of infected death cases in Hubei, China from January 20, 2020 to March 23, 2020; testing dataset: the cumulative number of infected deaths in Turkey from March 10, 2020 to April 10, 2020 | COVID-19 spread prediction and the severity evaluation | Develop accurate local prediction models |
| Yang | China | Demographic data | SEIR model/RNN using LSTM | Training dataset: SARS epidemic data between April and June 2003 across the whole of China retrieved from an archived news-site (SOHU); testing dataset: migration and epidemiological data before and after January 23 2020 | COVID-19 spread prediction and the severity evaluation | Found control measures to reduce the eventual COVID-19 epidemic size |
| Chimmula | Canada | Demographic data | RNN using LSTM | Training dataset: 80% number of confirmed cases, fatalities and recovered patients until March 31, 2020 provided by Johns Hopkins University and Canadian Health authority; testing dataset: 20% number of confirmed cases, fatalities and recovered patients until March 31, 2020 provided by Johns Hopkins University and Canadian Health authority | COVID-19 spread prediction and the severity evaluation | Short term predictions: RMSE error 34.83/accuracy 93.4%; long term predictions: RMSE error 45.70/accuracy 92.67% |
| Kolozsvari | Globe | Demographic data | RNN using LSTM | The publicly available datasets from the WHO and Johns Hopkins University for the following countries: Austria, Belgium, China (Hubei), Czech Republic, France, Germany, Hungary, Iran, Italy, Netherlands, Norway, Portugal, Slovenia, Spain, Switzerland, UK and the USA until April 10, 2020 | COVID-19 spread prediction and the severity evaluation | Mean of RMSLE of prediction 1: Hungary 0.06/UK 0.234/Italy 0.114/Spain 0.266/Germany 0.147/France 0.513/USA 0.216; mean of RMSLE of prediction 2: Hungary 0.107/UK 0.455/Italy 0.155/Spain 0.181/Germany 0.108/France 0.307/USA 0.528 |
| Hu | Mainland China/three other regions (Hong Kong, Macau and Taiwan) in China | Demographic data | CNN/MAE | The total numbers of the accumulated and new confirmed cases in all of China from January 11 to February 27, 2020 | COVID-19 spread prediction and the severity evaluation | Error rate: 0.73% |
| Fong | China | Demographic data | CNN/PNN + cf | COVID-19 case in China between 21 Jan to 3 Feb 2020 | COVID-19 spread prediction and the severity evaluation | Offering the highest possible level of prediction accuracy under the constraints of low data availability and knowledge |
| Mao | China | Graph data/demographic data | Graph database model | (I) Resident trajectory information database from Hubei Province(Health Committee); (II) confirmed Person Information Database (Public Security Department; Health Committee); (III) information database of high-risk groups and close contacts (Epidemic Prevention and Control Headquarters); (IV) hospital fever information database (Epidemic Prevention and Control Headquarters); (V) mobile phone signaling database (communications operator branch) from January 21 to February 9, 2020 | Trajectory tracking and infectious rate control | Identify closest contacts and a number of public places with high risk of infection |
| Pramanik | Globe | Climatic data/demographic data | BRT model | The average monthly temperature and the average relative humidity data for January to April 2020 in 230 cities; the maximum number of recorded cases for January to April 2020 in 230 cities | Uncovering climatic factors of COVID-19 spread | AUC: 0.8675 |
| Pasayat | India | Demographic data | EG model/LR model | The publicly available dataset of India from Humanitarian | Uncovering social factors of COVID-19 spread | Accuracy (the exponential growth model): 90.78%; accuracy (the linear regression model): 99.88% |
AI, artificial intelligence; COVID-19, coronavirus disease 2019; SEIQR model, susceptible-exposed-susceptible in quarantine (isolated class)-infected (asymptomatic or having mild symptoms)-reported (infected) cases (hospitalized if get severe symptoms or quarantined if get mild symptoms)-recovered; SEIR, susceptible-exposed-infectious-removed; RNN, recurrent neural network; LSTM, long short-term memory model; CNN, convolutional neural networks; MAE, modified auto-encoders; PNN + cf, polynomial neural network with corrective feedback; BRT model, boosted regression tree model; EG model, exponential growth model; LR model, linear regression model; SARS, severe acute respiratory syndrome; RMSE error, root mean squared errors; RMSLE, root mean squared logarithmic errors; AUC, area under the curve.
Applications of AI in COVID-19 diagnosis
| First author [year] (reference) | Country (region) | Modality | Model | Data source | Sample size | Application area | Result |
|---|---|---|---|---|---|---|---|
| Gomes | Brazil | DNA sequences | RF/NBC/IBL/MLP/SVM | NIAID Virus Pathogen Database/Analysis Resource (ViPR)/Genome Reference Consortium | Twenty-five different viruses from NIAID Virus Pathogen Database/Analysis Resource (ViPR): 347,363; viruses from Genome Reference Consortium: 103,959 | Laboratory-based diagnosis: RT-PCR method | RF (results from dataset with 30% overlap) sensitivity: 0.822222±0.05613; specificity: 0.99974±0.00001; AUC: 0.99884± 0.0025; MLP (results from dataset with 30% overlap) sensitivity: 0.97386±0.03052; specificity: 0.96151±0.00246; AUC: 0.97353±0.01863; MLP (results from dataset with 50% overlap) sensitivity: 0.98824±0.01198; specificity: 0.99860±0.00020; AUC: 0.99947±0.00056 |
| Cady | USA | Blood sample | SVM | Dataset: COVID-19 negative samples plus COVID-19 positive samples | Negative samples: obtained from the Lyme Disease Biobank prior to the COVID-19 pandemic; positive samples: obtained from donors within New York state or from the Wadsworth Center, New York State Department of Health | Laboratory-based diagnosis: antibody response | Selectivity of human blood serum: 100%, sensitivity of dried blood spot samples: 86.7% |
| Kukar | Switzerland | Blood sample | CRISP-DM based machine learning/DNN/RF/XGBoost | Dataset: COVID-19 negative cases plus COVID-19 positive cases | Healthy cases: 5,108 from March, 2012 to April, 2019; COVID-19 cases: 160 from March/April 2020; other cases (different bacterial and viral infections): 225 from March, 2012 to April, 2019 | Laboratory-based diagnosis: routine blood tests | Sensitivity: 81.9%, specificity: 97.9%, AUC: 0.97 |
| Wang | China | CT image | 3D U-Net++/ | Training dataset: 1,136 (723 were positive); testing dataset: 282 cases (154 were positive) | Healthy cases: 70; COVID-19 cases: 723; other cases (inflammation or tumors): 343 | Medical images diagnosis: CT image | Sensitivity: 0.974, specificity: 0.922, AUC: 0.991 |
| Xu | China | CT image | 3D CNN segmentation model/residual network (ResNet-18) | Training dataset: 528 CT samples (COVID-19: 189; IAVP: 194; healthy: 145); testing dataset: 90 CT samples (COVID-19: 30; IAVP: 30; healthy: 30) | Healthy cases/CT samples: 175/175; COVID-19 cases/CT samples: 110/219; other cases IAVP/CT samples: 224/224 | Medical images diagnosis: CT image | Accuracy: 86.7% |
| Narin | Globe | Chest X-ray radiographs | CNN based models (ResNet50, ResNet101, ResNet152, InceptionV3 and Inception-ResNetV2) | Dataset-1: 341 (COVID-19)/2,800 (healthy); dataset-2: 341 (COVID-19)/1,493 (viral); dataset-3: 341 (COVID-19)/2,772 (bacterial) | Healthy X-ray samples: 2,800; COVID-19 X-ray samples: 341; other viral pneumonia X-ray samples: 1,493; other bacterial pneumonia X-ray samples: 2,772 | Medical images diagnosis: chest X-ray radiographs | Accuracy for dataset-1: 96.1%, accuracy for dataset-2: 99.5%, accuracy for dataset-3: 99.7% |
| Zhang | Globe | Chest X-ray radiographs | CAAD model | Dataset-1 cases/X-ray samples: 33/50 (COVID-19); 492/714 (other pneumonia); dataset-2 cases/X-ray samples: 37/50 (COVID-19); 516/717 (other pneumonia) | COVID-19 cases/X-ray samples: 70/100; other pneumonia cases/X-ray samples: 1,008/1,431 | Medical images diagnosis: chest X-ray radiographs | Sensitivity: 96.0%, specificity: 70.7% AUC: 0.952 |
| Arpaci | China | Clinical features | BayesNet/Logistic/IBk/CR/PART/J48 | 114 subjects from the Taizhou hospital of Zhejiang Province in China from January 17, 2020 to February 1, 2020 | COVID-19 cases: 32; non COVID-19 cases: 82 | Respiratory pattern and symptoms diagnosis | Accuracy of BayesNet: 71.93%; accuracy of Logistic: 80.7%; accuracy of IBk: 72.81%; accuracy of CR: 84.21%; accuracy of PART: 76.32%; accuracy of J48: 73.68% |
AI, artificial intelligence; COVID-19, coronavirus disease 2019; RF, random forests; NBC, naive Bayes classifier; IBL, instance-based learner; MLP, multilayer perceptron; SVM, support vector machine; DNN, deep neural networks; XGBoost, extreme gradient boosting machine; CAAD, confidence-aware anomaly detection model; BayesNet, Bayes classifier; Logistic, logistic-regression; IBk, lazy-classifier; CR, classification via regression; PART, rule-learner; J48, decision-tree; IAVP, influenza-A viral pneumonia; RT-PCR, reverse‐transcriptase polymerase chain reaction; AUC, area under the curve.
Applications of AI in COVID-19 progression
| First author [year] (reference) | Country (region) | Modality | Model | Data source | Sample size | Result |
|---|---|---|---|---|---|---|
| Li | China | CT image | U-Net | COVID-19 patients in Shanghai Jiao Tong University Affiliated Sixth People’s Hospital from February 10, 2020 to April 9, 2020 | COVID-19 cases classified as non-severe group on admission: 123 | (CT-SS) AUC 0.66; accuracy 62.6%; sensitivity 58.97%; specificity: 64.29% (GGO volume cm3) AUC 0.639; accuracy 43.9%; sensitivity 79.49%; specificity: 45.24% (GGO volume percentage): AUC 0.694; accuracy 62.6%; sensitivity 64.1%; specificity: 69.05%; (consolidation volume cm3): AUC 0.796; accuracy 78.05%; sensitivity 71.79%; specificity: 80.95%; (consolidation volume percentage): AUC 0.79; accuracy 78.86%; sensitivity 79.49%; specificity: 78.57% |
| Yang | China | CT image | CT-SS | COVID-19 patients in Chongqing Three Gorges Central Hospital from January 21, 2020 to February 5, 2020 | COVID-19 cases: 102 | AUC: 0.892; sensitivity: 83.3%; specificity: 94% |
AI, artificial intelligence; COVID-19, coronavirus disease 2019; CT-SS, CT severity score; AUC, area under the curve; GGO, ground-glass opacity.
Applications of AI in COVID-19 treatment
| First author [year] (reference) | Country (region) | Modality | Model | Data source | Application area | Result |
|---|---|---|---|---|---|---|
| Pfab | USA | Multichain protein complex structure | CNN/U-Net | EM Data Resource/Protein Data Bank | Drug designing | The average percentage of matched model residues: 84%; the sequence matching percentage: 63.08% |
| Magar | USA | Amino acid sequences | XGBoost/RF/MLP/SVM/LR | Virus Net dataset | Drug designing | Accuracy: XGBoost (90.57%)/RF (89.18%)/LR (81.17%)/MLP (78.23%)/SVM (75.49%) |
| Beck | Korea | Amino acid sequences | MT-DTI/NLP | NCBI database/DTC database/ Binding DB database | Drug repurposing | Predict drug-target interactions accurately and find valuable drugs for COVID-19 |
| Zeng | China | PubMed publications | KG-DML | Global Network of Biomedical Relationships/Drug Bank/transcriptome datasets | Drug repurposing | AUC: 0.85 |
| Erlina | Indonesia | Multichain protein complex structure | SVM/RF/MLP | Super Target web resources/Herbal database | Herbal drug | Accuracy: SVM (99.91%)/RF (98.67%)/MLP (98.32%); AUC: SVM (0.99)/RF (0.98)/MLP (0.98); precision: SVM (99.84%)/RF (97.25%)/MLP (96.63%) |
| Ong | USA | PubMed publications/multichain protein complex structure/DNA sequences | RF/SVM | PubMed database/ClinicalTrials.gov database | Vaccines development | Six proteins, including the S protein and five non-structural proteins (Nsp3, 3CL-pro, and Nsp8-10), were predicted to be adhesins, which are crucial to the viral adhering and host invasion |
AI, artificial intelligence; COVID-19, coronavirus disease 2019; CNN, convolutional neural networks; XGBoost, extreme gradient boosting machine; RF, random forests; MLP, multilayer perceptron; SVM, support vector machine; LR, logistic regression; MT-DTI, molecule transformer-drug target interaction; NLP, natural language processing; KG-DML, knowledge-graph-based deep-learning model; NCBI, National Center for Biotechnology Information; DTC, Drug Target Common.
Applications of AI in COVID-19 psychological effects and data security
| First author [year] (reference) | Country (region) | Modality | Model | Data source | Application area | Result |
|---|---|---|---|---|---|---|
| Choi | USA | Sociodemographic questionnaire | ANN | Korean immigrants above the age of 18 residing in the U.S. were invited to respond to a survey by e-mails and posting on Korean immigrants’ online communities from 24 May 2020 to 14 June 2020 | Psychological effects | AUC: 0.806 |
| Wang | China | Sociodemographic questionnaire | XGBoost | 3,800 non-graduating college students from a top multidisciplinary and research-oriented university directly under the jurisdiction of the Ministry of Education in North China were invited to attend the studies during February 15 to March 17, 2020 | Psychological effects | Accuracy of Model 1: 79.26%; |
| Jha | USA | Sociodemographic questionnaire | PGM | 17,764 adults in the USA at different age groups, genders, and socioeconomic statuses | Psychological effects | Accuracy of high risk of depression group: 0.80; accuracy of low risk of depression group: 0.64 |
| Kang | Korea | Pathological image data | PAIP | 3,100 images acquired by the Department of Pathology at Seoul National University Hospital, Seoul National University Bundang Hospital, and SMG-SNU Boramae Medical Center | Data security | Accuracy of liver cancer: 83%; |
AI, artificial intelligence; COVID-19, coronavirus disease 2019; ANN, artificial neural network; XGBoost, extreme gradient boosting machine; PGM, Bayesian probabilistic graphical model; PAIP, pathology artificial intelligence platform.