Literature DB >> 34025952

Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.

Eleni S Adamidi¹, Konstantinos Mitsis¹, Konstantina S Nikita¹.

Abstract

The worldwide health crisis caused by the SARS-Cov-2 virus has resulted in>3 million deaths so far. Improving early screening, diagnosis and prognosis of the disease are critical steps in assisting healthcare professionals to save lives during this pandemic. Since WHO declared the COVID-19 outbreak as a pandemic, several studies have been conducted using Artificial Intelligence techniques to optimize these steps on clinical settings in terms of quality, accuracy and most importantly time. The objective of this study is to conduct a systematic literature review on published and preprint reports of Artificial Intelligence models developed and validated for screening, diagnosis and prognosis of the coronavirus disease 2019. We included 101 studies, published from January 1st, 2020 to December 30th, 2020, that developed AI prediction models which can be applied in the clinical setting. We identified in total 14 models for screening, 38 diagnostic models for detecting COVID-19 and 50 prognostic models for predicting ICU need, ventilator need, mortality risk, severity assessment or hospital length stay. Moreover, 43 studies were based on medical imaging and 58 studies on the use of clinical parameters, laboratory results or demographic features. Several heterogeneous predictors derived from multimodal data were identified. Analysis of these multimodal data, captured from various sources, in terms of prominence for each category of the included studies, was performed. Finally, Risk of Bias (RoB) analysis was also conducted to examine the applicability of the included studies in the clinical setting and assist healthcare providers, guideline developers, and policymakers.

Entities: Chemical Disease Gene Species

Keywords: ABG, Arterial Blood Gas; ADA, Adenosine Deaminase; AI, Artificial Intelligence; ANN, Artificial Neural Networks; APTT, Activated Partial Thromboplastin Time; ARMED, Attribute Reduction with Multi-objective Decomposition Ensemble optimizer; AUC, Area Under the Curve; Acc, Accuracy; Adaboost, Adaptive Boosting; Apol AI, Apolipoprotein AI; Apol B, Apolipoprotein B; Artificial intelligence; BNB, Bernoulli Naïve Bayes; BUN, Blood Urea Nitrogen; CI, Confidence Interval; CK-MB, Creatine Kinase isoenzyme; CNN, Convolutional Neural Networks; COVID-19; CPP, COVID-19 Positive Patients; CRP, C-Reactive Protein; CRT, Classification and Regression Decision Tree; CoxPH, Cox Proportional Hazards; DCNN, Deep Convolutional Neural Networks; DL, Deep Learning; DLC, Density Lipoprotein Cholesterol; DNN, Deep Neural Networks; DT, Decision Tree; Diagnosis; ED, Emergency Department; ESR, Erythrocyte Sedimentation Rate; ET, Extra Trees; FCV, Fold Cross Validation; FL, Federated Learning; FiO2, Fraction of Inspiration O2; GBDT, Gradient Boost Decision Tree; GBM light, Gradient Boosting Machine light; GDCNN, Genetic Deep Learning Convolutional Neural Network; GFR, Glomerular Filtration Rate; GFS, Gradient boosted feature selection; GGT, Glutamyl Transpeptidase; GNB, Gaussian Naïve Bayes; HDLC, High Density Lipoprotein Cholesterol; INR, International Normalized Ratio; Inception Resnet, Inception Residual Neural Network; L1LR, L1 Regularized Logistic Regression; LASSO, Least Absolute Shrinkage and Selection Operator; LDA, Linear Discriminant Analysis; LDH, Lactate Dehydrogenase; LDLC, Low Density Lipoprotein Cholesterol; LR, Logistic Regression; LSTM, Long-Short Term Memory; MCHC, Mean Corpuscular Hemoglobin Concentration; MCV, Mean corpuscular volume; ML, Machine Learning; MLP, MultiLayer Perceptron; MPV, Mean Platelet Volume; MRMR, Maximum Relevance Minimum Redundancy; Multimodal data; NB, Naïve Bayes; NLP, Natural Language Processing; NPV, Negative Predictive Values; Nadam optimizer, Nesterov Accelerated Adaptive Moment optimizer; OB, Occult Blood test; PCT, Thrombocytocrit; PPV, Positive Predictive Values; PWD, Platelet Distribution Width; PaO2, Arterial Oxygen Tension; Paco2, Arterial Carbondioxide Tension; Prognosis; RBC, Red Blood Cell; RBF, Radial Basis Function; RBP, Retinol Binding Protein; RDW, Red blood cell Distribution Width; RF, Random Forest; RFE, Recursive Feature Elimination; RSV, Respiratory Syncytial Virus; SEN, Sensitivity; SG, Specific Gravity; SMOTE, Synthetic Minority Oversampling Technique; SPE, Specificity; SRLSR, Sparse Rescaled Linear Square Regression; SVM, Support Vector Machine; SaO2, Arterial Oxygen saturation; Screening; TBA, Total Bile Acid; TTS, Training Test Split; WBC, White Blood Cell count; XGB, eXtreme Gradient Boost; k-NN, K-Nearest Neighbor

Year: 2021 PMID： 34025952 PMCID： PMC8123783 DOI： 10.1016/j.csbj.2021.05.010

Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN： 2001-0370 Impact factor: 7.271

Introduction

The World Health Organization (WHO) declared on March 11th, 2020 the COVID-19 outbreak, emerged in December 2019 in Wuhan, China [1] resulting at the time of writing in more than 3 million deaths and 150 million cases worldwide. The most critical steps in assisting healthcare professionals to save lives during this pandemic are early screening, diagnosis and prognosis of the disease. Several studies have been conducted using Artificial Intelligence (AI) techniques to optimize these steps on clinical settings in terms of quality, accuracy and time. AI techniques, employing Deep Learning (DL) methods, have demonstrated great success in the medical imaging domain due to DL’s advanced capability for feature extraction [2]. Apart from the medical imaging domain, AI techniques are widely used to screen, diagnose and predict prognosis of COVID-19 based on clinical, laboratory and demographic data. Early clinical course of SARS-CoV2 infection can be difficult to distinguish from other undifferentiated medical presentations to hospital and SARS-CoV-2 PCR testing can take up to 48 h for operational reasons. Limitations of the gold-standard PCR test for COVID-19 have challenged healthcare systems across the world due to shortages of specialist equipment and operators, relatively low test sensitivity and prolonged turnaround times [3]. Hence, rapid identification of COVID-19 is important for delivering care, aiding proper triage among patients admitting to hospitals, accelerating proper treatment and minimizing the risk of infection during presentation and waiting hospital admission time. Several studies have been conducted to face the need of early screening by using AI methods [4], [5]. Challenges on COVID-19 diagnosis are also present due to the difficulties of differentiating Chest X-Ray radiographs (CXRs) with COVID-19 pneumonia symptoms from those with common pneumonia and insufficient empirical understanding of the radiological morphology in CT scans of this new type of pneumonia among other. Moreover, CXR or CT-based diagnosis may need laboratory confirmation. Therefore, there is an imperative demand for accurate methods to assist clinical diagnosis of COVID-19. Multiple studies using AI techniques have been conducted towards this direction, to extract valuable features from CXRs or CTs [6], [7], to use clinical data and laboratory exams [8], [9] or to even combine both imaging quantitative features and clinical data to result in accurate diagnosis [10], [11]. Finally, prognosis is an essential step towards assisting healthcare professionals to predict ICU need, mechanical ventilator need, hospitalization time, mortality risk or severity assessment of the disease. The objective of our study was to conduct a systematic literature review on published and preprint reports of Artificial Intelligence techniques developed and validated for screening, diagnosis, and prognosis of the coronavirus disease 2019. Studies that developed AI prediction models for screening, diagnosis or prognosis that can be applied in the clinical setting were included (see Fig. 1). Screening studies describe prediction models developed for early identification of COVID-19 infection, whereas diagnostic studies propose prediction models developed to establish a diagnosis of the disease. In these studies, several predictors were recognized, including clinical parameters (e.g., comorbidities, symptoms) laboratory results (e.g., hematological, biochemical tests), demographic features (e.g., age, sex, province, country, travel history) or imaging features extracted from CT scans or CXRs. Identification of the most prominent predictors was also part of our analysis. Furthermore, novel technologies incorporated in AI techniques were investigated to determine the current state of research in developing AI prediction models. Additionally, the advantages of using imaging, clinical and laboratory data or the combination of those were analyzed. To achieve this objective, each study was analyzed in terms of COVID-19 positive patients included in the primary datasets, AI methods employed, predictors identified, validation methods applied, and performance metrics used. Finally, a Risk of Bias (RoB) analysis was conducted to examine the applicability of the included studies in the clinical setting and support decisions made by healthcare providers, guideline developers, and policymakers.

Fig. 1

AI-based clinical prediction models.

AI-based clinical prediction models. The paper is organized as follows. In Section 2, we describe the used methods in the approach and protocols including the description of the AI algorithms performance metrics, and the inclusion–exclusion criteria of the reviewed studies. In Section 3, results are presented on the primary datasets, AI algorithms, validation methods, as well as prediction models developed for screening, diagnostic and prognostic purposes. In this section, we also provide results on the most prominent predictors for each category of the included prediction models. Moreover, the results of the RoB assessment, are provided in Section 3. In Section 4, we discuss the results and the limitations related to the applicability of the developed prediction models and we identify possible future directions aiming at enhancing the adoption of AI-based prediction models in clinical practice.

Methods

Review approach and protocols In this systematic literature review, we followed the guidelines of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol to ensure transparent and complete reporting (see Fig. 2) [12]. This study focused on peer-reviewed publications, as well as preprints published in English, that applied AI techniques to develop prediction models for diagnosis or prognosis of COVID-19. A systematic literature search was conducted for collecting research articles available from January 2020 through December 2020, using the online databases PubMed, Nature, Science Direct, IEEE Xplore, Arxiv and medRxiv. By combining appropriate keywords with Boolean operators, the following expression was formed:

Fig. 2

PRISMA (preferred reporting items for systematic reviews and meta-analyses) flowchart.

PRISMA (preferred reporting items for systematic reviews and meta-analyses) flowchart. [(“artificial intelligence” OR “AI”) OR (“machine learning” OR “ML”) OR (“deep learning” OR “DL”)] AND (“hospital” OR “clinical” OR “healthcare system”) AND (“triage” OR “early screening” OR “diagnosis” OR “mortality prediction” OR “severity assessment”) AND (“covid-19” OR “sars-cov-2” OR “Coronavirus” OR “pandemic”) AND (“prediction models”) Title and abstract screening, full text review, data extraction and Risk of Bias Analysis were conducted by two independent reviewers using Covidence [13], a software for systematic review management. Data were extracted with Covidence software using a customized extraction form. The extraction form included the following fields for each included study: Covidence Study ID, Lead Author, Title, Database source, Country (the country of dataset origin), Hospital Name, No of hospitals, Start date, End date, Outcome, No of days for mortality prediction, Type of AI model, AI Methods used, Type of input data, Source of input data, Sample Size of input data, Predictors, Study design, Number of participants for model development (with outcomes), Total Number of COVID-19 positive patients, Population description, Validation method, Number of participants for model validation (with outcomes), Performance (Area under the curve (AUC%), Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), Positive Predictive Values/ Negative Predictive Values (PPV/NPV) (%), (95% CI)), Code availability, Limitations, Ethical Considerations, Risk of Bias for participants/ predictors/ outcome/ analysis/, overall risk of bias. The performance of each AI model was reported in terms of metrics defined using the number of True Positives (TP), True Negatives (TN), False Positive (FP) and False Negatives (FN) [14], as follows: AUC is the area under the Receiver Operating Characteristic (ROC) curve, which plots the true positive rate against the false positive rate. This metric is a standard method for evaluating medical tests and risk models [15]. Accuracy is the percentage of cases correctly identified calculated by: Sensitivity is the rate of true positives. It measures the proportion of true positives that the model predicts accurately as positive [16], expressed by: Specificity is the rate of true negatives. It measures the proportion of true negatives that the model accurately predicts as negative [16], calculated by: The positive predictive value (PPV), can be expressed as the ratio of the true positives to the sum of the true positives and false positives and NPV is defined as the ratio of the true negatives to the predicted negatives [17]. In this systematic review, we used PROBAST (Prediction model Risk Of Bias ASsessment Tool) [18], to assess the risk of bias and applicability of the included studies with a focused and transparent approach. PROBAST protocol is organized into the following four domains: participants, predictors, outcome, and analysis. These domains contain a total of 20 signaling questions to facilitate structured judgment of ROB, which was defined to occur when shortcomings in study design, conduct, or analysis led to systematically distorted estimates of model predictive performance. Inclusion - Exclusion criteria Studies that reported the use of AI techniques, including but not limited to techniques from the AI subfield of Machine Learning (ML) and techniques from the ML subfield of DL for developing prediction models for Triage, Diagnosis and Prognosis (such as disease progression, mortality prediction, severity assessment) were included in this systematic review. Cohorts, retrospective cohorts, randomized controlled trials, diagnostic test accuracy, single-centered or multicentered retrospective studies were selected for further analysis. Restrictions were applied concerning the setting of the studies. If the outcome of the studies could not be applied on a clinical setting, these studies were excluded. Concerning the type of participants, studies that did not include COVID-19 patient data, were excluded. Additionally, studies affecting mental health were excluded. Finally, studies that did not use exclusively AI, ML, or DL to develop these type of prediction models were also excluded.

Results

In this review, 879 titles were screened and 101 studies presenting 101 AI-based models for screening, diagnosis and prognosis of COVID-19, were included for full-text review. A significant increase in the number of studies published in the 3rd and 4th quarter of the year was observed (see Fig. 3). We identified in total 14 models for screening (5 based on medical imaging), 38 diagnostic models for detecting COVID-19 (31 based on medical imaging) and 50 prognostic models (7 based on medical imaging) for predicting ICU need, ventilator need, mortality risk, severity assessment or hospital length stay (see Fig. 4). The results are presented in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, including Lead Author of each included study, Country (in case datasets from specific hospitals were used), outcome of the study, number of COVID-19 Positive Patients (CPP) included in the development of the model, AI methods, validation methods and performance of developed prediction models. Missing values of CPP were mainly found in imaging studies that did not specify the number of CT scans or CXR images corresponding to each COVID-19 positive patient. There are in total 11 studies [9], [15], [19], [20], [21], [22], [23], [24], [25], [26], [27] with unclear reporting of the number of CPP included.

Fig. 3

Number of studies per year 2020 quarter.

Fig. 4

Included AI based prediction models.

Table 1

Results for screening models.

Study, Country, Outcome	No. of CPP*	AI methods	Predictors	Val. methods	Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI))	Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall
Yang et al. [4], USA, Early and rapid identification of high-risk SARS-CoV-2 infected patients	1,898	LR, DT, RF, GBDT	age, gender, race and 27 routine laboratory tests	5-FCV	AUC 0.854 (95% CI: 0.829–0.878)	L	U	H	H	H

Li et al. [63], China, Screening based on ocular surface features	104	DL	Imaging features	5-FCV	AUC 0.999 (95%CI, 1670.997–1.000, SEN 98.2, SPE 97.8	U	U	U	H	H
AS Soltan et al. [3], UK, Early detection, Screening	437	multivariate LR, RF, XGBoost	Presentation laboratory tests and vital signs	TTS, 10-FCV	ED model: AUC 0.939, SEN 77.4, SPE 95.7Admissions model: AUC 0.940, SEN 77.4, SPE 94.8Both models achieve high NPP (>99)	H	H	H	H	H
Nan et al. [57], China, Early screening	293	DL, LR, SVM, DT, RF	4 epidemiological features, 6 clinical manifestations (muscle soreness, dyspnea, fatigue, lymphocyte count, WBC, imaging features)	TTS	AUC 0.971, Acc 90, SPE 0.95 (LR optimal screening model)	H	U	H	H	H
Soares et al. [58], Brazil, Screening of suspect COVID-19 patients	81	ML, SVM, SMOTE Boost, ensembling, k-NN	Hemogram: (Red blood cells, MCV, MCHC, MCH, RDW, Leukocytes, Basophils, Monocytes, Lymphocytes, Platelets, Mean platelet volume, Creatinine, Potassium, Sodium, CRP, Age	unspecified	AUC 86.78 (95%CI: 85.65–87.90), SEN 70.25 (95%CI: 66.57–73.12), SPE 85.98 (95%CI: 84.94–86.84), NPV 94.92 (95%CI: 94.37–95.37), PPV 44.96 (95%CI: 43.15–46.87)	L	U	H	H	H
Feng et al. [59], China, Early identification of suspected COVID-19 pneumonia on admission	32	ML, LR (LASSO), DT, Adaboost	lymphopenia, elevated CRP and elevated IL-6 on admission	10-FCV	AUC 0.841, SPE 72.7	H	H	H	H	H
Wu et al. [60], China, Early detection	27	RF	11 key blood indices: TP, GLU, Ca, CK-MB, Mg, BA, TBIL, CREA, LDH, K, PDW	10-FVC, Ext. Val.	Acc 95.95, SEN 95.12, SPE 96.97	L	L	L	H	H
Banerjee et al. [61], Brazil, Initial screening	81	RF, ANN	platelets, leukocytes, eosinophils, basophils, lymphocytes, monocytes.	10-FCV	AUC 0.95	H	H	H	H	H
Peng et al. [62], China, Quick and accurate diagnosis	32	SRLSR, non-dominated radial slots-based algorithm, ARMED, GFS, RFE	18 diagnostic factors: WBC, eosinophil count, eosinophil ratio, 2019 new Coronavirus RNA (2019n-CoV), Amyloid-A, Neutrophil ratio, basophil ratio, platelet, thrombocytocrit, monocyte count, procalcitonin, neutrophil count, lymphocyte ratio, lymphocyte count, monocyte ratio, MCHC, Urine SG	not performed	not performed	L	L	U	H	H