| Literature DB >> 35456357 |
Ștefan Busnatu1, Adelina-Gabriela Niculescu2, Alexandra Bolocan1, George E D Petrescu1, Dan Nicolae Păduraru1, Iulian Năstasă1, Mircea Lupușoru1, Marius Geantă3, Octavian Andronic1, Alexandru Mihai Grumezescu2,4,5, Henrique Martins6.
Abstract
Artificial intelligence has the potential to revolutionize modern society in all its aspects. Encouraged by the variety and vast amount of data that can be gathered from patients (e.g., medical images, text, and electronic health records), researchers have recently increased their interest in developing AI solutions for clinical care. Moreover, a diverse repertoire of methods can be chosen towards creating performant models for use in medical applications, ranging from disease prediction, diagnosis, and prognosis to opting for the most appropriate treatment for an individual patient. In this respect, the present paper aims to review the advancements reported at the convergence of AI and clinical care. Thus, this work presents AI clinical applications in a comprehensive manner, discussing the recent literature studies classified according to medical specialties. In addition, the challenges and limitations hindering AI integration in the clinical setting are further pointed out.Entities:
Keywords: artificial intelligence; clinical applications; deep learning; machine learning; personalized medicine; precision medicine
Year: 2022 PMID: 35456357 PMCID: PMC9031863 DOI: 10.3390/jcm11082265
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.964
Figure 1Relationship between AI, ML, and DL. Created based on information from [4,8,9,10].
Important AI-related terms and definitions.
| Term | Description | References |
|---|---|---|
| Machine learning (ML) | Process by which an algorithm encodes statistical regularities from a database of examples into parameter weights for future predictions | [ |
| Deep learning (DL) | Multilayered complex ML platform comprised of numerous computational layers able to make accurate predictions | [ |
| Supervised learning | Training an ML algorithm using previously labeled training examples, consisting of inputs and desired outputs provided by an expert | [ |
| Unsupervised learning | When an ML algorithm discovers hidden patterns or data groupings without the need for human intervention | [ |
| Reinforcement learning | Learning strategies towards acting optimally in certain situations with respect to a given criterion; such an algorithm obtains feedback on its performance by comparison with this criterion through reward values during training | [ |
| Model | A trained ML algorithm that can make predictions from unseen data | [ |
| Training | Feeding an ML algorithm with examples from a training dataset towards deriving useful parameters for future predictions | [ |
| Features | Components of a dataset describing the characteristics of the studied observations | [ |
| Decision tree | Nonparametric supervised learning method visualized as a graph representing the choices and their outcomes in the form of a tree; each tree consists of nodes (attributes in the group to be classified) and branches (values that a node can take) | [ |
| Random forest | Ensemble classification technique that uses “parallel ensembling”, fitting several decision tree classifiers in parallel on dataset subsamples | [ |
| Naïve Bayes (NB) | Classification technique assuming independence among predictors (i.e., assumes that the presence of a feature in the class is unrelated to the presence of any other feature) | [ |
| Logistic regression | Algorithm using a logistic function to estimate probabilities that can overfit high-dimensional datasets, being suitable for datasets that can be linearly separated | [ |
| K-nearest neighbors (KNN) | “Instance-based learning” or a non-generalizing learning algorithm that does not focus on constructing a general internal model but, rather, stores all instances corresponding to the training data in an | [ |
| Support vector machine (SVM) | Supervised learning model that can efficiently perform linear and nonlinear classifications, implicitly mapping their inputs into high-dimensional feature spaces | [ |
| Boosting | Family of algorithms converting weak learners (i.e., classifiers) to strong learners (i.e., classifiers that are arbitrarily well-correlated with the true classification) towards decreasing the bias and variance | [ |
| Artificial neural network (ANN) | An ML technique that processes information in an architecture comprising many layers (“neurons”), each inter-neuronal connection extracting the desired parameters incrementally from the training data | [ |
| Deep neural network (DNN) | A DL architecture with multiple layers between the input and output layers | [ |
| Convolutional neural network (CNN) | A class of DNN displaying connectivity patterns similar to the connectivity patterns and image processing in the visual cortex | [ |
Figure 2Examples of AI potential applications in clinical care. Reproduced from [6].
Summary of the recent AI studies in cardiology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Prediction of incident essential hypertension within the following year | XGBoost | EHR from Maine Health Information Exchange network: | Predictive accuracy: | [ |
| Detection of AF using smartwatch data | DNN | 9750 participants from Health eHeart Study and 51 patients undergoing cardioversion at the University of California, San Francisco | External validation: | [ |
| Detection of VF | DL based on 1D-CNN and LSTM network | Public repositories of arrhythmia ( | For 4-s ECG segments | [ |
| Classification of AS based on cardio-mechanical signals from noninvasive wearable inertial sensors | Elastic Net (for reducing the features generated by CWT) | 21 AS patients and 13 non-AS subjects | After the reduction of features by 95.47%, the following accuracies were reported: | [ |
| Feasibility and potential clinical role of FFRCT in patients presenting to the emergency department with acute chest pain who underwent CPCT | ML-based software | 56 patients with acute chest pain who underwent CPCT and who had at least a mild (≥25% diameter) coronary artery stenosis | Feasibility—68% | [ |
| SCA detection on ECG signal | CNN (for feature extraction) | 57 records from Creighton University Ventricular Tachyarrhythmia Database and MIT-BIH Malignant Ventricular Arrhythmia Database, where each record corresponds to an individual patient | Validated accuracy—99.26% | [ |
| Clinical measurement of RV and LV volume and function across cardiac MR images obtained for various clinical indications and pathologies | DL algorithm | First 200 noncongenitally clinical cardiac MRI examinations from June 2015 to June 2017 for which volumetry was available | Correlation between automated measurements and manual measurements | [ |
| Classification of various arrythmias from single-lead ECGs | DNN | 91,232 single-lead ECGs from 53,549 patients who used a single-lead ambulatory ECG monitoring device | Specificity | Sensitivity for: | [ |
1D—one-dimensional; 2D—two-dimensional; AVB—atrioventricular block; CWT—continuous wavelet transform; CNN—convolutional neural network; DL—deep learning; DNN—deep neural network; EAR—ectopic atrial rhythm; IVR—idioventricular rhythm; LSTM—long short-term memory; LV—left ventricle; OHCA—out-of-hospital cardiac arrest; RV—right ventricle; SVT—supraventricular tachycardia.
Summary of the recent AI studies in neurology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Prediction of ischemic stroke recurrence | Several ML algorithms | Geisinger EHR of 2091 ischemic stroke patients | Recurrence within 1-year prediction using random forest with up-sampling the training dataset: | [ |
| Automated lesion detection | ANN classifier | 61 patients with pharmacoresistant epilepsy and histologically proven FCD type II from three different epilepsy centers | Sensitivity—73.7% | [ |
| Early prediction of epileptic seizures | DCNN | Long-term scalp EEG data for 22 pediatric subjects with intractable seizures from Children’s Hospital Boston | Accuracy—99.66% | [ |
| Prediction of Alzheimer disease status | DL framework linking an FCN to a traditional MLP | Four distinct datasets: | ADNI test: | [ |
| Diagnosis of Parkinson disease | CNN | 45 patients with Parkinson’s disease, 20 patients with atypical parkinsonian syndromes, 35 healthy controls from the general outpatient clinic and movement disorder services at the Department of Neurology, National Institute of Mental Health and Neurosciences, Bangalore, India | Parkinson’s disease vs. healthy controls: | [ |
| Assessment of collateral flow of patients with AIS | CNN | 200 patients with AIS who presented at the comprehensive stroke center with stroke-like symptoms between March 2019 and January 2020 | Dichotomized classification: | [ |
| Detection of intracranial IED | Template-matching algorithm | 1000 intracranial EEG epochs extracted randomly from 307 subjects with refractory epilepsy enrolled in the Defense Advanced Research Projects Agency (DARPA) Restoring Active Memory (RAM) collaborative agreement | Accuracy for classifying an IED—91% | [ |
| Prediction of the safe clipping time of temporary artery occlusion (TAO) during intracranial aneurysm surgery | ANN | 125 patients: 105 patients from a retrospective cohort for training the model and 20 patients from a prospective cohort for validating the model | Accuracy—88% | [ |
ADNI—Alzheimer’s Disease Neuroimaging Initiative; AIBL—Australian Imaging, Biomarker and Lifestyle Flagship Study of Ageing; ANN—artificial neural network; CNN—convolutional neural network; DCNN—deep convolutional neural network; FCN—fully convolutional network; FHS—Framingham Heart Study; IED—interictal epileptiform discharges; ML—machine learning; NACC—National Alzheimer’s Coordinating Center.
Summary of the recent AI studies in oncology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Prediction of liver metastasis presence when still undetectable using the standard protocols | FM | CT scan data of 30 patients collected between January 2013 and June 2021 at the Pineta Grande Hospital Castel Volturno, Caserta, Italy | Precision rate—100% | [ |
| Recognition of colorectal cancer tumor sprouting | Faster RCNN | Retrospectively collected 100 surgical pathological sections of colorectal cancer from January 2019 to October 2019; | Precision rate—85.5% | [ |
| Detection, grading, and evaluation of clinically relevant findings in digitized slides of prostate core needle biopsies | Multilayered CNN | 1,357,480 image patches from 549 H&E-stained slides for training; 2501 H&E-stained slides for internal test; external dataset of 100 consecutive cases (1627 H&E-stained slides) | Correlation between cancer percentages calculated by the algorithm and pathologists: | [ |
| Automated voice signals analysis for differentiating subjects with laryngeal cancer from healthy individuals | Several ML algorithms | Preoperative medical records from a single university center from July 2015 to June 2019 of patients who underwent voice assessments at the time of laryngeal cancer diagnosis; normal voice samples acquired from otherwise healthy subjects who underwent voice assessments prior to general anesthesia for surgical procedures involving sites other than the head and neck region | Accuracy | Sensitivity | Specificity of: | [ |
| Prediction of radiation doses to subsites of the mandible before planning of radiation therapy for oropharyngeal cancer | ML-based clinical decision support | 86 previously delivered RT treatment plans (for the training set) and 20 patients whose cases were chronologically subsequent to the training dataset (for the test dataset) | Positive predictive value—95% | [ |
1D—one-dimensional; 2D—two-dimensional; ANN—artificial neural network; ASAP—atypical small acinar proliferation; CNN—convolutional neural network; FM—Formal Methods; MFCCs—Mel-frequency cepstral coefficients; ML—machine learning; RCNN—region convolutional neural network; STFT—short-time Fourier transform; SVM—support vector machine.
Summary of the recent AI studies in hematology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Automation and enhancement of PCs delineation | CNN | Manually annotated 25, 28, and 21 regions of interest encompassing small round PCs and confluent/expanded PCs of 10 CLL, 12 aCLL, and 8 RT digitized H&E-stained slides, respectively | Accuracy using data from: | [ |
| Prediction of overall survival and best treatment for acute myeloid leukemia | Several ML algorithms | 3687 consecutive adult AML patients included in the DATAML registry between 2000 and 2019 (3030 receiving IC, 657 receiving AZA) | Overall survival prediction accuracy for: | [ |
| Prediction of diagnosis of acute leukemia using blood cell images | ALNet (a DL model) | A set of 731 blood smears containing 16,450 single-cell images from 100 healthy controls, 191 patients with viral infections and 148 with acute leukemia | Overall accuracy—94.2% | [ |
| Automatic detection of β-thalassemia carriers | CRISP-DM | Blood parameters of apparently healthy 45,498 individuals who were referred to the Thalassemia and Hemophilia center, Palestine Avenir Foundation in from 2012 to 2016 to be screened for the premarital tests; | Sensitivity—98.81% | [ |
| Differential screening of hereditary anemias from a fraction of blood drop | Hierarchical ML decider | 8 patients with clinical and molecular diagnosis of CDA type I, CDA type II, HS, DHS1, IRIDA, and α-thalassemia and 7 healthy donors; | Overall accuracy of cubic SVM for: | [ |
AML—Acute Myeloid Leukemia; AZA—azacitidine; CDA—congenital dyserythropoietic anemia; CNN—convolutional neural network; CRISP-DM—cross-industry standard process for data mining; DHS—dehydrated hereditary stomatocytosis; HS—hereditary spherocytosis; IC—intensive chemotherapy; IRIDA—iron-refractory iron-deficiency anemia; SVM—support vector machine.
Summary of the recent AI studies in nephrology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Prediction of future acute kidney injury | DL model | Dataset consisting of all eligible patients during a five-year period across the entire Veterans Affairs healthcare system in the USA | Prediction with a lead time of up to 48 h and a ratio of 2 false alerts for every true alert: | [ |
| Early detection and prediction of acute kidney injury | XGBoost | Patients whose hospital stays lasted between 5 and 1000 h and who had at least one documented measurement of heart rate, respiratory rate, temperature, serum creatinine (SCr), and Glasgow Coma Scale (GCS) (48,582 patients from BIDMC and 19,737 patients from Stanford Medical Center) | Accuracy | Sensitivity | Specificity of prediction for stage 2 or stage 3 acute kidney injury in the BIDMC dataset: | [ |
| Postoperative acute kidney injury prediction | IDEA (ML algorithm) | Retrospective single-center cohort of 2911 adults who underwent surgery at the University of Florida Health between 2000 and 2010 | Preoperative model: | [ |
| Mortality prediction for acute kidney injury patients in the intensive care unit | Several ML algorithms | Medical information mart for intensive care (MIMIC) III database from 19,044 patients with acute kidney injury among which 2586 died | With the prediction sensitivity fixed at 85%, the following accuracies were reported: | [ |
| Prediction of diabetic kidney disease progression | CAE | EHR of 64,059 type II diabetes patients | Accuracy—71% | [ |
| Automatic determination of the eGFR and chronic kidney disease status | ResNet | 4505 kidney ultrasound images labeled using eGFRs derived from serum creatinine concentrations | Accuracy—85.6% | [ |
| GFR estimation | ANN | 1959 chronic kidney disease patients (development dataset: 1075 participants from January 2012 to December 2014; validation dataset: 877 participants from January 2015 to June 2016) | Accuracy—75.8% | [ |
| Early detection of acute renal transplant rejection | CNN | Diffusion-weighted MRI dataset of 56 individuals (with associated clinical biomarkers), who had renal transplantation | Accuracy—92.9% | [ |
| Multiclass segmentation of kidney tissue in sections stained by PAS | CNN | Blouin-fixed, paraffin-embedded needle-core biopsies from 101 patients who underwent a kidney transplantation between 2008 and 2012 in the Radboud University Medical Center, Nijmegen, The Netherlands (Radboudumc); 132 PAS-stained slides from Radboudumc pathology archives | Correlation between glomerular counting performed by pathologists vs. AI—0.94 | [ |
ANN—artificial neural network; BIDMC—Beth Israel Deaconess Medical Center; CAE—convolutional autoencoder; CNN—convolutional neural network; DL—deep learning; IDEA—Intraoperative Data Embedded Analytics; ML—machine learning; SVM—support vector machine; XGBoost—extreme gradient boosting.
Summary of the recent AI studies in gastroenterology and hepatology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Prediction of long-term health-related quality of life and comorbidity after bariatric surgery | DBN | 6542 patients registered in the Scandinavian Obesity Surgery Registry between 2008 and 2012 operated on with primary Roux-en-Y gastric bypass | Accuracy | Sensitivity | Specificity of DBN for predicting 5-year comorbidities: | [ |
| Assessment of bowel preparation | ENDOANGEL | 5583 clear and unambiguous colonoscopy images retrospectively collected from over 2000 patients (for training dataset) | Accuracy: | [ |
| Automatic assessment and classification of the EmA test for celiac disease | SVM | 2597 high-quality IgA class EmA images collected in 2017–2018 in the celiac disease service laboratory at the Tampere University, Tampere, Finland | Accuracy—96.80% | [ |
| Identification of immunogenic epitopes of the tTG-DGP complex for use in detection and monitoring patients with celiac disease | SVM | Serum samples from 90 patients with biopsy-proven celiac disease and 79 healthy individuals for the training dataset and 82 patients with newly diagnosed CeD and 217 controls for the validation dataset | Identification of patients with celiac disease: | [ |
| Detection of pathologic morphological features in diseased vs. healthy duodenal tissue | CNN | 3118 segmented images from 121 H&E-stained duodenal biopsy glass slides from 102 patients collected between November 2017 and February 2018 | Accuracy—93.4% | [ |
| Prediction of liver disease | Several ML algorithms | 615 patients (blood donors and non-blood donors with Hepatitis C) data collected from the University of California Irvine Machine Learning Repository | ANN | [ |
| Quantification of steatosis, inflammation, ballooning, and fibrosis in biopsies from patients with NAFLD | ML algorithm | Data from 246 consecutive patients with biopsy-proven NAFLD and followed up in London from January 2010 to December 2016; biopsy specimens from the first 100 patients were used for training, while the other 146 were used for validation | Correlation between manual annotation and software results: | [ |
| Detection and quantification of hepatic fibrosis and assessment of its architectural pattern in NAFDL biopsies | Supervised ML models | A set of digital images of trichrome stained slides of 18 unique liver biopsies | Precision of fibrosis patterns: | [ |
| Automatic objective quantification of macrovesicular steatosis in histopathological liver section slides stained with Sudan | Several ML and DL algorithms | Eight micrometer-thick sections obtained from 20 donor liver samples | Accuracy | Sensitivity | Specificity of | [ |
ANN—artificial neural network; CNN—convolutional neural network; DBN—discrete Bayesian network; DCNN—deep convolutional neural network; KNN—K-nearest neighbors; ML—machine learning; MLR—multivariable logistic regression; NB—Naïve Bayes; NN—neural network; SVM—support vector machine.
Summaries of the recent AI studies in orthopedics and rheumatology.
| Task/Objective | AI Tool(s) | Data/Validation | Performance | Ref. |
|---|---|---|---|---|
| Automatic knee meniscus tear detection and orientation classification | RCNN | A total of 1128 images, with an imbalanced number of horizontal posterior tears, vertical posterior tears, horizontal anterior tears, and vertical anterior tears | Accuracy—83% | [ |
| Assessment of the risk of hip dislocation based on postoperative anteroposterior pelvis radiographs | YOLO-V3 | Retrospective radiographs of 13,970 primary THAs with 374 dislocations after 5 years of follow-up, accounting for 1490 radiographs from dislocated and 91,094 from non-dislocated THAs | Accuracy—49.55% | [ |
| Prediction of PsA among psoriasis patients | Several ML algorithms | Data from six cohorts with more than 7000 genotyped PsA and PsC patients | For the top 5% of patients predicted as having PsA: | [ |
| Differential diagnosis of rheumatoid arthritis and osteoarthritis | Several ML algorithms | Affymetrix and Illumina microarrays on gene expression in rheumatoid arthritis and osteoarthritis healthy control synovial tissues curated from Gene Expression Omnibus | Rheumatoid arthritis: | [ |
CNN—convolutional neural network; ML—machine learning; RCNN—region convolutional neural network; PsA—psoriatic arthritis; PsC—cutaneous-only psoriasis; THA—total hip arthroplasty.
Figure 3ML in drug discovery. Reproduced from [14], Elsevier B.V. 2021.
Figure 4Applications of AI in drug discovery. Adapted from [14].
Figure 5The main challenges in AI clinical integration.