| Literature DB >> 35616768 |
Valentina Bellini1, Marina Valente2, Melania Turetti1, Paolo Del Rio2, Francesco Saturno1, Massimo Maffezzoni1, Elena Bignami1.
Abstract
The application of artificial intelligence technologies is growing in several fields of healthcare settings. The aim of this article is to review the current applications of artificial intelligence in bariatric surgery. We performed a review of the literature on Scopus, PubMed and Cochrane databases, screening all relevant studies published until September 2021, and finally including 36 articles. The use of machine learning algorithms in bariatric surgery is explored in all steps of the clinical pathway, from presurgical risk-assessment and intraoperative management to complications and outcomes prediction. The models showed remarkable results helping physicians in the decision-making process, thus improving the quality of care, and contributing to precision medicine. Several legal and ethical hurdles should be overcome before these methods can be used in common practice.Entities:
Keywords: Artificial intelligence; Bariatric surgery; Machine learning; Perioperative medicine; Postoperative complications
Mesh:
Year: 2022 PMID: 35616768 PMCID: PMC9273529 DOI: 10.1007/s11695-022-06100-1
Source DB: PubMed Journal: Obes Surg ISSN: 0960-8923 Impact factor: 3.479
Fig. 1Article selection flow diagram
Fig. 2Temporal distribution of the articles included in our analysis according to the year of publication
Fig. 3Pie chart describing the proportion of the studied involved in the review related to the specific phase of the perioperative pathway
Fig. 4Role of artificial intelligence (AI) in bariatric surgery. AI can be used in every aspect of the perioperative path, from the presurgical assessment to the intraoperative phase, up to the postoperative management
Overview of papers about preoperative assessment included in our analysis
| Author, years | Study design | Objective | Final cohort | Outcomes | Type of ML | Prediction performance |
|---|---|---|---|---|---|---|
| Zhou CM 2021 | Retrospective single center | Prediction of difficult tracheal intubation in obese patients using six approaches from various ML fields | 1015 | Prediction of difficult tracheal intubation | LR, TR, RF, Gbdt, Xgbc, Gbm | Training vs testing group: LR AUC 0,68–0,70; TR AUC 0,71–0,60; RF AUC 0,92–0,58; Gbdt AUC 0,78–0,71; Xgbc AUC 0,73–0,71; Gbm AUC 0,81–0,66 |
| Mencar C 2020 | Observational multicentric | Efficacy and clinical applicability of different ML methods based on demographic information and questionnaire data to predict OSA severity | 313 | Prediction of obstructive sleep apnea syndrome severity | MV, NB, k-NN, Ct, RF, SVM AdaBoost-SVM, CN2 rule induction, ML, LR, k-NN, RT, SVR, AdaBoost-SVR | SVM AUC 0,65–0,61 RF AUC 0,63 |
| Pépin JL 2020 | Prospective observational single center | Evaluation of mandibular movement monitoring during sleep coupled with an automated analysis by ML for OSA diagnosis | 376 | OSA diagnosis | Sr RDI | Sr-RDI ≥ 5 events/h AUC 0,95; PSG-RDI ≥ 15 events/h AUC 0,93 |
| Keshavarz Z 2020 | Retrospective, single center | Development of a model for predicting OSA to select the best model to determine and screen high-risk OSA patients | 231 | OSA diagnosis | NN, NB, LR, KNN, SVM, RF | NN AUC 0.75; NB AUC 0,76; LR AUC 0,76; KNN AUC 0.65; SVM AUC 0.72; RF AUC 0.75 |
| Gao WD 2019 | Retrospective | Detection of OSA extracting the features of the heartbeat interval signal and the respiratory signal | N/A | OSA diagnosis | Model fusion (LR-SVM) | Sensitivity 74%, specificity 75%, accuracy 75% |
| Tiron R 2020 | Prospective, single center | Determining of sleep and breathing patterns, and then analyzing results to track sleep-related health risks associated with sleep apnea | 248 | Performance of the Firefly technology as a screener for a clinical threshold of apnea hypopnea index ≥ 15 | Firefly technology | ROC AUC (training 0.95, test 0.92); PR AUC (training 0.87, test 0.89) |
| Cheng Q 2017 | Prospective, single center | Predicting pulmonary function by improved classification models with sole inputs being motion sensors from carried phones | 35 | To categorize patients into the correct GOLD stage | SVM | Accuracy 99% |
| Viswanath V 2018 | Prospective, multicenter | Performing a spirometry test using only the audio data from the microphone of a standard smartphone providing automatic feedback | 20505 | Pulmonary function | NB, k-NN, Log Reg (L1) Log Reg (L2), RF, Gradient Boosting VGG CNN Gated-CRNN | Mel spectogram Naive Bayes precision 0.80; Mel spectogram k-NN precision 0.94; Mel spectogram Log Reg (L1) precision 0.94; Mel spectogram Log Reg (L2) precision 0.93; Mel spectogram RF precision 0.96; Mel spectogram Gradient Boosting precision 0.96; Mel spectogram VGG CNN precision 0.97; Mel spectogram Gated-CRNN precision 0.98 |
| Assaf D 2021 | Retrospective, single center | To improve preoperative diagnosis of hiatal hernia in patients candidates for BS | 2482 | Diagnosis of hiatal hernia | ML decision tree model | Achieving 38.5% sensitivity and 92.9% specificity, ML models increased sensitivity up to 60.2% compared to swallow study prediction |
LR logistic regression, TR decision tree, RF random forest, Gbdt gradient boosting decision tree, Xgbc extreme gradient boosting, Gbm light GBM, MV majority vote, NB Naive Bayes, k-NN k-nearest neighbor, Ct classification tree, SVM support vector machine, AdaBoost-SVM adaptive boosting SVM, ML machine learning, RT regression tree, SVR support vector regression, AdaBoost-SVR adaptive boosting SVR, Sr RDI sunrise system-derived respiratory disturbance index, OSA obstructive sleep apnea, GOLD Global Initiative for Chronic Obstructive Lung Disease, NN neural network, NB Naïve Bayes, ROC receiver operating characteristics, PR precision recall, ANNs artificial neural networks, LDA linear discriminant analysis, QDA quadratic discriminant analysis, MLP multilayer perceptron, AdaBoost LR adaptive boosting LR, CNN convolutional neural network, RNN recurrent neural network, XGBs gradient boosting machines, OSA obstructive sleep apnea, BS bariatric surgery
Overview of papers about intraoperative phase included in our analysis
| Author, years | Study design | Objective | Final cohort | Outcomes | Type of ML | Prediction performance |
|---|---|---|---|---|---|---|
| Ingrande J 2020 | Prospective, single center | Modeling inductionphase kinetics using a high-resolution pharmacokinetic dataset | 30 | Drug concentrations | 4-compartment model, recirculatory model, gated recurrent unit neural network | Direct comparison of observed versus predicted concentrations |
| Twinanda AP 2019 | Retrospective, single center | Intraoperative accurate surgery duration estimation by using only visual information from laparoscopic videos | 290 | Remaining surgery duration estimation | Deep Learning- a convolutional neural network and a long-short term memory network | The proposed network significantly outperforms a traditional method of estimating surgery duration without utilizing manual annotation |
| Hashimoto DA 2019 | Retrospective, single center | To identify operative steps in laparoscopic sleeve gastrectomy | 88 | Automatic extraction of quantitative surgical data from operative video of laparoscopic sleeve gastrectomy | Deep Learning | SleeveNet demonstrated a mean classification accuracy of 82% ± 4% with a minimum classification accuracy of 73% and a maximum classification accuracy of 85.6% |
Overview of papers about postoperative management and complications included in our analysis
| Author, years | Study design | Objective | Final cohort | Outcomes | Type of ML | Prediction performance |
|---|---|---|---|---|---|---|
| Sheikhtaheri A 2019 | Retrospective, multicenter | Predicting the early complications of one-anastomosis gastric bypass | 1509 | Complications incidence | ANNs | Accuracy, specificity, sensitivity: 10-day prediction system 98.4%, 98.6%, 98.3%;1-month system 96%, 93%, and 98.4%; 3-month system 89.3%, 86.6%, 91.5% |
| Cao Y 2019 | Retrospective, multicenter | Predicting the risk for severe complication after BS | 37811 | Complications incidence | LR, LDA, QDA, TR, KNN, SVM, MLP, NN, AdaBoost LR, bagging LDA, bagging QDA, RF, extremely randomized trees, AdaBoost Extra trees, gradient RT, AdaBoost Gradient trees, bagging KNN, AdaBoost SVM, bagging MLP | Best gradient RT and bagging MLP AUC 0.58 |
| Cao Y 2020 | Retrospective, multicenter | Exploring whether serious postoperative complications of bariatric surgery recorded in a national quality registry can be predicted preoperatively using deep learning methods | 44061 | Complications incidence | MLP, CNN, RNN | AUC ≤ 0.6 |
| Nudel J 2021 | Retrospective, multicenter | Predicting leak and VTE after BS | 436807 | Leak and VTE incidence | ANN, XGBs | ANN AUC 0.75; XGBs AUC 0.70 |
| Wise ES 2020 | Retrospective, multicenter | To optimize the prediction of the composite endpoint of 30-day readmission, reoperation, reintervention, or mortality, after laparoscopic sleeve gastrectomy | 101721 | 30-day morbidity and mortality prediction after bariatric surgery | LR and ANN | ANN AUROC = 0.581 compared to LR AUROC = 0.572 in the training set |
| Razzaghi T 2019 | Retrospective, multicenter | To identify risks/outcomes associated with BS | 11636 | Risk-prediction | NB, Radial Basis Function Neural Network, k-NN, SVM, and LR | The combination of a suitable feature selection method with ensemble learning methods equipped with Oversampling (SMOTE) method can achieve higher performance metrics |
| Cruz MR 2014 | Retrospective, single center | To validate a computerized intelligent decision support system that suggests nutritional diagnoses of patients submitted to BS | 60 | Nutritional monitoring of patients undergoing BS | Bayesian network | The system sensibility and specificity were 95.0% |
| Liew PL 2007 | Retrospective, single center | To compare the predictive accuracy of LR and ANN with respect to the clinicopathologic features of gallbladder disease among obese patients | 117 | Prediction of gallbladder disease | LR and ANN | The average correct classification rate of ANNs was higher than that of the traditional logistic regression approach (97.14% versus 88.2%). Besides, ANNs also had a lower Type II error when compared with logistic regression |
LR logistic regression, TR decision tree, RF random forest, Gbdt gradient boosting decision tree, Xgbc extreme gradient boosting, Gbm light GBM, MV majority vote, NB Naive Bayes, k-NN k-nearest neighbor, Ct classification tree, SVM support vector machine, AdaBoost-SVM adaptive boosting SVM, ML machine learning, RT regression tree, SVR support vector regression, AdaBoost-SVR adaptive boosting SVR, Sr RDI sunrise system-derived respiratory disturbance index, OSA obstructive sleep apnea, GOLD Global Initiative for Chronic Obstructive Lung Disease, NN neural network, NB Naïve Bayes, ROC receiver operating characteristics, PR precision recall, ANNs artificial neural networks, LDA linear discriminant analysis, QDA quadratic discriminant analysis, MLP multilayer perceptron, AdaBoost LR adaptive boosting LR, CNN convolutional nearal network, RNN recurrent neural network, XGBs gradient boosting machines, BS bariatric surgery, VTE venous thromboembolism
Overview of papers about clinical outcomes included in our analysis
| Author, years | Study design | Objective | Final cohort | Outcomes | Type of ML | Prediction performance |
|---|---|---|---|---|---|---|
| Zhang W 2021 | Prospective, single center | To predict optimal weight loss 6 months after BS | 37 | Classification of patients with optimal and suboptimal weight loss at 6 months post BS | Siamese-kNN, LR, SVM | The Siamese-KNN achieved an accuracy of 83.78% and AUC of 0.84 |
| Modaresnezhad M 2019 | Retrospective, multicenter | To enable a large reduction in dimensionality of the data and to allow for fast and efficient application of data mining techniques to large clinical datasets | 120,000 | Prediction of BS outcomes | TR, regression, and NN | The rule-based semantic approach for reducing data dimensionality was highly effective in reducing the volume of the data and the time needed to run the analysis. The reduced model performs as well as the full model |
| Celik S 2020 | Retrospective, single center | To verify the dependence of weight loss on sleeve coefficients and to forecast the weight loss | 63 | Prediction of weight loss after laparoscopic sleeve gastrectomy | SVM, neural network Bayesian regularization | Levenberg–Marquardt and Bayesian regularization are the most suitable algorithms. Error intervals were smaller for Bayesian regularization algorithm and are broader for Levenberg-Marquardt algorithm |
| Wise ES 2016 | Retrospective, single center | To devise a web-based tool to predict excess BMI loss after laparoscopic RYGB by identification of independent preoperative predictors | 647 | Prediction of excess weight loss after laparoscopic RYGB | ANN | AUC of ANN for the training set and validation set were 0.78 ± 0.03 and 0.83 ± 0.04, respectively |
| Piaggi P 2010 | Prospective, single center | To build a statistical model based on psychological and physical data to predict weight loss in patients treated by LAGB | 172 | Weight loss prediction in obese candidates to LAGB | ANN | Nonlinear model resulted to be better at data fitting (36% vs. 10% variance explained, respectively) and provided more reliable parameters for accuracy and mis-classification rates (70% and 30% vs. 66% and 34%, respectively) |
| Lee YC 2007 | Prospective, single center | To evaluate weight reduction after BS using information available during the initial preoperative assessment | 249 | Prediction of weight reduction | LR and ANN | The overall predictive accuracy of ANN is higher than logistic regression in the prediction of successful weight reduction |
| Dimeglio C 2020 | Retrospective, single center | To analyze the postoperative weight trajectories and to identify “curve families” for early prediction of weight regain | 795 | Prediction of weight evolution | Hierarchical cluster analysis | Classification with reference trajectories produced an overall rate of correct classification of more than 93% |
| van Loon SLM 2020 | Retrospective, single center | To objectively quantify the metabolic health status of patients after BS | 1595 | The Metabolic Health Index can quantify the improvement in the metabolic health status of treated bariatric patients | LR | The index reflects severity of comorbidity, enabling objective assessment of a bariatric patient’s metabolic health state, regardless day of sampling and surgery type |
| Johnston SS 2019 | Retrospective, multicenter | To develop a predictive model of antihyperglycemic medication cessation after metabolic surgery | 16527 | No antihyperglycemic medication treatment from 365 to 730 days after metabolic surgery | LR | The model possessed good internal discriminative accuracy (AUC = 0.778) and transportability (external AUC = 0.759) |
| Lee WJ 2012 | Retrospective, single center | To examine the efficacy of surgically induced weight loss on diabetes remission | 88 | Prediction of diabetes remission | LR and ANN | The average correct classification rate of logistic regression was 85.9%, The average correct classification rate of the ANN model was 90.4% |
| Aminian A 2020 | Retrospective, single center | Constructing and internally validating prediction models to estimate the risk of long-term end-organ complications and mortality in patients with type 2 diabetes and obesity | 2287 | End organ complication detection | Regression, RF | Surgery versus usual care: all-cause mortality (AUC 0.79 and 0.81), coronary artery events (AUC 0.66 and 0.67), heart failure (AUC 0.73 and 0.75), and nephropathy (AUC 0.73 and 0.76) |
| Aron-Wisnewsky J 2017 | Retrospective, single center | To develop an improved scoring system for predicting diabetes remission following RYGB | 352 | Prediction of diabetes remission 1 year post BS | Multivariate logistic regression | Ad-DiaRem displayed improved AUROC and predictive accuracy compared with DiaRem (0.911 vs 0.856 and 0.841 vs 0.789, respectively; |
| Debédat J 2018 | Retrospective, single center | To develop an improved scoring system for predicting long-term diabetes remission following RYGB | 175 | Prediction of long-term diabetes remission | Fully corrective binning | The score was accurate AUROC = 90%; accuracy = 85% at predicting 5-years diabetes remission |
| Pedersen HK 2016 | Retrospective, multicenter | To stratify individuals based on clinical and genomic factors that determine their diabetic response to surgery, and to identify factors that have an important role in this response | 457 | Discrimination between patients with and without surgery-induced diabetes remission | ANN | Accuracy = 74%, AUC = 0.81 |
| Cao Y 2019 | Retrospective, multicenter | To predict 5-year health-related quality of life after bariatric surgery based on the available preoperative information | 6687 | Long-term quality of life prediction in patients after BS | CNN | The CNN model showed an overwhelming advantage in predicting all the health-related quality of life measures |
| Cao Y 2020 | Retrospective, multicenter | To find better methods for predicting prognosis and provide evidence for patient management after BS | 6542 | Long-term outcome prediction in patients after BS | BN, CNN, Multivariate LR | BN showed excellent predictive ability for 5-year type 2 diabetes and dyslipidemia (AUC = 0.942 and 0.917, respectively), good ability for 5-year hypertension and sleep apnea syndrome (AUC = 0.891 and 0.834, respectively), and fair ability for 5-year depression (AUC = 0.750) |
LR logistic regression, TR decision tree, RF random forest, Gbdt gradient boosting decision tree, Xgbc extreme gradient boosting, Gbm light GBM, MV majority vote, NB Naive Bayes, k-NN k-nearest neighbor, Ct classification tree, SVM support vector machine, AdaBoost-SVM adaptive boosting SVM, ML machine learning, RT regression tree, SVR support vector regression, AdaBoost-SVR adaptive boosting SVR, Sr RDI sunrise system-derived respiratory disturbance index, OSA obstructive sleep apnea, GOLD Global Initiative for Chronic Obstructive Lung Disease, NN neural network, NB Naïve Bayes, ROC receiver operating characteristics, PR precision recall, ANNs artificial neural networks, LDA linear discriminant analysis, QDA quadratic discriminant analysis, MLP multilayer perceptron, AdaBoost LR adaptive boosting LR, CNN convolutional neural network, RNN recurrent neural network, XGBs gradient boosting machines, BS bariatric surgery, BMI body mass index, RYGB Roux-en-Y gastric bypass, LAGB = laparoscopic adjustable gastric banding