Yash R Somnay1, Mark Craven2, Kelly L McCoy3, Sally E Carty3, Tracy S Wang4, Caprice C Greenberg5, David F Schneider6. 1. Section of Endocrine Surgery, Department of Surgery, University of Wisconsin, Madison, WI. 2. Department of Biostatistics and Medical Informatics, and the Department Computer Science, University of Wisconsin, Madison, WI. 3. Division of Endocrine Surgery, Department of Surgery, University of Pittsburgh, Pittsburgh, PA. 4. Division of Surgical Oncology, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI. 5. Wisconsin Surgical Outcomes Research Program, Department of Surgery, University of Wisconsin, Madison, WI. 6. Section of Endocrine Surgery, Department of Surgery, University of Wisconsin, Madison, WI. Electronic address: schneiderd@surgery.wisc.edu.
Abstract
BACKGROUND: Parathyroidectomy offers the only cure for primary hyperparathyroidism, but today only 50% of primary hyperparathyroidism patients are referred for operation, in large part, because the condition is widely under-recognized. The diagnosis of primary hyperparathyroidism can be especially challenging with mild biochemical indices. Machine learning is a collection of methods in which computers build predictive algorithms based on labeled examples. With the aim of facilitating diagnosis, we tested the ability of machine learning to distinguish primary hyperparathyroidism from normal physiology using clinical and laboratory data. METHODS: This retrospective cohort study used a labeled training set and 10-fold cross-validation to evaluate accuracy of the algorithm. Measures of accuracy included area under the receiver operating characteristic curve, precision (sensitivity), and positive and negative predictive value. Several different algorithms and ensembles of algorithms were tested using the Weka platform. Among 11,830 patients managed operatively at 3 high-volume endocrine surgery programs from March 2001 to August 2013, 6,777 underwent parathyroidectomy for confirmed primary hyperparathyroidism, and 5,053 control patients without primary hyperparathyroidism underwent thyroidectomy. Test-set accuracies for machine learning models were determined using 10-fold cross-validation. Age, sex, and serum levels of preoperative calcium, phosphate, parathyroid hormone, vitamin D, and creatinine were defined as potential predictors of primary hyperparathyroidism. Mild primary hyperparathyroidism was defined as primary hyperparathyroidism with normal preoperative calcium or parathyroid hormone levels. RESULTS: After testing a variety of machine learning algorithms, Bayesian network models proved most accurate, classifying correctly 95.2% of all primary hyperparathyroidism patients (area under receiver operating characteristic = 0.989). Omitting parathyroid hormone from the model did not decrease the accuracy significantly (area under receiver operating characteristic = 0.985). In mild disease cases, however, the Bayesian network model classified correctly 71.1% of patients with normal calcium and 92.1% with normal parathyroid hormone levels preoperatively. Bayesian networking and AdaBoost improved the accuracy of all parathyroid hormone patients to 97.2% cases (area under receiver operating characteristic = 0.994), and 91.9% of primary hyperparathyroidism patients with mild disease. This was significantly improved relative to Bayesian networking alone (P < .0001). CONCLUSION: Machine learning can diagnose accurately primary hyperparathyroidism without human input even in mild disease. Incorporation of this tool into electronic medical record systems may aid in recognition of this under-diagnosed disorder.
BACKGROUND: Parathyroidectomy offers the only cure for primary hyperparathyroidism, but today only 50% of primary hyperparathyroidismpatients are referred for operation, in large part, because the condition is widely under-recognized. The diagnosis of primary hyperparathyroidism can be especially challenging with mild biochemical indices. Machine learning is a collection of methods in which computers build predictive algorithms based on labeled examples. With the aim of facilitating diagnosis, we tested the ability of machine learning to distinguish primary hyperparathyroidism from normal physiology using clinical and laboratory data. METHODS: This retrospective cohort study used a labeled training set and 10-fold cross-validation to evaluate accuracy of the algorithm. Measures of accuracy included area under the receiver operating characteristic curve, precision (sensitivity), and positive and negative predictive value. Several different algorithms and ensembles of algorithms were tested using the Weka platform. Among 11,830 patients managed operatively at 3 high-volume endocrine surgery programs from March 2001 to August 2013, 6,777 underwent parathyroidectomy for confirmed primary hyperparathyroidism, and 5,053 control patients without primary hyperparathyroidism underwent thyroidectomy. Test-set accuracies for machine learning models were determined using 10-fold cross-validation. Age, sex, and serum levels of preoperative calcium, phosphate, parathyroid hormone, vitamin D, and creatinine were defined as potential predictors of primary hyperparathyroidism. Mild primary hyperparathyroidism was defined as primary hyperparathyroidism with normal preoperative calcium or parathyroid hormone levels. RESULTS: After testing a variety of machine learning algorithms, Bayesian network models proved most accurate, classifying correctly 95.2% of all primary hyperparathyroidismpatients (area under receiver operating characteristic = 0.989). Omitting parathyroid hormone from the model did not decrease the accuracy significantly (area under receiver operating characteristic = 0.985). In mild disease cases, however, the Bayesian network model classified correctly 71.1% of patients with normal calcium and 92.1% with normal parathyroid hormone levels preoperatively. Bayesian networking and AdaBoost improved the accuracy of all parathyroid hormonepatients to 97.2% cases (area under receiver operating characteristic = 0.994), and 91.9% of primary hyperparathyroidismpatients with mild disease. This was significantly improved relative to Bayesian networking alone (P < .0001). CONCLUSION: Machine learning can diagnose accurately primary hyperparathyroidism without human input even in mild disease. Incorporation of this tool into electronic medical record systems may aid in recognition of this under-diagnosed disorder.
Authors: Scott M Wilhelm; Tracy S Wang; Daniel T Ruan; James A Lee; Sylvia L Asa; Quan-Yang Duh; Gerard M Doherty; Miguel F Herrera; Janice L Pasieka; Nancy D Perrier; Shonni J Silverberg; Carmen C Solórzano; Cord Sturgeon; Mitchell E Tublin; Robert Udelsman; Sally E Carty Journal: JAMA Surg Date: 2016-10-01 Impact factor: 14.766
Authors: Parthiv J Mahadevia; Julie Ann Sosa; Michael A Levine; Martha A Zeiger; Neil R Powe Journal: Endocr Pract Date: 2003 Nov-Dec Impact factor: 3.443
Authors: Ammar Asban; Alex Dombrowsky; Reema Mallick; Rongbing Xie; James K Kirklin; Raymon H Grogan; David F Schneider; Herbert Chen; Courtney J Balentine Journal: Oncologist Date: 2019-04-24
Authors: Nick A Zaborek; Andy Cheng; Joseph R Imbus; Kristin L Long; Susan C Pitt; Rebecca S Sippel; David F Schneider Journal: Surgery Date: 2018-11-06 Impact factor: 3.982
Authors: Luis Serviá; Neus Montserrat; Mariona Badia; Juan Antonio Llompart-Pou; Jesús Abelardo Barea-Mendoza; Mario Chico-Fernández; Marcelino Sánchez-Casado; José Manuel Jiménez; Dolores María Mayor; Javier Trujillano Journal: BMC Med Res Methodol Date: 2020-10-20 Impact factor: 4.615