Emma L Barber1, Ravi Garg2, Christianne Persenaire3, Melissa Simon4. 1. Northwestern University Feinberg School of Medicine, Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Chicago, IL, USA; Robert H Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL, United States of America; Center for Health Equity Transformation, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America; Institute of Public Health and Medicine Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America. Electronic address: emma.barber@northwestern.edu. 2. Institute of Public Health and Medicine Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America. 3. Northwestern University Feinberg School of Medicine, Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Chicago, IL, USA. 4. Northwestern University Feinberg School of Medicine, Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Chicago, IL, USA; Robert H Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL, United States of America; Center for Health Equity Transformation, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America; Institute of Public Health and Medicine Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
Abstract
OBJECTIVE: To determine if natural language processing (NLP) with machine learning of unstructured full text documents (a preoperative CT scan) improves the ability to predict postoperative complication and hospital readmission among women with ovarian cancer undergoing surgery when compared with discrete data predictors alone. METHODS: Medical records from two institutions were queried to identify women with ovarian cancer and available preoperative CT scan reports who underwent debulking surgery. Machine learning methods using both discrete data predictors (age, comorbidities, preoperative laboratory values) and natural language processing of full text reports (preoperative CT scans) were used to predict postoperative complication and hospital readmission within 30 days of surgery. Discrimination was measured using the area under the receiver operating characteristic curve (AUC). RESULTS: We identified 291 women who underwent debulking surgery for ovarian cancer. Mean age was 59, mean preoperative CA125 value was 610 U/ml and albumin was 3.9 g/dl. There were 25 patients (8.6%) who were readmitted and 45 patients (15.5%) who developed postoperative complications within 30 days. Using discrete features alone, we were able to predict postoperative readmission with an AUC of 0.56 (0.54-0.58, 95% CI); this improved to 0.70 (0.68-0.73, 95% CI) (p < 0.001) with the addition of NLP of preoperative CT scans. CONCLUSIONS: Natural language processing with machine learning improved the ability to predict postoperative complication and hospital readmission among women with ovarian cancer undergoing surgery.
OBJECTIVE: To determine if natural language processing (NLP) with machine learning of unstructured full text documents (a preoperative CT scan) improves the ability to predict postoperative complication and hospital readmission among women with ovarian cancer undergoing surgery when compared with discrete data predictors alone. METHODS: Medical records from two institutions were queried to identify women with ovarian cancer and available preoperative CT scan reports who underwent debulking surgery. Machine learning methods using both discrete data predictors (age, comorbidities, preoperative laboratory values) and natural language processing of full text reports (preoperative CT scans) were used to predict postoperative complication and hospital readmission within 30 days of surgery. Discrimination was measured using the area under the receiver operating characteristic curve (AUC). RESULTS: We identified 291 women who underwent debulking surgery for ovarian cancer. Mean age was 59, mean preoperative CA125 value was 610 U/ml and albumin was 3.9 g/dl. There were 25 patients (8.6%) who were readmitted and 45 patients (15.5%) who developed postoperative complications within 30 days. Using discrete features alone, we were able to predict postoperative readmission with an AUC of 0.56 (0.54-0.58, 95% CI); this improved to 0.70 (0.68-0.73, 95% CI) (p < 0.001) with the addition of NLP of preoperative CT scans. CONCLUSIONS: Natural language processing with machine learning improved the ability to predict postoperative complication and hospital readmission among women with ovarian cancer undergoing surgery.
Authors: Lloyd A Courtenay; Diego González-Aguilera; Susana Lagüela; Susana Del Pozo; Camilo Ruiz; Inés Barbero-García; Concepción Román-Curto; Javier Cañueto; Carlos Santos-Durán; María Esther Cardeñoso-Álvarez; Mónica Roncero-Riesco; David Hernández-López; Diego Guerrero-Sevilla; Pablo Rodríguez-Gonzalvez Journal: J Clin Med Date: 2022-04-21 Impact factor: 4.964