OBJECTIVES: Acute respiratory distress syndrome is frequently under recognized and associated with increased mortality. Previously, we developed a model that used machine learning and natural language processing of text from radiology reports to identify acute respiratory distress syndrome. The model showed improved performance in diagnosing acute respiratory distress syndrome when compared to a rule-based method. In this study, our objective was to externally validate the natural language processing model in patients from an independent hospital setting. DESIGN: Secondary analysis of data across five prospective clinical studies. SETTING: An urban, tertiary care, academic hospital. PATIENTS: Adult patients admitted to the medical ICU and at-risk for acute respiratory distress syndrome. INTERVENTIONS: None. MEASUREMENTS AND MAIN RESULTS: The natural language processing model was previously derived and internally validated in burn, trauma, and medical patients at Loyola University Medical Center. Two machine learning models were examined with the following text features from qualifying radiology reports: 1) word representations (n-grams) and 2) standardized clinical named entity mentions mapped from the National Library of Medicine Unified Medical Language System. The models were externally validated in a cohort of 235 patients at the University of Chicago Medicine, among which 110 (47%) were diagnosed with acute respiratory distress syndrome by expert annotation. During external validation, the n-gram model demonstrated good discrimination between acute respiratory distress syndrome and nonacute respiratory distress syndrome patients (C-statistic, 0.78; 95% CI, 0.72-0.84). The n-gram model had a higher discrimination for acute respiratory distress syndrome when compared with the standardized named entity model, although not statistically significant (C-statistic 0.78 vs 0.72; p = 0.09). The most important features in the model had good face validity for acute respiratory distress syndrome characteristics but differences in frequencies did occur between hospital settings. CONCLUSIONS: Our computable phenotype for acute respiratory distress syndrome had good discrimination in external validation and may be used by other health systems for case-identification. Discrepancies in feature representation are likely due to differences in characteristics of the patient cohorts.
OBJECTIVES: Acute respiratory distress syndrome is frequently under recognized and associated with increased mortality. Previously, we developed a model that used machine learning and natural language processing of text from radiology reports to identify acute respiratory distress syndrome. The model showed improved performance in diagnosing acute respiratory distress syndrome when compared to a rule-based method. In this study, our objective was to externally validate the natural language processing model in patients from an independent hospital setting. DESIGN: Secondary analysis of data across five prospective clinical studies. SETTING: An urban, tertiary care, academic hospital. PATIENTS: Adult patients admitted to the medical ICU and at-risk for acute respiratory distress syndrome. INTERVENTIONS: None. MEASUREMENTS AND MAIN RESULTS: The natural language processing model was previously derived and internally validated in burn, trauma, and medical patients at Loyola University Medical Center. Two machine learning models were examined with the following text features from qualifying radiology reports: 1) word representations (n-grams) and 2) standardized clinical named entity mentions mapped from the National Library of Medicine Unified Medical Language System. The models were externally validated in a cohort of 235 patients at the University of Chicago Medicine, among which 110 (47%) were diagnosed with acute respiratory distress syndrome by expert annotation. During external validation, the n-gram model demonstrated good discrimination between acute respiratory distress syndrome and nonacute respiratory distress syndrome patients (C-statistic, 0.78; 95% CI, 0.72-0.84). The n-gram model had a higher discrimination for acute respiratory distress syndrome when compared with the standardized named entity model, although not statistically significant (C-statistic 0.78 vs 0.72; p = 0.09). The most important features in the model had good face validity for acute respiratory distress syndrome characteristics but differences in frequencies did occur between hospital settings. CONCLUSIONS: Our computable phenotype for acute respiratory distress syndrome had good discrimination in external validation and may be used by other health systems for case-identification. Discrepancies in feature representation are likely due to differences in characteristics of the patient cohorts.
Authors: Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497
Authors: Giacomo Bellani; John G Laffey; Tài Pham; Eddy Fan; Laurent Brochard; Andres Esteban; Luciano Gattinoni; Frank van Haren; Anders Larsson; Daniel F McAuley; Marco Ranieri; Gordon Rubenfeld; B Taylor Thompson; Hermann Wrigge; Arthur S Slutsky; Antonio Pesenti Journal: JAMA Date: 2016-02-23 Impact factor: 56.272
Authors: Carolyn S Calfee; David R Janz; Gordon R Bernard; Addison K May; Kirsten N Kangelaris; Michael A Matthay; Lorraine B Ware Journal: Chest Date: 2015-06 Impact factor: 9.410
Authors: William D Schweickert; Mark C Pohlman; Anne S Pohlman; Celerina Nigos; Amy J Pawlik; Cheryl L Esbrook; Linda Spears; Megan Miller; Mietka Franczyk; Deanna Deprizio; Gregory A Schmidt; Amy Bowman; Rhonda Barr; Kathryn E McCallister; Jesse B Hall; John P Kress Journal: Lancet Date: 2009-05-14 Impact factor: 79.321
Authors: April E Howard; Carrie Courtney-Shapiro; Lynn A Kelso; Michele Goltz; Peter E Morris Journal: Am J Crit Care Date: 2004-01 Impact factor: 2.228
Authors: Helen C Azzam; Satjeet S Khalsa; Richard Urbani; Chirag V Shah; Jason D Christie; Paul N Lanken; Barry D Fuchs Journal: J Am Med Inform Assoc Date: 2009-04-23 Impact factor: 4.497
Authors: Daniel B Knox; Alex Brunhoeber; Ithan D Peltan; Samuel M Brown; Michael J Lanspa Journal: Intensive Care Med Date: 2022-08-05 Impact factor: 41.787