Literature DB >> 24384230

Combining structured and unstructured data to identify a cohort of ICU patients who received dialysis.

Swapna Abhyankar1, Dina Demner-Fushman1, Fiona M Callaghan1, Clement J McDonald1.   

Abstract

OBJECTIVE: To develop a generalizable method for identifying patient cohorts from electronic health record (EHR) data-in this case, patients having dialysis-that uses simple information retrieval (IR) tools.
METHODS: We used the coded data and clinical notes from the 24,506 adult patients in the Multiparameter Intelligent Monitoring in Intensive Care database to identify patients who had dialysis. We used SQL queries to search the procedure, diagnosis, and coded nursing observations tables based on ICD-9 and local codes. We used a domain-specific search engine to find clinical notes containing terms related to dialysis. We manually validated the available records for a 10% random sample of patients who potentially had dialysis and a random sample of 200 patients who were not identified as having dialysis based on any of the sources.
RESULTS: We identified 1844 patients that potentially had dialysis: 1481 from the three coded sources and 1624 from the clinical notes. Precision for identifying dialysis patients based on available data was estimated to be 78.4% (95% CI 71.9% to 84.2%) and recall was 100% (95% CI 86% to 100%).
CONCLUSIONS: Combining structured EHR data with information from clinical notes using simple queries increases the utility of both types of data for cohort identification. Patients identified by more than one source are more likely to meet the inclusion criteria; however, including patients found in any of the sources increases recall. This method is attractive because it is available to researchers with access to EHR data and off-the-shelf IR tools. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

Entities:  

Mesh:

Year:  2014        PMID: 24384230      PMCID: PMC4147606          DOI: 10.1136/amiajnl-2013-001915

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  22 in total

1.  Design and implementation of a national clinical trials registry.

Authors:  A T McCray; N C Ide
Journal:  J Am Med Inform Assoc       Date:  2000 May-Jun       Impact factor: 4.497

2.  The usefulness of administrative databases for identifying disease cohorts is increased with a multivariate model.

Authors:  Carl van Walraven; Peter C Austin; Douglas Manuel; Greg Knoll; Allison Jennings; Alan J Forster
Journal:  J Clin Epidemiol       Date:  2010-05-08       Impact factor: 6.437

3.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

4.  Essie: a concept-based search engine for structured biomedical text.

Authors:  Nicholas C Ide; Russell F Loane; Dina Demner-Fushman
Journal:  J Am Med Inform Assoc       Date:  2007-02-28       Impact factor: 4.497

5.  Identification of hospital-acquired catheter-associated urinary tract infections from Medicare claims: sensitivity and positive predictive value.

Authors:  Chunliu Zhan; Anne Elixhauser; Chesley L Richards; Yun Wang; William B Baine; Michael Pineau; Nancy Verzier; Rebecca Kliman; David Hunt
Journal:  Med Care       Date:  2009-03       Impact factor: 2.983

6.  Utility of administrative claims data for the study of brain metastases: a validation study.

Authors:  April F Eichler; Elizabeth B Lamont
Journal:  J Neurooncol       Date:  2009-06-27       Impact factor: 4.130

7.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

8.  MIMIC II: a massive temporal ICU patient database to support research in intelligent patient monitoring.

Authors:  M Saeed; C Lieu; G Raber; R G Mark
Journal:  Comput Cardiol       Date:  2002

9.  Failure of ICD-9-CM codes to identify patients with comorbid chronic kidney disease in diabetes.

Authors:  Elizabeth F O Kern; Miriam Maney; Donald R Miller; Chin-Lin Tseng; Anjali Tiwari; Mangala Rajan; David Aron; Leonard Pogach
Journal:  Health Serv Res       Date:  2006-04       Impact factor: 3.402

10.  Automated de-identification of free-text medical records.

Authors:  Ishna Neamatullah; Margaret M Douglass; Li-wei H Lehman; Andrew Reisner; Mauricio Villarroel; William J Long; Peter Szolovits; George B Moody; Roger G Mark; Gari D Clifford
Journal:  BMC Med Inform Decis Mak       Date:  2008-07-24       Impact factor: 2.796

View more
  20 in total

1.  Data integration of structured and unstructured sources for assigning clinical codes to patient stays.

Authors:  Elyne Scheurwegs; Kim Luyckx; Léon Luyten; Walter Daelemans; Tim Van den Bulcke
Journal:  J Am Med Inform Assoc       Date:  2015-08-27       Impact factor: 4.497

2.  Clinical Research Informatics: Recent Advances and Future Directions.

Authors:  M Dugas
Journal:  Yearb Med Inform       Date:  2015-08-13

3.  Trends in biomedical informatics: automated topic analysis of JAMIA articles.

Authors:  Dong Han; Shuang Wang; Chao Jiang; Xiaoqian Jiang; Hyeon-Eui Kim; Jimeng Sun; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2015-11       Impact factor: 4.497

4.  Inclusion of Unstructured Clinical Text Improves Early Prediction of Death or Prolonged ICU Stay.

Authors:  Gary E Weissman; Rebecca A Hubbard; Lyle H Ungar; Michael O Harhay; Casey S Greene; Blanca E Himes; Scott D Halpern
Journal:  Crit Care Med       Date:  2018-07       Impact factor: 7.598

5.  Using Arden Syntax to identify registry-eligible very low birth weight neonates from the Electronic Health Record.

Authors:  Indra Neil Sarkar; Elizabeth S Chen; Paul T Rosenau; Matthew B Storer; Beth Anderson; Jeffrey D Horbar
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

Review 6.  Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing.

Authors:  D Demner-Fushman; N Elhadad
Journal:  Yearb Med Inform       Date:  2016-11-10

7.  Use of Electronic Health Record Data to Evaluate the Impact of Race on 30-Day Mortality in Patients Admitted to the Intensive Care Unit.

Authors:  Mallika L Mundkur; Fiona M Callaghan; Swapna Abhyankar; Clement J McDonald
Journal:  J Racial Ethn Health Disparities       Date:  2016-06-20

8.  Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record.

Authors:  Timothy I Kennell; James H Willig; James J Cimino
Journal:  Appl Clin Inform       Date:  2017-12-21       Impact factor: 2.342

9.  Construct validity of six sentiment analysis methods in the text of encounter notes of patients with critical illness.

Authors:  Gary E Weissman; Lyle H Ungar; Michael O Harhay; Katherine R Courtright; Scott D Halpern
Journal:  J Biomed Inform       Date:  2018-12-14       Impact factor: 6.317

10.  Natural Language Processing to Assess Documentation of Features of Critical Illness in Discharge Documents of Acute Respiratory Distress Syndrome Survivors.

Authors:  Gary E Weissman; Michael O Harhay; Ricardo M Lugo; Barry D Fuchs; Scott D Halpern; Mark E Mikkelsen
Journal:  Ann Am Thorac Soc       Date:  2016-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.