Literature DB >> 26171080

Interactive Cohort Identification of Sleep Disorder Patients Using Natural Language Processing and i2b2.

W Chen1, R Kowatch2, S Lin1, M Splaingard3, Y Huang1.   

Abstract

UNLABELLED: Nationwide Children's Hospital established an i2b2 (Informatics for Integrating Biology & the Bedside) application for sleep disorder cohort identification. Discrete data were gleaned from semistructured sleep study reports. The system showed to work more efficiently than the traditional manual chart review method, and it also enabled searching capabilities that were previously not possible.
OBJECTIVE: We report on the development and implementation of the sleep disorder i2b2 cohort identification system using natural language processing of semi-structured documents.
METHODS: We developed a natural language processing approach to automatically parse concepts and their values from semi-structured sleep study documents. Two parsers were developed: a regular expression parser for extracting numeric concepts and a NLP based tree parser for extracting textual concepts. Concepts were further organized into i2b2 ontologies based on document structures and in-domain knowledge.
RESULTS: 26,550 concepts were extracted with 99% being textual concepts. 1.01 million facts were extracted from sleep study documents such as demographic information, sleep study lab results, medications, procedures, diagnoses, among others. The average accuracy of terminology parsing was over 83% when comparing against those by experts. The system is capable of capturing both standard and non-standard terminologies. The time for cohort identification has been reduced significantly from a few weeks to a few seconds.
CONCLUSION: Natural language processing was shown to be powerful for quickly converting large amount of semi-structured or unstructured clinical data into discrete concepts, which in combination of intuitive domain specific ontologies, allows fast and effective interactive cohort identification through the i2b2 platform for research and clinical use.

Entities:  

Keywords:  Sleep disorder; clinical ontology; cohort identification; i2b2; natural language processing (NLP)

Mesh:

Year:  2015        PMID: 26171080      PMCID: PMC4493335          DOI: 10.4338/ACI-2014-11-RA-0106

Source DB:  PubMed          Journal:  Appl Clin Inform        ISSN: 1869-0327            Impact factor:   2.342


  31 in total

1.  Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure.

Authors:  Jennifer H Garvin; Scott L DuVall; Brett R South; Bruce E Bray; Daniel Bolton; Julia Heavirland; Steve Pickard; Paul Heidenreich; Shuying Shen; Charlene Weir; Matthew Samore; Mary K Goldstein
Journal:  J Am Med Inform Assoc       Date:  2012-03-21       Impact factor: 4.497

2.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Authors:  Min Jiang; Yukun Chen; Mei Liu; S Trent Rosenbloom; Subramani Mani; Joshua C Denny; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2011-04-20       Impact factor: 4.497

3.  A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries.

Authors:  Yonghui Wu; Joshua C Denny; S Trent Rosenbloom; Randolph A Miller; Dario A Giuse; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

Review 4.  Natural language processing in biomedicine: a unified system architecture overview.

Authors:  Son Doan; Mike Conway; Tu Minh Phuong; Lucila Ohno-Machado
Journal:  Methods Mol Biol       Date:  2014

5.  A randomized trial of adenotonsillectomy for childhood sleep apnea.

Authors:  Carole L Marcus; Reneé H Moore; Carol L Rosen; Bruno Giordani; Susan L Garetz; H Gerry Taylor; Ron B Mitchell; Raouf Amin; Eliot S Katz; Raanan Arens; Shalini Paruthi; Hiren Muzumdar; David Gozal; Nina Hattiangadi Thomas; Janice Ware; Dean Beebe; Karen Snyder; Lisa Elden; Robert C Sprecher; Paul Willging; Dwight Jones; John P Bent; Timothy Hoban; Ronald D Chervin; Susan S Ellenberg; Susan Redline
Journal:  N Engl J Med       Date:  2013-05-21       Impact factor: 91.245

6.  An i2b2-based, generalizable, open source, self-scaling chronic disease registry.

Authors:  Marc D Natter; Justin Quan; David M Ortiz; Athos Bousvaros; Norman T Ilowite; Christi J Inman; Keith Marsolo; Andrew J McMurry; Christy I Sandborg; Laura E Schanberg; Carol A Wallace; Robert W Warren; Griffin M Weber; Kenneth D Mandl
Journal:  J Am Med Inform Assoc       Date:  2012-06-25       Impact factor: 4.497

7.  Using natural language processing to enable in-depth analysis of clinical messages posted to an Internet mailing list: a feasibility study.

Authors:  Tanja Bekhuis; Marcos Kreinacke; Heiko Spallek; Mei Song; Jean A O'Donnell
Journal:  J Med Internet Res       Date:  2011-11-23       Impact factor: 5.428

8.  Unified Medical Language System term occurrences in clinical notes: a large-scale corpus analysis.

Authors:  Stephen T Wu; Hongfang Liu; Dingcheng Li; Cui Tao; Mark A Musen; Christopher G Chute; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2012-04-04       Impact factor: 4.497

9.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.

Authors:  Qing T Zeng; Sergey Goryachev; Scott Weiss; Margarita Sordo; Shawn N Murphy; Ross Lazarus
Journal:  BMC Med Inform Decis Mak       Date:  2006-07-26       Impact factor: 2.796

10.  Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.

Authors:  Buzhou Tang; Hongxin Cao; Yonghui Wu; Min Jiang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2013-04-05       Impact factor: 2.796

View more
  4 in total

1.  Computerized "Learn-As-You-Go" classification of traumatic brain injuries using NEISS narrative data.

Authors:  Wei Chen; Krista K Wheeler; Simon Lin; Yungui Huang; Huiyun Xiang
Journal:  Accid Anal Prev       Date:  2016-02-03

2.  Performance of a rule-based semi-automated method to optimize chart abstraction for surveillance imaging among patients treated for non-small cell lung cancer.

Authors:  Catherine Byrd; Ureka Ajawara; Ryan Laundry; John Radin; Prasha Bhandari; Ann Leung; Summer Han; Stephen M Asch; Steven Zeliadt; Alex H S Harris; Leah Backhus
Journal:  BMC Med Inform Decis Mak       Date:  2022-06-03       Impact factor: 3.298

Review 3.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

4.  Cohort Selection for Clinical Trials From Longitudinal Patient Records: Text Mining Approach.

Authors:  Irena Spasic; Dominik Krzeminski; Padraig Corcoran; Alexander Balinsky
Journal:  JMIR Med Inform       Date:  2019-10-31
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.