Literature DB >> 22195222

Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases.

Hua Xu1, Zhenming Fu, Anushi Shah, Yukun Chen, Neeraja B Peterson, Qingxia Chen, Subramani Mani, Mia A Levy, Qi Dai, Josh C Denny.   

Abstract

Identification of a cohort of patients with specific diseases is an important step for clinical research that is based on electronic health records (EHRs). Informatics approaches combining structured EHR data, such as billing records, with narrative text data have demonstrated utility for such tasks. This paper describes an algorithm combining machine learning and natural language processing to detect patients with colorectal cancer (CRC) from entire EHRs at Vanderbilt University Hospital. We developed a general case detection method that consists of two steps: 1) extraction of positive CRC concepts from all clinical notes (document-level concept identification); and 2) determination of CRC cases using aggregated information from both clinical narratives and structured billing data (patient-level case determination). For each step, we compared performance of rule-based and machine-learning-based approaches. Using a manually reviewed data set containing 300 possible CRC patients (150 for training and 150 for testing), we showed that our method achieved F-measures of 0.996 for document level concept identification, and 0.93 for patient level case detection.

Entities:  

Mesh:

Year:  2011        PMID: 22195222      PMCID: PMC3243156     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  25 in total

1.  Automatic identification of pneumonia related concepts on chest x-ray reports.

Authors:  M Fiszman; W W Chapman; S R Evans; P J Haug
Journal:  Proc AMIA Symp       Date:  1999

2.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports.

Authors:  George Hripcsak; John H M Austin; Philip O Alderson; Carol Friedman
Journal:  Radiology       Date:  2002-07       Impact factor: 11.105

3.  "Where do we teach what?" Finding broad concepts in the medical school curriculum.

Authors:  Joshua C Denny; Jeffrey D Smithers; Brian Armstrong; Anderson Spickard
Journal:  J Gen Intern Med       Date:  2005-10       Impact factor: 5.128

4.  Development and evaluation of a clinical note section header terminology.

Authors:  Joshua C Denny; Randolph A Miller; Kevin B Johnson; Anderson Spickard
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

5.  A natural language parsing system for encoding admitting diagnoses.

Authors:  P J Haug; L Christensen; M Gundersen; B Clemons; S Koehler; K Bauer
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

6.  Extracting timing and status descriptors for colonoscopy testing from electronic medical records.

Authors:  Joshua C Denny; Josh F Peterson; Neesha N Choma; Hua Xu; Randolph A Miller; Lisa Bastarache; Neeraja B Peterson
Journal:  J Am Med Inform Assoc       Date:  2010 Jul-Aug       Impact factor: 4.497

7.  Cancer statistics, 2002.

Authors:  Ahmedin Jemal; Andrea Thomas; Taylor Murray; Michael Thun
Journal:  CA Cancer J Clin       Date:  2002 Jan-Feb       Impact factor: 508.702

8.  Failure of ICD-9-CM codes to identify patients with comorbid chronic kidney disease in diabetes.

Authors:  Elizabeth F O Kern; Miriam Maney; Donald R Miller; Chin-Lin Tseng; Anjali Tiwari; Mangala Rajan; David Aron; Leonard Pogach
Journal:  Health Serv Res       Date:  2006-04       Impact factor: 3.402

9.  Unlocking clinical data from narrative reports: a study of natural language processing.

Authors:  G Hripcsak; C Friedman; P O Alderson; W DuMouchel; S B Johnson; P D Clayton
Journal:  Ann Intern Med       Date:  1995-05-01       Impact factor: 25.391

10.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system.

Authors:  Qing T Zeng; Sergey Goryachev; Scott Weiss; Margarita Sordo; Shawn N Murphy; Ross Lazarus
Journal:  BMC Med Inform Decis Mak       Date:  2006-07-26       Impact factor: 2.796

View more
  39 in total

1.  Information from Searching Content with an Ontology-Utilizing Toolkit (iSCOUT).

Authors:  Ronilda Lacson; Katherine P Andriole; Luciano M Prevedello; Ramin Khorasani
Journal:  J Digit Imaging       Date:  2012-08       Impact factor: 4.056

2.  Open Globe Injury Patient Identification in Warfare Clinical Notes.

Authors:  Emilia Apostolova; Helen A White; Patty A Morris; David A Eliason; Tom Velez
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

3.  Enhancing clinical concept extraction with contextual embeddings.

Authors:  Yuqi Si; Jingqi Wang; Hua Xu; Kirk Roberts
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

4.  Cohort selection for clinical trials using hierarchical neural network.

Authors:  Ying Xiong; Xue Shi; Shuai Chen; Dehuan Jiang; Buzhou Tang; Xiaolong Wang; Qingcai Chen; Jun Yan
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

5.  An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies.

Authors:  Rui Duan; Ming Cao; Yonghui Wu; Jing Huang; Joshua C Denny; Hua Xu; Yong Chen
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 6.  Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress.

Authors:  S M Meystre; C Lovis; T Bürkle; G Tognola; A Budrionis; C U Lehmann
Journal:  Yearb Med Inform       Date:  2017-09-11

7.  Evaluation of an Automated Information Extraction Tool for Imaging Data Elements to Populate a Breast Cancer Screening Registry.

Authors:  Ronilda Lacson; Kimberly Harris; Phyllis Brawarsky; Tor D Tosteson; Tracy Onega; Anna N A Tosteson; Abby Kaye; Irina Gonzalez; Robyn Birdwell; Jennifer S Haas
Journal:  J Digit Imaging       Date:  2015-10       Impact factor: 4.056

8.  A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summaries.

Authors:  Yonghui Wu; Joshua C Denny; S Trent Rosenbloom; Randolph A Miller; Dario A Giuse; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

Review 9.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

10.  A study of active learning methods for named entity recognition in clinical text.

Authors:  Yukun Chen; Thomas A Lasko; Qiaozhu Mei; Joshua C Denny; Hua Xu
Journal:  J Biomed Inform       Date:  2015-09-15       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.