Literature DB >> 25451102

Secondary use of electronic health records for building cohort studies through top-down information extraction.

Markus Kreuzthaler1, Stefan Schulz2, Andrea Berghold2.   

Abstract

Controlled clinical trials are usually supported with an in-front data aggregation system, which supports the storage of relevant information according to the trial context within a highly structured environment. In contrast to the documentation of clinical trials, daily routine documentation has many characteristics that influence data quality. One such characteristic is the use of non-standardized text, which is an indispensable part of information representation in clinical information systems. Based on a cohort study we highlight challenges for mining electronic health records targeting free text entry fields within semi-structured data sources. Our prototypical information extraction system achieved an F-measure of 0.91 (precision=0.90, recall=0.93) for the training set and an F-measure of 0.90 (precision=0.89, recall=0.92) for the test set. We analyze the obtained results in detail and highlight challenges and future directions for the secondary use of routine data in general.
Copyright © 2014 Elsevier Inc. All rights reserved.

Keywords:  Clinical narrative; Information extraction; Secondary use

Mesh:

Year:  2014        PMID: 25451102     DOI: 10.1016/j.jbi.2014.10.010

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  11 in total

Review 1.  Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress.

Authors:  S M Meystre; C Lovis; T Bürkle; G Tognola; A Budrionis; C U Lehmann
Journal:  Yearb Med Inform       Date:  2017-09-11

Review 2.  Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing.

Authors:  D Demner-Fushman; N Elhadad
Journal:  Yearb Med Inform       Date:  2016-11-10

3.  Clinical Research Informatics Contributions from 2015.

Authors:  C Daniel; R Choquet
Journal:  Yearb Med Inform       Date:  2016-11-10

4.  Catch Me if You Can: Acute Events Hidden in Structured Chronic Disease Diagnosis Descriptions Show Detectable Recording Patterns in EHR.

Authors:  Franck Diaz-Garelli; Kristin M Lenoir; Brian J Wells
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

Review 5.  Update on Data Reuse in Health Care.

Authors:  C Safran
Journal:  Yearb Med Inform       Date:  2017-09-11

Review 6.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

7.  Estimating Marginal Healthcare Costs Using Genetic Variants as Instrumental Variables: Mendelian Randomization in Economic Evaluation.

Authors:  Padraig Dixon; George Davey Smith; Stephanie von Hinke; Neil M Davies; William Hollingworth
Journal:  Pharmacoeconomics       Date:  2016-11       Impact factor: 4.981

8.  Improving a Secondary Use Health Data Warehouse: Proposing a Multi-Level Data Quality Framework.

Authors:  Sandra Henley-Smith; Douglas Boyle; Kathleen Gray
Journal:  EGEMS (Wash DC)       Date:  2019-08-02

Review 9.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30

10.  Cohort Selection for Clinical Trials From Longitudinal Patient Records: Text Mining Approach.

Authors:  Irena Spasic; Dominik Krzeminski; Padraig Corcoran; Alexander Balinsky
Journal:  JMIR Med Inform       Date:  2019-10-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.