Literature DB >> 17600096

Rapidly retargetable approaches to de-identification in medical records.

Ben Wellner1, Matt Huyck, Scott Mardis, John Aberdeen, Alex Morgan, Leonid Peshkin, Alex Yeh, Janet Hitzeman, Lynette Hirschman.   

Abstract

OBJECTIVE: This paper describes a successful approach to de-identification that was developed to participate in a recent AMIA-sponsored challenge evaluation.
METHOD: Our approach focused on rapid adaptation of existing toolkits for named entity recognition using two existing toolkits, Carafe and LingPipe.
RESULTS: The "out of the box" Carafe system achieved a very good score (phrase F-measure of 0.9664) with only four hours of work to adapt it to the de-identification task. With further tuning, we were able to reduce the token-level error term by over 36% through task-specific feature engineering and the introduction of a lexicon, achieving a phrase F-measure of 0.9736.
CONCLUSIONS: We were able to achieve good performance on the de-identification task by the rapid retargeting of existing toolkits. For the Carafe system, we developed a method for tuning the balance of recall vs. precision, as well as a confidence score that correlated well with the measured F-score.

Mesh:

Year:  2007        PMID: 17600096      PMCID: PMC1975794          DOI: 10.1197/jamia.M2435

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  4 in total

1.  Gene name identification and normalization using a model organism database.

Authors:  Alexander A Morgan; Lynette Hirschman; Marc Colosimo; Alexander S Yeh; Jeff B Colombe
Journal:  J Biomed Inform       Date:  2004-12       Impact factor: 6.317

2.  Automating the assignment of diagnosis codes to patient encounters using example-based and machine learning techniques.

Authors:  Serguei V S Pakhomov; James D Buntrock; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

3.  Evaluating the state-of-the-art in automatic de-identification.

Authors:  Ozlem Uzuner; Yuan Luo; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

4.  Identifying gene and protein mentions in text using conditional random fields.

Authors:  Ryan McDonald; Fernando Pereira
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

  4 in total
  48 in total

1.  MITRE system for clinical assertion status classification.

Authors:  Cheryl Clark; John Aberdeen; Matt Coarr; David Tresner-Kirsch; Ben Wellner; Alexander Yeh; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2011-04-22       Impact factor: 4.497

2.  Voice-dictated versus typed-in clinician notes: linguistic properties and the potential implications on natural language processing.

Authors:  Kai Zheng; Qiaozhu Mei; Lei Yang; Frank J Manion; Ulysses J Balis; David A Hanauer
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

Review 3.  Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.

Authors:  Clete A Kushida; Deborah A Nichols; Rik Jadrnicek; Ric Miller; James K Walsh; Kara Griffin
Journal:  Med Care       Date:  2012-07       Impact factor: 2.983

4.  Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

Authors:  David Carrell; Bradley Malin; John Aberdeen; Samuel Bayer; Cheryl Clark; Ben Wellner; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

5.  Improving textual medication extraction using combined conditional random fields and rule-based systems.

Authors:  Domonkos Tikk; Illés Solt
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

6.  Using a pipeline to improve de-identification performance.

Authors:  Frances P Morrison; Soumitra Sengupta; George Hripcsak
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

7.  Embedding a hiding function in a portable electronic health record for privacy preservation.

Authors:  Lu-Chou Huang; Huei-Chung Chu; Chung-Yueh Lien; Chia-Hung Hsiao; Tsair Kao
Journal:  J Med Syst       Date:  2010-06       Impact factor: 4.460

8.  Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC).

Authors:  Leonard W D'Avolio; Thien M Nguyen; Wildon R Farwell; Yongming Chen; Felicia Fitzmeyer; Owen M Harris; Louis D Fiore
Journal:  J Am Med Inform Assoc       Date:  2010 Jul-Aug       Impact factor: 4.497

9.  The bird's-eye view: A data-driven approach to understanding patient journeys from claims data.

Authors:  Katherine Bobroske; Christine Larish; Anita Cattrell; Margrét V Bjarnadóttir; Lawrence Huan
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

10.  CRFs based de-identification of medical records.

Authors:  Bin He; Yi Guan; Jianyi Cheng; Keting Cen; Wenlan Hua
Journal:  J Biomed Inform       Date:  2015-08-24       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.