Literature DB >> 20351891

Inductive creation of an annotation schema and a reference standard for de-identification of VA electronic clinical notes.

Jeanmarie Mayer1, Shuying Shen, Brett R South, Stephane Meystre, F Jeff Friedlin, William R Ray, Matthew Samore.   

Abstract

Accessing both structured and unstructured clinical data is a high priority for research efforts. However, HIPAA requires that data meet or exceed a deidentification standard to assure that protected health information (PHI) is removed. This is a particularly difficult problem in the case of unstructured clinical free text and natural language processing (NLP) systems can be trained to automatically de-identify clinical text. Moreover, manual human annotation of clinical note documents for the purpose of building reference standards to evaluate NLP systems is a costly and time consuming process. Annotation schema must be created that can be used to build reliable and valid reference standards to evaluate NLP systems for the deidentification task. We describe the inductive creation of an annotation schema and subsequent reference standard. We also provide estimates of the accuracy of human annotators for this particular task.

Entities:  

Mesh:

Year:  2009        PMID: 20351891      PMCID: PMC2815367     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  7 in total

Review 1.  Measuring agreement in medical informatics reliability studies.

Authors:  George Hripcsak; Daniel F Heitjan
Journal:  J Biomed Inform       Date:  2002-04       Impact factor: 6.317

2.  Standards for privacy of individually identifiable health information. Final rule.

Authors: 
Journal:  Fed Regist       Date:  2002-08-14

3.  Assessing the difficulty and time cost of de-identification in clinical narratives.

Authors:  D A Dorr; W F Phillips; S Phansalkar; S A Sims; J F Hurdle
Journal:  Methods Inf Med       Date:  2006       Impact factor: 2.176

4.  Evaluating the state-of-the-art in automatic de-identification.

Authors:  Ozlem Uzuner; Yuan Luo; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

5.  Replacing personally-identifying information in medical records, the Scrub system.

Authors:  L Sweeney
Journal:  Proc AMIA Annu Fall Symp       Date:  1996

6.  The CLEF corpus: semantic annotation of clinical text.

Authors:  Angus Roberts; Robert Gaizauskas; Mark Hepple; Neil Davis; George Demetriou; Yikun Guo; Jay Kola; Ian Roberts; Andrea Setzer; Archana Tapuria; Bill Wheeldin
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

7.  Evaluation of a deidentification (De-Id) software engine to share pathology reports and clinical documents for research.

Authors:  Dilip Gupta; Melissa Saul; John Gilbertson
Journal:  Am J Clin Pathol       Date:  2004-02       Impact factor: 2.493

  7 in total
  9 in total

Review 1.  Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.

Authors:  Clete A Kushida; Deborah A Nichols; Rik Jadrnicek; Ric Miller; James K Walsh; Kara Griffin
Journal:  Med Care       Date:  2012-07       Impact factor: 2.983

2.  Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

Authors:  David Carrell; Bradley Malin; John Aberdeen; Samuel Bayer; Cheryl Clark; Ben Wellner; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

3.  The machine giveth and the machine taketh away: a parrot attack on clinical text deidentified with hiding in plain sight.

Authors:  David S Carrell; David J Cronkite; Muqun Rachel Li; Steve Nyemba; Bradley A Malin; John S Aberdeen; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

4.  Is the Juice Worth the Squeeze? Costs and Benefits of Multiple Human Annotators for Clinical Text De-identification.

Authors:  David S Carrell; David J Cronkite; Bradley A Malin; John S Aberdeen; Lynette Hirschman
Journal:  Methods Inf Med       Date:  2016-07-13       Impact factor: 2.176

5.  Informatics can identify systemic sclerosis (SSc) patients at risk for scleroderma renal crisis.

Authors:  Doug Redd; Tracy M Frech; Maureen A Murtaugh; Julia Rhiannon; Qing T Zeng
Journal:  Comput Biol Med       Date:  2014-08-07       Impact factor: 4.589

6.  Classifying Cyber-Risky Clinical Notes by Employing Natural Language Processing.

Authors:  Suzanna Schmeelk; Martins Samuel Dogo; Yifan Peng; Braja Gopal Patra
Journal:  Proc Annu Hawaii Int Conf Syst Sci       Date:  2022-01-04

7.  Resilience of clinical text de-identified with "hiding in plain sight" to hostile reidentification attacks by human readers.

Authors:  David S Carrell; Bradley A Malin; David J Cronkite; John S Aberdeen; Cheryl Clark; Muqun Rachel Li; Dikshya Bastakoty; Steve Nyemba; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

Review 8.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

9.  Evaluating the effects of machine pre-annotation and an interactive annotation interface on manual de-identification of clinical text.

Authors:  Brett R South; Danielle Mowery; Ying Suo; Jianwei Leng; Óscar Ferrández; Stephane M Meystre; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2014-05-20       Impact factor: 6.317

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.