| Literature DB >> 26958206 |
Mehmet Kayaalp1, Allen C Browne1, Pamela Sagan1, Tyne McGee1, Clement J McDonald1.
Abstract
The Privacy Rule of Health Insurance Portability and Accountability Act (HIPAA) requires that clinical documents be stripped of personally identifying information before they can be released to researchers and others. We have been manually annotating clinical text since 2008 in order to test and evaluate an algorithmic clinical text de-identification tool, NLM Scrubber, which we have been developing in parallel. Although HIPAA provides some guidance about what must be de-identified, translating those guidelines into practice is not as straightforward, especially when one deals with free text. As a result we have changed our manual annotation labels and methods six times. This paper explains why we have made those annotation choices, which have been evolved throughout seven years of practice on this field. The aim of this paper is to start a community discussion towards developing standards for clinical text annotation with the end goal of studying and comparing clinical text de-identification systems more accurately.Mesh:
Year: 2015 PMID: 26958206 PMCID: PMC4765667
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076