Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Automatic detection of protected health information from clinic narratives.

Literature DB >> 26231070

Automatic detection of protected health information from clinic narratives.

Abstract

This paper presents a natural language processing (NLP) system that was designed to participate in the 2014 i2b2 de-identification challenge. The challenge task aims to identify and classify seven main Protected Health Information (PHI) categories and 25 associated sub-categories. A hybrid model was proposed which combines machine learning techniques with keyword-based and rule-based approaches to deal with the complexity inherent in PHI categories. Our proposed approaches exploit a rich set of linguistic features, both syntactic and word surface-oriented, which are further enriched by task-specific features and regular expression template patterns to characterize the semantics of various PHI categories. Our system achieved promising accuracy on the challenge test data with an overall micro-averaged F-measure of 93.6%, which was the winner of this de-identification challenge.

Entities: Chemical

Keywords: Clinical text mining; De-identification; Hybrid model; Natural language processing; Protected Health Information (PHI)

Mesh：

Year: 2015 PMID： 26231070 PMCID： PMC4989090 DOI： 10.1016/j.jbi.2015.06.015

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

17 in total

Automatic detection of protected health information from clinic narratives.

1. Identification of patient name references within medical documents using semantic selectional restrictions.

2. A de-identifier for medical discharge summaries.

3. Rapidly retargetable approaches to de-identification in medical records.

4. A system for de-identifying medical message board text.

5. BoB, a best-of-breed automated text de-identification system for VHA clinical documents.

6. The MITRE Identification Scrubber Toolkit: design, training, and assessment.

7. Repurposing the clinical record: can an existing natural language processing system de-identify clinical notes?

8. Large-scale evaluation of automated clinical note de-identification and its impact on information extraction.

9. Development and evaluation of an open source software tool for deidentification of pathology reports.

10. Improved de-identification of physician notes through integrative modeling of both public and private medical text.

1. Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

2. Ensemble-based Methods to Improve De-identification of Electronic Health Record Narratives.

3. Comparative Study of Various Approaches for Ensemble-based De-identification of Electronic Health Record Narratives.

4. Healthcare Data Breaches: Implications for Digital Forensic Readiness.

5. Automatic prediction of coronary artery disease from clinical narratives.

6. Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

7. Scalable Iterative Classification for Sanitizing Large-Scale Datasets.

8. The UAB Informatics Institute and 2016 CEGS N-GRID de-identification shared task challenge.

9. De-identification of medical records using conditional random fields and long short-term memory networks.

Review 10. Clinical concept extraction: A methodology review.