Literature DB >> 29569650

Anonymizing and Sharing Medical Text Records.

Xiao-Bai Li1, Jialun Qin1.   

Abstract

Health information technology has increased accessibility of health and medical data and benefited medical research and healthcare management. However, there are rising concerns about patient privacy in sharing medical and healthcare data. A large amount of these data are in free text form. Existing techniques for privacy-preserving data sharing deal largely with structured data. Current privacy approaches for medical text data focus on detection and removal of patient identifiers from the data, which may be inadequate for protecting privacy or preserving data quality. We propose a new systematic approach to extract, cluster, and anonymize medical text records. Our approach integrates methods developed in both data privacy and health informatics fields. The key novel elements of our approach include a recursive partitioning method to cluster medical text records based on the similarity of the health and medical information and a value-enumeration method to anonymize potentially identifying information in the text data. An experimental study is conducted using real-world medical documents. The results of the experiments demonstrate the effectiveness of the proposed approach.

Entities:  

Keywords:  anonymization; data analytics; document clustering; information extraction; privacy

Year:  2017        PMID: 29569650      PMCID: PMC5858761          DOI: 10.1287/isre.2016.0676

Source DB:  PubMed          Journal:  Inf Syst Res        ISSN: 1047-7047


  15 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Biomedical databases: protecting privacy and promoting research.

Authors:  Jean E Wylie; Geraldine P Mineau
Journal:  Trends Biotechnol       Date:  2003-03       Impact factor: 19.536

3.  Strategies for maintaining patient privacy in i2b2.

Authors:  Shawn N Murphy; Vivian Gainer; Michael Mendis; Susanne Churchill; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2011-10-07       Impact factor: 4.497

4.  Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper.

Authors:  Charles Safran; Meryl Bloomrosen; W Edward Hammond; Steven Labkoff; Suzanne Markel-Fox; Paul C Tang; Don E Detmer
Journal:  J Am Med Inform Assoc       Date:  2006-10-31       Impact factor: 4.497

5.  Rapidly retargetable approaches to de-identification in medical records.

Authors:  Ben Wellner; Matt Huyck; Scott Mardis; John Aberdeen; Alex Morgan; Leonid Peshkin; Alex Yeh; Janet Hitzeman; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

Review 6.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

7.  Ensemble machine learning on gene expression data for cancer classification.

Authors:  Aik Choon Tan; David Gilbert
Journal:  Appl Bioinformatics       Date:  2003

8.  A software tool for removing patient identifying information from clinical documents.

Authors:  F Jeff Friedlin; Clement J McDonald
Journal:  J Am Med Inform Assoc       Date:  2008-06-25       Impact factor: 4.497

9.  Class Restricted Clustering and Micro-Perturbation for Data Privacy.

Authors:  Xiao-Bai Li; Sumit Sarkar
Journal:  Manage Sci       Date:  2013-04-01       Impact factor: 4.883

Review 10.  Automatic de-identification of textual documents in the electronic health record: a review of recent research.

Authors:  Stephane M Meystre; F Jeffrey Friedlin; Brett R South; Shuying Shen; Matthew H Samore
Journal:  BMC Med Res Methodol       Date:  2010-08-02       Impact factor: 4.615

View more
  2 in total

1.  Blockchain-Based Medical Records Secure Storage and Medical Service Framework.

Authors:  Yi Chen; Shuai Ding; Zheng Xu; Handong Zheng; Shanlin Yang
Journal:  J Med Syst       Date:  2018-11-22       Impact factor: 4.460

Review 2.  Privacy Protection and Secondary Use of Health Data: Strategies and Methods.

Authors:  Dingyi Xiang; Wei Cai
Journal:  Biomed Res Int       Date:  2021-10-07       Impact factor: 3.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.