Literature DB >> 28748228

Supervised Extraction of Diagnosis Codes from EMRs: Role of Feature Selection, Data Selection, and Probabilistic Thresholding.

Anthony Rios1, Ramakanth Kavuluru2.   

Abstract

Extracting diagnosis codes from medical records is a complex task carried out by trained coders by reading all the documents associated with a patient's visit. With the popularity of electronic medical records (EMRs), computational approaches to code extraction have been proposed in the recent years. Machine learning approaches to multi-label text classification provide an important methodology in this task given each EMR can be associated with multiple codes. In this paper, we study the the role of feature selection, training data selection, and probabilistic threshold optimization in improving different multi-label classification approaches. We conduct experiments based on two different datasets: a recent gold standard dataset used for this task and a second larger and more complex EMR dataset we curated from the University of Kentucky Medical Center. While conventional approaches achieve results comparable to the state-of-the-art on the gold standard dataset, on our complex in-house dataset, we show that feature selection, training data selection, and probabilistic thresholding provide significant gains in performance.

Entities:  

Year:  2013        PMID: 28748228      PMCID: PMC5524216          DOI: 10.1109/ICHI.2013.15

Source DB:  PubMed          Journal:  IEEE Int Conf Healthc Inform        ISSN: 2575-2626


  8 in total

1.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

2.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

3.  Three approaches to automatic assignment of ICD-9-CM codes to radiology reports.

Authors:  Ira Goldstein; Anna Arzrumtsyan; Ozlem Uzuner
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

4.  Optimal training sets for Bayesian prediction of MeSH assignment.

Authors:  Sunghwan Sohn; Won Kim; Donald C Comeau; W John Wilbur
Journal:  J Am Med Inform Assoc       Date:  2008-04-24       Impact factor: 4.497

5.  Beyond synonymy: exploiting the UMLS semantics in mapping vocabularies.

Authors:  O Bodenreider; S J Nelson; W T Hole; H F Chang
Journal:  Proc AMIA Symp       Date:  1998

6.  Development and evaluation of a computerized admission diagnoses encoding system.

Authors:  M L Gundersen; P J Haug; T A Pryor; R van Bree; S Koehler; K Bauer; B Clemons
Journal:  Comput Biomed Res       Date:  1996-10

7.  A recent advance in the automatic indexing of the biomedical literature.

Authors:  Aurélie Névéol; Sonya E Shooshan; Susanne M Humphrey; James G Mork; Alan R Aronson
Journal:  J Biomed Inform       Date:  2008-12-30       Impact factor: 6.317

8.  Automatic construction of rule-based ICD-9-CM coding systems.

Authors:  Richárd Farkas; György Szarvas
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

  8 in total
  4 in total

1.  Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network.

Authors:  Ying Sha; May D Wang
Journal:  ACM BCB       Date:  2017-08

2.  Predicting mental conditions based on "history of present illness" in psychiatric notes with deep neural networks.

Authors:  Tung Tran; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2017-06-10       Impact factor: 6.317

3.  Natural Language Processing and Machine Learning to Identify People Who Inject Drugs in Electronic Health Records.

Authors:  David Goodman-Meza; Amber Tang; Babak Aryanfar; Sergio Vazquez; Adam J Gordon; Michihiko Goto; Matthew Bidwell Goetz; Steven Shoptaw; Alex A T Bui
Journal:  Open Forum Infect Dis       Date:  2022-09-12       Impact factor: 4.423

4.  EMR Coding with Semi-Parametric Multi-Head Matching Networks.

Authors:  Anthony Rios; Ramakanth Kavuluru
Journal:  Proc Conf       Date:  2018-06
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.