Literature DB >> 26316458

Data integration of structured and unstructured sources for assigning clinical codes to patient stays.

Elyne Scheurwegs1, Kim Luyckx2, Léon Luyten2, Walter Daelemans3, Tim Van den Bulcke4.   

Abstract

OBJECTIVE: Enormous amounts of healthcare data are becoming increasingly accessible through the large-scale adoption of electronic health records. In this work, structured and unstructured (textual) data are combined to assign clinical diagnostic and procedural codes (specifically ICD-9-CM) to patient stays. We investigate whether integrating these heterogeneous data types improves prediction strength compared to using the data types in isolation.
METHODS: Two separate data integration approaches were evaluated. Early data integration combines features of several sources within a single model, and late data integration learns a separate model per data source and combines these predictions with a meta-learner. This is evaluated on data sources and clinical codes from a broad set of medical specialties.
RESULTS: When compared with the best individual prediction source, late data integration leads to improvements in predictive power (eg, overall F-measure increased from 30.6% to 38.3% for International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) diagnostic codes), while early data integration is less consistent. The predictive strength strongly differs between medical specialties, both for ICD-9-CM diagnostic and procedural codes. DISCUSSION: Structured data provides complementary information to unstructured data (and vice versa) for predicting ICD-9-CM codes. This can be captured most effectively by the proposed late data integration approach.
CONCLUSIONS: We demonstrated that models using multiple electronic health record data sources systematically outperform models using data sources in isolation in the task of predicting ICD-9-CM codes over a broad range of medical specialties.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  clinical coding; data integration; data mining; electronic health records; international classification of diseases

Mesh:

Year:  2015        PMID: 26316458      PMCID: PMC4954635          DOI: 10.1093/jamia/ocv115

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  18 in total

1.  Identification of in-hospital complications from claims data. Is it valid?

Authors:  A G Lawthers; E P McCarthy; R B Davis; L E Peterson; R H Palmer; L I Iezzoni
Journal:  Med Care       Date:  2000-08       Impact factor: 2.983

2.  Can administrative data be used to compare postoperative complication rates across hospitals?

Authors:  Patrick S Romano; Benjamin K Chan; Michael E Schembri; Julie A Rainwater
Journal:  Med Care       Date:  2002-10       Impact factor: 2.983

3.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

4.  DISEASES: text mining and data integration of disease-gene associations.

Authors:  Sune Pletscher-Frankild; Albert Pallejà; Kalliopi Tsafou; Janos X Binder; Lars Juhl Jensen
Journal:  Methods       Date:  2014-12-05       Impact factor: 3.608

5.  Validity of International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) screening for sepsis in surgical mortalities.

Authors:  Rajesh Ramanathan; Patricia Leavell; Gregory Stockslager; Catherine Mays; Dale Harvey; Therese M Duane
Journal:  Surg Infect (Larchmt)       Date:  2014-05-28       Impact factor: 2.150

6.  R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment.

Authors:  Robert J Carroll; Lisa Bastarache; Joshua C Denny
Journal:  Bioinformatics       Date:  2014-04-14       Impact factor: 6.937

7.  Improving the electronic health record--are clinicians getting what they wished for?

Authors:  James J Cimino
Journal:  JAMA       Date:  2013-03-13       Impact factor: 56.272

8.  Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

Authors:  Jyotishman Pathak; Kent R Bailey; Calvin E Beebe; Steven Bethard; David C Carrell; Pei J Chen; Dmitriy Dligach; Cory M Endle; Lacey A Hart; Peter J Haug; Stanley M Huff; Vinod C Kaggal; Dingcheng Li; Hongfang Liu; Kyle Marchant; James Masanz; Timothy Miller; Thomas A Oniki; Martha Palmer; Kevin J Peterson; Susan Rea; Guergana K Savova; Craig R Stancl; Sunghwan Sohn; Harold R Solbrig; Dale B Suesse; Cui Tao; David P Taylor; Les Westberg; Stephen Wu; Ning Zhuo; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2013-11-04       Impact factor: 4.497

9.  A multidisciplinary audit of clinical coding accuracy in otolaryngology: financial, managerial and clinical governance considerations under payment-by-results.

Authors:  S A R Nouraei; S O'Hanlon; C R Butler; A Hadovsky; E Donald; E Benjamin; G S Sandhu
Journal:  Clin Otolaryngol       Date:  2009-02       Impact factor: 2.597

10.  Diagnosis code assignment: models and evaluation metrics.

Authors:  Adler Perotte; Rimma Pivovarov; Karthik Natarajan; Nicole Weiskopf; Frank Wood; Noémie Elhadad
Journal:  J Am Med Inform Assoc       Date:  2013-12-02       Impact factor: 4.497

View more
  12 in total

1.  Computer-Assisted Diagnostic Coding: Effectiveness of an NLP-based approach using SNOMED CT to ICD-10 mappings.

Authors:  Anthony N Nguyen; Donna Truran; Madonna Kemp; Bevan Koopman; David Conlan; John O'Dwyer; Ming Zhang; Sarvnaz Karimi; Hamed Hassanzadeh; Michael J Lawley; Damian Green
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Can structured EHR data support clinical coding? A data mining approach.

Authors:  José Carlos Ferrão; Mónica Duarte Oliveira; Filipe Janela; Henrique M G Martins; Daniel Gartner
Journal:  Health Syst (Basingstoke)       Date:  2020-03-01

3.  Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data.

Authors:  Daniel J Feller; Oliver J Bear Don't Walk Iv; Jason Zucker; Michael T Yin; Peter Gordon; Noémie Elhadad
Journal:  Appl Clin Inform       Date:  2020-03-04       Impact factor: 2.342

4.  The Diagnosis-Wide Landscape of Hospital-Acquired AKI.

Authors:  Anne-Sophie Jannot; Anita Burgun; Eric Thervet; Nicolas Pallet
Journal:  Clin J Am Soc Nephrol       Date:  2017-05-11       Impact factor: 8.237

5.  Counting trees in Random Forests: Predicting symptom severity in psychiatric intake reports.

Authors:  Elyne Scheurwegs; Madhumita Sushil; Stéphan Tulkens; Walter Daelemans; Kim Luyckx
Journal:  J Biomed Inform       Date:  2017-06-07       Impact factor: 6.317

6.  SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research.

Authors:  Honghan Wu; Giulia Toti; Katherine I Morley; Zina M Ibrahim; Amos Folarin; Richard Jackson; Ismail Kartoglu; Asha Agrawal; Clive Stringer; Darren Gale; Genevieve Gorrell; Angus Roberts; Matthew Broadbent; Robert Stewart; Richard J B Dobson
Journal:  J Am Med Inform Assoc       Date:  2018-05-01       Impact factor: 4.497

7.  Comparison of different feature extraction methods for applicable automated ICD coding.

Authors:  Zhao Shuai; Diao Xiaolin; Yuan Jing; Huo Yanni; Cui Meng; Wang Yuxin; Zhao Wei
Journal:  BMC Med Inform Decis Mak       Date:  2022-01-12       Impact factor: 2.796

8.  Columbia Open Health Data, clinical concept prevalence and co-occurrence from electronic health records.

Authors:  Casey N Ta; Michel Dumontier; George Hripcsak; Nicholas P Tatonetti; Chunhua Weng
Journal:  Sci Data       Date:  2018-11-27       Impact factor: 6.444

9.  Automated Diabetes Case Identification Using Electronic Health Record Data at a Tertiary Care Facility.

Authors:  Sudhi G Upadhyaya; Dennis H Murphree; Che G Ngufor; Alison M Knight; Daniel J Cronk; Robert R Cima; Timothy B Curry; Jyotishman Pathak; Rickey E Carter; Daryl J Kor
Journal:  Mayo Clin Proc Innov Qual Outcomes       Date:  2017-04-28

10.  Automatic Prediction of Recurrence of Major Cardiovascular Events: A Text Mining Study Using Chest X-Ray Reports.

Authors:  Ayoub Bagheri; T Katrien J Groenhof; Folkert W Asselbergs; Saskia Haitjema; Michiel L Bots; Wouter B Veldhuis; Pim A de Jong; Daniel L Oberski
Journal:  J Healthc Eng       Date:  2021-07-09       Impact factor: 2.682

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.