Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Data integration of structured and unstructured sources for assigning clinical codes to patient stays.

Literature DB >> 26316458

Data integration of structured and unstructured sources for assigning clinical codes to patient stays.

Elyne Scheurwegs¹, Kim Luyckx², Léon Luyten², Walter Daelemans³, Tim Van den Bulcke⁴.

Abstract

OBJECTIVE: Enormous amounts of healthcare data are becoming increasingly accessible through the large-scale adoption of electronic health records. In this work, structured and unstructured (textual) data are combined to assign clinical diagnostic and procedural codes (specifically ICD-9-CM) to patient stays. We investigate whether integrating these heterogeneous data types improves prediction strength compared to using the data types in isolation.
METHODS: Two separate data integration approaches were evaluated. Early data integration combines features of several sources within a single model, and late data integration learns a separate model per data source and combines these predictions with a meta-learner. This is evaluated on data sources and clinical codes from a broad set of medical specialties.
RESULTS: When compared with the best individual prediction source, late data integration leads to improvements in predictive power (eg, overall F-measure increased from 30.6% to 38.3% for International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) diagnostic codes), while early data integration is less consistent. The predictive strength strongly differs between medical specialties, both for ICD-9-CM diagnostic and procedural codes. DISCUSSION: Structured data provides complementary information to unstructured data (and vice versa) for predicting ICD-9-CM codes. This can be captured most effectively by the proposed late data integration approach.
CONCLUSIONS: We demonstrated that models using multiple electronic health record data sources systematically outperform models using data sources in isolation in the task of predicting ICD-9-CM codes over a broad range of medical specialties.

Entities: Species

Keywords: clinical coding; data integration; data mining; electronic health records; international classification of diseases

Mesh：

Year: 2015 PMID： 26316458 PMCID： PMC4954635 DOI： 10.1093/jamia/ocv115

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

18 in total

1. Identification of in-hospital complications from claims data. Is it valid?

Authors: A G Lawthers; E P McCarthy; R B Davis; L E Peterson; R H Palmer; L I Iezzoni
Journal: Med Care Date: 2000-08 Impact factor: 2.983

2. Can administrative data be used to compare postoperative complication rates across hospitals?

Authors: Patrick S Romano; Benjamin K Chan; Michael E Schembri; Julie A Rainwater
Journal: Med Care Date: 2002-10 Impact factor: 2.983

3. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors: Hanchuan Peng; Fuhui Long; Chris Ding
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2005-08 Impact factor: 6.226

4. DISEASES: text mining and data integration of disease-gene associations.

Authors: Sune Pletscher-Frankild; Albert Pallejà; Kalliopi Tsafou; Janos X Binder; Lars Juhl Jensen
Journal: Methods Date: 2014-12-05 Impact factor: 3.608

5. Validity of International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) screening for sepsis in surgical mortalities.

Authors: Rajesh Ramanathan; Patricia Leavell; Gregory Stockslager; Catherine Mays; Dale Harvey; Therese M Duane
Journal: Surg Infect (Larchmt) Date: 2014-05-28 Impact factor: 2.150

6. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment.

Authors: Robert J Carroll; Lisa Bastarache; Joshua C Denny
Journal: Bioinformatics Date: 2014-04-14 Impact factor: 6.937

7. Improving the electronic health record--are clinicians getting what they wished for?

Authors: James J Cimino
Journal: JAMA Date: 2013-03-13 Impact factor: 56.272

8. Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

Authors: Jyotishman Pathak; Kent R Bailey; Calvin E Beebe; Steven Bethard; David C Carrell; Pei J Chen; Dmitriy Dligach; Cory M Endle; Lacey A Hart; Peter J Haug; Stanley M Huff; Vinod C Kaggal; Dingcheng Li; Hongfang Liu; Kyle Marchant; James Masanz; Timothy Miller; Thomas A Oniki; Martha Palmer; Kevin J Peterson; Susan Rea; Guergana K Savova; Craig R Stancl; Sunghwan Sohn; Harold R Solbrig; Dale B Suesse; Cui Tao; David P Taylor; Les Westberg; Stephen Wu; Ning Zhuo; Christopher G Chute
Journal: J Am Med Inform Assoc Date: 2013-11-04 Impact factor: 4.497

9. A multidisciplinary audit of clinical coding accuracy in otolaryngology: financial, managerial and clinical governance considerations under payment-by-results.

Authors: S A R Nouraei; S O'Hanlon; C R Butler; A Hadovsky; E Donald; E Benjamin; G S Sandhu
Journal: Clin Otolaryngol Date: 2009-02 Impact factor: 2.597

10. Diagnosis code assignment: models and evaluation metrics.

Authors: Adler Perotte; Rimma Pivovarov; Karthik Natarajan; Nicole Weiskopf; Frank Wood; Noémie Elhadad
Journal: J Am Med Inform Assoc Date: 2013-12-02 Impact factor: 4.497

12 in total

1. Computer-Assisted Diagnostic Coding: Effectiveness of an NLP-based approach using SNOMED CT to ICD-10 mappings.

Authors: Anthony N Nguyen; Donna Truran; Madonna Kemp; Bevan Koopman; David Conlan; John O'Dwyer; Ming Zhang; Sarvnaz Karimi; Hamed Hassanzadeh; Michael J Lawley; Damian Green
Journal: AMIA Annu Symp Proc Date: 2018-12-05

2. Can structured EHR data support clinical coding? A data mining approach.

Authors: José Carlos Ferrão; Mónica Duarte Oliveira; Filipe Janela; Henrique M G Martins; Daniel Gartner
Journal: Health Syst (Basingstoke) Date: 2020-03-01

3. Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data.

Authors: Daniel J Feller; Oliver J Bear Don't Walk Iv; Jason Zucker; Michael T Yin; Peter Gordon; Noémie Elhadad
Journal: Appl Clin Inform Date: 2020-03-04 Impact factor: 2.342

4. The Diagnosis-Wide Landscape of Hospital-Acquired AKI.

Authors: Anne-Sophie Jannot; Anita Burgun; Eric Thervet; Nicolas Pallet
Journal: Clin J Am Soc Nephrol Date: 2017-05-11 Impact factor: 8.237

5. Counting trees in Random Forests: Predicting symptom severity in psychiatric intake reports.

Authors: Elyne Scheurwegs; Madhumita Sushil; Stéphan Tulkens; Walter Daelemans; Kim Luyckx
Journal: J Biomed Inform Date: 2017-06-07 Impact factor: 6.317

6. SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research.

Authors: Honghan Wu; Giulia Toti; Katherine I Morley; Zina M Ibrahim; Amos Folarin; Richard Jackson; Ismail Kartoglu; Asha Agrawal; Clive Stringer; Darren Gale; Genevieve Gorrell; Angus Roberts; Matthew Broadbent; Robert Stewart; Richard J B Dobson
Journal: J Am Med Inform Assoc Date: 2018-05-01 Impact factor: 4.497