Literature DB >> 26338219

Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance.

Wei-Qi Wei1, Pedro L Teixeira1, Huan Mo1, Robert M Cronin2, Jeremy L Warner2, Joshua C Denny3.   

Abstract

OBJECTIVE: To evaluate the phenotyping performance of three major electronic health record (EHR) components: International Classification of Disease (ICD) diagnosis codes, primary notes, and specific medications.
MATERIALS AND METHODS: We conducted the evaluation using de-identified Vanderbilt EHR data. We preselected ten diseases: atrial fibrillation, Alzheimer's disease, breast cancer, gout, human immunodeficiency virus infection, multiple sclerosis, Parkinson's disease, rheumatoid arthritis, and types 1 and 2 diabetes mellitus. For each disease, patients were classified into seven categories based on the presence of evidence in diagnosis codes, primary notes, and specific medications. Twenty-five patients per disease category (a total number of 175 patients for each disease, 1750 patients for all ten diseases) were randomly selected for manual chart review. Review results were used to estimate the positive predictive value (PPV), sensitivity, andF-score for each EHR component alone and in combination.
RESULTS: The PPVs of single components were inconsistent and inadequate for accurately phenotyping (0.06-0.71). Using two or more ICD codes improved the average PPV to 0.84. We observed a more stable and higher accuracy when using at least two components (mean ± standard deviation: 0.91 ± 0.08). Primary notes offered the best sensitivity (0.77). The sensitivity of ICD codes was 0.67. Again, two or more components provided a reasonably high and stable sensitivity (0.59 ± 0.16). Overall, the best performance (Fscore: 0.70 ± 0.12) was achieved by using two or more components. Although the overall performance of using ICD codes (0.67 ± 0.14) was only slightly lower than using two or more components, its PPV (0.71 ± 0.13) is substantially worse (0.91 ± 0.08).
CONCLUSION: Multiple EHR components provide a more consistent and higher performance than a single one for the selected phenotypes. We suggest considering multiple EHR components for future phenotyping design in order to obtain an ideal result.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  International Classification of Diseases; clinical notes; diagnosis codes; electronic health records; medications; phenotype; problem lists

Mesh:

Year:  2015        PMID: 26338219      PMCID: PMC4954637          DOI: 10.1093/jamia/ocv130

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  45 in total

1.  Development of an ensemble resource linking MEDications to their Indications (MEDI).

Authors:  Wei-Qi Wei; Robert M Cronin; Hua Xu; Thomas A Lasko; Lisa Bastarache; Joshua C Denny
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2013-03-18

2.  Evaluation of a method to identify and categorize section headers in clinical documents.

Authors:  Joshua C Denny; Anderson Spickard; Kevin B Johnson; Neeraja B Peterson; Josh F Peterson; Randolph A Miller
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

3.  Development and validation of an algorithm to identify patients newly diagnosed with HIV infection from electronic health records.

Authors:  Matthew Bidwell Goetz; Tuyen Hoang; Virginia L Kan; David Rimland; Maria Rodriguez-Barradas
Journal:  AIDS Res Hum Retroviruses       Date:  2014-03-20       Impact factor: 2.205

4.  Failure of ICD-9-CM codes to identify patients with comorbid chronic kidney disease in diabetes.

Authors:  Elizabeth F O Kern; Miriam Maney; Donald R Miller; Chin-Lin Tseng; Anjali Tiwari; Mangala Rajan; David Aron; Leonard Pogach
Journal:  Health Serv Res       Date:  2006-04       Impact factor: 3.402

5.  Characterization of statin dose response in electronic medical records.

Authors:  W-Q Wei; Q Feng; L Jiang; M S Waitara; O F Iwuchukwu; D M Roden; M Jiang; H Xu; R M Krauss; J I Rotter; D A Nickerson; R L Davis; R L Berg; P L Peissig; C A McCarty; R A Wilke; J C Denny
Journal:  Clin Pharmacol Ther       Date:  2013-10-04       Impact factor: 6.875

6.  The granularity of medical narratives and its effect on the speed and completeness of information retrieval.

Authors:  H J Tange; H C Schouten; A D Kester; A Hasman
Journal:  J Am Med Inform Assoc       Date:  1998 Nov-Dec       Impact factor: 4.497

7.  Validation and enhancement of a computable medication indication resource (MEDI) using a large practice-based dataset.

Authors:  Wei-Qi Wei; Jonathan D Mosley; Lisa Bastarache; Joshua C Denny
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

8.  The validity of using ICD-9 codes and pharmacy records to identify patients with chronic obstructive pulmonary disease.

Authors:  Colin R Cooke; Min J Joo; Stephen M Anderson; Todd A Lee; Edmunds M Udris; Eric Johnson; David H Au
Journal:  BMC Health Serv Res       Date:  2011-02-16       Impact factor: 2.655

9.  Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

Authors:  Hua Xu; Marianthi Markatou; Rositsa Dimova; Hongfang Liu; Carol Friedman
Journal:  BMC Bioinformatics       Date:  2006-07-05       Impact factor: 3.169

10.  Extracting research-quality phenotypes from electronic health records to support precision medicine.

Authors:  Wei-Qi Wei; Joshua C Denny
Journal:  Genome Med       Date:  2015-04-30       Impact factor: 11.117

View more
  73 in total

1.  Deep learning approaches for plethysmography signal quality assessment in the presence of atrial fibrillation.

Authors:  Tania Pereira; Cheng Ding; Kais Gadhoumi; Nate Tran; Rene A Colorado; Karl Meisel; Xiao Hu
Journal:  Physiol Meas       Date:  2019-12-27       Impact factor: 2.833

2.  Leveraging Big Data to Transform Drug Discovery.

Authors:  Benjamin S Glicksberg; Li Li; Rong Chen; Joel Dudley; Bin Chen
Journal:  Methods Mol Biol       Date:  2019

3.  A Roadmap for Foundational Research on Artificial Intelligence in Medical Imaging: From the 2018 NIH/RSNA/ACR/The Academy Workshop.

Authors:  Curtis P Langlotz; Bibb Allen; Bradley J Erickson; Jayashree Kalpathy-Cramer; Keith Bigelow; Tessa S Cook; Adam E Flanders; Matthew P Lungren; David S Mendelson; Jeffrey D Rudie; Ge Wang; Krishna Kandarpa
Journal:  Radiology       Date:  2019-04-16       Impact factor: 11.105

4.  Deep representation learning of electronic health records to unlock patient stratification at scale.

Authors:  Isotta Landi; Benjamin S Glicksberg; Hao-Chih Lee; Sarah Cherng; Giulia Landi; Matteo Danieletto; Joel T Dudley; Cesare Furlanello; Riccardo Miotto
Journal:  NPJ Digit Med       Date:  2020-07-17

Review 5.  Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing.

Authors:  D Demner-Fushman; N Elhadad
Journal:  Yearb Med Inform       Date:  2016-11-10

6.  Automated disease cohort selection using word embeddings from Electronic Health Records.

Authors:  Benjamin S Glicksberg; Riccardo Miotto; Kipp W Johnson; Khader Shameer; Li Li; Rong Chen; Joel T Dudley
Journal:  Pac Symp Biocomput       Date:  2018

7.  Automatic Methods to Extract New York Heart Association Classification from Clinical Notes.

Authors:  Rui Zhang; Sisi Ma; Liesa Shanahan; Jessica Munroe; Sarah Horn; Stuart Speedie
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

8.  Automating the Determination of Prostate Cancer Risk Strata From Electronic Medical Records.

Authors:  Justin R Gregg; Maximilian Lang; Lucy L Wang; Matthew J Resnick; Sandeep K Jain; Jeremy L Warner; Daniel A Barocas
Journal:  JCO Clin Cancer Inform       Date:  2017-06-08

9.  Comparison of the cohort selection performance of Australian Medicines Terminology to Anatomical Therapeutic Chemical mappings.

Authors:  Guan N Guo; Jitendra Jonnagaddala; Sanjay Farshid; Vojtech Huser; Christian Reich; Siaw-Teng Liaw
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

10.  Sleep health, diseases, and pain syndromes: findings from an electronic health record biobank.

Authors:  Hassan S Dashti; Brian E Cade; Gerda Stutaite; Richa Saxena; Susan Redline; Elizabeth W Karlson
Journal:  Sleep       Date:  2021-03-12       Impact factor: 5.849

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.