Literature DB >> 28410982

EHR-based phenotyping: Bulk learning and evaluation.

Po-Hsiang Chiu1, George Hripcsak2.   

Abstract

In data-driven phenotyping, a core computational task is to identify medical concepts and their variations from sources of electronic health records (EHR) to stratify phenotypic cohorts. A conventional analytic framework for phenotyping largely uses a manual knowledge engineering approach or a supervised learning approach where clinical cases are represented by variables encompassing diagnoses, medicinal treatments and laboratory tests, among others. In such a framework, tasks associated with feature engineering and data annotation remain a tedious and expensive exercise, resulting in poor scalability. In addition, certain clinical conditions, such as those that are rare and acute in nature, may never accumulate sufficient data over time, which poses a challenge to establishing accurate and informative statistical models. In this paper, we use infectious diseases as the domain of study to demonstrate a hierarchical learning method based on ensemble learning that attempts to address these issues through feature abstraction. We use a sparse annotation set to train and evaluate many phenotypes at once, which we call bulk learning. In this batch-phenotyping framework, disease cohort definitions can be learned from within the abstract feature space established by using multiple diseases as a substrate and diagnostic codes as surrogates. In particular, using surrogate labels for model training renders possible its subsequent evaluation using only a sparse annotated sample. Moreover, statistical models can be trained and evaluated, using the same sparse annotation, from within the abstract feature space of low dimensionality that encapsulates the shared clinical traits of these target diseases, collectively referred to as the bulk learning set.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Disease modeling; EHR phenotyping; Ensemble learning; Feature learning; Knowledge representation; Stacked generalization

Mesh:

Year:  2017        PMID: 28410982      PMCID: PMC5934756          DOI: 10.1016/j.jbi.2017.04.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  26 in total

1.  Orthogonal forward selection and backward elimination algorithms for feature subset selection.

Authors:  K Z Mao
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2004-02

2.  Deep phenotyping: The details of disease.

Authors:  Cathryn M Delude
Journal:  Nature       Date:  2015-11-05       Impact factor: 49.962

3.  Using stacked generalization to predict membrane protein types based on pseudo-amino acid composition.

Authors:  Shuang-Quan Wang; Jie Yang; Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2006-05-16       Impact factor: 2.691

4.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.

Authors:  Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-03-26       Impact factor: 4.497

5.  Identification of type 2 diabetes subgroups through topological analysis of patient similarity.

Authors:  Li Li; Wei-Yi Cheng; Benjamin S Glicksberg; Omri Gottesman; Ronald Tamler; Rong Chen; Erwin P Bottinger; Joel T Dudley
Journal:  Sci Transl Med       Date:  2015-10-28       Impact factor: 17.956

6.  Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources.

Authors:  Sheng Yu; Katherine P Liao; Stanley Y Shaw; Vivian S Gainer; Susanne E Churchill; Peter Szolovits; Shawn N Murphy; Isaac S Kohane; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2015-04-29       Impact factor: 4.497

Review 7.  Deep phenotyping for precision medicine.

Authors:  Peter N Robinson
Journal:  Hum Mutat       Date:  2012-05       Impact factor: 4.878

8.  Identifying phenotypic signatures of neuropsychiatric disorders from electronic medical records.

Authors:  Svetlana Lyalina; Bethany Percha; Paea LePendu; Srinivasan V Iyer; Russ B Altman; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2013-08-16       Impact factor: 4.497

9.  Representation learning: a unified deep learning framework for automatic prostate MR segmentation.

Authors:  Shu Liao; Yaozong Gao; Aytekin Oto; Dinggang Shen
Journal:  Med Image Comput Comput Assist Interv       Date:  2013

10.  Electronic medical record phenotyping using the anchor and learn framework.

Authors:  Yoni Halpern; Steven Horng; Youngduck Choi; David Sontag
Journal:  J Am Med Inform Assoc       Date:  2016-04-23       Impact factor: 4.497

View more
  10 in total

1.  High-throughput multimodal automated phenotyping (MAP) with application to PheWAS.

Authors:  Katherine P Liao; Jiehuan Sun; Tianrun A Cai; Nicholas Link; Chuan Hong; Jie Huang; Jennifer E Huffman; Jessica Gronsbell; Yichi Zhang; Yuk-Lam Ho; Victor Castro; Vivian Gainer; Shawn N Murphy; Christopher J O'Donnell; J Michael Gaziano; Kelly Cho; Peter Szolovits; Isaac S Kohane; Sheng Yu; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  Enabling phenotypic big data with PheNorm.

Authors:  Sheng Yu; Yumeng Ma; Jessica Gronsbell; Tianrun Cai; Ashwin N Ananthakrishnan; Vivian S Gainer; Susanne E Churchill; Peter Szolovits; Shawn N Murphy; Isaac S Kohane; Katherine P Liao; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

3.  Defining Phenotypes from Clinical Data to Drive Genomic Research.

Authors:  Jamie R Robinson; Wei-Qi Wei; Dan M Roden; Joshua C Denny
Journal:  Annu Rev Biomed Data Sci       Date:  2018-04-25

4.  Deep Phenotyping of Chinese Electronic Health Records by Recognizing Linguistic Patterns of Phenotypic Narratives With a Sequence Motif Discovery Tool: Algorithm Development and Validation.

Authors:  Shicheng Li; Lizong Deng; Xu Zhang; Luming Chen; Tao Yang; Yifan Qi; Taijiao Jiang
Journal:  J Med Internet Res       Date:  2022-06-03       Impact factor: 7.076

5.  Automated disease cohort selection using word embeddings from Electronic Health Records.

Authors:  Benjamin S Glicksberg; Riccardo Miotto; Kipp W Johnson; Khader Shameer; Li Li; Rong Chen; Joel T Dudley
Journal:  Pac Symp Biocomput       Date:  2018

Review 6.  Electronic health records and polygenic risk scores for predicting disease risk.

Authors:  Ruowang Li; Yong Chen; Marylyn D Ritchie; Jason H Moore
Journal:  Nat Rev Genet       Date:  2020-03-31       Impact factor: 53.242

7.  High-throughput phenotyping with temporal sequences.

Authors:  Hossein Estiri; Zachary H Strasser; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2021-03-18       Impact factor: 4.497

8.  Heterogeneity introduced by EHR system implementation in a de-identified data resource from 100 non-affiliated organizations.

Authors:  Earl F Glynn; Mark A Hoffman
Journal:  JAMIA Open       Date:  2019-08-07

9.  Generative transfer learning for measuring plausibility of EHR diagnosis records.

Authors:  Hossein Estiri; Sebastien Vasey; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2021-03-01       Impact factor: 4.497

10.  Automated Phenotyping Tool for Identifying Developmental Language Disorder Cases in Health Systems Data (APT-DLD): A New Research Algorithm for Deployment in Large-Scale Electronic Health Record Systems.

Authors:  Courtney E Walters; Rachana Nitin; Katherine Margulis; Olivia Boorom; Daniel E Gustavson; Catherine T Bush; Lea K Davis; Jennifer E Below; Nancy J Cox; Stephen M Camarata; Reyna L Gordon
Journal:  J Speech Lang Hear Res       Date:  2020-08-11       Impact factor: 2.297

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.