Literature DB >> 26958183

Semi-supervised Learning for Phenotyping Tasks.

Dmitriy Dligach1, Timothy Miller1, Guergana K Savova1.   

Abstract

Supervised learning is the dominant approach to automatic electronic health records-based phenotyping, but it is expensive due to the cost of manual chart review. Semi-supervised learning takes advantage of both scarce labeled and plentiful unlabeled data. In this work, we study a family of semi-supervised learning algorithms based on Expectation Maximization (EM) in the context of several phenotyping tasks. We first experiment with the basic EM algorithm. When the modeling assumptions are violated, basic EM leads to inaccurate parameter estimation. Augmented EM attenuates this shortcoming by introducing a weighting factor that downweights the unlabeled data. Cross-validation does not always lead to the best setting of the weighting factor and other heuristic methods may be preferred. We show that accurate phenotyping models can be trained with only a few hundred labeled (and a large number of unlabeled) examples, potentially providing substantial savings in the amount of the required manual chart review.

Entities:  

Mesh:

Year:  2015        PMID: 26958183      PMCID: PMC4765699     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  13 in total

1.  Exploring semantic groups through visual approaches.

Authors:  Olivier Bodenreider; Alexa T McCray
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

2.  A translational engine at the national scale: informatics for integrating biology and the bedside.

Authors:  Isaac S Kohane; Susanne E Churchill; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2011-11-10       Impact factor: 4.497

3.  What to expect from the Pharmacogenomics Research Network.

Authors:  R M Long; J M Berg
Journal:  Clin Pharmacol Ther       Date:  2011-03       Impact factor: 6.875

4.  Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: a novel informatics approach.

Authors:  Ashwin N Ananthakrishnan; Tianxi Cai; Guergana Savova; Su-Chun Cheng; Pei Chen; Raul Guzman Perez; Vivian S Gainer; Shawn N Murphy; Peter Szolovits; Zongqi Xia; Stanley Shaw; Susanne Churchill; Elizabeth W Karlson; Isaac Kohane; Robert M Plenge; Katherine P Liao
Journal:  Inflamm Bowel Dis       Date:  2013-06       Impact factor: 5.325

5.  Electronic medical records for discovery research in rheumatoid arthritis.

Authors:  Katherine P Liao; Tianxi Cai; Vivian Gainer; Sergey Goryachev; Qing Zeng-treitler; Soumya Raychaudhuri; Peter Szolovits; Susanne Churchill; Shawn Murphy; Isaac Kohane; Elizabeth W Karlson; Robert M Plenge
Journal:  Arthritis Care Res (Hoboken)       Date:  2010-08       Impact factor: 4.794

6.  The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies.

Authors:  Catherine A McCarty; Rex L Chisholm; Christopher G Chute; Iftikhar J Kullo; Gail P Jarvik; Eric B Larson; Rongling Li; Daniel R Masys; Marylyn D Ritchie; Dan M Roden; Jeffery P Struewing; Wendy A Wolf
Journal:  BMC Med Genomics       Date:  2011-01-26       Impact factor: 3.063

7.  Psychiatric co-morbidity is associated with increased risk of surgery in Crohn's disease.

Authors:  A N Ananthakrishnan; V S Gainer; R G Perez; T Cai; S-C Cheng; G Savova; P Chen; P Szolovits; Z Xia; P L De Jager; S Y Shaw; S Churchill; E W Karlson; I Kohane; R H Perlis; R M Plenge; S N Murphy; K P Liao
Journal:  Aliment Pharmacol Ther       Date:  2013-01-07       Impact factor: 8.171

8.  Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management.

Authors:  Vijay Garla; Caroline Taylor; Cynthia Brandt
Journal:  J Biomed Inform       Date:  2013-07-08       Impact factor: 6.317

Review 9.  A review of approaches to identifying patient phenotype cohorts using electronic health records.

Authors:  Chaitanya Shivade; Preethi Raghavan; Eric Fosler-Lussier; Peter J Embi; Noemie Elhadad; Stephen B Johnson; Albert M Lai
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

10.  Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.

Authors:  Chen Lin; Elizabeth W Karlson; Helena Canhao; Timothy A Miller; Dmitriy Dligach; Pei Jun Chen; Raul Natanael Guzman Perez; Yuanyan Shen; Michael E Weinblatt; Nancy A Shadick; Robert M Plenge; Guergana K Savova
Journal:  PLoS One       Date:  2013-08-16       Impact factor: 3.240

View more
  2 in total

1.  Clinical Document Classification Using Labeled and Unlabeled Data Across Hospitals.

Authors:  Hamed Hassanzadeh; Mahnoosh Kholghi; Anthony Nguyen; Kevin Chu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Pre-training phenotyping classifiers.

Authors:  Dmitriy Dligach; Majid Afshar; Timothy Miller
Journal:  J Biomed Inform       Date:  2020-11-28       Impact factor: 6.317

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.