Literature DB >> 24076750

Supervised embedding of textual predictors with applications in clinical diagnostics for pediatric cardiology.

Thomas Ernest Perry1, Hongyuan Zha, Ke Zhou, Patricio Frias, Dadan Zeng, Mark Braunstein.   

Abstract

OBJECTIVE: Electronic health records possess critical predictive information for machine-learning-based diagnostic aids. However, many traditional machine learning methods fail to simultaneously integrate textual data into the prediction process because of its high dimensionality. In this paper, we present a supervised method using Laplacian Eigenmaps to enable existing machine learning methods to estimate both low-dimensional representations of textual data and accurate predictors based on these low-dimensional representations at the same time.
MATERIALS AND METHODS: We present a supervised Laplacian Eigenmap method to enhance predictive models by embedding textual predictors into a low-dimensional latent space, which preserves the local similarities among textual data in high-dimensional space. The proposed implementation performs alternating optimization using gradient descent. For the evaluation, we applied our method to over 2000 patient records from a large single-center pediatric cardiology practice to predict if patients were diagnosed with cardiac disease. In our experiments, we consider relatively short textual descriptions because of data availability. We compared our method with latent semantic indexing, latent Dirichlet allocation, and local Fisher discriminant analysis. The results were assessed using four metrics: the area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), specificity, and sensitivity. RESULTS AND DISCUSSION: The results indicate that supervised Laplacian Eigenmaps was the highest performing method in our study, achieving 0.782 and 0.374 for AUC and MCC, respectively. Supervised Laplacian Eigenmaps showed an increase of 8.16% in AUC and 20.6% in MCC over the baseline that excluded textual data and a 2.69% and 5.35% increase in AUC and MCC, respectively, over unsupervised Laplacian Eigenmaps.
CONCLUSIONS: As a solution, we present a supervised Laplacian Eigenmap method to embed textual predictors into a low-dimensional Euclidean space. This method allows many existing machine learning predictors to effectively and efficiently capture the potential of textual predictors, especially those based on short texts.

Entities:  

Keywords:  Clinical Diagnostics; Eigenmaps; Embedding; Supervised Learning

Mesh:

Year:  2013        PMID: 24076750      PMCID: PMC3957389          DOI: 10.1136/amiajnl-2013-001792

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  10 in total

1.  Analysing and improving the diagnosis of ischaemic heart disease with machine learning.

Authors:  M Kukar; I Kononenko; C Groselj; K Kralj; J Fettich
Journal:  Artif Intell Med       Date:  1999-05       Impact factor: 5.326

2.  Identification of documented medication non-adherence in physician notes.

Authors:  Alexander Turchin; Holly I Wheeler; Matthew Labreche; Julia T Chu; Merri L Pendergrass; Jonathan S Einbinder; Jonathan Seth Einbinder
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

3.  Small-sample precision of ROC-related estimates.

Authors:  Blaise Hanczar; Jianping Hua; Chao Sima; John Weinstein; Michael Bittner; Edward R Dougherty
Journal:  Bioinformatics       Date:  2010-02-03       Impact factor: 6.937

4.  Primary care physicians should be coordinators, not gatekeepers.

Authors:  T Bodenheimer; B Lo; L Casalino
Journal:  JAMA       Date:  1999-06-02       Impact factor: 56.272

5.  Utilization of critical care services among patients undergoing total hip and knee arthroplasty: epidemiology and risk factors.

Authors:  Stavros G Memtsoudis; Xuming Sun; Ya-Lin Chiu; Michael Nurok; Ottokar Stundner; Stephen M Pastores; Madhu Mazumdar
Journal:  Anesthesiology       Date:  2012-07       Impact factor: 7.892

6.  Evaluation of radiological features for breast tumour classification in clinical screening with machine learning methods.

Authors:  Tim W Nattkemper; Bert Arnrich; Oliver Lichte; Wiebke Timm; Andreas Degenhard; Linda Pointon; Carmel Hayes; Martin O Leach
Journal:  Artif Intell Med       Date:  2004-12-16       Impact factor: 5.326

7.  Cardiologist versus internist management of patients with unstable angina: treatment patterns and outcomes.

Authors:  T L Schreiber; A Elkhatib; C L Grines; W W O'Neill
Journal:  J Am Coll Cardiol       Date:  1995-09       Impact factor: 24.094

8.  A comparison of methods for assessing penetrating trauma on retrospective multi-center data.

Authors:  Bilal A Ahmed; Michael E Matheny; Phillip L Rice; John R Clarke; Omolola I Ogunyemi
Journal:  J Biomed Inform       Date:  2008-10-01       Impact factor: 6.317

9.  Usefulness of patient symptoms and nasal endoscopy in the diagnosis of chronic sinusitis.

Authors:  K W Rosbe; K R Jones
Journal:  Am J Rhinol       Date:  1998 May-Jun

10.  Utility of a clinical support tool for outpatient evaluation of pediatric chest pain.

Authors:  Thomas Perry; Hongyuan Zha; Matthew E Oster; Patricio A Frias; Mark Braunstein
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.