Literature DB >> 19328036

Combining hidden Markov models and latent semantic analysis for topic segmentation and labeling: method and clinical application.

Filip Ginter1, Hanna Suominen, Sampo Pyysalo, Tapio Salakoski.   

Abstract

MOTIVATION: Topic segmentation and labeling systems enable fine-grained information search. However, previously proposed methods require annotated data to adapt to different information needs and have limited applicability to texts with short segment length.
METHODS: We introduce an unsupervised method based on a combination of hidden Markov models and latent semantic analysis which allows the topics of interest to be defined freely, without the need for data annotation, and can identify short segments.
RESULTS: The method is evaluated on intensive care nursing narratives and motivated by information needs in this domain. The method is shown to considerably outperform a keyword-based heuristic baseline and to achieve a level of performance comparable to that of a related supervised method trained on 3600 manually annotated words.

Mesh:

Year:  2009        PMID: 19328036     DOI: 10.1016/j.ijmedinf.2009.02.003

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  1 in total

1.  Extracting diagnoses and investigation results from unstructured text in electronic health records by semi-supervised machine learning.

Authors:  Zhuoran Wang; Anoop D Shah; A Rosemary Tate; Spiros Denaxas; John Shawe-Taylor; Harry Hemingway
Journal:  PLoS One       Date:  2012-01-19       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.