Literature DB >> 29208328

Different approaches for identifying important concepts in probabilistic biomedical text summarization.

Milad Moradi1, Nasser Ghadiri2.   

Abstract

Automatic text summarization tools help users in the biomedical domain to acquire their intended information from various textual resources more efficiently. Some of biomedical text summarization systems put the basis of their sentence selection approach on the frequency of concepts extracted from the input text. However, it seems that exploring other measures rather than the raw frequency for identifying valuable contents within an input document, or considering correlations existing between concepts, may be more useful for this type of summarization. In this paper, we describe a Bayesian summarization method for biomedical text documents. The Bayesian summarizer initially maps the input text to the Unified Medical Language System (UMLS) concepts; then it selects the important ones to be used as classification features. We introduce six different feature selection approaches to identify the most important concepts of the text and select the most informative contents according to the distribution of these concepts. We show that with the use of an appropriate feature selection approach, the Bayesian summarizer can improve the performance of biomedical summarization. Using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) toolkit, we perform extensive evaluations on a corpus of scientific papers in the biomedical domain. The results show that when the Bayesian summarizer utilizes the feature selection methods that do not use the raw frequency, it can outperform the biomedical summarizers that rely on the frequency of concepts, domain-independent and baseline methods.
Copyright © 2017 Elsevier B.V. All rights reserved.

Keywords:  Bayesian classification; Data mining; Feature selection; Medical text mining; Sentence classification; UMLS concept

Mesh:

Year:  2017        PMID: 29208328     DOI: 10.1016/j.artmed.2017.11.004

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  1 in total

1.  A systematic review of automatic text summarization for biomedical literature and EHRs.

Authors:  Mengqian Wang; Manhua Wang; Fei Yu; Yue Yang; Jennifer Walker; Javed Mostafa
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.