Literature DB >> 32439479

Summarization of biomedical articles using domain-specific word embeddings and graph ranking.

Milad Moradi1, Maedeh Dashti2, Matthias Samwald3.   

Abstract

Text summarization tools can help biomedical researchers and clinicians reduce the time and effort needed for acquiring important information from numerous documents. It has been shown that the input text can be modeled as a graph, and important sentences can be selected by identifying central nodes within the graph. However, the effective representation of documents, quantifying the relatedness of sentences, and selecting the most informative sentences are main challenges that need to be addressed in graph-based summarization. In this paper, we address these challenges in the context of biomedical text summarization. We evaluate the efficacy of a graph-based summarizer using different types of context-free and contextualized embeddings. The word representations are produced by pre-training neural language models on large corpora of biomedical texts. The summarizer models the input text as a graph in which the strength of relations between sentences is measured using the domain specific vector representations. We also assess the usefulness of different graph ranking techniques in the sentence selection step of our summarization method. Using the common Recall-Oriented Understudy for Gisting Evaluation (ROUGE) metrics, we evaluate the performance of our summarizer against various comparison methods. The results show that when the summarizer utilizes proper combinations of context-free and contextualized embeddings, along with an effective ranking method, it can outperform the other methods. We demonstrate that the best settings of our graph-based summarizer can efficiently improve the informative content of summaries and decrease the redundancy.
Copyright © 2020 Elsevier Inc. All rights reserved.

Keywords:  Deep learning; Graph ranking; Medical text mining; Natural language processing; Text summarization; Word embedding

Mesh:

Year:  2020        PMID: 32439479     DOI: 10.1016/j.jbi.2020.103452

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  1 in total

1.  A systematic review of automatic text summarization for biomedical literature and EHRs.

Authors:  Mengqian Wang; Manhua Wang; Fei Yu; Yue Yang; Jennifer Walker; Javed Mostafa
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.