Literature DB >> 23115581

Extractive summarisation of medical documents using domain knowledge and corpus statistics.

Abeed Sarker1, Diego Mollá, Cecile Paris.   

Abstract

BACKGROUND: Evidence Based Medicine (EBM) practice requires practitioners to extract evidence from published medical research when answering clinical queries. Due to the time- consuming nature of this practice, there is a strong motivation for systems that can automatically summarise medical documents and help practitioners find relevant information. AIM: The aim of this work is to propose an automatic query- focused, extractive summarisation approach that selects informative sentences from medical documents.
METHOD: We use a corpus that is specifically designed for summarisation in the EBM domain. We use approximately half the corpus for deriving important statistics associated with the best possible extractive summaries. We take into account factors such as sentence position, length, sentence content, and the type of the query posed. Using the statistics from the first set, we evaluate our approach on a separate set. Evaluation of the qualities of the generated summaries is performed automatically using ROUGE, which is a popular tool for evaluating automatic summaries.
RESULTS: Our summarisation approach outperforms all baselines (best baseline score: 0.1594; our score 0.1653). Further improvements are achieved when query types are taken into account.
CONCLUSION: The quality of extractive summarisation in the medical domain can be significantly improved by incorporating domain knowledge and statistics derived from a specialised corpus. Such techniques can therefore be applied for content selection in end-to-end summarisation systems.

Keywords:  Automatic summarisation; extractive summarisation evidence based medicine; medical document summarisation

Year:  2012        PMID: 23115581      PMCID: PMC3477776          DOI: 10.4066/AMJ.2012.1361

Source DB:  PubMed          Journal:  Australas Med J        ISSN: 1836-1935


  3 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Using outcome polarity in sentence extraction for medical question-answering.

Authors:  Yun Niu; Xiaodan Zhu; Graeme Hirst
Journal:  AMIA Annu Symp Proc       Date:  2006

3.  Automatic classification of sentences to support Evidence Based Medicine.

Authors:  Su Nam Kim; David Martinez; Lawrence Cavedon; Lars Yencken
Journal:  BMC Bioinformatics       Date:  2011-03-29       Impact factor: 3.169

  3 in total
  1 in total

Review 1.  Text summarization in the biomedical domain: a systematic review of recent research.

Authors:  Rashmi Mishra; Jiantao Bian; Marcelo Fiszman; Charlene R Weir; Siddhartha Jonnalagadda; Javed Mostafa; Guilherme Del Fiol
Journal:  J Biomed Inform       Date:  2014-07-10       Impact factor: 6.317

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.