Literature DB >> 21439897

MEDRank: using graph-based concept ranking to index biomedical texts.

Jorge R Herskovic1, Trevor Cohen, Devika Subramanian, M Sriram Iyengar, Jack W Smith, Elmer V Bernstam.   

Abstract

BACKGROUND: As the volume of biomedical text increases exponentially, automatic indexing becomes increasingly important. However, existing approaches do not distinguish central (or core) concepts from concepts that were mentioned in passing. We focus on the problem of indexing MEDLINE records, a process that is currently performed by highly trained humans at the National Library of Medicine (NLM). NLM indexers are assisted by a system called the Medical Text Indexer (MTI) that suggests candidate indexing terms.
OBJECTIVE: To improve the ability of MTI to select the core terms in MEDLINE abstracts. These core concepts are deemed to be most important and are designated as "major headings" by MEDLINE indexers. We introduce and evaluate a graph-based indexing methodology called MEDRank that generates concept graphs from biomedical text and then ranks the concepts within these graphs to identify the most important ones.
METHODS: We insert a MEDRank step into the MTI and compare MTI's output with and without MEDRank to the MEDLINE indexers' selected terms for a sample of 11,803 PubMed Central articles. We also tested whether human raters prefer terms generated by the MEDLINE indexers, MTI without MEDRank, and MTI with MEDRank for a sample of 36 PubMed Central articles.
RESULTS: MEDRank improved recall of major headings designated by 30% over MTI without MEDRank (0.489 vs. 0.376). Overall recall was only slightly (6.5%) higher (0.490 vs. 0.460) as was F(2) (3%, 0.408 vs. 0.396). However, overall precision was 3.9% lower (0.268 vs. 0.279). Human raters preferred terms generated by MTI with MEDRank over terms generated by MTI without MEDRank (by an average of 1.00 more term per article), and preferred terms generated by MTI with MEDRank and the MEDLINE indexers at the same rate.
CONCLUSIONS: The addition of MEDRank to MTI significantly improved the retrieval of core concepts in MEDLINE abstracts and more closely matched human expectations compared to MTI without MEDRank. In addition, MEDRank slightly improved overall recall and F(2).
Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21439897      PMCID: PMC3090689          DOI: 10.1016/j.ijmedinf.2011.02.008

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  12 in total

1.  Automatic MeSH term assignment and quality assessment.

Authors:  W Kim; A R Aronson; W J Wilbur
Journal:  Proc AMIA Symp       Date:  2001

2.  Application of a Medical Text Indexer to an online dermatology atlas.

Authors:  G R Kim; A R Aronson; J G Mork; B A Cohen; C U Lehmann
Journal:  Stud Health Technol Inform       Date:  2004

3.  The NLM Indexing Initiative's Medical Text Indexer.

Authors:  Alan R Aronson; James G Mork; Clifford W Gay; Susanne M Humphrey; Willie J Rogers
Journal:  Stud Health Technol Inform       Date:  2004

4.  Reflective random indexing for semi-automatic indexing of the biomedical literature.

Authors:  Vidya Vasuki; Trevor Cohen
Journal:  J Biomed Inform       Date:  2010-04-09       Impact factor: 6.317

5.  Using citation data to improve retrieval from MEDLINE.

Authors:  Elmer V Bernstam; Jorge R Herskovic; Yindalon Aphinyanaphongs; Constantin F Aliferis; Madurai G Sriram; William R Hersh
Journal:  J Am Med Inform Assoc       Date:  2005-10-12       Impact factor: 4.497

6.  Semi-automatic indexing of full text biomedical articles.

Authors:  Clifford W Gay; Mehmet Kayaalp; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2005

7.  Measures of semantic similarity and relatedness in the biomedical domain.

Authors:  Ted Pedersen; Serguei V S Pakhomov; Siddharth Patwardhan; Christopher G Chute
Journal:  J Biomed Inform       Date:  2006-06-10       Impact factor: 6.317

8.  Multiple approaches to fine-grained indexing of the biomedical literature.

Authors:  Aurelie Neveol; Sonya E Shooshan; Susanne M Humphrey; Thomas C Rindflesh; Alan R Aronson
Journal:  Pac Symp Biocomput       Date:  2007

Review 9.  Empirical distributional semantics: methods and biomedical applications.

Authors:  Trevor Cohen; Dominic Widdows
Journal:  J Biomed Inform       Date:  2009-02-14       Impact factor: 6.317

10.  A recent advance in the automatic indexing of the biomedical literature.

Authors:  Aurélie Névéol; Sonya E Shooshan; Susanne M Humphrey; James G Mork; Alan R Aronson
Journal:  J Biomed Inform       Date:  2008-12-30       Impact factor: 6.317

View more
  9 in total

1.  Applying a "Big Data" Literature System to Recommend Antihypertensive Drugs for Hypertension Patients with Diabetes Mellitus.

Authors:  Jing-Xian Shu; Ying Li; Ting He; Ling Chen; Xue Li; Lin-Lin Zou; Lu Yin; Xiao-Hui Li; An-Li Wang; Xing Liu; Hong Yuan
Journal:  Med Sci Monit       Date:  2018-01-07

2.  Stochastic Gradient Descent and the Prediction of MeSH for PubMed Records.

Authors:  W John Wilbur; Won Kim
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

3.  Deterministic binary vectors for efficient automated indexing of MEDLINE/PubMed abstracts.

Authors:  Manuel Wahle; Dominic Widdows; Jorge R Herskovic; Elmer V Bernstam; Trevor Cohen
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

4.  Automatically extracting clinically useful sentences from UpToDate to support clinicians' information needs.

Authors:  Rashmi Mishra; Guilherme Del Fiol; Halil Kilicoglu; Siddhartha Jonnalagadda; Marcelo Fiszman
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

5.  Feature engineering for MEDLINE citation categorization with MeSH.

Authors:  Antonio Jose Jimeno Yepes; Laura Plaza; Jorge Carrillo-de-Albornoz; James G Mork; Alan R Aronson
Journal:  BMC Bioinformatics       Date:  2015-04-08       Impact factor: 3.169

6.  Beyond opinion classification: Extracting facts, opinions and experiences from health forums.

Authors:  Jorge Carrillo-de-Albornoz; Ahmet Aker; Emina Kurtic; Laura Plaza
Journal:  PLoS One       Date:  2019-01-09       Impact factor: 3.240

7.  CERC: an interactive content extraction, recognition, and construction tool for clinical and biomedical text.

Authors:  Eva K Lee; Karan Uppal
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-15       Impact factor: 2.796

8.  Graph-based signal integration for high-throughput phenotyping.

Authors:  Jorge R Herskovic; Devika Subramanian; Trevor Cohen; Pamela A Bozzo-Silva; Charles F Bearden; Elmer V Bernstam
Journal:  BMC Bioinformatics       Date:  2012-08-24       Impact factor: 3.169

9.  Using cited references to improve the retrieval of related biomedical documents.

Authors:  Francisco M Ortuño; Ignacio Rojas; Miguel A Andrade-Navarro; Jean-Fred Fontaine
Journal:  BMC Bioinformatics       Date:  2013-03-27       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.