Literature DB >> 24625719

Learning to rank figures within a biomedical article.

Feifan Liu1, Hong Yu2.   

Abstract

Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. This ever-increasing sheer volume has made it difficult for scientists to effectively and accurately access figures of their interest, the process of which is crucial for validating research facts and for formulating or testing novel research hypotheses. Current figure search applications can't fully meet this challenge as the "bag of figures" assumption doesn't take into account the relationship among figures. In our previous study, hundreds of biomedical researchers have annotated articles in which they serve as corresponding authors. They ranked each figure in their paper based on a figure's importance at their discretion, referred to as "figure ranking". Using this collection of annotated data, we investigated computational approaches to automatically rank figures. We exploited and extended the state-of-the-art listwise learning-to-rank algorithms and developed a new supervised-learning model BioFigRank. The cross-validation results show that BioFigRank yielded the best performance compared with other state-of-the-art computational models, and the greedy feature selection can further boost the ranking performance significantly. Furthermore, we carry out the evaluation by comparing BioFigRank with three-level competitive domain-specific human experts: (1) First Author, (2) Non-Author-In-Domain-Expert who is not the author nor co-author of an article but who works in the same field of the corresponding author of the article, and (3) Non-Author-Out-Domain-Expert who is not the author nor co-author of an article and who may or may not work in the same field of the corresponding author of an article. Our results show that BioFigRank outperforms Non-Author-Out-Domain-Expert and performs as well as Non-Author-In-Domain-Expert. Although BioFigRank underperforms First Author, since most biomedical researchers are either in- or out-domain-experts for an article, we conclude that BioFigRank represents an artificial intelligence system that offers expert-level intelligence to help biomedical researchers to navigate increasingly proliferated big data efficiently.

Entities:  

Mesh:

Year:  2014        PMID: 24625719      PMCID: PMC3953065          DOI: 10.1371/journal.pone.0061567

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  19 in total

1.  Finding correlations in big data.

Authors: 
Journal:  Nat Biotechnol       Date:  2012-04-10       Impact factor: 54.908

2.  How will big pictures emerge from a sea of biological data?

Authors:  Elizabeth Pennisi
Journal:  Science       Date:  2005-07-01       Impact factor: 47.728

3.  Towards answering biological questions with experimental evidence: automatically identifying text that summarize image content in full-text articles.

Authors:  Hong Yu
Journal:  AMIA Annu Symp Proc       Date:  2006

4.  GoldMiner: a radiology image search engine.

Authors:  Charles E Kahn; Cheng Thao
Journal:  AJR Am J Roentgenol       Date:  2007-06       Impact factor: 3.959

5.  Yale Image Finder (YIF): a new search engine for retrieving biomedical images.

Authors:  Songhua Xu; James McCusker; Michael Krauthammer
Journal:  Bioinformatics       Date:  2008-07-09       Impact factor: 6.937

6.  FigSum: automatically generating structured text summaries for figures in biomedical literature.

Authors:  Shashank Agarwal; Hong Yu
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

7.  Annotation and retrieval of clinically relevant images.

Authors:  Dina Demner-Fushman; Sameer Antani; Matthew Simpson; George R Thoma
Journal:  Int J Med Inform       Date:  2009-07-09       Impact factor: 4.046

8.  GeneTUKit: a software for document-level gene normalization.

Authors:  Minlie Huang; Jingchen Liu; Xiaoyan Zhu
Journal:  Bioinformatics       Date:  2011-02-08       Impact factor: 6.937

9.  Towards big data science in the decade ahead from ten years of InCoB and the 1st ISCB-Asia Joint Conference.

Authors:  Shoba Ranganathan; Christian Schönbach; Janet Kelso; Burkhard Rost; Sheila Nathan; Tin Wee Tan
Journal:  BMC Bioinformatics       Date:  2011-11-30       Impact factor: 3.169

10.  Beyond captions: linking figures with abstract sentences in biomedical articles.

Authors:  Joseph P Bockhorst; John M Conroy; Shashank Agarwal; Dianne P O'Leary; Hong Yu
Journal:  PLoS One       Date:  2012-07-18       Impact factor: 3.240

View more
  1 in total

1.  DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

Authors:  Xu-Cheng Yin; Chun Yang; Wei-Yi Pei; Haixia Man; Jun Zhang; Erik Learned-Miller; Hong Yu
Journal:  PLoS One       Date:  2015-05-07       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.