Literature DB >> 24991197

Structured Literature Image Finder: Parsing Text and Figures in Biomedical Literature.

Amr Ahmed1, Andrew Arnold2, Luis Pedro Coelho3, Joshua Kangas3, Abdul-Saboor Sheikh4, Eric Xing5, William Cohen6, Robert F Murphy7.   

Abstract

The SLIF project combines text-mining and image processing to extract structured information from biomedical literature. SLIF extracts images and their captions from published papers. The captions are automatically parsed for relevant biological entities (protein and cell type names), while the images are classified according to their type (e.g., micrograph or gel). Fluorescence microscopy images are further processed and classified according to the depicted subcellular localization. The results of this process can be queried online using either a user-friendly web-interface or an XML-based web-service. As an alternative to the targeted query paradigm, SLIF also supports browsing the collection based on latent topic models which are derived from both the annotated text and the image data. The SLIF web application, as well as labeled datasets used for training system components, is publicly available at http://slif.cbi.cmu.edu.

Entities:  

Year:  2010        PMID: 24991197      PMCID: PMC4075770          DOI: 10.1016/j.websem.2010.04.002

Source DB:  PubMed          Journal:  Web Semant        ISSN: 1570-8268            Impact factor:   1.897


  4 in total

1.  High-recall protein entity recognition using a dictionary.

Authors:  Zhenzhen Kou; William W Cohen; Robert F Murphy
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

2.  Integrating image data into biomedical text categorization.

Authors:  Hagit Shatkay; Nawei Chen; Dorothea Blostein
Journal:  Bioinformatics       Date:  2006-07-15       Impact factor: 6.937

3.  A stacked graphical model for associating sub-images with sub-captions.

Authors:  Zhenzhen Kou; William W Cohen; Robert F Murphy
Journal:  Pac Symp Biocomput       Date:  2007

4.  Structured Correspondence Topic Models for Mining Captioned Figures in Biological Literature.

Authors:  Amr Ahmed; Eric P Xing; William W Cohen; Robert F Murphy
Journal:  KDD       Date:  2009
  4 in total
  2 in total

1.  Bar charts detection and analysis in biomedical literature of PubMed Central.

Authors:  Ying He; Xiaohan Yu; Yangjing Gan; Tujin Zhu; Shengwu Xiong; Jing Peng; Lun Hu; Guang Xu; Xiaohui Yuan
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

Authors:  Xu-Cheng Yin; Chun Yang; Wei-Yi Pei; Haixia Man; Jun Zhang; Erik Learned-Miller; Hong Yu
Journal:  PLoS One       Date:  2015-05-07       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.