Literature DB >> 18229720

Filling the gaps between tools and users: a tool comparator, using protein-protein interaction as an example.

Yoshinobu Kano1, Ngan Nguyen, Rune Saetre, Kazuhiro Yoshida, Yusuke Miyao, Yoshimasa Tsuruoka, Yuichiro Matsubayashi, Sophia Ananiadou, Jun'ichi Tsujii.   

Abstract

Recently, several text mining programs have reached a near-practical level of performance. Some systems are already being used by biologists and database curators. However, it has also been recognized that current Natural Language Processing (NLP) and Text Mining (TM) technology is not easy to deploy, since research groups tend to develop systems that cater specifically to their own requirements. One of the major reasons for the difficulty of deployment of NLP/TM technology is that re-usability and interoperability of software tools are typically not considered during development. While some effort has been invested in making interoperable NLP/TM toolkits, the developers of end-to-end systems still often struggle to reuse NLP/TM tools, and often opt to develop similar programs from scratch instead. This is particularly the case in BioNLP, since the requirements of biologists are so diverse that NLP tools have to be adapted and re-organized in a much more extensive manner than was originally expected. Although generic frameworks like UIMA (Unstructured Information Management Architecture) provide promising ways to solve this problem, the solution that they provide is only partial. In order for truly interoperable toolkits to become a reality, we also need sharable type systems and a developer-friendly environment for software integration that includes functionality for systematic comparisons of available tools, a simple I/O interface, and visualization tools. In this paper, we describe such an environment that was developed based on UIMA, and we show its feasibility through our experience in developing a protein-protein interaction (PPI) extraction system.

Mesh:

Year:  2008        PMID: 18229720

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  6 in total

1.  TRANSLATING BIOLOGY: TEXT MINING TOOLS THAT WORK.

Authors:  K Bretonnel Cohen; Hong Yu; Philip E Bourne; Lynette Hirschman
Journal:  Pac Symp Biocomput       Date:  2008-01-01

2.  U-Compare: share and compare text mining tools with UIMA.

Authors:  Yoshinobu Kano; William A Baumgartner; Luke McCrohon; Sophia Ananiadou; K Bretonnel Cohen; Lawrence Hunter; Jun'ichi Tsujii
Journal:  Bioinformatics       Date:  2009-05-04       Impact factor: 6.937

3.  Construction and Analysis of the Cell Surface's Protein Network for Human Sperm-Egg Interaction.

Authors:  Soudabeh Sabetian Fard Jahromi; Mohd Shahir Shamsir
Journal:  ISRN Bioinform       Date:  2013-08-12

4.  Automatic extraction of protein-protein interactions using grammatical relationship graph.

Authors:  Kaixian Yu; Pei-Yau Lung; Tingting Zhao; Peixiang Zhao; Yan-Yuan Tseng; Jinfeng Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2018-07-23       Impact factor: 2.796

5.  An open-source framework for large-scale, flexible evaluation of biomedical text mining systems.

Authors:  William A Baumgartner; K Bretonnel Cohen; Lawrence Hunter
Journal:  J Biomed Discov Collab       Date:  2008-01-29

6.  BioC: a minimalist approach to interoperability for biomedical text processing.

Authors:  Donald C Comeau; Rezarta Islamaj Doğan; Paolo Ciccarese; Kevin Bretonnel Cohen; Martin Krallinger; Florian Leitner; Zhiyong Lu; Yifan Peng; Fabio Rinaldi; Manabu Torii; Alfonso Valencia; Karin Verspoor; Thomas C Wiegers; Cathy H Wu; W John Wilbur
Journal:  Database (Oxford)       Date:  2013-09-18       Impact factor: 3.451

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.