Literature DB >> 16085453

Distributed modules for text annotation and IE applied to the biomedical domain.

Harald Kirsch1, Sylvain Gaudan, Dietrich Rebholz-Schuhmann.   

Abstract

Biological databases contain facts from scientific literature that have been curated by hand to ensure high quality. Curation is time-consuming and can be supported by information extraction methods. We present a server software infrastructure which allows to easily plug in modules to identify biologically interesting pieces of text to be then presented in a web interface to the curator. There are modules which identify UniProt, UMLS and GO terminology, gene and protein names, mutations and protein-protein interactions. UniProt, UMLS and GO concepts are automatically linked to the original source. The module for mutations is based on syntax patterns and the one for protein-protein interactions relies on chunk parsing. All modules work as separate servers possibly distributed on different machines and can be combined into processing pipelines as necessary. Communication is based on XML annotated text streams, each server processing the XML elements it is designed for, and possibly adding more information in the form of XML annotation. The server and the underlying software are available to the public.

Mesh:

Year:  2005        PMID: 16085453     DOI: 10.1016/j.ijmedinf.2005.06.011

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  8 in total

1.  PaperMaker: validation of biomedical scientific publications.

Authors:  D Rebholz-Schuhmann; S Kavaliauskas; P Pezik
Journal:  Bioinformatics       Date:  2010-03-03       Impact factor: 6.937

2.  Featured Article: Genotation: Actionable knowledge for the scientific reader.

Authors:  Panduka Nagahawatte; Ethan Willis; Mark Sakauye; Rony Jose; Hao Chen; Robert L Davis
Journal:  Exp Biol Med (Maywood)       Date:  2016-02-21

Review 3.  SYMBIOmatics: synergies in Medical Informatics and Bioinformatics--exploring current scientific literature for emerging topics.

Authors:  Dietrich Rebholz-Schuhman; Graham Cameron; Dominic Clark; Erik van Mulligen; Jean-Louis Coatrieux; Eva Del Hoyo Barbolla; Fernando Martin-Sanchez; Luciano Milanesi; Ivan Porro; Francesco Beltrame; Ioannis Tollis; Johan Van der Lei
Journal:  BMC Bioinformatics       Date:  2007-03-08       Impact factor: 3.169

4.  Evaluation and cross-comparison of lexical entities of biological interest (LexEBI).

Authors:  Dietrich Rebholz-Schuhmann; Jee-Hyub Kim; Ying Yan; Abhishek Dixit; Caroline Friteyre; Robert Hoehndorf; Rolf Backofen; Ian Lewin
Journal:  PLoS One       Date:  2013-10-04       Impact factor: 3.240

5.  Evaluating gold standard corpora against gene/protein tagging solutions and lexical resources.

Authors:  Dietrich Rebholz-Schuhmann; Senay Kafkas; Jee-Hyub Kim; Chen Li; Antonio Jimeno Yepes; Robert Hoehndorf; Rolf Backofen; Ian Lewin
Journal:  J Biomed Semantics       Date:  2013-10-11

Review 6.  Semantic annotation in biomedicine: the current landscape.

Authors:  Jelena Jovanović; Ebrahim Bagheri
Journal:  J Biomed Semantics       Date:  2017-09-22

7.  Rapid identification of PAX2/5/8 direct downstream targets in the otic vesicle by combinatorial use of bioinformatics tools.

Authors:  Mirana Ramialison; Baubak Bajoghli; Narges Aghaallaei; Laurence Ettwiller; Sylvain Gaudan; Beate Wittbrodt; Thomas Czerny; Joachim Wittbrodt
Journal:  Genome Biol       Date:  2008-10-01       Impact factor: 13.583

8.  Monitoring named entity recognition: the League Table.

Authors:  Dietrich Rebholz-Schuhmann; Senay Kafkas; Jee-Hyub Kim; Antonio Jimeno Yepes; Ian Lewin
Journal:  J Biomed Semantics       Date:  2013-09-13
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.