Literature DB >> 23255168

A survey on annotation tools for the biomedical literature.

Mariana Neves1, Ulf Leser.   

Abstract

New approaches to biomedical text mining crucially depend on the existence of comprehensive annotated corpora. Such corpora, commonly called gold standards, are important for learning patterns or models during the training phase, for evaluating and comparing the performance of algorithms and also for better understanding the information sought for by means of examples. Gold standards depend on human understanding and manual annotation of natural language text. This process is very time-consuming and expensive because it requires high intellectual effort from domain experts. Accordingly, the lack of gold standards is considered as one of the main bottlenecks for developing novel text mining methods. This situation led the development of tools that support humans in annotating texts. Such tools should be intuitive to use, should support a range of different input formats, should include visualization of annotated texts and should generate an easy-to-parse output format. Today, a range of tools which implement some of these functionalities are available. In this survey, we present a comprehensive survey of tools for supporting annotation of biomedical texts. Altogether, we considered almost 30 tools, 13 of which were selected for an in-depth comparison. The comparison was performed using predefined criteria and was accompanied by hands-on experiences whenever possible. Our survey shows that current tools can support many of the tasks in biomedical text annotation in a satisfying manner, but also that no tool can be considered as a true comprehensive solution.

Entities:  

Keywords:  annotation tools; curation tools; gold standard corpora; text mining

Mesh:

Year:  2012        PMID: 23255168     DOI: 10.1093/bib/bbs084

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  22 in total

1.  Expert guided natural language processing using one-class classification.

Authors:  Erel Joffe; Emily J Pettigrew; Jorge R Herskovic; Charles F Bearden; Elmer V Bernstam
Journal:  J Am Med Inform Assoc       Date:  2015-06-10       Impact factor: 4.497

2.  TeamTat: a collaborative text annotation tool.

Authors:  Rezarta Islamaj; Dongseop Kwon; Sun Kim; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

3.  RegEl corpus: identifying DNA regulatory elements in the scientific literature.

Authors:  Samuele Garda; Freyda Lenihan-Geels; Sebastian Proft; Stefanie Hochmuth; Markus Schülke; Dominik Seelow; Ulf Leser
Journal:  Database (Oxford)       Date:  2022-06-27       Impact factor: 4.462

4.  An extensive review of tools for manual annotation of documents.

Authors:  Mariana Neves; Jurica Ševa
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

Review 5.  An analysis on the entity annotations in biological corpora.

Authors:  Mariana Neves
Journal:  F1000Res       Date:  2014-04-25

6.  The CHEMDNER corpus of chemicals and drugs and its annotation principles.

Authors:  Martin Krallinger; Obdulia Rabal; Florian Leitner; Miguel Vazquez; David Salgado; Zhiyong Lu; Robert Leaman; Yanan Lu; Donghong Ji; Daniel M Lowe; Roger A Sayle; Riza Theresa Batista-Navarro; Rafal Rak; Torsten Huber; Tim Rocktäschel; Sérgio Matos; David Campos; Buzhou Tang; Hua Xu; Tsendsuren Munkhdalai; Keun Ho Ryu; S V Ramanan; Senthil Nathan; Slavko Žitnik; Marko Bajec; Lutz Weber; Matthias Irmer; Saber A Akhondi; Jan A Kors; Shuo Xu; Xin An; Utpal Kumar Sikdar; Asif Ekbal; Masaharu Yoshioka; Thaer M Dieb; Miji Choi; Karin Verspoor; Madian Khabsa; C Lee Giles; Hongfang Liu; Komandur Elayavilli Ravikumar; Andre Lamurias; Francisco M Couto; Hong-Jie Dai; Richard Tzong-Han Tsai; Caglar Ata; Tolga Can; Anabel Usié; Rui Alves; Isabel Segura-Bedmar; Paloma Martínez; Julen Oyarzabal; Alfonso Valencia
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

7.  PubTator: a web-based text mining tool for assisting biocuration.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2013-05-22       Impact factor: 16.971

8.  Preliminary evaluation of the CellFinder literature curation pipeline for gene expression in kidney cells and anatomical parts.

Authors:  Mariana Neves; Alexander Damaschun; Nancy Mah; Fritz Lekschas; Stefanie Seltmann; Harald Stachelscheid; Jean-Fred Fontaine; Andreas Kurtz; Ulf Leser
Journal:  Database (Oxford)       Date:  2013-04-18       Impact factor: 3.451

9.  Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings.

Authors:  Anne-Dominique Pham; Aurélie Névéol; Thomas Lavergne; Daisuke Yasunaga; Olivier Clément; Guy Meyer; Rémy Morello; Anita Burgun
Journal:  BMC Bioinformatics       Date:  2014-08-07       Impact factor: 3.169

10.  Assisting manual literature curation for protein-protein interactions using BioQRator.

Authors:  Dongseop Kwon; Sun Kim; Soo-Yong Shin; Andrew Chatr-aryamontri; W John Wilbur
Journal:  Database (Oxford)       Date:  2014-07-22       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.