Literature DB >> 32383756

TeamTat: a collaborative text annotation tool.

Rezarta Islamaj1, Dongseop Kwon2, Sun Kim1, Zhiyong Lu1.   

Abstract

Manually annotated data is key to developing text-mining and information-extraction algorithms. However, human annotation requires considerable time, effort and expertise. Given the rapid growth of biomedical literature, it is paramount to build tools that facilitate speed and maintain expert quality. While existing text annotation tools may provide user-friendly interfaces to domain experts, limited support is available for figure display, project management, and multi-user team annotation. In response, we developed TeamTat (https://www.teamtat.org), a web-based annotation tool (local setup available), equipped to manage team annotation projects engagingly and efficiently. TeamTat is a novel tool for managing multi-user, multi-label document annotation, reflecting the entire production life cycle. Project managers can specify annotation schema for entities and relations and select annotator(s) and distribute documents anonymously to prevent bias. Document input format can be plain text, PDF or BioC (uploaded locally or automatically retrieved from PubMed/PMC), and output format is BioC with inline annotations. TeamTat displays figures from the full text for the annotator's convenience. Multiple users can work on the same document independently in their workspaces, and the team manager can track task completion. TeamTat provides corpus quality assessment via inter-annotator agreement statistics, and a user-friendly interface convenient for annotation review and inter-annotator disagreement resolution to improve corpus quality. Published by Oxford University Press on behalf of Nucleic Acids Research 2020.

Entities:  

Year:  2020        PMID: 32383756      PMCID: PMC7319445          DOI: 10.1093/nar/gkaa333

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  24 in total

1.  PMC text mining subset in BioC: about three million full-text articles and growing.

Authors:  Donald C Comeau; Chih-Hsuan Wei; Rezarta Islamaj Doğan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2019-09-15       Impact factor: 6.937

2.  PubTator central: automated concept annotation for biomedical full text articles.

Authors:  Chih-Hsuan Wei; Alexis Allot; Robert Leaman; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

3.  tmVar: a text mining approach for extracting sequence variants in biomedical literature.

Authors:  Chih-Hsuan Wei; Bethany R Harris; Hung-Yu Kao; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-04-05       Impact factor: 6.937

4.  NCBI disease corpus: a resource for disease name recognition and concept normalization.

Authors:  Rezarta Islamaj Doğan; Robert Leaman; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2014-01-03       Impact factor: 6.317

Review 5.  A survey on annotation tools for the biomedical literature.

Authors:  Mariana Neves; Ulf Leser
Journal:  Brief Bioinform       Date:  2012-12-18       Impact factor: 11.622

6.  Argo: an integrative, interactive, text mining-based workbench supporting curation.

Authors:  Rafal Rak; Andrew Rowley; William Black; Sophia Ananiadou
Journal:  Database (Oxford)       Date:  2012-03-20       Impact factor: 3.451

7.  EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation.

Authors:  Evangelos Pafilis; Pier Luigi Buttigieg; Barbra Ferrell; Emiliano Pereira; Julia Schnetzer; Christos Arvanitidis; Lars Juhl Jensen
Journal:  Database (Oxford)       Date:  2016-02-20       Impact factor: 3.451

8.  Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature.

Authors:  H-M Müller; K M Van Auken; Y Li; P W Sternberg
Journal:  BMC Bioinformatics       Date:  2018-03-09       Impact factor: 3.169

9.  BC4GO: a full-text corpus for the BioCreative IV GO task.

Authors:  Kimberly Van Auken; Mary L Schaeffer; Peter McQuilton; Stanley J F Laulederkind; Donghui Li; Shur-Jen Wang; G Thomas Hayman; Susan Tweedie; Cecilia N Arighi; James Done; Hans-Michael Müller; Paul W Sternberg; Yuqing Mao; Chih-Hsuan Wei; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2014-07-28       Impact factor: 3.451

10.  Assisting manual literature curation for protein-protein interactions using BioQRator.

Authors:  Dongseop Kwon; Sun Kim; Soo-Yong Shin; Andrew Chatr-aryamontri; W John Wilbur
Journal:  Database (Oxford)       Date:  2014-07-22       Impact factor: 3.451

View more
  4 in total

1.  Machine Learning Approach to Facilitate Knowledge Synthesis at the Intersection of Liver Cancer, Epidemiology, and Health Disparities Research.

Authors:  Travis C Hyams; Ling Luo; Brionna Hair; Kyubum Lee; Zhiyong Lu; Daniela Seminara
Journal:  JCO Clin Cancer Inform       Date:  2022-05

2.  NLM-Chem, a new resource for chemical entity recognition in PubMed full text literature.

Authors:  Rezarta Islamaj; Robert Leaman; Sun Kim; Dongseop Kwon; Chih-Hsuan Wei; Donald C Comeau; Yifan Peng; David Cissel; Cathleen Coss; Carol Fisher; Rob Guzman; Preeti Gokal Kochar; Stella Koppel; Dorothy Trinh; Keiko Sekiya; Janice Ward; Deborah Whitman; Susan Schmidt; Zhiyong Lu
Journal:  Sci Data       Date:  2021-03-25       Impact factor: 6.444

3.  MedTAG: a portable and customizable annotation tool for biomedical documents.

Authors:  Fabio Giachelle; Ornella Irrera; Gianmaria Silvello
Journal:  BMC Med Inform Decis Mak       Date:  2021-12-18       Impact factor: 2.796

4.  Auto-CORPus: A Natural Language Processing Tool for Standardizing and Reusing Biomedical Literature.

Authors:  Tim Beck; Tom Shorter; Yan Hu; Zhuoyu Li; Shujian Sun; Casiana M Popovici; Nicholas A R McQuibban; Filip Makraduli; Cheng S Yeung; Thomas Rowlands; Joram M Posma
Journal:  Front Digit Health       Date:  2022-02-15
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.