Literature DB >> 29295103

Inter-Annotator Agreement and the Upper Limit on Machine Performance: Evidence from Biomedical Natural Language Processing.

Mayla Boguslav1, Kevin Bretonnel Cohen1.   

Abstract

Human-annotated data is a fundamental part of natural language processing system development and evaluation. The quality of that data is typically assessed by calculating the agreement between the annotators. It is widely assumed that this agreement between annotators is the upper limit on system performance in natural language processing: if humans can't agree with each other about the classification more than some percentage of the time, we don't expect a computer to do any better. We trace the logical positivist roots of the motivation for measuring inter-annotator agreement, demonstrate the prevalence of the widely-held assumption about the relationship between inter-annotator agreement and system performance, and present data that suggest that inter-annotator agreement is not, in fact, an upper bound on language processing system performance.

Entities:  

Keywords:  Evaluation Studies; Natural Language Processing; Supervised Machine Learning

Mesh:

Year:  2017        PMID: 29295103

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  2 in total

1.  Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine.

Authors:  Tingting Zhang; Yaqiang Wang; Xiaofeng Wang; Yafei Yang; Ying Ye
Journal:  BMC Med Inform Decis Mak       Date:  2020-04-06       Impact factor: 2.796

2.  Identifying and classifying goals for scientific knowledge.

Authors:  Mayla R Boguslav; Nourah M Salem; Elizabeth K White; Sonia M Leach; Lawrence E Hunter
Journal:  Bioinform Adv       Date:  2021-07-28
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.