Literature DB >> 28389234

Crowd control: Effectively utilizing unscreened crowd workers for biomedical data annotation.

Anne Cocos1, Ting Qian2, Chris Callison-Burch3, Aaron J Masino4.   

Abstract

Annotating unstructured texts in Electronic Health Records data is usually a necessary step for conducting machine learning research on such datasets. Manual annotation by domain experts provides data of the best quality, but has become increasingly impractical given the rapid increase in the volume of EHR data. In this article, we examine the effectiveness of crowdsourcing with unscreened online workers as an alternative for transforming unstructured texts in EHRs into annotated data that are directly usable in supervised learning models. We find the crowdsourced annotation data to be just as effective as expert data in training a sentence classification model to detect the mentioning of abnormal ear anatomy in radiology reports of audiology. Furthermore, we have discovered that enabling workers to self-report a confidence level associated with each annotation can help researchers pinpoint less-accurate annotations requiring expert scrutiny. Our findings suggest that even crowd workers without specific domain knowledge can contribute effectively to the task of annotating unstructured EHR datasets.
Copyright © 2017 Elsevier Inc. All rights reserved.

Keywords:  Crowdsourcing; EHR data; Logistic regression; Sentence classification; Text annotations

Mesh:

Year:  2017        PMID: 28389234     DOI: 10.1016/j.jbi.2017.04.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  Crowdsourcing to delineate skin affected by chronic graft-vs-host disease.

Authors:  Eric R Tkaczyk; Joseph R Coco; Jianing Wang; Fuyao Chen; Cheng Ye; Madan H Jagasia; Benoit M Dawant; Daniel Fabbri
Journal:  Skin Res Technol       Date:  2019-02-20       Impact factor: 2.365

2.  A hybrid approach toward biomedical relation extraction training corpora: combining distant supervision with crowdsourcing.

Authors:  Diana Sousa; Andre Lamurias; Francisco M Couto
Journal:  Database (Oxford)       Date:  2020-12-01       Impact factor: 3.451

3.  A systematic review of natural language processing applied to radiology reports.

Authors:  Arlene Casey; Emma Davidson; Michael Poon; Hang Dong; Daniel Duma; Andreas Grivas; Claire Grover; Víctor Suárez-Paniagua; Richard Tobin; William Whiteley; Honghan Wu; Beatrice Alex
Journal:  BMC Med Inform Decis Mak       Date:  2021-06-03       Impact factor: 2.796

Review 4.  Crowdsourcing in health and medical research: a systematic review.

Authors:  Cheng Wang; Larry Han; Gabriella Stein; Suzanne Day; Cedric Bien-Gund; Allison Mathews; Jason J Ong; Pei-Zhen Zhao; Shu-Fang Wei; Jennifer Walker; Roger Chou; Amy Lee; Angela Chen; Barry Bayus; Joseph D Tucker
Journal:  Infect Dis Poverty       Date:  2020-01-20       Impact factor: 4.520

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.