Literature DB >> 28192233

Use of ontology structure and Bayesian models to aid the crowdsourcing of ICD-11 sanctioning rules.

Yun Lou1, Samson W Tu2, Csongor Nyulas1, Tania Tudorache1, Robert J G Chalmers3, Mark A Musen1.   

Abstract

The International Classification of Diseases (ICD) is the de facto standard international classification for mortality reporting and for many epidemiological, clinical, and financial use cases. The next version of ICD, ICD-11, will be submitted for approval by the World Health Assembly in 2018. Unlike previous versions of ICD, where coders mostly select single codes from pre-enumerated disease and disorder codes, ICD-11 coding will allow extensive use of multiple codes to give more detailed disease descriptions. For example, "severe malignant neoplasms of left breast" may be coded using the combination of a "stem code" (e.g., code for malignant neoplasms of breast) with a variety of "extension codes" (e.g., codes for laterality and severity). The use of multiple codes (a process called post-coordination), while avoiding the pitfall of having to pre-enumerate vast number of possible disease and qualifier combinations, risks the creation of meaningless expressions that combine stem codes with inappropriate qualifiers. To prevent that from happening, "sanctioning rules" that define legal combinations are necessary. In this work, we developed a crowdsourcing method for obtaining sanctioning rules for the post-coordination of concepts in ICD-11. Our method utilized the hierarchical structures in the domain to improve the accuracy of the sanctioning rules and to lower the crowdsourcing cost. We used Bayesian networks to model crowd workers' skills, the accuracy of their responses, and our confidence in the acquired sanctioning rules. We applied reinforcement learning to develop an agent that constantly adjusted the confidence cutoffs during the crowdsourcing process to maximize the overall quality of sanctioning rules under a fixed budget. Finally, we performed formative evaluations using a skin-disease branch of the draft ICD-11 and demonstrated that the crowd-sourced sanctioning rules replicated those defined by an expert dermatologist with high precision and recall. This work demonstrated that a crowdsourcing approach could offer a reasonably efficient method for generating a first draft of sanctioning rules that subject matter experts could verify and edit, thus relieving them of the tedium and cost of formulating the initial set of rules.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Bayesian network; Crowdsourcing; ICD; Ontology; Post-coordination; Sanctioning rules

Mesh:

Year:  2017        PMID: 28192233      PMCID: PMC5428551          DOI: 10.1016/j.jbi.2017.02.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  19 in total

1.  A shallow parser based on closed-class words to capture relations in biomedical text.

Authors:  Gondy Leroy; Hsinchun Chen; Jesse D Martinez
Journal:  J Biomed Inform       Date:  2003-06       Impact factor: 6.317

Review 2.  Natural language processing: an introduction.

Authors:  Prakash M Nadkarni; Lucila Ohno-Machado; Wendy W Chapman
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

3.  Extracting causal relations on HIV drug resistance from literature.

Authors:  Quoc-Chinh Bui; Breanndán O Nualláin; Charles A Boucher; Peter M A Sloot
Journal:  BMC Bioinformatics       Date:  2010-02-23       Impact factor: 3.169

4.  Mining the pharmacogenomics literature--a survey of the state of the art.

Authors:  Udo Hahn; K Bretonnel Cohen; Yael Garten; Nigam H Shah
Journal:  Brief Bioinform       Date:  2012-07       Impact factor: 11.622

Review 5.  Text-mining solutions for biomedical research: enabling integrative biology.

Authors:  Dietrich Rebholz-Schuhmann; Anika Oellrich; Robert Hoehndorf
Journal:  Nat Rev Genet       Date:  2012-11-14       Impact factor: 53.242

6.  What's in a class? Lessons learnt from the ICD - SNOMED CT harmonisation.

Authors:  Stefan Schulz; Jean-M Rodrigues; Alan Rector; Kent Spackman; James Campbell; Bedirhan Ustün; Christopher G Chute; Harold Solbrig; Vincenzo Della Mea; Jane Millar; Kristina Brand Persson
Journal:  Stud Health Technol Inform       Date:  2014

7.  Crowdsourcing the verification of relationships in biomedical ontologies.

Authors:  Jonathan M Mortensen; Mark A Musen; Natalya F Noy
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

8.  Extraction of semantic biomedical relations from text using conditional random fields.

Authors:  Markus Bundschus; Mathaeus Dejori; Martin Stetter; Volker Tresp; Hans-Peter Kriegel
Journal:  BMC Bioinformatics       Date:  2008-04-23       Impact factor: 3.169

9.  Characterizing environmental and phenotypic associations using information theory and electronic health records.

Authors:  Xiaoyan Wang; George Hripcsak; Carol Friedman
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

Review 10.  Survey of Natural Language Processing Techniques in Bioinformatics.

Authors:  Zhiqiang Zeng; Hua Shi; Yun Wu; Zhiling Hong
Journal:  Comput Math Methods Med       Date:  2015-10-07       Impact factor: 2.238

View more
  1 in total

1.  A Bibliometric Analysis of the Development of ICD-11 in Medical Informatics.

Authors:  Donghua Chen; Runtong Zhang; Hongmei Zhao; Jiayi Feng
Journal:  J Healthc Eng       Date:  2019-12-25       Impact factor: 2.682

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.