Literature DB >> 26873781

Is the crowd better as an assistant or a replacement in ontology engineering? An exploration through the lens of the Gene Ontology.

Jonathan M Mortensen1, Natalie Telis2, Jacob J Hughey3, Hua Fan-Minogue4, Kimberly Van Auken5, Michel Dumontier6, Mark A Musen7.   

Abstract

Biomedical ontologies contain errors. Crowdsourcing, defined as taking a job traditionally performed by a designated agent and outsourcing it to an undefined large group of people, provides scalable access to humans. Therefore, the crowd has the potential to overcome the limited accuracy and scalability found in current ontology quality assurance approaches. Crowd-based methods have identified errors in SNOMED CT, a large, clinical ontology, with an accuracy similar to that of experts, suggesting that crowdsourcing is indeed a feasible approach for identifying ontology errors. This work uses that same crowd-based methodology, as well as a panel of experts, to verify a subset of the Gene Ontology (200 relationships). Experts identified 16 errors, generally in relationships referencing acids and metals. The crowd performed poorly in identifying those errors, with an area under the receiver operating characteristic curve ranging from 0.44 to 0.73, depending on the methods configuration. However, when the crowd verified what experts considered to be easy relationships with useful definitions, they performed reasonably well. Notably, there are significantly fewer Google search results for Gene Ontology concepts than SNOMED CT concepts. This disparity may account for the difference in performance - fewer search results indicate a more difficult task for the worker. The number of Internet search results could serve as a method to assess which tasks are appropriate for the crowd. These results suggest that the crowd fits better as an expert assistant, helping experts with their verification by completing the easy tasks and allowing experts to focus on the difficult tasks, rather than an expert replacement.
Copyright © 2016 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Crowdsourcing; Gene Ontology; Ontology engineering

Mesh:

Year:  2016        PMID: 26873781      PMCID: PMC4836980          DOI: 10.1016/j.jbi.2016.02.005

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  17 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

Review 2.  Bio-ontologies: current trends and future directions.

Authors:  Olivier Bodenreider; Robert Stevens
Journal:  Brief Bioinform       Date:  2006-08-09       Impact factor: 11.622

Review 3.  Biomedical ontologies: a functional perspective.

Authors:  Daniel L Rubin; Nigam H Shah; Natalya F Noy
Journal:  Brief Bioinform       Date:  2007-12-12       Impact factor: 11.622

4.  Special issue on auditing of terminologies.

Authors:  J Geller; Y Perl; M Halper; R Cornet
Journal:  J Biomed Inform       Date:  2009-06       Impact factor: 6.317

5.  Biomedical ontologies in action: role in knowledge management, data integration and decision support.

Authors:  O Bodenreider
Journal:  Yearb Med Inform       Date:  2008

6.  Applying evolutionary terminology auditing to the Gene Ontology.

Authors:  Werner Ceusters
Journal:  J Biomed Inform       Date:  2008-12-31       Impact factor: 6.317

7.  Using the wisdom of the crowds to find critical errors in biomedical ontologies: a study of SNOMED CT.

Authors:  Jonathan M Mortensen; Evan P Minty; Michael Januszyk; Timothy E Sweeney; Alan L Rector; Natalya F Noy; Mark A Musen
Journal:  J Am Med Inform Assoc       Date:  2014-10-23       Impact factor: 4.497

8.  Predicting protein structures with a multiplayer online game.

Authors:  Seth Cooper; Firas Khatib; Adrien Treuille; Janos Barbero; Jeehyung Lee; Michael Beenen; Andrew Leaver-Fay; David Baker; Zoran Popović; Foldit Players
Journal:  Nature       Date:  2010-08-05       Impact factor: 49.962

9.  Ontology Design Patterns for bio-ontologies: a case study on the Cell Cycle Ontology.

Authors:  Mikel Egaña Aranguren; Erick Antezana; Martin Kuiper; Robert Stevens
Journal:  BMC Bioinformatics       Date:  2008-04-29       Impact factor: 3.169

10.  OpenDMAP: an open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression.

Authors:  Lawrence Hunter; Zhiyong Lu; James Firby; William A Baumgartner; Helen L Johnson; Philip V Ogren; K Bretonnel Cohen
Journal:  BMC Bioinformatics       Date:  2008-01-31       Impact factor: 3.169

View more
  2 in total

1.  Use of ontology structure and Bayesian models to aid the crowdsourcing of ICD-11 sanctioning rules.

Authors:  Yun Lou; Samson W Tu; Csongor Nyulas; Tania Tudorache; Robert J G Chalmers; Mark A Musen
Journal:  J Biomed Inform       Date:  2017-02-10       Impact factor: 6.317

Review 2.  Assessing the practice of biomedical ontology evaluation: Gaps and opportunities.

Authors:  Muhammad Amith; Zhe He; Jiang Bian; Juan Antonio Lossio-Ventura; Cui Tao
Journal:  J Biomed Inform       Date:  2018-02-17       Impact factor: 6.317

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.