Literature DB >> 31504205

Applying citizen science to gene, drug and disease relationship extraction from biomedical abstracts.

Ginger Tsueng1, Max Nanis1, Jennifer T Fouquier1, Michael Mayers1, Benjamin M Good1, Andrew I Su1.   

Abstract

MOTIVATION: Biomedical literature is growing at a rate that outpaces our ability to harness the knowledge contained therein. To mine valuable inferences from the large volume of literature, many researchers use information extraction algorithms to harvest information in biomedical texts. Information extraction is usually accomplished via a combination of manual expert curation and computational methods. Advances in computational methods usually depend on the time-consuming generation of gold standards by a limited number of expert curators. Citizen science is public participation in scientific research. We previously found that citizen scientists are willing and capable of performing named entity recognition of disease mentions in biomedical abstracts, but did not know if this was true with relationship extraction (RE).
RESULTS: In this article, we introduce the Relationship Extraction Module of the web-based application Mark2Cure (M2C) and demonstrate that citizen scientists can perform RE. We confirm the importance of accurate named entity recognition on user performance of RE and identify design issues that impacted data quality. We find that the data generated by citizen scientists can be used to identify relationship types not currently available in the M2C Relationship Extraction Module. We compare the citizen science-generated data with algorithm-mined data and identify ways in which the two approaches may complement one another. We also discuss opportunities for future improvement of this system, as well as the potential synergies between citizen science, manual biocuration and natural language processing.
AVAILABILITY AND IMPLEMENTATION: Mark2Cure platform: https://mark2cure.org; Mark2Cure source code: https://github.com/sulab/mark2cure; and data and analysis code for this article: https://github.com/gtsueng/M2C_rel_nb. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2020        PMID: 31504205      PMCID: PMC8104067          DOI: 10.1093/bioinformatics/btz678

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  42 in total

1.  DISEASES: text mining and data integration of disease-gene associations.

Authors:  Sune Pletscher-Frankild; Albert Pallejà; Kalliopi Tsafou; Janos X Binder; Lars Juhl Jensen
Journal:  Methods       Date:  2014-12-05       Impact factor: 3.608

2.  Literature-based discovery of new candidates for drug repurposing.

Authors:  Hsih-Te Yang; Jiun-Huang Ju; Yue-Ting Wong; Ilya Shmulevich; Jung-Hsien Chiang
Journal:  Brief Bioinform       Date:  2017-05-01       Impact factor: 11.622

3.  Fish oil, Raynaud's syndrome, and undiscovered public knowledge.

Authors:  D R Swanson
Journal:  Perspect Biol Med       Date:  1986       Impact factor: 1.416

4.  Microtask crowdsourcing for disease mention annotation in PubMed abstracts.

Authors:  Benjamin M Good; Max Nanis; Chunlei Wu; Andrew I Su
Journal:  Pac Symp Biocomput       Date:  2015

5.  A transition-based joint model for disease named entity recognition and normalization.

Authors:  Yinxia Lou; Yue Zhang; Tao Qian; Fei Li; Shufeng Xiong; Donghong Ji
Journal:  Bioinformatics       Date:  2017-08-01       Impact factor: 6.937

6.  A crowdsourcing workflow for extracting chemical-induced disease relations from free text.

Authors:  Tong Shu Li; Àlex Bravo; Laura I Furlong; Benjamin M Good; Andrew I Su
Journal:  Database (Oxford)       Date:  2016-04-17       Impact factor: 3.451

7.  EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation.

Authors:  Evangelos Pafilis; Pier Luigi Buttigieg; Barbra Ferrell; Emiliano Pereira; Julia Schnetzer; Christos Arvanitidis; Lars Juhl Jensen
Journal:  Database (Oxford)       Date:  2016-02-20       Impact factor: 3.451

Review 8.  Semantic annotation in biomedicine: the current landscape.

Authors:  Jelena Jovanović; Ebrahim Bagheri
Journal:  J Biomed Semantics       Date:  2017-09-22

9.  OC-2-KB: integrating crowdsourcing into an obesity and cancer knowledge base curation system.

Authors:  Juan Antonio Lossio-Ventura; William Hogan; François Modave; Yi Guo; Zhe He; Xi Yang; Hansi Zhang; Jiang Bian
Journal:  BMC Med Inform Decis Mak       Date:  2018-07-23       Impact factor: 2.796

10.  Crowdsourcing the General Public for Large Scale Molecular Pathology Studies in Cancer.

Authors:  Francisco J Candido Dos Reis; Stuart Lynn; H Raza Ali; Diana Eccles; Andrew Hanby; Elena Provenzano; Carlos Caldas; William J Howat; Leigh-Anne McDuffus; Bin Liu; Frances Daley; Penny Coulson; Rupesh J Vyas; Leslie M Harris; Joanna M Owens; Amy F M Carton; Janette P McQuillan; Andy M Paterson; Zohra Hirji; Sarah K Christie; Amber R Holmes; Marjanka K Schmidt; Montserrat Garcia-Closas; Douglas F Easton; Manjeet K Bolla; Qin Wang; Javier Benitez; Roger L Milne; Arto Mannermaa; Fergus Couch; Peter Devilee; Robert A E M Tollenaar; Caroline Seynaeve; Angela Cox; Simon S Cross; Fiona M Blows; Joyce Sanders; Renate de Groot; Jonine Figueroa; Mark Sherman; Maartje Hooning; Hermann Brenner; Bernd Holleczek; Christa Stegmaier; Chris Lintott; Paul D P Pharoah
Journal:  EBioMedicine       Date:  2015-05-09       Impact factor: 8.143

View more
  3 in total

1.  A hybrid approach toward biomedical relation extraction training corpora: combining distant supervision with crowdsourcing.

Authors:  Diana Sousa; Andre Lamurias; Francisco M Couto
Journal:  Database (Oxford)       Date:  2020-12-01       Impact factor: 3.451

2.  Outbreak.info Research Library: A standardized, searchable platform to discover and explore COVID-19 resources.

Authors:  Ginger Tsueng; Julia L Mullen; Manar Alkuzweny; Marco Cano; Benjamin Rush; Emily Haag; Alaa Abdel Latif; Xinghua Zhou; Zhongchao Qian; Emory Hufbauer; Mark Zeller; Kristian G Andersen; Chunlei Wu; Andrew I Su; Karthik Gangavarapu; Laura D Hughes
Journal:  bioRxiv       Date:  2022-06-02

3.  Building a pipeline to solicit expert knowledge from the community to aid gene summary curation.

Authors:  Giulia Antonazzo; Jose M Urbano; Steven J Marygold; Gillian H Millburn; Nicholas H Brown
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.