Literature DB >> 26883486

Beyond accuracy: creating interoperable and scalable text-mining web services.

Chih-Hsuan Wei1, Robert Leaman1, Zhiyong Lu1.   

Abstract

UNLABELLED: The biomedical literature is a knowledge-rich resource and an important foundation for future research. With over 24 million articles in PubMed and an increasing growth rate, research in automated text processing is becoming increasingly important. We report here our recently developed web-based text mining services for biomedical concept recognition and normalization. Unlike most text-mining software tools, our web services integrate several state-of-the-art entity tagging systems (DNorm, GNormPlus, SR4GN, tmChem and tmVar) and offer a batch-processing mode able to process arbitrary text input (e.g. scholarly publications, patents and medical records) in multiple formats (e.g. BioC). We support multiple standards to make our service interoperable and allow simpler integration with other text-processing pipelines. To maximize scalability, we have preprocessed all PubMed articles, and use a computer cluster for processing large requests of arbitrary text.
AVAILABILITY AND IMPLEMENTATION: Our text-mining web service is freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/tmTools/#curl CONTACT: : Zhiyong.Lu@nih.gov. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.

Mesh:

Year:  2016        PMID: 26883486      PMCID: PMC4908316          DOI: 10.1093/bioinformatics/btv760

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

1.  MyMiner: a web application for computer-assisted biocuration and text annotation.

Authors:  David Salgado; Martin Krallinger; Marc Depaule; Elodie Drula; Ashish V Tendulkar; Florian Leitner; Alfonso Valencia; Christophe Marcelle
Journal:  Bioinformatics       Date:  2012-07-12       Impact factor: 6.937

2.  NERBio: using selected word conjunctions, term normalization, and global patterns to improve biomedical named entity recognition.

Authors:  Richard Tzong-Han Tsai; Cheng-Lung Sung; Hong-Jie Dai; Hsieh-Chuan Hung; Ting-Yi Sung; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2006-12-18       Impact factor: 3.169

3.  A navigator for human genome epidemiology.

Authors:  Wei Yu; Marta Gwinn; Melinda Clyne; Ajay Yesupriya; Muin J Khoury
Journal:  Nat Genet       Date:  2008-02       Impact factor: 38.330

4.  High-performance gene name normalization with GeNo.

Authors:  Joachim Wermter; Katrin Tomanek; Udo Hahn
Journal:  Bioinformatics       Date:  2009-02-02       Impact factor: 6.937

5.  LINNAEUS: a species name identification system for biomedical literature.

Authors:  Martin Gerner; Goran Nenadic; Casey M Bergman
Journal:  BMC Bioinformatics       Date:  2010-02-11       Impact factor: 3.169

6.  tmVar: a text mining approach for extracting sequence variants in biomedical literature.

Authors:  Chih-Hsuan Wei; Bethany R Harris; Hung-Yu Kao; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-04-05       Impact factor: 6.937

7.  MutationFinder: a high-performance system for extracting point mutation mentions from text.

Authors:  J Gregory Caporaso; William A Baumgartner; David A Randolph; K Bretonnel Cohen; Lawrence Hunter
Journal:  Bioinformatics       Date:  2007-05-11       Impact factor: 6.937

8.  Overview of BioCreative II gene normalization.

Authors:  Alexander A Morgan; Zhiyong Lu; Xinglong Wang; Aaron M Cohen; Juliane Fluck; Patrick Ruch; Anna Divoli; Katrin Fundel; Robert Leaman; Jörg Hakenberg; Chengjie Sun; Heng-hui Liu; Rafael Torres; Michael Krauthammer; William W Lau; Hongfang Liu; Chun-Nan Hsu; Martijn Schuemie; K Bretonnel Cohen; Lynette Hirschman
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

9.  DNorm: disease name normalization with pairwise learning to rank.

Authors:  Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-08-21       Impact factor: 6.937

10.  Web services-based text-mining demonstrates broad impacts for interoperability and process simplification.

Authors:  Thomas C Wiegers; Allan Peter Davis; Carolyn J Mattingly
Journal:  Database (Oxford)       Date:  2014-06-10       Impact factor: 3.451

View more
  11 in total

1.  HiPub: translating PubMed and PMC texts to networks for knowledge discovery.

Authors:  Kyubum Lee; Wonho Shin; Byounggun Kim; Sunwon Lee; Yonghwa Choi; Sunkyu Kim; Minji Jeon; Aik Choon Tan; Jaewoo Kang
Journal:  Bioinformatics       Date:  2016-08-02       Impact factor: 6.937

2.  tmVar 2.0: integrating genomic variant information from literature with dbSNP and ClinVar for precision medicine.

Authors:  Chih-Hsuan Wei; Lon Phan; Juliana Feltz; Rama Maiti; Tim Hefferon; Zhiyong Lu
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

3.  DES-Tcell is a knowledgebase for exploring immunology-related literature.

Authors:  Ahdab AlSaieedi; Adil Salhi; Faroug Tifratene; Arwa Bin Raies; Arnaud Hungler; Mahmut Uludag; Christophe Van Neste; Vladimir B Bajic; Takashi Gojobori; Magbubah Essack
Journal:  Sci Rep       Date:  2021-07-12       Impact factor: 4.379

4.  PubMedPortable: A Framework for Supporting the Development of Text Mining Applications.

Authors:  Kersten Döring; Björn A Grüning; Kiran K Telukunta; Philippe Thomas; Stefan Günther
Journal:  PLoS One       Date:  2016-10-05       Impact factor: 3.240

5.  Overview of the interactive task in BioCreative V.

Authors:  Qinghua Wang; Shabbir S Abdul; Lara Almeida; Sophia Ananiadou; Yalbi I Balderas-Martínez; Riza Batista-Navarro; David Campos; Lucy Chilton; Hui-Jou Chou; Gabriela Contreras; Laurel Cooper; Hong-Jie Dai; Barbra Ferrell; Juliane Fluck; Socorro Gama-Castro; Nancy George; Georgios Gkoutos; Afroza K Irin; Lars J Jensen; Silvia Jimenez; Toni R Jue; Ingrid Keseler; Sumit Madan; Sérgio Matos; Peter McQuilton; Marija Milacic; Matthew Mort; Jeyakumar Natarajan; Evangelos Pafilis; Emiliano Pereira; Shruti Rao; Fabio Rinaldi; Karen Rothfels; David Salgado; Raquel M Silva; Onkar Singh; Raymund Stefancsik; Chu-Hsien Su; Suresh Subramani; Hamsa D Tadepally; Loukia Tsaprouni; Nicole Vasilevsky; Xiaodong Wang; Andrew Chatr-Aryamontri; Stanley J F Laulederkind; Sherri Matis-Mitchell; Johanna McEntyre; Sandra Orchard; Sangya Pundir; Raul Rodriguez-Esteban; Kimberly Van Auken; Zhiyong Lu; Mary Schaeffer; Cathy H Wu; Lynette Hirschman; Cecilia N Arighi
Journal:  Database (Oxford)       Date:  2016-09-01       Impact factor: 3.451

6.  Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges.

Authors:  Ayush Singhal; Robert Leaman; Natalie Catlett; Thomas Lemberger; Johanna McEntyre; Shawn Polson; Ioannis Xenarios; Cecilia Arighi; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-12-26       Impact factor: 3.451

7.  Triage of documents containing protein interactions affected by mutations using an NLP based machine learning approach.

Authors:  Jinchan Qu; Albert Steppi; Dongrui Zhong; Jie Hao; Jian Wang; Pei-Yau Lung; Tingting Zhao; Zhe He; Jinfeng Zhang
Journal:  BMC Genomics       Date:  2020-11-10       Impact factor: 3.969

8.  GeneCup: mining PubMed and GWAS catalog for gene-keyword relationships.

Authors:  Mustafa H Gunturkun; Efraim Flashner; Tengfei Wang; Megan K Mulligan; Robert W Williams; Pjotr Prins; Hao Chen
Journal:  G3 (Bethesda)       Date:  2022-05-06       Impact factor: 3.542

9.  Integrated Approaches to Identify miRNA Biomarkers Associated with Cognitive Dysfunction in Multiple Sclerosis Using Text Mining, Gene Expression, Pathways, and GWAS.

Authors:  Archana Prabahar; Kalpana Raja
Journal:  Diagnostics (Basel)       Date:  2022-08-08

10.  Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.

Authors:  Rezarta Islamaj Dogan; Sun Kim; Andrew Chatr-Aryamontri; Chih-Hsuan Wei; Donald C Comeau; Rui Antunes; Sérgio Matos; Qingyu Chen; Aparna Elangovan; Nagesh C Panyam; Karin Verspoor; Hongfang Liu; Yanshan Wang; Zhuang Liu; Berna Altinel; Zehra Melce Hüsünbeyi; Arzucan Özgür; Aris Fergadis; Chen-Kai Wang; Hong-Jie Dai; Tung Tran; Ramakanth Kavuluru; Ling Luo; Albert Steppi; Jinfeng Zhang; Jinchan Qu; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.