Literature DB >> 12855455

GENIA corpus--semantically annotated corpus for bio-textmining.

J-D Kim1, T Ohta, Y Tateisi, J Tsujii.   

Abstract

MOTIVATION: Natural language processing (NLP) methods are regarded as being useful to raise the potential of text mining from biological literature. The lack of an extensively annotated corpus of this literature, however, causes a major bottleneck for applying NLP techniques. GENIA corpus is being developed to provide reference materials to let NLP techniques work for bio-textmining.
RESULTS: GENIA corpus version 3.0 consisting of 2000 MEDLINE abstracts has been released with more than 400,000 words and almost 100,000 annotations for biological terms.

Mesh:

Year:  2003        PMID: 12855455     DOI: 10.1093/bioinformatics/btg1023

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  155 in total

1.  Cross-species gene normalization by species inference.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

2.  Automatic discourse connective detection in biomedical text.

Authors:  Balaji Polepalli Ramesh; Rashmi Prasad; Tim Miller; Brian Harrington; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2012-06-28       Impact factor: 4.497

3.  Bio-Ontology and text: bridging the modeling gap.

Authors:  Carol Friedman; Tara Borlawsky; Lyudmila Shagina; H Rosie Xing; Yves A Lussier
Journal:  Bioinformatics       Date:  2006-07-26       Impact factor: 6.937

4.  SemCat: semantically categorized entities for genomics.

Authors:  Lorraine Tanabe; Lynne H Thom; Wayne Matten; Donald C Comeau; W John Wilbur
Journal:  AMIA Annu Symp Proc       Date:  2006

Review 5.  Frontiers of biomedical text mining: current progress.

Authors:  Pierre Zweigenbaum; Dina Demner-Fushman; Hong Yu; Kevin B Cohen
Journal:  Brief Bioinform       Date:  2007-10-30       Impact factor: 11.622

6.  Biological entity recognition with conditional random fields.

Authors:  Ying He; Mehmet Kayaalp
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

7.  Narratives in the network: interactive methods for mining cell signaling networks.

Authors:  M Shahriar Hossain; Monika Akbar; Nicholas F Polys
Journal:  J Comput Biol       Date:  2012-08-16       Impact factor: 1.479

8.  Document classification for mining host pathogen protein-protein interactions.

Authors:  Lanlan Yin; Guixian Xu; Manabu Torii; Zhendong Niu; Jose M Maisog; Cathy Wu; Zhangzhi Hu; Hongfang Liu
Journal:  Artif Intell Med       Date:  2010-05-15       Impact factor: 5.326

Review 9.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

Review 10.  Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0).

Authors:  Abhyuday Jagannatha; Feifan Liu; Weisong Liu; Hong Yu
Journal:  Drug Saf       Date:  2019-01       Impact factor: 5.606

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.