Literature DB >> 17238331

dTagger: a POS tagger.

Guy Divita1, Allen C Browne, Russell Loane.   

Abstract

The Lexical Systems Group at the National Library of Medicine (NLM) has developed a Part-of-Speech (POS) tagger to be freely distributed with the SPECIALIST NLP Tools. dTagger is specifically designed for use with the SPECIALIST lexicon but it can be used with an arbitrary tag set. It is capable of single or multi-word chunking. It is trainable with previously annotated text and in development is a version that is tunable with untagged text. The tagger allows users to add local lexicon content. It can report likelihoods for each sentence tagged. New words seen while tagging (the unknowns) are handled by shape identification including heuristics based on suffix statistics gleaned during the training. The performance of the supervised training is noted to be 95% on a modified version of the MedPost hand annotated Medline abstracts. Eight percent of the terms within this corpus were multi-word entities.

Mesh:

Year:  2006        PMID: 17238331      PMCID: PMC1839340     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  1 in total

1.  MedPost: a part-of-speech tagger for bioMedical text.

Authors:  L Smith; T Rindflesch; W J Wilbur
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

  1 in total
  4 in total

1.  Part-of-speech tagging for clinical text: wall or bridge between institutions?

Authors:  Jung-wei Fan; Rashmi Prasad; Rommel M Yabut; Richard M Loomis; Daniel S Zisook; John E Mattison; Yang Huang
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Heuristic sample selection to minimize reference standard training set for a part-of-speech tagger.

Authors:  Kaihong Liu; Wendy Chapman; Rebecca Hwa; Rebecca S Crowley
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

3.  De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports.

Authors:  Mehmet Kayaalp; Allen C Browne; Zeyno A Dodd; Pamela Sagan; Clement J McDonald
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

Review 4.  Linking genes to literature: text mining, information extraction, and retrieval applications for biology.

Authors:  Martin Krallinger; Alfonso Valencia; Lynette Hirschman
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.