Literature DB >> 18999169

Unsupervised method for automatic construction of a disease dictionary from a large free text collection.

Rong Xu1, Kaustubh Supekar, Alex Morgan, Amar Das, Alan Garber.   

Abstract

Concept specific lexicons (e.g. diseases, drugs, anatomy) are a critical source of background knowledge for many medical language-processing systems. However, the rapid pace of biomedical research and the lack of constraints on usage ensure that such dictionaries are incomplete. Focusing on disease terminology, we have developed an automated, unsupervised, iterative pattern learning approach for constructing a comprehensive medical dictionary of disease terms from randomized clinical trial (RCT) abstracts, and we compared different ranking methods for automatically extracting con-textual patterns and concept terms. When used to identify disease concepts from 100 randomly chosen, manually annotated clinical abstracts, our disease dictionary shows significant performance improvement (F1 increased by 35-88%) over available, manually created disease terminologies.

Mesh:

Year:  2008        PMID: 18999169      PMCID: PMC2656087     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  2 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  A study of biomedical concept identification: MetaMap vs. people.

Authors:  Wanda Pratt; Meliha Yetisgen-Yildiz
Journal:  AMIA Annu Symp Proc       Date:  2003
  2 in total
  21 in total

1.  An evaluation of the UMLS in representing corpus derived clinical concepts.

Authors:  Jeff Friedlin; Marc Overhage
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Unsupervised method for extracting machine understandable medical knowledge from a large free text collection.

Authors:  Rong Xu; Amar K Das; Alan M Garber
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

3.  Using text to build semantic networks for pharmacogenomics.

Authors:  Adrien Coulet; Nigam H Shah; Yael Garten; Mark Musen; Russ B Altman
Journal:  J Biomed Inform       Date:  2010-08-17       Impact factor: 6.317

4.  Towards building a disease-phenotype knowledge base: extracting disease-manifestation relationship from literature.

Authors:  Rong Xu; Li Li; Quanqiu Wang
Journal:  Bioinformatics       Date:  2013-07-04       Impact factor: 6.937

Review 5.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

6.  Extraction of genotype-phenotype-drug relationships from text: from entity recognition to bioinformatics application.

Authors:  Adrien Coulet; Nigam Shah; Lawrence Hunter; Chitta Barral; Russ B Altman
Journal:  Pac Symp Biocomput       Date:  2010

7.  A Comprehensive Analysis of Five Million UMLS Metathesaurus Terms Using Eighteen Million MEDLINE Citations.

Authors:  Rong Xu; Mark A Musen; Nigam H Shah
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

8.  The Lexicon Builder Web service: Building Custom Lexicons from two hundred Biomedical Ontologies.

Authors:  Gautam K Parai; Clement Jonquet; Rong Xu; Mark A Musen; Nigam H Shah
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

9.  Syntactic dependency parsers for biomedical-NLP.

Authors:  Raphael Cohen; Michael Elhadad
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

10.  An iterative searching and ranking algorithm for prioritising pharmacogenomics genes.

Authors:  Rong Xu; Quanqiu Wang
Journal:  Int J Comput Biol Drug Des       Date:  2013-02-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.