Literature DB >> 21347110

A Comprehensive Analysis of Five Million UMLS Metathesaurus Terms Using Eighteen Million MEDLINE Citations.

Rong Xu1, Mark A Musen, Nigam H Shah.   

Abstract

The Unified Medical Language System (UMLS) Metathesaurus is widely used for biomedical natural language processing (NLP) tasks. In this study, we systematically analyzed UMLS Metathesaurus terms by analyzing their occurrences in over 18 million MEDLINE abstracts. Our goals were: 1. analyze the frequency and syntactic distribution of Metathesaurus terms in MEDLINE; 2. create a filtered UMLS Metathesaurus based on the MEDLINE analysis; 3. augment the UMLS Metathesaurus where each term is associated with metadata on its MEDLINE frequency and syntactic distribution statistics. After MEDLINE frequency-based filtering, the augmented UMLS Metathesaurus contains 518,835 terms and is roughly 13% of its original size. We have shown that the syntactic and frequency information is useful to identify errors in the Metathesaurus. This filtered and augmented UMLS Metathesaurus can potentially be used to improve efficiency and precision of UMLS-based information retrieval and NLP tasks.

Mesh:

Year:  2010        PMID: 21347110      PMCID: PMC3041393     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  12 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Finding UMLS Metathesaurus concepts in MEDLINE.

Authors:  Suresh Srinivasan; Thomas C Rindflesch; William T Hole; Alan R Aronson; James G Mork
Journal:  Proc AMIA Symp       Date:  2002

3.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

4.  A study of biomedical concept identification: MetaMap vs. people.

Authors:  Wanda Pratt; Meliha Yetisgen-Yildiz
Journal:  AMIA Annu Symp Proc       Date:  2003

5.  Unsupervised method for extracting machine understandable medical knowledge from a large free text collection.

Authors:  Rong Xu; Amar K Das; Alan M Garber
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

6.  Analysis of a study of the users, uses, and future agenda of the UMLS.

Authors:  Yan Chen; Yehoshua Perl; James Geller; James J Cimino
Journal:  J Am Med Inform Assoc       Date:  2007-01-09       Impact factor: 4.497

7.  Unsupervised method for automatic construction of a disease dictionary from a large free text collection.

Authors:  Rong Xu; Kaustubh Supekar; Alex Morgan; Amar Das; Alan Garber
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

8.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

9.  The open biomedical annotator.

Authors:  Clement Jonquet; Nigam H Shah; Mark A Musen
Journal:  Summit Transl Bioinform       Date:  2009-03-01

10.  Comparison of concept recognizers for building the Open Biomedical Annotator.

Authors:  Nigam H Shah; Nipun Bhatia; Clement Jonquet; Daniel Rubin; Annie P Chiang; Mark A Musen
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

View more
  26 in total

1.  Usability-driven pruning of large ontologies: the case of SNOMED CT.

Authors:  Pablo López-García; Martin Boeker; Arantza Illarramendi; Stefan Schulz
Journal:  J Am Med Inform Assoc       Date:  2012-01-19       Impact factor: 4.497

2.  Using ontology-based annotation to profile disease research.

Authors:  Yi Liu; Adrien Coulet; Paea LePendu; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2012-04-11       Impact factor: 4.497

3.  Text mining for adverse drug events: the promise, challenges, and state of the art.

Authors:  Rave Harpaz; Alison Callahan; Suzanne Tamang; Yen Low; David Odgers; Sam Finlayson; Kenneth Jung; Paea LePendu; Nigam H Shah
Journal:  Drug Saf       Date:  2014-10       Impact factor: 5.606

4.  Neophilia Ranking of Scientific Journals.

Authors:  Mikko Packalen; Jay Bhattacharya
Journal:  Scientometrics       Date:  2016-10-22       Impact factor: 3.238

Review 5.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

6.  The Lexicon Builder Web service: Building Custom Lexicons from two hundred Biomedical Ontologies.

Authors:  Gautam K Parai; Clement Jonquet; Rong Xu; Mark A Musen; Nigam H Shah
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

7.  Molecularly and clinically related drugs and diseases are enriched in phenotypically similar drug-disease pairs.

Authors:  Ingo Vogt; Jeanette Prinz; Mónica Campillos
Journal:  Genome Med       Date:  2014-08-17       Impact factor: 11.117

8.  Identifying phenotypic signatures of neuropsychiatric disorders from electronic medical records.

Authors:  Svetlana Lyalina; Bethany Percha; Paea LePendu; Srinivasan V Iyer; Russ B Altman; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2013-08-16       Impact factor: 4.497

9.  Mining clinical text for signals of adverse drug-drug interactions.

Authors:  Srinivasan V Iyer; Rave Harpaz; Paea LePendu; Anna Bauer-Mehren; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2013-10-24       Impact factor: 4.497

10.  Cross-domain targeted ontology subsets for annotation: the case of SNOMED CORE and RxNorm.

Authors:  Pablo López-García; Paea Lependu; Mark Musen; Arantza Illarramendi
Journal:  J Biomed Inform       Date:  2013-10-01       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.