Literature DB >> 17460124

Semantic classification of biomedical concepts using distributional similarity.

Jung-Wei Fan1, Carol Friedman.   

Abstract

OBJECTIVE: To develop an automated, high-throughput, and reproducible method for reclassifying and validating ontological concepts for natural language processing applications.
DESIGN: We developed a distributional similarity approach to classify the Unified Medical Language System (UMLS) concepts. Classification models were built for seven broad biomedically relevant semantic classes created by grouping subsets of the UMLS semantic types. We used contextual features based on syntactic properties obtained from two different large corpora and used alpha-skew divergence as the similarity measure. MEASUREMENTS: The testing sets were automatically generated based on the changes by the National Library of Medicine to the semantic classification of concepts from the UMLS 2005AA to the 2006AA release. Error rates were calculated and a misclassification analysis was performed.
RESULTS: The estimated lowest error rates were 0.198 and 0.116 when considering the correct classification to be covered by our top prediction and top 2 predictions, respectively.
CONCLUSION: The results demonstrated that the distributional similarity approach can recommend high level semantic classification suitable for use in natural language processing.

Mesh:

Year:  2007        PMID: 17460124      PMCID: PMC2244895          DOI: 10.1197/jamia.M2314

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  26 in total

1.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles.

Authors:  C Friedman; P Kra; H Yu; M Krauthammer; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

3.  Aggregating UMLS semantic types for reducing conceptual complexity.

Authors:  A T McCray; A Burgun; O Bodenreider
Journal:  Stud Health Technol Inform       Date:  2001

4.  Auditing the UMLS for redundant classifications.

Authors:  Yi Peng; Michael H Halper; Yehoshua Perl; James Geller
Journal:  Proc AMIA Symp       Date:  2002

5.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

6.  Auditing concept categorizations in the UMLS.

Authors:  Huanying Gu; Yehoshua Perl; Gai Elhanan; Hua Min; Li Zhang; Yi Peng
Journal:  Artif Intell Med       Date:  2004-05       Impact factor: 5.326

Review 7.  A survey of current work in biomedical text mining.

Authors:  Aaron M Cohen; William R Hersh
Journal:  Brief Bioinform       Date:  2005-03       Impact factor: 11.622

8.  Using UMLS metathesaurus concepts to describe medical images: dermatology vocabulary.

Authors:  James W Woods; Charles A Sneiderman; Kamran Hameed; Michael J Ackerman; Charlie Hatton
Journal:  Comput Biol Med       Date:  2006-01       Impact factor: 4.589

9.  The HUGO Gene Nomenclature Database, 2006 updates.

Authors:  Tina A Eyre; Fabrice Ducluzeau; Tam P Sneddon; Sue Povey; Elspeth A Bruford; Michael J Lush
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  The Universal Protein Resource (UniProt).

Authors:  Amos Bairoch; Rolf Apweiler; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  14 in total

1.  Unsupervised method for extracting machine understandable medical knowledge from a large free text collection.

Authors:  Rong Xu; Amar K Das; Alan M Garber
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

Review 2.  Empirical distributional semantics: methods and biomedical applications.

Authors:  Trevor Cohen; Dominic Widdows
Journal:  J Biomed Inform       Date:  2009-02-14       Impact factor: 6.317

3.  Adaptive semantic tag mining from heterogeneous clinical research texts.

Authors:  T Hao; C Weng
Journal:  Methods Inf Med       Date:  2014-10-20       Impact factor: 2.176

4.  Feature extraction for phenotyping from semantic and knowledge resources.

Authors:  Wenxin Ning; Stephanie Chan; Andrew Beam; Ming Yu; Alon Geva; Katherine Liao; Mary Mullen; Kenneth D Mandl; Isaac Kohane; Tianxi Cai; Sheng Yu
Journal:  J Biomed Inform       Date:  2019-02-07       Impact factor: 6.317

5.  Extracting drug indication information from structured product labels using natural language processing.

Authors:  Kin Wah Fung; Chiang S Jao; Dina Demner-Fushman
Journal:  J Am Med Inform Assoc       Date:  2013-03-09       Impact factor: 4.497

6.  LabeledIn: cataloging labeled indications for human drugs.

Authors:  Ritu Khare; Jiao Li; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2014-08-23       Impact factor: 6.317

7.  Automated learning of domain taxonomies from text using background knowledge.

Authors:  Julia Hoxha; Guoqian Jiang; Chunhua Weng
Journal:  J Biomed Inform       Date:  2016-09-03       Impact factor: 6.317

8.  A Comparison between Human and NLP-based Annotation of Clinical Trial Eligibility Criteria Text Using The OMOP Common Data Model.

Authors:  Xinhang Li; Hao Liu; Fabrício Kury; Chi Yuan; Alex Butler; Yingcheng Sun; Anna Ostropolets; Hua Xu; Chunhua Weng
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2021-05-17

9.  Evaluation of a large-scale biomedical data annotation initiative.

Authors:  Ronilda Lacson; Erik Pitzer; Christian Hinske; Pedro Galante; Lucila Ohno-Machado
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

10.  Proceedings of the 2008 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference.

Authors:  Jonathan D Wren; Dawn Wilkins; James C Fuscoe; Susan Bridges; Stephen Winters-Hilt; Yuriy Gusev
Journal:  BMC Bioinformatics       Date:  2008-08-12       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.