Literature DB >> 20723615

Using text to build semantic networks for pharmacogenomics.

Adrien Coulet1, Nigam H Shah, Yael Garten, Mark Musen, Russ B Altman.   

Abstract

Most pharmacogenomics knowledge is contained in the text of published studies, and is thus not available for automated computation. Natural Language Processing (NLP) techniques for extracting relationships in specific domains often rely on hand-built rules and domain-specific ontologies to achieve good performance. In a new and evolving field such as pharmacogenomics (PGx), rules and ontologies may not be available. Recent progress in syntactic NLP parsing in the context of a large corpus of pharmacogenomics text provides new opportunities for automated relationship extraction. We describe an ontology of PGx relationships built starting from a lexicon of key pharmacogenomic entities and a syntactic parse of more than 87 million sentences from 17 million MEDLINE abstracts. We used the syntactic structure of PGx statements to systematically extract commonly occurring relationships and to map them to a common schema. Our extracted relationships have a 70-87.7% precision and involve not only key PGx entities such as genes, drugs, and phenotypes (e.g., VKORC1, warfarin, clotting disorder), but also critical entities that are frequently modified by these key entities (e.g., VKORC1 polymorphism, warfarin response, clotting disorder treatment). The result of our analysis is a network of 40,000 relationships between more than 200 entity types with clear semantics. This network is used to guide the curation of PGx knowledge and provide a computable resource for knowledge discovery.
Copyright © 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2010        PMID: 20723615      PMCID: PMC2991587          DOI: 10.1016/j.jbi.2010.08.005

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  15 in total

1.  Automatic extraction of biological information from scientific text: protein-protein interactions.

Authors:  C Blaschke; M A Andrade; C Ouzounis; A Valencia
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1999

2.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles.

Authors:  C Friedman; P Kra; H Yu; M Krauthammer; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Integrating genotype and phenotype information: an overview of the PharmGKB project. Pharmacogenetics Research Network and Knowledge Base.

Authors:  T E Klein; J T Chang; M K Cho; K L Easton; R Fergerson; M Hewett; Z Lin; Y Liu; S Liu; D E Oliver; D L Rubin; F Shafa; J M Stuart; R B Altman
Journal:  Pharmacogenomics J       Date:  2001       Impact factor: 3.550

4.  Semantic relations asserting the etiology of genetic diseases.

Authors:  Thomas C Rindflesch; Bisharah Libbus; Dimitar Hristovski; Alan R Aronson; Halil Kilicoglu
Journal:  AMIA Annu Symp Proc       Date:  2003

5.  Extraction of regulatory gene/protein networks from Medline.

Authors:  Jasmin Saric; Lars Juhl Jensen; Rossitza Ouzounova; Isabel Rojas; Peer Bork
Journal:  Bioinformatics       Date:  2005-07-26       Impact factor: 6.937

6.  RelEx--relation extraction using dependency parse trees.

Authors:  Katrin Fundel; Robert Küffner; Ralf Zimmer
Journal:  Bioinformatics       Date:  2006-12-01       Impact factor: 6.937

7.  Extracting semantic predications from Medline citations for pharmacogenomics.

Authors:  Caroline B Ahlers; Marcelo Fiszman; Dina Demner-Fushman; François-Michel Lang; Thomas C Rindflesch
Journal:  Pac Symp Biocomput       Date:  2007

8.  Querying parse tree database of Medline text to synthesize user-specific biomolecular networks.

Authors:  Luis Tari; Jörg Hakenberg; Graciela Gonzalez; Chitta Baral
Journal:  Pac Symp Biocomput       Date:  2009

9.  Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge.

Authors:  Martin Krallinger; Alexander Morgan; Larry Smith; Florian Leitner; Lorraine Tanabe; John Wilbur; Lynette Hirschman; Alfonso Valencia
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

10.  OpenDMAP: an open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression.

Authors:  Lawrence Hunter; Zhiyong Lu; James Firby; William A Baumgartner; Helen L Johnson; Philip V Ogren; K Bretonnel Cohen
Journal:  BMC Bioinformatics       Date:  2008-01-31       Impact factor: 3.169

View more
  43 in total

1.  Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.

Authors:  Yuan Luo; Özlem Uzuner; Peter Szolovits
Journal:  Brief Bioinform       Date:  2016-02-05       Impact factor: 11.622

2.  A knowledge-driven conditional approach to extract pharmacogenomics specific drug-gene relationships from free text.

Authors:  Rong Xu; Quanqiu Wang
Journal:  J Biomed Inform       Date:  2012-04-27       Impact factor: 6.317

Review 3.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

4.  EliXR: an approach to eligibility criteria extraction and representation.

Authors:  Chunhua Weng; Xiaoying Wu; Zhihui Luo; Mary Regina Boland; Dimitri Theodoratos; Stephen B Johnson
Journal:  J Am Med Inform Assoc       Date:  2011-07-31       Impact factor: 4.497

5.  Trends in computational biology—2010.

Authors:  H Craig Mak
Journal:  Nat Biotechnol       Date:  2011-01       Impact factor: 54.908

6.  Recurrent neural networks for classifying relations in clinical notes.

Authors:  Yuan Luo
Journal:  J Biomed Inform       Date:  2017-07-08       Impact factor: 6.317

7.  Automated Metabolic Phenotyping of Cytochrome Polymorphisms Using PubMed Abstract Mining.

Authors:  Luoxin Chen; Carol Friedman; Joseph Finkelstein
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

8.  Mining the pharmacogenomics literature--a survey of the state of the art.

Authors:  Udo Hahn; K Bretonnel Cohen; Yael Garten; Nigam H Shah
Journal:  Brief Bioinform       Date:  2012-07       Impact factor: 11.622

9.  An iterative searching and ranking algorithm for prioritising pharmacogenomics genes.

Authors:  Rong Xu; Quanqiu Wang
Journal:  Int J Comput Biol Drug Des       Date:  2013-02-21

10.  A gene ontology inferred from molecular networks.

Authors:  Janusz Dutkowski; Michael Kramer; Michal A Surma; Rama Balakrishnan; J Michael Cherry; Nevan J Krogan; Trey Ideker
Journal:  Nat Biotechnol       Date:  2013-01       Impact factor: 54.908

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.