Literature DB >> 16870928

Bio-Ontology and text: bridging the modeling gap.

Carol Friedman1, Tara Borlawsky, Lyudmila Shagina, H Rosie Xing, Yves A Lussier.   

Abstract

MOTIVATION: Natural language processing (NLP) techniques are increasingly being used in biology to automate the capture of new biological discoveries in text, which are being reported at a rapid rate. Yet, information represented in NLP data structures is classically very different from information organized with ontologies as found in model organisms or genetic databases. To facilitate the computational reuse and integration of information buried in unstructured text with that of genetic databases, we propose and evaluate a translational schema that represents a comprehensive set of phenotypic and genetic entities, as well as their closely related biomedical entities and relations as expressed in natural language. In addition, the schema connects different scales of biological information, and provides mappings from the textual information to existing ontologies, which are essential in biology for integration, organization, dissemination and knowledge management of heterogeneous phenotypic information. A common comprehensive representation for otherwise heterogeneous phenotypic and genetic datasets, such as the one proposed, is critical for advancing systems biology because it enables acquisition and reuse of unprecedented volumes of diverse types of knowledge and information from text.
RESULTS: A novel representational schema, PGschema, was developed that enables translation of phenotypic, genetic and their closely related information found in textual narratives to a well-defined data structure comprising phenotypic and genetic concepts from established ontologies along with modifiers and relationships. Evaluation for coverage of a selected set of entities showed that 90% of the information could be represented (95% confidence interval: 86-93%; n = 268). Moreover, PGschema can be expressed automatically in an XML format using natural language techniques to process the text. To our knowledge, we are providing the first evaluation of a translational schema for NLP that contains declarative knowledge about genes and their associated biomedical data (e.g. phenotypes). AVAILABILITY: http://zellig.cpmc.columbia.edu/PGschema

Mesh:

Year:  2006        PMID: 16870928      PMCID: PMC2879055          DOI: 10.1093/bioinformatics/btl405

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  50 in total

1.  NLP techniques associated with the OpenGALEN ontology for semi-automatic textual extraction of medical knowledge: abstracting and mapping equivalent linguistic and logical constructs.

Authors:  M B do Amaral; A Roberts; A L Rector
Journal:  Proc AMIA Symp       Date:  2000

2.  Bio-ontologies-fast and furious.

Authors:  Judith Blake
Journal:  Nat Biotechnol       Date:  2004-06       Impact factor: 54.908

3.  Systems biology of the 2-cell mouse embryo.

Authors:  A V Evsikov; W N de Vries; A E Peaston; E E Radford; K S Fancher; F H Chen; J A Blake; C J Bult; K E Latham; D Solter; B B Knowles
Journal:  Cytogenet Genome Res       Date:  2004       Impact factor: 1.636

4.  SNOMED CT milestones: endorsements are added to already-impressive standards credentials.

Authors:  Kent A Spackman
Journal:  Healthc Inform       Date:  2004-09

5.  Beyond the clause: extraction of phosphorylation information from medline abstracts.

Authors:  M Narayanaswamy; K E Ravikumar; K Vijay-Shanker
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

Review 6.  Ontologies and semantic data integration.

Authors:  Stephen P Gardner
Journal:  Drug Discov Today       Date:  2005-07-15       Impact factor: 7.851

7.  How will big pictures emerge from a sea of biological data?

Authors:  Elizabeth Pennisi
Journal:  Science       Date:  2005-07-01       Impact factor: 47.728

8.  Medical-concept models and medical records: an approach based on GALEN and PEN&PAD.

Authors:  A L Rector; A J Glowinski; W A Nowlan; A Rossi-Mori
Journal:  J Am Med Inform Assoc       Date:  1995 Jan-Feb       Impact factor: 4.497

9.  Overview of BioCreAtIvE: critical assessment of information extraction for biology.

Authors:  Lynette Hirschman; Alexander Yeh; Christian Blaschke; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

10.  An ontology for cell types.

Authors:  Jonathan Bard; Seung Y Rhee; Michael Ashburner
Journal:  Genome Biol       Date:  2005-01-14       Impact factor: 13.583

View more
  6 in total

Review 1.  Natural Language Processing methods and systems for biomedical ontology learning.

Authors:  Kaihong Liu; William R Hogan; Rebecca S Crowley
Journal:  J Biomed Inform       Date:  2010-07-18       Impact factor: 6.317

Review 2.  Computational approaches to phenotyping: high-throughput phenomics.

Authors:  Yves A Lussier; Yang Liu
Journal:  Proc Am Thorac Soc       Date:  2007-01

Review 3.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

4.  Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes.

Authors:  Spiro P Pantazatos; Jianrong Li; Paul Pavlidis; Yves A Lussier
Journal:  Cancer Inform       Date:  2009-06-08

5.  Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes.

Authors:  Spiro P Pantazatos; Jianrong Li; Paul Pavlidis; Yves A Lussier
Journal:  Summit Transl Bioinform       Date:  2009-03-01

6.  Developing a sampling method and preliminary taxonomy for classifying COVID-19 public health guidance for healthcare organizations and the general public.

Authors:  Peter Taber; Catherine J Staes; Saifon Phengphoo; Elisa Rocha; Adria Lam; Guilherme Del Fiol; Saverio M Maviglia; Roberto A Rocha
Journal:  J Biomed Inform       Date:  2021-06-28       Impact factor: 8.000

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.