Literature DB >> 19621901

Tunable machine vision-based strategy for automated annotation of chemical databases.

Jungkap Park1, Gus R Rosania, Kazuhiro Saitou.   

Abstract

We present a tunable, machine vision-based strategy for automated annotation of virtual small molecule databases. The proposed strategy is based on the use of a machine vision-based tool for extracting structure diagrams in research articles and converting them into connection tables, a virtual "Chemical Expert" system for screening the converted structures based on the adjustable levels of estimated conversion accuracy, and a fragment-based measure for calculating intermolecular similarity. For annotation, calculated chemical similarity between the converted structures and entries in a virtual small molecule database is used to establish the links. The overall annotation performances can be tuned by adjusting the cutoff threshold of the estimated conversion accuracy. We perform an annotation test which attempts to link 121 journal articles registered in PubMed to entries in PubChem which is the largest, publicly accessible chemical database. Two cases of tests are performed, and their results are compared to see how the overall annotation performances are affected by the different threshold levels of the estimated accuracy of the converted structure. Our work demonstrates that over 45% of the articles could have true positive links to entries in the PubChem database with promising recall and precision rates in both tests. Furthermore, we illustrate that the Chemical Expert system which can screen converted structures based on the adjustable levels of estimated conversion accuracy is a key factor impacting the overall annotation performance. We propose that this machine vision-based strategy can be incorporated with the text-mining approach to facilitate extraction of contextual scientific knowledge about a chemical structure, from the scientific literature.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19621901      PMCID: PMC2907084          DOI: 10.1021/ci900029v

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  25 in total

1.  Analysis of biomedical text for chemical names: a comparison of three methods.

Authors:  W J Wilbur; G F Hazard; G Divita; J G Mork; A R Aronson; A C Browne
Journal:  Proc AMIA Symp       Date:  1999

Review 2.  Target-oriented and diversity-oriented organic synthesis in drug discovery.

Authors:  S L Schreiber
Journal:  Science       Date:  2000-03-17       Impact factor: 47.728

3.  Tagging gene and protein names in biomedical text.

Authors:  Lorraine Tanabe; W John Wilbur
Journal:  Bioinformatics       Date:  2002-08       Impact factor: 6.937

Review 4.  Combinatorial informatics in the post-genomics ERA.

Authors:  Dimitris K Agrafiotis; Victor S Lobanov; F Raymond Salemme
Journal:  Nat Rev Drug Discov       Date:  2002-05       Impact factor: 84.694

Review 5.  Chemical database techniques in drug discovery.

Authors:  Mitchell A Miller
Journal:  Nat Rev Drug Discov       Date:  2002-03       Impact factor: 84.694

6.  Chemical machine vision: automated extraction of chemical metadata from raster images.

Authors:  Georgios V Gkoutos; Henry Rzepa; Richard M Clark; Osei Adjei; Harpal Johal
Journal:  J Chem Inf Comput Sci       Date:  2003 Sep-Oct

7.  Science resources. Chemists want NIH to curtail database.

Authors:  Jocelyn Kaiser
Journal:  Science       Date:  2005-05-06       Impact factor: 47.728

Review 8.  A cheminformatic toolkit for mining biomedical knowledge.

Authors:  Gus R Rosania; Gordon Crippen; Peter Woolf; David States; Kerby Shedden
Journal:  Pharm Res       Date:  2007-03-24       Impact factor: 4.200

9.  Mining patents using molecular similarity search.

Authors:  James Rhodes; Stephen Boyer; Jeffrey Kreulen; Ying Chen; Patricia Ordonez
Journal:  Pac Symp Biocomput       Date:  2007

10.  Reconstruction of chemical molecules from images.

Authors:  Maria-Elena Algorri; Marc Zimmermann; Christoph M Friedrich; Santiago Akle; Martin Hofmann-Apitius
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2007
View more
  2 in total

1.  Learning to predict chemical reactions.

Authors:  Matthew A Kayala; Chloé-Agathe Azencott; Jonathan H Chen; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2011-09-02       Impact factor: 4.956

2.  Academic librarians at play in the field of cheminformatics: building the case for chemistry research data management.

Authors:  Leah McEwen; Ye Li
Journal:  J Comput Aided Mol Des       Date:  2014-07-20       Impact factor: 3.686

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.