Literature DB >> 19434905

Optical structure recognition software to recover chemical information: OSRA, an open source solution.

Igor V Filippov1, Marc C Nicklaus.   

Abstract

Until recently most scientific and patent documents dealing with chemistry have described molecular structures either with systematic names or with graphical images of Kekulé structures. The latter method poses inherent problems in the automated processing that is needed when the number of documents ranges in the hundreds of thousands or even millions since graphical representations cannot be directly interpreted by a computer. To recover this structural information, which is otherwise all but lost, we have built an optical structure recognition application based on modern advances in image processing implemented in open source tools, OSRA. OSRA can read documents in over 90 graphical formats including GIF, JPEG, PNG, TIFF, PDF, and PS, automatically recognizes and extracts the graphical information representing chemical structures in such documents, and generates the SMILES or SD representation of the encountered molecular structure images.

Entities:  

Mesh:

Year:  2009        PMID: 19434905      PMCID: PMC2889020          DOI: 10.1021/ci800067r

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  1 in total

1.  Internet resources integrating many small-molecule databases.

Authors:  M Sitzmann; I V Filippov; M C Nicklaus
Journal:  SAR QSAR Environ Res       Date:  2008 Jan-Mar       Impact factor: 3.000

  1 in total
  31 in total

1.  Silver threads.

Authors:  Wendy A Warr
Journal:  J Comput Aided Mol Des       Date:  2011-12-09       Impact factor: 3.686

Review 2.  Open source molecular modeling.

Authors:  Somayeh Pirhadi; Jocelyn Sunseri; David Ryan Koes
Journal:  J Mol Graph Model       Date:  2016-07-30       Impact factor: 2.518

3.  Many InChIs and quite some feat.

Authors:  Wendy A Warr
Journal:  J Comput Aided Mol Des       Date:  2015-06-17       Impact factor: 3.686

Review 4.  Software and resources for computational medicinal chemistry.

Authors:  Chenzhong Liao; Markus Sitzmann; Angelo Pugliese; Marc C Nicklaus
Journal:  Future Med Chem       Date:  2011-06       Impact factor: 3.808

5.  Making SharePoint® Chemically Aware™.

Authors:  Kartik Tallapragada; Joseph Chewning; David Kombo; Beverly Ludwick
Journal:  J Cheminform       Date:  2012-01-12       Impact factor: 5.514

Review 6.  Using ChEMBL web services for building applications and data processing workflows relevant to drug discovery.

Authors:  Michał M Nowotka; Anna Gaulton; David Mendez; A Patricia Bento; Anne Hersey; Andrew Leach
Journal:  Expert Opin Drug Discov       Date:  2017-06-12       Impact factor: 6.098

7.  Tunable machine vision-based strategy for automated annotation of chemical databases.

Authors:  Jungkap Park; Gus R Rosania; Kazuhiro Saitou
Journal:  J Chem Inf Model       Date:  2009-08       Impact factor: 4.956

8.  Inhibitors for the hepatitis C virus RNA polymerase explored by SAR with advanced machine learning methods.

Authors:  Iwona E Weidlich; Igor V Filippov; Jodian Brown; Neerja Kaushik-Basu; Ramalingam Krishnan; Marc C Nicklaus; Ian F Thorpe
Journal:  Bioorg Med Chem       Date:  2013-03-29       Impact factor: 3.641

9.  ChemEx: information extraction system for chemical data curation.

Authors:  Atima Tharatipyakul; Somrak Numnark; Duangdao Wichadakul; Supawadee Ingsriswang
Journal:  BMC Bioinformatics       Date:  2012-12-13       Impact factor: 3.169

10.  Extracting and connecting chemical structures from text sources using chemicalize.org.

Authors:  Christopher Southan; Andras Stracz
Journal:  J Cheminform       Date:  2013-04-23       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.