Literature DB >> 21384929

Chemical name to structure: OPSIN, an open source solution.

Daniel M Lowe1, Peter T Corbett, Peter Murray-Rust, Robert C Glen.   

Abstract

We have produced an open source, freely available, algorithm (Open Parser for Systematic IUPAC Nomenclature, OPSIN) that interprets the majority of organic chemical nomenclature in a fast and precise manner. This has been achieved using an approach based on a regular grammar. This grammar is used to guide tokenization, a potentially difficult problem in chemical names. From the parsed chemical name, an XML parse tree is constructed that is operated on in a stepwise manner until the structure has been reconstructed from the name. Results from OPSIN on various computer generated name/structure pair sets are presented. These show exceptionally high precision (99.8%+) and, when using general organic chemical nomenclature, high recall (98.7-99.2%). This software can serve as the basis for future open source developments of chemical name interpretation.

Mesh:

Year:  2011        PMID: 21384929     DOI: 10.1021/ci100384d

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  46 in total

Review 1.  Open source molecular modeling.

Authors:  Somayeh Pirhadi; Jocelyn Sunseri; David Ryan Koes
Journal:  J Mol Graph Model       Date:  2016-07-30       Impact factor: 2.518

2.  Molecular docking analysis of curcumin analogues against kinase domain of ALK5.

Authors:  Shivananda Kandagalla; B S Sharath; Basavapattana Rudresh Bharath; Umme Hani; Hanumanthappa Manjunatha
Journal:  In Silico Pharmacol       Date:  2017-11-18

3.  SimConcept: A Hybrid Approach for Simplifying Composite Named Entities in Biomedicine.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  ACM BCB       Date:  2014

4.  SimConcept: a hybrid approach for simplifying composite named entities in biomedical text.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  IEEE J Biomed Health Inform       Date:  2015-04-13       Impact factor: 5.772

5.  tmChem: a high performance approach for chemical named entity recognition and normalization.

Authors:  Robert Leaman; Chih-Hsuan Wei; Zhiyong Lu
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

6.  Recognition of chemical entities: combining dictionary-based and grammar-based approaches.

Authors:  Saber A Akhondi; Kristina M Hettne; Eelke van der Horst; Erik M van Mulligen; Jan A Kors
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

7.  Consistency of systematic chemical identifiers within and between small-molecule databases.

Authors:  Saber A Akhondi; Jan A Kors; Sorel Muresan
Journal:  J Cheminform       Date:  2012-12-13       Impact factor: 5.514

8.  Structure-based classification and ontology in chemistry.

Authors:  Janna Hastings; Despoina Magka; Colin Batchelor; Lian Duan; Robert Stevens; Marcus Ennis; Christoph Steinbeck
Journal:  J Cheminform       Date:  2012-04-05       Impact factor: 5.514

9.  Transformer-based artificial neural networks for the conversion between chemical notations.

Authors:  Lev Krasnov; Ivan Khokhlov; Maxim V Fedorov; Sergey Sosnin
Journal:  Sci Rep       Date:  2021-07-20       Impact factor: 4.379

10.  Extracting and connecting chemical structures from text sources using chemicalize.org.

Authors:  Christopher Southan; Andras Stracz
Journal:  J Cheminform       Date:  2013-04-23       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.