Literature DB >> 17990501

Mining patents using molecular similarity search.

James Rhodes1, Stephen Boyer, Jeffrey Kreulen, Ying Chen, Patricia Ordonez.   

Abstract

Text analytics is becoming an increasingly important tool used in biomedical research. While advances continue to be made in the core algorithms for entity identification and relation extraction, a need for practical applications of these technologies arises. We developed a system that allows users to explore the US Patent corpus using molecular information. The core of our system contains three main technologies: A high performing chemical annotator which identifies chemical terms and converts them to structures, a similarity search engine based on the emerging IUPAC International Chemical Identifier (InChI) standard, and a set of on demand data mining tools. By leveraging this technology we were able to rapidly identify and index 3,623,248 unique chemical structures from 4,375,036 US Patents and Patent Applications. Using this system a user may go to a web page, draw a molecule, search for related Intellectual Property (IP) and analyze the results. Our results prove that this is a far more effective way for identifying IP than traditional keyword based approaches.

Mesh:

Year:  2007        PMID: 17990501

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  6 in total

Review 1.  Frontiers of biomedical text mining: current progress.

Authors:  Pierre Zweigenbaum; Dina Demner-Fushman; Hong Yu; Kevin B Cohen
Journal:  Brief Bioinform       Date:  2007-10-30       Impact factor: 11.622

Review 2.  Lowering industry firewalls: pre-competitive informatics initiatives in drug discovery.

Authors:  Michael R Barnes; Lee Harland; Steven M Foord; Matthew D Hall; Ian Dix; Scott Thomas; Bryn I Williams-Jones; Cory R Brouwer
Journal:  Nat Rev Drug Discov       Date:  2009-07-17       Impact factor: 84.694

3.  Exploring sets of molecules from patents and relationships to other active compounds in chemical space networks.

Authors:  Ryo Kunimoto; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2017-09-04       Impact factor: 3.686

4.  Tunable machine vision-based strategy for automated annotation of chemical databases.

Authors:  Jungkap Park; Gus R Rosania; Kazuhiro Saitou
Journal:  J Chem Inf Model       Date:  2009-08       Impact factor: 4.956

5.  Detection of IUPAC and IUPAC-like chemical names.

Authors:  Roman Klinger; Corinna Kolárik; Juliane Fluck; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

6.  Application of deep metric learning to molecular graph similarity.

Authors:  Damien E Coupry; Peter Pogány
Journal:  J Cheminform       Date:  2022-03-12       Impact factor: 5.514

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.