Literature DB >> 22639789

Exploiting structural information in patent specifications for key compound prediction.

Christian Tyrchan1, Jonas Boström, Fabrizio Giordanetto, Jon Winter, Sorel Muresan.   

Abstract

Patent specifications are one of many information sources needed to progress drug discovery projects. Understanding compound prior art and novelty checking, validation of biological assays, and identification of new starting points for chemical explorations are a few areas where patent analysis is an important component. Cheminformatics methods can be used to facilitate the identification of so-called key compounds in patent specifications. Such methods, relying on structural information extracted from documents by expert curation or text mining, can complement or in some cases replace the traditional manual approach of searching for clues in the text. This paper describes and compares three different methods for the automatic prediction of key compounds in patent specifications using structural information alone. For this data set, the cluster seed analysis described by Hattori et al. (Hattori, K.; Wakabayashi, H.; Tamaki, K. Predicting key example compounds in competitors' patent applications using structural information alone. J. Chem. Inf. Model.2008, 48, 135-142) is superior in terms of prediction accuracy with 26 out of 48 drugs (54%) correctly predicted from their corresponding patents. Nevertheless, the two new methods, based on frequency of R-groups (FOG) and maximum common substructure (MCS) similarity measures, show significant advantages due to their inherent ability to visualize relevant structural features. The results of the FOG method can be enhanced by manual selection of the scaffolds used in the analysis. Finally, a successful example of applying FOG analysis for designing potent ATP-competitive AXL kinase inhibitors with improved properties is described.

Entities:  

Mesh:

Year:  2012        PMID: 22639789     DOI: 10.1021/ci3001293

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  6 in total

1.  Managing expectations: assessment of chemistry databases generated by automated extraction of chemical structures from patents.

Authors:  Stefan Senger; Luca Bartek; George Papadatos; Anna Gaulton
Journal:  J Cheminform       Date:  2015-10-06       Impact factor: 5.514

2.  Annotated chemical patent corpus: a gold standard for text mining.

Authors:  Saber A Akhondi; Alexander G Klenner; Christian Tyrchan; Anil K Manchala; Kiran Boppana; Daniel Lowe; Marc Zimmermann; Sarma A R P Jagarlapudi; Roger Sayle; Jan A Kors; Sorel Muresan
Journal:  PLoS One       Date:  2014-09-30       Impact factor: 3.240

3.  Automatic identification of relevant chemical compounds from patents.

Authors:  Saber A Akhondi; Hinnerk Rey; Markus Schwörer; Michael Maier; John Toomey; Heike Nau; Gabriele Ilchmann; Mark Sheehan; Matthias Irmer; Claudia Bobach; Marius Doornenbal; Michelle Gregory; Jan A Kors
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

4.  SureChEMBL: a large-scale, chemically annotated patent document database.

Authors:  George Papadatos; Mark Davies; Nathan Dedman; Jon Chambers; Anna Gaulton; James Siddle; Richard Koks; Sean A Irvine; Joe Pettersson; Nicko Goncharoff; Anne Hersey; John P Overington
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

5.  Chemical entity recognition in patents by combining dictionary-based and statistical approaches.

Authors:  Saber A Akhondi; Ewoud Pons; Zubair Afzal; Herman van Haagen; Benedikt F H Becker; Kristina M Hettne; Erik M van Mulligen; Jan A Kors
Journal:  Database (Oxford)       Date:  2016-05-02       Impact factor: 3.451

6.  A-loop interactions in Mer tyrosine kinase give rise to inhibitors with two-step mechanism and long residence time of binding.

Authors:  Alexander Pflug; Marianne Schimpl; J Willem M Nissink; Ross C Overman; Philip B Rawlins; Caroline Truman; Elizabeth Underwood; Juli Warwicker; Jon Winter-Holt; William McCoull
Journal:  Biochem J       Date:  2020-11-27       Impact factor: 3.857

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.