Literature DB >> 22753780

AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature.

Darby Tien-Hao Chang1, Chao-Hsuan Ke, Jung-Hsin Lin, Jung-Hsien Chiang.   

Abstract

MOTIVATION: Determination of the binding affinity of a protein-ligand complex is important to quantitatively specify whether a particular small molecule will bind to the target protein. Besides, collection of comprehensive datasets for protein-ligand complexes and their corresponding binding affinities is crucial in developing accurate scoring functions for the prediction of the binding affinities of previously unknown protein-ligand complexes. In the past decades, several databases of protein-ligand-binding affinities have been created via visual extraction from literature. However, such approaches are time-consuming and most of these databases are updated only a few times per year. Hence, there is an immediate demand for an automatic extraction method with high precision for binding affinity collection. RESULT: We have created a new database of protein-ligand-binding affinity data, AutoBind, based on automatic information retrieval. We first compiled a collection of 1586 articles where the binding affinities have been marked manually. Based on this annotated collection, we designed four sentence patterns that are used to scan full-text articles as well as a scoring function to rank the sentences that match our patterns. The proposed sentence patterns can effectively identify the binding affinities in full-text articles. Our assessment shows that AutoBind achieved 84.22% precision and 79.07% recall on the testing corpus. Currently, 13 616 protein-ligand complexes and the corresponding binding affinities have been deposited in AutoBind from 17 221 articles. AVAILABILITY: AutoBind is automatically updated on a monthly basis, and it is freely available at http://autobind.csie.ncku.edu.tw/ and http://autobind.mc.ntu.edu.tw/. All of the deposited binding affinities have been refined and approved manually before being released.

Mesh:

Substances:

Year:  2012        PMID: 22753780     DOI: 10.1093/bioinformatics/bts367

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  Quantum Mechanics Approaches to Drug Research in the Era of Structural Chemogenomics.

Authors:  Andrey V Ilatovskiy; Ruben Abagyan; Irina Kufareva
Journal:  Int J Quantum Chem       Date:  2013-06-15       Impact factor: 2.444

Review 2.  Compound activity prediction using models of binding pockets or ligand properties in 3D.

Authors:  Irina Kufareva; Yu-Chen Chen; Andrey V Ilatovskiy; Ruben Abagyan
Journal:  Curr Top Med Chem       Date:  2012       Impact factor: 3.295

3.  Detection and categorization of bacteria habitats using shallow linguistic analysis.

Authors:  İlknur Karadeniz; Arzucan Özgür
Journal:  BMC Bioinformatics       Date:  2015-07-13       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.